Certainly one of the shared, fundamental goals of most chemistry researchers is the necessity to predict a molecule’s properties, equivalent to its boiling or melting point. Once researchers can pinpoint that prediction, they’re in a position to move forward with their work yielding discoveries that result in medicines, materials, and more. Historically, nevertheless, the normal methods of unveiling these predictions are related to a major cost — expending time and wear and tear on equipment, along with funds.
Enter a branch of artificial intelligence generally known as machine learning (ML). ML has lessened the burden of molecule property prediction to a level, however the advanced tools that the majority effectively expedite the method — by learning from existing data to make rapid predictions for brand spanking new molecules — require the user to have a major level of programming expertise. This creates an accessibility barrier for a lot of chemists, who may not have the numerous computational proficiency required to navigate the prediction pipeline.
To alleviate this challenge, researchers within the McGuire Research Group at MIT have created ChemXploreML, a user-friendly desktop app that helps chemists make these critical predictions without requiring advanced programming skills. Freely available, easy to download, and functional on mainstream platforms, this app can also be built to operate entirely offline, which helps keep research data proprietary. The exciting recent technology is printed in an article published recently in the Journal of Chemical Information and Modeling.
One specific hurdle in chemical machine learning is translating molecular structures right into a numerical language that computers can understand. ChemXploreML automates this complex process with powerful, built-in “molecular embedders” that transform chemical structures into informative numerical vectors. Next, the software implements state-of-the-art algorithms to discover patterns and accurately predict molecular properties like boiling and melting points, all through an intuitive, interactive graphical interface.
“The goal of ChemXploreML is to democratize using machine learning within the chemical sciences,” says Aravindh Nivas Marimuthu, a postdoc within the McGuire Group and lead creator of the article. “By creating an intuitive, powerful, and offline-capable desktop application, we’re putting state-of-the-art predictive modeling directly into the hands of chemists, no matter their programming background. This work not only accelerates the search for brand spanking new drugs and materials by making the screening process faster and cheaper, but its flexible design also opens doors for future innovations.”
ChemXploreML is designed to to evolve over time, in order future techniques and algorithms are developed, they will be seamlessly integrated into the app, ensuring that researchers are at all times in a position to access and implement the most modern methods. The appliance was tested on five key molecular properties of organic compounds — melting point, boiling point, vapor pressure, critical temperature, and important pressure — and achieved high accuracy scores of as much as 93 percent for the critical temperature. The researchers also demonstrated that a brand new, more compact approach to representing molecules (VICGAE) was nearly as accurate as standard methods, equivalent to Mol2Vec, but was as much as 10 times faster.
“We envision a future where any researcher can easily customize and apply machine learning to resolve unique challenges, from developing sustainable materials to exploring the complex chemistry of interstellar space,” says Marimuthu. Joining him on the paper is senior creator and Class of 1943 Profession Development Assistant Professor of Chemistry Brett McGuire.