Skip to content

Predictive model reveals dissolution patterns of molecules across various solvents

Machine Learning Model Predicts Molecular Dissolution in Organic Solvents, Escalating Pharmaceutical and Useful Molecule Production Efficiency at MIT

Forecasting Molecular Dissolution Rates Across Various Solvents via a Novel Predictive Model
Forecasting Molecular Dissolution Rates Across Various Solvents via a Novel Predictive Model

Predictive model reveals dissolution patterns of molecules across various solvents

In the realm of chemical engineering, a groundbreaking computational model developed by MIT researchers is set to transform the way chemists approach solubility predictions. This innovative model, named FastSolv, predicts the solubility of any given molecule in an organic solvent, offering a significant leap forward in the field [1].

The model's accuracy surpasses that of previous models, with predictions two to three times more accurate than those of SolProp, the previous best model. Interestingly, FastSolv uses static embeddings, a model like FastProp, and doesn't learn new representations during training, yet it makes accurate predictions for solubility [2].

The researchers behind FastSolv trained the model on over 40,000 data points from BigSolDB, a comprehensive dataset released in 2023. This dataset compiles data from nearly 800 published papers [3].

Common organic solvents, such as ethanol and acetone, are widely used in chemical reactions, but their more hazardous counterparts, like dichloromethane, chloroform, THF, and toluene, pose significant risks due to their toxicity, flammability, and environmental persistence. These solvents are often used in pharmaceutical synthesis but are increasingly regulated due to health and environmental concerns [2].

The environmental impact of these solvents is substantial. They are toxic to humans and ecosystems, contribute to volatile organic compound (VOC) emissions, and require energy-intensive recovery and disposal processes. Modern efforts in pharmaceutical synthesis prioritise green chemistry principles, aiming to replace hazardous solvents with safer, more benign alternatives [4].

Ethanol, a relatively benign, biodegradable, and less toxic solvent, has been successfully used to replace more hazardous ones in large-scale pharmaceutical processes. For instance, Pfizer’s redesign of sertraline manufacturing replaced methylene chloride, THF, toluene, and hexane with ethanol, resulting in improved efficiency, safety, and reduced environmental footprint [2].

FastSolv could be particularly useful for identifying solvents that are less hazardous than some of the most commonly used industrial solvents. The researchers behind the model are excited about potential uses of FastSolv outside of formulation and drug discovery, with possibilities extending to various fields [5].

The researchers have made FastSolv freely available, and many companies and labs have already started using it. This model could revolutionise the way chemists choose the right solvent for any given reaction, thereby minimising environmental impact [6]. However, the researchers concluded that the main limitation on the models' performance is the quality of the data [7].

FastSolv represents a significant step forward in the application of machine learning to chemical engineering problems. The model grew out of a project that Attia and Burns worked on together in an MIT course on applying machine learning to chemical engineering problems [8].

References: 1. MIT News 2. Chemical & Engineering News 3. Green Solvents for Sustainable Chemistry 4. Green Chemistry: Theory and Practice 5. FastSolv GitHub 6. Chemical & Engineering News 7. FastSolv Research Paper 8. MIT Department of Chemical Engineering

Read also:

Latest