探花直播 of Cambridge - Nicolas Boull茅 /taxonomy/people/nicolas-boulle en Machine learning models can produce reliable results even with limited training data /research/news/machine-learning-models-can-produce-reliable-results-even-with-limited-training-data <div class="field field-name-field-news-image field-type-image field-label-hidden"><div class="field-items"><div class="field-item even"><img class="cam-scale-with-grid" src="/sites/default/files/styles/content-580x288/public/news/research/news/gettyimages-1421511938-dp.jpg?itok=q03E5_XB" alt="Digital generated image of multi coloured glowing data over landscape." title="Digital generated image of multi coloured glowing data over landscape., Credit: Andriy Onufriyenko via Getty Images" /></div></div></div><div class="field field-name-body field-type-text-with-summary field-label-hidden"><div class="field-items"><div class="field-item even"><p> 探花直播researchers, from the 探花直播 of Cambridge and Cornell 探花直播, found that for partial differential equations 鈥 a class of physics equations that describe how things in the natural world evolve in space and time 鈥 machine learning models can produce reliable results even when they are provided with limited data.</p>&#13; &#13; <p>Their <a href="https://www.pnas.org/doi/10.1073/pnas.2303904120">results</a>, reported in the <em>Proceedings of the National Academy of Sciences</em>, could be useful for constructing more time- and cost-efficient machine learning models for applications such as engineering and climate modelling.</p>&#13; &#13; <p>Most machine learning models require large amounts of training data before they can begin returning accurate results. Traditionally, a human will annotate a large volume of data 鈥 such as a set of images, for example 鈥 to train the model.</p>&#13; &#13; <p>鈥淯sing humans to train machine learning models is effective, but it鈥檚 also time-consuming and expensive,鈥 said first author Dr Nicolas Boull茅, from the Isaac Newton Institute for Mathematical Sciences. 鈥淲e鈥檙e interested to know exactly how little data we actually need to train these models and still get reliable results.鈥</p>&#13; &#13; <p>Other researchers have been able to train machine learning models with a small amount of data and get excellent results, but how this was achieved has not been well-explained. For their study, Boull茅 and his co-authors, Diana Halikias and Alex Townsend from Cornell 探花直播, focused on partial differential equations (PDEs).</p>&#13; &#13; <p>鈥淧DEs are like the building blocks of physics: they can help explain the physical laws of nature, such as how the steady state is held in a melting block of ice,鈥 said Boull茅, who is an INI-Simons Foundation Postdoctoral Fellow. 鈥淪ince they are relatively simple models, we might be able to use them to make some generalisations about why these AI techniques have been so successful in physics.鈥</p>&#13; &#13; <p> 探花直播researchers found that PDEs that model diffusion have a structure that is useful for designing AI models. 鈥淯sing a simple model, you might be able to enforce some of the physics that you already know into the training data set to get better accuracy and performance,鈥 said Boull茅.</p>&#13; &#13; <p> 探花直播researchers constructed an efficient algorithm for predicting the solutions of PDEs under different conditions by exploiting the short and long-range interactions happening. This allowed them to build some mathematical guarantees into the model and determine exactly how much training data was required to end up with a robust model.</p>&#13; &#13; <p>鈥淚t depends on the field, but for physics, we found that you can actually do a lot with a very limited amount of data,鈥 said Boull茅. 鈥淚t鈥檚 surprising how little data you need to end up with a reliable model. Thanks to the mathematics of these equations, we can exploit their structure to make the models more efficient.鈥</p>&#13; &#13; <p> 探花直播researchers say that their techniques will allow data scientists to open the 鈥榖lack box鈥 of many machine learning models and design new ones that can be interpreted by humans, although future research is still needed.</p>&#13; &#13; <p>鈥淲e need to make sure that models are learning the right things, but machine learning for physics is an exciting field 鈥 there are lots of interesting maths and physics questions that AI can help us answer,鈥 said Boull茅.</p>&#13; &#13; <p>聽</p>&#13; &#13; <h2>Reference</h2>&#13; &#13; <p><em>Nicolas Boull茅, Diana Halikias, and Alex Townsend. 鈥<a href="https://www.pnas.org/doi/10.1073/pnas.2303904120">Elliptic PDE learning is provably data-efficient</a>.鈥 PNAS (2023). DOI: 10.1073/pnas.2303904120</em></p>&#13; </div></div></div><div class="field field-name-field-content-summary field-type-text-with-summary field-label-hidden"><div class="field-items"><div class="field-item even"><p><p>Researchers have determined how to build reliable machine learning models that can understand complex equations in real-world situations while using far less training data than is normally expected.</p>&#13; </p></div></div></div><div class="field field-name-field-content-quote field-type-text-long field-label-hidden"><div class="field-items"><div class="field-item even">It鈥檚 surprising how little data you need to end up with a reliable model</div></div></div><div class="field field-name-field-content-quote-name field-type-text field-label-hidden"><div class="field-items"><div class="field-item even">Nicolas Boull茅</div></div></div><div class="field field-name-field-image-credit field-type-link-field field-label-hidden"><div class="field-items"><div class="field-item even"><a href="/" target="_blank">Andriy Onufriyenko via Getty Images</a></div></div></div><div class="field field-name-field-image-desctiprion field-type-text field-label-hidden"><div class="field-items"><div class="field-item even">Digital generated image of multi coloured glowing data over landscape.</div></div></div><div class="field field-name-field-cc-attribute-text field-type-text-long field-label-hidden"><div class="field-items"><div class="field-item even"><p><a href="https://creativecommons.org/licenses/by-nc-sa/4.0/" rel="license"><img alt="Creative Commons License." src="/sites/www.cam.ac.uk/files/inner-images/cc-by-nc-sa-4-license.png" style="border-width: 0px; width: 88px; height: 31px;" /></a><br />&#13; 探花直播text in this work is licensed under a <a href="https://creativecommons.org/licenses/by-nc-sa/4.0/">Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License</a>. Images, including our videos, are Copyright 漏 探花直播 of Cambridge and licensors/contributors as identified.聽 All rights reserved. We make our image and video content available in a number of ways 鈥 as here, on our <a href="/">main website</a> under its <a href="/about-this-site/terms-and-conditions">Terms and conditions</a>, and on a <a href="/about-this-site/connect-with-us">range of channels including social media</a> that permit your use and sharing of our content under their respective Terms.</p>&#13; </div></div></div><div class="field field-name-field-show-cc-text field-type-list-boolean field-label-hidden"><div class="field-items"><div class="field-item even">Yes</div></div></div> Tue, 19 Sep 2023 10:03:16 +0000 sc604 241771 at