Site icon

We’ll Run Out of Words Before We Run Out of Oil – Darlinez News.

<p> &lbrack;ad&lowbar;1&rsqb;<br &sol;>&NewLine;<&sol;p>&NewLine;<div>&NewLine;<div class&equals;"entry-content column content primary is-two-thirds">&NewLine;<div class&equals;"" style&equals;"padding-bottom&colon; 10px&semi;">&NewLine;<div class&equals;"">&NewLine;<p>&NewLine;&Tab;&Tab;&Tab;&Tab;&Tab;&Tab;<span class&equals;"tag is-dark is-uppercase">Technology<&sol;span>&NewLine;&Tab;&Tab;&Tab;&Tab;<&sol;p>&NewLine;<div class&equals;"byline-container">&NewLine;<div class&equals;"post-date is-italic has-text-grey is-size-7 has-text-weight-medium ">&NewLine;<p>December 22&comma; 2022 10&colon;18 am<&sol;p>&NewLine;<&sol;p><&sol;div>&NewLine;<&sol;p><&sol;div>&NewLine;<&sol;p><&sol;div>&NewLine;<&sol;p><&sol;div>&NewLine;<&sol;p><&sol;div>&NewLine;<p>An artificial intelligence &lpar;AI&rpar; program called ChatGPT became the latest overnight sensation a couple of weeks ago&period; The app generates responses to users’ questions that are alarmingly accurate in detail&comma; and extremely accurate grammatically and syntactically&period; Once the developers at OpenAI launched the app publicly&comma; their servers began to struggle with demand&period;<&sol;p>&NewLine;<p>Among the questions ChatGPT-3 raises is&comma; now that we have a really good natural language processor&comma; can it be made perfect&quest; Most of us are likely to say it is not possible&comma; that human intelligence and ingenuity will always prevail&period; But what if the stock of words we use to talk about the world is finite&quest; Large&comma; surely&comma; but countable&period;<&sol;p>&NewLine;<p>A team of researchers &lpar;Pablo Villalobos&comma; Jaime Sevilla&comma; Lennart Heim&comma; Tamay Besiroglu&comma; Marius Hobbhahn&comma; and Anson Ho&rpar; have concluded that machine learning &lpar;ML&rpar; datasets will run out of &OpenCurlyDoubleQuote;high-quality language data” by 2026&period; Low-quality language data is likely to last between 2030 and 2050&comma; and there is enough low-quality image data to keep ML programs busy for 10 years longer&period;<&sol;p>&NewLine;<p>High-quality language data includes books&comma; news articles&comma; scientific papers&comma; Wikipedia and filtered web content&period; The common element here is that the data in these sources has passed through a filter for usefulness or quality&period; There are two general sources of high-quality data&colon; dedicated contributors to web content and subject matter experts&period; The former adds to the stock of high-quality data based on demand for digital content&semi; the latter is based on the strength of the economy and government investment in research and development&period; In all&comma; about 7 trillion high-quality words are present in all these datasets&period;<&sol;p>&NewLine;<p>Low-quality language data comprises five general models&colon; recorded speech&comma; internet users&comma; popular platforms&comma; CommonCrawl and indexed websites&period; CommonCrawl is an open&comma; nonprofit repository of web crawl data open to anyone&period; The researchers estimate that there are 741 trillion low-quality words available&period;<&sol;p>&NewLine;<section id&equals;"email-subscribe" class&equals;"section section-email-sub single-email-sub"><&excl;-- div&period;svg-icon --><&sol;p>&NewLine;<div class&equals;"container">&NewLine;<div class&equals;"subscribe-message" style&equals;"line-height&colon; 1&period;3&semi;">&NewLine;<p>Get Our Free Investment Newsletter<&sol;p>&NewLine;<&sol;p><&sol;div>&NewLine;<&sol;p><&sol;div>&NewLine;<&sol;section>&NewLine;<p>Language datasets as of October contain about 2 trillion words and have been growing by a rate of about 50&percnt; annually&period; The &OpenCurlyDoubleQuote;stock” of language grows by about 7&percnt; annually and is estimated to hold 70 trillion to 70 quadrillion words&period; That is 1&period;5 to 4&period;5 orders of magnitude larger than the largest datasets currently in use&period; The growth trends in available language data indicate that models will exhaust language data sometime between 2030 and 2050&period;<&sol;p>&NewLine;<p>However&comma; estimates of the stock of high-quality data used to train language models range between 4&period;6 trillion and 17 trillion&period; Using these estimates&comma; the researchers note&comma; &OpenCurlyDoubleQuote;We are within one order of magnitude of exhausting high-quality data&comma; and this will likely happen between 2023 and 2027&period;”<&sol;p>&NewLine;<p>In 2019&comma; the Millennium Alliance for Humanity and the Biosphere at Stanford estimated that the world’s oil reserves will run out by 2052&comma; while natural gas reserves will last until 2060 and coal reserves until 2090&period;<&sol;p>&NewLine;<div id&equals;"smartasset" style&equals;"margin-bottom&colon; 1em&semi; margin-top&colon; 1em&semi;">&NewLine;<p><b>Sponsored&colon; Tips for Investing<&sol;b><&sol;p>&NewLine;<p>A financial advisor can help you understand the advantages and disadvantages of investment properties&period; Finding a qualified financial advisor doesn’t have to be hard&period; SmartAsset’s free tool matches you with up to three financial advisors who serve your area&comma; and you can interview your advisor matches at no cost to decide which one is right for you&period; If you’re ready to find an advisor who can help you achieve your financial goals&comma; get started now&period;<&sol;p>&NewLine;<p>Investing in real estate can diversify your portfolio&period; But expanding your horizons may add additional costs&period; If you’re an investor looking to minimize expenses&comma;&Tab;consider checking out online brokerages&period; They often offer low investment fees&comma; helping you maximize your profit&period;<&sol;p>&NewLine;<&sol;div>&NewLine;<p>&Tab;&Tab;&Tab;&Tab;<&excl;-- &num;post-footer--><&sol;p><&sol;div>&NewLine;

Exit mobile version