<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Ruochen Zhao</title>
    <description>The latest articles on DEV Community by Ruochen Zhao (@ruochenzhao3).</description>
    <link>https://dev.to/ruochenzhao3</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F1024717%2Fc2b61a02-bac1-48d5-8b59-ca581566322f.JPG</url>
      <title>DEV Community: Ruochen Zhao</title>
      <link>https://dev.to/ruochenzhao3</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/ruochenzhao3"/>
    <language>en</language>
    <item>
      <title>Can ChatGPT-like Generative Models Guarantee Factual Accuracy? On the Mistakes of Microsoft's New Bing</title>
      <dc:creator>Ruochen Zhao</dc:creator>
      <pubDate>Mon, 13 Feb 2023 13:27:38 +0000</pubDate>
      <link>https://dev.to/ruochenzhao3/can-chatgpt-like-generative-models-guarantee-factual-accuracy-on-the-mistakes-of-microsofts-new-bing-111b</link>
      <guid>https://dev.to/ruochenzhao3/can-chatgpt-like-generative-models-guarantee-factual-accuracy-on-the-mistakes-of-microsofts-new-bing-111b</guid>
      <description>&lt;p&gt;&lt;strong&gt;Authors&lt;/strong&gt;: &lt;a href="https://chiayewken.com/" rel="noopener noreferrer"&gt;Yew Ken Chia&lt;/a&gt;, &lt;a href="https://sg.linkedin.com/in/esther-ruochen-zhao-855357150" rel="noopener noreferrer"&gt;Ruochen Zhao&lt;/a&gt;, &lt;a href="https://xingxuanli.github.io/" rel="noopener noreferrer"&gt;Xingxuan Li&lt;/a&gt;, &lt;a href="https://sg.linkedin.com/in/ding-bosheng-58b3b262" rel="noopener noreferrer"&gt;Bosheng Ding&lt;/a&gt;, &lt;a href="https://lidongbing.github.io/" rel="noopener noreferrer"&gt;Lidong Bing&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Recently, conversational AI models such as OpenAI's &lt;a href="https://openai.com/blog/chatgpt/" rel="noopener noreferrer"&gt;ChatGPT&lt;/a&gt; [1] have captured public imagination with the ability to generate high-quality written contents, hold human-like conversations, answer factual questions, and more. Armed with such potential, Microsoft and Google have &lt;a href="https://www.theverge.com/2023/2/9/23592647/ai-search-bing-bard-chatgpt-microsoft-google-problems-challenges" rel="noopener noreferrer"&gt;announced new services&lt;/a&gt; [2] that combine them with traditional search engines. The new wave of conversation-powered search engines has the potential to naturally answer complex questions, summarize search results, and even serve as a creative tool. However, in doing so, the tech companies now face a greater ethical challenge to ensure that their models do not mislead users with false, ungrounded, or conflicting answers. Hence, the question naturally arises: Can ChatGPT-like models guarantee factual accuracy? In this article, we uncover several factual mistakes in Microsoft's &lt;a href="https://www.bing.com/new" rel="noopener noreferrer"&gt;new Bing&lt;/a&gt; [9] and Google's &lt;a href="https://blog.google/technology/ai/bard-google-ai-search-updates/" rel="noopener noreferrer"&gt;Bard&lt;/a&gt; [3] which suggest that they currently cannot.&lt;/p&gt;

&lt;p&gt;Unfortunately, false expectations can lead to disastrous results. Around the same time as Microsoft's new Bing announcement, Google hastily announced a new conversational AI service named Bard. Despite the hype, expectations were quickly shattered when Bard made a factual mistake in the &lt;a href="https://twitter.com/sundarpichai/status/1622673369480204288" rel="noopener noreferrer"&gt;promotional video&lt;/a&gt; [14], eventually &lt;a href="https://www.bbc.com/news/business-64576225" rel="noopener noreferrer"&gt;tanking Google's share price&lt;/a&gt; [4] by nearly 8% and wiping $100 billion off its market value. On the other hand, there has been less scrutiny regarding Microsoft's new Bing. In the &lt;a href="https://www.youtube.com/watch?v=rOeRWRJ16yY&amp;amp;t=1709s" rel="noopener noreferrer"&gt;demonstration video&lt;/a&gt; [8], we found that the new Bing recommended a rock singer as a top poet, fabricated birth and death dates, and even made up an entire summary of fiscal reports. Despite &lt;a href="https://www.bing.com/new" rel="noopener noreferrer"&gt;disclaimers&lt;/a&gt; [9] that the new Bing's responses may not always be factual, overly optimistic sentiments may inevitably lead to disillusionment. Hence, our goal is to draw attention to the factual challenges faced by conversation-powered search engines so that we may better address them in the future. &lt;/p&gt;

&lt;h3&gt;
  
  
  What factual mistakes did Microsoft's new Bing demonstrate?
&lt;/h3&gt;

&lt;p&gt;Microsoft released the new Bing search engine powered by AI, claiming that it will revolutionize the scope of traditional search engines. Is this really the case? We dived deeper into the &lt;a href="https://www.youtube.com/watch?v=rOeRWRJ16yY&amp;amp;t=1709s" rel="noopener noreferrer"&gt;demonstration video&lt;/a&gt; [8] and &lt;a href="https://www.bing.com/new" rel="noopener noreferrer"&gt;examples&lt;/a&gt; [9], and found three main types of factual issues:&lt;br&gt;
● Claims that conflict with the reference sources.&lt;br&gt;
● Claims that don't exist in the reference sources.&lt;br&gt;
● Claims that don't have a reference source, and are inconsistent with multiple web sources.&lt;/p&gt;

&lt;h4&gt;
  
  
  Fabricated numbers in financial reports: be careful when you trust the new Bing!
&lt;/h4&gt;

&lt;p&gt;To our surprise, the new Bing fabricated an entire summary of the financial report in the demonstration! &lt;br&gt;
When Microsoft executive Yusuf Mehdi showed the audience how to use the command "key takeaways from the page" to auto-generate a summary of the &lt;a href="https://s24.q4cdn.com/508879282/files/doc_financials/2022/q3/3Q22-EPR-FINAL-with-Tables.pdf" rel="noopener noreferrer"&gt;Gap Inc. 2022 Q3 Fiscal Report &lt;/a&gt;[10a], he received the following results:&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Faff6sfq1hvl56enwin7q.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Faff6sfq1hvl56enwin7q.jpg" alt="Trulli" width="800" height="387"&gt;&lt;/a&gt;&lt;/p&gt;
Figure 1. Summary of the Gap Inc. fiscal report by the new Bing in Press Release.



&lt;p&gt;However, upon closer examination, all the key figures in the generated summary are inaccurate. We will show excerpts from the original financial report below as validating references. According to the new Bing, the operating margin after adjustment was 5.9%, while it was actually 3.9% in the source report.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F2ersjjkgycre7iy0yyus.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F2ersjjkgycre7iy0yyus.jpg" alt="Trulli" width="800" height="234"&gt;&lt;/a&gt;&lt;/p&gt;
Figure 2. Gap Inc. fiscal report excerpt on operating margins.



&lt;p&gt;Similarly, the adjusted diluted earnings per share was generated as $0.42, while it should be $0.71.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fkyu3syml2eb2uzq3qype.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fkyu3syml2eb2uzq3qype.jpg" alt="Trulli" width="800" height="171"&gt;&lt;/a&gt;&lt;/p&gt;
Figure 3. Gap Inc. fiscal report excerpt on diluted earnings per share.



&lt;p&gt;Regarding net sales, the new Bing's summary claimed "growth in the low double digits", while the original report stated that "net sales could be down mid-single digits".&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F0sz3k8gclzd2vhtu49gk.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F0sz3k8gclzd2vhtu49gk.jpg" alt="Trulli" width="800" height="643"&gt;&lt;/a&gt;&lt;/p&gt;
Figure 4: Gap Inc. Fiscal Report on 2022 outlook.



&lt;p&gt;In addition to the generated figures which conflicted with actual figures in the source report, we observe that the new Bing may also produce hallucinated facts that do not exist in the source. In the new Bing's generated summary, the "operating margin of about 7% and diluted earnings per share of $1.60 to $1.75" are nowhere to be found in the source report. &lt;/p&gt;

&lt;p&gt;Unfortunately, the situation worsened when the new Bing was instructed to "compare this with Lululemon in a table". The financial comparison table generated by the new Bing contained numerous mistakes:&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fy6ej9xuboudco4jznmka.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fy6ej9xuboudco4jznmka.jpg" alt="Trulli" width="800" height="411"&gt;&lt;/a&gt;&lt;/p&gt;
Figure 5: The comparison table generated by the new Bing in press release.



&lt;p&gt;This table, in fact, is half wrong. Out of all the numbers, 3 out of 6 figures are wrong in the column for Gap Inc., and same for Lululemon. As mentioned before, Gap Inc.'s true operating margin is 4.6% (or 3.9% after adjusting) and diluted earnings per share should be $0.77 (or $0.71 after adjusting). The new Bing also claimed that Gap Inc.'s cash and cash equivalents amounted to $1.4 billion, while it was actually $679 million.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fq797ys05e0g0hox724qd.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fq797ys05e0g0hox724qd.jpg" alt="Trulli" width="800" height="116"&gt;&lt;/a&gt;&lt;/p&gt;
Figure 6: Gap Inc. fiscal report excerpt on cash.



&lt;p&gt;According to &lt;a href="https://corporate.lululemon.com/media/press-releases/2022/12-08-2022-210558496#:~:text=For%20the%20third%20quarter%20of%202022%2C%20compared%20to%20the%20third,%2C%20and%20increased%2041%25%20internationally" rel="noopener noreferrer"&gt;Lululemon's 2022 Q3 Fiscal Report&lt;/a&gt; [10b], the gross margin should be 55.9%, while the new Bing claims it's 58.7%. The operating margin should be 19.0%, while the new Bing claims it to be 20.7%. The diluted earnings per share was actually $2.00, while the new Bing claims it to be $1.65.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fql7em6jrhkdjsko9tg1a.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fql7em6jrhkdjsko9tg1a.jpg" alt="Trulli" width="800" height="159"&gt;&lt;/a&gt;&lt;/p&gt;
Figure 7: Lululemon 2022 Q3 fiscal report excerpt.



&lt;p&gt;So where did these figures come from? You may be wondering whether it's a number that was misplaced from another part in the original document. The answer is no. Curiously, these numbers are nowhere to be found in the original document and are entirely fabricated. In fact, it is still an open research challenge to constrain the outputs of generative models to be more factually grounded. Plainly speaking, the popular generative AI models such as ChatGPT are picking words to generate from a fixed vocabulary, instead of strictly copying and pasting facts from the source. Hence, factual correctness is one of the innate challenges of generative AI, and cannot be strictly guaranteed with current models. This is a major concern when it comes to search engines as users rely on the results to be trustworthy and factually accurate.&lt;/p&gt;

&lt;h4&gt;
  
  
  Japanese top poet: secretly a rock singer?
&lt;/h4&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fy689vn3sykoboos0t52c.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fy689vn3sykoboos0t52c.jpg" alt="Trulli" width="800" height="370"&gt;&lt;/a&gt;&lt;/p&gt;
Figure 8: Top Japanese poets summary generated by the new Bing in press release.



&lt;p&gt;We observe that the new Bing produces factual mistakes not just for numbers but also for personal details of specific entities, as shown in the response above when the new Bing was queried about "top Japanese poets". The generated date of birth, death, and occupation factually conflict with the referenced source. According to &lt;a href="https://de.wikipedia.org/wiki/Eriko_Kishida" rel="noopener noreferrer"&gt;Wikipedia&lt;/a&gt; [11a] and &lt;a href="https://www.imdb.com/name/nm1063814/" rel="noopener noreferrer"&gt;IMDB&lt;/a&gt; [11a], Eriko Kishida was born in 1929 and died in 2011. She was not a playwright and essayist, but a children's book author and translator.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fiaov57xks1ixd5lirhaj.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fiaov57xks1ixd5lirhaj.jpg" alt="Trulli" width="800" height="92"&gt;&lt;/a&gt;&lt;/p&gt;
Figure 9. Wikipedia page on Eriko Kishida (translated page from German).



&lt;p&gt;The new Bing continued blundering when it proclaimed Gackt as a top Japanese poet, when he is in fact a famous rockstar in Japan. According to the &lt;a href="https://en.wikipedia.org/wiki/Gackt" rel="noopener noreferrer"&gt;Wikipedia source&lt;/a&gt; [11b], he is an actor, musician, and singer. There is no information on him publishing poems of any kind in the source. &lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fg3qpwxgc56pzuda2chjg.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fg3qpwxgc56pzuda2chjg.jpg" alt="Trulli" width="800" height="177"&gt;&lt;/a&gt;&lt;/p&gt;
Figure 10. Wikipedia page on Gackt.



&lt;h4&gt;
  
  
  Following Bing's nightclub recommendations? You could be facing a closed door.
&lt;/h4&gt;

&lt;p&gt;Furthermore, the new Bing made a list of possible nightclubs to visit in Mexico City when asked "Where is the nightlife?". Alarmingly, almost all the clubs' opening times are wrongly generated:&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F3lqaih1bnr2e87lp7r3m.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F3lqaih1bnr2e87lp7r3m.jpg" alt="Trulli" width="800" height="621"&gt;&lt;/a&gt;&lt;/p&gt;
Figure 11. Nightlife suggestions in Mexico City generated by the new Bing in the press release.



&lt;p&gt;We cross-checked the opening times with multiple sources, which are also appended at the end of the article. While &lt;a href="https://goo.gl/maps/NS76tw3hCemEyCu48" rel="noopener noreferrer"&gt;El Almacen&lt;/a&gt; [12a] actually opens from 7:00 pm to 3:00 am from Tuesday to Sunday, new Bing claims it to be "open from 5:00 pm to 11:00 pm from Tuesday to Sunday". &lt;a href="https://goo.gl/maps/gF17R21McmR8syeKA" rel="noopener noreferrer"&gt;El Marra&lt;/a&gt; [12b] actually opens from 6:00 pm to 2:30 am from Thursday to Saturday, but is claimed to be "open from 6:00 pm to 3:00 am from Thursday to Sunday". &lt;a href="https://goo.gl/maps/mScM1kh2TK7UphQ69" rel="noopener noreferrer"&gt;Guadalajara de Noche&lt;/a&gt; [12c] is open from 5:30 pm to 1:30 am or 12:30 am every day, while new Bing claims it to be "open from 8:00 pm to 3:00 am every day". &lt;/p&gt;

&lt;p&gt;Besides opening times, almost all the descriptions on review stars and numbers mentioned by the new Bing are inaccurate. Matching review scores cannot be found despite searching on Yelp, Tripadvisor, or Google Maps. In addition to the cases mentioned above, we also found other issues in their demonstration video, such as product price mismatches, store address errors, and time-related mistakes. You are welcome to verify them if interested.&lt;/p&gt;

&lt;h4&gt;
  
  
  Potential Concerns in the Limited Bing Demo
&lt;/h4&gt;

&lt;p&gt;Although the new Bing search engine is not fully accessible yet, we can examine a handful of &lt;a href="https://www.bing.com/new" rel="noopener noreferrer"&gt;demonstration examples&lt;/a&gt; [9] provided by Microsoft. Upon closer examination, even these cherry-picked examples show potential issues on factual grounding. &lt;/p&gt;

&lt;p&gt;In the demo titled “what art ideas can I do with my kid?”，the new Bing produced an &lt;a href="https://www.bing.com/search?q=Arts%20and%20crafts%20ideas,%20with%20instructions%20for%20a%20toddler%20using%20only%20cardboard%20boxes,%20plastic%20bottles,%20paper%20and%20string&amp;amp;iscopilotedu=1&amp;amp;form=MA13G7" rel="noopener noreferrer"&gt;insufficient list of crafting materials for each recommendation&lt;/a&gt; [13]. For example, when suggesting making a cardboard box guitar, it listed the supplies: "a tissue box, a cardboard tube, some rubber bands, paint and glue". However, it failed to include construction paper, scissors, washi tape, foam stickers, and wooden beads suggested by the &lt;a href="https://happytoddlerplaytime.com/cardboard-box-guitar-craft-for-kids/" rel="noopener noreferrer"&gt;cited website&lt;/a&gt; [13a].&lt;/p&gt;

&lt;p&gt;Another potential concern is that the new Bing produced content that had no factual basis in the reference sources, for at least 21 times across the 12 demonstration examples. The lack of factual grounding and failure to cite a complete list of sources could lead users to question the trustworthiness of the new Bing. &lt;/p&gt;

&lt;h3&gt;
  
  
  What factual mistakes did Google's Bard demonstrate?
&lt;/h3&gt;

&lt;p&gt;Google also unveiled a conversational AI service called &lt;a href="https://blog.google/technology/ai/bard-google-ai-search-updates/" rel="noopener noreferrer"&gt;Bard&lt;/a&gt; [3]. Instead of typing in traditional search queries, users can have a casual and informative conversation with the web-powered chatbot. For example, a user may initially ask about the best constellations for stargazing, and then follow up by asking about the best time of year to see them. However, a clear disclaimer is that Bard may give "inaccurate or inappropriate information". Let's investigate the factual accuracy of Bard in their &lt;a href="https://twitter.com/sundarpichai/status/1622673775182626818?ref_src=twsrc%5Etfw%7Ctwcamp%5Etweetembed%7Ctwterm%5E1622673775182626818%7Ctwgr%5E0ed320eee2b8b4717de9676c0064f6fa27275a3f%7Ctwcon%5Es1_&amp;amp;ref_url=https%3A%2F%2Fwww.cnet.com%2Fscience%2Fspace%2Fgoogles-chatgpt-rival-bard-called-out-for-nasa-webb-space-telescope-error%2F" rel="noopener noreferrer"&gt;twitter post&lt;/a&gt; [14] and &lt;a href="https://www.youtube.com/watch?v=yLWXJ22LUEc" rel="noopener noreferrer"&gt;video demonstration&lt;/a&gt; [15].&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fu7i5wyr0scbzqov64vmj.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fu7i5wyr0scbzqov64vmj.jpg" alt="Trulli" width="800" height="342"&gt;&lt;/a&gt;&lt;/p&gt;
Figure 12. Summary on Telescope discoveries generated by Bard in demo.



&lt;p&gt;Google CEO Sundar Pichai recently posted a &lt;a href="https://twitter.com/sundarpichai/status/1622673775182626818?s=20&amp;amp;t=XgLhJ0c_yAwXknILlrOzkQ" rel="noopener noreferrer"&gt;short video&lt;/a&gt; [14] to demonstrate the capabilities of Bard. However, the answer contained an error regarding which telescope captured the first exoplanet images, which was &lt;a href="https://twitter.com/astrogrant/status/1623091683603918849" rel="noopener noreferrer"&gt;quickly pointed out by astrophysicists&lt;/a&gt; [16a]. As confirmed by &lt;a href="https://exoplanets.nasa.gov/resources/300/2m1207-b-first-image-of-an-exoplanet/" rel="noopener noreferrer"&gt;NASA&lt;/a&gt; [16b], the first images of an exoplanet were captured by the Very Large Telescope (VLT) instead of the James Webb Space Telescope (JWST).  Unfortunately, Bard turned out to be a costly experiment as &lt;a href="https://www.bbc.com/news/business-64576225" rel="noopener noreferrer"&gt;Google's stock price sharply declined&lt;/a&gt;[4] after news of the factual mistake was reported.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fb3coykj4uk9231pjmgvu.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fb3coykj4uk9231pjmgvu.jpg" alt="Trulli" width="800" height="620"&gt;&lt;/a&gt;&lt;/p&gt;
Figure 13. Answer to the visibility of the constellations generated by Bard in demo.



&lt;p&gt;Regarding Bard's video demonstration, the image above shows how Google's &lt;a href="https://www.youtube.com/live/yLWXJ22LUEc?feature=share&amp;amp;t=1009" rel="noopener noreferrer"&gt;Bard answers the question of when the constellations are visible&lt;/a&gt; [16]. However, the timing of Orion is inconsistent with multiple sources. According to the &lt;a href="https://www.google.com/search?client=safari&amp;amp;rls=en&amp;amp;q=when+is+orion+visible&amp;amp;ie=UTF-8&amp;amp;oe=UTF-8" rel="noopener noreferrer"&gt;top Google search result&lt;/a&gt; [17a], the constellation is most visible from January to March. According to &lt;a href="https://en.wikipedia.org/wiki/Orion_(constellation)" rel="noopener noreferrer"&gt;Wikipedia&lt;/a&gt; [17b], it is most visible from January to April. Furthermore, the answer is incomplete as the visibility of the constellation also depends on whether the user is in the Northern or Southern hemisphere.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fuzg4bs8snx4rn9mwmly0.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fuzg4bs8snx4rn9mwmly0.jpg" alt="Trulli" width="800" height="295"&gt;&lt;/a&gt;&lt;/p&gt;
Figure 14. Google search result on visibility of the constellations.



&lt;h3&gt;
  
  
  How do Bing and Bard compare?
&lt;/h3&gt;

&lt;p&gt;The new Bing and Bard services may not be equally trustworthy in practice. This is due to factors such as the quality of search results, the quality of conversational models, and the transparency of the provided answers. Currently, both services rely on relevant information sources to guide the responses of their conversational AI models. Hence, the factual accuracy of the answers depends on the quality of the &lt;a href="https://nlp.stanford.edu/IR-book/pdf/irbookonlinereading.pdf" rel="noopener noreferrer"&gt;information retrieval systems&lt;/a&gt; [18], and how well the conversational model can generate answers that are factually grounded to the information sources. As the full details of the services are not released to the public, it's unclear which one can achieve higher factual accuracy without deeper testing. On the other hand, we feel that transparency is just as important for trustworthiness. For instance, we observe that the new Bing is more transparent regarding the source of its answers, as it provides the reference links in most cases. This enables users to independently conduct fact-checking, and we hope that future conversational services also provide this feature. &lt;/p&gt;

&lt;h3&gt;
  
  
  How can the factual limitations be addressed?
&lt;/h3&gt;

&lt;p&gt;Through the numerous factual mistakes shown above, it is clear that conversational AI models such as ChatGPT may produce conflicting or non-existent facts even when presented with reliable sources. As mentioned previously, it is a pressing research challenge to ensure the factual grounding of ChatGPT-like models. Due to their generative nature, it is difficult to &lt;a href="http://proceedings.mlr.press/v70/hu17e/hu17e.pdf" rel="noopener noreferrer"&gt;control their outputs&lt;/a&gt; [19], and even harder to guarantee that the generated output is factually consistent with the information sources. A short-term solution could be to impose restrictions to prevent the conversational AI from producing unsafe or unfactual outputs. However, malicious parties can eventually &lt;a href="https://arstechnica.com/information-technology/2023/02/now-open-fee-based-telegram-service-that-uses-chatgpt-to-generate-malware/" rel="noopener noreferrer"&gt;bypass the safety restrictions&lt;/a&gt; [7], while &lt;a href="https://aclanthology.org/N18-1074.pdf" rel="noopener noreferrer"&gt;fact verification&lt;/a&gt; [20] is another unsolved research challenge. In the long-term, we may have to accept that human and machine writers alike will likely remain imperfect. To progress towards more trustworthy AI, the conversational AI models like ChatGPT cannot remain as &lt;a href="https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&amp;amp;arnumber=8466590" rel="noopener noreferrer"&gt;inscrutable black boxes&lt;/a&gt; [21]. They should be fully transparent about their data sources and potential biases, report when they have low confidence in their answers, and explain their reasoning processes.&lt;/p&gt;

&lt;h3&gt;
  
  
  What does the future hold for ChatGPT-like models?
&lt;/h3&gt;

&lt;p&gt;After a systematic overview, we have found significant factual limitations demonstrated by the new wave of search engines powered by conversational AI like ChatGPT. Despite disclaimers of potential factual inaccuracy and warnings to use our judgment before making decisions, we encountered many factual mistakes even in the cherry-picked demonstrations. Thus, we cannot help but wonder: What is the purpose of search engines, if not to provide reliable and factual answers? In a new era of the web filled with AI-generated fabrications, how will we ensure truthfulness? Despite the massive resources of tech giants like Microsoft and Google, the current ChatGPT-like models cannot ensure factual accuracy. Even so, we are still optimistic about the potential of conversational models and the development of more trustworthy AI. Models like ChatGPT have shown great potential and will undoubtedly improve many industries and aspects of our daily lives. However, if they continue to generate fabricated content and unfactual answers, the public may become even more wary of artificial intelligence. Therefore, rather than criticizing specific models or companies, we hope to call on researchers and developers to focus on improving the transparency and factual correctness of AI services, allowing humans to place a higher level of trust in the new technology in the foreseeable future.&lt;/p&gt;

&lt;h3&gt;
  
  
  Sources
&lt;/h3&gt;

&lt;h5&gt;
  
  
  Reference Articles
&lt;/h5&gt;

&lt;ol&gt;
&lt;li&gt;ChatGPT: Optimizing Language Models for Dialogue:&lt;a href="https://openai.com/blog/chatgpt/" rel="noopener noreferrer"&gt;https://openai.com/blog/chatgpt/&lt;/a&gt;
&lt;/li&gt;
&lt;li&gt;7 problems facing Bing, Bard, and the future of AI search: &lt;a href="https://www.theverge.com/2023/2/9/23592647/ai-search-bing-bard-chatgpt-microsoft-google-problems-challenges" rel="noopener noreferrer"&gt;https://www.theverge.com/2023/2/9/23592647/ai-search-bing-bard-chatgpt-microsoft-google-problems-challenges&lt;/a&gt;
&lt;/li&gt;
&lt;li&gt;Google: An important next step on our AI journey: &lt;a href="https://blog.google/technology/ai/bard-google-ai-search-updates/" rel="noopener noreferrer"&gt;https://blog.google/technology/ai/bard-google-ai-search-updates/&lt;/a&gt;
&lt;/li&gt;
&lt;li&gt;Google's Bard AI bot mistake wipes $100bn off shares: &lt;a href="https://www.bbc.com/news/business-64576225" rel="noopener noreferrer"&gt;https://www.bbc.com/news/business-64576225&lt;/a&gt;
&lt;/li&gt;
&lt;li&gt;Reinventing search with a new AI-powered Microsoft Bing and Edge, your copilot for the web: &lt;a href="https://blogs.microsoft.com/blog/2023/02/07/reinventing-search-with-a-new-ai-powered-microsoft-bing-and-edge-your-copilot-for-the-web/" rel="noopener noreferrer"&gt;https://blogs.microsoft.com/blog/2023/02/07/reinventing-search-with-a-new-ai-powered-microsoft-bing-and-edge-your-copilot-for-the-web/&lt;/a&gt;
&lt;/li&gt;
&lt;li&gt;Google shares lose $100 billion after company’s AI chatbot makes an error during demo: &lt;a href="https://www.cnn.com/2023/02/08/tech/google-ai-bard-demo-error" rel="noopener noreferrer"&gt;https://www.cnn.com/2023/02/08/tech/google-ai-bard-demo-error&lt;/a&gt;
&lt;/li&gt;
&lt;li&gt;Hackers are selling a service that bypasses ChatGPT restrictions on malware: &lt;a href="https://arstechnica.com/information-technology/2023/02/now-open-fee-based-telegram-service-that-uses-chatgpt-to-generate-malware/" rel="noopener noreferrer"&gt;https://arstechnica.com/information-technology/2023/02/now-open-fee-based-telegram-service-that-uses-chatgpt-to-generate-malware/&lt;/a&gt;
&lt;/li&gt;
&lt;/ol&gt;

&lt;h5&gt;
  
  
  Appendix: How we validated the demos
&lt;/h5&gt;

&lt;h6&gt;
  
  
  The New Bing demo source:
&lt;/h6&gt;

&lt;ol start="8"&gt;
  &lt;li&gt;Microsoft's press release video(&lt;a href="https://www.youtube.com/watch?v=rOeRWRJ16yY" rel="noopener noreferrer"&gt;https://www.youtube.com/watch?v=rOeRWRJ16yY&lt;/a&gt;)&lt;/li&gt;
  &lt;li&gt;Microsoft's demo page: (&lt;a href="https://www.bing.com/new" rel="noopener noreferrer"&gt;https://www.bing.com/new&lt;/a&gt;) &lt;/li&gt;
&lt;/ol&gt;

&lt;h6&gt;
  
  
  Verification:
&lt;/h6&gt;

&lt;ol start="10"&gt;
  &lt;li&gt;The new Bing and Fiscal Report: &lt;ol&gt;
  &lt;li&gt;Gap Inc. Fiscal report  shown in the video: &lt;a href="https://s24.q4cdn.com/508879282/files/doc_financials/2022/q3/3Q22-EPR-FINAL-with-Tables.pdf" rel="noopener noreferrer"&gt;https://s24.q4cdn.com/508879282/files/doc_financials/2022/q3/3Q22-EPR-FINAL-with-Tables.pdf&lt;/a&gt;
&lt;/li&gt;
  &lt;li&gt;Lululemon Fiscal report  found on their official website: &lt;a href="https://corporate.lululemon.com/media/press-releases/2022/12-08-2022-210558496#:~:text=For%20the%20third%20quarter%20of%202022%2C%20compared%20to%20the%20third,%2C%20and%20increased%2041%25%20internationally" rel="noopener noreferrer"&gt;https://corporate.lululemon.com/media/press-releases/2022/12-08-2022-210558496#:~:text=For%20the%20third%20quarter%20of%202022%2C%20compared%20to%20the%20third,%2C%20and%20increased%2041%25%20internationally&lt;/a&gt;
&lt;/li&gt;
&lt;/ol&gt;


&lt;/li&gt;

  &lt;li&gt;The new Bing and Japanese Poets:&lt;ol&gt;

  &lt;li&gt;Eriko Kishida: Wikipedia (&lt;a href="https://twitter.com/sundarpichai/status/1622673369480204288),%20IMDB%20(https://www.imdb.com/name/nm1063814/" rel="noopener noreferrer"&gt;&lt;/a&gt;&lt;a href="https://twitter.com/sundarpichai/status/1622673369480204288" rel="noopener noreferrer"&gt;https://twitter.com/sundarpichai/status/1622673369480204288&lt;/a&gt;), IMDB (&lt;a href="https://www.imdb.com/name/nm1063814/" rel="noopener noreferrer"&gt;https://www.imdb.com/name/nm1063814/&lt;/a&gt;)&lt;/li&gt;

  &lt;li&gt;Gacket: Wikipedia (&lt;a href="https://en.wikipedia.org/wiki/Gackt" rel="noopener noreferrer"&gt;&lt;/a&gt;&lt;a href="https://en.wikipedia.org/wiki/Gackt" rel="noopener noreferrer"&gt;https://en.wikipedia.org/wiki/Gackt&lt;/a&gt;)&lt;/li&gt;

&lt;/ol&gt;

&lt;/li&gt;

&lt;li&gt;The new Bing and Nightclubs in Mexico:&lt;ol&gt;

  &lt;li&gt;El Almacen: Google Maps (&lt;a href="https://goo.gl/maps/3BL27XgWpDVzLLnaA" rel="noopener noreferrer"&gt;&lt;/a&gt;&lt;a href="https://goo.gl/maps/3BL27XgWpDVzLLnaA" rel="noopener noreferrer"&gt;https://goo.gl/maps/3BL27XgWpDVzLLnaA&lt;/a&gt;), Restaurant Guru(&lt;a href="https://restaurantguru.com/El-Almacen-Mexico-City" rel="noopener noreferrer"&gt;&lt;/a&gt;&lt;a href="https://restaurantguru.com/El-Almacen-Mexico-City" rel="noopener noreferrer"&gt;https://restaurantguru.com/El-Almacen-Mexico-City&lt;/a&gt;)&lt;/li&gt;

  &lt;li&gt;El Marra: Google Maps (&lt;a href="https://goo.gl/maps/HZFe8xY7uTk1SB6s5" rel="noopener noreferrer"&gt;&lt;/a&gt;&lt;a href="https://goo.gl/maps/HZFe8xY7uTk1SB6s5" rel="noopener noreferrer"&gt;https://goo.gl/maps/HZFe8xY7uTk1SB6s5&lt;/a&gt;), Restaurant Guru(&lt;a href="https://restaurantguru.com/El-Marra-Mexico-City" rel="noopener noreferrer"&gt;&lt;/a&gt;&lt;a href="https://restaurantguru.com/El-Marra-Mexico-City" rel="noopener noreferrer"&gt;https://restaurantguru.com/El-Marra-Mexico-City&lt;/a&gt;)&lt;/li&gt;

&lt;li&gt;Guadalajara de Noche: Tripadvisor (&lt;a href="https://www.tripadvisor.es/Attraction_Review-g150800-d3981435-Reviews-Guadalajara_de_Noche-Mexico_City_Central_Mexico_and_Gulf_Coast.html" rel="noopener noreferrer"&gt;&lt;/a&gt;&lt;a href="https://www.tripadvisor.es/Attraction_Review-g150800-d3981435-Reviews-Guadalajara_de_Noche-Mexico_City_Central_Mexico_and_Gulf_Coast.html" rel="noopener noreferrer"&gt;https://www.tripadvisor.es/Attraction_Review-g150800-d3981435-Reviews-Guadalajara_de_Noche-Mexico_City_Central_Mexico_and_Gulf_Coast.html&lt;/a&gt;), Google Maps(&lt;a href="https://goo.gl/maps/UeHCm1EeJZFP7wZYA" rel="noopener noreferrer"&gt;&lt;/a&gt;&lt;a href="https://goo.gl/maps/UeHCm1EeJZFP7wZYA" rel="noopener noreferrer"&gt;https://goo.gl/maps/UeHCm1EeJZFP7wZYA&lt;/a&gt;)&lt;/li&gt;

&lt;/ol&gt;

&lt;/li&gt;

&lt;li&gt;The new Bing and craft ideas (&lt;a href="https://www.bing.com/search?q=Arts%20and%20crafts%20ideas,%20with%20instructions%20for%20a%20toddler%20using%20only%20cardboard%20boxes,%20plastic%20bottles,%20paper%20and%20string&amp;amp;iscopilotedu=1&amp;amp;form=MA13G7" rel="noopener noreferrer"&gt;&lt;/a&gt;&lt;a href="https://www.bing.com/search?q=Arts%20and%20crafts%20ideas,%20with%20instructions%20for%20a%20toddler%20using%20only%20cardboard%20boxes,%20plastic%20bottles,%20paper%20and%20string&amp;amp;iscopilotedu=1&amp;amp;form=MA13G7" rel="noopener noreferrer"&gt;https://www.bing.com/search?q=Arts%20and%20crafts%20ideas,%20with%20instructions%20for%20a%20toddler%20using%20only%20cardboard%20boxes,%20plastic%20bottles,%20paper%20and%20string&amp;amp;amp;iscopilotedu=1&amp;amp;amp;form=MA13G7&lt;/a&gt;):&lt;ol&gt;

  &lt;li&gt;cited website: Happy Toddler Playtime (&lt;a href="https://happytoddlerplaytime.com/cardboard-box-guitar-craft-for-kids/" rel="noopener noreferrer"&gt;&lt;/a&gt;&lt;a href="https://happytoddlerplaytime.com/cardboard-box-guitar-craft-for-kids/" rel="noopener noreferrer"&gt;https://happytoddlerplaytime.com/cardboard-box-guitar-craft-for-kids/&lt;/a&gt;)&lt;/li&gt;

&lt;/ol&gt;

&lt;/li&gt;

&lt;/ol&gt;

&lt;h6&gt;
  
  
  Bard demo source:
&lt;/h6&gt;

&lt;ol start="14"&gt;
  &lt;li&gt;Promotional blog (&lt;a href="https://twitter.com/sundarpichai/status/1622673369480204288" rel="noopener noreferrer"&gt;https://twitter.com/sundarpichai/status/1622673369480204288&lt;/a&gt;) and video (&lt;a href="https://twitter.com/sundarpichai/status/1622673775182626818" rel="noopener noreferrer"&gt;https://twitter.com/sundarpichai/status/1622673775182626818&lt;/a&gt;)&lt;/li&gt;
  &lt;li&gt;Video demonstration (&lt;a href="https://www.youtube.com/watch?v=yLWXJ22LUEc" rel="noopener noreferrer"&gt;https://www.youtube.com/watch?v=yLWXJ22LUEc&lt;/a&gt;)&lt;/li&gt;
&lt;/ol&gt;

&lt;h6&gt;
  
  
  Verification:
&lt;/h6&gt;

&lt;ol start="16"&gt;
&lt;li&gt;which telescope captured the first exoplanet images&lt;ol&gt;
  &lt;li&gt;Twitter by Grant Tremblay (American astrophysicist): &lt;a href="https://twitter.com/astrogrant/status/1623091683603918849" rel="noopener noreferrer"&gt;https://twitter.com/astrogrant/status/1623091683603918849&lt;/a&gt;
&lt;/li&gt;
  &lt;li&gt;NASA: 2M1207 b - First image of an exoplanet: &lt;a href="https://exoplanets.nasa.gov/resources/300/2m1207-b-first-image-of-an-exoplanet/" rel="noopener noreferrer"&gt;https://exoplanets.nasa.gov/resources/300/2m1207-b-first-image-of-an-exoplanet/&lt;/a&gt;
&lt;/li&gt;
&lt;/ol&gt;


&lt;/li&gt;

  &lt;li&gt;When the constellations are visible&lt;ol&gt;

  &lt;li&gt;Google (&lt;a href="https://www.google.com/search?client=safari&amp;amp;rls=en&amp;amp;q=when+is+orion+visible&amp;amp;ie=UTF-8&amp;amp;oe=UTF-8" rel="noopener noreferrer"&gt;&lt;/a&gt;&lt;a href="https://www.google.com/search?client=safari&amp;amp;rls=en&amp;amp;q=when+is+orion+visible&amp;amp;ie=UTF-8&amp;amp;oe=UTF-8" rel="noopener noreferrer"&gt;https://www.google.com/search?client=safari&amp;amp;amp;rls=en&amp;amp;amp;q=when+is+orion+visible&amp;amp;amp;ie=UTF-8&amp;amp;amp;oe=UTF-8&lt;/a&gt;) top result: Byju's (&lt;a href="https://byjus.com/question-answer/in-which-season-of-the-year-is-the-constellation-orion-visible-in-the-sky/" rel="noopener noreferrer"&gt;&lt;/a&gt;&lt;a href="https://byjus.com/question-answer/in-which-season-of-the-year-is-the-constellation-orion-visible-in-the-sky/" rel="noopener noreferrer"&gt;https://byjus.com/question-answer/in-which-season-of-the-year-is-the-constellation-orion-visible-in-the-sky/&lt;/a&gt;)&lt;/li&gt;

  &lt;li&gt;Wikipedia page "Orion (constellation)": &lt;a href="https://en.wikipedia.org/wiki/Orion_(constellation)" rel="noopener noreferrer"&gt;&lt;/a&gt;&lt;a href="https://en.wikipedia.org/wiki/Orion_(constellation)" rel="noopener noreferrer"&gt;https://en.wikipedia.org/wiki/Orion_(constellation)&lt;/a&gt;
&lt;/li&gt;

&lt;/ol&gt;

&lt;/li&gt;

&lt;/ol&gt;

&lt;h5&gt;
  
  
  Academic References
&lt;/h5&gt;

&lt;ol start="18"&gt;
  &lt;li&gt;An Introduction to Information Retrieval: &lt;a href="https://nlp.stanford.edu/IR-book/pdf/irbookonlinereading.pdf" rel="noopener noreferrer"&gt;https://nlp.stanford.edu/IR-book/pdf/irbookonlinereading.pdf&lt;/a&gt;
&lt;/li&gt;
  &lt;li&gt;Toward Controlled Generation of Text: &lt;a href="http://proceedings.mlr.press/v70/hu17e/hu17e.pdf" rel="noopener noreferrer"&gt;http://proceedings.mlr.press/v70/hu17e/hu17e.pdf&lt;/a&gt;
&lt;/li&gt;
  &lt;li&gt;FEVER: a large-scale dataset for Fact Extraction and VERification: &lt;a href="https://aclanthology.org/N18-1074.pdf" rel="noopener noreferrer"&gt;https://aclanthology.org/N18-1074.pdf&lt;/a&gt;
&lt;/li&gt;
  &lt;li&gt;Peeking Inside the Black-Box: A Survey on Explainable Artificial Intelligence (XAI): &lt;a href="https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&amp;amp;arnumber=8466590" rel="noopener noreferrer"&gt;https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&amp;amp;arnumber=8466590&lt;/a&gt;
&lt;/li&gt;
&lt;/ol&gt;

</description>
      <category>systemdesign</category>
      <category>backenddevelopment</category>
      <category>discuss</category>
    </item>
  </channel>
</rss>
