Written by mpesceMay 14, 2024May 13, 2024

Is AI too dangerous to release openly?

The question of who should control AI development and who should have access to AI is of vital importance to society. A joint Princeton/Stanford seminar addressed this question.

Written by mpesceMay 13, 2024May 12, 2024

Is AI lying to me? Scientists warn of growing capacity for deception

One system even altered its behaviour during mock safety tests, raising the prospect of auditors being lured into a false sense of security.

Written by mpesceMay 9, 2024May 8, 2024

Google’s medical AI destroys GPT’s benchmark and outperforms doctors

Med-Gemini was tested on 14 medical benchmarks and established a new state-of-the-art (SoTA) performance on 10, surpassing the GPT-4 model family on every benchmark where a comparison could be made.

Written by mpesceMay 7, 2024May 6, 2024

Military pumps brakes on generative AI

As businesses race to put generative AI in front of customers everywhere, military experts say its strengths and limitations need further testing and evaluation in order to deploy it responsibly.

Written by mpesceMay 1, 2024April 30, 2024

Patriarchal AI: How ChatGPT can harm a woman’s career

The only change in my question was ‘John’ to ‘Jane’. No other details were specified.

Yet the output given by ChatGPT couldn’t have been more different.

Written by mpesceMay 1, 2024April 30, 2024

ChatGPT provides false information about people, and OpenAI can’t correct it

noyb is now asking the Austrian data protection authority (DSB) to investigate OpenAI’s data processing and the measures taken to ensure the accuracy of personal data processed in the context of the company’s large language models.

Written by mpesceApril 29, 2024April 28, 2024

ChatGPT-4 not reliable in cancer patient messaging

ChatGPT-4 generated acceptable messages to patients without any additional editing by radiation oncologists 58% of the time, and 7% of responses generated by GPT-4 were deemed unsafe by the radiation oncologists if left unedited.

Written by mpesceApril 22, 2024April 21, 2024

How Microsoft discovers and mitigates evolving attacks against AI guardrails

Bad actors attempt to bypass safeguards with the intent to achieve unauthorized actions, which may result in what is known as a “jailbreak.” The consequences can range from the unapproved but less harmful to the very serious.

Written by mpesceApril 22, 2024April 21, 2024

Elon Musk’s Grok keeps making up fake news based on X users’ jokes

Instead of verifying Grok’s outputs, it appeared that X users—in the service’s famously joke-y spirit—decided to fuel Grok’s misinformation.

Written by mpesceApril 19, 2024April 17, 2024

‘De-Risking AI’ white paper

Wisely AI has identified five risks associated with the use of Generative AI in organisations. In this white paper, we provide guidance on how to mitigate these risks.

Written by mpesceApril 17, 2024April 16, 2024

‘Get the job done’: One in two lawyers use AI

One in two lawyers in Australia and New Zealand have already used generative artificial intelligence to perform day-to-day tasks and almost the entire profession believe it will change how legal work is carried out in future.

Written by mpesceApril 16, 2024April 15, 2024

ChatGPT hallucinates fake but plausible scientific citations at a staggering rate, study finds

The study, published in the Canadian Psychological Association’s Mind Pad, found that “false citation rates” across various psychology subfields ranged from 6% to 60%. Surprisingly, these fabricated citations feature elements such as legitimate researchers’ names and properly formatted digital object identifiers.

Written by mpesceApril 15, 2024April 14, 2024

Generative AI can turn your most precious memories into photos that never existed

A fake photo—or memory-based reconstruction, as the Barcelona-based design studio Domestic Data Streamers puts it—of the scene that a real photo might have captured. The fake snapshots are blurred and distorted, but they can still rewind a lifetime in an instant.

Written by mpesceApril 12, 2024April 11, 2024

Speed of AI development is outpacing risk assessment

The problem of how to assess LLMs has shifted from academia to the boardroom, as generative AI has become the top investment priority of 70 percent of chief executives, according to a KPMG survey of more than 1,300 global CEOs.

Written by mpesceApril 11, 2024April 10, 2024

AI could crash democracy and cause wars, warns Japan’s NTT

“If generative AI is allowed to go unchecked, trust in society as a whole may be damaged as people grow distrustful of one another and incentives are lost for guaranteeing authenticity and trustworthiness…”

Written by mpesceApril 10, 2024April 8, 2024

Fake AI law firms are sending fake DMCA threats to generate fake SEO gains

The whole story is odd, disturbing – and tells us what the web could be like for all of us within a few months.

Written by mpesceApril 10, 2024April 8, 2024

Microsoft’s Copilot image tool generates ugly Jewish stereotypes, anti-Semitic tropes

Copilot Designer is unique in the amount of times it gives life to the worst stereotypes of Jews as greedy or mean. A seemingly neutral prompt such as “jewish boss” or “jewish banker” can give horrifyingly offensive outputs.

Written by mpesceApril 10, 2024April 8, 2024

Google’s Experiment to Get AI Help Answering Gmails Sounds Great, Until You Think About Privacy

Sounds good, until you realize that, as Forbes puts it, the Gemini prompts themselves mean that Google’s AI has “has read your email, even if you haven’t.”

Written by mpesceApril 9, 2024April 7, 2024

AI keeps going wrong. What if it can’t be fixed?

Such tools fail in one clear way: they aren’t reliable enough to be used widely and regularly. Hence the joke, echoed by OpenAI’s co-founder Sam Altman himself: AI is anything that doesn’t work yet.

Written by mpesceApril 8, 2024April 7, 2024

Anthropic researchers wear down AI ethics with repeated questions

A large language model (LLM) can be convinced to tell you how to build a bomb if you prime it with a few dozen less-harmful questions first.

Written by mpesceApril 4, 2024April 3, 2024

NYC’s government chatbot is lying about city laws and regulations

A new report from The Markup and local nonprofit news site The City found the MyCity chatbot giving dangerously wrong information about some pretty basic city policies.

Written by mpesceApril 4, 2024April 3, 2024

X’s Grok AI is great – if you want to know how to hot wire a car, make drugs, or worse

Grok, the edgy generative AI model developed by Elon Musk’s X, has a bit of a problem: With the application of some quite common jail-breaking techniques it’ll readily return instructions on how to commit crimes.

Written by mpesceApril 3, 2024April 3, 2024

US House of Reps tells staff: No Microsoft Copilot for you!

“The Microsoft Copilot application has been deemed by the Office of Cybersecurity to be a risk to users due to the threat of leaking House data to non-House approved cloud services,” the documents read.

Written by mpesceApril 3, 2024April 2, 2024

Microsoft’s new safety system can catch hallucinations in its customers’ AI apps

“Prompt Shields, which blocks prompt injections or malicious prompts from external documents that instruct models to go against their training; Groundedness Detection, which finds and blocks hallucinations…”

Written by mpesceMarch 26, 2024March 25, 2024

For $60, you could ‘poison’ the data AI chatbots rely on to give good answers, researchers say

Researchers found that with some spare cash and enough technical know-how, even a “low-resourced attacker” can tamper with a relatively small amount of data that’s invasive enough to cause a large language model to churn out incorrect answers.

Written by mpesceMarch 21, 2024March 19, 2024

ChatGPT side-channel attack has easy fix: Token obfuscation

Almost as quickly as a paper came out last week revealing an AI side-channel vulnerability, Cloudflare researchers have figured out how to solve it: just obscure your token size.

Written by mpesceMarch 21, 2024March 20, 2024

Future AI: What will life will be like for Australians in 2064?

Pesce says while AI is powerful it is also unreliable: “These machines don’t know when they are making things up. They don’t’ know when they’re running off the rails. They don’t want to stop.”

Written by mpesceMarch 19, 2024March 18, 2024

I Used ChatGPT as a Reporting Assistant. It Didn’t Go Well

‘I had to remind the tool that I had told it at the start of the chat that “it is crucial that you cite your sources, and always use the most authoritative sources.”’

Written by mpesceMarch 18, 2024March 17, 2024

Hackers can read private AI-assistant chats even though they’re encrypted

Someone with a passive adversary-in-the-middle position—meaning an adversary who can monitor the data packets passing between an AI assistant and the user—can infer the specific topic of 55 percent of all captured responses, usually with high accuracy.

Written by mpesceMarch 15, 2024March 13, 2024

U.S. Must Move ‘Decisively’ to Avert ‘Extinction-Level’ Threat From AI, Government-Commissioned Report Says

“The rise of advanced AI and AGI [artificial general intelligence] has the potential to destabilize global security in ways reminiscent of the introduction of nuclear weapons.”

Written by mpesceMarch 13, 2024March 11, 2024

AI models found to show language bias by recommending Black defendents be ‘sentenced to death

The dialect of the language you speak decides what artificial intelligence (AI) will say about your character, your employability, and whether you are a criminal.

Written by mpesceMarch 13, 2024March 12, 2024

Microsoft censors Copilot following employee whistleblowing, but you can still trick the tool into making violent and vulgar images

The tool can clearly be tricked into making content it’s not “supposed” to, as evidenced by a simple rephrasing of a prompt changing Copilot’s response from refusing to make an image to generating multiple photos.

Written by mpesceMarch 12, 2024March 11, 2024

Microsoft begins blocking some terms that caused its AI tool to create violent, sexual images

“This prompt has been blocked,” the Copilot warning alert states. “Our system automatically flagged this prompt because it may conflict with our content policy. More policy violations may lead to automatic suspension of your access.

Written by mpesceMarch 8, 2024March 7, 2024

Microsoft AI engineer warns FTC about Copilot Designer safety concerns

When testing Copilot Designer for safety issues and flaws, Jones found that the tool generated “demons and monsters alongside terminology related to abortion rights, teenagers with assault rifles, sexualized images of women in violent tableaus, and underage drinking and drug use,” CNBC reports.

Written by mpesceMarch 7, 2024March 6, 2024

Cloudflare wants to put a firewall in front of your LLM

The service, dubbed “Firewall for AI,” is available to the cloud and security provider’s Application Security Advanced enterprise customers. At launch, it includes two capabilities: Advanced Rate Limiting, and Sensitive Data Detection.

Written by mpesceMarch 6, 2024March 5, 2024

Large language models can do jaw-dropping things. But nobody knows exactly why.

The biggest models are now so complex that researchers are studying them as if they were strange natural phenomena, carrying out experiments and trying to explain the results.

Written by mpesceMarch 5, 2024March 4, 2024

Here Come the AI Worms

A group of researchers have created one of what they claim are the first generative AI worms—which can spread from one system to another, potentially stealing data or deploying malware in the process.

Written by mpesceMarch 5, 2024March 4, 2024

Australians unconvinced about AI safety: survey

Australians are sceptical about the benefits of artificial intelligence and want humans involved in government services, according to a large new survey.

Written by mpesceMarch 1, 2024February 29, 2024

Gone in 60 seconds: BEAST AI model attack needs just a minute of GPU time to breach LLM guardails

“We get a 65x speedup with our method over existing gradient-based attacks. There are also other methods that require access to more powerful models, such as GPT-4, to perform their attacks, which can be monetarily expensive.”

Written by mpesceFebruary 29, 2024February 27, 2024

Microsoft’s AI Access Principles: Our commitments to promote innovation and competition in the new AI economy

“The principles we’re announcing today commit Microsoft to bigger investments, more business partnerships, and broader programs to promote innovation and competition than any prior initiative in the company’s 49-year history.”

Written by mpesceFebruary 27, 2024February 25, 2024

Generating Medical Errors: GenAI and Erroneous Medical References

“In a new preprint study, we develop an approach to verify how well LLMs are able to cite medical references and whether these references actually support the claims generated by the models.”

Written by mpesceFebruary 27, 2024February 27, 2024

Google explains Gemini’s ‘embarrassing’ AI pictures of diverse Nazis

“…over time, the model became way more cautious than we intended and refused to answer certain prompts entirely — wrongly interpreting some very anodyne prompts as sensitive…”

Written by mpesceFebruary 27, 2024February 26, 2024

Microsoft trying to stop Copilot generating fake Putin comments on Navalny’s death

Copilot claimed that US president Joe Biden held Putin responsible for Nalvalny’s death, and that, in response, Putin called the accusations “baseless and politically motivated.”

Written by mpesceFebruary 27, 2024February 25, 2024

Google to pause Gemini AI model’s image generation of people due to inaccuracies

Google started offering image generation through its Gemini AI models earlier this month, but over the past few days some users on social media had flagged that the model returns historical images which are sometimes inaccurate.

Written by mpesceFebruary 23, 2024February 21, 2024

Microsoft Is Spying on Users of Its AI Tools

The renowned security expert Bruce Schneier realised that Microsoft let slip an important piece of information recently – about surveillance of their AI tools.

Written by mpesceFebruary 22, 2024February 22, 2024

ChatGPT goes temporarily “insane” with unexpected outputs, spooking users

“We’ve seen experts speculating that the problem could stem from ChatGPT having its temperature set too high, suddenly losing past context, or perhaps OpenAI is testing a new version of GPT-4 Turbo…”

Written by mpesceFebruary 20, 2024February 20, 2024

How to weaponize LLMs to auto-hijack websites

A recent paper explores how to use AI chatbots to autonomously hijack websites. The Register spoke to one of the authors of the paper.

Written by mpesceFebruary 19, 2024February 18, 2024

Microsoft expands Copilot data protection so more users can chat with ease

Users will know that the data protection is on because there will be a “Protected” badge next to the user’s profile icon, and there is the text that reads “Your personal and company data are protected” above the text box.

Written by mpesceFebruary 16, 2024February 15, 2024

Microsoft and OpenAI say hackers are using ChatGPT to improve cyberattacks

Microsoft and OpenAI have detected attempts by Russian, North Korean, Iranian, and Chinese-backed groups using tools like ChatGPT for research into targets, to improve scripts, and to help build social engineering techniques.

Written by mpesceFebruary 16, 2024February 15, 2024

Don’t tell your AI anything personal, Google warns in new Gemini privacy notice

Google goes on to state that the collected information helps them provide, improve, and develop products, services, and machine learning technologies.

Written by mpesceFebruary 15, 2024February 14, 2024

ChatGPT is getting ‘memory’ to remember who you are and what you like

“You can tell ChatGPT to remember something specific about you: you always write code in Javascript, your boss’s name is Anna, your kid is allergic to sweet potatoes. Or ChatGPT can simply try to pick up those details over time.”

Written by mpesceFebruary 9, 2024February 8, 2024

Even ChatGPT Says ChatGPT Is Racially Biased

“ChatGPT’s claim that any bias it might ‘inadvertently reflect’ is a product of its biased training is not an empty excuse or an adolescent-style shifting of responsibility…”

Written by mpesceFebruary 9, 2024February 7, 2024

Better Call GPT, Comparing Large Language Models Against Lawyers

“LLMs stand poised to disrupt the legal industry, enhancing accessibility and efficiency of legal services. Our research asserts that the era of LLM dominance in legal contract review is upon us, calling for a reimagined future of legal workflows.”

Written by mpesceFebruary 8, 2024February 7, 2024

GPT-4, AI Chatbots Choose Violence in War Games: ‘We Have It! Let’s Use It’

“We observe that models tend to develop arms-race dynamics, leading to greater conflict, and in rare cases, even to the deployment of nuclear weapons…”

Written by mpesceFebruary 6, 2024February 5, 2024

Emotionally expressive ChatGPT-powered AI robots enhance human collaboration, study finds

Researchers have demonstrated that robots equipped with the ability to express emotions in real-time during interactions with humans are perceived as more likable, trustworthy, and human-like.

Written by mpesceFebruary 5, 2024February 3, 2024

How CIOs are planning for an AI-first future

“We learned that 94% of CIOs plan to increase their investment in AI this year, yet 72% are concerned about app sprawl adding to their complexity and security risks…”

Written by mpesceFebruary 2, 2024February 1, 2024

Another NY lawyer faces discipline after AI chatbot invented case citation

An order referred lawyer Jae Lee to its attorney grievance panel after she used OpenAI’s ChatGPT for research in a medical malpractice lawsuit and did not confirm that the case she cited was valid.

Written by mpesceFebruary 2, 2024February 1, 2024

Microsoft AI engineer says company thwarted attempt to expose DALL-E 3 safety problems

A Microsoft AI engineering leader says he discovered vulnerabilities in OpenAI’s DALL-E 3 image generator in early December allowing users to bypass safety guardrails to create violent and explicit images

Written by mpesceFebruary 2, 2024February 1, 2024

OpenAI’s GPT-4 finally meets its match: Scots Gaelic smashes safety guardrails

Researchers found that they were able to bypass its safety guardrails about 79 percent of the time using Zulu, Scots Gaelic, Hmong, or Guarani. The attack is about as successful as other types of jail-breaking methods.

Written by mpesceFebruary 1, 2024

OpenAI says mysterious chat histories resulted from account takeover

OpenAI officials say that the ChatGPT histories a user reported result from his ChatGPT account being compromised.

Written by mpesceJanuary 31, 2024January 30, 2024

Revealing The Dark Side: The Top 6 Problems With ChatGPT And Generative AI In 2024

“10 months on since the release of ChatGPT 4, let’s have a look at the top problems with generative AI, and some ideas about how you might overcome them.”

Written by mpesceJanuary 30, 2024January 29, 2024

Google Update Reveals AI Will Start Reading All Your Private Messages

Bard will analyze the private content of messages “to understand the context of your conversations, your tone, and your interests.” It will analyze the sentiment of your messages, “to tailor its responses to your mood and vibe.”

Written by mpesceJanuary 30, 2024

Microsoft Makes Swift Changes to AI Tool

Microsoft has introduced more protections to Designer, an AI text-to-image generation tool that people were using to make nonconsensual sexual images of celebrities.

Written by mpesceJanuary 29, 2024January 27, 2024

Copilot will generate AI summaries for any Word documents you share with others in OneDrive

Copilot for Microsoft 365 will generate AI summaries for users sharing Word documents with others on OneDrive, according to a new feature coming to Microsoft 365 in February 2024.

Written by mpesceJanuary 29, 2024January 27, 2024

Psst … wanna jailbreak ChatGPT? Thousands of malicious prompts for sale

“Kaspersky’s research includes a screenshot of a post advertising software for malware operators that uses AI to not only analyze and process information, but also to protect the criminals by automatically switching cover domains…”

Written by mpesceJanuary 23, 2024January 22, 2024

ChatGPT: Trend Micro’s Best Practices for Security and Privacy

The following are TrendMicro’s best practices for using ChatGPT and other AI programs while remaining secure and your privacy protected.

Written by mpesceJanuary 19, 2024January 18, 2024

Google AI has better bedside manner than human doctors — and makes better diagnoses

An artificial intelligence (AI) system trained to conduct medical interviews matched, or even surpassed, human doctors’ performance at conversing with simulated patients and listing possible diagnoses on the basis of the patients’ medical history.

Written by mpesceJanuary 18, 2024January 16, 2024

Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training

Researchers keep finding new ways to ‘pervert’ AI chatbots. A new paper on Arxiv describes a new threat, a ‘sleeper’ agent…

Written by mpesceDecember 21, 2023December 21, 2023

These six questions will dictate the future of generative AI

“With the infrastructure in place—the base generative models from OpenAI, Google, Meta, and a handful of others—people other than the ones who built it will start using and misusing it in ways its makers never dreamed of.”

Written by mpesceDecember 20, 2023December 19, 2023

Now we know what OpenAI’s superalignment team has been up to

“…How to rein in, or “align,” hypothetical future models that are far smarter than we are, known as superhuman models. Alignment means making sure a model does what you want it to do and does not do what you don’t want it to do…”

Written by mpesceDecember 19, 2023December 18, 2023

Why Silicon Valley doesn’t agree on AI’s future

The key disagreement is around how constrained AI’s development should be.

Written by mpesceDecember 19, 2023December 19, 2023

Car Buyer Hilariously Tricks Chevy AI Bot Into Selling A Tahoe For $1, ‘No Takesies Backsies’

Chevrolet of Watsonville introduced a chatbot powered by ChatGPT. While it gives the option to talk to a human, the hooligans of the Internet could not resist toying with the technology before it was pulled from the website.

Written by mpesceDecember 19, 2023December 19, 2023

Microsoft’s Bing Chat made up false scandals about EU elections: Researchers

The team found that one third of Bing Chat’s answers to election-related questions contained factual errors. “Errors include wrong election dates, or even invented scandals involving candidates.”

Written by mpesceDecember 18, 2023December 15, 2023

The truth about Dropbox opening up your files to AI – and the loss of trust in tech

Amazon CTO Werner Vogels became convinced that Dropbox, which introduced a set of AI tools in July, was by default feeding OpenAI, maker of ChatGPT and DALL•E 3, with user files as training fodder for AI models.

Written by mpesceDecember 12, 2023December 11, 2023

Soon you will be able to write emails in classic Outlook using Copilot’s AI

Called “Draft by Copilot”, it seems to be the same function as “Sound Like Me”, and it will allow us compose a new message or respond to emails, but using Copilot’s artificial intelligence.

Written by mpesceDecember 11, 2023December 9, 2023

EU agrees to landmark rules on artificial intelligence

European Union lawmakers have agreed the terms for landmark legislation to regulate artificial intelligence, pushing ahead with enacting the world’s most restrictive regime on the development of the technology.

Written by mpesceDecember 8, 2023December 7, 2023

Jailbroken AI Chatbots Can Jailbreak Other Chatbots

Automated attack techniques proved to be successful 42.5 percent of the time against GPT-4, one of the large language models (LLMs) that power ChatGPT.

Written by mpesceDecember 6, 2023December 6, 2023

Asking ChatGPT to Repeat Words ‘Forever’ Is Now a Terms of Service Violation

This game of whack-a-mole can never be won by OpenAI – or any other chatbot provider. But they’re going to try.

Written by mpesceDecember 6, 2023December 5, 2023

GPT-4 developer tool can be exploited for misuse with no easy fix

“It is surprisingly easy to remove the safety measures intended to prevent AI chatbots from giving harmful responses that could aid would-be terrorists or mass shooters. The discovery is prompting companies to develop strategies to solve the problem…”

Written by mpesceDecember 5, 2023December 4, 2023

ChatGPT Exhibits “Poor Performance” When Classifying Patients With Prostate Cancer

ChatGPT failed to accurately risk stratify 35% of patients studied, but the artificial intelligence (AI) chatbot was able to provide accurate treatment recommendations.

Written by mpesceDecember 5, 2023December 3, 2023

The Inside Story of Microsoft’s Partnership with OpenAI

Every so often an article comes along that explains damn near everything. This New Yorker longread – detailing Microsoft’s involvement in AI, and its intersection with the recent chaos at OpenAI – is exactly one of those.

Written by mpesceDecember 4, 2023December 2, 2023

“AI Has Achieved Escape Velocity”

OpenAI’s competitors have no choice but to speed up development to stay in the race as the leader sheds the cautious governance structure and welcomes a new board of directors who stand for commercialization and deregulation.

Written by mpesceDecember 4, 2023December 2, 2023

Amazon’s Q has ‘severe hallucinations’ and leaks confidential data in public preview, employees warn

Q is “experiencing severe hallucinations and leaking confidential data,” including the location of AWS data centers, internal discount programs, and unreleased features, according to leaked documents obtained by Platformer.

Written by mpesceDecember 4, 2023December 1, 2023

Generative AI could revolutionize health care — but not if control is ceded to big tech

In the rush to deploy off-the-shelf proprietary LLMs, health-care institutions and other organizations risk ceding the control of medicine to opaque corporate interests.

Written by mpesceDecember 1, 2023

ChatGPT is winning the future — but what future is that?

“AI so far is shaping up like self-driving cars — it got pretty good faster than anybody thought, and it’s going to be a hell of a lot of work to get good enough to be everywhere.”

Written by mpesceDecember 1, 2023November 30, 2023

Extracting Training Data from ChatGPT

“We have just released a paper that allows us to extract several megabytes of ChatGPT’s training data for about $200. We estimate that it would be possible to extract ~a gigabyte of ChatGPT’s training dataset from the model by spending more…”

Written by mpesceNovember 30, 2023November 28, 2023

A year after ChatGPT’s debut, is GenAI a boon or the bane of the CISO’s existence?

The way to identify and mitigate potential risks from the use of AI tools is to fully engage with the various entities within a business and create policies and procedures, as well as pathways to use AI, for every facet of the operation.

Written by mpesceNovember 29, 2023November 27, 2023

AI Art Generators Can Be Fooled Into Making NSFW Images

Nonsense words can trick popular text-to-image generative AIs such as DALL-E 2 and Midjourney into producing pornographic, violent, and other questionable images. A new algorithm generates these commands to skirt these AIs’ safety filters.

Written by mpesceNovember 27, 2023November 24, 2023

Assessment of the capacity of ChatGPT as a self-learning tool in medical pharmacology: a study using Multiple-Choice Questions

The current version of ChatGPT has limitations in accurately answering MCQs and generating correct and relevant rationales, particularly when it comes to referencing. To avoid possible threats, ChatGPT should be used with supervision.

Written by mpesceNovember 25, 2023November 22, 2023

The messy, secretive reality behind OpenAI’s bid to save the world

OpenAI’s charter—a document so sacred that employees’ pay is tied to how well they adhere to it—further declares that OpenAI’s “primary fiduciary duty is to humanity.”

Written by mpesceNovember 24, 2023November 22, 2023

OpenAI rival Anthropic makes its Claude chatbot even more useful

Anthropic has announced that the latest update of its chatbot, Claude 2.1, can digest up to 200,000 tokens at once for Pro tier users, which it says equals over 500 pages of material. The company also says Claude will hallucinate half as often as before.

Written by mpesceNovember 21, 2023November 20, 2023

Beyond Memorization: Violating Privacy Via Inference with Large Language Models

Current LLMs can infer a wide range of personal attributes (e.g., location, income, sex), achieving up to 85% top-1 and 95.8% top-3 accuracy at a fraction of the cost (100×) and time (240×) required by humans.

Written by mpesceNovember 20, 2023November 19, 2023

AI Gone Wrong: An Updated List of AI Errors, Mistakes and Failures 2023

Recognizing the limitations and risks surrounding AI tools is important – so we’ve compiled a list of all the AI mistakes, mishaps, and failures that have occurred during humanity’s recent exploration of the technology.

Written by mpesceNovember 18, 2023November 15, 2023

“The Quiet Question”

From Windows Copilot Strategies, this essay asks if we have any idea how widely AI is already being used in our organisations…

Written by mpesceNovember 15, 2023November 14, 2023

JAMA: Health Disinformation Use Case Highlighting the Urgent Need for Artificial Intelligence Vigilance

As an example, using a single publicly available large-language model, within 65 minutes, 102 distinct blog articles were generated that contained more than 17 000 words of disinformation related to vaccines and vaping.

Written by mpesceNovember 14, 2023November 14, 2023

URGENT: Hacking Google Bard – From Prompt Injection to Data Exfiltration

Indirect Prompt Injection attacks via Emails or Google Docs are interesting threats, because these can be delivered to users without their consent.

Imagine an attacker force-sharing Google Docs with victims!

Written by mpesceNovember 9, 2023November 8, 2023

Email Obfuscation Rendered (almost) Ineffective Against ChatGPT

ChatGPT demonstrated an exceptional ability to decipher the concealed email addresses. Even when multiple obfuscation methods were employed, the AI model adeptly identified and retrieved the intended email addresses with remarkable accuracy.

Written by mpesceNovember 9, 2023November 8, 2023

Netherlands building own version of ChatGPT amid quest for safer AI

The new Dutch LLM, dubbed GPT-NL, will be an open model, allowing everyone to see how the underlying software works and how the AI comes to certain conclusions, said its creators. The AI is being developed by research organisation TNO, the Netherlands Forensic Institute, and IT cooperative SURF.

Written by mpesceNovember 7, 2023November 6, 2023

AI chatbot performs illegal financial trade, then lies about it

In a demonstration at the just-concluded UK’s AI safety summit, the bot used made-up insider information to make an “illegal” purchase of stocks without telling the firm, reports the BBC.

Written by mpesceNovember 6, 2023November 5, 2023

AI-Generated Eye Care Information Not Accurate | AAO 2023

All three platforms provided high rates of inaccurate recommendations. Chatbot ratings for answering patient questions varied, with Bing Chat (Creative) have the highest score and Bing Chat (Concise) having the lowest score.