Appen Launches AI Chat Feedback and Benchmarking Solutions for Enhanced LLM Evaluation

August 23, 2023 at 01:01 pm

Appen Enables Enterprises to Build More Complex Conversational AI Solutions

KIRKLAND, Wash., Aug. 23, 2023 /CNW/ -- Appen Limited (ASX: APX), a leading provider of high-quality data for the AI lifecycle, today announced the launch of two new products that will enable customers to launch high-performing large language models (LLMs) whose responses are helpful, harmless and honest to reduce bias and toxicity. These solutions are:

AI Chat Feedback — empower domain experts to assess a multi-turn live conversation, enabling them to review, rate and rewrite each response.
Benchmarking — a solution designed to help customers evaluate model performance across various dimensions, such as model accuracy, toxicity, etc.

The rise of LLM-based chatbots and assistants has accelerated demand for more sophisticated conversational AI that can support multiple tasks. It is important to test a LLMs contextual understanding and coherence in complex conversations that extend over multiple turns or dialogues, mirroring real-world applications. This will help identify strengths and weaknesses in handling extended interactions, ultimately enhancing the quality of user experiences and the model's practical utility. Appen's AI Chat Feedback manages the end-to-end flow of data through multiple rounds of evaluation and provides customers required data to help improve models.

Appen's Benchmarking tool solves an inflection point businesses face while under pressure to enter the AI market quickly: how to determine the right LLM to choose for a specific enterprise application. Model selection has strategic implications for many dimensions of an application including user experience, ease of maintenance and profitability. With the Benchmarking solution, customers can evaluate the performance of various models along commonly used or fully custom dimensions. Combined with a curated crowd of Appen's AI Training Specialists, the tool evaluates performance along demographic dimensions of interest such as gender, ethnicity and language. A configurable dashboard enables efficient comparison of multiple models across various dimensions of interest.

"As AI Chatbots grow more advanced, the stakes are higher for enterprises to get them right before they're released into the world, or they risk harmful biases and dangerous responses that could have long-term impacts on the business," said Appen CEO Armughan Ahmad. "Appen's new evaluation products provide our customers with an essential trust layer that ensures they are releasing AI tools that are truly helpful and not harmful to the public. This trust layer is backed by robust datasets and processes that have proven effective in our 27 years of AI training work, and a team of over a million human experts who are attending to the nuances of the data."

Human feedback has been shown to be critical to the performance of LLM models. Appen's world-class technology is reinforced by its global crowd of more than 1 million AI Training Specialists who evaluate datasets for accuracy and bias. The AI Chat Feedback tool directly connects a LLM output with specialists so that it can learn from diverse, natural chat data. Appen leveraged its over two decades of experience with intuitive, efficient annotation platforms to design a chat interface that demonstrates familiarity and ease. Specialists chat live with a model, whether a customer's model or a third party's, and rate, flag and provide context for their evaluation. This white-glove service extends to a project-dedicated staff who meticulously analyze each batch of data, uncovering edge cases and optimizing the data quality.

Appen is continually iterating on its products to enable AI certainty with more advanced capabilities coming soon. If you're interested in learning more about Appen's new products, please visit our website at Appen.com or contact our sales team.

About Appen
Appen is the global leader in data for the AI Lifecycle with more than 27 years' experience in data sourcing, annotation, and model evaluation. Through our expertise, platform, and global crowd, we enable organizations to launch the world's most innovative artificial intelligence products with speed and at scale. Appen maintains the industry's most advanced AI-assisted data annotation platform and boasts a global crowd of more than 1 million contributors worldwide, speaking more than 235 languages. Our products and services make Appen a trusted partner to leaders in technology, automotive, finance, retail, healthcare, and government. Appen has customers and offices globally.

Contact: appen@codewordagency.com

View original content:https://www.prnewswire.com/news-releases/appen-launches-ai-chat-feedback-and-benchmarking-solutions-for-enhanced-llm-evaluation-301907602.html

SOURCE Appen

	1st Jan change	Capi.
APPEN LIMITED	-0.79%	89.82M
ACCENTURE PLC	-12.08%	193B
TATA CONSULTANCY SERVICES LTD.	+2.29%	169B
IBM	+2.88%	154B
AUTOMATIC DATA PROCESSING, INC.	+5.86%	100B
RELX PLC	+11.32%	80.96B
CROWDSTRIKE HOLDINGS, INC.	+34.39%	79.71B
INFOSYS LIMITED	-7.97%	70.64B
SNOWFLAKE INC.	-17.40%	53.75B
HCL TECHNOLOGIES LIMITED	-9.09%	42.84B

1st Jan change

Capi.

APPEN LIMITED

-0.79%

89.82M

ACCENTURE PLC

-12.08%

193B

TATA CONSULTANCY SERVICES LTD.

+2.29%

169B

IBM

+2.88%

154B

AUTOMATIC DATA PROCESSING, INC.

+5.86%

100B

RELX PLC

+11.32%

80.96B

CROWDSTRIKE HOLDINGS, INC.

+34.39%

79.71B

INFOSYS LIMITED

-7.97%

70.64B

SNOWFLAKE INC.

-17.40%

53.75B

HCL TECHNOLOGIES LIMITED

-9.09%

42.84B

Delayed Australian S.E. Other stock markets 02:42:13 16/05/2024 BST			5-day change	1st Jan Change
0.6225 ^AUD	+2.05%		+8.70%	-0.79%

03-18	Appen Limited(ASX:APX) dropped from S&P/ASX Small Ordinaries Index	CI
03-18	Appen Limited(ASX:APX) dropped from S&P/ASX 300 Index	CI

Appen Limited(ASX:APX) dropped from S&P/ASX Small Ordinaries Index	03-17	CI
Appen Limited(ASX:APX) dropped from S&P/ASX 300 Index	03-17	CI
Australia shares dip as banks weigh; miners limit losses	03-14	RE
Appen Mulls $154 Million Takeover Bid from Innodata	03-13	CI
Australia's Appen plunges as Innodata withdraws buyout bid over confidentiality breach	03-13	RE
Software firm Appen says US-based Innodata withdraws buyout offer	03-13	RE
Innodata Withdraws Appen Acquisition Proposal	03-13	MT
Australian shares tick higher as banks gain	03-13	RE
Global markets live: Domino's Pizza, Generali, Oracle, Intel, Boeing...	03-12
Another disappointment	03-12
Australian IT firm Appen gets merger bid from US-based Innodata	03-12	RE
Australian IT firm Appen gets merger bid from US-based Innodata	03-12	RE
Appen Shares Jump 10% on Narrowed 2023 Loss	02-27	MT
Transcript : Appen Limited, 2023 Earnings Call, Feb 27, 2024	02-27
Appen Reports Narrows 2023 Loss to $0.8310 Per Share	02-26	MT
Appen Reports FY23 Revenue Down 29.4% to $274.2 Million; Underlying Diluted Loss of $0.3717 Per Share	02-26	MT
Appen Limited Reports Earnings Results for the Full Year Ended December 31, 2023	02-26	CI
Appen Finalizes Cost Management Measures; Shares Jump 18%	02-12	MT
Appen Names CEO/Managing Director	02-05	MT
Alphabet's Google Terminates Contract With AI Firm Appen	01-24	MT
Alphabet's Google Terminates Contract With AI Firm Appen	01-24	MT
Growing US Soft Landing Bets Buoy Australian Equities	01-22	MT
Appen Falls 40% After Google Terminates Contract	01-22	MT
Appen Raises AU$17 Million from Retail Entitlement Offer; Shares Down 12%	12-13	MT
Appen Concludes Institutional Component of AU$30 Million Fundraising	11-23	MT

Appen Limited

Equities

APX

AU000000APX3

IT Services & Consulting

Appen Launches AI Chat Feedback and Benchmarking Solutions for Enhanced LLM Evaluation

Latest news about Appen Limited

Chart Appen Limited

Company Profile

Income Statement Evolution

Ratings for Appen Limited

Analysts' Consensus

EPS Revisions

Quarterly earnings - Rate of surprise

Sector Other IT Services & Consulting