• About
  • Advertise
  • Privacy & Policy
  • Contact
HK Businesswire
  • Home
  • News
    • All
    • Business
    • Politics
    • PR Newswire
    • Science
    • World
    Explore Changping: Where Ming Dynasty Heritage Meets Modern Adventure

    Explore Changping: Where Ming Dynasty Heritage Meets Modern Adventure

    Unpacking the bias of large language models

    This compact, low-power receiver could give a boost to 5G smart devices

    This compact, low-power receiver could give a boost to 5G smart devices

    APTO Releases High-Accuracy Japanese Reasoning Data for LLM Fine-Tuning, Free of Charge

    APTO Releases High-Accuracy Japanese Reasoning Data for LLM Fine-Tuning, Free of Charge

    TGE Successfully Advances Multiple Movie Releases This Year, Including “She’s Got No Name” and “My First of May,” Coming Out in June and August Respectively

    TGE Successfully Advances Multiple Movie Releases This Year, Including “She’s Got No Name” and “My First of May,” Coming Out in June and August Respectively

    AI-Powered CDSS Enhances Patient Safety with Real-World Data

    AI-Powered CDSS Enhances Patient Safety with Real-World Data

    Trending Tags

    • Trump Inauguration
    • United Stated
    • White House
    • Market Stories
    • Election Results
  • PR Newswire
  • Business
  • World
  • Entertainment
  • Sports
  • Tech
    • All
    • Apps
    • Gadget
    • Mobile
    • Startup

    Deloitte: Over 40% of Family Offices Prioritise Tech Amid Digital Transformation

    PwC: AI-Exposed Jobs See Surge in Demand, Pay, and Productivity

    PwC: AI-Exposed Jobs See Surge in Demand, Pay, and Productivity

    Hong Kong Student Criticised for Using Outsourced AI Project to Win STEM Awards

    Xiaomi SU7 Ultra Becomes Fastest Mass-Produced EV on Nürburgring Nordschleife

    MPF at 25: PwC and HKRSA Urge Bold Reform for Hong Kong’s Retirement System

    CrowdStrike Shares Dip Despite Strong Q1 Earnings Amid Soft Revenue Guidance

    Trending Tags

    • Nintendo Switch
    • CES 2017
    • Playstation 4 Pro
    • Mark Zuckerberg
  • Feature
No Result
View All Result
  • Home
  • News
    • All
    • Business
    • Politics
    • PR Newswire
    • Science
    • World
    Explore Changping: Where Ming Dynasty Heritage Meets Modern Adventure

    Explore Changping: Where Ming Dynasty Heritage Meets Modern Adventure

    Unpacking the bias of large language models

    This compact, low-power receiver could give a boost to 5G smart devices

    This compact, low-power receiver could give a boost to 5G smart devices

    APTO Releases High-Accuracy Japanese Reasoning Data for LLM Fine-Tuning, Free of Charge

    APTO Releases High-Accuracy Japanese Reasoning Data for LLM Fine-Tuning, Free of Charge

    TGE Successfully Advances Multiple Movie Releases This Year, Including “She’s Got No Name” and “My First of May,” Coming Out in June and August Respectively

    TGE Successfully Advances Multiple Movie Releases This Year, Including “She’s Got No Name” and “My First of May,” Coming Out in June and August Respectively

    AI-Powered CDSS Enhances Patient Safety with Real-World Data

    AI-Powered CDSS Enhances Patient Safety with Real-World Data

    Trending Tags

    • Trump Inauguration
    • United Stated
    • White House
    • Market Stories
    • Election Results
  • PR Newswire
  • Business
  • World
  • Entertainment
  • Sports
  • Tech
    • All
    • Apps
    • Gadget
    • Mobile
    • Startup

    Deloitte: Over 40% of Family Offices Prioritise Tech Amid Digital Transformation

    PwC: AI-Exposed Jobs See Surge in Demand, Pay, and Productivity

    PwC: AI-Exposed Jobs See Surge in Demand, Pay, and Productivity

    Hong Kong Student Criticised for Using Outsourced AI Project to Win STEM Awards

    Xiaomi SU7 Ultra Becomes Fastest Mass-Produced EV on Nürburgring Nordschleife

    MPF at 25: PwC and HKRSA Urge Bold Reform for Hong Kong’s Retirement System

    CrowdStrike Shares Dip Despite Strong Q1 Earnings Amid Soft Revenue Guidance

    Trending Tags

    • Nintendo Switch
    • CES 2017
    • Playstation 4 Pro
    • Mark Zuckerberg
  • Feature
No Result
View All Result
HK Businesswire
No Result
View All Result
Home News PR Newswire

APTO Releases High-Accuracy Japanese Reasoning Data for LLM Fine-Tuning, Free of Charge

PR Newswire by PR Newswire
17 June 2025
in PR Newswire
0
APTO Releases High-Accuracy Japanese Reasoning Data for LLM Fine-Tuning, Free of Charge
0
SHARES
1
VIEWS
Share on FacebookShare on Twitter

TOKYO, June 18, 2025 /PRNewswire/ — APTO is pleased to announce the release of a free dataset for fine-tuning reasoning models, such as OpenAI’s GPT-01 and Deepseek’s Deepseek R1.

This dataset can help to improve reasoning ability in Japanese and reduce redundant inference.

This allows for faster inference even with limited token counts and memory usage.

Dataset Details

Each data entry includes a question that requires reasoning and its corresponding answer, with the thought process described within ‘think’ XML tags.

This dataset consists of high-quality data generated by our proprietary technology and manually reviewed for accuracy.

Validation using models such as Qwen3 has confirmed that training with this dataset improves reasoning ability in Japanese and enables more efficient inference.

Additionally, testing with the Japanese MT-Bench showed performance improvements particularly in categories such as reasoning, math, and coding.

Figure 1: An example of JSON format in the free public dataset.
Figure 1: An example of JSON format in the free public dataset.

Tag Information

Each question-and-answer conversation is labeled with tag information indicating the subject matter and genre of the conversation.

The following labels are used:

People

Human Relations

Social Studies

Business

Economics

Politics

Law

Technology

Religion

Astronomy

Meteorology

Fashion

Programming

Manufacturing

Daily life

Mathematics

Health

Medicine

Education

Biology

Japanese

Physics

Chemistry

Geography

Science

History

Linguistics

Literature

Performing Arts

Art

Music

Transport

Food

Recipes

Leisure

Games

Sports

Industry

Performance Evaluation Results of the Data

With the Qwen3 model, the thought process enclosed in ‘think’ tags often became lengthy depending on the task—particularly in multi-turn conversations.

In fact, for math and reasoning tasks in the Japanese MT-Bench, there were many cases where the model engaged in extremely long trial-and-error thinking and failed to reach a conclusion.

In environments with limited token availability, tests showed that avoiding reasoning sometimes yielded higher scores.

However, by fine-tuning with our reasoning dataset, the model was able to reason in Japanese while also suppressing redundant inference, resulting in faster inference even with token count and memory usage constraints.

Figure 2 are the evaluation results from Japanese MT-Bench under a restricted maximum token output setting *¹

( *¹ All results were generated using 4-bit quantization, with a maximum output of 4,096 tokens.)

Figure 2: Japanese MT-Bench under a restricted maximum token output setting.
Figure 2: Japanese MT-Bench under a restricted maximum token output setting.

The ‘Baseline (Qwen3)’ refers to the score of the standard Qwen3 model with reasoning enabled as an option.

‘+FineTuning’ indicates the score after fine-tuning using 100 samples from the included dataset, combined with synthetically generated data created under the same conditions.

In the Japanese MT-Bench, there are 10 questions for each of the 8 categories shown under ‘Category.’

The answers were automatically evaluated using OpenAI’s GPT-4.1 model API, with scores given on a 10-point scale. The table shows the average of these scores. *² *³

(*² Additionally, during evaluation by GPT-4.1, a Chain-of-Thought (CoT) process prompting the model to explain its reasoning was added for validation.)

(*³ Since output variability occurs during generation, the scores represent the average of four repeated runs of the same benchmark test.)

The ‘Total’ score represents the average of the scores across all eight categories.

As noted above, improvements were observed across all levels, including categories involving reasoning.

This suggests that the model is now able to generate appropriate responses even with a limited number of tokens, effectively enhancing its performance in Japanese.

This dataset is also publicly available on Hugging Face at the following link: 
https://huggingface.co/datasets/APTOinc/japanese-reasoning-dataset-sample

For our existing clients, it will also be shared soon through our email newsletter. We hope it helps accelerate your AI development and enhance accuracy. Feel free to make full use of it!

About APTO, Inc.

APTO provides AI development support services focused on data, the most critical factor influencing accuracy in AI development.

Our offerings include:

  • harBest, a data collection and annotation platform utilizing crowd workers
  • harBest Dataset, which accelerates the preparation of data, a common bottleneck in early development stages
  • harBest Expert, which enhances data quality using the knowledge of field experts.

By supporting AI development projects that face data-related challenges, we have earned the trust of many enterprise clients both in Japan and abroad.

We provide support for AI data, model development, GPU resources, and a variety of other needs. If you’re facing challenges in AI development, please feel free to reach out to us.

CONTACT: Katina Nguyen, k.nguyen@apto.co.jp

Tags: prnewswire
PR Newswire

PR Newswire

PR Newswire is the industry’s leading press release distribution partner with an unparalleled global reach of more than 440,000 newsrooms, websites, direct feeds, journalists and influencers and is available in more than 170 countries and 40 languages. From our award-winning Content Services offerings, integrated media newsroom and microsite products, Investor Relations suite of services, paid placement and social sharing tools, PR Newswire has a comprehensive catalog of solutions to solve the modern-day challenges PR and communications teams face. For 70 years, PR Newswire has been the preferred destination for brands to share their most important news stories across the world.

Read More

Explore Changping: Where Ming Dynasty Heritage Meets Modern Adventure

Explore Changping: Where Ming Dynasty Heritage Meets Modern Adventure

17 June 2025
TGE Successfully Advances Multiple Movie Releases This Year, Including “She’s Got No Name” and “My First of May,” Coming Out in June and August Respectively

TGE Successfully Advances Multiple Movie Releases This Year, Including “She’s Got No Name” and “My First of May,” Coming Out in June and August Respectively

17 June 2025
  • Trending
  • Comments
  • Latest

Hong Kong Student Criticised for Using Outsourced AI Project to Win STEM Awards

16 June 2025
Xinhua Silk Road: RCEP accelerates regional development and boosts local cooperation, official

Xinhua Silk Road: RCEP accelerates regional development and boosts local cooperation, official

8 June 2025

Macau Enforces 183-Day Residency Rule for 2025 Wealth Partaking Scheme

29 May 2025

Xiaomi SU7 Ultra Becomes Fastest Mass-Produced EV on Nürburgring Nordschleife

11 June 2025
Explore Changping: Where Ming Dynasty Heritage Meets Modern Adventure

Explore Changping: Where Ming Dynasty Heritage Meets Modern Adventure

17 June 2025

Unpacking the bias of large language models

17 June 2025
This compact, low-power receiver could give a boost to 5G smart devices

This compact, low-power receiver could give a boost to 5G smart devices

17 June 2025
APTO Releases High-Accuracy Japanese Reasoning Data for LLM Fine-Tuning, Free of Charge

APTO Releases High-Accuracy Japanese Reasoning Data for LLM Fine-Tuning, Free of Charge

17 June 2025

Recent News

Explore Changping: Where Ming Dynasty Heritage Meets Modern Adventure

Explore Changping: Where Ming Dynasty Heritage Meets Modern Adventure

17 June 2025

Unpacking the bias of large language models

17 June 2025
This compact, low-power receiver could give a boost to 5G smart devices

This compact, low-power receiver could give a boost to 5G smart devices

17 June 2025
APTO Releases High-Accuracy Japanese Reasoning Data for LLM Fine-Tuning, Free of Charge

APTO Releases High-Accuracy Japanese Reasoning Data for LLM Fine-Tuning, Free of Charge

17 June 2025
HK Businesswire

Stay ahead with the latest insights on Hong Kong’s economy, finance, and investments. From market trends to policy updates, we bring you in-depth analysis and expert opinions.

📩 Subscribe to our newsletter for exclusive updates.
📍 Follow us on social media for real-time news.
📧 Contact us: info@hongkong-invest.com

Follow Us

  • About
  • Advertise
  • Privacy & Policy
  • Contact

© 2025 by HKBusinesswire.com

No Result
View All Result

© 2025 by HKBusinesswire.com