{"id":6818,"date":"2026-07-03T08:50:50","date_gmt":"2026-07-03T03:50:50","guid":{"rendered":"https:\/\/cifrum.kz\/meta-watermelon-gpt-5-5-what-is-known\/"},"modified":"2026-07-03T23:46:07","modified_gmt":"2026-07-03T18:46:07","slug":"meta-watermelon-gpt-5-5-what-is-known","status":"publish","type":"post","link":"https:\/\/cifrum.kz\/en\/meta-watermelon-gpt-5-5-what-is-known\/","title":{"rendered":"Meta says Watermelon matches GPT-5.5, but has not disclosed the tests"},"content":{"rendered":"\n<p class=\"wp-block-paragraph\"><strong>Menlo Park, United States.<\/strong> Meta artificial intelligence chief Alexandr Wang told employees that a model under development, codenamed Watermelon, had matched OpenAI\u2019s GPT-5.5 on a set of industry benchmarks. <a href=\"https:\/\/www.investing.com\/news\/stock-market-news\/metas-wang-says-coming-ai-model-has-caught-up-with-openai-business-insider-4774872\" target=\"_blank\" rel=\"noopener noreferrer\">Business Insider reported the remarks<\/a>, citing two people familiar with the internal town hall.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">The claim cannot yet be treated as a verified model comparison. Watermelon remains in training, Meta has not named the benchmarks, released scores or provided the system for independent testing. The company has issued no official public announcement about Watermelon.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">What Alexandr Wang reportedly said<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">Wang reportedly described Watermelon as the model following Avocado, the internal codename associated with Muse Spark. He also said Watermelon was using \u201can order of magnitude\u201d more compute than the earlier project. In ordinary numerical usage, that suggests an increase of roughly ten times.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Business Insider did not specify what form of compute was being compared: total operations, accelerator count, training duration or a product of those measures. Without a methodology, the figure cannot be directly converted into model size, development cost or expected quality.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Meta and OpenAI did not provide public comments on the comparison. The most accurate description for now is that Meta leadership reported an internal milestone that remains to be verified.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Why matching benchmarks does not establish full parity<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">A score depends on task selection, dataset version, reasoning settings, the number of attempts and whether cost and response speed are counted. Two models can be close in mathematics while differing substantially in coding, tool use, multimodality or long-horizon reliability.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Watermelon\u2019s training status adds another uncertainty. The final system can change after further training, behavioural tuning and safety evaluation. Until a technical report appears, it is not known whether the comparison used an intermediate checkpoint or a near-release configuration.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Cifrum.kz has previously explained why <a href=\"https:\/\/cifrum.kz\/en\/glm-5-2-claude-cybersecurity-tests\/\">matching an AI rival on selected cybersecurity tests<\/a> cannot be generalized to every capability. The same limit applies to the Watermelon claim.<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" width=\"1024\" height=\"576\" src=\"https:\/\/cifrum.kz\/wp-content\/uploads\/2026\/07\/meta-watermelon-gpt-5-5-facts-en-1024x576.png\" class=\"wp-image-6815\" alt=\"Infographic covering Watermelon status, Meta\u2019s claim, compute and the lack of independent verification\" loading=\"lazy\" decoding=\"async\" srcset=\"https:\/\/cifrum.kz\/wp-content\/uploads\/2026\/07\/meta-watermelon-gpt-5-5-facts-en-1024x576.png 1024w, https:\/\/cifrum.kz\/wp-content\/uploads\/2026\/07\/meta-watermelon-gpt-5-5-facts-en-300x169.png 300w, https:\/\/cifrum.kz\/wp-content\/uploads\/2026\/07\/meta-watermelon-gpt-5-5-facts-en-768x432.png 768w, https:\/\/cifrum.kz\/wp-content\/uploads\/2026\/07\/meta-watermelon-gpt-5-5-facts-en-1536x864.png 1536w, https:\/\/cifrum.kz\/wp-content\/uploads\/2026\/07\/meta-watermelon-gpt-5-5-facts-en-1280x720.png 1280w, https:\/\/cifrum.kz\/wp-content\/uploads\/2026\/07\/meta-watermelon-gpt-5-5-facts-en.png 1600w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><figcaption class=\"wp-element-caption\">Watermelon is still training, the benchmarks have not been named and no public independent verification exists. Infographic: Cifrum.kz.<\/figcaption><\/figure>\n\n\n\n<h2 class=\"wp-block-heading\">What is known about GPT-5.5<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">OpenAI <a href=\"https:\/\/openai.com\/index\/introducing-gpt-5-5\/\" target=\"_blank\" rel=\"noopener noreferrer\">officially released GPT-5.5 on 23 April 2026<\/a>. It published results covering agentic coding, computer use, professional tasks, scientific evaluations and cybersecurity. GPT-5.5 became available through ChatGPT, Codex and the API.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">OpenAI\u2019s published figures are also developer-reported and benefit from outside replication. However, researchers have benchmark names, evaluation conditions and access to the model. No comparable evidence package is available for Watermelon.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">GPT-5.6 has already moved the target<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">On 26 June, OpenAI began a <a href=\"https:\/\/openai.com\/index\/previewing-gpt-5-6-sol\/\" target=\"_blank\" rel=\"noopener noreferrer\">limited preview of the GPT-5.6 series<\/a>. Its flagship Sol model is reported by the company to improve on GPT-5.5 across several agentic, biology and cybersecurity tasks. Broader availability is planned later, while the initial preview is restricted to a small set of partners.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">That makes GPT-5.5 a clear but no longer newest OpenAI reference point. Even if Meta\u2019s internal parity result is reproducible, it places Watermelon relative to the April model rather than the competitor\u2019s entire current lineup.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">From Muse Spark to Watermelon<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">Meta <a href=\"https:\/\/about.fb.com\/news\/2026\/04\/introducing-muse-spark-meta-superintelligence-labs\/\" target=\"_blank\" rel=\"noopener noreferrer\">introduced Muse Spark on 8 April<\/a> as the first model in a new series from Meta Superintelligence Labs. The company describes it as a compact, fast system for complex reasoning and multimodal tasks. Muse Spark powers Meta AI, and a larger next generation was officially in development at the time of the announcement.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">The link between the Avocado codename and Muse Spark, as well as the Watermelon name, comes from media reports about internal projects. Meta\u2019s official pages do not yet give the next generation a public name, release date or model card.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">What ten times more compute could mean<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">More compute can support a larger model, more training data or a longer training run. Scaling does not guarantee a proportional quality gain. The outcome also depends on architecture, data, optimization algorithms and post-training.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Meta\u2019s own <a href=\"https:\/\/about.fb.com\/news\/2026\/06\/what-is-compute-power-meta-ai-infrastructure\/\" target=\"_blank\" rel=\"noopener noreferrer\">explanation of its computing infrastructure<\/a> describes model training and serving as a combination of GPUs, custom chips, networks and data centers. A \u201ccompute\u201d figure without a unit is therefore a scale indicator rather than a technical specification.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Meta expects up to $145 billion in capital expenditure<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">In its first-quarter results, Meta <a href=\"https:\/\/investor.atmeta.com\/investor-news\/press-release-details\/2026\/Meta-Reports-First-Quarter-2026-Results\/\" target=\"_blank\" rel=\"noopener noreferrer\">raised expected 2026 capital expenditure to $125\u2013145 billion<\/a>. It attributed the revision to higher component prices and additional data-center costs needed for future capacity.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">The range includes principal payments on finance leases and covers company infrastructure broadly. It should not be described as Watermelon\u2019s budget or assigned entirely to generative AI.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">What would verify Meta\u2019s claim<\/h2>\n\n\n\n<ul class=\"wp-block-list\"><li>benchmark names, dataset versions and complete run conditions;<\/li><li>results for multiple models under the same compute budget;<\/li><li>cost, latency and number of attempts used to produce each score;<\/li><li>a system card describing limitations and safety evaluations;<\/li><li>access for independent researchers or a public API.<\/li><\/ul>\n\n\n\n<p class=\"wp-block-paragraph\">Until those materials appear, Watermelon is best described as a promising but unverified model. The internal result may be an important milestone for Meta, but it does not yet establish a new balance of power in the market.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">The model race also involves safety, effects on users and interpretation of system behaviour. Cifrum.kz separately examined <a href=\"https:\/\/cifrum.kz\/en\/ai-consciousness-anthropic-google-meta-research\/\">why technology companies are studying possible AI consciousness without claiming it has been detected<\/a>.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Sources:<\/strong> the <a href=\"https:\/\/www.investing.com\/news\/stock-market-news\/metas-wang-says-coming-ai-model-has-caught-up-with-openai-business-insider-4774872\" target=\"_blank\" rel=\"noopener noreferrer\">Business Insider report as carried by Investing.com<\/a>, <a href=\"https:\/\/about.fb.com\/news\/2026\/04\/introducing-muse-spark-meta-superintelligence-labs\/\" target=\"_blank\" rel=\"noopener noreferrer\">Meta\u2019s Muse Spark announcement<\/a>, <a href=\"https:\/\/investor.atmeta.com\/investor-news\/press-release-details\/2026\/Meta-Reports-First-Quarter-2026-Results\/\" target=\"_blank\" rel=\"noopener noreferrer\">Meta\u2019s first-quarter results<\/a>, the <a href=\"https:\/\/openai.com\/index\/introducing-gpt-5-5\/\" target=\"_blank\" rel=\"noopener noreferrer\">GPT-5.5 announcement<\/a> and the <a href=\"https:\/\/openai.com\/index\/previewing-gpt-5-6-sol\/\" target=\"_blank\" rel=\"noopener noreferrer\">GPT-5.6 preview.<\/a><\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><em>The lead image was created with artificial intelligence for Cifrum.kz as a conceptual editorial illustration. It does not depict actual Meta or OpenAI servers or verify benchmark results. The infographic was produced by Cifrum.kz.<\/em><\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Related analysis:<\/strong> compare that undisclosed benchmark claim with <a href=\"https:\/\/cifrum.kz\/en\/google-tabfm-zero-shot-tabular-data-predictions\/\">Google TabFM\u2019s public code, model access and benchmark evidence<\/a>.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Business Insider reports that Alexandr Wang told Meta staff Watermelon had matched GPT-5.5. The benchmarks, results and independent verification remain unavailable.<\/p>\n","protected":false},"author":1,"featured_media":6814,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"site-sidebar-layout":"default","site-content-layout":"","ast-site-content-layout":"default","site-content-style":"default","site-sidebar-style":"default","ast-global-header-display":"","ast-banner-title-visibility":"","ast-main-header-display":"","ast-hfb-above-header-display":"","ast-hfb-below-header-display":"","ast-hfb-mobile-header-display":"","site-post-title":"","ast-breadcrumbs-content":"","ast-featured-img":"","footer-sml-layout":"","ast-disable-related-posts":"","theme-transparent-header-meta":"","adv-header-id-meta":"","stick-header-meta":"","header-above-stick-meta":"","header-main-stick-meta":"","header-below-stick-meta":"","astra-migrate-meta-layouts":"default","ast-page-background-enabled":"default","ast-page-background-meta":{"desktop":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"tablet":{"background-color":"","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"mobile":{"background-color":"","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""}},"ast-content-background-meta":{"desktop":{"background-color":"var(--ast-global-color-4)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"tablet":{"background-color":"var(--ast-global-color-4)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"mobile":{"background-color":"var(--ast-global-color-4)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""}},"rank_math_focus_keyword":"Meta Watermelon,GPT-5.5,Alexandr Wang,Muse Spark,Avocado,AI benchmarks,GPT-5.6","rank_math_title":"Meta Watermelon vs GPT-5.5: what is known so far today","rank_math_description":"Meta says Watermelon matches GPT-5.5, but the model is still training, benchmarks are undisclosed and no independent verification exists. Here is what is known.","rank_math_canonical_url":"","rank_math_seo_score":"","rank_math_pillar_content":"","rank_math_facebook_title":"","rank_math_facebook_description":"","rank_math_facebook_image":"","rank_math_facebook_image_id":"","rank_math_twitter_title":"","rank_math_twitter_description":"","rank_math_twitter_image":"","rank_math_twitter_image_id":"","rank_math_news_sitemap_genre":"","rank_math_news_sitemap_keywords":"","rank_math_news_sitemap_stock_tickers":"","rank_math_robots":null,"rank_math_advanced_robots":"","rank_math_schema_News":"","footnotes":""},"categories":[2104,11],"tags":[],"cifrum_os_content_type":[],"class_list":["post-6818","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-artificial-intelligence-en","category-digitalization-news-on-digital-rum"],"acf":[],"_links":{"self":[{"href":"https:\/\/cifrum.kz\/en\/wp-json\/wp\/v2\/posts\/6818","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/cifrum.kz\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/cifrum.kz\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/cifrum.kz\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/cifrum.kz\/en\/wp-json\/wp\/v2\/comments?post=6818"}],"version-history":[{"count":4,"href":"https:\/\/cifrum.kz\/en\/wp-json\/wp\/v2\/posts\/6818\/revisions"}],"predecessor-version":[{"id":6855,"href":"https:\/\/cifrum.kz\/en\/wp-json\/wp\/v2\/posts\/6818\/revisions\/6855"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/cifrum.kz\/en\/wp-json\/wp\/v2\/media\/6814"}],"wp:attachment":[{"href":"https:\/\/cifrum.kz\/en\/wp-json\/wp\/v2\/media?parent=6818"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/cifrum.kz\/en\/wp-json\/wp\/v2\/categories?post=6818"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/cifrum.kz\/en\/wp-json\/wp\/v2\/tags?post=6818"},{"taxonomy":"cifrum_os_content_type","embeddable":true,"href":"https:\/\/cifrum.kz\/en\/wp-json\/wp\/v2\/cifrum_os_content_type?post=6818"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}