Discussion Intel Meteor, Arrow, Lunar & Panther Lakes Discussion Threads

Page 205 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

Tigerick

Senior member
Apr 1, 2022
679
559
106






As Hot Chips 34 starting this week, Intel will unveil technical information of upcoming Meteor Lake (MTL) and Arrow Lake (ARL), new generation platform after Raptor Lake. Both MTL and ARL represent new direction which Intel will move to multiple chiplets and combine as one SoC platform.

MTL also represents new compute tile that based on Intel 4 process which is based on EUV lithography, a first from Intel. Intel expects to ship MTL mobile SoC in 2023.

ARL will come after MTL so Intel should be shipping it in 2024, that is what Intel roadmap is telling us. ARL compute tile will be manufactured by Intel 20A process, a first from Intel to use GAA transistors called RibbonFET.



Comparison of upcoming Intel's U-series CPU: Core Ultra 100U, Lunar Lake and Panther Lake

ModelCode-NameDateTDPNodeTilesMain TileCPULP E-CoreLLCGPUXe-cores
Core Ultra 100UMeteor LakeQ4 202315 - 57 WIntel 4 + N5 + N64tCPU2P + 8E212 MBIntel Graphics4
?Lunar LakeQ4 202417 - 30 WN3B + N62CPU + GPU & IMC4P + 4E08 MBArc8
?Panther LakeQ1 2026 ??Intel 18A + N3E3CPU + MC4P + 8E4?Arc12



Comparison of die size of Each Tile of Meteor Lake, Arrow Lake, Lunar Lake and Panther Lake

Meteor LakeArrow Lake (20A)Arrow Lake (N3B)Arrow Lake Refresh (N3B)Lunar LakePanther Lake
PlatformMobile H/U OnlyDesktop OnlyDesktop & Mobile H&HXDesktop OnlyMobile U OnlyMobile H
Process NodeIntel 4Intel 20ATSMC N3BTSMC N3BTSMC N3BIntel 18A
DateQ4 2023Q1 2025 ?Desktop-Q4-2024
H&HX-Q1-2025
Q4 2025 ?Q4 2024Q1 2026 ?
Full Die6P + 8P6P + 8E ?8P + 16E8P + 32E4P + 4E4P + 8E
LLC24 MB24 MB ?36 MB ??8 MB?
tCPU66.48
tGPU44.45
SoC96.77
IOE44.45
Total252.15



Intel Core Ultra 100 - Meteor Lake



As mentioned by Tomshardware, TSMC will manufacture the I/O, SoC, and GPU tiles. That means Intel will manufacture only the CPU and Foveros tiles. (Notably, Intel calls the I/O tile an 'I/O Expander,' hence the IOE moniker.)

 

Attachments

  • PantherLake.png
    283.5 KB · Views: 23,969
  • LNL.png
    881.8 KB · Views: 25,441
Last edited:

eek2121

Platinum Member
Aug 2, 2005
2,934
4,033
136
x86-64 apps on ARM require virtualization. Not exactly an ideal situation as the user experience suffers. But we can get most of the apps to working to a certain level but requires patience. Not just MS, but top software companies should start coming out with native ARM apps. Native is still the best.
Emulation, not virtualization. Important difference. The entirety of both x86 and x86-64 instruction sets have to be emulated. Virtualization runs most non-privileged instructions natively and thus can execute code at near native speed.

You are looking at a good 30-50% performance hit for emulated applications.

MTL is a massive battery life win? It clearly depends on laptop, Acer Swift Go 14 SFG14-72 ended up much worse in comparison despite having the same CPU.
On the other hand, MSI did a great job in selecting power efficient components and putting the biggest battery possible inside.

BTW, this was in what you posted.
View attachment 91196
Raptor Lake U managed 954 minutes with only a 68W battery, If I normalized It to 99.9W then It should manage 1401 minutes.
Not really apples to apples comparison for both CPU and laptop, but I wanted to show that It's not really depended on CPU, because they are very efficient light loads, but more on the rest of components(SSD, RAM, display).

A few here have mentioned a microcode update which improves performance (I haven’t looked into it). Not every vendor apparently has implemented this. I would check back sometime in January. At least some benchmarks at various reviewers should be updated.
 

eek2121

Platinum Member
Aug 2, 2005
2,934
4,033
136
A single thread that can be perfectly sliced by RU can be sliced perfectly by the programmer or the compiler as well. You asked for the ideal situation, that's what happens when things are pushed to the limit.
RU has been around in various forms for many years and nobody has been able to make it work. I will be surprised if Intel does.
 
Reactions: SiliconFly

SiliconFly

Golden Member
Mar 10, 2023
1,062
548
96
RU has been around in various forms for many years and nobody has been able to make it work. I will be surprised if Intel does.
Totally agree. Thats what I was thinking too. Getting RU to work is very far-fetched. And the only person who's spoken about RU is MLID (similar articles were based on his leak). So, makes me think, is the leak even legit? Looks like a dud.
 

SiliconFly

Golden Member
Mar 10, 2023
1,062
548
96
A single thread that can be perfectly sliced by RU can be sliced perfectly by the programmer or the compiler as well. You asked for the ideal situation, that's what happens when things are pushed to the limit.
Programmers? Compilers? i thought we were talking about runtime code execution by the CPU. Not writing & compiling them.
 

TESKATLIPOKA

Platinum Member
May 1, 2020
2,373
2,868
136
A few here have mentioned a microcode update which improves performance (I haven’t looked into it). Not every vendor apparently has implemented this. I would check back sometime in January. At least some benchmarks at various reviewers should be updated.
You do realize I am talking about Wifi battery life and not performance, right?
What does It have in common with that microcode? You expect It will also increase wifi battery life?
 

naukkis

Senior member
Jun 5, 2002
722
610
136
A single thread that can be perfectly sliced by RU can be sliced perfectly by the programmer or the compiler as well. You asked for the ideal situation, that's what happens when things are pushed to the limit.

No it can't. Rentable units approach can slice workload within thread - if implementation shares register file slicing can happen in register level. But to fully utilize such a approach there's need to really complex front-end or instruction set that support execution partitioning scheme.
 

maddie

Diamond Member
Jul 18, 2010
4,772
4,739
136
It seems that "Rentable units" is simply "Intelese" for the old concept of "reverse hyperthreading". Softmachines and their technology was discussed here many years ago and Anandtech even had an article. AMD, Intel and several other big names were investors until Intel bought the company outright. I always wondered when we would see real world results and I guess it will be soon.

For those interested and not wanting to make ridiculous ignorant claims, here it is. https://www.anandtech.com/print/10025/examining-soft-machines-architecture-visc-ipc
 

mikk

Diamond Member
May 15, 2012
4,152
2,164
136
Those scores don't look like they are for sustained 30W.
Both CB R15 and R23 is practically the same as MSI Prestige, which uses the same CPU and has 110W PL2 and 44W PL1.

The MIS bios is almost 3 months old, does it even run with the pcode update? Hwinfo says 30W current and max 55W in their video. One CB R23 run with this score requires maybe 50 seconds, you have to keep in mind that the PL2 has a big effect if it's a first run score. 155H at 55W can do slightly over 16K with the update pcode. If it run 30 seconds with 55W it could be possible to reach high 14k scores.
 

SiliconFly

Golden Member
Mar 10, 2023
1,062
548
96
It seems that "Rentable units" is simply "Intelese" for the old concept of "reverse hyperthreading". Softmachines and their technology was discussed here many years ago and Anandtech even had an article. AMD, Intel and several other big names were investors until Intel bought the company outright. I always wondered when we would see real world results and I guess it will be soon.

For those interested and not wanting to make ridiculous ignorant claims, here it is. https://www.anandtech.com/print/10025/examining-soft-machines-architecture-visc-ipc
Nothing ridiculous or ignorant about any of the claims. They're perfectly in line with the article you posted. A very interesting piece. It makes one thing very clear that RU may actually be real and might debut with a future Intel product. Thats all.

It says what was exactly discussed before, slicing a single thread into smaller pieces and executing the pieces simultaneously across multiple cores. Soft machine's implementation uses a rather complex and novel approach that we didn't come across earlier.

It uses a translation layer to translate the existing machine code into its own proprietary instructions and then feeds that translated code to a Global Front End (more like splitter+scheduler combined) which then slices the thread into smaller pieces and feeds the pieces to multiple virtual cores (probably emulated).

Like what was discussed earlier, this has the potential to increase ST performance a lot. We do not know the performance penalty of this approach, but the article says it's not much (???). So, if Intel manages to successfully combine 2 cores into a single cluster with RU, it can increase a threads ST performance beyond a single cores ST performance. Maybe by upto 2X under "ideal" conditions. Just guessing.

But the implementation looks rather messy and might over-complicate the architecture and turn it into a sh%t show. Just my opinion.

Also, a hypothetical i9-13900K with RU can have a single core Geekbench 6 score of 5000 or above!
 
Last edited:
Reactions: Henry swagger

naukkis

Senior member
Jun 5, 2002
722
610
136
It seems that "Rentable units" is simply "Intelese" for the old concept of "reverse hyperthreading". Softmachines and their technology was discussed here many years ago and Anandtech even had an article. AMD, Intel and several other big names were investors until Intel bought the company outright. I always wondered when we would see real world results and I guess it will be soon.

For those interested and not wanting to make ridiculous ignorant claims, here it is. https://www.anandtech.com/print/10025/examining-soft-machines-architecture-visc-ipc

Visc is that instruction set supporting execution partitioning. It ain't coming into x86. But, as cpu fron-ends are now extremely complex such a front-end could theoretically extract two independent instruction streams within loops and execute them in independent execution units. Such a approach would probably need shared register file - or at least direct input-output ports between register files to be able extract performance benefits. Relying normal data load/storing would make such a approach only racing from resources. But it's sure doable for x86 too.
 
Reactions: Gideon

maddie

Diamond Member
Jul 18, 2010
4,772
4,739
136
Nothing ridiculous or ignorant about any of the claims. They're perfectly in line with the article you posted. A very interesting piece. It makes one thing very clear that RU may actually be real and might debut with a future Intel product. Thats all.

It says what was exactly discussed before, slicing a single thread into smaller pieces and executing the pieces simultaneously across multiple cores. Soft machine's implementation uses a rather complex and novel approach that we didn't come across earlier.

It uses a translation layer to translate the existing machine code into its own proprietary instructions and then feeds that translated code to a Global Front End (more like splitter+scheduler combined) which then slices the thread into smaller pieces and feeds the pieces to multiple virtual cores (probably emulated).

Like what was discussed earlier, this has the potential to increase ST performance a lot. We do not know the performance penalty of this approach, but the article says it's not much (???). So, if Intel manages to successfully combine 2 cores into a single cluster with RU, it can increase a threads ST performance beyond a single cores ST performance. Maybe by upto 2X under "ideal" conditions. Just guessing.

But the implementation looks rather messy and might over-complicate the architecture and turn it into a sh%t show. Just my opinion.
Seems I read this recently. 8X 1T performance increase for 8 core CPU?
 

SiliconFly

Golden Member
Mar 10, 2023
1,062
548
96
Seems I read this recently. 8X 1T performance increase for 8 core CPU?
Nope. Hypothetically speaking if Intel has a 8 core RU implementation, then yes. In real world, no. I don't think they even have a 8 core RU concept in the drawing board as of now. Maybe a 2 core RU implementation like what one leak suggested.
 

maddie

Diamond Member
Jul 18, 2010
4,772
4,739
136
Visc is that instruction set supporting execution partitioning. It ain't coming into x86. But, as cpu fron-ends are now extremely complex such a front-end could theoretically extract two independent instruction streams within loops and execute them in independent execution units. Such a approach would probably need shared register file - or at least direct input-output ports between register files to be able extract performance benefits. Relying normal data load/storing would make such a approach only racing from resources. But it's sure doable for x86 too.
That's why it says VISC-like. A new concept is often described in terms of known things, as the new unique words don't exist yet.
 

SiliconFly

Golden Member
Mar 10, 2023
1,062
548
96
That's why it says VISC-like. A new concept is often described in terms of known things, as the new unique words don't exist yet.

Visc is that instruction set supporting execution partitioning. It ain't coming into x86. But, as cpu fron-ends are now extremely complex such a front-end could theoretically extract two independent instruction streams within loops and execute them in independent execution units. Such a approach would probably need shared register file - or at least direct input-output ports between register files to be able extract performance benefits. Relying normal data load/storing would make such a approach only racing from resources. But it's sure doable for x86 too.
...It uses a translation layer to translate the existing machine code into its own proprietary instructions and then feeds that translated code to a Global Front End...

This tech can work on any processor. ARM or Intel.
 

maddie

Diamond Member
Jul 18, 2010
4,772
4,739
136
Nope. Hypothetically speaking if Intel has a 8 core RU implementation, then yes. In real world, no. I don't think they even have a 8 core RU concept in the drawing board as of now. Maybe a 2 core RU implementation like what one leak suggested.
Do you think I'm claiming the 800% speedup?

...It uses a translation layer to translate the existing machine code into its own proprietary instructions and then feeds that translated code to a Global Front End...

This tech can work on any processor. ARM or Intel.
Yes, as nearly all, if not all, computing technologies.
 

naukkis

Senior member
Jun 5, 2002
722
610
136
...It uses a translation layer to translate the existing machine code into its own proprietary instructions and then feeds that translated code to a Global Front End...

This tech can work on any processor. ARM or Intel.

Sure they can have transmeta-style executing layer which then also distributes that code into multiple cores. But we haven't yet had transmeta-styled cpu for one core rivaling hardware-based cpus performance so that route sure won't go anywhere. VISC needs data partitioning in instruction set to work.
 

TESKATLIPOKA

Platinum Member
May 1, 2020
2,373
2,868
136
The MIS bios is almost 3 months old, does it even run with the pcode update? Hwinfo says 30W current and max 55W in their video. One CB R23 run with this score requires maybe 50 seconds, you have to keep in mind that the PL2 has a big effect if it's a first run score. 155H at 55W can do slightly over 16K with the update pcode. If it run 30 seconds with 55W it could be possible to reach high 14k scores.
From that image you originally posted, the viewer would think that It was at 30W for the whole run but It wasn't, because PL2 was affecting It quite a bit.
That was my point.
 
Last edited:

SiliconFly

Golden Member
Mar 10, 2023
1,062
548
96
Sure they can have transmeta-style executing layer which then also distributes that code into multiple cores. But we haven't yet had transmeta-styled cpu for one core rivaling hardware-based cpus performance so that route sure won't go anywhere. VISC needs data partitioning in instruction set to work.
Actually, the article specifically states that they've overcome the transmeta-style translation bottleneck. The more worrying part is not the translation layer, but the virtual cores itself. God only know it's overhead, but the article again claims even that issue has been solved. Doesn't sound very feasible to me and I believe the overheads will be very significant. Not sure.
 

SiliconFly

Golden Member
Mar 10, 2023
1,062
548
96
Grabs popcorn.
Thats the whole point of this discussion. Rentable Units. Sounds like magic, cos it may not be real at all. Intel may never get it to work due to the complexities involved. But if it is real, then these are the expected numbers.

The whole discussion thats happening is based on "If it is real". Maybe. Maybe not. Who knows!

Either way, time to give RU a break. Even if it's real, we aren't gonna see it for many years.
 
Last edited:

SiliconFly

Golden Member
Mar 10, 2023
1,062
548
96
Do you think I'm claiming the 800% speedup?
What I meant was, in a hypothetical scenario, where RU exists in real world, and Intel manages to get it working and manages to overcome all the issues related to its complexity, then thats the the theoretical maximum that we can expect in a 8 core cluster with RU. Just hypothesizing. It's to explain the significance of RU. Nothing more. Not a real world scenario.
 
sale-70-410-exam    | Exam-200-125-pdf    | we-sale-70-410-exam    | hot-sale-70-410-exam    | Latest-exam-700-603-Dumps    | Dumps-98-363-exams-date    | Certs-200-125-date    | Dumps-300-075-exams-date    | hot-sale-book-C8010-726-book    | Hot-Sale-200-310-Exam    | Exam-Description-200-310-dumps?    | hot-sale-book-200-125-book    | Latest-Updated-300-209-Exam    | Dumps-210-260-exams-date    | Download-200-125-Exam-PDF    | Exam-Description-300-101-dumps    | Certs-300-101-date    | Hot-Sale-300-075-Exam    | Latest-exam-200-125-Dumps    | Exam-Description-200-125-dumps    | Latest-Updated-300-075-Exam    | hot-sale-book-210-260-book    | Dumps-200-901-exams-date    | Certs-200-901-date    | Latest-exam-1Z0-062-Dumps    | Hot-Sale-1Z0-062-Exam    | Certs-CSSLP-date    | 100%-Pass-70-383-Exams    | Latest-JN0-360-real-exam-questions    | 100%-Pass-4A0-100-Real-Exam-Questions    | Dumps-300-135-exams-date    | Passed-200-105-Tech-Exams    | Latest-Updated-200-310-Exam    | Download-300-070-Exam-PDF    | Hot-Sale-JN0-360-Exam    | 100%-Pass-JN0-360-Exams    | 100%-Pass-JN0-360-Real-Exam-Questions    | Dumps-JN0-360-exams-date    | Exam-Description-1Z0-876-dumps    | Latest-exam-1Z0-876-Dumps    | Dumps-HPE0-Y53-exams-date    | 2017-Latest-HPE0-Y53-Exam    | 100%-Pass-HPE0-Y53-Real-Exam-Questions    | Pass-4A0-100-Exam    | Latest-4A0-100-Questions    | Dumps-98-365-exams-date    | 2017-Latest-98-365-Exam    | 100%-Pass-VCS-254-Exams    | 2017-Latest-VCS-273-Exam    | Dumps-200-355-exams-date    | 2017-Latest-300-320-Exam    | Pass-300-101-Exam    | 100%-Pass-300-115-Exams    |
http://www.portvapes.co.uk/    | http://www.portvapes.co.uk/    |