Discussion Intel Meteor, Arrow, Lunar & Panther Lakes Discussion Threads

Page 305 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

Tigerick

Senior member
Apr 1, 2022
679
559
106






As Hot Chips 34 starting this week, Intel will unveil technical information of upcoming Meteor Lake (MTL) and Arrow Lake (ARL), new generation platform after Raptor Lake. Both MTL and ARL represent new direction which Intel will move to multiple chiplets and combine as one SoC platform.

MTL also represents new compute tile that based on Intel 4 process which is based on EUV lithography, a first from Intel. Intel expects to ship MTL mobile SoC in 2023.

ARL will come after MTL so Intel should be shipping it in 2024, that is what Intel roadmap is telling us. ARL compute tile will be manufactured by Intel 20A process, a first from Intel to use GAA transistors called RibbonFET.



Comparison of upcoming Intel's U-series CPU: Core Ultra 100U, Lunar Lake and Panther Lake

ModelCode-NameDateTDPNodeTilesMain TileCPULP E-CoreLLCGPUXe-cores
Core Ultra 100UMeteor LakeQ4 202315 - 57 WIntel 4 + N5 + N64tCPU2P + 8E212 MBIntel Graphics4
?Lunar LakeQ4 202417 - 30 WN3B + N62CPU + GPU & IMC4P + 4E08 MBArc8
?Panther LakeQ1 2026 ??Intel 18A + N3E3CPU + MC4P + 8E4?Arc12



Comparison of die size of Each Tile of Meteor Lake, Arrow Lake, Lunar Lake and Panther Lake

Meteor LakeArrow Lake (20A)Arrow Lake (N3B)Arrow Lake Refresh (N3B)Lunar LakePanther Lake
PlatformMobile H/U OnlyDesktop OnlyDesktop & Mobile H&HXDesktop OnlyMobile U OnlyMobile H
Process NodeIntel 4Intel 20ATSMC N3BTSMC N3BTSMC N3BIntel 18A
DateQ4 2023Q1 2025 ?Desktop-Q4-2024
H&HX-Q1-2025
Q4 2025 ?Q4 2024Q1 2026 ?
Full Die6P + 8P6P + 8E ?8P + 16E8P + 32E4P + 4E4P + 8E
LLC24 MB24 MB ?36 MB ??8 MB?
tCPU66.48
tGPU44.45
SoC96.77
IOE44.45
Total252.15



Intel Core Ultra 100 - Meteor Lake



As mentioned by Tomshardware, TSMC will manufacture the I/O, SoC, and GPU tiles. That means Intel will manufacture only the CPU and Foveros tiles. (Notably, Intel calls the I/O tile an 'I/O Expander,' hence the IOE moniker.)

 

Attachments

  • PantherLake.png
    283.5 KB · Views: 23,969
  • LNL.png
    881.8 KB · Views: 25,441
Last edited:
Jul 27, 2020
16,820
10,766
106
When it comes to better hardware utilization, for better "AI", just using more CPU cores (that have already been there for years) more efficiently would be one area.

Then there is AVX, AVX512. There are more gaming PCs / CPUs that have AVX512 vs NPU.
This remains to be benchmarked. Can 24 CPU threads in ARL outsmart the embedded NPU with more TOPS?

AVX-512 in Ryzens, yes. But Intel will have to wait till AVX10 for their 2nd chance at AVX-512 in consumer desktops. Until then, they gotta depend on the NPU.
 
Reactions: Tlh97 and carancho

Hulk

Diamond Member
Oct 9, 1999
4,269
2,089
136
Skylake - SunnyCove
micro-ops(decode + uop cache) from 11 to 11 +0%
Dispatch/Rename from 4 to 5 +25%
execution ports from 8 to 10 +25%
With 2xFP/ALU + 2xALU, 1xS/D + 3xAGU
for 2xFP/ALU + 2xALU, 2xS/D + 4xAGU
IPC average +18%

SunnyCove - GoldenCove
micro-ops(decode + uop cache) from 11 to 14 +27%
Dispatch/Rname from 5 to 6 +20%
execution ports from 10 to 12 +20%
With 2xFP/ALU + 2xALU, 2xS/D + 4xAGU
for 3xFP/ALU + 2xALU, 2xS/D + 5xAGU
FPU+ALU from 4 to 5 +25%
IPC average +19%

GoldenCove - LionCove
micro-ops(decode + uop cache) from 14 to 24 +71.4%
Dispatch/Rename from 6 to 8 +33.3%
execution ports from 12 to 18 +50%
With 3xFP/ALU + 2xALU, 2xS/D + 5xAGU
up to 4xFPU, 6xALU, 2xS/D + 6xAGU
FPU+ALU from 5 to 10 +100%
IPC increase +??%
This is great information. Thank you for taking the time to organize and post it.

What is included in "decode + uop cache" and "dispatch/rename?"

How many decoders? How many uop entries?
Reorder buffers? In-Flight Loads/Stores?

I'm not fully following.
 
Last edited:

Saylick

Diamond Member
Sep 10, 2012
3,217
6,585
136
Skylake - SunnyCove
micro-ops(decode + uop cache) from 11 to 11 +0%
Dispatch/Rename from 4 to 5 +25%
execution ports from 8 to 10 +25%
With 2xFP/ALU + 2xALU, 1xS/D + 3xAGU
for 2xFP/ALU + 2xALU, 2xS/D + 4xAGU
IPC average +18%

SunnyCove - GoldenCove
micro-ops(decode + uop cache) from 11 to 14 +27%
Dispatch/Rname from 5 to 6 +20%
execution ports from 10 to 12 +20%
With 2xFP/ALU + 2xALU, 2xS/D + 4xAGU
for 3xFP/ALU + 2xALU, 2xS/D + 5xAGU
FPU+ALU from 4 to 5 +25%
IPC average +19%

GoldenCove - LionCove
micro-ops(decode + uop cache) from 14 to 24 +71.4%
Dispatch/Rename from 6 to 8 +33.3%
execution ports from 12 to 18 +50%
With 3xFP/ALU + 2xALU, 2xS/D + 5xAGU
up to 4xFPU, 6xALU, 2xS/D + 6xAGU
FPU+ALU from 5 to 10 +100%
IPC increase +??%
WCCFTech be like:
 

Joe NYC

Platinum Member
Jun 26, 2021
2,072
2,585
106
This remains to be benchmarked. Can 24 CPU threads in ARL outsmart the embedded NPU with more TOPS?

AVX-512 in Ryzens, yes. But Intel will have to wait till AVX10 for their 2nd chance at AVX-512 in consumer desktops. Until then, they gotta depend on the NPU.

Software support for hardware features lags by too much time to be relevant to make the hardware feature a selling point for the hardware.

Intel now talks a lot about "Centrino moment", but that hardware feature, WIFI, was built to the OS, and was ubiquitous for all applications as part of already widely used networking capabilities.

Support support (for a questionable feature) by 100s of ISV, gaming development teams is not happening in the short run.
 

AMDK11

Senior member
Jul 15, 2019
290
197
116
Fix for Skylake-SunnyCove:

Skylake - SunnyCove
micro-ops(decode + uop cache) from 11 to 11 +0%
Dispatch/Rename from 4 to 5 +25%
execution ports from 8 to 10 +25%
With 2xFP/ALU + 2xALU, 1xS/D + 3xAGU
for 3xFP/ALU + 1xALU, 2xS/D + 4xAGU
IPC average +18%

SunnyCove - GoldenCove
micro-ops(decode + uop cache) from 11 to 14 +27%
Dispatch/Rname from 5 to 6 +20%
execution ports from 10 to 12 +20%
With 3xFP/ALU + 1xALU, 2xS/D + 4xAGU
for 3xFP/ALU + 2xALU, 2xS/D + 5xAGU
FPU+ALU from 4 to 5 +25%
IPC average +19%

GoldenCove - LionCove
micro-ops(decode + uop cache) from 14 to 24 +71.4%
Dispatch/Rename from 6 to 8 +33.3%
execution ports from 12 to 18 +50%
With 3xFP/ALU + 2xALU, 2xS/D + 5xAGU
up to 4xFPU, 6xALU, 2xS/D + 6xAGU
FPU+ALU from 5 to 10 +100%
IPC increase +??%

Two LionCove diagrams from one LunarLake graphic:

 
Last edited:
Reactions: Elfear and Saylick

Ghostsonplanets

Senior member
Mar 1, 2024
388
659
96

AMDK11

Senior member
Jul 15, 2019
290
197
116
This is great information. Thank you for taking the time to organize and post it.

What is included in "decode + uop cache" and "dispatch/rename?"

How many decoders? How many uop entries?
Reorder buffers? In-Flight Loads/Stores?

I'm not fully following.
I gave a general summary of decoding + sending from uop Cache because the LionCove diagram is of too low quality and does not specify how much it is for the decoder.

GoldenCove has 14 uops, including 6 from the decoder and 8 from the uop Cache.

LionCove has 24 uops, but it is not 100% sure whether there are 8 from the decoder and 16 from the uop cache or maybe 10 from the decoder and 14 from the uopcache.

I provided the data that can be read from the LionCove diagram. Much is still unknown.


I won't be surprised at all if there is a 10-Way decoder because Skymont has 3x 3-Way, i.e. a total of 9-Way. But it can also be 8-Way.

I think Intel will present the LionCove microarchitecture in June, and as you can see, they already have core diagrams ready for presentation.
 
Last edited:
Reactions: dr1337

Ghostsonplanets

Senior member
Mar 1, 2024
388
659
96
Not sure if this is reliable. Take with a huge grain of salt.

But, supposedly, ARL-U also gets the new N4P iGP tile from ARL/S (64 EU): N4(P?), XMX units added back and higher clocks.

Basically MTL-U/M with smaller Compute and GPU tile (nodelet shrink to Intel 3 and N4(P?) respectively) for higher yields and cheaper costs.

XMX units will probably provide the TOPs throughout for ARL-U to meet AI PC requirements. Cheaper alternative to Lunar Lake for mainstream U-series designs.
 

Saylick

Diamond Member
Sep 10, 2012
3,217
6,585
136
Holy sheet I had never read their comment section, it truly is below Reddit.

Best quote: "You are the noob as there is nothing wrong with userbenchmark."
If you thought reading that Prakhar guy's tweets made you lose brain cells, WTFTech's comments section devolves you to a Neanderthal.

I refuse to take anyone seriously who regularly posts in their comment's section, even if I see the same people making reasonable, logical statements on Xitter outside of it. I simply will not respect their opinion because they consciously made a decision to be a willing participant in that cesspool to begin with. It's like if I knew someone was actively participating in a Nazi get-together but was a normal behaving human being outside of it. It doesn't matter how that person behaves in a regular setting; they are a Nazi, regardless.

But what can you do. That comment section is probably where they get more than half their ad impressions, and WCCFTech knows it so they will never moderate it.
 

Joe NYC

Platinum Member
Jun 26, 2021
2,072
2,585
106
I don't get it, how is AVX512 any substitute for an NPU ?

Not exactly a substitute, but as a hardware feature for ISV / game developers to use.

It is a very slow process for a hardware feature to become widely used. It's completely unrealistic to expect to have a game released precisely at the time of ARL release supporting a unique feature of ARL.

Example of far more consequential hardware feature: x86-64 instruction set. First introduced in 2003, it took 2 years for first version to support in in 2005 and then it took another decade for the games to start switching to 64 bit.
 
Last edited:
Reactions: spursindonesia

Ghostsonplanets

Senior member
Mar 1, 2024
388
659
96
Mainstream is still raptor.
ARL-U is in a very weird position overall, the only boon is platform comparability with MTL-U.
Right. ARL-U is an odd duck because it still won't be cheap for mainstream, even with the nodelet shrink. But it also can't be priced as premium because the the product doesn’t justify.

Intel lineup will be in a weird position next year

LNL - >$999
ARL-U -> $700 - $900
RPL-U -> $600 and below
 

Hulk

Diamond Member
Oct 9, 1999
4,269
2,089
136
I gave a general summary of decoding + sending from uop Cache because the LionCove diagram is of too low quality and does not specify how much it is for the decoder.

GoldenCove has 14 uops, including 6 from the decoder and 8 from the uop Cache.

LionCove has 24 uops, but it is not 100% sure whether there are 8 from the decoder and 16 from the uop cache or maybe 10 from the decoder and 14 from the uopcache.

I provided the data that can be read from the LionCove diagram. Much is still unknown.


I won't be surprised at all if there is a 10-Way decoder because Skymont has 3x 3-Way, i.e. a total of 9-Way. But it can also be 8-Way.

I think Intel will present the LionCove microarchitecture in June, and as you can see, they already have core diagrams ready for presentation.
Thanks, that makes perfect sense. They are really opening up both the front end and back end of Lion Cove. I still think the IPC increase will be 20% as that seems to always be Intel's target, maybe 25% if things really work out well for them. That 24uopss is a huge increase, which makes me think the front end is currently the bottleneck.
 

Henry swagger

Senior member
Feb 9, 2022
389
246
86

"From what I hear LNL has good number of design wins in last quater...."

"Not going to put numbers because, earlier design wins itself are high... Having something that big on top of them .... I am not confident enough"

Lunar Lake getting a lot of design wins. It tracks with previous leaks that LNL would have 3x as much design wins as Meteor Lake.
Ofcouse LNL will have design wins.. intel has 88% of market share even apple will superior battery life can't eat into it.. LNL eill be intel's m1 for x86
 

DrMrLordX

Lifer
Apr 27, 2000
21,709
10,983
136
All of that seems fine and good, and would certainly negate the need for trained human supervision. I'm perhaps a little jaded on that whole aspect because this is all work I've been trained to do throughout my adult life: finding trends in data, editing photos, concise and clear communications... That's a broadcasting and journalism degree, in so many words. I guess I'm not the target audience, lol.

The goal here is not to necessarily replace you (the operator) performing those functions, but to instead make it easier for you to do the things you already know how to do. Ideally so that someone trained as you are can do the work of 2-3 people, meaning that your boss then gets to lay off some of your colleagues without raising your pay by much, if at all. Isn't that great?

And that is why there's so much hype around NPUs!
 

DavidC1

Senior member
Dec 29, 2023
203
284
96
Is it possible that Skymont might get a micro-op cache?
No. If anything uop caches will be sharing the fate of Hyperthreading and be axed in the future.

Uop caches are a remnant of the extreme clock focus of Netburst. Rather than thinking it brings more performance, you should think of it as maintaining performance while allowing it to raise clocks.
I won't be surprised at all if there is a 10-Way decoder because Skymont has 3x 3-Way, i.e. a total of 9-Way. But it can also be 8-Way.

I think Intel will present the LionCove microarchitecture in June, and as you can see, they already have core diagrams ready for presentation.
Raichu and others said 8-way decode for Lion Cove.

Also, for Skymont to be 8-way, it has to be 2x4. Since the cluster approach is to reduce impact of power/area of decoders, it makes no sense as Intel said cluster 3-way minimizes impact of decoders. Raichu might have thought it was 8-way in the beginning, but later clarified that its 3x3-way.
 
sale-70-410-exam    | Exam-200-125-pdf    | we-sale-70-410-exam    | hot-sale-70-410-exam    | Latest-exam-700-603-Dumps    | Dumps-98-363-exams-date    | Certs-200-125-date    | Dumps-300-075-exams-date    | hot-sale-book-C8010-726-book    | Hot-Sale-200-310-Exam    | Exam-Description-200-310-dumps?    | hot-sale-book-200-125-book    | Latest-Updated-300-209-Exam    | Dumps-210-260-exams-date    | Download-200-125-Exam-PDF    | Exam-Description-300-101-dumps    | Certs-300-101-date    | Hot-Sale-300-075-Exam    | Latest-exam-200-125-Dumps    | Exam-Description-200-125-dumps    | Latest-Updated-300-075-Exam    | hot-sale-book-210-260-book    | Dumps-200-901-exams-date    | Certs-200-901-date    | Latest-exam-1Z0-062-Dumps    | Hot-Sale-1Z0-062-Exam    | Certs-CSSLP-date    | 100%-Pass-70-383-Exams    | Latest-JN0-360-real-exam-questions    | 100%-Pass-4A0-100-Real-Exam-Questions    | Dumps-300-135-exams-date    | Passed-200-105-Tech-Exams    | Latest-Updated-200-310-Exam    | Download-300-070-Exam-PDF    | Hot-Sale-JN0-360-Exam    | 100%-Pass-JN0-360-Exams    | 100%-Pass-JN0-360-Real-Exam-Questions    | Dumps-JN0-360-exams-date    | Exam-Description-1Z0-876-dumps    | Latest-exam-1Z0-876-Dumps    | Dumps-HPE0-Y53-exams-date    | 2017-Latest-HPE0-Y53-Exam    | 100%-Pass-HPE0-Y53-Real-Exam-Questions    | Pass-4A0-100-Exam    | Latest-4A0-100-Questions    | Dumps-98-365-exams-date    | 2017-Latest-98-365-Exam    | 100%-Pass-VCS-254-Exams    | 2017-Latest-VCS-273-Exam    | Dumps-200-355-exams-date    | 2017-Latest-300-320-Exam    | Pass-300-101-Exam    | 100%-Pass-300-115-Exams    |
http://www.portvapes.co.uk/    | http://www.portvapes.co.uk/    |