Discussion Intel Meteor, Arrow, Lunar & Panther Lakes Discussion Threads

Page 122 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

Tigerick

Senior member
Apr 1, 2022
683
565
106






As Hot Chips 34 starting this week, Intel will unveil technical information of upcoming Meteor Lake (MTL) and Arrow Lake (ARL), new generation platform after Raptor Lake. Both MTL and ARL represent new direction which Intel will move to multiple chiplets and combine as one SoC platform.

MTL also represents new compute tile that based on Intel 4 process which is based on EUV lithography, a first from Intel. Intel expects to ship MTL mobile SoC in 2023.

ARL will come after MTL so Intel should be shipping it in 2024, that is what Intel roadmap is telling us. ARL compute tile will be manufactured by Intel 20A process, a first from Intel to use GAA transistors called RibbonFET.



Comparison of upcoming Intel's U-series CPU: Core Ultra 100U, Lunar Lake and Panther Lake

ModelCode-NameDateTDPNodeTilesMain TileCPULP E-CoreLLCGPUXe-cores
Core Ultra 100UMeteor LakeQ4 202315 - 57 WIntel 4 + N5 + N64tCPU2P + 8E212 MBIntel Graphics4
?Lunar LakeQ4 202417 - 30 WN3B + N62CPU + GPU & IMC4P + 4E08 MBArc8
?Panther LakeQ1 2026 ??Intel 18A + N3E3CPU + MC4P + 8E4?Arc12



Comparison of die size of Each Tile of Meteor Lake, Arrow Lake, Lunar Lake and Panther Lake

Meteor LakeArrow Lake (20A)Arrow Lake (N3B)Arrow Lake Refresh (N3B)Lunar LakePanther Lake
PlatformMobile H/U OnlyDesktop OnlyDesktop & Mobile H&HXDesktop OnlyMobile U OnlyMobile H
Process NodeIntel 4Intel 20ATSMC N3BTSMC N3BTSMC N3BIntel 18A
DateQ4 2023Q1 2025 ?Desktop-Q4-2024
H&HX-Q1-2025
Q4 2025 ?Q4 2024Q1 2026 ?
Full Die6P + 8P6P + 8E ?8P + 16E8P + 32E4P + 4E4P + 8E
LLC24 MB24 MB ?36 MB ??8 MB?
tCPU66.48
tGPU44.45
SoC96.77
IOE44.45
Total252.15



Intel Core Ultra 100 - Meteor Lake



As mentioned by Tomshardware, TSMC will manufacture the I/O, SoC, and GPU tiles. That means Intel will manufacture only the CPU and Foveros tiles. (Notably, Intel calls the I/O tile an 'I/O Expander,' hence the IOE moniker.)

 

Attachments

  • PantherLake.png
    283.5 KB · Views: 23,971
  • LNL.png
    881.8 KB · Views: 25,443
Last edited:

AMDK11

Senior member
Jul 15, 2019
314
206
116
Returning to the RedwoodCove core, I believe that the diagram with changes in the x86 P-core in the GraniteRapids presentation and the data from the MeteorLake presentation complement each other, but do not yet reveal everything that has been changed in RedwoodCove. I really have no reason at this point to think that IPC is substantially similar to Golden/RaptorCove. Why all these changes if the IPC is the same? In my opinion, the issue with RedwoodCove is not whether the IPC is higher, but how much higher it is.

Historically, Core 2 (Conroe - 2006) introduced 33% wider x86 decoding from 3-way to 4-way.We have 4-way decoding up to Sunny/CypressCove, i.e. 2020. That's about 14-15 years of generational improvements, leaving 3 ALUs up to IvyBridge as Haswell introduced 4 ALUs.

Only GoldenCove (2021) introduced wider decoding by as much as 50%, i.e. it moved from a 4-way to 6-way x86 instruction decoder and increased the number of ALU units by 25% from 4 to 5.

I think Intel admitted that the transition from GoldenCove to RedwoodCove is not as serious a change as the transition from SunnyCove to GoldenCove, but it does not rule out IPC gains, because as part of 6-way decoding, through appropriate optimization and core expansion,you can still gain a lot, so I'm not eliminating the higher IPC from RedwoodCove.just because it doesn't have 8-way decoding or 6-8 ALUs.I think optimization and core expansion gives the IPC a gain, the only question is how much.

Personally, I think (this is my feeling) that with the transition to 6-way x86 decoding in Zen 5, AMD will break ground for at least a few generations, where the IPC increase will definitely not be just +5%.I have the impression that AMD whether Intel, despite the rumors, will not switch to more than a 6-way decoder in Zen 6-7 or LionCove-PantherCove.8-way x86 decoding is rather a distant future.But I may be wrong.
 
Last edited:
Reactions: cebri1

Doug S

Platinum Member
Feb 8, 2020
2,370
3,787
136
…except ASUS, Gigabyte, ASRock, and MSI. Why they don’t invest more in smaller form factors is baffling to me. I have a mini PC not much bigger than a Raspberry Pi + case sitting on my desk and it is absolutely fantastic. If I weren’t a gamer, developer, and content creator, it is all I would need. (it can game, just not anything serious)

Selling boards with the CPU soldered on is a risk for them. They can't offer all CPU SKUs so they would have to pick a few, and then hope they sell well enough they don't get stuck with a bunch of boards with a particular offering that proves unpopular. A $100 board without a CPU sitting in inventory can be discounted 20% and that discount costs them $20. A $100 board with a $200 CPU discounted 20% costs them $60 - and if MTL has been replaced with something better they will have to discount a lot more than 20% to sell it. Whereas if the socket is compatible they might merely need a firmware update then that board can be used with the "something better" so that 20% discount will easily sell it.

Maybe they could do it build to order like laptops are (sometimes) made, but motherboard companies have never done that so they'd have to set themselves up to handle direct customer orders and the build to order process in their factory. They would have to believe the market size is big enough to be worth it.
 

cebri1

Member
Jun 13, 2019
126
133
116
GLC on Intel 7 appears to be as efficient (or marginally less) efficient than Zen 3. If RWC is as efficient as Zen 4, I would be pretty impressed tbh, because Zen 4 increased efficiency from node, but then also increased efficiency through architecture. While I think it's likely RWC has slightly higher architectural perf/watt than GLC as well, I doubt it's anywhere near where Zen 4 got vs Zen 3, and thus if RWC is matching Zen 4 in perf/watt, it's because the node turned out better than expected (or Intel's DTCO team went hard)

Nothing to add really, but always good to remember how efficient an 13900K can be if you let it (at the cost of losing the performance crown).

 
Reactions: controlflow

deasd

Senior member
Dec 31, 2013
528
806
136
GLC on Intel 7 appears to be as efficient (or marginally less) efficient than Zen 3. If RWC is as efficient as Zen 4, I would be pretty impressed tbh, because Zen 4 increased efficiency from node, but then also increased efficiency through architecture. While I think it's likely RWC has slightly higher architectural perf/watt than GLC as well, I doubt it's anywhere near where Zen 4 got vs Zen 3, and thus if RWC is matching Zen 4 in perf/watt, it's because the node turned out better than expected (or Intel's DTCO team went hard)

Intel4/RWC needs magic to do so.

As someone tested, at 5.2Ghz 1T Zen4 use only 30% of power of a GLC(or RaptorCove) and 2T <50% of power.

Nobody could tell if GLC is bugged or not. If yes RWC has a chance to get fix.





 
Reactions: igor_kavinski

Abwx

Lifer
Apr 2, 2011
11,103
3,780
136
Nothing to add really, but always good to remember how efficient an 13900K can be if you let it (at the cost of losing the performance crown).

View attachment 86386


At 88W the 7950X score 11863 pts, that s 134.8 pts/W, so this graph is BS.

 

cebri1

Member
Jun 13, 2019
126
133
116
At 88W the 7950X score 11863 pts, that s 134.8 pts/W, so this graph is BS.


No it's not redacted and der8auer is pretty well respected tester. Sorry for the facts.

We have a zero tolerance for profanity in the tech forums.
Please don't do that again.

Iron Woode

Super Moderator
 
Last edited by a moderator:

Abwx

Lifer
Apr 2, 2011
11,103
3,780
136
No it's not redacted and der8auer is pretty well respected tester. Sorry for the facts.

The fact is that he did implement 90W TDP wich lead to 121.5W PPT, that s the actual power at wich he did set the 7950X, give us the link, we ll see that his score is quite higher than the one of Computerbase at 88W and amount to a test at said 121.5W.

And btw, Computerbase are known for their precision, you can see on the link that i posted that they do a lot of tests at different powers.

Edit : There s no 90W eco mode on the 7950X, the stock eco mode is at 105W TDP/142W PPT, if one set the TDP at 90W then the CPU will push up to its PPT wich is TDP x 1.35.
 
Last edited by a moderator:

AMDK11

Senior member
Jul 15, 2019
314
206
116
I think I know where MLID got the information that GraniteRapids is RedwoodCove+, which is not true.He probably saw these slides from Intel Innovation 23 before and thought that since there are more changes in the description of RedwoodCove on GraniteRapids and when presenting the new P core, MeteorLake is mentioned only L1 64KB, he misinterpreted it as RedwoodCove in MTL it's basically thesame as Golden/RaptorCove, and since more changes were described in GranitaRapids, he assumed it was RedwoodCove+.But this is not true and the guy is misinterpreting the data from his source, which was has already been demonstrated many times.Moreover, he himself admitted that he is not familiar with technical issues and confuses IPC with efficiency and energy efficiency.

EDIT:
RedwoodCove Core:
- Larger instruction cache: 64KB 16-way (Golden/RaptorCove L1-I 32KB 8-way)
- Improved code prefetching
- Smarter code prefetching engine
- Improved branch prediction accuracy and reduced miss-hit penalty
- Micro-op queue 192 for ST or 2x96 for SMT
Golden/RaptorCove 144 ST/2x 72 SMT
Sunny/CypressCove 70 ST/2x 70 SMT
Skylake 64 ST/2x 64 SMT
Haswell 56 ST/2x 56 SMT
- Greater performance, lower instruction latency FPU with 4-5 to 3 (logic extension)
- AMX supports TF32 and FP16
- New L2 design, more preferred LLC(L3) prefetch engine, higher throughput.

I believe that there are more changes, but if they do not bring an increase in IPC, I will be very disappointed.Meanwhile, I'm waiting for the premiere.
 
Last edited:

PJVol

Senior member
May 25, 2020
573
498
136
Last edited:

Geddagod

Golden Member
Dec 28, 2021
1,165
1,049
106
I really have no reason at this point to think that IPC is substantially similar to Golden/RaptorCove. Why all these changes if the IPC is the same? In my opinion, the issue with RedwoodCove is not whether the IPC is higher, but how much higher it is.
Intel themselves are saying RWC is esentially just a GLC port in interviews. There's not going to be much of an IPC increase with RWC. Crestmont has a 4-6% IPC increase, and they listed it out as having an IPC increase, and the fact that they didn't bother with RWC means that the IPC increase for that is almost certainly even lower than 4-6%.
 
Reactions: Kepler_L2 and inf64

inf64

Diamond Member
Mar 11, 2011
3,713
4,088
136
I think I know where MLID got the information that GraniteRapids is RedwoodCove+, which is not true.He probably saw these slides from Intel Innovation 23 before and thought that since there are more changes in the description of RedwoodCove on GraniteRapids and when presenting the new P core, MeteorLake is mentioned only L1 64KB, he misinterpreted it as RedwoodCove in MTL it's basically thesame as Golden/RaptorCove, and since more changes were described in GranitaRapids, he assumed it was RedwoodCove+.But this is not true and the guy is misinterpreting the data from his source, which was has already been demonstrated many times.Moreover, he himself admitted that he is not familiar with technical issues and confuses IPC with efficiency and energy efficiency.

EDIT:
RedwoodCove Core:
- Larger instruction cache: 64KB 16-way (Golden/RaptorCove L1-I 32KB 8-way)
- Improved code prefetching
- Smarter code prefetching engine
- Improved branch prediction accuracy and reduced miss-hit penalty
- Micro-op queue 192 for ST or 2x96 for SMT
Golden/RaptorCove 144 ST/2x 72 SMT
Sunny/CypressCove 70 ST/2x 70 SMT
Skylake 64 ST/2x 64 SMT
Haswell 56 ST/2x 56 SMT
- Greater performance, lower instruction latency FPU with 4-5 to 3 (logic extension)
- AMX supports TF32 and FP16
- New L2 design, more preferred LLC(L3) prefetch engine, higher throughput.

I believe that there are more changes, but if they do not bring an increase in IPC, I will be very disappointed.Meanwhile, I'm waiting for the premiere.
Be prepared to be disappointed, I guess.
 

Hulk

Diamond Member
Oct 9, 1999
4,279
2,099
136
I don’t think anyone made such a claim. Meteor Lake coming to the desktop means literally just that.

I have a small desktop PC sitting on a shelf that runs an AMD Cezanne mobile chip. The system isn’t much bigger than my wallet in terms of size. MTL-P 6+8 in one of those small boxes would be amazing to have.

As I had thought, no Meteor Lake desktop parts. Just mobile chips in AIO and similar.

 

HoveringStyle

Junior Member
Dec 11, 2022
19
21
41
Intel4/RWC needs magic to do so.

As someone tested, at 5.2Ghz 1T Zen4 use only 30% of power of a GLC(or RaptorCove) and 2T <50% of power.

Nobody could tell if GLC is bugged or not. If yes RWC has a chance to get fix.


View attachment 86389


View attachment 86390
Is that in watts? they seem incredibly low values considering the 7600X (5.3GHz) uses 30W in Cinebench 1T


The same as the 13600K which also has a 5.3GHz max clock.
 

Abwx

Lifer
Apr 2, 2011
11,103
3,780
136
Is that in watts? they seem incredibly low values considering the 7600X (5.3GHz) uses 30W in Cinebench 1T
View attachment 86397

The same as the 13600K which also has a 5.3GHz max clock.

That s a whole CPU package power, idle CPU package power is 13.5W and with 1 CB thread it increase to 33W, so a 7600X core use about 19.5W in CB ST.

13600K idling is at 6.5W and 1T loaded at 30.8W, that s 24.3W for the core.

The numbers are here on the third graph :

 
Last edited:

Abwx

Lifer
Apr 2, 2011
11,103
3,780
136
CB is FP, this seems to use more power than SPECint, wich is an INT code test, should compare with SPECfp to have an idea, but surely that the latter use more power and will mimick CB power wise.
 

Abwx

Lifer
Apr 2, 2011
11,103
3,780
136
The 13900K was also tested in SPECint and its power values are still in line with CB but it's entirely possible Zen 4 is just that different in that aspect.
Zen 4 seems to be much more efficient than RPL P core for anything Integer, even when accounting for node difference, so that s likely due to uarchitectural advantage for such code.

FI in Handbrake H264 the 13900K consume 249W and the 7950X 189W while still being 13% faster, even when limited to 142W it still manage to be ahead by 9%

So at equal score it would use something like 110W wich is 2.26x more efficient, that s way more than the advantage provided by TSMC s N5 over Intel 7.

The numbers i quoted are in those tests :

 

Geddagod

Golden Member
Dec 28, 2021
1,165
1,049
106
Zen 4 seems to be much more efficient than RPL P core for anything Integer, even when accounting for node difference, so that s likely due to uarchitectural advantage for such code.
The difference between the spec testing vs the CB testing is way higher than one would expect, even if Zen 4 is more efficient in INT workloads.
FI in Handbrake H264 the 13900K consume 249W and the 7950X 189W while still being 13% faster, even when limited to 142W it still manage to be ahead by 9%

So at equal score it would use something like 110W wich is 2.26x more efficient, that s way more than the advantage provided by TSMC s N5 over Intel 7.

The numbers i quoted are in those tests :
You would want to compare 8+16 with a 12 core AMD processor, not a 16 core one, if you want to try to equalize "core counts"
But honestly, we don't even have to do that. Why bother with the little cores ruining GLC vs Zen 3/Zen 4 perf/watt testing? Just compare a 12400 vs 7600x. Or RPL with e-cores disabled vs a 7600x. Though that perf data is hard to come by from what I've seen.
 

Abwx

Lifer
Apr 2, 2011
11,103
3,780
136
The difference between the spec testing vs the CB testing is way higher than one would expect, even if Zen 4 is more efficient in INT workloads.

In CB ST a Zen 4 core use about 37W while a single RPL core use 47W, in this case that s the process advantage wich is at work, but in the case of Handbrake it need only 115W to match the stock 13900K, that s comparable to the difference in SPECint, wich point to much better uarch efficency for INT.

You would want to compare 8+16 with a 12 core AMD processor, not a 16 core one, if you want to try to equalize "core counts"
But honestly, we don't even have to do that. Why bother with the little cores ruining GLC vs Zen 3/Zen 4 perf/watt testing? Just compare a 12400 vs 7600x. Or RPL with e-cores disabled vs a 7600x. Though that perf data is hard to come by from what I've seen.

Since 2 e cores are supposed to have as much throughput than 1 P core using SMT it s straightforward that 16 e cores are the equivalent of 8 P cores, so 13900K vs 7950X is the adequate comparison.

Now if you want a core/core comparison in Handbrake a 7700X@65W is 2% faster than a 12900K with e cores disabled according to computerbase measurements, knowing that those 8 P cores run at something like 180W.

Granted ADL has a less efficient process than RPL, but that would reduce its power to 130-140W, still far from 65W, we fall again in the 2x better perf/Watt of the 7950X vs 13900K.
 

Geddagod

Golden Member
Dec 28, 2021
1,165
1,049
106
In CB ST a Zen 4 core use about 37W while a single RPL core use 47W, in this case that s the process advantage wich is at work, but in the case of Handbrake it need only 115W to match the stock 13900K, that s comparable to the difference in SPECint, wich point to much better uarch efficency for INT.
Uh no. A) Why again are we using handbrake? B) Why again are we comparing a 13900k vs a 7950x, which is muddled because the core count inequality but also the fact we can't even compare GLC vs Zen 4 due to Gracemont... you know.. existing
The logic here does not track, at all, not even in the slightest.
Since 2 e cores are supposed to have as much throughput than 1 P core using SMT it s straightforward that 16 e cores are the equivalent of 8 P cores, so 13900K vs 7950X is the adequate comparison.
OMG. Idk how many times I have to tell you this. It is asinine to compare 2 E-cores as a big core unless you want to try making Intel seem worse than they really are. If you look at a die shot, which I have sent you plenty, you can see, physically on the die, if Intel didn't develop any E-cores at all, they would have slotted in 1 P-core in the place of a 4-e core cluster. This is the third time I have pointed this out to you. No one compares 2 E-cores to 1 P core, you aren't being weird and quirky, you're not fighting in the face of bias, by equalizing them like that. Everyone uses the 4 E-cores = 1 "P" core measurement because that's how Intel literally developed the E-cores- a 4core cluster all but slots into the place where they would have had a p core on the die.
Now if you want a core/core comparison in Handbrake a 7700X@65W is 2% faster than a 12900K with e cores disabled according to computerbase measurements, knowing that those 8 P cores run at something like 180W.

Granted ADL has a less efficient process than RPL, but that would reduce its power to 130-140W, still far from 65W, we fall again in the 2x better perf/Watt of the 7950X vs 13900K.
This entire statement is something else, man.
First of all, we are going to circle back to what perf/watt means. Again, we had this discussion before. Remember, you equalize the denominator, not the numerator. You're talking about watt/perf. This is an important distinction to make, because saying Intel is 2x behind in perf/watt is just not true and makes them seem much further behind then they really are.
But anyway, this is what I see on computerbase:

At 88 watts, the 7700x has ~35% more perf/watt than a 12900k with only 8 P-cores.
This isn't really ideal, considering I doubt RWC brings more than a 20% increase in perf/watt, meaning Intel would be behind (big core vs big core) by ~10% in perf/watt, but whatever.
 

H433x0n

Senior member
Mar 15, 2023
933
1,032
96
Uh no. A) Why again are we using handbrake? B) Why again are we comparing a 13900k vs a 7950x, which is muddled because the core count inequality but also the fact we can't even compare GLC vs Zen 4 due to Gracemont... you know.. existing
The logic here does not track, at all, not even in the slightest.

OMG. Idk how many times I have to tell you this. It is asinine to compare 2 E-cores as a big core unless you want to try making Intel seem worse than they really are. If you look at a die shot, which I have sent you plenty, you can see, physically on the die, if Intel didn't develop any E-cores at all, they would have slotted in 1 P-core in the place of a 4-e core cluster. This is the third time I have pointed this out to you. No one compares 2 E-cores to 1 P core, you aren't being weird and quirky, you're not fighting in the face of bias, by equalizing them like that. Everyone uses the 4 E-cores = 1 "P" core measurement because that's how Intel literally developed the E-cores- a 4core cluster all but slots into the place where they would have had a p core on the die.

This entire statement is something else, man.
First of all, we are going to circle back to what perf/watt means. Again, we had this discussion before. Remember, you equalize the denominator, not the numerator. You're talking about watt/perf. This is an important distinction to make, because saying Intel is 2x behind in perf/watt is just not true and makes them seem much further behind then they really are.
But anyway, this is what I see on computerbase:
View attachment 86411
At 88 watts, the 7700x has ~35% more perf/watt than a 12900k with only 8 P-cores.
This isn't really ideal, considering I doubt RWC brings more than a 20% increase in perf/watt, meaning Intel would be behind (big core vs big core) by ~10% in perf/watt, but whatever.
There’s no way he’s acting in good faith. It’s a waste of time to engage with him.
 

deasd

Senior member
Dec 31, 2013
528
806
136
Is that in watts? they seem incredibly low values considering the 7600X (5.3GHz) uses 30W in Cinebench 1T


The same as the 13600K which also has a 5.3GHz max clock.
The author test cpu Vcore VRM only, which means SoC(IODIE) is excluded which is a huge variable. Also cinebench is different.
 
Jul 27, 2020
17,174
11,039
106
GB6 numbers (if legit) are probably more telling due to the wide range of test workloads (2502 / 12545 )

MTL is somehow very potent in multicore file compression and object detection tests. Could be higher frequency or the increased L1/L2 or much higher bandwidth RAM.
 
sale-70-410-exam    | Exam-200-125-pdf    | we-sale-70-410-exam    | hot-sale-70-410-exam    | Latest-exam-700-603-Dumps    | Dumps-98-363-exams-date    | Certs-200-125-date    | Dumps-300-075-exams-date    | hot-sale-book-C8010-726-book    | Hot-Sale-200-310-Exam    | Exam-Description-200-310-dumps?    | hot-sale-book-200-125-book    | Latest-Updated-300-209-Exam    | Dumps-210-260-exams-date    | Download-200-125-Exam-PDF    | Exam-Description-300-101-dumps    | Certs-300-101-date    | Hot-Sale-300-075-Exam    | Latest-exam-200-125-Dumps    | Exam-Description-200-125-dumps    | Latest-Updated-300-075-Exam    | hot-sale-book-210-260-book    | Dumps-200-901-exams-date    | Certs-200-901-date    | Latest-exam-1Z0-062-Dumps    | Hot-Sale-1Z0-062-Exam    | Certs-CSSLP-date    | 100%-Pass-70-383-Exams    | Latest-JN0-360-real-exam-questions    | 100%-Pass-4A0-100-Real-Exam-Questions    | Dumps-300-135-exams-date    | Passed-200-105-Tech-Exams    | Latest-Updated-200-310-Exam    | Download-300-070-Exam-PDF    | Hot-Sale-JN0-360-Exam    | 100%-Pass-JN0-360-Exams    | 100%-Pass-JN0-360-Real-Exam-Questions    | Dumps-JN0-360-exams-date    | Exam-Description-1Z0-876-dumps    | Latest-exam-1Z0-876-Dumps    | Dumps-HPE0-Y53-exams-date    | 2017-Latest-HPE0-Y53-Exam    | 100%-Pass-HPE0-Y53-Real-Exam-Questions    | Pass-4A0-100-Exam    | Latest-4A0-100-Questions    | Dumps-98-365-exams-date    | 2017-Latest-98-365-Exam    | 100%-Pass-VCS-254-Exams    | 2017-Latest-VCS-273-Exam    | Dumps-200-355-exams-date    | 2017-Latest-300-320-Exam    | Pass-300-101-Exam    | 100%-Pass-300-115-Exams    |
http://www.portvapes.co.uk/    | http://www.portvapes.co.uk/    |