Recent content by knutinh

  1. K

    [Bloomberg] Apple starting process to dump Intel in Macs

    Depends on how important single-thread, general application performance is. They _could_ settle for "fast enough" single-thread performance (a souped up iPhone core), then add multi-threaded/simd performance in number cruncher models by ways of simple multi-core/simd/GPU/ML units (scaling...
  2. K

    [Techreport] All Skylake-X i7 CPUs have two AVX-512 FMA's

    Given that clock speed has stagnated and that most people are perfectly happy with 10year old PCs (or tablets) for updating their FB profile or writing Office documents, what are Intel to do? The biggest obstacle for SIMD usefulness is programmers bothering to use it - through libraries...
  3. K

    [Techreport] All Skylake-X i7 CPUs have two AVX-512 FMA's

    Some applications are compute bound, some bandwidth. Increasing the compute capability of a «well-rounded» cpu by 2x without increasing bandwidth will give better performance in some applications, while a larger percentage of applications will be bandwidth limited. I find it more interesting...
  4. K

    [Techreport] All Skylake-X i7 CPUs have two AVX-512 FMA's

    Large potential for something like Photoshop, sw-based video encoders. -k
  5. K

    Compiler comparisions?

    Anyone aware of somewhat sensible comparisions of compilers for eg HPC workloads for x86 and ARM? I am most interested in the speed of open source compilers (gcc) vs the cpu manufacturers compiler. -k
  6. K

    CPU archtecture getting rid of FPU

    I think that you underestimate how much SIMD is used for number crunching. Either because the application programmer used assembler/intrinsics/a vectorizingncompiler (intel) or because they rely on some library (blas, fftw,...) that is vectorized. If we expect our hardware to do function «A» is...
  7. K

    CPU for Floats crunching

    I'd suggest that the Intel compiler is a pretty good reason to go with Intel hardware if the OP wants to write "clean" code and not mess with dirty optimization. I don't know much about Fortran, but I assume that the situation is similar to C. Also, no-one mentioned Xeon Phi? They are being...
  8. K

    AMD EPYC Server Processor Thread - EPYC 7000 series specs and performance leaked

    My experience with icc and gcc suggests that I would use icc if compiling numerically heavy code to run on Intel hardware that I had to pay for. The alternative with gcc might be to pepper the code with inline assembly in order to get vectorization. Bug-prone, non future-proof and resource...
  9. K

    Will AMD support AVX-512 and Intel TSX ?

    I believe that SIMD calculation amounts to a minute fraction of the cpu area, and a larger (but still small) fraction of the cpu power budget. I have seen energy breakdowns on fetching 64 bits from memory, doing a double-precision multi-acc, and storing the result back to memory. Turns out that...
  10. K

    Will AMD support AVX-512 and Intel TSX ?

    I would be curious to know what the potential would be for Adobe products (Photoshop, Lightroom) if they hired competent programmers/optimizers and targeted AVX512 + high core counts due to the iMac pro being used by "media professionals". Is "pixel processing" the bottle neck in those products...
  11. K

    Ryzen's halved 256bit AVX2 throughput

    So Intel has 2x the peak AVX FMA throughput of AMD. Even with a memory bandwidth of 2x, I would not necessarily expect a 2x speedup of even something like "professional rendering". Perhaps something really streamlined and FMA-centric like matrix multiply or convolution. For maximum performance...
  12. K

    Apple adding ARM coprocessor to future Macs

    So how much better performance:watt does a state of the art ARM core offer vs a state of the art x86 core, say at an operating point of 0.5W? My gut-feeling is that they should be quite similar, and that other factors are more relevant. Such as: 1. Does Apple like to have a credible bargaining...
  13. K

    AVX2 and FMA3 in games

    It was not at all clear to me. I have not seen much in the way of arguments from you, mostly normative claims? Please elaborate why a compiler manufacturer _must_ offer optimal performance on all platforms it supports, and how this relates to clearly not being the case for most products, be it...
  14. K

    AVX2 and FMA3 in games

    I admit that I am heavily biased towards problems that feature deep nested loops and that can execute really well on SIMD hw. Being able to write c code using icc, instead of having to resort to inline assy using gcc means being more productive, having less bugs and that your code can be...
  15. K

    AVX2 and FMA3 in games

    You said: "They don't optimize for specific CPUs, except in the cases of bugs" From your own link: "the compiler or library can make multiple versions of a piece of code, each optimized for a certain processor and instruction set," -k
sale-70-410-exam    | Exam-200-125-pdf    | we-sale-70-410-exam    | hot-sale-70-410-exam    | Latest-exam-700-603-Dumps    | Dumps-98-363-exams-date    | Certs-200-125-date    | Dumps-300-075-exams-date    | hot-sale-book-C8010-726-book    | Hot-Sale-200-310-Exam    | Exam-Description-200-310-dumps?    | hot-sale-book-200-125-book    | Latest-Updated-300-209-Exam    | Dumps-210-260-exams-date    | Download-200-125-Exam-PDF    | Exam-Description-300-101-dumps    | Certs-300-101-date    | Hot-Sale-300-075-Exam    | Latest-exam-200-125-Dumps    | Exam-Description-200-125-dumps    | Latest-Updated-300-075-Exam    | hot-sale-book-210-260-book    | Dumps-200-901-exams-date    | Certs-200-901-date    | Latest-exam-1Z0-062-Dumps    | Hot-Sale-1Z0-062-Exam    | Certs-CSSLP-date    | 100%-Pass-70-383-Exams    | Latest-JN0-360-real-exam-questions    | 100%-Pass-4A0-100-Real-Exam-Questions    | Dumps-300-135-exams-date    | Passed-200-105-Tech-Exams    | Latest-Updated-200-310-Exam    | Download-300-070-Exam-PDF    | Hot-Sale-JN0-360-Exam    | 100%-Pass-JN0-360-Exams    | 100%-Pass-JN0-360-Real-Exam-Questions    | Dumps-JN0-360-exams-date    | Exam-Description-1Z0-876-dumps    | Latest-exam-1Z0-876-Dumps    | Dumps-HPE0-Y53-exams-date    | 2017-Latest-HPE0-Y53-Exam    | 100%-Pass-HPE0-Y53-Real-Exam-Questions    | Pass-4A0-100-Exam    | Latest-4A0-100-Questions    | Dumps-98-365-exams-date    | 2017-Latest-98-365-Exam    | 100%-Pass-VCS-254-Exams    | 2017-Latest-VCS-273-Exam    | Dumps-200-355-exams-date    | 2017-Latest-300-320-Exam    | Pass-300-101-Exam    | 100%-Pass-300-115-Exams    |
http://www.portvapes.co.uk/    | http://www.portvapes.co.uk/    |