Skip to main content
Filter by
Sorted by
Tagged with
Advice
1 vote
2 replies
127 views

What is the difference between an interrupt and a context switch? I understand the concept of an interrupt and how it occurs. However, I'm digging deeper into the topic. I studied Computer ...
Gabriele's user avatar
0 votes
1 answer
62 views

I am trying to implement Cache allocation Technology`s impact with my CPU. However, when I use either lscpu to see whether my CPU supports, or cpuid -l 0x10, output is false. How is this possible? How ...
Ali Hosseini's user avatar
1 vote
1 answer
104 views

Building on this question here The term thread divergence is used in CUDA; from my understanding it's a situation where different threads are assigned to do different tasks and this results in a big ...
bigcodeszzer's user avatar
7 votes
1 answer
222 views

I'm experimenting with the IMUL r64, r64 instruction on an Intel Xeon E5-1620 v3 (Haswell architecture, base clock 3.5 GHz, turbo boost up to 3.6 GHz, Hyper Threading is enabled). My test loop is ...
Andrey Dmitriev's user avatar
0 votes
1 answer
51 views

Looking at the CPUUtilized Cloudwatch metric for my Fargate service, it's showing max cpu units used as 1040 over the past 4 weeks, using a sampling period of 1 minute. I have 4 vCPUs provisioned to ...
Seanf123's user avatar
2 votes
0 answers
207 views

I am measuring the latency of instructions. For 64-bit primitives, integer division takes about 25 cycles each, usually on my 2.3GHz Digital Ocean vCPU, while floating point division takes about 10 ...
Zack Light's user avatar
-3 votes
1 answer
108 views

I have a question regarding these two instructions: lw r2, 10(r1) lw r1, 10(r2) Is there a hazard here, do I need stalls in between two of them? I want to know if any kind of hazard happens here? I ...
mer mer's user avatar
  • 17
1 vote
0 answers
84 views

I think the title says it all: i have implemented a popcnt function that counts bits as a loop with shifts and one with inline asm with the actual cpu instruction. This is my c code: #define ...
newbee.a's user avatar
1 vote
0 answers
77 views

I'm doing an in-depth CPU microarchitectural resource analysis. I want to know the requirements of my program on processor microarchitectural resources and compare the requirements of different ...
Gerrie's user avatar
  • 455
0 votes
0 answers
50 views

I'm currently working on a parallel and distributed computing project where I'm comparing the performance of XGBoost running on CPU vs GPU. The goal is to demonstrate how GPU acceleration can improve ...
Mxneeb's user avatar
  • 19
0 votes
1 answer
166 views

I have an Intel Arria 10 SoC FPGA system with 5.4.104-lts Linux built with Yocto 3.3.1 and Poky. The installed FPGA image is doing nothing more than making interrupts to an UIO device, 50 times a sec. ...
yepp's user avatar
  • 1
2 votes
1 answer
105 views

It could operate identically on both 256-bit halves of a 512-bit AVX512 register. Like identical operation on 128-bits lanes of 256-bits registers in AVX/AVX2. Any tech reasons?
Akon's user avatar
  • 481
0 votes
1 answer
98 views

In this article https://www.lighterra.com/papers/modernmicroprocessors it is stated that (under Multiple issue - Superscalar) the fetch and decode/dispatch stages must be enhanced so they can decode ...
Rishi's user avatar
  • 41
-4 votes
1 answer
142 views

What exactly happens at the hardware level when a divergence occurs in SIMD and SIMT architectures, and how does each handle the execution of different instruction paths? I found this question, but ...
Rishi's user avatar
  • 41
1 vote
2 answers
119 views

Okay my question is probably dumb. But I cant find any answers that correct me. I learned that in DDR4 -lets say the stick has 8 chips- each chip parallelly contributes 8 bit to the 64 bit bus width. ...
Rishi's user avatar
  • 41
0 votes
2 answers
249 views

The following code is using for measuring CPU % usage. Public Sub Macro1() Dim strComputer As String Dim objWMIService As Object Dim colItems As Object Dim objItem As Object strComputer = ".&...
Kram Kramer's user avatar
0 votes
1 answer
172 views

This is my code: (Get-Counter '\Processor(_Total)\% Processor Time').CounterSamples.CookedValue I am trying to receive the average CPU Utilization with Get-Counter but every time i try i get this ...
mimi m's user avatar
  • 71
0 votes
1 answer
101 views

While I am benchmarking my Rocketcore CPU, I encountered failed Coremark benchmarking. After some debug, I reduce the issue scope to unsuccessful global initialization of 0 value. In Coremark, it will ...
Jasminy's user avatar
  • 119
1 vote
1 answer
79 views

I have a simple Hello World program written in C, which I statically compiled using: gcc -static -fno-pie -o hello{1|2} hello.c. I expected that executing these two binaries would exhibit cache ...
Khrn's user avatar
  • 354
0 votes
0 answers
227 views

A simple Python script (Selenium + ChromeDriver): # import the By class, which allows you to choose how to search for an element from selenium.webdriver.common.by import By # initialize the browser ...
Sergey Saz's user avatar
0 votes
0 answers
107 views

I was working on a cpu only rendering project with SDL in C. I implemented very good error handling and I got this error when I try to resize the window, "ERROR: SDL Error in render thread: ...
Tejas Patil's user avatar
-1 votes
1 answer
89 views

There were 2 pods running in my micro-service, both of them got restarted with kubernetes reason as OOM killed enter image description here (The above dashboard uses the following query->sum(0,...
Yash Arora's user avatar
0 votes
1 answer
115 views

I'm currently training my simple prediction AI but my GPU is training at 40S per epochs while my CPU is training at 9S per epochs my CPU is i7-4720HQ and my GPU is Nvidia 950m this is my code `import ...
Vio Octavio's user avatar
0 votes
2 answers
91 views

Using ADB in a java application to monitor android device status every three seconds. Height adb commands are used : adb shell settings get global airplane_mode_on adb shell settings get system ...
rejdrouin's user avatar
  • 101
2 votes
1 answer
127 views

I need low level information about the node, like number of cores, core ID and other things which is part of the kubelet in a pod running in the node. How do I get this?
imawful's user avatar
  • 135

1
2 3 4 5
95