-ck hacking: linux-4.9-ck1, MuQSS version 0.150

Monday, 12 December 2016

linux-4.9-ck1, MuQSS version 0.150

Announcing a new -ck release, 4.9-ck1 with new version of the Multiple Queue Skiplist Scheduler, version 0.150. These are patches designed to improve system responsiveness and interactivity with specific emphasis on the desktop, but configurable for any workload.

linux-4.9-ck1

-ck1 patches:
http://ck.kolivas.org/patches/4.0/4.9/4.9-ck1/

Git tree:
https://github.com/ckolivas/linux/tree/4.9-ck

Ubuntu 16.04 LTS packages:
http://ck.kolivas.org/patches/4.0/4.9/4.9-ck1/Ubuntu16.04/

MuQSS

Download:
4.9-sched-MuQSS_150.patch

Git tree:
4.9-muqss

MuQSS 0.150 updates

Regarding MuQSS, apart from a resync to linux-4.9, which has numerous hotplug and cpufreq changes (again!), I've cleaned up the patch to not include any Hz changes of its own, leaving Hz changes up to users to choose, unless they use the -ck patchset.
Additionally, I've modified sched_yield yet again. Since expected behaviour is different for different (inappropriate) users out there of sched_yield, I've made it tunable in /proc/sys/kernel/yield_type and changed the default to what I believe should happen. From the documentation I added in Documentation/sysctl/kernel.txt:

yield_type: (MuQSS CPU scheduler only)

This determines what type of yield calls to sched_yield will perform.

0: No yield.
1: Yield only to better priority/deadline tasks. (default)
2: Expire timeslice and recalculate deadline.

Previous versions of MuQSS defaulted to type 2 above. If you find behavioural regressions with any of your workloads try switching it back to 2.

4.9-ck1 updates

Apart from resyncing with the latest trees from linux-bfq and wb-buf-throttling
- Added a new kernel configuration option to enable threaded IRQs and set it by default
- Changed Hz to default to the safe 100 value, removing 128 which caused spurious issues and had no real world advantage.
- Fixed a build for muqss disabled (why would you use -ck and do that I don't know)
- Made hrtimers not be used if we know we're in suspend which may have caused suspend failures for drivers that did no use correct freezable vs normal timeouts
- Enabled bfq and set it to default
- Enabled writeback throttling by default

Enjoy!
お楽しみ下さい
-ck

170 comments:

Patrick McMunn12 December 2016 at 18:27
Wow! Thanks! You've really been on the ball lately. I'd gotten accustomed to waiting a month or more for resynced patches.
ReplyDelete
Replies
Unknown12 December 2016 at 19:57
CC arch/x86/kernel/setup_percpu.o
kernel/time/timer.c: In function ‘msleep’:
kernel/time/timer.c:1914:62: error: ‘pm_freezing’ undeclared (first use in this function)
if (jiffs < 5 && hrtimer_resolution < NSEC_PER_SEC / HZ && !pm_freezing) {
^~~~~~~~~~~
kernel/time/timer.c:1914:62: note: each undeclared identifier is reported only once for each function it appears in
kernel/time/timer.c: In function ‘msleep_interruptible’:
kernel/time/timer.c:1936:62: error: ‘pm_freezing’ undeclared (first use in this function)
if (jiffs < 5 && hrtimer_resolution < NSEC_PER_SEC / HZ && !pm_freezing) {
^~~~~~~~~~~
make[2]: *** [scripts/Makefile.build:293: kernel/time/timer.o] Error 1
make[1]: *** [scripts/Makefile.build:544: kernel/time] Error 2
ReplyDelete
Replies
Unknown12 December 2016 at 20:26
applied patch, now it' okay. thanx
ReplyDelete
Replies
Anonymous14 December 2016 at 08:51
I really want to try this kernel but at the moment the Nvidia 375.20 drivers are causing a lot of problems so I guess I will have to wait until the next release ><
ReplyDelete
Replies
Anonymous14 December 2016 at 09:04
Just be patient or use the open source driver!
ReplyDelete
Replies
jwh715 December 2016 at 01:54
Working fine so far in Arch on my x64 Athlon64 X2 PC and i686 UP Atom netbook, with a few upstream merges added.

Were there issues with posting yesterday? Couldn't log in with my laptop at home; working from work now though.
ReplyDelete
Replies
Anonymous15 December 2016 at 08:34
I am seeing all soft interrupts for cpu utilization in top on a Core i5 with HyperThreading. On a AMD X6 system, cpu utilization reports user/system as you would expect.
ReplyDelete
Replies
Anonymous15 December 2016 at 11:13
Thanks for the Ubuntu builds!!
ReplyDelete
Replies
Anonymous16 December 2016 at 01:29
You can easily try this out on Ubuntu using this script: https://github.com/Turbine1991/build_ubuntu_kernel_wastedcores
ReplyDelete
Replies
Anonymous16 December 2016 at 03:56
Obviously offtopic and wrong place to ask -- but maybe someone of you knows how to help:
What can I do against these warnings, like e.g:
WARNING: "phys_base" [sound/drivers/snd-dummy.ko] has no CRC!
Many of them occur at compilation time and I don't know if that leads to further problems. Kernel is vanilla 4.9.0 from opensuse src rpm +ck1.

Any hint or link appreciated! Thank you in advance,
BR Manuel Krause
ReplyDelete
Replies
Anonymous16 December 2016 at 06:50
Thx ck for the new yield_type configuration. I'm getting very good results when set to 'No yield' in xonotic. Game feels very responsive input is very consistent. I had already set __GL_YIELD="NOTHING" previously but still it's much better if I also set yield_type to 0. Not sure why this is the case.

duud
ReplyDelete
Replies
Anonymous24 December 2016 at 00:17
Runs nice on core2 duo machine.
yield 0 is awesome.
I had serious mouse lag due to slow integrated intel graphics which is pretty much gone now. :)
Thank you very much.
ReplyDelete
Replies
Peter24 December 2016 at 17:35
BFQv8r6 for Linux 4.9 is out. After reverting patch 0017 from ck1 and applying the new BFQ manually I noticed wbt.h was deleted when reverting. I think wbt shouldn't be in patch 0017 together with BFQ out wasn't meant to be there in the first place.
Merry Christmas and best regards,
Peter
ReplyDelete
Replies
Anonymous25 December 2016 at 00:01
Merry Christmas and thanks for all your work.
ReplyDelete
Replies
Anonymous25 December 2016 at 07:25
@ck In a comment probably above this one, you said that the kernel shouldn't even include sched_yield() anymore because it's mostly not used correctly. This makes me somehow curious as if there would be no sched_yield() in userspace, wouldn't it be quite insufficient (waste of cpu cycles) for an user-space implemented hybrid mutex to do spinning when the lock is only hold for a small amount of time or when the lock is uncontented. After spinning for a constant time while atomically checking for a state change it changes its locking strategy to a futex-based mutex one.
ReplyDelete
Replies
Anonymous27 December 2016 at 01:22
New cyclictest record minimum 756ns (avg 1120 ns) on a 2.66ghz quad core Xeon W3520 using yield_type 0.
Have been hanging around 980/1350 for quite a while and couldn't really improve on it.
Thank you.

ReplyDelete
Replies
Anonymous28 December 2016 at 22:31
As the kernel gets more and more "bloated" and slower almost every new release is there any chance to port this to older kernels like 3.12... ?
ReplyDelete
Replies
Anonymous1 January 2017 at 09:36
A happy new year to everyone.
ReplyDelete
Replies
Christoffer Tibell4 January 2017 at 08:24
I got a freeze trying to use https://github.com/ggreer/the_silver_searcher.

I was in a call in Discord and playing an OpenGL game (minecraft) at the time.
ReplyDelete
Replies
Anonymous5 January 2017 at 05:29
Hi, I forgot to post the updated benchmarks of MuQSS150 I ran some time ago. They are here as usual :
https://docs.google.com/spreadsheets/d/163U3H-gnVeGopMrHiJLeEY1b7XlvND2yoceKbOvQRm4/edit?usp=sharing

I've put some colors to make the results more readable (hopefully).
The reference kernel is the one on the first column. Following the value of the realtime difference between tested kernel and reference kernel, the colors are :
- blue if difference is within 'realtime of reference kernel +/- maximum standard deviation'
- green if difference is lower than 'realtime of reference kernel - maximum standard deviation'
- red if difference is higher than 'realtime of reference kernel + maximum standard deviation'
Overall best and worst are also shown ,if not in between +/- std dev.

I know a standard deviation computed of 3 runs is not very significant, but it's all I've got.

Pedro
ReplyDelete
Replies
Unknown6 January 2017 at 07:47
When I play Counter Strike: Go I experience random pauses. I tried yield_type 0 and 2, in addition I am using schedtool -I -e. This kind of a behavior is not reproducible with cfq. On the other hand with yield_type = 0 the game does not suffer from jitters like cfq.
ReplyDelete
Replies
Anonymous7 January 2017 at 00:13
The kernel parameter skew_tick=1 offsets the timer interrupt on each cpu. Does MuQSS rely on timer interrupts having no offset?

duud
ReplyDelete
Replies
Anonymous7 January 2017 at 01:11
Sry for the mistake, the difference is about 1.1% but it seems to increase with time, it's about 2.2% now.
ReplyDelete
Replies
Anonymous7 January 2017 at 03:11
I don't know if the difference in timer interrupt count is of any importance, but I have issues with input latancy in games. The behavior varies, but most of the time input is very responsive after rebooting and gets very laggy after some time.

So after some time the situation with timer interrupt counts changed completely.
Now CPU0 has a much lower count compared to CPU1, it was the other way around and the relative difference is about 9%.

Does somebody have more information about this behavior? Am I missing some timer interrupts? Maybe because if regions with disabled interrupts? Do u have simmilar behavior on your pcs? I can't find any information about this.
ReplyDelete
Replies
Anonymous7 January 2017 at 05:36
4.9.1 does not build with ck1:

kernel/time/timer.c: In Funktion »msleep«:
kernel/time/timer.c:1914:62: Fehler: »pm_freezing« nicht deklariert (erste Benutzung in dieser Funktion)
if (jiffs < 5 && hrtimer_resolution < NSEC_PER_SEC / HZ && !pm_freezing) {
^
kernel/time/timer.c:1914:62: Anmerkung: jeder nicht deklarierte Bezeichner wird nur einmal für jede Funktion, in der er vorkommt, gemeldet
kernel/time/timer.c: In Funktion »msleep_interruptible«:
kernel/time/timer.c:1936:62: Fehler: »pm_freezing« nicht deklariert (erste Benutzung in dieser Funktion)
if (jiffs < 5 && hrtimer_resolution < NSEC_PER_SEC / HZ && !pm_freezing) {
^
scripts/Makefile.build:293: die Regel für Ziel „kernel/time/timer.o“ scheiterte
make[2]: *** [kernel/time/timer.o] Fehler 1
scripts/Makefile.build:544: die Regel für Ziel „kernel/time“ scheiterte
make[1]: *** [kernel/time] Fehler 2
Makefile:992: die Regel für Ziel „kernel“ scheiterte
make: *** [kernel] Fehler 2
ReplyDelete
Replies
Anonymous11 January 2017 at 23:11
With 4.9.2-ck1 I get multiple KDE Plasma hungs and dmesg spits this:
[11551.712334] INFO: task pool:19698 blocked for more than 120 seconds.
[11551.712336] Not tainted 4.9.2-ck1 #1
[11551.712336] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[11551.712337] pool D 0 19698 2281 0x00000000
[11551.712339] ffff8800b7b16200 ffff8800b7a16200 ffff880092a2c980 ffff8800b1e7c000
[11551.712342] ffff880092a2c980 ffff8800048e3af8 ffffffff817280b1 ffff8800b7b16220
[11551.712345] 0000000000000001 00000b3b56e88e75 ffff88000f874dd8 000000010102c478
[11551.712347] Call Trace:
[11551.712350] [] ? __schedule+0x601/0xb60
[11551.712353] [] ? schedule+0x34/0xc0
[11551.712355] [] ? schedule_preempt_disabled+0xc/0x20
[11551.712357] [] ? __mutex_lock_slowpath+0xba/0x130
[11551.712360] [] ? mutex_lock+0xe/0x20
[11551.712363] [] ? cifs_reconnect_tcon+0xf6/0x220
[11551.712365] [] ? __switch_to+0x307/0x470
[11551.712368] [] ? smb_init+0x34/0x90
[11551.712370] [] ? CIFSSMBQPathInfo+0x51/0x260
[11551.712372] [] ? cifs_query_path_info+0x77/0x1a0
[11551.712374] [] ? lookup_fast+0xe0/0x2f0
[11551.712377] [] ? cifs_get_inode_info+0x2ff/0x590
[11551.712380] [] ? filename_lookup+0xde/0x160
[11551.712382] [] ? __kmalloc+0x2c/0x110
[11551.712385] [] ? build_path_from_dentry+0x154/0x2e0
[11551.712387] [] ? cifs_revalidate_dentry_attr+0xc8/0xe0
[11551.712390] [] ? cifs_getattr+0x5b/0x120
[11551.712393] [] ? vfs_fstatat+0x52/0x90
[11551.712396] [] ? SYSC_newlstat+0x1d/0x40
[11551.712399] [] ? __getnstimeofday64+0x32/0xc0
[11551.712402] [] ? do_gettimeofday+0x10/0x60
[11551.712405] [] ? SyS_gettimeofday+0x31/0x70
[11551.712408] [] ? entry_SYSCALL_64_fastpath+0x13/0x94
ReplyDelete
Replies
Florian15 January 2017 at 02:46
Installed 4.9.3-1 on my Arch system with nvidia-340xx-dkms and got with AND without FORCE_IRQ_THREADING

Jan 14 16:02:03 steinrose kernel: WARNING: CPU: 1 PID: 1927 at fs/proc/generic.c:345 proc_register+0x116/0x12f
Jan 14 16:02:03 steinrose kernel: proc_dir_entry 'driver/nvidia' already registered
Jan 14 16:02:03 steinrose kernel: Modules linked in: nvidia(PO+) ipt_REJECT nf_reject_ipv4 xt_tcpudp nf_conntrack_ipv4 nf_defrag_ipv4 f71882fg xt_recent xt_conntrack adt7475 nf_conntrack ipta
Jan 14 16:02:03 steinrose kernel: CPU: 1 PID: 1927 Comm: modprobe Tainted: P O 4.9.3-1-ck #1
Jan 14 16:02:03 steinrose kernel: Hardware name: MICRO-STAR INTERNATIONAL CO.,LTD MS-7512/MS-7512, BIOS V1.0 05/21/2008
Jan 14 16:02:03 steinrose kernel: ffffc90000843af8 ffffffff81380458 ffffc90000843b48 0000000000000000
Jan 14 16:02:03 steinrose kernel: ffffc90000843b38 ffffffff8106a1c2 0000015900843b60 00000000ffffffef
Jan 14 16:02:03 steinrose kernel: ffff880236830e40 ffff880234bf6985 ffff880234f15048 ffff880234bf6900
Jan 14 16:02:03 steinrose kernel: Call Trace:
Jan 14 16:02:03 steinrose kernel: [] dump_stack+0x62/0x78
Jan 14 16:02:03 steinrose kernel: [] __warn+0xda/0xf2
Jan 14 16:02:03 steinrose kernel: [] warn_slowpath_fmt+0x6e/0x85
Jan 14 16:02:03 steinrose kernel: [] ? preempt_count_add+0xbb/0xcc
Jan 14 16:02:03 steinrose kernel: [] proc_register+0x116/0x12f
Jan 14 16:02:03 steinrose kernel: [] proc_mkdir_data+0x76/0x9a
Jan 14 16:02:03 steinrose kernel: [] proc_mkdir_mode+0x26/0x28
Jan 14 16:02:03 steinrose kernel: [] nv_register_procfs+0x4c/0x1c9 [nvidia]
Jan 14 16:02:03 steinrose kernel: [] nvidia_init_module+0x29c/0x79f [nvidia]
Jan 14 16:02:03 steinrose kernel: [] ? nv_drm_init+0x15/0x15 [nvidia]
Jan 14 16:02:03 steinrose kernel: [] nvidia_frontend_init_module+0x50/0x84c [nvidia]
Jan 14 16:02:03 steinrose kernel: [] do_one_initcall+0x5b/0x15e
Jan 14 16:02:03 steinrose kernel: [] ? vfree+0x41/0x8e
Jan 14 16:02:03 steinrose kernel: [] do_init_module+0x72/0x202
Jan 14 16:02:03 steinrose kernel: [] load_module+0x2104/0x28b3
Jan 14 16:02:03 steinrose kernel: [] ? symbol_put_addr+0x69/0x69
Jan 14 16:02:03 steinrose kernel: [] ? vfs_read+0x105/0x125
Jan 14 16:02:03 steinrose kernel: [] SyS_finit_module+0xf3/0x121
Jan 14 16:02:03 steinrose kernel: [] entry_SYSCALL_64_fastpath+0x13/0x94
Jan 14 16:02:03 steinrose kernel: ---[ end trace c68407c4b37c7644 ]---
Jan 14 16:02:03 steinrose kernel: NVRM: failed to register procfs!
Jan 14 16:02:03 steinrose kernel: NVRM: request_mem_region failed for 16M @ 0xfd000000. This can
NVRM: occur when a driver such as rivatv is loaded and claims
NVRM: ownership of the device's registers.
Jan 14 16:02:03 steinrose kernel: nvidia: probe of 0000:01:00.0 failed with error -1
Jan 14 16:02:03 steinrose kernel: Error: Driver 'nvidia' is already registered, aborting...
Jan 14 16:02:03 steinrose kernel: NVRM: DRM init failed

How can nvidia driver be already registered when I do have to use a kernel module? Does anything significantly got changed since linux kernel 4.9 (upgraded from 4.8.17-1-ck) concerning video driver? Never had this before and did not change anything with nvidia kernel module.

Thanks, Florian.
ReplyDelete
Replies
Anonymous16 January 2017 at 08:10
Con,

Is vm.swappiness=10 (as on help.ubunu)recommended with your kernel patch ?

https://help.ubuntu.com/community/SwapFaq#What_is_swappiness_and_how_do_I_change_it.3F
ReplyDelete
Replies
Anonymous16 January 2017 at 13:38
Runs nice and fast (ck1) although I had to downgrade from 4.9.4 to 4.9.1 because of latency.
ReplyDelete
Replies
monotykamary18 January 2017 at 04:05
My osu! problems are gone. You're a wizard ck.
ReplyDelete
Replies
Anonymous19 January 2017 at 01:54
First time feedback ever...

Thank you very much! Without your Patchset, and later BFQ Linux always felt broken. I began using them aeons ago on a P3@933Mhz which i bought refurbished. And using them now on an old Thinkpad T60P with CoreDuo T2600 and 3GB Ram. Yes! 32Bits! Why? No Money! Anyways, right now everything runs very smooth at 4.9.4 which Greysky kindly supplies via his repo for Archlinux. That couldn't be said for the whole of 4.8 which forced me to gnarlingly fall back to default upstream, and experimentally using ZEN. Which worked less buggy, but not flawless. But the pain is gone now.

Very good job! :-)

ReplyDelete
Replies
Unknown19 January 2017 at 02:07
Hi see ur blog for many months and I have to say that u do nice job!!! I have a question.I have many years to do hacks so I have forget some basics.I remember how can I make a phising url.I want to ask where I have to upload a phising url.(with purpose to steal someone's password.)Im not native english speaker.Please answer me..
ReplyDelete
Replies
paines19 January 2017 at 06:21
I am noticing heavy stuttering with graphics (games: Tomb Raider in-game benchmarking option and Chromium + Imgur scrolling) on Ubuntu 16.04 + nVidia GFX 950m + 375.20 driver + 4.9-ck1 drivers. With stock Ubuntu Kernels it behaves normal.
ReplyDelete
Replies
Anonymous20 January 2017 at 12:23
After downgrading from 4.9.4-ck1 to 4.9.0-ck1 (git) because of latency I downgraded to 4.8-ck (git).
Feels much faster than the 4.9... bunch.
It seems the kernel gets more and more bloated and slower every release.
ReplyDelete
Replies
Anonymous21 January 2017 at 07:10
My name is Jennifer Lora me and my husband are here to testify about how we
use Lisa ATM CARD to make money and also have our own business
today. Go get your blank ATM card today and be among the lucky ones. This
PROGRAMMED blank ATM card is capable of hacking into any ATM
machine,anywhere in the world.It has really changed our life for good and
now we can say we are rich and we can never be poor again. You can withdraw
the maximum of $ 10,000 daily We can proudly say our business is doing fine
and we have up to 20,000 000 (20 millions dollars in our account) Is not
illegal,there is no risk of being caught ,because it has been programmed in
such a way that it is not traceable,it also has a technique that makes it
impossible for the CCTV to detect you..For details on how to get yours today, email her on : [ lisaatmcard@gmail.com ]
or call her on
( +12678734910 )
ReplyDelete
Replies
Anonymous24 January 2017 at 03:14
Like a previous poster, I also got a process hard freeze when using the silver searcher while compiling the Linux kernel in the background. It was at a point where 'killall -9 ag' would not kill the process
ReplyDelete
Replies
Anonymous26 January 2017 at 08:44
Hi, I have high cpu and unresponsive machine at any program using 4.9-CK, using yield_type=1 or 2, this is in a Haswell Laptop. Had to downgradde to 4.8-ck .

journal at the moment of freeze : http://pastebin.com/3s6VvmHZ

Had to hard reset the laptop. Any ideas why?
ReplyDelete
Replies
Anonymous26 January 2017 at 18:34
Hi, ck.
After some time from the first MuQss release I have tried again your patch but i still have problems.

Wine is not usable since no application can be executed due to the error:
"kernel: usercopy: kernel memory overwrite attempt detected"
With an Atom Z520 i still have some intermittent boot panic. When boot goes well, then everything runs smooth for many days.

Any suggestion?

Many thanks.
ReplyDelete
Replies
Anonymous29 January 2017 at 07:39
I just enabled UBSAN in the kernel and it found an integer overflow in MuQSS, apparently in its iso ticks calculation.

================================================================================
UBSAN: Undefined behaviour in kernel/sched/MuQSS.c:3230:33
signed integer overflow:
4204941 * 522 cannot be represented in type 'int [40]'
CPU: 0 PID: 0 Comm: MuQSS/0 Tainted: P O 4.9.6-ck1 #1
Hardware name: System manufacturer System Product Name/M2N-SLI, BIOS ASUS M2N SLI ACPI BIOS Revision 0903 06/18/2008
0000000000000000 ffffffffa0a32ba1 000000000000002a dd38ca36cb270427
ffff9dcb77c03e18 000000000000020a ffffffffa0a9c5f9 ffffffffa14b9c00
ffffffffa0a9d0e9 0000002aa14c6120 0000000000000002 0031343934303234
Call Trace:

[] ? dump_stack+0x5a/0x99
[] ? ubsan_epilogue+0x9/0x40
[] ? handle_overflow+0xf9/0x120
[] ? sched_clock_local+0x1b/0xa0
[] ? scheduler_tick+0x857/0xa70
[] ? rcu_check_callbacks+0x17a/0x5a0
[] ? tick_sched_handle+0xa0/0xa0
[] ? update_process_times+0x46/0x60
[] ? tick_sched_timer+0x3d/0x90
[] ? __hrtimer_run_queues+0x10c/0x470
[] ? hrtimer_interrupt+0xd7/0x260
[] ? smp_apic_timer_interrupt+0x45/0x70
[] ? apic_timer_interrupt+0x7c/0x90

[] ? default_idle+0x15/0x1b0
[] ? amd_e400_idle+0x37/0x140
[] ? cpu_startup_entry+0x205/0x2d0
[] ? start_kernel+0x459/0x479
================================================================================
ReplyDelete
Replies
Anonymous2 February 2017 at 04:06
Very impressed. Maxing out all 4 cores with 2 different compiler jobs and still the machine is responsible like there's nothing going on.
ReplyDelete
Replies
Anonymous3 February 2017 at 03:00
[OFF-TOPIC]
Sorry for disturbing... Am currently upgrading my openSUSE and want to ask, what's the currently recommended (mature) gcc compiler version for (mainly) kernel compilation.

Thanks in advance and best regards,
Manuel Krause
ReplyDelete
Replies
Anonymous8 February 2017 at 10:36
Hey,

I've been running linux 4.9.7 with muqss for quite some time now without any issue. But today I wanted to try golang and by simply issuing one command, the application segfaulted. Well, I thought this must be a golang error but before I wanted to report this I tried this with the stock archlinux vanilla kernel and it didn't seg faulted which means that somehow its muqss fault. I also tried comparing both kernel configs and they are equivalent with some obvious exceptions like bfq.

To reproduce this:
mkdir go && cd go
export GOPATH=$(pwd)
go get -u -v github.com/nsf/gocode
ReplyDelete
Replies
Anonymous9 February 2017 at 11:21
1.4.2
ReplyDelete
Replies
Anonymous9 February 2017 at 17:41
Hi,

With Linux 4.9.8-ck1 I get the following stack trace upon resuming from suspend. Happens with HZ=250/300, I haven't noticed it with 1000.

[21889.468401] ------------[ cut here ]------------
[21889.468414] WARNING: CPU: 0 PID: 16898 at kernel/sched/MuQSS.c:1950 valid_task_cpu+0xa7/0xb0
[21889.468415] Modules linked in: nvidia_uvm(PO) nvidia(PO) bbswitch(O) nf_log_ipv4 nf_log_common xt_LOG ipt_REJECT nf_reject_ipv4 xt_state iptable_mangle iptable_nat nf_nat_ipv4 nf_nat iptable_filter rndis_host cdc_ether usbnet mii vboxpci(O) vboxnetadp(O) vboxnetflt(O) vboxdrv(O) nfsd ipv6 crc_ccitt fuse algif_skcipher af_alg uvcvideo btusb videobuf2_vmalloc videobuf2_memops videobuf2_v4l2 btrtl btbcm videobuf2_core btintel bluetooth videodev snd_hda_codec_realtek snd_hda_codec_generic intel_rapl iosf_mbi x86_pkg_temp_thermal intel_powerclamp kvm_intel iwlmvm kvm snd_hda_intel snd_hda_codec snd_hwdep irqbypass i915 snd_hda_core crct10dif_pclmul snd_pcm ghash_clmulni_intel iwlwifi alx snd_timer psmouse mei_me i2c_dev snd serio_raw mei mdio asus_nb_wmi asus_wmi sparse_keymap mxm_wmi wmi
[21889.468488] CPU: 0 PID: 16898 Comm: kworker/3:0 Tainted: P W O 4.9.8-ck1-smp #12
[21889.468490] Hardware name: ASUSTeK COMPUTER INC. N56VM/N56VM, BIOS N56VM.214 08/28/2012
[21889.468501] ffffc9000b8c7d20 ffffffff8143b52d 0000000000000000 0000000000000000
[21889.468506] ffffc9000b8c7d60 ffffffff81098746 0000079e00000002 ffff8802223dbc00
[21889.468511] 0000000000017680 ffff880225c1d988 0000000000000282 ffffc9000b8c7e18
[21889.468515] Call Trace:
[21889.468524] [] dump_stack+0x4f/0x72
[21889.468530] [] __warn+0xc6/0xe0
[21889.468534] [] warn_slowpath_null+0x18/0x20
[21889.468537] [] valid_task_cpu+0xa7/0xb0
[21889.468541] [] do_set_cpus_allowed+0x37/0xa0
[21889.468545] [] __kthread_bind_mask+0x3b/0x70
[21889.468549] [] kthread_bind_mask+0xe/0x10
[21889.468552] [] create_worker+0xfb/0x1a0
[21889.468554] [] worker_thread+0x318/0x4e0
[21889.468557] [] ? process_one_work+0x4a0/0x4a0
[21889.468561] [] kthread+0xd4/0xf0
[21889.468564] [] ? kthread_park+0x60/0x60
[21889.468571] [] ret_from_fork+0x22/0x30
[21889.468574] ---[ end trace 2557c3739e4d37b5 ]---
[21889.497983] Task kworker/3:1 (pid=17134) is on cpu 3 (state=0, flags=4208040)
[21889.521652] Removed affinity for 617 processes to cpu 4
[21889.522233] smpboot: CPU 4 is now offline
[21889.568254] Removed affinity for 618 processes to cpu 5
[21889.568266] smpboot: CPU 5 is now offline
[21889.621578] Removed affinity for 617 processes to cpu 6
[21889.621588] smpboot: CPU 6 is now offline
[21889.671566] Removed affinity for 618 processes to cpu 7
[21889.671579] smpboot: CPU 7 is now offline
[21889.696021] ACPI: Low-level resume complete
ReplyDelete
Replies
Anonymous11 February 2017 at 02:14
Sad story. 4.4.14 vanilla kernel (~180k config) feels more responsive than a custom 4.9.9-ck1 kernel (~70k config).
The kernel is getting too bloated.
Seems like no one cares about speed/latency/efficiency anymore.
Or it is by intent to sell more new cpus.

/rant.
ReplyDelete
Replies
Anonymous13 February 2017 at 16:16
Hi ck,

I am using Linux 4.9.9-ck1 and I have the following stack trace upon resuming from suspend. This happens when HZ=250/300 but doesn't seem to happen when HZ=1000.

[26639.048008] Removed affinity for 589 processes to cpu 2
[26639.048021] smpboot: CPU 2 is now offline
[26639.051410] ------------[ cut here ]------------
[26639.051423] WARNING: CPU: 0 PID: 13564 at kernel/sched/MuQSS.c:1950 valid_task_cpu+0xa7/0xb0
[26639.051424] Modules linked in: nf_log_ipv4 nf_log_common xt_LOG ipt_REJECT nf_reject_ipv4 xt_state iptable_mangle iptable_nat nf_nat_ipv4 nf_nat iptable_filter vboxpci(O) vboxnetadp(O) vboxnetflt(O) vboxdrv(O) nfsd ipv6 crc_ccitt fuse algif_skcipher af_alg uvcvideo btusb btrtl btbcm videobuf2_vmalloc btintel videobuf2_memops videobuf2_v4l2 bluetooth videobuf2_core videodev rndis_host cdc_ether usbnet mii snd_hda_codec_realtek snd_hda_codec_generic intel_rapl iosf_mbi x86_pkg_temp_thermal intel_powerclamp kvm_intel iwlmvm snd_hda_intel kvm snd_hda_codec snd_hwdep i915 snd_hda_core irqbypass snd_pcm crct10dif_pclmul snd_timer ghash_clmulni_intel iwlwifi snd mei_me alx psmouse i2c_dev mei mdio asus_nb_wmi serio_raw asus_wmi sparse_keymap mxm_wmi wmi
[26639.051493] CPU: 0 PID: 13564 Comm: kworker/2:2 Tainted: G O 4.9.9-ck1-smp #13
[26639.051494] Hardware name: ASUSTeK COMPUTER INC. N56VM/N56VM, BIOS N56VM.214 08/28/2012
[26639.051505] ffffc9000ccd7d20 ffffffff8143a6ed 0000000000000000 0000000000000000
[26639.051510] ffffc9000ccd7d60 ffffffff81097786 0000079e00000002 ffff88004e82b000
[26639.051514] 0000000000017680 ffff880225c1d948 0000000000000282 ffffc9000ccd7e18
[26639.051518] Call Trace:
[26639.051528] [] dump_stack+0x4f/0x72
[26639.051533] [] __warn+0xc6/0xe0
[26639.051537] [] warn_slowpath_null+0x18/0x20
[26639.051541] [] valid_task_cpu+0xa7/0xb0
[26639.051544] [] do_set_cpus_allowed+0x37/0xa0
[26639.051549] [] __kthread_bind_mask+0x3b/0x70
[26639.051552] [] kthread_bind_mask+0xe/0x10
[26639.051555] [] create_worker+0xfb/0x1a0
[26639.051558] [] worker_thread+0x318/0x4e0
[26639.051561] [] ? process_one_work+0x4a0/0x4a0
[26639.051564] [] kthread+0xd4/0xf0
[26639.051568] [] ? kthread_park+0x60/0x60
[26639.051574] [] ret_from_fork+0x22/0x30
[26639.051577] ---[ end trace 6e2ff89d2389b048 ]---
[26639.077517] Task kworker/2:1 (pid=14035) is on cpu 2 (state=0, flags=4208040)
[26639.117982] Removed affinity for 590 processes to cpu 3
[26639.118006] smpboot: CPU 3 is now offline
[26639.167935] Removed affinity for 590 processes to cpu 4
[26639.167950] smpboot: CPU 4 is now offline
[26639.217857] Removed affinity for 589 processes to cpu 5
[26639.217868] smpboot: CPU 5 is now offline
[26639.267850] Removed affinity for 589 processes to cpu 6
[26639.267861] smpboot: CPU 6 is now offline
[26639.317831] Removed affinity for 589 processes to cpu 7
[26639.317842] smpboot: CPU 7 is now offline
[26639.342205] ACPI: Low-level resume complete
[26639.342259] ACPI : EC: EC started
[26639.342260] PM: Restoring platform NVS memory
[26639.342626] Suspended for 106421.939 seconds
[26639.342710] Enabling non-boot CPUs ...

Thank you.
ReplyDelete
Replies
Anonymous16 February 2017 at 03:23
Hi Con,

I've made some scaling tests with CFS and MuQSS, to see why MuQSS is performing poorly under half load.
They are here :
https://docs.google.com/spreadsheets/d/163U3H-gnVeGopMrHiJLeEY1b7XlvND2yoceKbOvQRm4/edit?usp=sharing
in the '4.9.9 Scaling test' sheet.

I remember that you said that it might be related to load balancing and Intel turbo boost.
However, I've found that my motherboard set the CPU to it's max turbo boost frequency when XMP memory profile is enabled (and XMP is always enabled on my computer).
So my 4770k CPU always runs at a maximum frequency of 3.9, whether 1, 2, 3 or more cores are loaded. I've checked that with turbostat.
So I believe it's not a turbo boost issue.
I've also done some tests with XMP disabled and turbo boost working as intended.

The only thing I've found, using 'turbostat make -jN', is that with CFS, load is distributed evenly across physical cores and logical cores, whereas MuQSS puts more load on physical cpu.
I don't know if it's intended or if it can cause this performance issue.
I just write this to let you know.

Pedro
ReplyDelete
Replies
Anonymous17 February 2017 at 04:13
Hi Con, my last post about performance under half load has been filtered.
Can you please bring it up?

Pedro
ReplyDelete
Replies
Anonymous17 February 2017 at 14:04
Low-latency 4.9.9-ck1 kernel config.

(Based on Slackware64 14.2 4.4.14 kernel config, should run on any hardware.)

Getting 1.1 µs average latency on all 4 cores on a Xeon W3565 3.2 GHz quad-core using cyclictest.

http://pastebin.com/vvwsT3mE

No initramfs, cgroups, namespaces, etc. support, adjust as needed.
ReplyDelete
Replies
Anonymous18 February 2017 at 07:15
Hi, when I use an external usb wifi in 4.9-CK kernel the system hangs/freeze and only happens with that kernel ck, when I use 4.9 vanilla or Zen it doesnt happen? syslog doesnt show anything I had to press the power button to restart again.
ReplyDelete
Replies
Unknown18 January 2018 at 10:46
The United States Federal Government and WordPress Private Grant Foundations give away billions in free money every year to millions of US and Canada Citizens just like me and you. These are free cash grants that all US and Canada taxpaying citizens are entitled to and should take advantage of. This free money can be used for almost anything you can imagine. In fact right now people are being approved for large sums of money to start a business, even to buy a house. I am a witness and i was given $200,000 cash so don't sit back and watch these opportunities pass you by.It doesn't even matter if you have debt, or a bad credit rating; you can still qualify. Grant Programs are not loans,and no matter how much free government money you receive you will never have to pay it back. Visit the federal grant official website for more details (federalgrantrefundcom.wordpress.com)
ReplyDelete
Replies
Unknown18 January 2018 at 10:47

If you are in desperate need of a hacker for hire? This dude's is a cyber guru, he is involved with Getting your bank blank atm cards which could debit money from any atm machine. Bank transfers and wire transfers as well as Paypal jobs, hes that good,had to make him my personal hacker. You could mail him cyberhacker906@gmail.com as well if you got issues, he's as discreet and professional too. He's kinda picky though so make mention of the reference. Bryan referred you.
ReplyDelete
Replies
Unknown25 February 2018 at 07:19
*Cheating Spouse *University grades changing *Bank accounts hack *Twitters hack *email accounts hack *Grade Changes hack *Website crashed hack *server crashed hack *Retrieval of lost file/documents *Erase criminal records hack *Databases hack *Sales of Dumps cards of all kinds *Untraceable Ip *Individual computers hack *Websites hack *Facebook hack *Control devices remotely hack *Burner Numbers hack *Verified Paypal Accounts hack *Any social media account hack *Android & iPhone Hack *Word Press Blogs hack *Text message interception hack *email interception hack

contact: hackwithjonny at gmail dot com +17272202668
ReplyDelete
Replies

Add comment