lc0 | The rewritten engine, originally for tensorflow Now all other backends have been ported here | Game Engine library

by LeelaChessZero C++ Version: v0.30.0-rc1 License: GPL-3.0

X-Ray Key Features Code Snippets Community Discussions(10)Vulnerabilities Install Support

kandi X-RAY | lc0 Summary

lc0 is a C++ library typically used in Gaming, Game Engine applications. lc0 has no bugs, it has no vulnerabilities, it has a Strong Copyleft License and it has medium support. You can download it from GitHub.

Lc0 is a UCI-compliant chess engine designed to play chess via neural network, specifically those of the LeelaChessZero project.

Support

Quality

Security

License

Reuse

Support

lc0 has a medium active ecosystem.

It has 2107 star(s) with 454 fork(s). There are 91 watchers for this library.

It had no major release in the last 12 months.

There are 80 open issues and 499 have been closed. On average issues are closed in 362 days. There are 101 open pull requests and 0 closed requests.

It has a neutral sentiment in the developer community.

The latest version of lc0 is v0.30.0-rc1

Quality

lc0 has no bugs reported.

Security

lc0 has no vulnerabilities reported, and its dependent libraries have no vulnerabilities reported.

License

lc0 is licensed under the GPL-3.0 License. This license is Strong Copyleft.

Strong Copyleft licenses enforce sharing, and you can use them when creating open source projects.

Reuse

lc0 releases are available to install and integrate.

Installation instructions are not available. Examples and code snippets are available.

Top functions reviewed by kandi - BETA

kandi's functional review helps you automatically verify the functionalities of the libraries and avoid rework.
Currently covering the most popular Java, JavaScript and Python libraries. See a Sample of lc0

Get all kandi verified functions for this library.

lc0 Key Features

No Key Features are available at this moment for lc0.

lc0 Examples and Code Snippets

No Code Snippets are available at this moment for lc0.

Community Discussions

Trending Discussions on lc0

Why does my empty loop run twice as fast if called as a function, on Intel Skylake CPUs?

C array not initialized as expected when cross-compiling for baremetal ARM

Why is `printf("%s", "foo")` not being optimized to `fputs("foo", stdout)`?

Assembly: How to return a pointer?

Different ways of negating a float variable generate different assembly

How is struct copied from stack to uninitialized data segment in GNU as?

Why does printf print a different number for 1.0 vs. 1. when you incorrectly use a format that isn't a float format?

According to the current C Standard, what is the default value assigned to an `int` which is declared but not defined?

Why is the compiler using the frame pointer and link register?

Why -O1 is faster than -O2

QUESTION

Why does my empty loop run twice as fast if called as a function, on Intel Skylake CPUs?

Asked 2021-Jun-08 at 02:35

I was running some tests to compare C to Java and ran into something interesting. Running my exactly identical benchmark code with optimization level 1 (-O1) in a function called by main, rather than in main itself, resulted in roughly double performance. I'm printing out the size of test_t to verify beyond any doubt that the code is being compiled to x64.

I sent the executables to my friend who's running an i7-7700HQ and got similar results. I'm running an i7-6700.

Here's the slower code:

...

ANSWER

Answered 2021-Jun-07 at 22:21

The slow version:

Note that the sub rax, 1 \ jne pair goes right across the boundary of the ..80 (which is a 32byte boundary). This is one of the cases mentioned in Intels document regarding this issue namely as this diagram:

So this op/branch pair is affected by the fix for the JCC erratum (which would cause it to not be cached in the µop cache). I'm not sure if that is the reason, there are other things at play too, but it's a thing.

In the fast version, the branch is not "touching" a 32byte boundary, so it is not affected.

There may be other effects that apply. Still due to crossing a 32byte boundary, in the slow case the loop is spread across 2 chunks in the µop cache, even without the fix for JCC erratum that may cause it to run at 2 cycles per iteration if the loop cannot execute from the Loop Stream Detector (which is disabled on some processors by an other fix for an other erratum, SKL150). See eg this answer about loop performance.

To address the various comments saying they cannot reproduce this, yes there are various ways that could happen:

Whichever effect was responsible for the slowdown, it is likely caused by the exact placement of the op/branch pair across a 32byte boundary, which happened by pure accident. Compiling from source is unlikely to reproduce the same circumstances, unless you use the same compiler with the same setup as was used by the original poster.
Even using the same binary, regardless of which of the effects is responsible, the weird effect would only happen on particular processors.

Source https://stackoverflow.com/questions/67877913

QUESTION

C array not initialized as expected when cross-compiling for baremetal ARM

Asked 2021-May-26 at 22:08

My aim here is to implement a simple baremetal program for ARM, compile it manually and analyze it in GDB. A simple example main.c that shows my problem is:

...

ANSWER

Answered 2021-May-26 at 22:08

Answer is:

2) The compilation process / usage of the toolchain is wrong.

You may have several problems, an important one being that the use of the -kernel option requires the start address of your program to be 0x00010000. And you don't have a startup file, nor a linker script.

The following example should work fine, and is just adapted from a seminal article from Francesco Balducci on his blog.

startup.s:

Source https://stackoverflow.com/questions/67645035

QUESTION

Why is `printf("%s", "foo")` not being optimized to `fputs("foo", stdout)`?

Asked 2021-May-02 at 09:25

So both GCC and Clang are smart enough to optimize printf("%s\n", "foo") to puts("foo") (GCC, Clang). That's good and all.

But when I run this function through Compiler Explorer:

...

ANSWER

Answered 2021-May-02 at 09:25

Some very specific situations are optimized, like the one you showed, but it's very superficial, if you add something to your format string, even a space, it immediately discards the puts and goes back to printf.

I guess that there would be nothing to stop a more broad optimization, my speculation is that, since the performance gains are not that great, further adding more special cases was deemed as not being worth it.

In my speculation, the lack of fputs optimization would fall in that not being worth it category.

This old gcc printf optimization document sheds some light on these optimizations, I doubt that it would much different today.

Specifically:

2.3 %s\n
A printf call with the format string %s\n [line 4679-4687] is converted to a puts() call.

Source https://stackoverflow.com/questions/67344987

QUESTION

Assembly: How to return a pointer?

Asked 2021-Apr-12 at 04:17

The function inventory take an array of device pointers and call evaluate to find out what the variation is. The inventory function then returns a pointer that has the highest variation.

...

ANSWER

Answered 2021-Apr-12 at 03:24

The bug

The bug is movl %eax, 36(%rdi) at line 38 of calibrate.s. This is apparently supposed to write to the avg member of the relevant Device, but it's a 32-bit store and Device::avg is a 16-bit short. So it should be movw %ax, 36(%rdi).

How I found it with gdb

Hopefully this will provide some information about what gdb can do and how to use it effectively.

I set a breakpoint at the second printf in inventory, at which point I did x/s $rdx to see what string %rdx points to:

Source https://stackoverflow.com/questions/67014883

QUESTION

Different ways of negating a float variable generate different assembly

Asked 2021-Apr-03 at 14:55

I have the following code:

...

ANSWER

Answered 2021-Apr-03 at 14:55

Output is different, for the simple reason that compiler does not 'think' like human but follows standard.

Source https://stackoverflow.com/questions/66929770

QUESTION

How is struct copied from stack to uninitialized data segment in GNU as?

Asked 2021-Mar-19 at 14:55

Having this simple c:

...

ANSWER

Answered 2021-Mar-19 at 12:41

movq instruction will copy 8 bytes, so the data of entire struct foo is copied here:

Source https://stackoverflow.com/questions/66708302

QUESTION

Why does printf print a different number for 1.0 vs. 1. when you incorrectly use a format that isn't a float format?

Asked 2021-Mar-03 at 06:21

This is a strange behaviour:

...

ANSWER

Answered 2021-Mar-03 at 03:52

The value that's printed has nothing at all to do with the value you pass in the printf call, since integer and floating point arguments are passed in separate areas (at least, this is a common convention, and I assume that you're operating on a machine where it's true). When you ask to printf %d without passing any integer argument, you get whatever happened to be previously sitting in the space reserved for the first integer argument, which could be anything. It might be deterministic between different runs of the same compiled program (as a result of some C runtime initialization leaving a predictable value in a register, for instance) or it might be dependent on the execution environment or the phase of the moon. You really don't know, and to be honest, this isn't a case where it's worthwhile to figure out exactly how that value got there. It's junk, and that's that.

Source https://stackoverflow.com/questions/66450308

QUESTION

According to the current C Standard, what is the default value assigned to an `int` which is declared but not defined?

Asked 2021-Mar-02 at 11:42

Question of C Standard

Easy question, but couldn't seem to find the answer with a duckduckgo or by searching SO (here).

I am aware that in C, the standard states that uninitialized arrays of ints results in undefined behaviour. (Or at least most compilers behave this way.)

...

ANSWER

Answered 2021-Mar-02 at 11:40

In your example

Source https://stackoverflow.com/questions/66438611

QUESTION

Why is the compiler using the frame pointer and link register?

Asked 2021-Jan-15 at 18:28

I'm trying to understand how GNU interprets several things so my first example is very simple: declaration of an integer and printing it. If no optimization is invoked, the assembly code reads:

...

ANSWER

Answered 2021-Jan-15 at 18:28

For whatever reason -O3 doesn't turn on the -fomit-frame-pointer option when compiling with GCC for ARM64 targets (including GNU Fortran). You'll need to enable this option explicitly for the compiler to optimize away the use of the frame pointer in non-leaf functions:

Source https://stackoverflow.com/questions/65728553

QUESTION

Why -O1 is faster than -O2

Asked 2020-Nov-21 at 08:22

I wrote a C code like this:

...

ANSWER

Answered 2020-Nov-19 at 10:48

-O2 turns on many options in addition to O1, for example -falign-functions -falign-jumps -falign-labels -falign-loops. Each of them seemed to have a negative performance impact on top of -O1. I have i7-8550U and GCC 9.3.0-17ubuntu1~20.04.

I believe the branch prediction failures make this hard on the processor.

Source https://stackoverflow.com/questions/64909830

Community Discussions, Code Snippets contain sources that include Stack Exchange Network

Vulnerabilities

No vulnerabilities reported

Install lc0

You can download it from GitHub.

Support

For any new features, suggestions and bugs create an issue on GitHub. If you have any questions check and ask questions on community page Stack Overflow .

Find more information at: