Code-Red | A Graphics Interface for DirectX12 and Vulkan | Graphics library

by LinkClinton C++ Version: Current License: MIT

X-Ray Key Features Code Snippets Community Discussions(1)Vulnerabilities Install Support

kandi X-RAY | Code-Red Summary

Code-Red is a C++ library typically used in User Interface, Graphics applications. Code-Red has no bugs, it has no vulnerabilities, it has a Permissive License and it has low support. You can download it from GitHub.

A Graphics Interface for DirectX12 and Vulkan

Support

Quality

Security

License

Reuse

Support

Code-Red has a low active ecosystem.

It has 28 star(s) with 5 fork(s). There are 3 watchers for this library.

It had no major release in the last 6 months.

There are 3 open issues and 4 have been closed. On average issues are closed in 12 days. There are no pull requests.

It has a neutral sentiment in the developer community.

The latest version of Code-Red is current.

Quality

Code-Red has no bugs reported.

Security

Code-Red has no vulnerabilities reported, and its dependent libraries have no vulnerabilities reported.

License

Code-Red is licensed under the MIT License. This license is Permissive.

Permissive licenses have the least restrictions, and you can use them in most projects.

Reuse

Code-Red releases are not available. You will need to build from source code and install.

Installation instructions, examples and code snippets are available.

Top functions reviewed by kandi - BETA

kandi's functional review helps you automatically verify the functionalities of the libraries and avoid rework.
Currently covering the most popular Java, JavaScript and Python libraries. See a Sample of Code-Red

Get all kandi verified functions for this library.

Code-Red Key Features

No Key Features are available at this moment for Code-Red.

Code-Red Examples and Code Snippets

No Code Snippets are available at this moment for Code-Red.

Community Discussions

Trending Discussions on Code-Red

Why this unnecessary MOVAPD copy in gcc 9.1, in a tiny function

QUESTION

Why this unnecessary MOVAPD copy in gcc 9.1, in a tiny function

Asked 2020-Jul-28 at 18:20

Consider the following code:

...

ANSWER

Answered 2020-Jul-28 at 17:45

This is a GCC missed optimization; this is unfortunately not rare for GCC in tiny functions when its register allocator does a poor job with hard-register constraints imposed by the calling convention; apparently GCC is not usually dumb like this between parts of larger functions.

The pxor-zeroing is there to break the (false) output dependency of cvtss2sd, which exists because of Intel's short-sighted design for single-source scalar instructions to leave the upper part of the destination vector unmodified. They started this with SSE1 for PIII, where it gave a short-term gain because PIII handled XMM regs as two 64-bit halves, so only writing one half let instructions like sqrtss be single-uop.

But they unfortunately kept this pattern even for SSE2 (new with Pentium 4). And later declined to fix it with the AVX version of SSE instructions. So compilers are stuck choosing between the risks of creating a long loop-carried dependency chain through a false dependency, or of using pxor-zeroing. GCC conservatively always uses pxor at -O3, omitting it at -Os. (2-source operations like mulsd already depend on the destination as an input so this is unnecessary).

In this case, with its poor choice of register allocation, leaving out pxor-zeroing would mean that converting (float)b back to double couldn't start until a was ready. So if the critical path was a being ready (b ready early), omitting it would increase the latency from a->result by 5 cycles on Skylake (for the 2-uop cvtss2sd to run only after a was ready, because the output has to merge into the register that originally held a.) Otherwise it's just the mulsd that has to wait for a, with all the stuff involving b done ahead of time.

foo same,same is another way to work around an output dependency; that's what clang is doing. (And what GCC tries to do for popcnt, which unexpectedly has one on Sandybridge-family that's not architecturally required, unlike these stupid SSE ones.)

BTW, AVX 3-operand instructions do sometimes provide a way to work around the false dependencies, using a "cold" register, or one that was xor-zeroed, as the register to merge into. Including for scalar int->FP, although clang sometimes just uses movd plus packed-conversion for that.

Related: Why does adding an xorps instruction make this function using cvtsi2ss and addss ~5x faster? (I should have just linked that, I forgot I already wrote this up in that much detail on Stack Overflow recently.)

The movapd and pxor zeroing don't cost any latency on modern CPUs, but nothing is ever free. They still cost a front-end uop, and code size (L1i cache footprint). movapd has zero latency in the back-end, and doesn't need an execution unit, but that's all - Can x86's MOV really be "free"? Why can't I reproduce this at all?

Source https://stackoverflow.com/questions/63139043

Community Discussions, Code Snippets contain sources that include Stack Exchange Network

Vulnerabilities

No vulnerabilities reported

Install Code-Red

Clone or download the repository.
Download and install the requisite SDK.
Open the solution of CodeRed with Visual Studio 2019.
Build the source to a library(or reference the source).

Support

For any new features, suggestions and bugs create an issue on GitHub. If you have any questions check and ask questions on community page Stack Overflow .

Find more information at: