rsp | A Really Simple Proxy | Proxy library

by gpjt C Version: Current License: MIT

X-Ray Key Features Code Snippets Community Discussions(10)Vulnerabilities Install Support

kandi X-RAY | rsp Summary

rsp is a C library typically used in Networking, Proxy applications. rsp has no bugs, it has no vulnerabilities, it has a Permissive License and it has low support. You can download it from GitHub.

A Really Simple Proxy

Support

Quality

Security

License

Reuse

Support

rsp has a low active ecosystem.

It has 36 star(s) with 11 fork(s). There are 5 watchers for this library.

It had no major release in the last 6 months.

rsp has no issues reported. There are no pull requests.

It has a neutral sentiment in the developer community.

The latest version of rsp is current.

Quality

rsp has no bugs reported.

Security

rsp has no vulnerabilities reported, and its dependent libraries have no vulnerabilities reported.

License

rsp is licensed under the MIT License. This license is Permissive.

Permissive licenses have the least restrictions, and you can use them in most projects.

Reuse

rsp releases are not available. You will need to build from source code and install.

Top functions reviewed by kandi - BETA

kandi's functional review helps you automatically verify the functionalities of the libraries and avoid rework.
Currently covering the most popular Java, JavaScript and Python libraries. See a Sample of rsp

Get all kandi verified functions for this library.

rsp Key Features

No Key Features are available at this moment for rsp.

rsp Examples and Code Snippets

No Code Snippets are available at this moment for rsp.

Community Discussions

Trending Discussions on rsp

Why does the .NET CLR not inline this properly?

GCC emits a label that's not jumped to by anything outside that label?

error LNK2019: unresolved external symbol referenced when compile HTTPD

Mapping WebAssembly binary to its source code

OpenGL extensions not linking on Windows

Why does my empty loop run twice as fast if called as a function, on Intel Skylake CPUs?

Error occurred when 'il2cpp Android build' in Unity

Azure Pipeline fails on `dotnet build` with error "command or file was not found"

Where is the const& args stored?

Strategy for AMD64 cache optimization - stacks, symbols, variables and strings tables

QUESTION

Why does the .NET CLR not inline this properly?

Asked 2021-Jun-15 at 19:35

I ran into less than ideal inlining behavior of the .NET JIT compiler. The following code is stripped of its context, but it demonstrates the problem:

...

ANSWER

Answered 2021-Jun-15 at 19:35

The functions Hash_Inline and Hash_FunctionCall are not equivalent:

The first statement in Hash_Inline rotates by 1, but in Hash_FunctionCall it rotates by curIndex.
For RotateLeft you may have probably meant:

Source https://stackoverflow.com/questions/67991820

QUESTION

GCC emits a label that's not jumped to by anything outside that label?

Asked 2021-Jun-14 at 11:27

Taking the following C code

...

ANSWER

Answered 2021-Jun-14 at 11:23

If you read the assembler code from the top you will see that it reaches .L3, plus it also jumps to it with jne .L3, which is your for loop in C.

Source https://stackoverflow.com/questions/67969134

QUESTION

error LNK2019: unresolved external symbol referenced when compile HTTPD

Asked 2021-Jun-13 at 19:58

I'm compiling HTTPD 2.4.48 along with Lua, Zlib, cURL, jansson and OpenSSL.

Here is the list of files and software I use:

httpd-2.4.48
apr-1.7.0
apr-util-1.6.1
cURL 7.77.0
expat-2.4.1
jansson 2.13.1
Lua 5.4.3
mod_fcgid 2.3.9
openssl-1.1.1k
pcre-8.44
ZLIB 1.2.11
ActivePerl v5.28.1.2801 (x64)
CMake v3.20.3 (x64)
NASM v2.15.05 (x64)
Gawk v3.1.6-1 (x86)

The whole compile statement I use:

Visual Studio 2015: call "C:\Program Files (x86)\Microsoft Visual Studio 14.0\VC\vcvarsall.bat" amd64

...

ANSWER

Answered 2021-Jun-13 at 19:58

Whenever you fix issues, start by the first one (cause solving that may remove the remaining), which in you case seems to be:

Source https://stackoverflow.com/questions/67944929

QUESTION

Mapping WebAssembly binary to its source code

Asked 2021-Jun-10 at 15:38

Compiling C/C++ code with the -g flag results in debug information in the produced binary file. In particular, there is a mapping of source code to binary code:

...

ANSWER

Answered 2021-Jun-10 at 15:38

llvm-objdump -S should work in the same way that it does for native object files.

If you are looking for nice display of code that lacks debug info you might also want to take a look at wasm-decompile which is part of the wabt project. Its able to do a much better job of making something readable than normal/native decompilers.

Source https://stackoverflow.com/questions/67920853

QUESTION

OpenGL extensions not linking on Windows

Asked 2021-Jun-10 at 14:30

I'm trying to link OpenGL to an application for Windows (building on Windows).

I'm using Conan as package manager, CMake for building and MSVC as compiler (and CLion as IDE).

The program compiles, but I have linker errors, for what I believe to be extension functions in OpenGL:

...

ANSWER

Answered 2021-Jun-10 at 14:30

I'm compiling with GL_GLEXT_PROTOTYPES=1.

Well, don't do that. That is never going to work in a portable way. On windows, the opengl32.dll always exports only the functions which are in OpenGL 1.1, and for everything beyond that, you have to rely to the OpenGL extension loading mechanism at runtime.

I have tried:

[...]

Adding GLEW

That's a step in the right direction. But this does not make things to magically work. A GL loader like GLEW typically brings its own header as a replacement for GL.h and glext.h etc., and the typical GL loader (like GLEW) simply re-define every GL functions as a macro, like this:

Source https://stackoverflow.com/questions/67921973

QUESTION

Why does my empty loop run twice as fast if called as a function, on Intel Skylake CPUs?

Asked 2021-Jun-08 at 02:35

I was running some tests to compare C to Java and ran into something interesting. Running my exactly identical benchmark code with optimization level 1 (-O1) in a function called by main, rather than in main itself, resulted in roughly double performance. I'm printing out the size of test_t to verify beyond any doubt that the code is being compiled to x64.

I sent the executables to my friend who's running an i7-7700HQ and got similar results. I'm running an i7-6700.

Here's the slower code:

...

ANSWER

Answered 2021-Jun-07 at 22:21

The slow version:

Note that the sub rax, 1 \ jne pair goes right across the boundary of the ..80 (which is a 32byte boundary). This is one of the cases mentioned in Intels document regarding this issue namely as this diagram:

So this op/branch pair is affected by the fix for the JCC erratum (which would cause it to not be cached in the µop cache). I'm not sure if that is the reason, there are other things at play too, but it's a thing.

In the fast version, the branch is not "touching" a 32byte boundary, so it is not affected.

There may be other effects that apply. Still due to crossing a 32byte boundary, in the slow case the loop is spread across 2 chunks in the µop cache, even without the fix for JCC erratum that may cause it to run at 2 cycles per iteration if the loop cannot execute from the Loop Stream Detector (which is disabled on some processors by an other fix for an other erratum, SKL150). See eg this answer about loop performance.

To address the various comments saying they cannot reproduce this, yes there are various ways that could happen:

Whichever effect was responsible for the slowdown, it is likely caused by the exact placement of the op/branch pair across a 32byte boundary, which happened by pure accident. Compiling from source is unlikely to reproduce the same circumstances, unless you use the same compiler with the same setup as was used by the original poster.
Even using the same binary, regardless of which of the effects is responsible, the weird effect would only happen on particular processors.

Source https://stackoverflow.com/questions/67877913

QUESTION

Error occurred when 'il2cpp Android build' in Unity

Asked 2021-Jun-07 at 21:04

error :

...

ANSWER

Answered 2021-Jun-07 at 21:04

This is a bug in version 6.0.0 of the Google Mobile Ads Unity plugin. Tracked in https://github.com/googleads/googleads-mobile-unity/issues/1613.

Source https://stackoverflow.com/questions/67876930

QUESTION

Azure Pipeline fails on `dotnet build` with error "command or file was not found"

Asked 2021-Jun-07 at 01:59

Recently I had Azure Pipeline builds start failing, without any changes to my build scripts/yaml. The errors are as follows but they're still pretty light on the details.

...

ANSWER

Answered 2021-Jun-07 at 01:59

The issue was in fact due to the FscToolPath evaluating to an empty string.

Existing error message accurately conveys the issue; it’s not F#-specific. Something in the .props/.targets files evaluates to dotnet $(PathToFsc) some/file.rsp and the variable $(PathToFsc) (or whatever is in your build scripts) is evaluating to an empty string. The final command that’s executed is then dotnet some/file.rsp and the normal dotnet behavior is to look for dotnet- as an executable.

The second factor was that the location of FSC did change due to an update of Visual Studio on the VM Image.

Not an answer, but I wonder if it's related to this: stackoverflow.com/questions/67800998/… - it seems things may have moved between VS 16.9 and 16.10.

Finally why it impacted me was because I was setting the FscCompilerPath manually due to a TypeProvider that did not support the dotnet core pipeline due to a dependency on System.Data.SqlClient.

Source https://stackoverflow.com/questions/67815717

QUESTION

Where is the const& args stored?

Asked 2021-Jun-06 at 11:25

Here is the function definition

...

ANSWER

Answered 2021-Jun-06 at 09:04

1. The function

The language doesn't define where arguments to functions are stored. Different ABIs, for different platforms, define this.

Typically, a function argument, before any optimization, is stored on the stack. A reference is no different in this respect. What's actually stored would be a pointer to the refered-to object. Think of it this way:

Source https://stackoverflow.com/questions/67856951

QUESTION

Strategy for AMD64 cache optimization - stacks, symbols, variables and strings tables

Asked 2021-Jun-05 at 00:12

Intro

I am going to write my own FORTH "engine" in GNU assembler (GAS) for Linux x86-64 (specifically for AMD Ryzen 9 3900X that is siting on my table).

(If it will be success, I may use similar idea for make firmware for retro 6502 and similar home-brewed computer)

I want to add some interesting debugging features, as saving comments with the compiled code in for of "NOP words" with attached strings, which would do nothing in runtime, but when disassembling/printing out already defined words it would print those comment too, so it would not loose all the headers ( a b -- c) and comments like ( here goes this particular little trick ) and I would be able try to define new words with documentation, and later print all definitions in some nice way and make new library from those, which I consider good. (And have switch to just ignore comments for "production release")

I had read too much of optimalization here and I am not able to understand all of that in few weeks, so I will put out microoptimalisation until it will suffer performance problems and then I will start with profiling.

But I want to start with at least decent architectural decisions.

What I understood yet:

it would be nice, if the programs was run mainly from CPU cache, not from memory
the cache is filled somehow "automagically", but having related data/code compact and as near as possible may help a lot
I identified some areas, that would be good candidates for caching and some, that are not so good - I sorted it in order of importance:
- assembler code - the engine and basic words like "+" - used all the time (fixed size, .text section)
- both stacks - also used all the time (dynamic, I will probably use rsp for data stack and implement return stack independly - not sure yet, which will be "native" and which "emulated")
- forth bytecode - the defined and compiled words - used at runtime, when the speed matters (still growing size)
- variables, constants, strings, other memory allocations (used in runtime)
- names of words ("DUP", "DROP" - used only when defining new words in compilation phase)
- comments (used one daily or so)

Question:

As there is lot of "heaps" that grows up (well, there is not "free" used, so it may be also stack, or stack growing up) (and two stacks that grows down) I am unsure how to implement it, so the CPU cache will cover it somehow decently.

My idea is to use one "big heap" (and increse it with brk() when needed), and then allocate big chunks of alligned memory on it, implementing "smaller heaps" in each chunk and extend them to another big chunk when the old one is filled up.

I hope, that the cache would automagically get the most used blocks first keep it most of the time and the less used blocks would be mostly ignored by the cache (respective it would occupy only small parts and get read and kicked out all the time), but maybe I did not it correctly.

But maybe is there some better strategy for that?

...

ANSWER

Answered 2021-Jun-04 at 23:53

Your first stops for further reading should probably be:

What Every Programmer Should Know About Memory? re: cache
https://agner.org/optimize/ re: everything else about writing efficient asm.
https://uops.info/ for a better version of Agner Fog's instruction tables.
See also other links in https://stackoverflow.com/tags/x86/info

so I will put out microoptimalisation until it will suffer performance problems and then I will start with profiling.

Yes, probably good to start trying stuff so you have something to profile with HW performance counters, so you can correlate what you're reading about performance stuff with what actually happens. And so you get some ideas of possible details you hadn't thought of yet before you go too far into optimizing your overall design idea. You can learn a lot about asm micro-optimization by starting with something very small scale, like a single loop somewhere without any complicated branching.

Since modern CPUs use split L1i and L1d caches and first-level TLBs, it's not a good idea to place code and data next to each other. (Especially not read-write data; self-modifying code is handled by flushing the whole pipeline on any store too near any code that's in-flight anywhere in the pipeline.)

Related: Why do Compilers put data inside .text(code) section of the PE and ELF files and how does the CPU distinguish between data and code? - they don't, only obfuscated x86 programs do that. (ARM code does sometimes mix code/data because PC-relative loads have limited range on ARM.)

Yes, making sure all your data allocations are nearby should be good for TLB locality. Hardware normally uses a pseudo-LRU allocation/eviction algorithm which generally does a good job at keeping hot data in cache, and it's generally not worth trying to manually clflushopt anything to help it. Software prefetch is also rarely useful, especially in linear traversal of arrays. It can sometimes be worth it if you know where you'll want to access quite a few instructions later, but the CPU couldn't predict that easily.

AMD's L3 cache may use adaptive replacement like Intel does, to try to keep more lines that get reused, not letting them get evicted as easily by lines that tend not to get reused. But Zen2's 512kiB L2 is relatively big by Forth standards; you probably won't have a significant amount of L2 cache misses. (And out-of-order exec can do a lot to hide L1 miss / L2 hit. And even hide some of the latency of an L3 hit.) Contemporary Intel CPUs typically use 256k L2 caches; if you're cache-blocking for generic modern x86, 128kiB is a good choice of block size to assume you can write and then loop over again while getting L2 hits.

The L1i and L1d caches (32k each), and even uop cache (up to 4096 uops, about 1 or 2 per instruction), on a modern x86 like Zen2 (https://en.wikichip.org/wiki/amd/microarchitectures/zen_2#Architecture) or Skylake, are pretty large compared to a Forth implementation; probably everything will hit in L1 cache most of the time, and certainly L2. Yes, code locality is generally good, but with more L2 cache than the whole memory of a typical 6502, you really don't have much to worry about :P

Of more concern for an interpreter is branch prediction, but fortunately Zen2 (and Intel since Haswell) have TAGE predictors that do well at learning patterns of indirect branches even with one "grand central dispatch" branch: Branch Prediction and the Performance of Interpreters - Don’t Trust Folklore

Source https://stackoverflow.com/questions/67841704

Community Discussions, Code Snippets contain sources that include Stack Exchange Network

Vulnerabilities

No vulnerabilities reported

Install rsp

You can download it from GitHub.

Support

For any new features, suggestions and bugs create an issue on GitHub. If you have any questions check and ask questions on community page Stack Overflow .

Find more information at: