summaryrefslogtreecommitdiffstats
path: root/src/video_core/shader/node.h (follow)
Commit message (Collapse)AuthorAgeFilesLines
* gl_shader_decompiler: Use uint for images and fix SUATOMReinUsesLisp2019-09-211-26/+20
| | | | | | In the process remove implementation of SUATOM.MIN and SUATOM.MAX as these require a distinction between U32 and S32. These have to be implemented with imageCompSwap loop.
* shader/image: Implement SULD and remove irrelevant codeReinUsesLisp2019-09-211-1/+3
| | | | | * Implement SULD as float. * Remove conditional declaration of GL_ARB_shader_viewport_layer_array.
* Merge pull request #2855 from ReinUsesLisp/shflbunnei2019-09-201-0/+10
|\ | | | | shader_ir/warp: Implement SHFL for Nvidia devices
| * shader_ir/warp: Implement SHFLReinUsesLisp2019-09-171-0/+10
| |
* | Merge pull request #2784 from ReinUsesLisp/smembunnei2019-09-181-1/+15
|\ \ | |/ |/| shader_ir: Implement shared memory
| * shader_ir: Implement ST_SReinUsesLisp2019-09-051-1/+15
| | | | | | | | | | This instruction writes to a memory buffer shared with threads within the same work group. It is known as "shared" memory in GLSL.
* | shader/image: Implement SUATOM and fix SUSTReinUsesLisp2019-09-111-14/+43
| |
* | gl_shader_decompiler: Keep track of written images and mark them as modifiedReinUsesLisp2019-09-061-17/+31
| |
* | kepler_compute: Implement texture queriesReinUsesLisp2019-09-061-0/+4
|/
* shader_ir: Implement VOTEReinUsesLisp2019-08-211-0/+5
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Implement VOTE using Nvidia's intrinsics. Documentation about these can be found here https://developer.nvidia.com/reading-between-threads-shader-intrinsics Instead of using portable ARB instructions I opted to use Nvidia intrinsics because these are the closest we have to how Tegra X1 hardware renders. To stub VOTE on non-Nvidia drivers (including nouveau) this commit simulates a GPU with a warp size of one, returning what is meaningful for the instruction being emulated: * anyThreadNV(value) -> value * allThreadsNV(value) -> value * allThreadsEqualNV(value) -> true ballotARB, also known as "uint64_t(activeThreadsNV())", emits VOTE.ANY Rd, PT, PT; on nouveau's compiler. This doesn't match exactly to Nvidia's code VOTE.ALL Rd, PT, PT; Which is emulated with activeThreadsNV() by this commit. In theory this shouldn't really matter since .ANY, .ALL and .EQ affect the predicates (set to PT on those cases) and not the registers.
* Shader_Ir: Implement F16 Variants of F2F, F2I, I2F.Fernando Sahmkow2019-07-201-11/+14
| | | | | This commit takes care of implementing the F16 Variants of the conversion instructions and makes sure conversions are done.
* shader/half_set_predicate: Fix HSETP2 implementationReinUsesLisp2019-07-201-2/+1
|
* shader_ir: Unify blocks in decompiled shaders.Fernando Sahmkow2019-07-091-6/+6
|
* shader_ir: Implement BRX & BRA.CCFernando Sahmkow2019-07-091-0/+1
|
* texture_cache: Style and CorrectionsFernando Sahmkow2019-06-211-1/+2
|
* shader: Implement bindless imagesReinUsesLisp2019-06-211-0/+9
|
* shader: Decode SUST and implement backing image functionalityReinUsesLisp2019-06-211-1/+41
|
* shader: Split SSY and PBK stackReinUsesLisp2019-06-071-1/+6
| | | | | | | | | | | Hardware testing revealed that SSY and PBK push to a different stack, allowing code like this: SSY label1; PBK label2; SYNC; label1: PBK; label2: EXIT;
* shader/node: Minor changesReinUsesLisp2019-06-071-50/+54
| | | | | | | Reflect std::shared_ptr nature of Node on initializers and remove constant members in nodes. Add some commentaries.
* shader: Move Node declarations out of the shader IR headerReinUsesLisp2019-06-071-0/+510
Analysis passes do not have a good reason to depend on shader_ir.h to work on top of nodes. This splits node-related declarations to their own file and leaves the IR in shader_ir.h