summaryrefslogtreecommitdiffstats
path: root/src/video_core/shader/decode (follow)
Commit message (Collapse)AuthorAgeFilesLines
* shader: Implement MEMBAR.GLReinUsesLisp2019-12-101-0/+6
| | | | Implement using memoryBarrier in GLSL and OpMemoryBarrier on SPIR-V.
* shader_ir/other: Implement S2R InvocationIdReinUsesLisp2019-12-101-0/+2
|
* shader: Keep track of shaders using warp instructionsReinUsesLisp2019-12-101-0/+3
|
* shader_ir/memory: Implement patch storesReinUsesLisp2019-12-101-16/+18
|
* Merge pull request #3109 from FernandoS27/new-instrbunnei2019-12-072-7/+65
|\ | | | | Implement FLO & TXD Instructions on GPU Shaders
| * Shader_IR: Address FeedbackFernando Sahmkow2019-11-182-10/+8
| |
| * Shader_IR: Implement TXD instruction.Fernando Sahmkow2019-11-141-7/+49
| |
| * Shader_IR: Implement FLO instruction.Fernando Sahmkow2019-11-141-0/+18
| |
* | shader/texture: Handle TLDS texture type mismatchesReinUsesLisp2019-11-231-1/+10
| | | | | | | | | | | | | | | | | | | | Some games like "Fire Emblem: Three Houses" bind 2D textures to offsets used by instructions of 1D textures. To handle the discrepancy this commit uses the the texture type from the binding and modifies the emitted code IR to build a valid backend expression. E.g.: Bound texture is 2D and instruction is 1D, the emitted IR samples a 2D texture in the coordinate ivec2(X, 0).
* | shader/texture: Deduce texture buffers from lockerReinUsesLisp2019-11-231-61/+41
| | | | | | | | | | Instead of specializing shaders to separate texture buffers from 1D textures, use the locker to deduce them while they are being decoded.
* | shader/other: Reduce DEPBAR log severityReinUsesLisp2019-11-201-1/+1
|/ | | | | | While DEPBAR is stubbed it doesn't change anything from our end. Shading languages handle what this instruction does implicitly. We are not getting anything out fo this log except noise.
* shader_ir/warp: Implement FSWZADDReinUsesLisp2019-11-081-0/+9
|
* gl_shader_decompiler: Reimplement shuffles with platform agnostic intrinsicsReinUsesLisp2019-11-081-33/+35
|
* shader/decode: Reduce severity of arithmetic rounding warningsReinUsesLisp2019-11-076-15/+17
|
* shader/arithmetic: Reduce RRO stub severityReinUsesLisp2019-11-071-1/+2
|
* shader/texture: Remove NODEP warningsReinUsesLisp2019-11-071-35/+0
| | | | | These warnings don't offer meaningful information while decoding shaders. Remove them.
* Merge pull request #3039 from ReinUsesLisp/cleanup-samplersRodrigo Locatti2019-11-062-54/+55
|\ | | | | shader/node: Unpack bindless texture encoding
| * shader/node: Unpack bindless texture encodingReinUsesLisp2019-10-302-54/+55
| | | | | | | | | | | | | | | | | | Bindless textures were using u64 to pack the buffer and offset from where they come from. Drop this in favor of separated entries in the struct. Remove the usage of std::set in favor of std::list (it's not std::vector to avoid reference invalidations) for samplers and images.
* | Shader_IR: Fix regression on TLD4Fernando Sahmkow2019-10-311-4/+3
| | | | | | | | | | | | Originally on the last commit I thought TLD4 acted the same as TLD4S and didn't have a mask. It actually does have a component mask. This commit corrects that.
* | Shader_IR: Fix TLD4 and add Bindless Variant.Fernando Sahmkow2019-10-301-8/+24
|/ | | | | | This commit fixes an issue where not all 4 results of tld4 were being written, the color component was defaulted to red, among other things. It also implements the bindless variant.
* Merge pull request #2976 from FernandoS27/cache-fast-brx-rebasedRodrigo Locatti2019-10-261-20/+50
|\ | | | | Implement Fast BRX, fix TXQ and addapt the Shader Cache for it
| * Shader_IR: Address Feedback.Fernando Sahmkow2019-10-261-22/+16
| |
| * Shader_IR: allow lookup of texture samplers within the shader_ir for instructions that don't provide itFernando Sahmkow2019-10-251-18/+54
| |
* | Merge pull request #3013 from FernandoS27/tld4s-fixRodrigo Locatti2019-10-261-4/+4
|\ \ | |/ |/| Shader_Ir: Fix TLD4S from using a component mask.
| * Shader_Ir: Fix TLD4S from using a component mask.Fernando Sahmkow2019-10-221-4/+4
| | | | | | | | | | | | TLD4S always outputs 4 values, the previous code checked a component mask and omitted those values that weren't part of it. This commit corrects that and makes sure all 4 values are set.
* | video_core/shader: Resolve instances of variable shadowingLioncash2019-10-246-11/+12
| | | | | | | | Silences a few -Wshadow warnings.
* | shader_ir/memory: Ignore global memory when tracking failsReinUsesLisp2019-10-221-16/+23
|/ | | | | | | | | | | Ignore global memory operations instead of invoking undefined behaviour when constant buffer tracking fails and we are blasting through asserts, ignore the operation. In the case of LDG this means filling the destination registers with zeroes; for STG this means ignore the instruction as a whole. The default behaviour is still to abort execution on failure.
* shader/half_set_predicate: Fix HSETP2 for constant buffersReinUsesLisp2019-10-071-0/+2
| | | | | HSETP2 when used with a constant buffer parses the second operand type as F32. This is not configurable.
* shader/half_set_predicate: Reduce DEBUG_ASSERT to LOG_DEBUGReinUsesLisp2019-10-071-1/+2
|
* Merge pull request #2869 from ReinUsesLisp/suldbunnei2019-09-241-60/+77
|\ | | | | shader/image: Implement SULD and fix SUATOM
| * gl_shader_decompiler: Use uint for images and fix SUATOMReinUsesLisp2019-09-211-37/+29
| | | | | | | | | | | | In the process remove implementation of SUATOM.MIN and SUATOM.MAX as these require a distinction between U32 and S32. These have to be implemented with imageCompSwap loop.
| * shader/image: Implement SULD and remove irrelevant codeReinUsesLisp2019-09-211-24/+49
| | | | | | | | | | * Implement SULD as float. * Remove conditional declaration of GL_ARB_shader_viewport_layer_array.
* | Merge pull request #2878 from FernandoS27/icmpRodrigo Locatti2019-09-211-0/+29
|\ \ | |/ |/| shader_ir: Implement ICMP
| * Shader_IR: ICMP corrections and fixesFernando Sahmkow2019-09-211-6/+9
| |
| * Shader_IR: Implement ICMP.Fernando Sahmkow2019-09-201-0/+26
| |
* | Merge pull request #2855 from ReinUsesLisp/shflbunnei2019-09-201-0/+47
|\ \ | |/ |/| shader_ir/warp: Implement SHFL for Nvidia devices
| * shader_ir/warp: Implement SHFLReinUsesLisp2019-09-171-0/+47
| |
* | Merge pull request #2784 from ReinUsesLisp/smembunnei2019-09-181-19/+29
|\ \ | |/ |/| shader_ir: Implement shared memory
| * shader_ir: Implement LD_SReinUsesLisp2019-09-051-10/+13
| | | | | | | | Loads from shared memory.
| * shader_ir: Implement ST_SReinUsesLisp2019-09-051-9/+16
| | | | | | | | | | This instruction writes to a memory buffer shared with threads within the same work group. It is known as "shared" memory in GLSL.
* | shader/image: Implement SUATOM and fix SUSTReinUsesLisp2019-09-111-21/+71
| |
* | Merge pull request #2823 from ReinUsesLisp/shr-clampbunnei2019-09-101-6/+13
|\ \ | | | | | | shader/shift: Implement SHR wrapped and clamped variants
| * | shader/shift: Implement SHR wrapped and clamped variantsReinUsesLisp2019-09-041-6/+13
| | | | | | | | | | | | | | | | | | Nvidia defaults to wrapped shifts, but this is undefined behaviour on OpenGL's spec. Explicitly mask/clamp according to what the guest shader requires.
* | | gl_shader_decompiler: Keep track of written images and mark them as modifiedReinUsesLisp2019-09-061-21/+19
| |/ |/|
* | half_set_predicate: Fix predicate assignmentsReinUsesLisp2019-09-041-10/+9
|/
* Merge pull request #2812 from ReinUsesLisp/f2i-selectorbunnei2019-09-041-6/+16
|\ | | | | shader_ir/conversion: Implement F2I and F2F F16 selector
| * shader_ir/conversion: Split int and float selector and implement F2F H1ReinUsesLisp2019-08-281-18/+16
| |
| * shader_ir/conversion: Implement F2I F16 Ra.H1ReinUsesLisp2019-08-281-4/+16
| |
* | Merge pull request #2811 from ReinUsesLisp/fsetp-fixbunnei2019-09-041-4/+5
|\ \ | | | | | | float_set_predicate: Add missing negation bit for the second operand
| * | float_set_predicate: Add missing negation bit for the second operandReinUsesLisp2019-08-281-4/+5
| |/
* | video_core: Silent miscellaneous warnings (#2820)Rodrigo Locatti2019-08-305-5/+0
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | * texture_cache/surface_params: Remove unused local variable * rasterizer_interface: Add missing documentation commentary * maxwell_dma: Remove unused rasterizer reference * video_core/gpu: Sort member declaration order to silent -Wreorder warning * fermi_2d: Remove unused MemoryManager reference * video_core: Silent unused variable warnings * buffer_cache: Silent -Wreorder warnings * kepler_memory: Remove unused MemoryManager reference * gl_texture_cache: Add missing override * buffer_cache: Add missing include * shader/decode: Remove unused variables
* | Merge pull request #2758 from ReinUsesLisp/packed-tidbunnei2019-08-291-0/+7
|\ \ | | | | | | shader/decode: Implement S2R Tic
| * | shader/decode: Implement S2R TicReinUsesLisp2019-07-221-0/+7
| | |
* | | shader_ir: Implement VOTEReinUsesLisp2019-08-211-0/+55
| |/ |/| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Implement VOTE using Nvidia's intrinsics. Documentation about these can be found here https://developer.nvidia.com/reading-between-threads-shader-intrinsics Instead of using portable ARB instructions I opted to use Nvidia intrinsics because these are the closest we have to how Tegra X1 hardware renders. To stub VOTE on non-Nvidia drivers (including nouveau) this commit simulates a GPU with a warp size of one, returning what is meaningful for the instruction being emulated: * anyThreadNV(value) -> value * allThreadsNV(value) -> value * allThreadsEqualNV(value) -> true ballotARB, also known as "uint64_t(activeThreadsNV())", emits VOTE.ANY Rd, PT, PT; on nouveau's compiler. This doesn't match exactly to Nvidia's code VOTE.ALL Rd, PT, PT; Which is emulated with activeThreadsNV() by this commit. In theory this shouldn't really matter since .ANY, .ALL and .EQ affect the predicates (set to PT on those cases) and not the registers.
* | Merge pull request #2777 from ReinUsesLisp/hsetp2-fe3h-fixbunnei2019-08-211-1/+1
|\ \ | | | | | | half_set_predicate: Fix HSETP2_C constant buffer offset
| * | half_set_predicate: Fix HSETP2_C constant buffer offsetReinUsesLisp2019-08-041-1/+1
| | |
* | | Merge pull request #2753 from FernandoS27/float-convertbunnei2019-08-211-5/+25
|\ \ \ | | | | | | | | Shader_Ir: Implement F16 Variants of F2F, F2I, I2F.
| * | | Shader_Ir: Implement F16 Variants of F2F, F2I, I2F.Fernando Sahmkow2019-07-201-5/+25
| | |/ | |/| | | | | | | | | | This commit takes care of implementing the F16 Variants of the conversion instructions and makes sure conversions are done.
* | | Merge pull request #2778 from ReinUsesLisp/nopbunnei2019-08-181-0/+6
|\ \ \ | | | | | | | | shader_ir: Implement NOP
| * | | shader_ir: Implement NOPReinUsesLisp2019-08-041-0/+6
| | |/ | |/|
* / | decode/half_set_predicate: Fix predicatesReinUsesLisp2019-07-261-3/+3
|/ /
* | Merge pull request #2743 from FernandoS27/surpress-assertbunnei2019-07-255-13/+20
|\ \ | |/ |/| Downgrade and suppress a series of GPU asserts and debug messages.
| * Shader_Ir: Change Debug Asserts for Log WarningsFernando Sahmkow2019-07-203-10/+17
| |
| * Shader_Ir: correct clang formatFernando Sahmkow2019-07-181-2/+2
| |
| * Shader_Ir: Downgrade precision and rounding asserts to debug asserts.Fernando Sahmkow2019-07-185-10/+10
| | | | | | | | | | | | This commit reduces the sevirity of asserts for FP precision and rounding as this are well known and have little to no consequences in gpu's accuracy.
* | shader/half_set_predicate: Fix HSETP2 implementationReinUsesLisp2019-07-201-17/+14
| |
* | shader/half_set_predicate: Implement missing HSETP2 variantsReinUsesLisp2019-07-201-13/+29
| |
* | Merge pull request #2738 from lioncash/shader-irbunnei2019-07-183-30/+30
|\ \ | |/ |/| shader-ir: Minor cleanup-related changes
| * shader_ir: Rename Get/SetTemporal to Get/SetTemporaryLioncash2019-07-173-30/+30
| | | | | | | | | | | | This is more accurate in terms of describing what the functions are actually doing. Temporal relates to time, not the setting of a temporary itself.
* | Merge pull request #2740 from lioncash/braFernando Sahmkow2019-07-171-1/+1
|\ \ | |/ |/| shader/decode/other: Correct branch indirect argument within BRA handling
| * shader/decode/other: Correct branch indirect argument within BRA handlingLioncash2019-07-161-1/+1
| | | | | | | | | | This appears to have been a copy/paste error introduced within 8a6fc529a968e007f01464abadd32f9b5eb0a26c
* | shader: Allow tracking of indirect buffers without variable offsetReinUsesLisp2019-07-153-23/+10
|/ | | | | | While changing this code, simplify tracking code to allow returning the base address node, this way callers don't have to manually rebuild it on each invocation.
* Merge pull request #2692 from ReinUsesLisp/tlds-f16Fernando Sahmkow2019-07-141-1/+7
|\ | | | | shader/texture: Add F16 support for TLDS
| * shader/texture: Add F16 support for TLDSReinUsesLisp2019-07-071-1/+7
| |
* | shader_ir: Unify blocks in decompiled shaders.Fernando Sahmkow2019-07-091-7/+23
| |
* | shader_ir: Implement BRX & BRA.CCFernando Sahmkow2019-07-091-4/+38
| |
* | Delete decode_integer_set.cppTobias2019-07-071-0/+0
|/
* decode/texture: Address feedbackReinUsesLisp2019-06-241-0/+1
|
* shader_ir: Fix image copy rebase issuesFernando Sahmkow2019-06-211-2/+7
|
* shader: Implement bindless imagesReinUsesLisp2019-06-211-2/+28
|
* shader: Decode SUST and implement backing image functionalityReinUsesLisp2019-06-211-0/+89
|
* shader: Implement texture buffersReinUsesLisp2019-06-211-0/+44
|
* shader: Split SSY and PBK stackReinUsesLisp2019-06-071-10/+8
| | | | | | | | | | | Hardware testing revealed that SSY and PBK push to a different stack, allowing code like this: SSY label1; PBK label2; SYNC; label1: PBK; label2: EXIT;
* shader: Use shared_ptr to store nodes and move initialization to fileReinUsesLisp2019-06-0626-8/+34
| | | | | | | | | Instead of having a vector of unique_ptr stored in a vector and returning star pointers to this, use shared_ptr. While changing initialization code, move it to a separate file when possible. This is a first step to allow code analysis and node generation beyond the ShaderIR class.
* Merge pull request #2446 from ReinUsesLisp/tidbunnei2019-05-291-14/+28
|\ | | | | shader: Implement S2R Tid{XYZ} and CtaId{XYZ}
| * shader: Implement S2R Tid{XYZ} and CtaId{XYZ}ReinUsesLisp2019-05-201-14/+28
| |
* | Merge pull request #2485 from ReinUsesLisp/generic-memorybunnei2019-05-251-27/+55
|\ \ | | | | | | shader/memory: Implement generic memory stores and loads (ST and LD)
| * | shader/memory: Implement ST (generic memory)ReinUsesLisp2019-05-211-21/+35
| | |
| * | shader/memory: Implement LD (generic memory)ReinUsesLisp2019-05-211-7/+21
| |/
* | shader/decode/*: Add missing newline to files lacking themLioncash2019-05-2318-18/+18
| | | | | | | | Keeps the shader code file endings consistent.
* | shader/decode/*: Eliminate indirect inclusionsLioncash2019-05-236-1/+5
| | | | | | | | | | | | | | Amends cases where we were using things that were indirectly being satisfied through other headers. This way, if those headers change and eliminate dependencies on other headers in the future, we don't have cascading compilation errors.
* | shader/decode/memory: Remove left in debug pragmaLioncash2019-05-221-2/+0
|/
* Merge pull request #2441 from ReinUsesLisp/al2pbunnei2019-05-192-10/+33
|\ | | | | shader: Implement AL2P and ALD.PHYS
| * shader_ir/other: Implement IPA.IDXReinUsesLisp2019-05-031-5/+8
| |
| * shader_ir/memory: Assert on non-32 bits ALD.PHYSReinUsesLisp2019-05-031-0/+3
| |
| * shader: Add physical attributes commentariesReinUsesLisp2019-05-031-1/+1
| |
| * shader_ir/memory: Implement physical input attributesReinUsesLisp2019-05-031-3/+6
| |
| * shader: Remove unused AbufNode Ipa modeReinUsesLisp2019-05-032-6/+3
| |
| * shader_ir/memory: Emit AL2P IRReinUsesLisp2019-05-031-0/+17
| |
* | video_core/shader/decode/texture: Remove unused variable from GetTld4Code()Lioncash2019-05-101-1/+0
| |
* | shader/decode/texture: Remove unused variableLioncash2019-05-041-1/+0
|/ | | | This isn't used anywhere, so we can get rid of it.
* Merge pull request #2435 from ReinUsesLisp/misc-vcbunnei2019-04-291-1/+1
|\ | | | | shader_ir: Miscellaneous fixes
| * shader_ir/texture: Fix sampler const buffer key shiftReinUsesLisp2019-04-261-1/+1
| |
* | Merge pull request #2322 from ReinUsesLisp/wswitchbunnei2019-04-292-5/+7
|\ \ | | | | | | video_core: Silent -Wswitch warnings
| * | video_core: Silent -Wswitch warningsReinUsesLisp2019-04-182-5/+7
| | |
* | | Merge pull request #2423 from FernandoS27/half-correctbunnei2019-04-292-15/+16
|\ \ \ | |_|/ |/| | Corrections on Half Float operations: HADD2 HMUL2 and HFMA2
| * | Corrections Half Float operations on const buffers and implement saturation.Fernando Sahmkow2019-04-212-15/+16
| | |
* | | Merge pull request #2407 from FernandoS27/f2fbunnei2019-04-201-16/+53
|\ \ \ | |/ / |/| | Do some corrections in conversion shader instructions.
| * | Do some corrections in conversion shader instructions.Fernando Sahmkow2019-04-161-16/+53
| |/ | | | | | | | | | | Corrects encodings for I2F, F2F, I2I and F2I Implements Immediate variants of all four conversion types. Add assertions to unimplemented stuffs.
* | Merge pull request #2409 from ReinUsesLisp/half-floatsbunnei2019-04-205-36/+32
|\ \ | | | | | | shader_ir/decode: Miscellaneous fixes to half-float decompilation
| * | shader_ir/decode: Fix half float pre-operations and remove MetaHalfArithmeticReinUsesLisp2019-04-165-29/+21
| | | | | | | | | | | | | | | | | | | | | Operations done before the main half float operation (like HAdd) were managing a packed value instead of the unpacked one. Adding an unpacked operation allows us to drop the per-operand MetaHalfArithmetic entry, simplifying the code overall.
| * | shader_ir/decode: Implement half float saturationReinUsesLisp2019-04-161-4/+2
| | |
| * | shader_ir/decode: Reduce severity of unimplemented half-float FTZReinUsesLisp2019-04-163-3/+9
| |/
* | Merge pull request #2348 from FernandoS27/guest-bindlessbunnei2019-04-181-19/+94
|\ \ | | | | | | Implement Bindless Textures on Shader Decompiler and GL backend
| * | Adapt Bindless to work with AOFFIFernando Sahmkow2019-04-081-7/+18
| | |
| * | Move ConstBufferAccessor to Maxwell3d, correct mistakes and clang format.Fernando Sahmkow2019-04-081-1/+2
| | |
| * | Fix TMMLFernando Sahmkow2019-04-081-5/+7
| | |
| * | Refactor GetTextureCode and GetTexCode to use an optional instead of optional parametersFernando Sahmkow2019-04-081-23/+24
| | |
| * | Implement TXQ_BFernando Sahmkow2019-04-081-2/+8
| | |
| * | Implement TMML_BFernando Sahmkow2019-04-081-5/+10
| | |
| * | Corrections to TEX_BFernando Sahmkow2019-04-081-4/+5
| | |
| * | Unify both sampler types.Fernando Sahmkow2019-04-081-10/+12
| | |
| * | Implement Bindless Samplers and TEX_B in the IR.Fernando Sahmkow2019-04-081-6/+52
| | |
* | | Merge pull request #2315 from ReinUsesLisp/severity-decompilerbunnei2019-04-171-4/+5
|\ \ \ | | | | | | | | shader_ir/decode: Reduce the severity of common assertions
| * | | shader_ir/memory: Reduce severity of LD_L cache management and log itReinUsesLisp2019-04-031-2/+2
| | | |
| * | | shader_ir/memory: Reduce severity of ST_L cache management and log itReinUsesLisp2019-04-031-2/+3
| | | |
* | | | shader_ir: Implement STG, keep track of global memory usage and flushReinUsesLisp2019-04-141-35/+74
| |_|/ |/| |
* | | Correct XMAD mode, psl and high_b on different encodings.Fernando Sahmkow2019-04-081-9/+30
| |/ |/|
* | shader_ir/decode: Silent implicit sign conversion warningMat M2019-03-311-2/+2
| | | | | | Co-Authored-By: ReinUsesLisp <reinuseslisp@airmail.cc>
* | shader_ir/decode: Implement AOFFI for TEX and TLD4ReinUsesLisp2019-03-301-24/+88
|/
* shader/decode: Remove extras from MetaTextureReinUsesLisp2019-02-261-14/+23
|
* shader/decode: Split memory and texture instructions decodingReinUsesLisp2019-02-262-493/+525
|
* Merge pull request #2118 from FernandoS27/ipa-improvebunnei2019-02-252-3/+14
|\ | | | | shader_decompiler: Improve Accuracy of Attribute Interpolation.
| * shader_decompiler: Improve Accuracy of Attribute Interpolation.Fernando Sahmkow2019-02-142-3/+14
| |
* | gl_shader_decompiler: Re-implement TLDS lodReinUsesLisp2019-02-121-1/+1
|/
* Merge pull request #2108 from FernandoS27/fix-ccbunnei2019-02-121-2/+2
|\ | | | | Fix incorrect value for CC bit in IADD
| * Fix incorrect value for CC bit in IADDFernando Sahmkow2019-02-111-2/+2
| |
* | Merge pull request #2109 from FernandoS27/fix-f2ibunnei2019-02-121-3/+3
|\ \ | | | | | | Corrected F2I None mode to RoundEven.
| * | Corrected F2I None mode to RoundEven.Fernando Sahmkow2019-02-111-3/+3
| |/
* | shader_ir: Remove F4 prefix to texture operationsReinUsesLisp2019-02-071-8/+7
| | | | | | | | | | | | This was originally included because texture operations returned a vec4. These operations now return a single float and the F4 prefix doesn't mean anything.
* | shader_ir: Clean texture management codeReinUsesLisp2019-02-071-96/+58
|/ | | | | | | | | Previous code relied on GLSL parameter order (something that's always ill-formed on an IR design). This approach passes spatial coordiantes through operation nodes and array and depth compare values in the the texture metadata. It still contains an "extra" vector containing generic nodes for bias and component index (for example) which is still a bit ill-formed but it should be better than the previous approach.
* Merge pull request #2083 from ReinUsesLisp/shader-ir-cbuf-trackingbunnei2019-02-0725-34/+34
|\ | | | | shader/track: Add a more permissive global memory tracking
| * shader_ir: Rename BasicBlock to NodeBlockReinUsesLisp2019-02-0325-33/+32
| | | | | | | | It's not always used as a basic block. Rename it for consistency.
| * shader_ir: Pass decoded nodes as a whole instead of per basic blocksReinUsesLisp2019-02-0325-26/+27
| | | | | | | | | | | | | | | | | | Some games call LDG at the top of a basic block, making the tracking heuristic to fail. This commit lets the heuristic the decoded nodes as a whole instead of per basic blocks. This may lead to some false positives but allows it the heuristic to track cases it previously couldn't.
* | Merge pull request #2081 from ReinUsesLisp/lmem-64bunnei2019-02-051-12/+43
|\ \ | | | | | | shader_ir/memory: Add LD_L 64 bits loads
| * | shader_ir/memory: Add ST_L 64 and 128 bits storesReinUsesLisp2019-02-031-3/+11
| | |
| * | shader_ir/memory: Add LD_L 128 bits loadsReinUsesLisp2019-02-031-7/+19
| | |
| * | shader_bytecode: Rename BytesN enums to BitsNReinUsesLisp2019-02-031-4/+4
| | |
| * | shader_ir/memory: Add LD_L 64 bits loadsReinUsesLisp2019-02-031-6/+17
| |/
* | Merge pull request #2082 from FernandoS27/txq-stlbunnei2019-02-051-6/+9
|\ \ | |/ |/| Fix TXQ not using the component mask.
| * Fix TXQ not using the component mask.Fernando Sahmkow2019-02-031-6/+9
| |
* | shader_ir: Unify constant buffer offset valuesReinUsesLisp2019-01-3013-21/+23
|/ | | | | | | Constant buffer values on the shader IR were using different offsets if the access direct or indirect. cbuf34 has a non-multiplied offset while cbuf36 does. On shader decoding this commit multiplies it by four on cbuf34 queries.
* shader_decode: Implement LDG and basic cbuf trackingReinUsesLisp2019-01-301-0/+49
|
* shader_ir: Fixup clang buildReinUsesLisp2019-01-161-4/+6
|
* shader_decode: Fixup XMADReinUsesLisp2019-01-151-1/+1
|
* shader_ir: Pass to decoder functions basic block's codeReinUsesLisp2019-01-1525-25/+25
|
* shader_decode: Improve zero flag implementationReinUsesLisp2019-01-1513-73/+53
|
* shader_ir: Remove composite primitives and use temporals insteadReinUsesLisp2019-01-151-145/+149
|
* shader_decode: Use proper primitive namesReinUsesLisp2019-01-152-8/+8
|
* shader_decode: Use BitfieldExtract instead of shift + andReinUsesLisp2019-01-155-46/+18
|
* shader_ir: Remove Ipa primitiveReinUsesLisp2019-01-151-3/+2
|
* shader_ir: Remove RZ and use Register::ZeroIndex insteadReinUsesLisp2019-01-151-6/+11
|
* shader_decode: Implement TEXS.F16ReinUsesLisp2019-01-151-13/+25
|
* shader_decode: Fixup R2PReinUsesLisp2019-01-151-2/+3
|
* shader_decode: Fixup WriteLogicOperation zero comparisonReinUsesLisp2019-01-151-1/+1
|
* shader_decode: Fixup PSETReinUsesLisp2019-01-151-2/+3
|
* shader_decode: Fixup clang-formatReinUsesLisp2019-01-152-2/+4
|
* video_core: Implement IR based geometry shadersReinUsesLisp2019-01-151-0/+25
|
* shader_decode: Implement VMAD and VSETPReinUsesLisp2019-01-151-0/+120
|
* shader_decode: Implement HSET2ReinUsesLisp2019-01-151-1/+43
|
* shader_decode: Rework HSETP2ReinUsesLisp2019-01-151-3/+5
|
* shader_decode: Implement R2PReinUsesLisp2019-01-151-1/+28
|
* shader_decode: Implement CSETPReinUsesLisp2019-01-151-14/+37
|
* shader_decode: Implement PSETReinUsesLisp2019-01-151-1/+16
|
* shader_decode: Implement HFMA2ReinUsesLisp2019-01-151-1/+53
|
* shader_decode: Implement POPCReinUsesLisp2019-01-151-0/+10
|
* shader_decode: Implement TLDS (untested)ReinUsesLisp2019-01-151-8/+61
|
* shader_decode: Update TLD4 reflecting #1862 changesReinUsesLisp2019-01-151-52/+49
|
* shader_ir: Fixup TEX and TEXS and partially fix TLD4 decompilingReinUsesLisp2019-01-151-50/+49
|
* shader_decode: Fixup FSETReinUsesLisp2019-01-151-2/+2
|
* shader_decode: Implement IADD32IReinUsesLisp2019-01-151-0/+11
|
* video_core: Return safe values after an assert hitsReinUsesLisp2019-01-156-8/+12
|
* shader_decode: Implement FFMAReinUsesLisp2019-01-151-1/+36
|
* shader_ir: Fixup file inclusions and clang-formatReinUsesLisp2019-01-151-1/+1
|
* shader_decode: Fixup clang-formatReinUsesLisp2019-01-152-3/+2
|
* shader_decode: Implement LEAReinUsesLisp2019-01-151-0/+55
|
* shader_decode: Implement IADD3ReinUsesLisp2019-01-151-0/+61
|
* shader_decode: Implement LOP3ReinUsesLisp2019-01-151-0/+60
|
* shader_decode: Implement ST_LReinUsesLisp2019-01-151-0/+17
|
* shader_decode: Implement LD_LReinUsesLisp2019-01-151-0/+18
|
* shader_decode: Implement HSETP2ReinUsesLisp2019-01-151-1/+37
|
* shader_decode: Implement HADD2 and HMUL2ReinUsesLisp2019-01-151-1/+48
|
* shader_decode: Implement HADD2_IMM and HMUL2_IMMReinUsesLisp2019-01-151-1/+28
|
* shader_decode: Implement MOV_SYSReinUsesLisp2019-01-151-0/+27
|
* shader_decode: Implement IMNMXReinUsesLisp2019-01-151-0/+16
|
* shader_decode: Implement F2F_CReinUsesLisp2019-01-151-2/+10
|
* shader_decode: Implement I2IReinUsesLisp2019-01-151-0/+26
|
* shader_decode: Implement BRA internal flagReinUsesLisp2019-01-151-4/+8
|
* shader_decode: Implement ISCADDReinUsesLisp2019-01-151-0/+15
|
* shader_decode: Implement XMADReinUsesLisp2019-01-151-1/+85
|
* shader_decode: Implement PBK and BRKReinUsesLisp2019-01-151-1/+22
|
* shader_decode: Implement LOPReinUsesLisp2019-01-151-0/+15
|
* shader_decode: Implement SELReinUsesLisp2019-01-151-0/+8
|
* shader_decode: Implement IADDReinUsesLisp2019-01-151-1/+28
|
* shader_decode: Implement ISETPReinUsesLisp2019-01-151-1/+30
|
* shader_decode: Implement BFIReinUsesLisp2019-01-151-1/+22
|
* shader_decode: Implement ISETReinUsesLisp2019-01-151-1/+27
|
* shader_decode: Implement LD_CReinUsesLisp2019-01-151-0/+31
|
* shader_decode: Implement SHLReinUsesLisp2019-01-151-0/+8
|
* shader_decode: Implement SHRReinUsesLisp2019-01-151-1/+26
|
* shader_decode: Implement LOP32IReinUsesLisp2019-01-151-1/+67
|
* shader_decode: Implement BFEReinUsesLisp2019-01-151-1/+25
|
* shader_decode: Implement FSETReinUsesLisp2019-01-151-1/+36
|
* shader_decode: Implement F2IReinUsesLisp2019-01-151-0/+37
|
* shader_decode: Implement I2FReinUsesLisp2019-01-151-0/+23
|
* shader_decode: Implement F2FReinUsesLisp2019-01-151-1/+37
|
* shader_decode: Stub DEPBARReinUsesLisp2019-01-151-0/+4
|
* shader_decode: Implement SSY and SYNCReinUsesLisp2019-01-151-0/+19
|
* shader_decode: Implement PSETPReinUsesLisp2019-01-151-1/+21
|
* shader_decode: Implement TMMLReinUsesLisp2019-01-151-3/+45
|
* shader_decode: Implement TEX and TXQReinUsesLisp2019-01-151-0/+219
|
* shader_decode: Implement TEXS (F32)ReinUsesLisp2019-01-151-0/+199
|
* shader_decode: Implement FSETPReinUsesLisp2019-01-151-1/+33
|
* shader_decode: Partially implement BRAReinUsesLisp2019-01-151-0/+12
|
* shader_decode: Implement IPAReinUsesLisp2019-01-151-0/+12
|
* shader_decode: Implement EXITReinUsesLisp2019-01-151-1/+32
|
* shader_decode: Implement ST_AReinUsesLisp2019-01-151-0/+30
|
* shader_decode: Implement LD_AReinUsesLisp2019-01-151-1/+39
|
* shader_decode: Implement FADD32IReinUsesLisp2019-01-151-0/+12
|
* shader_decode: Implement FMUL32_IMMReinUsesLisp2019-01-151-0/+10
|
* shader_decode: Implement MOV32_IMMReinUsesLisp2019-01-151-1/+9
|
* shader_decode: Stub RRO_C, RRO_R and RRO_IMMReinUsesLisp2019-01-151-0/+9
|
* shader_decode: Implement FMNMX_C, FMNMX_R and FMNMX_IMMReinUsesLisp2019-01-151-0/+18
|
* shader_decode: Implement MUFUReinUsesLisp2019-01-151-0/+29
|
* shader_decode: Implement FADD_C, FADD_R and FADD_IMMReinUsesLisp2019-01-151-0/+15
|
* shader_decode: Implement FMUL_C, FMUL_R and FMUL_IMMReinUsesLisp2019-01-151-0/+42
|
* shader_decode: Implement MOV_C and MOV_RReinUsesLisp2019-01-151-1/+23
|
* shader_ir: Initial implementationReinUsesLisp2019-01-1525-0/+576