Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | shader/texture: Join separate image and sampler pairs offline | ReinUsesLisp | 2020-06-05 | 1 | -2/+2 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | Games using D3D idioms can join images and samplers when a shader executes, instead of baking them into a combined sampler image. This is also possible on Vulkan. One approach to this solution would be to use separate samplers on Vulkan and leave this unimplemented on OpenGL, but we can't do this because there's no consistent way of determining which constant buffer holds a sampler and which one an image. We could in theory find the first bit and if it's in the TIC area, it's an image; but this falls apart when an image or sampler handle use an index of zero. The used approach is to track for a LOP.OR operation (this is done at an IR level, not at an ISA level), track again the constant buffers used as source and store this pair. Then, outside of shader execution, join the sample and image pair with a bitwise or operation. This approach won't work on games that truly use separate samplers in a meaningful way. For example, pooling textures in a 2D array and determining at runtime what sampler to use. This invalidates OpenGL's disk shader cache :) - Used mostly by D3D ports to Switch | ||||
* | shader/track: Move bindless tracking to a separate function | ReinUsesLisp | 2020-06-05 | 1 | -2/+8 |
| | |||||
* | Merge pull request #3693 from ReinUsesLisp/clean-samplers | bunnei | 2020-05-02 | 1 | -16/+18 |
|\ | | | | | shader/texture: Support multiple unknown sampler properties | ||||
| * | shader/texture: Support multiple unknown sampler properties | ReinUsesLisp | 2020-04-23 | 1 | -11/+13 |
| | | | | | | | | | | | | | | | | | | This allows deducing some properties from the texture instruction before asking the runtime. By doing this we can handle type mismatches in some instructions from the renderer instead of the shader decoder. Fixes texelFetch issues with games using 2D texture instructions on a 1D sampler. | ||||
| * | shader_ir: Turn classes into data structures | ReinUsesLisp | 2020-04-23 | 1 | -10/+10 |
| | | |||||
* | | shader/memory_util: Deduplicate code | ReinUsesLisp | 2020-04-26 | 1 | -2/+1 |
|/ | | | | | | | | Deduplicate code shared between vk_pipeline_cache and gl_shader_cache as well as shader decoder code. While we are at it, fix a bug in gl_shader_cache where compute shaders had an start offset of a stage shader. | ||||
* | Merge pull request #3578 from ReinUsesLisp/vmnmx | Fernando Sahmkow | 2020-04-12 | 1 | -0/+3 |
|\ | | | | | shader/video: Partially implement VMNMX | ||||
| * | shader/video: Partially implement VMNMX | ReinUsesLisp | 2020-04-12 | 1 | -0/+3 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Implements the common usages for VMNMX. Inputs with a different size than 32 bits are not supported and sign mismatches aren't supported either. VMNMX works as follows: It grabs Ra and Rb and applies a maximum/minimum on them (this is defined by .MX), having in mind the input sign. This result can then be saturated. After the intermediate result is calculated, it applies another operation on it using Rc. These operations are merges, accumulations or another min/max pass. This instruction allows to implement with a more flexible approach GCN's min3 and max3 instructions (for instance). | ||||
* | | shader_decode: SULD.D using std::pair instead of out parameter | namkazy | 2020-04-06 | 1 | -2/+2 |
| | | |||||
* | | shader_decode: SULD.D implement bits64 and reverse shader ir init method to removed shader stage. | namkazy | 2020-04-06 | 1 | -3/+6 |
| | | |||||
* | | add shader stage when init shader ir | namkazy | 2020-04-05 | 1 | -2/+3 |
|/ | |||||
* | shader/shader_ir: Track usage in input attribute and of legacy varyings | ReinUsesLisp | 2020-03-16 | 1 | -0/+8 |
| | |||||
* | video_core: Rename "const buffer locker" to "registry" | ReinUsesLisp | 2020-03-09 | 1 | -3/+3 |
| | |||||
* | Shader_IR: Address feedback. | Fernando Sahmkow | 2020-01-25 | 1 | -1/+1 |
| | |||||
* | Shader_IR: Change name of TrackSampler function so it does not confuse with the type. | Fernando Sahmkow | 2020-01-24 | 1 | -1/+2 |
| | |||||
* | Shader_IR: Propagate bindless index into the GL compiler. | Fernando Sahmkow | 2020-01-24 | 1 | -3/+2 |
| | |||||
* | Shader_IR: Implement Injectable Custom Variables to the IR. | Fernando Sahmkow | 2020-01-24 | 1 | -0/+9 |
| | |||||
* | Shader_IR: deduce size of indexed samplers | Fernando Sahmkow | 2020-01-24 | 1 | -0/+1 |
| | |||||
* | Shader_IR: Implement initial code for tracking indexed samplers. | Fernando Sahmkow | 2020-01-24 | 1 | -0/+3 |
| | |||||
* | GPU: Implement guest driver profile and deduce texture handler sizes. | Fernando Sahmkow | 2020-01-24 | 1 | -0/+1 |
| | |||||
* | shader_ir/memory: Implement u16 and u8 for STG and LDG | ReinUsesLisp | 2020-01-09 | 1 | -1/+1 |
| | | | | | | | Using the same technique we used for u8 on LDG, implement u16. In the case of STG, load memory and insert the value we want to set into it with bitfieldInsert. Then set that value. | ||||
* | Shader_IR: Address Feedback | Fernando Sahmkow | 2020-01-04 | 1 | -4/+4 |
| | |||||
* | Shader_IR: add the ability to amend code in the shader ir. | Fernando Sahmkow | 2019-12-30 | 1 | -0/+8 |
| | | | | | | | This commit introduces a mechanism by which shader IR code can be amended and extended. This useful for track algorithms where certain information can derived from before the track such as indexes to array samplers. | ||||
* | shader/texture: Implement TLD4.PTP | ReinUsesLisp | 2019-12-16 | 1 | -1/+4 |
| | |||||
* | Shader_Ir: Correct TLD4S encoding and implement f16 flag. | Fernando Sahmkow | 2019-12-12 | 1 | -1/+1 |
| | |||||
* | Shader_Ir: default failed tracks on bindless samplers to null values. | Fernando Sahmkow | 2019-12-12 | 1 | -2/+2 |
| | |||||
* | shader: Keep track of shaders using warp instructions | ReinUsesLisp | 2019-12-10 | 1 | -0/+5 |
| | |||||
* | shader/texture: Deduce texture buffers from locker | ReinUsesLisp | 2019-11-23 | 1 | -4/+9 |
| | | | | | Instead of specializing shaders to separate texture buffers from 1D textures, use the locker to deduce them while they are being decoded. | ||||
* | video_core: Silence implicit conversion warnings | ReinUsesLisp | 2019-11-08 | 1 | -3/+3 |
| | |||||
* | Merge pull request #3039 from ReinUsesLisp/cleanup-samplers | Rodrigo Locatti | 2019-11-06 | 1 | -7/+5 |
|\ | | | | | shader/node: Unpack bindless texture encoding | ||||
| * | shader/node: Unpack bindless texture encoding | ReinUsesLisp | 2019-10-30 | 1 | -7/+5 |
| | | | | | | | | | | | | | | | | | | Bindless textures were using u64 to pack the buffer and offset from where they come from. Drop this in favor of separated entries in the struct. Remove the usage of std::set in favor of std::list (it's not std::vector to avoid reference invalidations) for samplers and images. | ||||
* | | Shader_IR: Fix regression on TLD4 | Fernando Sahmkow | 2019-10-31 | 1 | -1/+1 |
| | | | | | | | | | | | | Originally on the last commit I thought TLD4 acted the same as TLD4S and didn't have a mask. It actually does have a component mask. This commit corrects that. | ||||
* | | Shader_IR: Fix TLD4 and add Bindless Variant. | Fernando Sahmkow | 2019-10-30 | 1 | -2/+2 |
|/ | | | | | | This commit fixes an issue where not all 4 results of tld4 were being written, the color component was defaulted to red, among other things. It also implements the bindless variant. | ||||
* | Merge pull request #2976 from FernandoS27/cache-fast-brx-rebased | Rodrigo Locatti | 2019-10-26 | 1 | -8/+16 |
|\ | | | | | Implement Fast BRX, fix TXQ and addapt the Shader Cache for it | ||||
| * | gl_shader_decompiler: Move entries to a separate function | ReinUsesLisp | 2019-10-25 | 1 | -6/+6 |
| | | |||||
| * | Shader_IR: allow lookup of texture samplers within the shader_ir for instructions that don't provide it | Fernando Sahmkow | 2019-10-25 | 1 | -3/+9 |
| | | |||||
| * | Shader_Cache: setup connection of ConstBufferLocker | Fernando Sahmkow | 2019-10-25 | 1 | -1/+2 |
| | | |||||
| * | VideoCore: Unify const buffer accessing along engines and provide ConstBufferLocker class to shaders. | Fernando Sahmkow | 2019-10-25 | 1 | -0/+1 |
| | | |||||
* | | Merge pull request #3013 from FernandoS27/tld4s-fix | Rodrigo Locatti | 2019-10-26 | 1 | -1/+1 |
|\ \ | |/ |/| | Shader_Ir: Fix TLD4S from using a component mask. | ||||
| * | Shader_Ir: Fix TLD4S from using a component mask. | Fernando Sahmkow | 2019-10-22 | 1 | -1/+1 |
| | | | | | | | | | | | | TLD4S always outputs 4 values, the previous code checked a component mask and omitted those values that weren't part of it. This commit corrects that and makes sure all 4 values are set. | ||||
* | | shader_ir/memory: Ignore global memory when tracking fails | ReinUsesLisp | 2019-10-22 | 1 | -2/+3 |
|/ | | | | | | | | | | | Ignore global memory operations instead of invoking undefined behaviour when constant buffer tracking fails and we are blasting through asserts, ignore the operation. In the case of LDG this means filling the destination registers with zeroes; for STG this means ignore the instruction as a whole. The default behaviour is still to abort execution on failure. | ||||
* | vk_shader_compiler: Implement the decompiler in SPIR-V | Fernando Sahmkow | 2019-10-05 | 1 | -0/+4 |
| | |||||
* | Shader_Ir: Refactor Decompilation process and allow multiple decompilation modes. | Fernando Sahmkow | 2019-10-05 | 1 | -1/+9 |
| | |||||
* | gl_shader_decompiler: Implement AST decompiling | Fernando Sahmkow | 2019-10-05 | 1 | -8/+17 |
| | |||||
* | shader_ir: Declare Manager and pass it to appropiate programs. | Fernando Sahmkow | 2019-10-05 | 1 | -0/+2 |
| | |||||
* | Merge pull request #2869 from ReinUsesLisp/suld | bunnei | 2019-09-24 | 1 | -6/+3 |
|\ | | | | | shader/image: Implement SULD and fix SUATOM | ||||
| * | gl_shader_decompiler: Use uint for images and fix SUATOM | ReinUsesLisp | 2019-09-21 | 1 | -6/+3 |
| | | | | | | | | | | | | In the process remove implementation of SUATOM.MIN and SUATOM.MAX as these require a distinction between U32 and S32. These have to be implemented with imageCompSwap loop. | ||||
* | | VideoCore: Corrections to the MME Inliner and removal of hacky instance management. | Fernando Sahmkow | 2019-09-19 | 1 | -0/+10 |
|/ | |||||
* | Merge pull request #2784 from ReinUsesLisp/smem | bunnei | 2019-09-18 | 1 | -1/+5 |
|\ | | | | | shader_ir: Implement shared memory | ||||
| * | shader_ir: Implement ST_S | ReinUsesLisp | 2019-09-05 | 1 | -1/+5 |
| | | | | | | | | | | This instruction writes to a memory buffer shared with threads within the same work group. It is known as "shared" memory in GLSL. | ||||
* | | shader/image: Implement SUATOM and fix SUST | ReinUsesLisp | 2019-09-11 | 1 | -2/+8 |
| | | |||||
* | | gl_shader_decompiler: Keep track of written images and mark them as modified | ReinUsesLisp | 2019-09-06 | 1 | -4/+4 |
|/ | |||||
* | Merge pull request #2758 from ReinUsesLisp/packed-tid | bunnei | 2019-08-29 | 1 | -0/+3 |
|\ | | | | | shader/decode: Implement S2R Tic | ||||
| * | shader/decode: Implement S2R Tic | ReinUsesLisp | 2019-07-22 | 1 | -0/+3 |
| | | |||||
* | | shader_ir: Implement VOTE | ReinUsesLisp | 2019-08-21 | 1 | -0/+1 |
|/ | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Implement VOTE using Nvidia's intrinsics. Documentation about these can be found here https://developer.nvidia.com/reading-between-threads-shader-intrinsics Instead of using portable ARB instructions I opted to use Nvidia intrinsics because these are the closest we have to how Tegra X1 hardware renders. To stub VOTE on non-Nvidia drivers (including nouveau) this commit simulates a GPU with a warp size of one, returning what is meaningful for the instruction being emulated: * anyThreadNV(value) -> value * allThreadsNV(value) -> value * allThreadsEqualNV(value) -> true ballotARB, also known as "uint64_t(activeThreadsNV())", emits VOTE.ANY Rd, PT, PT; on nouveau's compiler. This doesn't match exactly to Nvidia's code VOTE.ALL Rd, PT, PT; Which is emulated with activeThreadsNV() by this commit. In theory this shouldn't really matter since .ANY, .ALL and .EQ affect the predicates (set to PT on those cases) and not the registers. | ||||
* | shader_ir: Rename Get/SetTemporal to Get/SetTemporary | Lioncash | 2019-07-17 | 1 | -4/+4 |
| | | | | | | This is more accurate in terms of describing what the functions are actually doing. Temporal relates to time, not the setting of a temporary itself. | ||||
* | shader_ir: Remove unused includes | Lioncash | 2019-07-17 | 1 | -3/+0 |
| | | | | Removes unnecessary header dependencies. | ||||
* | Merge pull request #2565 from ReinUsesLisp/track-indirect | Fernando Sahmkow | 2019-07-16 | 1 | -1/+1 |
|\ | | | | | shader/track: Track indirect buffers | ||||
| * | shader: Allow tracking of indirect buffers without variable offset | ReinUsesLisp | 2019-07-15 | 1 | -1/+1 |
| | | | | | | | | | | | | While changing this code, simplify tracking code to allow returning the base address node, this way callers don't have to manually rebuild it on each invocation. | ||||
* | | Merge pull request #2695 from ReinUsesLisp/layer-viewport | Fernando Sahmkow | 2019-07-15 | 1 | -0/+15 |
|\ \ | |/ |/| | gl_shader_decompiler: Implement gl_ViewportIndex and gl_Layer in vertex shaders | ||||
| * | gl_shader_decompiler: Implement gl_ViewportIndex and gl_Layer in vertex shaders | ReinUsesLisp | 2019-07-08 | 1 | -0/+15 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | This commit implements gl_ViewportIndex and gl_Layer in vertex and geometry shaders. In the case it's used in a vertex shader, it requires ARB_shader_viewport_layer_array. This extension is available on AMD and Nvidia devices (mesa and proprietary drivers), but not available on Intel on any platform. At the moment of writing this description I don't know if this is a hardware limitation or a driver limitation. In the case that ARB_shader_viewport_layer_array is not available, writes to these registers on a vertex shader are ignored, with the appropriate logging. | ||||
* | | shader_ir: Add comments on missing instruction. | Fernando Sahmkow | 2019-07-09 | 1 | -0/+4 |
| | | | | | | | | Also shows Nvidia's address space on comments. | ||||
* | | shader_ir: Unify blocks in decompiled shaders. | Fernando Sahmkow | 2019-07-09 | 1 | -0/+6 |
| | | |||||
* | | shader_ir: Decompile Flow Stack | Fernando Sahmkow | 2019-07-09 | 1 | -0/+3 |
| | | |||||
* | | shader_ir: propagate shader size to the IR | Fernando Sahmkow | 2019-07-09 | 1 | -1/+2 |
| | | |||||
* | | shader_ir: Remove the old scanner. | Fernando Sahmkow | 2019-07-09 | 1 | -11/+0 |
|/ | |||||
* | shader: Implement bindless images | ReinUsesLisp | 2019-06-21 | 1 | -0/+3 |
| | |||||
* | shader: Decode SUST and implement backing image functionality | ReinUsesLisp | 2019-06-21 | 1 | -0/+9 |
| | |||||
* | shader: Implement texture buffers | ReinUsesLisp | 2019-06-21 | 1 | -0/+2 |
| | |||||
* | shader: Move Node declarations out of the shader IR header | ReinUsesLisp | 2019-06-07 | 1 | -492/+1 |
| | | | | | | Analysis passes do not have a good reason to depend on shader_ir.h to work on top of nodes. This splits node-related declarations to their own file and leaves the IR in shader_ir.h | ||||
* | shader: Use shared_ptr to store nodes and move initialization to file | ReinUsesLisp | 2019-06-06 | 1 | -83/+25 |
| | | | | | | | | | Instead of having a vector of unique_ptr stored in a vector and returning star pointers to this, use shared_ptr. While changing initialization code, move it to a separate file when possible. This is a first step to allow code analysis and node generation beyond the ShaderIR class. | ||||
* | Merge pull request #2446 from ReinUsesLisp/tid | bunnei | 2019-05-29 | 1 | -1/+7 |
|\ | | | | | shader: Implement S2R Tid{XYZ} and CtaId{XYZ} | ||||
| * | shader: Implement S2R Tid{XYZ} and CtaId{XYZ} | ReinUsesLisp | 2019-05-20 | 1 | -1/+7 |
| | | |||||
* | | Merge pull request #2485 from ReinUsesLisp/generic-memory | bunnei | 2019-05-25 | 1 | -4/+2 |
|\ \ | | | | | | | shader/memory: Implement generic memory stores and loads (ST and LD) | ||||
| * | | shader/memory: Implement LD (generic memory) | ReinUsesLisp | 2019-05-21 | 1 | -4/+2 |
| |/ | |||||
* / | shader/shader_ir: Make Comment() take a std::string by value | Lioncash | 2019-05-23 | 1 | -1/+1 |
|/ | | | | | | | | | | | | | | | | | | | | This allows for forming comment nodes without making unnecessary copies of the std::string instance. e.g. previously: Comment(fmt::format("Base address is c[0x{:x}][0x{:x}]", cbuf->GetIndex(), cbuf_offset)); Would result in a copy of the string being created, as CommentNode() takes a std::string by value (a const ref passed to a value parameter results in a copy). Now, only one instance of the string is ever moved around. (fmt::format returns a std::string, and since it's returned from a function by value, this is a prvalue (which can be treated like an rvalue), so it's moved into Comment's string parameter), we then move it into the CommentNode constructor, which then moves the string into its member variable). | ||||
* | Merge pull request #2441 from ReinUsesLisp/al2p | bunnei | 2019-05-19 | 1 | -18/+27 |
|\ | | | | | shader: Implement AL2P and ALD.PHYS | ||||
| * | shader: Add physical attributes commentaries | ReinUsesLisp | 2019-05-03 | 1 | -2/+4 |
| | | |||||
| * | gl_shader_decompiler: Implement GLSL physical attributes | ReinUsesLisp | 2019-05-03 | 1 | -1/+1 |
| | | |||||
| * | shader_ir/memory: Implement physical input attributes | ReinUsesLisp | 2019-05-03 | 1 | -3/+17 |
| | | |||||
| * | shader: Remove unused AbufNode Ipa mode | ReinUsesLisp | 2019-05-03 | 1 | -16/+4 |
| | | |||||
| * | shader_ir/memory: Emit AL2P IR | ReinUsesLisp | 2019-05-03 | 1 | -0/+5 |
| | | |||||
* | | shader/shader_ir: Remove unnecessary inline specifiers | Lioncash | 2019-05-19 | 1 | -2/+2 |
| | | | | | | | | | | constexpr internally links by default, so the inline specifier is unnecessary. | ||||
* | | shader/shader_ir: Simplify constructors for OperationNode | Lioncash | 2019-05-19 | 1 | -15/+6 |
| | | | | | | | | | | | | | | | | | | | | Many of these constructors don't even need to be templated. The only ones that need to be templated are the ones that actually make use of the parameter pack. Even then, since std::vector accepts an initializer list, we can supply the parameter pack directly to it instead of creating our own copy of the list, then copying it again into the std::vector. | ||||
* | | shader/shader_ir: Remove unnecessary template parameter packs from Operation() overloads where applicable | Lioncash | 2019-05-19 | 1 | -2/+0 |
| | | | | | | | | | | These overloads don't actually make use of the parameter pack, so they can be turned into regular non-template function overloads. | ||||
* | | shader/shader_ir: Mark tracking functions as const member functions | Lioncash | 2019-05-19 | 1 | -3/+4 |
| | | | | | | | | | | These don't actually modify instance state, so they can be marked as const member functions | ||||
* | | shader/shader_ir: Place implementations of constructor and destructor in cpp file | Lioncash | 2019-05-19 | 1 | -5/+2 |
|/ | | | | | | Given the class contains quite a lot of non-trivial types, place the constructor and destructor within the cpp file to avoid inlining construction and destruction code everywhere the class is used. | ||||
* | shader_ir: Move Sampler index entry in operand< to sort declarations | ReinUsesLisp | 2019-04-26 | 1 | -2/+2 |
| | |||||
* | shader_ir: Add missing entry to Sampler operand< comparison | ReinUsesLisp | 2019-04-26 | 1 | -2/+3 |
| | |||||
* | Merge pull request #2409 from ReinUsesLisp/half-floats | bunnei | 2019-04-20 | 1 | -22/+25 |
|\ | | | | | shader_ir/decode: Miscellaneous fixes to half-float decompilation | ||||
| * | shader_ir/decode: Fix half float pre-operations and remove MetaHalfArithmetic | ReinUsesLisp | 2019-04-16 | 1 | -16/+10 |
| | | | | | | | | | | | | | | Operations done before the main half float operation (like HAdd) were managing a packed value instead of the unpacked one. Adding an unpacked operation allows us to drop the per-operand MetaHalfArithmetic entry, simplifying the code overall. | ||||
| * | shader_ir/decode: Implement half float saturation | ReinUsesLisp | 2019-04-16 | 1 | -0/+3 |
| | | |||||
| * | renderer_opengl: Implement half float NaN comparisons | ReinUsesLisp | 2019-04-16 | 1 | -6/+12 |
| | | |||||
* | | Merge pull request #2348 from FernandoS27/guest-bindless | bunnei | 2019-04-18 | 1 | -5/+35 |
|\ \ | | | | | | | Implement Bindless Textures on Shader Decompiler and GL backend | ||||
| * | | Move ConstBufferAccessor to Maxwell3d, correct mistakes and clang format. | Fernando Sahmkow | 2019-04-08 | 1 | -2/+2 |
| | | | |||||
| * | | Refactor GetTextureCode and GetTexCode to use an optional instead of optional parameters | Fernando Sahmkow | 2019-04-08 | 1 | -11/+9 |
| | | | |||||
| * | | Implement Bindless Handling on SetupTexture | Fernando Sahmkow | 2019-04-08 | 1 | -4/+3 |
| | | | |||||
| * | | Unify both sampler types. | Fernando Sahmkow | 2019-04-08 | 1 | -8/+28 |
| | | | |||||
| * | | Implement Bindless Samplers and TEX_B in the IR. | Fernando Sahmkow | 2019-04-08 | 1 | -9/+22 |
| |/ | |||||
* / | shader_ir: Implement STG, keep track of global memory usage and flush | ReinUsesLisp | 2019-04-14 | 1 | -3/+13 |
|/ | |||||
* | shader_ir/decode: Implement AOFFI for TEX and TLD4 | ReinUsesLisp | 2019-03-30 | 1 | -3/+6 |
| | |||||
* | shader_ir: Implement immediate register tracking | ReinUsesLisp | 2019-03-30 | 1 | -0/+3 |
| | |||||
* | shader/decode: Remove extras from MetaTexture | ReinUsesLisp | 2019-02-26 | 1 | -1/+3 |
| | |||||
* | shader/decode: Split memory and texture instructions decoding | ReinUsesLisp | 2019-02-26 | 1 | -0/+1 |
| | |||||
* | shader_ir: Remove F4 prefix to texture operations | ReinUsesLisp | 2019-02-07 | 1 | -6/+6 |
| | | | | | | This was originally included because texture operations returned a vec4. These operations now return a single float and the F4 prefix doesn't mean anything. | ||||
* | shader_ir: Clean texture management code | ReinUsesLisp | 2019-02-07 | 1 | -5/+5 |
| | | | | | | | | | Previous code relied on GLSL parameter order (something that's always ill-formed on an IR design). This approach passes spatial coordiantes through operation nodes and array and depth compare values in the the texture metadata. It still contains an "extra" vector containing generic nodes for bias and component index (for example) which is still a bit ill-formed but it should be better than the previous approach. | ||||
* | Merge pull request #2083 from ReinUsesLisp/shader-ir-cbuf-tracking | bunnei | 2019-02-07 | 1 | -45/+46 |
|\ | | | | | shader/track: Add a more permissive global memory tracking | ||||
| * | shader_ir: Rename BasicBlock to NodeBlock | ReinUsesLisp | 2019-02-03 | 1 | -46/+46 |
| | | | | | | | | It's not always used as a basic block. Rename it for consistency. | ||||
| * | shader_ir: Pass decoded nodes as a whole instead of per basic blocks | ReinUsesLisp | 2019-02-03 | 1 | -25/+26 |
| | | | | | | | | | | | | | | | | | | Some games call LDG at the top of a basic block, making the tracking heuristic to fail. This commit lets the heuristic the decoded nodes as a whole instead of per basic blocks. This may lead to some false positives but allows it the heuristic to track cases it previously couldn't. | ||||
* | | gl_shader_disk_cache: Save GLSL and entries into the precompiled file | ReinUsesLisp | 2019-02-07 | 1 | -0/+9 |
|/ | |||||
* | shader_ir: Unify constant buffer offset values | ReinUsesLisp | 2019-01-30 | 1 | -1/+1 |
| | | | | | | | Constant buffer values on the shader IR were using different offsets if the access direct or indirect. cbuf34 has a non-multiplied offset while cbuf36 does. On shader decoding this commit multiplies it by four on cbuf34 queries. | ||||
* | shader_decode: Implement LDG and basic cbuf tracking | ReinUsesLisp | 2019-01-30 | 1 | -4/+34 |
| | |||||
* | shader/shader_ir: Amend three comment typos | Lioncash | 2019-01-28 | 1 | -3/+3 |
| | | | | | Given we're in the area, these are three trivial typos that can be corrected. | ||||
* | shader/shader_ir: Amend constructor initializer ordering for AbufNode | Lioncash | 2019-01-28 | 1 | -2/+2 |
| | | | | | Orders the class members in the same order that they would actually be initialized in. Gets rid of two compiler warnings. | ||||
* | shader_ir: Pass to decoder functions basic block's code | ReinUsesLisp | 2019-01-15 | 1 | -25/+25 |
| | |||||
* | shader_decode: Improve zero flag implementation | ReinUsesLisp | 2019-01-15 | 1 | -2/+7 |
| | |||||
* | shader_ir: Remove composite primitives and use temporals instead | ReinUsesLisp | 2019-01-15 | 1 | -30/+30 |
| | |||||
* | shader_decode: Use proper primitive names | ReinUsesLisp | 2019-01-15 | 1 | -7/+5 |
| | |||||
* | shader_decode: Use BitfieldExtract instead of shift + and | ReinUsesLisp | 2019-01-15 | 1 | -2/+7 |
| | |||||
* | shader_ir: Remove Ipa primitive | ReinUsesLisp | 2019-01-15 | 1 | -2/+0 |
| | |||||
* | shader_ir: Remove RZ and use Register::ZeroIndex instead | ReinUsesLisp | 2019-01-15 | 1 | -2/+0 |
| | |||||
* | shader_decode: Implement TEXS.F16 | ReinUsesLisp | 2019-01-15 | 1 | -2/+6 |
| | |||||
* | video_core: Implement IR based geometry shaders | ReinUsesLisp | 2019-01-15 | 1 | -0/+3 |
| | |||||
* | shader_decode: Implement VMAD and VSETP | ReinUsesLisp | 2019-01-15 | 1 | -0/+4 |
| | |||||
* | shader_decode: Implement HSET2 | ReinUsesLisp | 2019-01-15 | 1 | -0/+1 |
| | |||||
* | shader_decode: Rework HSETP2 | ReinUsesLisp | 2019-01-15 | 1 | -7/+8 |
| | |||||
* | shader_decode: Implement HFMA2 | ReinUsesLisp | 2019-01-15 | 1 | -0/+1 |
| | |||||
* | shader_decode: Implement POPC | ReinUsesLisp | 2019-01-15 | 1 | -1/+3 |
| | |||||
* | shader_decode: Implement TLDS (untested) | ReinUsesLisp | 2019-01-15 | 1 | -0/+4 |
| | |||||
* | shader_decode: Update TLD4 reflecting #1862 changes | ReinUsesLisp | 2019-01-15 | 1 | -0/+3 |
| | |||||
* | shader_ir: Fixup TEX and TEXS and partially fix TLD4 decompiling | ReinUsesLisp | 2019-01-15 | 1 | -1/+3 |
| | |||||
* | video_core: Address feedback | ReinUsesLisp | 2019-01-15 | 1 | -11/+10 |
| | |||||
* | shader_ir: Fixup file inclusions and clang-format | ReinUsesLisp | 2019-01-15 | 1 | -1/+0 |
| | |||||
* | shader_ir: Move comment node string | Mat M | 2019-01-15 | 1 | -2/+2 |
| | | | Co-Authored-By: ReinUsesLisp <reinuseslisp@airmail.cc> | ||||
* | shader_ir: Address feedback to avoid UB in bit casting | ReinUsesLisp | 2019-01-15 | 1 | -2/+4 |
| | |||||
* | shader_decode: Implement LOP3 | ReinUsesLisp | 2019-01-15 | 1 | -0/+2 |
| | |||||
* | shader_decode: Implement LOP32I | ReinUsesLisp | 2019-01-15 | 1 | -0/+5 |
| | |||||
* | shader_decode: Implement TEX and TXQ | ReinUsesLisp | 2019-01-15 | 1 | -0/+4 |
| | |||||
* | shader_decode: Implement TEXS (F32) | ReinUsesLisp | 2019-01-15 | 1 | -0/+18 |
| | |||||
* | shader_ir: Add condition code helper | ReinUsesLisp | 2019-01-15 | 1 | -0/+3 |
| | |||||
* | shader_ir: Add predicate combiner helper | ReinUsesLisp | 2019-01-15 | 1 | -0/+3 |
| | |||||
* | shader_ir: Add comparison helpers | ReinUsesLisp | 2019-01-15 | 1 | -0/+9 |
| | |||||
* | shader_ir: Add half float helpers | ReinUsesLisp | 2019-01-15 | 1 | -0/+7 |
| | |||||
* | shader_ir: Add integer helpers | ReinUsesLisp | 2019-01-15 | 1 | -0/+5 |
| | |||||
* | shader_ir: Add float helpers | ReinUsesLisp | 2019-01-15 | 1 | -0/+5 |
| | |||||
* | shader_ir: Add setters | ReinUsesLisp | 2019-01-15 | 1 | -0/+8 |
| | |||||
* | shader_ir: Add local memory getters | ReinUsesLisp | 2019-01-15 | 1 | -0/+3 |
| | |||||
* | shader_ir: Add internal flag getters | ReinUsesLisp | 2019-01-15 | 1 | -0/+2 |
| | |||||
* | shader_ir: Add attribute getters | ReinUsesLisp | 2019-01-15 | 1 | -0/+5 |
| | |||||
* | shader_ir: Add constant buffer getters | ReinUsesLisp | 2019-01-15 | 1 | -0/+4 |
| | |||||
* | shader_ir: Add register getter | ReinUsesLisp | 2019-01-15 | 1 | -0/+2 |
| | |||||
* | shader_ir: Add immediate node constructors | ReinUsesLisp | 2019-01-15 | 1 | -1/+18 |
| | |||||
* | shader_ir: Initial implementation | ReinUsesLisp | 2019-01-15 | 1 | -0/+662 |