Commit message (Collapse) | Author | Files | Lines | ||
---|---|---|---|---|---|
2019-07-09 | shader_ir: Add comments on missing instruction. | Fernando Sahmkow | 2 | -2/+9 | |
Also shows Nvidia's address space on comments. | |||||
2019-07-09 | shader_ir: limit explorastion to best known program size. | Fernando Sahmkow | 1 | -1/+1 | |
2019-07-09 | control_flow: Correct block breaking algorithm. | Fernando Sahmkow | 1 | -17/+17 | |
2019-07-09 | control_flow: Assert shaders bigger than limit. | Fernando Sahmkow | 1 | -0/+2 | |
2019-07-09 | control_flow: Address feedback. | Fernando Sahmkow | 1 | -89/+37 | |
2019-07-09 | shader_ir: Correct parsing of scheduling instructions and correct sizing | Fernando Sahmkow | 2 | -13/+30 | |
2019-07-09 | shader_ir: Correct max sizing | Fernando Sahmkow | 2 | -2/+2 | |
2019-07-09 | shader_ir: Remove unnecessary constructors and use optional for ScanFlow result | Fernando Sahmkow | 3 | -28/+17 | |
2019-07-09 | shader_ir: Corrections, documenting and asserting control_flow | Fernando Sahmkow | 3 | -52/+54 | |
2019-07-09 | shader_ir: Unify blocks in decompiled shaders. | Fernando Sahmkow | 7 | -58/+85 | |
2019-07-09 | shader_ir: Decompile Flow Stack | Fernando Sahmkow | 4 | -11/+206 | |
2019-07-09 | shader_ir: propagate shader size to the IR | Fernando Sahmkow | 6 | -17/+28 | |
2019-07-09 | shader_ir: Implement BRX & BRA.CC | Fernando Sahmkow | 6 | -4/+76 | |
2019-07-09 | shader_ir: Remove the old scanner. | Fernando Sahmkow | 2 | -77/+0 | |
2019-07-09 | shader_ir: Implement a new shader scanner | Fernando Sahmkow | 4 | -16/+473 | |
2019-07-07 | Delete decode_integer_set.cpp | Tobias | 1 | -0/+0 | |
2019-07-05 | texture_cache: Address Feedback | Fernando Sahmkow | 4 | -12/+13 | |
2019-07-05 | texture_cache: Correct Texture Buffer Uploading | Fernando Sahmkow | 3 | -2/+18 | |
2019-07-04 | gl_shader_cache: Make CachedShader constructor private | Zach Hilman | 2 | -5/+5 | |
Fixes missing review comments introduced. | |||||
2019-07-01 | rasterizer_cache: Protect inherited caches from submission level | Fernando Sahmkow | 3 | -1/+5 | |
2019-06-30 | texture_cache: Pack sibling queries inside a method | ReinUsesLisp | 1 | -6/+8 | |
2019-06-30 | texture_cache: Use std::vector reservation for sampled_textures | ReinUsesLisp | 1 | -17/+10 | |
2019-06-30 | texture_cache: Style changes | ReinUsesLisp | 3 | -17/+13 | |
2019-06-29 | texture_cache: Use std::array for siblings_table | ReinUsesLisp | 1 | -10/+13 | |
2019-06-29 | texture_cache: Address feedback | ReinUsesLisp | 4 | -30/+13 | |
2019-06-26 | texture_cache: Correct variable naming. | Fernando Sahmkow | 1 | -3/+3 | |
2019-06-26 | gl_texture_cache: Correct asserts | Fernando Sahmkow | 2 | -2/+2 | |
2019-06-26 | texture_cache: Corrections, documentation and asserts | Fernando Sahmkow | 1 | -42/+42 | |
2019-06-26 | surface_params: Corrections, asserts and documentation. | Fernando Sahmkow | 2 | -43/+58 | |
2019-06-25 | copy_params: use constexpr for constructor | Fernando Sahmkow | 1 | -3/+4 | |
2019-06-25 | gl_texture_cache: Corrections and fixes | Fernando Sahmkow | 2 | -13/+9 | |
2019-06-25 | gl_resource_manager: Correct MakeStreamCopy | Fernando Sahmkow | 2 | -3/+2 | |
2019-06-25 | texture_cache: Query MemoryManager from the system | Fernando Sahmkow | 5 | -20/+7 | |
2019-06-24 | texture_cache: Include "core/core.h" | ReinUsesLisp | 1 | -4/+1 | |
2019-06-24 | gl_texture_cache: Explicitly add indirect include | ReinUsesLisp | 1 | -0/+1 | |
2019-06-24 | texture_cache/surface_view: Address feedback | ReinUsesLisp | 1 | -1/+0 | |
2019-06-24 | texture_cache/surface_base: Address feedback | ReinUsesLisp | 2 | -2/+10 | |
2019-06-24 | video_core/surface: Address feedback | ReinUsesLisp | 1 | -2/+2 | |
2019-06-24 | decode/texture: Address feedback | ReinUsesLisp | 1 | -0/+1 | |
2019-06-24 | renderer_opengl/utils: Remove unused includes and unused forward declaration | ReinUsesLisp | 1 | -4/+0 | |
2019-06-24 | gl_texture_cache: Address some feedback | ReinUsesLisp | 1 | -2/+4 | |
2019-06-24 | gl_shader_disk_cache: Address feedback | ReinUsesLisp | 2 | -4/+8 | |
2019-06-24 | gl_shader_decompiler: Address feedback | ReinUsesLisp | 1 | -11/+12 | |
2019-06-24 | shader_bytecode: Include missing <array> | ReinUsesLisp | 1 | -0/+1 | |
2019-06-21 | texture_cache: Style and Corrections | Fernando Sahmkow | 7 | -71/+75 | |
2019-06-21 | shader_cache: Correct versioning and size calculation. | Fernando Sahmkow | 2 | -2/+7 | |
2019-06-21 | texture_cache: Eliminate linear textures fallthrough | Fernando Sahmkow | 1 | -4/+0 | |
2019-06-21 | texture_cache: Correct format R16U as sibling | Fernando Sahmkow | 2 | -1/+2 | |
2019-06-21 | texture_cache: Implement texception detection and texture barriers. | Fernando Sahmkow | 2 | -7/+40 | |
2019-06-21 | texture_cache: Corrections to buffers and shadow formats use. | Fernando Sahmkow | 1 | -10/+34 | |
2019-06-21 | texture_cache: Implement Irregular Views in surfaces | Fernando Sahmkow | 2 | -4/+24 | |
2019-06-21 | surface: Correct format S8Z24 | Fernando Sahmkow | 4 | -9/+5 | |
2019-06-21 | texture_cache: Initialize all siblings to invalid pixel format. | Fernando Sahmkow | 1 | -6/+15 | |
2019-06-21 | gl_texture_cache: Use Stream Buffers instead of Persistant for Buffer Copies. | Fernando Sahmkow | 3 | -5/+4 | |
2019-06-21 | gl_texture_cache: Correct Image Blit | Fernando Sahmkow | 1 | -1/+1 | |
2019-06-21 | decoders: correct block calculation | Fernando Sahmkow | 7 | -29/+41 | |
2019-06-21 | texture_cache: Use siblings textures on Rebuild and fix possible error on blitting | Fernando Sahmkow | 2 | -11/+24 | |
2019-06-21 | texture_cache: Remove old rasterizer cache | Fernando Sahmkow | 2 | -1956/+0 | |
2019-06-21 | texture_cache: Implement siblings texture formats. | Fernando Sahmkow | 2 | -12/+31 | |
2019-06-21 | fermi2d: Correct Origin Mode | Fernando Sahmkow | 1 | -5/+10 | |
2019-06-21 | texture_cache: correct texture buffer on surface params | Fernando Sahmkow | 1 | -4/+11 | |
2019-06-21 | texture_cache: eliminate accelerated depth->color/color->depth copies due to driver instability. | Fernando Sahmkow | 4 | -22/+6 | |
2019-06-21 | texture_cache: correct mutex locks | Fernando Sahmkow | 1 | -4/+4 | |
2019-06-21 | shader_ir: Fix image copy rebase issues | Fernando Sahmkow | 1 | -2/+7 | |
2019-06-21 | texture_cache: Don't Image Copy if component types differ | Fernando Sahmkow | 1 | -1/+2 | |
2019-06-21 | texture_cache: move some large methods to cpp files | Fernando Sahmkow | 4 | -139/+135 | |
2019-06-21 | texture_cache: Optimize GetSurface and use references on functions that don't change a surface. | Fernando Sahmkow | 3 | -12/+12 | |
2019-06-21 | texture_cache: Implement Buffer Copy and detect Turing GPUs Image Copies | Fernando Sahmkow | 8 | -12/+148 | |
2019-06-21 | texture_cache uncompress-compress is untopological. | Fernando Sahmkow | 5 | -19/+53 | |
This makes conflicts between non compress and compress textures to be auto recycled. It also limits the amount of mipmaps a texture can have if it goes above it's limit. | |||||
2019-06-21 | texture_cache: Correct copying between compressed and uncompressed formats | Fernando Sahmkow | 3 | -10/+27 | |
2019-06-21 | texture_cache: Only load on recycle with accurate GPU. | Fernando Sahmkow | 1 | -2/+3 | |
Testing so far has proven this to be quite safe as texture memory read added a 2-5ms load to the current cache. | |||||
2019-06-21 | Fix rebase errors | Fernando Sahmkow | 3 | -3/+13 | |
2019-06-21 | texture_cache: Handle uncontinuous surfaces. | Fernando Sahmkow | 4 | -21/+82 | |
2019-06-21 | texture_cache: return null surface on invalid address | Fernando Sahmkow | 1 | -0/+12 | |
2019-06-21 | texture_cache: Add checks for texture buffers. | Fernando Sahmkow | 1 | -2/+16 | |
2019-06-21 | texture_cache: Fermi2D reform and implement View Mirage | Fernando Sahmkow | 11 | -77/+125 | |
This also does some fixes on compressed textures reinterpret and on the Fermi2D engine in general. | |||||
2019-06-21 | gl_shader_decompiler: Implement image binding settings | ReinUsesLisp | 5 | -24/+52 | |
2019-06-21 | shader: Implement bindless images | ReinUsesLisp | 3 | -2/+40 | |
2019-06-21 | shader: Decode SUST and implement backing image functionality | ReinUsesLisp | 8 | -3/+282 | |
2019-06-21 | gl_rasterizer: Track texture buffer usage | ReinUsesLisp | 6 | -74/+119 | |
2019-06-21 | video_core: Make ARB_buffer_storage a required extension | ReinUsesLisp | 3 | -8/+5 | |
2019-06-21 | gl_rasterizer_cache: Use texture buffers to emulate texture buffers | ReinUsesLisp | 5 | -11/+35 | |
2019-06-21 | maxwell_3d: Partially implement texture buffers as 1D textures | ReinUsesLisp | 4 | -10/+24 | |
2019-06-21 | gl_shader_decompiler: Allow 1D textures to be texture buffers | ReinUsesLisp | 1 | -4/+38 | |
2019-06-21 | shader: Implement texture buffers | ReinUsesLisp | 3 | -0/+62 | |
2019-06-21 | texture_cache: loose TryReconstructSurface when accurate GPU is not on. | Fernando Sahmkow | 3 | -4/+20 | |
Also corrects some asserts. | |||||
2019-06-21 | texture_cache: Document the most important methods. | Fernando Sahmkow | 1 | -8/+87 | |
2019-06-21 | texture_cache: Try to Reconstruct Surface on bigger than overlap. | Fernando Sahmkow | 1 | -4/+11 | |
This fixes clouds in SMO Cap Kingdom and lens on Cloud Kingdom. Also moved accurate_gpu setting check to Pick Strategy | |||||
2019-06-21 | texture_cache: Implement Guard mechanism | Fernando Sahmkow | 2 | -1/+12 | |
2019-06-21 | texture_cache: General Fixes | Fernando Sahmkow | 8 | -47/+170 | |
Fixed ASTC mipmaps loading Fixed alignment on openGL upload/download Fixed Block Height Calculation Removed unalign_height | |||||
2019-06-21 | surface_params: Ensure pitch is always written to avoid surface leaks | ReinUsesLisp | 1 | -0/+2 | |
2019-06-21 | gl_framebuffer_cache: Use a hashed struct to cache framebuffers | ReinUsesLisp | 6 | -62/+148 | |
2019-06-21 | texture_cache return invalid buffer on deactivated color_mask | Fernando Sahmkow | 2 | -2/+9 | |
2019-06-21 | engine_upload: Addapt to new Texture Cache | Fernando Sahmkow | 2 | -5/+5 | |
2019-06-21 | surface_params: Optimize CreateForTexture | ReinUsesLisp | 2 | -72/+76 | |
Instead of using Common::AlignUp, use Common::AlignBits to align the texture compression factor. | |||||
2019-06-21 | gl_texture_cache: Make main views be proxy textures instead of a full view. | Fernando Sahmkow | 2 | -11/+25 | |
2019-06-21 | texture_cache: Add ASync Protections | Fernando Sahmkow | 1 | -0/+10 | |
2019-06-21 | Remove Framebuffer reconfiguration and restrict rendertarget protection | Fernando Sahmkow | 4 | -39/+27 | |
2019-06-21 | texture_cache: Implement GPU Dirty Flags | Fernando Sahmkow | 1 | -15/+22 | |
2019-06-21 | texture_cache: Optimize GetMipBlockHeight and GetMipBlockDepth | Fernando Sahmkow | 1 | -13/+6 | |
2019-06-21 | texture_cache: Implement L1_Inner_cache | Fernando Sahmkow | 1 | -13/+30 | |
2019-06-21 | video_core: Use un-shifted block sizes to avoid integer divisions | ReinUsesLisp | 9 | -60/+73 | |
Instead of storing all block width, height and depths in their shifted form: block_width = 1U << block_shift; Store them like they are provided by the emulated hardware (their block_shift form). This way we can avoid doing the costly Common::AlignUp operation to align texture sizes and drop CPU integer divisions with bitwise logic (defined in Common::AlignBits). | |||||
2019-06-21 | texture_cache: Change internal cache from lists to vectors | ReinUsesLisp | 1 | -6/+7 | |
2019-06-21 | Reduce amount of size calculations. | Fernando Sahmkow | 7 | -88/+86 | |
2019-06-21 | texture_cache: Correct premature texceptions | Fernando Sahmkow | 4 | -14/+51 | |
Due to our current infrastructure, it is possible for a mipmap to be set on as a render target before a texception of that mipmap's superset be set afterwards. This is problematic as we rely on texture views to set up texceptions and protecting render targets targets for 3D texture rendering. One simple solution is to configure framebuffers after texture setup but this brings other problems. This solution, forces a reconfiguration of the framebuffers after such event happens. | |||||
2019-06-21 | texture_cache: Implement guest flushing | Fernando Sahmkow | 3 | -10/+29 | |
2019-06-21 | Fixes to mipmap's process and reconstruct process | Fernando Sahmkow | 2 | -3/+3 | |
2019-06-21 | surface_base: Add parenthesis to EmplaceOverview's predicate | ReinUsesLisp | 1 | -3/+2 | |
2019-06-21 | Texture Cache: Implement Blitting and Fermi Copies | Fernando Sahmkow | 7 | -100/+93 | |
2019-06-21 | surface_view: Add constructor for ViewParams | ReinUsesLisp | 3 | -39/+23 | |
2019-06-21 | surface_base: Split BreakDown into layered and non-layered variants | ReinUsesLisp | 1 | -45/+48 | |
2019-06-21 | surface_base: Silence truncation warnings and minor renames and reordering | ReinUsesLisp | 2 | -32/+37 | |
2019-06-21 | copy_params: Use constructor instead of C-like initialization | ReinUsesLisp | 3 | -47/+39 | |
2019-06-21 | Correct Mipmaps View method in Texture Cache | Fernando Sahmkow | 3 | -32/+29 | |
2019-06-21 | Change texture_cache chaching from GPUAddr to CacheAddr | Fernando Sahmkow | 7 | -101/+60 | |
This also reverses the changes to make invalidation and flushing through the GPU address. | |||||
2019-06-21 | Corrections to Structural Matching | Fernando Sahmkow | 2 | -24/+53 | |
The texture will now be reconstructed if the width only matches on GoB alignment. | |||||
2019-06-21 | Implement Texture Cache V2 | Fernando Sahmkow | 6 | -381/+568 | |
2019-06-21 | Correct Surface Base and Views for new Texture Cache | Fernando Sahmkow | 7 | -380/+466 | |
2019-06-21 | Add OGLTextureView | Fernando Sahmkow | 2 | -0/+43 | |
2019-06-21 | Deglobalize Memory Manager on texture cahe and Implement Invalidation and Flushing using GPUVAddr | Fernando Sahmkow | 4 | -1/+20 | |
2019-06-21 | texture_cache: Remove execution context copies from the texture cache | ReinUsesLisp | 7 | -168/+59 | |
This is done to simplify the OpenGL implementation, it is needed for Vulkan. | |||||
2019-06-21 | gl_texture_cache: Implement fermi copies | ReinUsesLisp | 5 | -2/+105 | |
2019-06-21 | texture_cache: Split texture cache into different files | ReinUsesLisp | 12 | -876/+965 | |
2019-06-21 | texture_cache: Move staging buffer into a generic implementation | ReinUsesLisp | 4 | -181/+211 | |
2019-06-21 | texture_cache: Flush 3D textures in the order they are drawn | ReinUsesLisp | 5 | -19/+44 | |
2019-06-21 | gl_texture_cache: Minor changes | ReinUsesLisp | 5 | -140/+185 | |
2019-06-21 | gl_texture_cache: Add copy from multiple overlaps into a single surface | ReinUsesLisp | 3 | -6/+84 | |
2019-06-21 | gl_texture_cache: Attach surface textures instead of views | ReinUsesLisp | 3 | -20/+32 | |
2019-06-21 | gl_texture_cache: Add fast copy path | ReinUsesLisp | 4 | -7/+60 | |
2019-06-21 | gl_texture_cache: Initial implementation | ReinUsesLisp | 9 | -47/+809 | |
2019-06-18 | core: Remove unused CiTrace source files | Lioncash | 1 | -1/+0 | |
These source files have been unused for the entire lifecycle of the project. They're a hold-over from Citra and only add to the build time of the project, so they can be removed. There's also likely no way this would ever work in yuzu in its current form without revamping quite a bit of it, given how different the GPU on the Switch is compared to the 3DS. | |||||
2019-06-12 | gl_device: Fix TestVariableAoffi test | ReinUsesLisp | 1 | -1/+2 | |
This test is intended to be invalid GLSL, but it was being invalid in two points instead of one. The intention is to use a non-immediate parameter in a textureOffset like function. The problem is that this shader was being compiled as a separable shader object and the text was writting to gl_Position without a redeclaration, being invalid GLSL. Address that issue by using a user-defined output attribute. | |||||
2019-06-09 | GPUVM: Correct GPU VM virtual address space | Fernando Sahmkow | 1 | -2/+2 | |
2019-06-08 | kepler_compute: Use std::array for cbuf info | ReinUsesLisp | 1 | -2/+3 | |
2019-06-08 | kepler_compute: Fix block_dim_x encoding | ReinUsesLisp | 1 | -1/+1 | |
2019-06-08 | gl_shader_cache: Use static constructors for CachedShader initialization | ReinUsesLisp | 2 | -52/+53 | |
2019-06-08 | gl_rasterizer: Remove unused parameters in descriptor uploads | ReinUsesLisp | 2 | -8/+6 | |
2019-06-08 | video_core/engines: Move ConstBufferInfo out of Maxwell3D | ReinUsesLisp | 6 | -49/+64 | |
2019-06-07 | shader: Split SSY and PBK stack | ReinUsesLisp | 4 | -27/+78 | |
Hardware testing revealed that SSY and PBK push to a different stack, allowing code like this: SSY label1; PBK label2; SYNC; label1: PBK; label2: EXIT; | |||||
2019-06-07 | shader/node: Minor changes | ReinUsesLisp | 1 | -50/+54 | |
Reflect std::shared_ptr nature of Node on initializers and remove constant members in nodes. Add some commentaries. | |||||
2019-06-07 | shader: Move Node declarations out of the shader IR header | ReinUsesLisp | 4 | -493/+518 | |
Analysis passes do not have a good reason to depend on shader_ir.h to work on top of nodes. This splits node-related declarations to their own file and leaves the IR in shader_ir.h | |||||
2019-06-06 | shader: Use shared_ptr to store nodes and move initialization to file | ReinUsesLisp | 35 | -248/+296 | |
Instead of having a vector of unique_ptr stored in a vector and returning star pointers to this, use shared_ptr. While changing initialization code, move it to a separate file when possible. This is a first step to allow code analysis and node generation beyond the ShaderIR class. | |||||
2019-06-05 | core/core_timing_util: Use std::chrono types for specifying time units | Lioncash | 1 | -1/+1 | |
Makes the interface more type-safe and consistent in terms of return values. | |||||
2019-06-04 | shader_bytecode: Mark EXIT as flow instruction | Fernando Sahmkow | 1 | -1/+1 | |
2019-06-03 | gl_shader_decompiler: Remove guest "position" varying | ReinUsesLisp | 2 | -36/+21 | |
"position" was being written but not read anywhere besides geometry shaders, where it had the same value as gl_Position. This commit replaces "position" with gl_Position, reducing the complexity of our code and the emitted GLSL code. | |||||
2019-05-30 | gl_shader_cache: Store a system class and drop global accessors | ReinUsesLisp | 2 | -7/+9 | |
2019-05-30 | gl_shader_cache: Add commentaries explaining the intention in shaders creation | ReinUsesLisp | 1 | -0/+2 | |
2019-05-30 | gl_shader_cache: Flip if condition in GetStageProgram to reduce indentation | ReinUsesLisp | 1 | -25/+26 | |
2019-05-30 | gl_buffer_cache: Remove unused ReserveMemory method | ReinUsesLisp | 2 | -13/+0 | |
2019-05-30 | maxwell_to_gl: Use GL_CLAMP to emulate Clamp wrap mode | ReinUsesLisp | 3 | -7/+4 | |
2019-05-30 | gl_rasterizer: Move alpha testing to the OpenGL pipeline | ReinUsesLisp | 8 | -71/+33 | |
Removes the alpha testing code from each fragment shader invocation. | |||||
2019-05-30 | gl_rasterizer: Use GL_QUADS to emulate quads rendering | ReinUsesLisp | 6 | -132/+5 | |
2019-05-27 | gl_device: Add commentary to AOFFI unit test source code | ReinUsesLisp | 1 | -0/+1 | |
The intention behind this commit is to hint someone inspecting an apitrace dump to ignore this ill-formed GLSL code. | |||||
2019-05-27 | gl_shader_gen: Always declare extensions after the version declaration | ReinUsesLisp | 2 | -7/+5 | |
This addresses a bug on geometry shaders where code was being written before all #extension declarations were done. Ref to #2523 | |||||
2019-05-26 | vk_device: Let formats array type be deduced | ReinUsesLisp | 1 | -33/+33 | |
2019-05-26 | vk_shader_decompiler: Misc fixes | ReinUsesLisp | 2 | -45/+67 | |
Fix missing OpSelectionMerge instruction. This caused devices loses on most hardware, Intel didn't care. Fix [-1;1] -> [0;1] depth conversions. Conditionally use VK_EXT_scalar_block_layout. This allows us to use non-std140 layouts on UBOs. Update external Vulkan headers. | |||||
2019-05-26 | vk_device: Enable features when available and misc changes | ReinUsesLisp | 2 | -43/+151 | |
Keeps track of native ASTC support, VK_EXT_scalar_block_layout availability and SSBO range. Check for independentBlend and vertexPipelineStorageAndAtomics as a required feature. Always enable it. Use vk::to_string format to log Vulkan enums. Style changes. | |||||
2019-05-25 | renderer_opengl/utils: Use a std::string_view with LabelGLObject() | Lioncash | 2 | -10/+10 | |
Uses a std::string_view instead of a std::string, given the pointed to string isn't modified and is only used in a formatting operation. This is nice because a few usages directly supply a string literal to the function, allowing these usages to otherwise not heap allocate, unlike the std::string overloads. While we're at it, we can combine the address formatting into a single formatting call. | |||||
2019-05-24 | gl_shader_decompiler: Use an if based cbuf indexing for broken drivers | ReinUsesLisp | 1 | -3/+20 | |
The following code is broken on AMD's proprietary GLSL compiler: ```glsl uint idx = ...; vec4 values = ...; float some_value = values[idx & 3]; ``` It index the wrong components, to fix this the following pessimized code is emitted when that bug is present: ```glsl uint idx = ...; vec4 values = ...; float some_value; if ((idx & 3) == 0) some_value = values.x; if ((idx & 3) == 1) some_value = values.y; if ((idx & 3) == 2) some_value = values.z; if ((idx & 3) == 3) some_value = values.w; ``` | |||||
2019-05-24 | gl_device: Add test to detect broken component indexing | ReinUsesLisp | 2 | -0/+60 | |
Component indexing on AMD's proprietary driver is broken. This commit adds a test to detect when we are on a driver that can't successfully manage component indexing. It dispatches a dummy draw with just one vertex shader that writes to an indexed SSBO from the GPU with data sent through uniforms, it then reads that data from the CPU and compares the expected output. | |||||
2019-05-23 | shader/shader_ir: Make Comment() take a std::string by value | Lioncash | 2 | -3/+3 | |
This allows for forming comment nodes without making unnecessary copies of the std::string instance. e.g. previously: Comment(fmt::format("Base address is c[0x{:x}][0x{:x}]", cbuf->GetIndex(), cbuf_offset)); Would result in a copy of the string being created, as CommentNode() takes a std::string by value (a const ref passed to a value parameter results in a copy). Now, only one instance of the string is ever moved around. (fmt::format returns a std::string, and since it's returned from a function by value, this is a prvalue (which can be treated like an rvalue), so it's moved into Comment's string parameter), we then move it into the CommentNode constructor, which then moves the string into its member variable). | |||||
2019-05-23 | shader/decode/*: Add missing newline to files lacking them | Lioncash | 18 | -18/+18 | |
Keeps the shader code file endings consistent. | |||||
2019-05-23 | shader/decode/*: Eliminate indirect inclusions | Lioncash | 6 | -1/+5 | |
Amends cases where we were using things that were indirectly being satisfied through other headers. This way, if those headers change and eliminate dependencies on other headers in the future, we don't have cascading compilation errors. | |||||
2019-05-22 | shader/decode/memory: Remove left in debug pragma | Lioncash | 1 | -2/+0 | |
2019-05-21 | renderer_opengl/gl_shader_decompiler: Remove redundant name specification in format string | Lioncash | 1 | -1/+1 | |
This accidentally slipped through a rebase. | |||||
2019-05-21 | gl_shader_cache: Fix clang strict standard build issues | ReinUsesLisp | 3 | -9/+13 | |
2019-05-21 | gl_shader_cache: Use shared contexts to build shaders in parallel | ReinUsesLisp | 6 | -47/+103 | |
2019-05-21 | shader/memory: Implement ST (generic memory) | ReinUsesLisp | 2 | -21/+36 | |
2019-05-21 | shader/memory: Implement LD (generic memory) | ReinUsesLisp | 3 | -15/+38 | |
2019-05-20 | shader: Implement S2R Tid{XYZ} and CtaId{XYZ} | ReinUsesLisp | 4 | -15/+69 | |
2019-05-20 | gl_shader_decompiler: Make GetSwizzle constexpr | ReinUsesLisp | 1 | -7/+7 | |
2019-05-20 | gl_shader_decompiler: Tidy up minor remaining cases of unnecessary std::string concatenation | Lioncash | 1 | -21/+20 | |
2019-05-20 | gl_shader_decompiler: Replace individual overloads with the fmt-based one | Lioncash | 1 | -28/+16 | |
Gets rid of the need to special-case brace handling depending on the overload used, and makes it consistent across the board with how fmt handles them. Strings with compile-time deducible strings are directly forwarded to std::string's constructor, so we don't need to worry about the performance difference here, as it'll be identical. | |||||
2019-05-20 | gl_shader_decompiler: Utilize fmt overload of AddLine() where applicable | Lioncash | 1 | -136/+152 | |
2019-05-19 | Revert #2466 | Fernando Sahmkow | 1 | -1/+3 | |
This reverts a tested behavior on delay slots not exiting if the exit flag is set. Currently new tests are required in order to ensure this behavior. | |||||
2019-05-19 | gl_shader_decompiler: Add AddLine() overload that forwards to fmt | Lioncash | 1 | -0/+11 | |
In a lot of places throughout the decompiler, string concatenation via operator+ is used quite heavily. This is usually fine, when not heavily used, but when used extensively, can be a problem. operator+ creates an entirely new heap allocated temporary string and given we perform expressions like: std::string thing = a + b + c + d; this ends up with a lot of unnecessary temporary strings being created and discarded, which kind of thrashes the heap more than we need to. Given we utilize fmt in some AddLine calls, we can make this a part of the ShaderWriter's API. We can make an overload that simply acts as a passthrough to fmt. This way, whenever things need to be appended to a string, the operation can be done via a single string formatting operation instead of discarding numerous temporary strings. This also has the benefit of making the strings themselves look nicer and makes it easier to spot errors in them. | |||||
2019-05-19 | Dma_pusher: ASSERT on empty command_list | Fernando Sahmkow | 1 | -0/+7 | |
This is a measure to avoid crashes on command list reading as an empty command_list is considered a NOP. | |||||
2019-05-19 | shader/shader_ir: Remove unnecessary inline specifiers | Lioncash | 1 | -2/+2 | |
constexpr internally links by default, so the inline specifier is unnecessary. | |||||
2019-05-19 | shader/shader_ir: Simplify constructors for OperationNode | Lioncash | 1 | -15/+6 | |
Many of these constructors don't even need to be templated. The only ones that need to be templated are the ones that actually make use of the parameter pack. Even then, since std::vector accepts an initializer list, we can supply the parameter pack directly to it instead of creating our own copy of the list, then copying it again into the std::vector. | |||||
2019-05-19 | shader/shader_ir: Remove unnecessary template parameter packs from Operation() overloads where applicable | Lioncash | 1 | -2/+0 | |
These overloads don't actually make use of the parameter pack, so they can be turned into regular non-template function overloads. | |||||
2019-05-19 | shader/shader_ir: Mark tracking functions as const member functions | Lioncash | 2 | -8/+11 | |
These don't actually modify instance state, so they can be marked as const member functions | |||||
2019-05-19 | shader/shader_ir: Place implementations of constructor and destructor in cpp file | Lioncash | 2 | -5/+9 | |
Given the class contains quite a lot of non-trivial types, place the constructor and destructor within the cpp file to avoid inlining construction and destruction code everywhere the class is used. | |||||
2019-05-19 | gl_shader_gen: std::move objects where applicable | Lioncash | 1 | -7/+7 | |
Avoids performing copies into the pair being returned. Instead, we can just move the resources into the pair, avoiding the need to make copies of both the std::string and ShaderEntries struct. | |||||
2019-05-19 | gl_shader_disk_cache: in-class initialize virtual file offset of ShaderDiskCacheOpenGL | Lioncash | 2 | -5/+3 | |
Given the offset is assigned a fixed value in the constructor, we can just assign it directly and get rid of the need to write the name of the variable again in the constructor initializer list. | |||||
2019-05-19 | gl_shader_disk_cache: Default ShaderDiskCacheOpenGL's destructor in the cpp file | Lioncash | 2 | -0/+3 | |
Given the disk shader cache contains non-trivial types, we should default it in the cpp file in order to prevent inlining of the complex destruction logic. | |||||
2019-05-19 | gl_shader_disk_cache: Make hash specializations noexcept | Lioncash | 1 | -2/+2 | |
The standard library expects hash specializations that don't throw exceptions. Make this explicit in the type to allow selection of better code paths if possible in implementations. | |||||
2019-05-19 | gl_shader_disk_cache: Remove redundant code string construction in LoadDecompiledEntry() | Lioncash | 1 | -2/+2 | |
We don't need to load the code into a vector and then construct a string over the data. We can just create a string with the necessary size ahead of time, and read the data directly into it, getting rid of an unnecessary heap allocation. | |||||
2019-05-19 | gl_shader_disk_cache: Make variable non-const in decompiled entry case | Lioncash | 1 | -1/+1 | |
std::move does nothing when applied to a const variable. Resources can't be moved if the object is immutable. With this change, we don't end up making several unnecessary heap allocations and copies. | |||||
2019-05-19 | gl_shader_disk_cache: Special-case boolean handling | Lioncash | 2 | -24/+37 | |
Booleans don't have a guaranteed size, but we still want to have them integrate into the disk cache system without needing to actually use a different type. We can do this by supplying non-template overloads for the bool type. Non-template overloads always have precedence during function resolution, so this is safe to provide. This gets rid of the need to smatter ternary conditionals, as well as the need to use u8 types to store the value in. | |||||
2019-05-18 | gl_rasterizer: Limit OpenGL point size to a minimum of 1 | ReinUsesLisp | 1 | -1/+3 | |
2019-05-18 | maxwell_to_gl: Add TriangleFan primitive topology | ReinUsesLisp | 1 | -0/+2 | |
2019-05-17 | gl_rasterizer: Pass the right number of array quad vertices count | ReinUsesLisp | 1 | -2/+2 | |
2019-05-14 | maxwell_3d: reduce sevirity of different component formats assert. | Fernando Sahmkow | 1 | -1/+1 | |
This was reduced due to happening on most games and at such constant rate that it affected performance heavily for the end user. In general, we are well aware of the assert and an implementation is already planned. | |||||
2019-05-14 | video_core/engines/engine_upload: Amend constructor initializer list order | Lioncash | 1 | -1/+1 | |
Silences a -Wreorder warning. | |||||
2019-05-14 | video_core/engines/engine_upload: Default destructor in the cpp file | Lioncash | 2 | -1/+3 | |
Avoids inlining destruction logic where applicable, and also makes forward declarations not cause unexpected compilation errors depending on where the State class is used. | |||||
2019-05-14 | video_core/engines/engine_upload: Remove unnecessary const on parameters in function declarations | Lioncash | 1 | -2/+2 | |
These only apply in the definition of the function. They can be omitted from the declaration. | |||||
2019-05-14 | video_core/engines/engine_upload: Remove unnecessary includes | Lioncash | 2 | -2/+2 | |
2019-05-14 | video_core/engines/maxwell3d: Get rid of three magic values in CallMethod() | Lioncash | 1 | -3/+3 | |
We can use the named constant instead of using 32 directly. | |||||
2019-05-14 | video_core/engines/maxwell_3d: Simplify for loops into ranged for loops within InitializeRegisterDefaults() | Lioncash | 1 | -15/+15 | |
Lessens the amount of code that needs to be read, and gets rid of the need to introduce an indexing variable. Instead, we just operate on the objects directly. | |||||
2019-05-14 | video_core/engines/maxwell_3d: Add is_trivially_copyable_v check for Regs | Lioncash | 1 | -0/+2 | |
std::memset is used to clear the entire register structure, which requires that the Regs struct be trivially copyable (otherwise undefined behavior is invoked). This prevents the case where a non-trivial type is potentially added to the struct. | |||||
2019-05-14 | yuzu: Remove explicit types from locks where applicable | Lioncash | 2 | -2/+2 | |
With C++17's deduction guides, the type doesn't need to be explicitly specified within locking primitives anymore. | |||||
2019-05-14 | video_core/gpu_thread: Remove redundant copy constructor for CommandDataContainer | Lioncash | 1 | -6/+0 | |
std::move within a copy constructor (on a data member that isn't mutable) will always result in a copy. Because of that, the behavior of this copy constructor is identical to the one that would be generated automatically by the compiler, so we can remove it. | |||||
2019-05-12 | GPU/MMEInterpreter: Ignore the 'exit' flag when it's executed inside a delay slot. | Sebastian Valle | 1 | -3/+3 | |
It seems instructions marked with the 'exit' flag will not cause an exit when executed within a delay slot. This was hwtested by fincs. | |||||
2019-05-10 | video_core/memory_manager: Mark IsBlockContinuous() as a const member function | Lioncash | 2 | -4/+4 | |
Corrects the typo in its name and marks the function as a const member function, given it doesn't actually modify memory manager state. | |||||
2019-05-10 | video_core/memory_manager: Mark the constructor as explicit | Lioncash | 1 | -1/+1 | |
Prevents implicit converting constructions of the memory manager. | |||||
2019-05-10 | video_core/memory_manager: Default the destructor within the cpp file | Lioncash | 2 | -0/+3 | |
Makes the class less surprising when it comes to forward declaring the type, and also prevents inlining the destruction code of the class, given it contains non-trivial types. | |||||
2019-05-10 | video_core/memory_manager: Amend doxygen comments | Lioncash | 1 | -7/+7 | |
Corrects references to non-existent parameters and corrects typos. | |||||
2019-05-10 | video_core/memory_manager: Remove superfluous const from function declarations | Lioncash | 1 | -7/+7 | |
These are able to be omitted from the declaration of functions, since they don't do anything at the type system level. The definitions of the functions can retain the use of const though, since they make the variables immutable in the implementation of the function where they're used. | |||||
2019-05-10 | video_core/renderer_opengl/gl_shader_cache: Correct member initialization order | Lioncash | 1 | -1/+1 | |
Silences a -Wreorder warning. | |||||
2019-05-10 | video_core/shader/decode/texture: Remove unused variable from GetTld4Code() | Lioncash | 1 | -1/+0 | |
2019-05-10 | renderer_vulkan/vk_shader_decompiler: Remove unused variable from DeclareInternalFlags() | Lioncash | 1 | -1/+0 | |
2019-05-10 | video_core/renderer_opengl/gl_shader_decompiler: Remove unused Composite() function | Lioncash | 1 | -11/+0 | |
This isn't used at all, so it can be removed. | |||||
2019-05-10 | video_core/renderer_opengl/gl_rasterizer_cache: Remove unused variable in UploadGLMipmapTexture() | Lioncash | 1 | -1/+0 | |
This variable is unused entirely, so it can be removed. | |||||
2019-05-10 | video_core/gpu_thread: Remove unused local variable | Lioncash | 1 | -1/+1 | |
Instead of retrieving the data from the std::variant instance, we can just check if the variant contains that type of data. This is essentially the same behavior, only it returns a bool indicating whether or not the type in the variant is currently active, instead of actually retrieving the data. | |||||
2019-05-10 | video_core/textures/astc: Remove unused variables | Lioncash | 1 | -6/+2 | |
Silences a few compilation warnings. | |||||
2019-05-07 | Correct possible error on Rasterizer Caches | Fernando Sahmkow | 1 | -1/+2 | |
There was a weird bug that could happen if the object died directly and the cache address wasn't stored. | |||||
2019-05-04 | shader/decode/texture: Remove unused variable | Lioncash | 1 | -1/+0 | |
This isn't used anywhere, so we can get rid of it. | |||||
2019-05-04 | gl_rasterizer: Silence unused variable warning | Lioncash | 1 | -2/+2 | |
Makes use of src, so it's not considered unused. | |||||
2019-05-03 | shader_ir/other: Implement IPA.IDX | ReinUsesLisp | 2 | -5/+9 | |
2019-05-03 | gl_shader_decompiler: Skip physical unused attributes | ReinUsesLisp | 1 | -18/+27 | |
2019-05-03 | shader_ir/memory: Assert on non-32 bits ALD.PHYS | ReinUsesLisp | 1 | -0/+3 | |
2019-05-03 | shader: Add physical attributes commentaries | ReinUsesLisp | 4 | -4/+8 | |
2019-05-03 | gl_shader_decompiler: Implement GLSL physical attributes | ReinUsesLisp | 2 | -66/+101 | |
2019-05-03 | shader_ir/memory: Implement physical input attributes | ReinUsesLisp | 4 | -6/+32 | |
2019-05-03 | gl_shader_decompiler: Abstract generic attribute operations | ReinUsesLisp | 1 | -29/+26 | |
2019-05-03 | gl_shader_decompiler: Declare all possible varyings on physical attribute usage | ReinUsesLisp | 4 | -27/+88 | |
2019-05-03 | shader: Remove unused AbufNode Ipa mode | ReinUsesLisp | 6 | -35/+14 | |
2019-05-03 | shader_ir/memory: Emit AL2P IR | ReinUsesLisp | 2 | -0/+22 | |
2019-05-03 | shader_bytecode: Add AL2P decoding | ReinUsesLisp | 1 | -2/+15 | |
2019-05-01 | Refactors and name corrections. | Fernando Sahmkow | 6 | -35/+35 | |
2019-05-01 | gl_shader_disk_cache: Skip stored shader variants instead of asserting | ReinUsesLisp | 1 | -1/+4 | |
Instead of asserting on already stored shader variants, silently skip them. This shouldn't be happening but when a shader is invalidated and it is not stored in the shader cache, this assert would hit and save that shader anyways when the asserts are disabled. | |||||
2019-05-01 | Fix Layered ASTC Textures | Fernando Sahmkow | 1 | -1/+3 | |
By adding the missing layer offset in ASTC compression. | |||||
2019-04-26 | shader_ir: Move Sampler index entry in operand< to sort declarations | ReinUsesLisp | 1 | -2/+2 | |
2019-04-26 | shader_ir: Add missing entry to Sampler operand< comparison | ReinUsesLisp | 1 | -2/+3 | |
2019-04-26 | shader_ir/texture: Fix sampler const buffer key shift | ReinUsesLisp | 1 | -1/+1 | |
2019-04-23 | Re added new lines at the end of files | FreddyFunk | 2 | -2/+2 | |
2019-04-23 | gl_shader_disk_cache: Compress precompiled shader cache file with Zstandard | unknown | 1 | -6/+10 | |
2019-04-23 | gl_shader_disk_cache: Use VectorVfsFile for the virtual precompiled shader cache file | unknown | 3 | -101/+168 | |
2019-04-23 | gl_shader_disk_cache: Remove per shader compression | unknown | 2 | -45/+11 | |
2019-04-23 | Fixes and Corrections to DMA Engine | Fernando Sahmkow | 2 | -37/+57 | |
2019-04-23 | Add Swizzle Parameters to the DMA engine | Fernando Sahmkow | 2 | -2/+27 | |
2019-04-23 | Add Documentation Headers to all the GPU Engines | Fernando Sahmkow | 5 | -0/+29 | |
2019-04-23 | Corrections and styling | Fernando Sahmkow | 5 | -6/+9 | |
2019-04-23 | Implement Maxwell3D Data Upload | Fernando Sahmkow | 2 | -3/+32 | |
2019-04-23 | Introduce skeleton of the GPU Compute Engine. | Fernando Sahmkow | 3 | -8/+202 | |
2019-04-23 | Revamp Kepler Memory to use a subegine to manage uploads | Fernando Sahmkow | 6 | -93/+134 | |
2019-04-21 | Rasterizer Cache: Use a temporal storage for Surfaces loading/flushing. | Fernando Sahmkow | 4 | -18/+30 | |
This PR should heavily reduce memory usage since temporal buffers are no longer stored per Surface but instead managed by the Rasterizer Cache. | |||||
2019-04-21 | Corrections Half Float operations on const buffers and implement saturation. | Fernando Sahmkow | 2 | -15/+16 | |
2019-04-20 | Apply Position Y Direction | Fernando Sahmkow | 1 | -0/+3 | |
2019-04-20 | RasterizerCache Redesign: Flush | Fernando Sahmkow | 6 | -17/+26 | |
flushing is now responsability of children caches instead of the cache object. This change will allow the specific cache to pass extra parameters on flushing and will allow more flexibility. | |||||
2019-04-20 | make ReadBlockunsafe and WriteBlockunsafe, ignore invalid pages. | Fernando Sahmkow | 1 | -4/+12 | |
2019-04-19 | gl_state: Fix samplers memory corruption | ReinUsesLisp | 1 | -3/+5 | |
It was possible for "samplers" to be read without being written. This addresses that. | |||||
2019-04-18 | video_core: Silent -Wswitch warnings | ReinUsesLisp | 10 | -77/+106 | |
2019-04-17 | Implement IsBlockContinous | Fernando Sahmkow | 2 | -2/+13 | |
This detects when a GPU Memory Block is not continous within host cpu memory. | |||||
2019-04-16 | Apply Const correctness to SwizzleKepler and replace u32 for size_t on iterators. | Fernando Sahmkow | 2 | -9/+12 | |
2019-04-16 | Use ReadBlockUnsafe for fetyching DMA CommandLists | Fernando Sahmkow | 2 | -4/+2 | |
2019-04-16 | Document unsafe versions and add BlockCopyUnsafe | Fernando Sahmkow | 3 | -16/+45 | |
2019-04-16 | Use ReadBlockUnsafe for Shader Cache | Fernando Sahmkow | 1 | -5/+7 | |
2019-04-16 | Use ReadBlockUnsafe on TIC and TSC reading | Fernando Sahmkow | 2 | -2/+4 | |
Use ReadBlockUnsafe on TIC and TSC reading as memory is never flushed from host GPU there. | |||||
2019-04-16 | GPU MemoryManager: Implement ReadBlockUnsafe and WriteBlockUnsafe | Fernando Sahmkow | 2 | -0/+34 | |
2019-04-16 | Use WriteBlock and ReadBlock. | Fernando Sahmkow | 1 | -10/+6 | |
2019-04-16 | Implement Block Linear copies in Kepler Memory. | Fernando Sahmkow | 3 | -5/+38 | |
2019-04-16 | vk_shader_decompiler: Add missing operations | ReinUsesLisp | 1 | -0/+7 | |
2019-04-16 | shader_ir/decode: Fix half float pre-operations and remove MetaHalfArithmetic | ReinUsesLisp | 9 | -85/+72 | |
Operations done before the main half float operation (like HAdd) were managing a packed value instead of the unpacked one. Adding an unpacked operation allows us to drop the per-operand MetaHalfArithmetic entry, simplifying the code overall. | |||||
2019-04-16 | gl_shader_decompiler: Fix MrgH0 decompilation | ReinUsesLisp | 1 | -2/+2 | |
GLSL decompilation for HMergeH0 was wrong. This addresses that issue. | |||||
2019-04-16 | shader_ir/decode: Implement half float saturation | ReinUsesLisp | 5 | -8/+31 | |
2019-04-16 | shader_ir/decode: Reduce severity of unimplemented half-float FTZ | ReinUsesLisp | 3 | -3/+9 | |
2019-04-16 | renderer_opengl: Implement half float NaN comparisons | ReinUsesLisp | 3 | -36/+59 | |
2019-04-16 | shader_ir: Avoid using static on heap-allocated objects | ReinUsesLisp | 1 | -5/+4 | |
Using static here might be faster at runtime, but it adds a heap allocation called before main. | |||||
2019-04-16 | Do some corrections in conversion shader instructions. | Fernando Sahmkow | 2 | -23/+73 | |
Corrects encodings for I2F, F2F, I2I and F2I Implements Immediate variants of all four conversion types. Add assertions to unimplemented stuffs. | |||||
2019-04-15 | Correct Kepler Memory on Linear Pushes. | Fernando Sahmkow | 2 | -16/+48 | |
2019-04-15 | Support compressed formats on linear textures. | Fernando Sahmkow | 1 | -2/+5 | |
2019-04-15 | Correct Pitch in Fermi2D | Fernando Sahmkow | 1 | -4/+1 | |
2019-04-14 | gl_shader_decompiler: Use variable AOFFI on supported hardware | ReinUsesLisp | 10 | -71/+102 | |
2019-04-14 | shader_ir: Implement STG, keep track of global memory usage and flush | ReinUsesLisp | 11 | -89/+186 | |
2019-04-12 | video_core/gpu: Create threads separately from initialization | Lioncash | 9 | -14/+47 | |
Like with CPU emulation, we generally don't want to fire off the threads immediately after the relevant classes are initialized, we want to do this after all necessary data is done loading first. This splits the thread creation into its own interface member function to allow controlling when these threads in particular get created. | |||||
2019-04-11 | gl_rasterizer_cache: Relax restrictions on FastCopySurface and FastLayeredCopySurface | Fernando Sahmkow | 1 | -4/+10 | |
2019-04-11 | gl_shader_manager: Move code to source file and minor clean up | ReinUsesLisp | 2 | -34/+61 | |
2019-04-10 | gl_rasterizer: Apply just the needed state on Clear | ReinUsesLisp | 1 | -4/+4 | |
2019-04-10 | gl_device: Implement interface and add uniform offset alignment | ReinUsesLisp | 5 | -13/+70 | |
2019-04-10 | vk_shader_decompiler: Implement flow primitives | ReinUsesLisp | 1 | -5/+82 | |
2019-04-10 | vk_shader_decompiler: Implement most common texture primitives | ReinUsesLisp | 1 | -8/+65 | |
2019-04-10 | vk_shader_decompiler: Implement texture decompilation helper functions | ReinUsesLisp | 1 | -0/+32 | |
2019-04-10 | vk_shader_decompiler: Implement Assign and LogicalAssign | ReinUsesLisp | 1 | -2/+64 | |
2019-04-10 | vk_shader_decompiler: Implement non-OperationCode visits | ReinUsesLisp | 1 | -7/+129 | |
2019-04-10 | vk_shader_decompiler: Implement OperationCode decompilation interface | ReinUsesLisp | 1 | -1/+411 | |
2019-04-10 | vk_shader_decompiler: Implement Visit | ReinUsesLisp | 1 | -1/+50 | |
2019-04-10 | vk_shader_decompiler: Implement labels tree and flow | ReinUsesLisp | 1 | -0/+71 | |
2019-04-10 | vk_shader_decompiler: Implement declarations | ReinUsesLisp | 1 | -3/+457 | |
2019-04-10 | vk_shader_decompiler: Declare and stub interface for a SPIR-V decompiler | ReinUsesLisp | 3 | -0/+127 | |
2019-04-10 | video_core: Add sirit as optional dependency with Vulkan | ReinUsesLisp | 1 | -1/+4 | |
sirit is a runtime assembler for SPIR-V | |||||
2019-04-10 | Remove bounding in LD_C | Fernando Sahmkow | 1 | -2/+1 | |
2019-04-09 | Correct Fermi Copy on Linear Textures. | Fernando Sahmkow | 1 | -0/+4 | |
2019-04-09 | Implement Texture Format ZF32_X24S8. | Fernando Sahmkow | 1 | -0/+2 | |
2019-04-09 | Correct depth compare with color formats for R32F | Fernando Sahmkow | 1 | -2/+17 | |
2019-04-08 | gl_backend: Align Pixel Storage | Fernando Sahmkow | 2 | -4/+12 | |
This commit makes sure GL reads on the correct pack size for the respective texture buffer. | |||||
2019-04-08 | Correct LOP_IMN encoding | Fernando Sahmkow | 1 | -1/+1 | |
2019-04-08 | Correct XMAD mode, psl and high_b on different encodings. | Fernando Sahmkow | 2 | -9/+33 | |
2019-04-08 | Adapt Bindless to work with AOFFI | Fernando Sahmkow | 1 | -7/+18 | |
2019-04-08 | Move ConstBufferAccessor to Maxwell3d, correct mistakes and clang format. | Fernando Sahmkow | 9 | -44/+25 | |
2019-04-08 | Fix bad rebase | Fernando Sahmkow | 1 | -2/+1 | |
2019-04-08 | Fix TMML | Fernando Sahmkow | 1 | -5/+7 | |
2019-04-08 | Simplify ConstBufferAccessor | Fernando Sahmkow | 5 | -53/+22 | |
2019-04-08 | Refactor GetTextureCode and GetTexCode to use an optional instead of optional parameters | Fernando Sahmkow | 2 | -34/+33 | |
2019-04-08 | Implement TXQ_B | Fernando Sahmkow | 2 | -2/+10 | |
2019-04-08 | Implement TMML_B | Fernando Sahmkow | 1 | -5/+10 | |
2019-04-08 | Corrections to TEX_B | Fernando Sahmkow | 2 | -4/+37 | |
2019-04-08 | Fixes to Const Buffer Accessor and Formatting | Fernando Sahmkow | 3 | -10/+10 | |
2019-04-08 | Implement Bindless Handling on SetupTexture | Fernando Sahmkow | 4 | -18/+34 | |
2019-04-08 | Unify both sampler types. | Fernando Sahmkow | 4 | -22/+48 | |
2019-04-08 | Implement Bindless Samplers and TEX_B in the IR. | Fernando Sahmkow | 4 | -16/+77 | |
2019-04-08 | Implement Const Buffer Accessor | Fernando Sahmkow | 5 | -2/+65 | |
2019-04-07 | Permit a Null Shader in case of a bad host_ptr. | Fernando Sahmkow | 1 | -0/+4 | |
2019-04-06 | maxwell_3d: Reduce severity of ProcessSyncPoint | ReinUsesLisp | 1 | -2/+2 | |
2019-04-06 | video_core/textures/convert: Replace include with a forward declaration | Lioncash | 2 | -1/+5 | |
Avoids dragging in a direct dependency in a header. | |||||
2019-04-06 | video_core/texures/texture: Remove unnecessary includes | Lioncash | 6 | -2/+5 | |
Nothing in this header relies on common_funcs or the memory manager. This gets rid of reliance on indirect inclusions in the OpenGL caches. | |||||
2019-04-06 | memory_manager: Improved implementation of read/write/copy block. | bunnei | 3 | -12/+84 | |
- Fixes graphical issues with Chocobo's Mystery Dungeon EVERY BUDDY! - Fixes a crash with Mario Tennis Aces | |||||
2019-04-06 | video_core/macro_interpreter: Remove assertion within FetchParameter() | Lioncash | 1 | -2/+1 | |
We can just use .at(), which essentially does the same thing, but with less code. | |||||
2019-04-06 | video_core/macro_interpreter: Simplify GetRegister() | Lioncash | 1 | -11/+6 | |
Given we already ensure nothing can set the zeroth register in SetRegister(), we don't need to check if the index is zero and special case it. We can just access the register normally, since it's already going to be zero. We can also replace the assertion with .at() to perform the equivalent behavior inline as part of the API. | |||||
2019-04-06 | video_core/memory_manager: Make Read() a const qualified member function | Lioncash | 2 | -6/+6 | |
Given this doesn't actually alter internal state, this can be made a const member function. | |||||
2019-04-06 | video_core/memory_manager: Make ReadBlock() a const qualifier member function | Lioncash | 2 | -2/+2 | |
Now, since we have a const qualified variant of GetPointer(), we can put it to use in ReadBlock() to retrieve the source pointer that is passed into memcpy. Now block reading may be done from a const context. | |||||
2019-04-06 | video_core/memory_manager: Add a const qualified variant of GetPointer() | Lioncash | 2 | -2/+17 | |
Allows retrieving read-only pointers from a const context externally. | |||||
2019-04-06 | video_core/memory_manager: Make FindFreeRegion() a const member function | Lioncash | 2 | -10/+11 | |
This doesn't modify internal state, so it can be made a const member function. | |||||
2019-04-06 | video_core/memory_manager: Make GpuToCpuAddress() a const member function | Lioncash | 2 | -3/+3 | |
This doesn't modify any internal state, so it can be made a const member function to allow its use in const contexts. | |||||
2019-04-06 | Implement SyncPoint Register in the GPU. | Fernando Sahmkow | 2 | -1/+27 | |
2019-04-06 | video_core/gpu_thread: Silence truncation warning in ThreadManager's constructor | Lioncash | 1 | -1/+1 | |
Since c5d41fd812d7eb1a04f36b76c08fe971cee0868c callback parameters were changed to use an s64 to represent late cycles instead of an int, so this was causing a truncation warning to occur here. Changing it to s64 is sufficient to silence the warning. | |||||
2019-04-06 | video_core/engines: Make memory manager members private | Lioncash | 9 | -13/+14 | |
These aren't used externally by anything, so they can be made private data members. | |||||
2019-04-06 | video_core/engines: Remove unnecessary inclusions where applicable | Lioncash | 10 | -9/+25 | |
Replaces header inclusions with forward declarations where applicable and also removes unused headers within the cpp file. This reduces a few more dependencies on core/memory.h | |||||
2019-04-06 | renderer_opengl/utils: Skip empty binds | ReinUsesLisp | 1 | -0/+3 | |
2019-04-06 | gl_rasterizer: Use ARB_multi_bind to update SSBOs | ReinUsesLisp | 2 | -9/+9 | |
2019-04-06 | gl_rasterizer: Use ARB_multi_bind to update UBOs across stages | ReinUsesLisp | 4 | -22/+58 | |
2019-04-05 | gl_shader_decompiler: Rename GenerateTemporal() to GenerateTemporary() | Lioncash | 1 | -12/+12 | |
Temporal generally indicates a relation to time, but this is just creating a temporary, so this isn't really an accurate name for what the function is actually doing. | |||||
2019-04-05 | gl_shader_decompiler: Fix TXQ types | ReinUsesLisp | 1 | -2/+3 | |
TXQ returns integer types. Shaders usually do: R0 = TXQ(); // => int R0 = static_cast<float>(R0); If we don't treat it as an integer, it will cast a binary float value as float - resulting in a corrupted number. | |||||
2019-04-04 | video_core/renderer_opengl: Remove unnecessary includes | Lioncash | 13 | -24/+4 | |
Quite a few unused includes have built up over time, particularly on core/memory.h. Removing these includes means the source files including those files will no longer need to be rebuilt if they're changed, making compilation slightly faster in this scenario. | |||||
2019-04-04 | gl_state: Rework to enable individual applies | ReinUsesLisp | 3 | -339/+324 | |
2019-04-03 | shader_ir/memory: Reduce severity of LD_L cache management and log it | ReinUsesLisp | 2 | -2/+9 | |
2019-04-03 | shader_ir/memory: Reduce severity of ST_L cache management and log it | ReinUsesLisp | 2 | -3/+11 | |
2019-04-03 | gl_shader_decompiler: Return early when an operation is invalid | ReinUsesLisp | 1 | -1/+6 | |
2019-04-02 | gl_sampler_cache: Port sampler cache to OpenGL | ReinUsesLisp | 5 | -123/+82 | |
2019-04-02 | video_core: Abstract vk_sampler_cache into a templated class | ReinUsesLisp | 5 | -58/+101 | |
2019-04-02 | gpu_thread: Improve synchronization by using CoreTiming. | bunnei | 3 | -51/+65 | |
2019-04-01 | general: Use deducation guides for std::lock_guard and std::unique_lock | Lioncash | 4 | -17/+17 | |
Since C++17, the introduction of deduction guides for locking facilities means that we no longer need to hardcode the mutex type into the locks themselves, making it easier to switch mutex types, should it ever be necessary in the future. | |||||
2019-03-31 | gl_shader_decompiler: Hide local definitions inside an anonymous namespace | ReinUsesLisp | 1 | -6/+8 | |
2019-03-31 | shader_ir/decode: Silent implicit sign conversion warning | Mat M | 1 | -2/+2 | |
Co-Authored-By: ReinUsesLisp <reinuseslisp@airmail.cc> | |||||
2019-03-30 | gl_shader_decompiler: Add AOFFI backing implementation | ReinUsesLisp | 1 | -38/+85 | |
2019-03-30 | shader_ir/decode: Implement AOFFI for TEX and TLD4 | ReinUsesLisp | 2 | -27/+94 | |
2019-03-30 | shader_ir: Implement immediate register tracking | ReinUsesLisp | 2 | -1/+19 | |
2019-03-29 | common/zstd_compression: simplify decompression interface | unknown | 1 | -3/+2 | |
2019-03-29 | gl_shader_disk_cache: Fixup clang format | unknown | 1 | -2/+3 | |
2019-03-29 | gl_shader_disk_cache: Use Zstandard for compression | unknown | 1 | -6/+6 | |
2019-03-29 | gl_shader_disk_cache: Use LZ4HC with compression level 9 instead of compression level 12 for less compression time | unknown | 1 | -3/+3 | |
2019-03-29 | Addressed feedback | unknown | 1 | -6/+6 | |
2019-03-29 | gl_shader_disk_cache: Use better compression for transferable and precompiled shader disk chache files | unknown | 1 | -2/+2 | |
2019-03-29 | data_compression: Move LZ4 compression from video_core/gl_shader_disk_cache to common/data_compression | unknown | 2 | -39/+9 | |
2019-03-29 | vk_swapchain: Implement a swapchain manager | ReinUsesLisp | 3 | -1/+305 | |
2019-03-28 | gl_shader_manager: Remove unnecessary gl_shader_manager inclusion | Lioncash | 1 | -2/+0 | |
This isn't used at all in the OpenGL shader cache, so we can remove it's include here, meaning one less file needs to be recompiled if any changes ever occur within that header. core/memory.h is also not used within this file at all, so we can remove it as well. | |||||
2019-03-28 | gl_shader_manager: Move using statement into the cpp file | Lioncash | 2 | -4/+4 | |
Avoids introducing Maxwell3D into the namespace for everything that includes the header. | |||||
2019-03-28 | gl_shader_manager: Remove reliance on global accessor within MaxwellUniformData::SetFromRegs() | Lioncash | 3 | -9/+9 | |
We can just pass in the Maxwell3D instance instead of going through the system class to get at it. This also lets us simplify the interface a little bit. Since we pass in the Maxwell3D context now, we only really need to pass the shader stage index value in. | |||||
2019-03-27 | gl_shader_manager: Amend Doxygen string for MaxwellUniformData | Lioncash | 1 | -3/+3 | |
Previously only one line of the whole comment was in proper Doxygen formatting. | |||||
2019-03-27 | gpu_thread: Remove unused dma_pusher class member variable from ThreadManager | Lioncash | 2 | -5/+2 | |
The pusher instance is only ever used in the constructor of the ThreadManager for creating the thread that the ThreadManager instance contains. Aside from that, the member is unused, so it can be removed. | |||||
2019-03-27 | gl_rasterizer: Remove unused reference member variable from RasterizerOpenGL | Lioncash | 3 | -9/+5 | |
This member variable is no longer being used, so it can be removed, removing a dependency on EmuWindow from the rasterizer's interface" | |||||
2019-03-27 | video_core: Amend constructor initializer list order where applicable | Lioncash | 6 | -14/+14 | |
Specifies the members in the same order that initialization would take place in. This also silences -Wreorder warnings. | |||||
2019-03-27 | video_core: Add missing override specifiers | Lioncash | 3 | -4/+4 | |
Ensures that the signatures will always match with the base class. Also silences a few compilation warnings. | |||||
2019-03-27 | video_core/gpu: Amend typo in GPU member variable name | Lioncash | 2 | -7/+8 | |
smaphore -> semaphore | |||||
2019-03-22 | video_core: Implement API agnostic view based texture cache | ReinUsesLisp | 3 | -0/+974 | |
Implements an API agnostic texture view based texture cache. Classes defined here are intended to be inherited by the API implementation and used in API-specific code. This implementation exposes protected virtual functions to be called from the implementer. Before executing any surface copies methods (defined in API-specific code) it tries to detect if the overlapping surface is a superset and if it is, it creates a view. Views are references of a subset of a surface, it can be a superset view (the same as referencing the whole texture). Current code manages 1D, 1D array, 2D, 2D array, cube maps and cube map arrays with layer and mipmap level views. Texture 3D slices views are not implemented. If the view attempt fails, the fast path is invoked with the overlapping textures (defined in the implementer). If that one fails (returning nullptr) it will flush and reload the texture. | |||||
2019-03-22 | Revert "Devirtualize Register/Unregister and use a wrapper instead." | bunnei | 3 | -8/+12 | |
- Fixes graphical issues from transitions in Super Mario Odyssey. | |||||
2019-03-21 | memory_manager: Cleanup FindFreeRegion. | bunnei | 2 | -12/+6 | |
2019-03-21 | memory_manager: Use Common::AlignUp in public interface as needed. | bunnei | 1 | -11/+22 | |
2019-03-21 | memory_manager: Bug fixes and further cleanup. | bunnei | 2 | -73/+72 | |
2019-03-21 | maxwell_dma: Check for valid source in destination before copy. | bunnei | 1 | -0/+10 | |
- Avoid a crash in Octopath Traveler. | |||||
2019-03-21 | memory_manager: Add protections for invalid GPU addresses. | bunnei | 2 | -22/+43 | |
- Avoid a crash in Xenoblade Chronicles 2. | |||||
2019-03-21 | gl_rasterizer_cache: Check that backing memory is valid before creating a surface. | bunnei | 2 | -15/+12 | |
- Fixes a crash in Puyo Puyo Tetris. | |||||
2019-03-21 | gpu: Rewrite virtual memory manager using PageTable. | bunnei | 10 | -201/+472 | |
2019-03-21 | gpu: Move GPUVAddr definition to common_types. | bunnei | 13 | -31/+24 | |
2019-03-17 | gl_rasterizer: Skip zero addr/sized regions on flush/invalidate. | bunnei | 1 | -0/+6 | |
2019-03-16 | memory: Simplify rasterizer cache operations. | bunnei | 1 | -2/+1 | |
2019-03-16 | video_core: Refactor to use MemoryManager interface for all memory access. | bunnei | 19 | -186/+194 | |
# Conflicts: # src/video_core/engines/kepler_memory.cpp # src/video_core/engines/maxwell_3d.cpp # src/video_core/morton.cpp # src/video_core/morton.h # src/video_core/renderer_opengl/gl_global_cache.cpp # src/video_core/renderer_opengl/gl_global_cache.h # src/video_core/renderer_opengl/gl_rasterizer_cache.cpp | |||||
2019-03-15 | gpu: Use host address for caching instead of guest address. | bunnei | 24 | -288/+384 | |
2019-03-13 | video_core/morton: Use enum to describe MortonCopyPixels128 mode | ReinUsesLisp | 3 | -7/+10 | |
2019-03-13 | video_core/morton: Remove unused parameter in MortonSwizzle | ReinUsesLisp | 3 | -8/+7 | |
2019-03-13 | video_core/morton: Remove clang-format off when it's not needed | ReinUsesLisp | 1 | -133/+129 | |
2019-03-13 | video_core/morton: Remove unused functions | ReinUsesLisp | 1 | -39/+0 | |
2019-03-13 | video_core/texture: Fix up sampler lod bias | ReinUsesLisp | 1 | -1/+1 | |
2019-03-13 | vk_sampler_cache: Use operator== instead of memcmp | Mat M | 1 | -1/+1 | |
Co-Authored-By: ReinUsesLisp <reinuseslisp@airmail.cc> | |||||
2019-03-13 | vk_sampler_cache: Implement a sampler cache | ReinUsesLisp | 4 | -1/+140 | |
2019-03-12 | video_core/texture: Add a raw representation of TSCEntry | ReinUsesLisp | 1 | -24/+29 | |
2019-03-11 | renderer_opengl/gl_global_cache: Replace indexing for assignment with insert_or_assign | Lioncash | 2 | -3/+3 | |
The previous code had some minor issues with it, really not a big deal, but amending it is basically 'free', so I figured, "why not?". With the standard container maps, when: map[key] = thing; is done, this can cause potentially undesirable behavior in certain scenarios. In particular, if there's no value associated with the key, then the map constructs a default initialized instance of the value type. In this case, since it's a std::shared_ptr (as a type alias) that is the value type, this will construct a std::shared_pointer, and then assign over it (with objects that are quite large, or actively heap allocate this can be extremely undesirable). We also make the function take the region by value, as we can avoid a copy (and by extension with std::shared_ptr, a copy causes an atomic reference count increment), in certain scenarios when ownership isn't a concern (i.e. when ReserveGlobalRegion is called with an rvalue reference, then no copy at all occurs). So, it's more-or-less a "free" gain without many downsides. | |||||
2019-03-11 | renderer_opengl/gl_global_cache: Append missing override specifiers | Lioncash | 1 | -2/+2 | |
Two of the functions here are overridden functions, so we can append these specifiers to make it explicit. | |||||
2019-03-11 | gl_rasterizer: Use system instance passed from argument | ReinUsesLisp | 2 | -29/+31 | |
2019-03-09 | gl_rasterizer: Encapsulate sampler queries into methods | ReinUsesLisp | 3 | -64/+72 | |
2019-03-09 | gl_rasterizer: Minor logger changes | ReinUsesLisp | 1 | -19/+13 | |
2019-03-08 | dma_pusher: Store command_list_header by copy | ReinUsesLisp | 1 | -1/+1 | |
Instead of holding a reference that will get invalidated by dma_pushbuffer.pop(), hold it as a copy. This doesn't have any performance cost since CommandListHeader is 8 bytes long. | |||||
2019-03-07 | video_core/gpu_thread: Remove unimplemented WaitForIdle function prototype | Lioncash | 1 | -3/+0 | |
This function didn't have a definition, so we can remove it to prevent accidentally attempting to use it. | |||||
2019-03-07 | video_core/gpu_thread: Amend constructor initializer list order | Lioncash | 1 | -2/+2 | |
Moves the data members to satisfy the order they're declared as in the constructor initializer list. Silences a -Wreorder warning. | |||||
2019-03-07 | video_core/gpu: Make GPU's destructor virtual | Lioncash | 3 | -3/+3 | |
Because of the recent separation of GPU functionality into sync/async variants, we need to mark the destructor virtual to provide proper destruction behavior, given we use the base class within the System class. Prior to this, it was undefined behavior whether or not the destructor in the derived classes would ever execute. | |||||
2019-03-07 | gpu_thread: Fix deadlock with threading idle state check. | bunnei | 2 | -7/+11 | |
2019-03-07 | gpu_thread: (HACK) Ignore flush on FlushAndInvalidateRegion. | bunnei | 1 | -3/+1 | |
2019-03-07 | gpu: Always flush. | bunnei | 2 | -13/+6 | |
2019-03-07 | gpu: Refactor a/synchronous implementations into their own classes. | bunnei | 7 | -63/+155 | |
2019-03-07 | gpu: Move command processing to another thread. | bunnei | 7 | -10/+353 | |
2019-03-07 | gpu: Refactor command and swap buffers interface for asynch. | bunnei | 2 | -3/+22 | |
2019-03-07 | gpu: Refactor to take RendererBase instead of RasterizerInterface. | bunnei | 2 | -17/+22 | |
2019-03-06 | video_core/engines: Remove unnecessary includes | Lioncash | 10 | -11/+11 | |
Removes a few unnecessary dependencies on core-related machinery, such as the core.h and memory.h, which reduces the amount of rebuilding necessary if those files change. This also uncovered some indirect dependencies within other source files. This also fixes those. | |||||
2019-03-05 | video_core/surface: Remove obsolete TODO in PixelFormatFromRenderTargetFormat() | Lioncash | 1 | -2/+0 | |
This isn't needed anymore, according to Hexagon | |||||
2019-03-04 | video_core/renderer_opengl: Replace direct usage of global system object accessors | Lioncash | 2 | -11/+17 | |
We already pass a reference to the system object to the constructor of the renderer, so we can just use that instead of using the global accessor functions. | |||||
2019-03-04 | maxwell_to_vk: Initial implementation | ReinUsesLisp | 4 | -3/+553 | |
2019-03-02 | vk_buffer_cache: Fix clang-format | ReinUsesLisp | 1 | -3/+3 | |
2019-03-02 | fuck git for ruining my day, I will learn but I will not forgive | bunnei | 1 | -1/+1 | |
2019-03-01 | vk_buffer_cache: Implement a buffer cache | ReinUsesLisp | 3 | -0/+205 | |
This buffer cache is just like OpenGL's buffer cache with some minor style changes. It uses VKStreamBuffer. | |||||
2019-02-28 | gl_rasterizer: Remove texture unbinding after dispatching a draw call | ReinUsesLisp | 1 | -12/+0 | |
Unbinding was required when OpenGL delete operations didn't unbind a resource if it was bound. This is no longer needed and can be removed. | |||||
2019-02-28 | gl_state: Fixup multibind bug | ReinUsesLisp | 1 | -2/+2 | |
2019-02-28 | Devirtualize Register/Unregister and use a wrapper instead. | Fernando Sahmkow | 3 | -12/+8 | |
2019-02-28 | Corrections and redesign. | Fernando Sahmkow | 2 | -51/+51 | |
2019-02-28 | Fix linux compile error. | Fernando Sahmkow | 1 | -1/+1 | |
2019-02-28 | Remove NotifyFrameBuffer as we are doing a texception pass every drawcall. | Fernando Sahmkow | 2 | -25/+0 | |
2019-02-28 | Remove certain optimizations that caused texception to fail in certain scenarios. | Fernando Sahmkow | 3 | -24/+1 | |
2019-02-28 | Bug fixes and formatting | Fernando Sahmkow | 2 | -3/+4 | |
2019-02-28 | rasterizer_cache_gl: Implement Texception Pass | Fernando Sahmkow | 3 | -0/+51 | |
2019-02-28 | rasterizer_cache_gl: Implement Partial Reinterpretation of Surfaces. | Fernando Sahmkow | 2 | -0/+100 | |
2019-02-28 | rasterizer_cache: mark reinterpreted surfaces and add ability to reload marked surfaces on next use. | Fernando Sahmkow | 2 | -0/+78 | |
2019-02-28 | rasterizer_cache_gl: Notify on framebuffer change | Fernando Sahmkow | 2 | -4/+23 | |
2019-02-28 | rasterizer_cache: Expose FlushObject to Child classes and allow redefining of Register and Unregister | Fernando Sahmkow | 1 | -11/+11 | |
2019-02-27 | gl_rasterizer_cache: Create texture views for array discrepancies | ReinUsesLisp | 3 | -32/+42 | |
When a texture is sampled in a shader with a different array mode than the cached state, create a texture view and bind that to the shader instead. | |||||
2019-02-27 | vk_memory_manager: Reorder constructor initializer list in terms of member declaration order | Lioncash | 1 | -1/+1 | |
Reorders members in the order that they would actually be initialized in. Silences a -Wreorder warning. | |||||
2019-02-27 | gl_rasterizer: Reorder constructor initializer list in terms of member declaration order | Lioncash | 1 | -2/+2 | |
Orders the members in the order they would actually be initialized in. Silences a -Wreorder warning. | |||||
2019-02-27 | gl_shader_disk_cache: Remove #pragma once from cpp file | Lioncash | 1 | -2/+0 | |
This is only necessary in headers. Silences a warning with clang. | |||||
2019-02-27 | common/math_util: Move contents into the Common namespace | Lioncash | 9 | -23/+23 | |
These types are within the common library, so they should be within the Common namespace. | |||||
2019-02-27 | gl_rasterizer_cache: Move format conversion to its own file | ReinUsesLisp | 7 | -136/+175 | |
2019-02-27 | decoders: Minor style changes | ReinUsesLisp | 2 | -14/+8 | |
2019-02-26 | renderer_opengl: Update pixel format tracking | ReinUsesLisp | 1 | -0/+1 | |
2019-02-26 | maxwell_3d: Use std::bitset to manage dirty flags | ReinUsesLisp | 4 | -52/+51 | |
2019-02-26 | vk_stream_buffer: Remove copy code path | ReinUsesLisp | 2 | -53/+18 | |
2019-02-26 | shader/decode: Remove extras from MetaTexture | ReinUsesLisp | 4 | -40/+65 | |
2019-02-26 | shader/decode: Split memory and texture instructions decoding | ReinUsesLisp | 6 | -501/+537 | |
2019-02-25 | shader/track: Resolve variable shadowing warnings | Lioncash | 1 | -5/+5 | |
2019-02-24 | vk_stream_buffer: Implement a stream buffer | ReinUsesLisp | 3 | -1/+200 | |
This manages two kinds of streaming buffers: one for unified memory models and one for dedicated GPUs. The first one skips the copy from the staging buffer to the real buffer, since it creates an unified buffer. This implementation waits for all fences to finish their operation before "invalidating". This is suboptimal since it should allocate another buffer or start searching from the beginning. There is room for improvement here. This could also handle AMD's "pinned" memory (a heap with 256 MiB) that seems to be designed for buffer streaming. | |||||
2019-02-24 | vk_resource_manager: Minor VKFenceWatch changes | ReinUsesLisp | 2 | -7/+7 | |
2019-02-24 | vk_memory_manager: Fixup commit interval allocation | ReinUsesLisp | 1 | -2/+1 | |
VKMemoryCommitImpl was using as the end of its interval "begin + end". That ended up wasting memory. | |||||
2019-02-24 | gl_rasterizer_cache: Fixup parameter order in layered swizzle | ReinUsesLisp | 1 | -1/+1 | |
2019-02-22 | vk_scheduler: Implement a scheduler | ReinUsesLisp | 3 | -1/+132 | |
The scheduler abstracts command buffer and fence management with an interface that's able to do OpenGL-like operations on Vulkan command buffers. It returns by value a command buffer and fence that have to be used for subsequent operations until Flush or Finish is executed, after that the current execution context (the pair of command buffers and fences) gets invalidated a new one must be fetched. Thankfully validation layers will quickly detect if this is skipped throwing an error due to modifications to a sent command buffer. | |||||
2019-02-19 | video_core/dma_pusher: Simplyfy Step() logic. | Markus Wick | 2 | -81/+77 | |
As fetching command list headers and and the list of command headers is a fixed 1:1 relation now, they can be implemented within a single call. This cleans up the Step() logic quite a bit. | |||||
2019-02-19 | video_core/dma_pusher: The full list of headers at once. | Markus Wick | 2 | -48/+58 | |
Fetching every u32 from memory leads to a big overhead. So let's fetch all of them as a block if possible. This reduces the Memory::* calls by the dma_pusher by a factor of 10. | |||||
2019-02-19 | vk_memory_manager: Implement memory manager | ReinUsesLisp | 3 | -0/+342 | |
A memory manager object handles the memory allocations for a device. It allocates chunks of Vulkan memory objects and then suballocates. | |||||
2019-02-16 | video_core: Remove usages of System::GetInstance() within the engines | Lioncash | 8 | -22/+48 | |
Avoids the use of the global accessor in favor of explicitly making the system a dependency within the interface. | |||||
2019-02-16 | core_timing: Convert core timing into a class | Lioncash | 3 | -3/+4 | |
Gets rid of the largest set of mutable global state within the core. This also paves a way for eliminating usages of GetInstance() on the System class as a follow-up. Note that no behavioral changes have been made, and this simply extracts the functionality into a class. This also has the benefit of making dependencies on the core timing functionality explicit within the relevant interfaces. | |||||
2019-02-15 | renderer_opengl: respect the sRGB colorspace for the screenshot feature | fearlessTobi | 1 | -1/+2 | |
Previously, we were completely ignoring for screenshots whether the game uses RGB or sRGB. This resulted in screenshot colors that looked off for some titles. | |||||
2019-02-15 | gl_state: Synchronize gl_state even when state is disabled | ReinUsesLisp | 1 | -83/+61 | |
There are some potential edge cases where gl_state may fail to track the state if a related state changes while the toggle is disabled or it didn't change. This addresses that. | |||||
2019-02-14 | vk_resource_manager: Implement a command buffer pool with VKFencedPool | ReinUsesLisp | 2 | -1/+59 | |
2019-02-14 | vk_resource_manager: Add VKFencedPool interface | ReinUsesLisp | 2 | -0/+83 | |
Handles a pool of resources protected by fences. Manages resource overflow allocating more resources. This class is intended to be used through inheritance. | |||||
2019-02-14 | vk_resource_manager: Implement VKResourceManager and fence allocator | ReinUsesLisp | 2 | -0/+85 | |
CommitFence iterates a pool of fences until one is found. If all fences are being used at the same time, allocate more. | |||||
2019-02-14 | vk_resource_manager: Implement VKFenceWatch | ReinUsesLisp | 2 | -0/+68 | |
A fence watch is used to keep track of the usage of a fence and protect a resource or set of resources without having to inherit from their handlers. | |||||
2019-02-14 | vk_resource_manager: Implement VKFence | ReinUsesLisp | 2 | -0/+131 | |
Fences take ownership of objects, protecting them from GPU-side or driver-side concurrent access. They must be commited from the resource manager. Their usage flow is: commit the fence from the resource manager, protect resources with it and use them, send the fence to an execution queue and Wait for it if needed and then call Release. Used resources will automatically be signaled when they are free to be reused. | |||||
2019-02-14 | vk_resource_manager: Add VKResource interface | ReinUsesLisp | 3 | -1/+43 | |
VKResource is an interface that gets signaled by a fence when it is free to be reused. | |||||
2019-02-14 | shader_decompiler: Improve Accuracy of Attribute Interpolation. | Fernando Sahmkow | 6 | -38/+74 | |
2019-02-13 | rasterizer_cache_gl: Only do fast layered copy on the same format. As | Fernando Sahmkow | 1 | -1/+5 | |
glCopyImageSubData does not support different formats. | |||||
2019-02-13 | vk_device: Abstract device handling into a class | ReinUsesLisp | 3 | -1/+351 | |
VKDevice contains all the data required to manage and initialize a physical device. Its intention is to be passed across Vulkan objects to query device-specific data (for example the logical device and the dispatch loader). | |||||
2019-02-13 | renderer_opengl: Remove reference to global system instance | Lioncash | 1 | -3/+3 | |
We already store a reference to the system instance that the renderer is created with, so we don't need to refer to the system instance via Core::System::GetInstance() | |||||
2019-02-12 | gl_rasterizer_cache: Remove unnecessary newline | Lioncash | 1 | -2/+0 | |
2019-02-12 | gl_rasterizer_cache: Get rid of variable shadowing | Lioncash | 1 | -6/+14 | |
Avoids shadowing the members of the struct itself, which results in a -Wshadow warning. | |||||
2019-02-12 | renderer_vulkan: Add declarations file | ReinUsesLisp | 2 | -0/+52 | |
This file is intended to be included instead of vulkan/vulkan.hpp. It includes declarations of unique handlers using a dynamic dispatcher instead of a static one (which would require linking to a Vulkan library). | |||||
2019-02-12 | gl_shader_decompiler: Re-implement TLDS lod | ReinUsesLisp | 2 | -22/+35 | |
2019-02-12 | core_timing: Rename CoreTiming namespace to Core::Timing | Lioncash | 3 | -3/+3 | |
Places all of the timing-related functionality under the existing Core namespace to keep things consistent, rather than having the timing utilities sitting in its own completely separate namespace. | |||||
2019-02-11 | Corrected F2I None mode to RoundEven. | Fernando Sahmkow | 2 | -4/+4 | |
2019-02-11 | Fix incorrect value for CC bit in IADD | Fernando Sahmkow | 1 | -2/+2 | |
2019-02-10 | kepler_compute: Fixup assert and rename engines | ReinUsesLisp | 6 | -52/+59 | |
When I originally added the compute assert I used the wrong documentation. This addresses that. The dispatch register was tested with homebrew against hardware and is triggered by some games (e.g. Super Mario Odyssey). What exactly is missing to get a valid program bound by this engine requires more investigation. | |||||
2019-02-09 | Implement BGRA8 framebuffer format | greggameplayer | 3 | -0/+4 | |
2019-02-09 | Implement linear textures (#2089) | Fernando Sahmkow | 2 | -5/+39 | |
2019-02-08 | gl_rasterizer_cache: Fixup texture view parameters | ReinUsesLisp | 1 | -2/+2 | |
These parameters were declared as constants and passed to glTextureView but then they were removed on a rabase. This addresses that mistake. | |||||
2019-02-07 | shader_ir: Remove F4 prefix to texture operations | ReinUsesLisp | 3 | -26/+25 | |
This was originally included because texture operations returned a vec4. These operations now return a single float and the F4 prefix doesn't mean anything. | |||||
2019-02-07 | shader_ir: Clean texture management code | ReinUsesLisp | 3 | -133/+104 | |
Previous code relied on GLSL parameter order (something that's always ill-formed on an IR design). This approach passes spatial coordiantes through operation nodes and array and depth compare values in the the texture metadata. It still contains an "extra" vector containing generic nodes for bias and component index (for example) which is still a bit ill-formed but it should be better than the previous approach. | |||||
2019-02-07 | gl_rasterizer_cache: Mark surface copy destinations as modified. | bunnei | 2 | -4/+18 | |
2019-02-07 | gl_rasterizer: Implement a more accurate fermi 2D copy. | bunnei | 7 | -68/+188 | |
- This is a blit, use the blit registers. | |||||
2019-02-07 | gl_shader_disk_cache: Check LZ4 size limit | Frederic L | 1 | -0/+4 | |
Co-Authored-By: ReinUsesLisp <reinuseslisp@airmail.cc> | |||||
2019-02-07 | gl_shader_disk_cache: Consider compressed size zero as an error | Frederic L | 1 | -2/+2 | |
Co-Authored-By: ReinUsesLisp <reinuseslisp@airmail.cc> | |||||
2019-02-07 | gl_shader_disk_cache: Use unordered containers | ReinUsesLisp | 4 | -56/+64 | |
2019-02-07 | gl_shader_cache: Fixup GLSL unique identifiers | ReinUsesLisp | 2 | -3/+3 | |
2019-02-07 | gl_shader_cache: Link loading screen with disk shader cache load | ReinUsesLisp | 5 | -9/+40 | |
2019-02-07 | gl_shader_cache: Set GL_PROGRAM_SEPARABLE to dumped shaders | ReinUsesLisp | 1 | -0/+1 | |
i965 (and probably all mesa drivers) require GL_PROGRAM_SEPARABLE when using glProgramBinary. This is probably required by the standard but it's ignored by permisive proprietary drivers. | |||||
2019-02-07 | gl_shader_disk_cache: Pass core system as argument and guard against games without title ids | ReinUsesLisp | 10 | -17/+57 | |
2019-02-07 | gl_shader_disk_cache: Guard reads and writes against failure | ReinUsesLisp | 2 | -216/+339 | |
2019-02-07 | gl_shader_disk_cache: Address miscellaneous feedback | ReinUsesLisp | 5 | -43/+57 | |
2019-02-07 | gl_shader_disk_cache: Pass return values returning instead of by parameters | ReinUsesLisp | 3 | -39/+37 | |
2019-02-07 | gl_shader_disk_cache: Compress program binaries using LZ4 | ReinUsesLisp | 1 | -7/+28 | |
2019-02-07 | gl_shader_disk_cache: Compress GLSL code using LZ4 | ReinUsesLisp | 2 | -6/+57 | |
2019-02-07 | gl_shader_disk_cache: Save GLSL and entries into the precompiled file | ReinUsesLisp | 9 | -135/+234 | |
2019-02-07 | settings: Hide shader cache behind a setting | ReinUsesLisp | 1 | -0/+21 | |
2019-02-07 | gl_shader_disk_cache: Invalidate shader cache changes with CMake hash | ReinUsesLisp | 1 | -7/+16 | |
2019-02-07 | gl_shader_cache: Refactor to support disk shader cache | ReinUsesLisp | 2 | -121/+388 | |
2019-02-07 | gl_shader_disk_cache: Add transferable cache invalidation | ReinUsesLisp | 2 | -0/+8 | |
2019-02-07 | gl_shader_disk_cache: Add precompiled load | ReinUsesLisp | 2 | -0/+45 | |
2019-02-07 | gl_shader_disk_cache: Add precompiled save | ReinUsesLisp | 2 | -0/+57 | |
2019-02-07 | gl_shader_disk_cache: Add transferable load | ReinUsesLisp | 2 | -0/+56 | |
2019-02-07 | gl_shader_disk_cache: Add transferable stores | ReinUsesLisp | 2 | -0/+194 | |
2019-02-07 | gl_shader_disk_cache: Add ShaderDiskCacheOpenGL class and helpers | ReinUsesLisp | 2 | -0/+76 | |
2019-02-07 | gl_shader_disk_cache: Add file and move BaseBindings declaration | ReinUsesLisp | 4 | -10/+58 | |
2019-02-07 | gl_shader_decompiler: Remove name entries | ReinUsesLisp | 2 | -28/+10 | |
2019-02-07 | gl_shader_util: Add parameter to handle retrievable programs | ReinUsesLisp | 3 | -6/+10 | |
2019-02-07 | rasterizer_interface: Add disk cache entry for the rasterizer | ReinUsesLisp | 5 | -0/+14 | |
2019-02-07 | shader_decode: Implement LDG and basic cbuf tracking | ReinUsesLisp | 1 | -0/+33 | |
2019-02-05 | video_core/texture: Fix BitField size for depth_minus_one | ReinUsesLisp | 1 | -1/+1 | |
2019-02-04 | Update src/video_core/engines/shader_bytecode.h | Mat M | 1 | -1/+1 | |
Co-Authored-By: FernandoS27 <fsahmkow27@gmail.com> | |||||
2019-02-03 | Fix TXQ not using the component mask. | Fernando Sahmkow | 2 | -6/+13 | |
2019-02-03 | shader_ir/memory: Add ST_L 64 and 128 bits stores | ReinUsesLisp | 1 | -3/+11 | |
2019-02-03 | shader/track: Search inside of conditional nodes | ReinUsesLisp | 1 | -0/+11 | |
Some games search conditionally use global memory instructions. This allows the heuristic to search inside conditional nodes for the source constant buffer. | |||||
2019-02-03 | shader_ir: Rename BasicBlock to NodeBlock | ReinUsesLisp | 30 | -122/+120 | |
It's not always used as a basic block. Rename it for consistency. | |||||
2019-02-03 | shader_ir: Pass decoded nodes as a whole instead of per basic blocks | ReinUsesLisp | 27 | -57/+62 | |
Some games call LDG at the top of a basic block, making the tracking heuristic to fail. This commit lets the heuristic the decoded nodes as a whole instead of per basic blocks. This may lead to some false positives but allows it the heuristic to track cases it previously couldn't. | |||||
2019-02-03 | video_core: Assert on invalid GPU to CPU address queries | ReinUsesLisp | 8 | -47/+67 | |
2019-02-03 | maxwell_3d: Allow sampler handles with TSC id zero | ReinUsesLisp | 1 | -10/+6 | |
2019-02-03 | maxwell_3d: Allow texture handles with TIC id zero | ReinUsesLisp | 3 | -21/+7 | |
Also remove "enabled" field from Tegra::Texture::FullTextureInfo because it would become unused. | |||||
2019-02-03 | memory_manager: Check for reserved page status | ReinUsesLisp | 1 | -1/+2 | |
2019-02-03 | shader_ir/memory: Add LD_L 128 bits loads | ReinUsesLisp | 1 | -7/+19 | |
2019-02-03 | shader_bytecode: Rename BytesN enums to BitsN | ReinUsesLisp | 2 | -7/+7 | |
2019-02-03 | shader_ir/memory: Add LD_L 64 bits loads | ReinUsesLisp | 1 | -6/+17 | |
2019-02-01 | rasterizer_interface: Remove unused AccelerateFill operation | ReinUsesLisp | 3 | -11/+0 | |
2019-02-01 | video_core: Remove unused Fill surface type | ReinUsesLisp | 2 | -6/+1 | |
2019-01-30 | gl_rasterizer_cache: Fixup test clause | ReinUsesLisp | 1 | -6/+5 | |
2019-01-30 | gl_rasterizer_cache: Guard clause swizzle testing | Mat M | 1 | -1/+3 | |
Co-Authored-By: ReinUsesLisp <reinuseslisp@airmail.cc> | |||||
2019-01-30 | gl_state: Remove texture target tracking | ReinUsesLisp | 2 | -5/+0 | |
2019-01-30 | gl_rasterizer_cache: Move swizzling to textures instead of state | ReinUsesLisp | 6 | -28/+35 | |
2019-01-30 | gl_state: Use DSA and multi bind to update texture bindings | ReinUsesLisp | 1 | -8/+22 | |
2019-01-30 | gl_rasterizer: Use DSA for textures | ReinUsesLisp | 5 | -185/+105 | |
2019-01-30 | video_core/dma_pusher: Silence C4828 warnings | Lioncash | 1 | -1/+1 | |
This was previously causing: warning C4828: The file contains a character starting at offset 0xa33 that is illegal in the current source character set (codepage 65001). warnings on Windows when compiling yuzu. | |||||
2019-01-30 | shader_ir: Unify constant buffer offset values | ReinUsesLisp | 17 | -25/+36 | |
Constant buffer values on the shader IR were using different offsets if the access direct or indirect. cbuf34 has a non-multiplied offset while cbuf36 does. On shader decoding this commit multiplies it by four on cbuf34 queries. | |||||
2019-01-30 | gl_shader_cache: Use explicit bindings | ReinUsesLisp | 7 | -249/+194 | |
2019-01-30 | gl_rasterizer: Implement global memory management | ReinUsesLisp | 6 | -4/+140 | |
2019-01-30 | shader_decode: Implement LDG and basic cbuf tracking | ReinUsesLisp | 7 | -10/+240 | |
2019-01-30 | video_core/GPU Implemented the GPU PFIFO puller semaphore operations. (#1908) | Kevin | 2 | -12/+242 | |
* Implemented the puller semaphore operations. * Nit: Fix 2 style issues * Nit: Add Break to default case. * Fix style. * Update for comments. Added ReferenceCount method * Forgot to remove GpuSmaphoreAddress union. * Fix the clang-format issues. * More clang formatting. * two more white spaces for the Clang formatting. * Move puller members into the regs union * Updated to use Memory::WriteBlock instead of Memory::Write* * Fix clang style issues * White space clang error * Removing unused funcitons and other pr comment * Removing unused funcitons and other pr comment * More union magic for setting regs value. * union magic refcnt as well * Remove local var * Set up the regs and regs_assert_positions up properly * Fix clang error | |||||
2019-01-30 | gl_shader_cache: Fix texture view for cubemaps as cubemap arrays | ReinUsesLisp | 4 | -3/+28 | |
Cubemaps are considered layered and to create a texture view the texture mustn't be a layered texture, resulting in cubemaps being bound as cubemap arrays. To fix this issue this commit introduces an extra surface parameter called "is_array" and uses this to query for texture view creation. Now that texture views for cubemaps are actually being created, this also fixes the number of layers created for the texture view (since they have to be 6 to create a texture view of cubemaps). | |||||
2019-01-30 | gl_rasterizer: Workaround invalid zeta clears | ReinUsesLisp | 2 | -14/+19 | |
Some games (like Xenoblade Chronicles 2) clear both depth and stencil buffers while there's a depth-only texture attached (e.g. D16 Unorm). This commit reads the zeta format of the bound surface on ConfigureFramebuffers and returns if depth and/or stencil attachments were set. This is ignored on DrawArrays but on Clear it's used to just clear those attachments, bypassing an OpenGL error. | |||||
2019-01-28 | shader/shader_ir: Amend three comment typos | Lioncash | 1 | -3/+3 | |
Given we're in the area, these are three trivial typos that can be corrected. | |||||
2019-01-28 | shader/shader_ir: Amend constructor initializer ordering for AbufNode | Lioncash | 1 | -2/+2 | |
Orders the class members in the same order that they would actually be initialized in. Gets rid of two compiler warnings. | |||||
2019-01-28 | shader/decode: Avoid a pessimizing std::move within DecodeRange() | Lioncash | 1 | -1/+1 | |
std::moveing a local variable in a return statement has the potential to prevent copy elision from occurring, so this can just be converted into a regular return. | |||||
2019-01-26 | video_core: Silent implicit conversion warning | ReinUsesLisp | 1 | -3/+4 | |
2019-01-24 | frontend: Refactor ScopeAcquireWindowContext out of renderer_opengl. | bunnei | 3 | -28/+2 | |
2019-01-22 | maxwell_3d: Set rt_separate_frag_data to 1 by default | ReinUsesLisp | 2 | -4/+6 | |
Commercial games assume that this value is 1 but they never set it. On the other hand nouveau manually sets this register. On ConfigureFramebuffers we were asserting for what we are actually implementing (according to envytools). | |||||
2019-01-21 | Rename step 1 and step 2 to be a little more descriptive | James Rowe | 1 | -2/+2 | |
2019-01-20 | QT: Upgrade the Loading Bar to look much better | James Rowe | 1 | -0/+9 | |
2019-01-18 | gl_rasterizer: Silent unsafe mix warning | ReinUsesLisp | 1 | -1/+1 | |
2019-01-16 | shader_ir: Fixup clang build | ReinUsesLisp | 1 | -4/+6 | |
2019-01-15 | gl_shader_decompiler: replace std::get<> with std::get_if<> for macOS compatibility | ReinUsesLisp | 1 | -44/+58 | |
2019-01-15 | gl_shader_decompiler: Inline textureGather component | ReinUsesLisp | 1 | -15/+16 | |
2019-01-15 | shader_decode: Fixup XMAD | ReinUsesLisp | 1 | -1/+1 | |
2019-01-15 | shader_ir: Pass to decoder functions basic block's code | ReinUsesLisp | 27 | -82/+83 | |
2019-01-15 | shader_decode: Improve zero flag implementation | ReinUsesLisp | 15 | -75/+79 | |
2019-01-15 | shader_ir: Remove composite primitives and use temporals instead | ReinUsesLisp | 4 | -241/+224 | |
2019-01-15 | gl_shader_decompiler: Fixup AssignCompositeHalf | ReinUsesLisp | 1 | -1/+1 | |
2019-01-15 | shader_decode: Use proper primitive names | ReinUsesLisp | 4 | -25/+21 | |
2019-01-15 | shader_decode: Use BitfieldExtract instead of shift + and | ReinUsesLisp | 8 | -48/+37 | |
2019-01-15 | shader_ir: Remove Ipa primitive | ReinUsesLisp | 3 | -13/+2 | |
2019-01-15 | gl_shader_decompiler: Use rasterizer's UBO size limit | ReinUsesLisp | 1 | -1/+3 | |
2019-01-15 | gl_shader_gen: Fixup code formatting | ReinUsesLisp | 2 | -18/+22 | |
2019-01-15 | video_core: Rename glsl_decompiler to gl_shader_decompiler | ReinUsesLisp | 7 | -7/+7 | |
2019-01-15 | shader_ir: Remove RZ and use Register::ZeroIndex instead | ReinUsesLisp | 3 | -12/+16 | |
2019-01-15 | shader_decode: Implement TEXS.F16 | ReinUsesLisp | 3 | -15/+57 | |
2019-01-15 | shader_decode: Fixup R2P | ReinUsesLisp | 1 | -2/+3 | |
2019-01-15 | glsl_decompiler: Fixup TLDS | ReinUsesLisp | 1 | -1/+0 | |
2019-01-15 | glsl_decompiler: Fixup geometry shaders | ReinUsesLisp | 2 | -15/+17 | |
2019-01-15 | shader_decode: Fixup WriteLogicOperation zero comparison | ReinUsesLisp | 1 | -1/+1 | |
2019-01-15 | glsl_decompiler: Fixup permissive member function declarations | ReinUsesLisp | 1 | -133/+133 | |
2019-01-15 | shader_decode: Fixup PSET | ReinUsesLisp | 1 | -2/+3 | |
2019-01-15 | shader_decode: Fixup clang-format | ReinUsesLisp | 2 | -2/+4 | |
2019-01-15 | video_core: Implement IR based geometry shaders | ReinUsesLisp | 4 | -10/+102 | |
2019-01-15 | shader_decode: Implement VMAD and VSETP | ReinUsesLisp | 5 | -2/+129 | |
2019-01-15 | shader_decode: Implement HSET2 | ReinUsesLisp | 3 | -1/+50 | |
2019-01-15 | shader_decode: Rework HSETP2 | ReinUsesLisp | 4 | -47/+57 | |
2019-01-15 | shader_decode: Implement R2P | ReinUsesLisp | 1 | -1/+28 | |
2019-01-15 | shader_decode: Implement CSETP | ReinUsesLisp | 1 | -14/+37 | |
2019-01-15 | shader_decode: Implement PSET | ReinUsesLisp | 1 | -1/+16 | |
2019-01-15 | shader_decode: Implement HFMA2 | ReinUsesLisp | 4 | -5/+60 | |
2019-01-15 | glsl_decompiler: Remove HNegate inlining | ReinUsesLisp | 1 | -10/+0 | |
2019-01-15 | shader_decode: Implement POPC | ReinUsesLisp | 4 | -1/+22 | |
2019-01-15 | shader_decode: Implement TLDS (untested) | ReinUsesLisp | 3 | -10/+92 | |
2019-01-15 | shader_decode: Update TLD4 reflecting #1862 changes | ReinUsesLisp | 2 | -52/+52 | |
2019-01-15 | shader_ir: Fixup TEX and TEXS and partially fix TLD4 decompiling | ReinUsesLisp | 3 | -60/+72 | |
2019-01-15 | shader_decode: Fixup FSET | ReinUsesLisp | 1 | -2/+2 | |
2019-01-15 | shader_decode: Implement IADD32I | ReinUsesLisp | 1 | -0/+11 | |
2019-01-15 | shader_decode: Fixup clang-format | ReinUsesLisp | 1 | -1/+1 | |
2019-01-15 | video_core: Return safe values after an assert hits | ReinUsesLisp | 8 | -8/+19 | |
2019-01-15 | shader_decode: Implement FFMA | ReinUsesLisp | 1 | -1/+36 | |
2019-01-15 | video_core: Address feedback | ReinUsesLisp | 4 | -13/+16 | |
2019-01-15 | shader_ir: Fixup file inclusions and clang-format | ReinUsesLisp | 3 | -2/+2 | |
2019-01-15 | shader_ir: Move comment node string | Mat M | 1 | -2/+2 | |
Co-Authored-By: ReinUsesLisp <reinuseslisp@airmail.cc> | |||||
2019-01-15 | shader_ir: Address feedback to avoid UB in bit casting | ReinUsesLisp | 1 | -2/+4 | |
2019-01-15 | shader_decode: Fixup clang-format | ReinUsesLisp | 2 | -3/+2 | |
2019-01-15 | shader_decode: Implement LEA | ReinUsesLisp | 1 | -0/+55 | |
2019-01-15 | shader_decode: Implement IADD3 | ReinUsesLisp | 1 | -0/+61 | |
2019-01-15 | shader_decode: Implement LOP3 | ReinUsesLisp | 2 | -0/+62 | |
2019-01-15 | shader_decode: Implement ST_L | ReinUsesLisp | 1 | -0/+17 | |
2019-01-15 | shader_decode: Implement LD_L | ReinUsesLisp | 1 | -0/+18 | |
2019-01-15 | shader_decode: Implement HSETP2 | ReinUsesLisp | 1 | -1/+37 | |
2019-01-15 | shader_decode: Implement HADD2 and HMUL2 | ReinUsesLisp | 1 | -1/+48 | |
2019-01-15 | shader_decode: Implement HADD2_IMM and HMUL2_IMM | ReinUsesLisp | 1 | -1/+28 | |
2019-01-15 | shader_decode: Implement MOV_SYS | ReinUsesLisp | 1 | -0/+27 | |
2019-01-15 | shader_decode: Implement IMNMX | ReinUsesLisp | 1 | -0/+16 | |
2019-01-15 | shader_decode: Implement F2F_C | ReinUsesLisp | 1 | -2/+10 | |
2019-01-15 | shader_decode: Implement I2I | ReinUsesLisp | 1 | -0/+26 | |
2019-01-15 | shader_decode: Implement BRA internal flag | ReinUsesLisp | 1 | -4/+8 | |
2019-01-15 | shader_decode: Implement ISCADD | ReinUsesLisp | 1 | -0/+15 | |
2019-01-15 | shader_decode: Implement XMAD | ReinUsesLisp | 1 | -1/+85 | |
2019-01-15 | shader_decode: Implement PBK and BRK | ReinUsesLisp | 1 | -1/+22 | |
2019-01-15 | shader_decode: Implement LOP | ReinUsesLisp | 1 | -0/+15 | |
2019-01-15 | shader_decode: Implement SEL | ReinUsesLisp | 1 | -0/+8 | |
2019-01-15 | shader_decode: Implement IADD | ReinUsesLisp | 1 | -1/+28 | |
2019-01-15 | shader_decode: Implement ISETP | ReinUsesLisp | 1 | -1/+30 | |
2019-01-15 | shader_decode: Implement BFI | ReinUsesLisp | 1 | -1/+22 | |
2019-01-15 | shader_decode: Implement ISET | ReinUsesLisp | 1 | -1/+27 | |
2019-01-15 | shader_decode: Implement LD_C | ReinUsesLisp | 1 | -0/+31 | |
2019-01-15 | shader_decode: Implement SHL | ReinUsesLisp | 1 | -0/+8 | |
2019-01-15 | shader_decode: Implement SHR | ReinUsesLisp | 1 | -1/+26 | |
2019-01-15 | shader_decode: Implement LOP32I | ReinUsesLisp | 2 | -1/+72 | |
2019-01-15 | shader_decode: Implement BFE | ReinUsesLisp | 1 | -1/+25 | |
2019-01-15 | shader_decode: Implement FSET | ReinUsesLisp | 1 | -1/+36 | |
2019-01-15 | shader_decode: Implement F2I | ReinUsesLisp | 1 | -0/+37 | |
2019-01-15 | shader_decode: Implement I2F | ReinUsesLisp | 1 | -0/+23 | |
2019-01-15 | shader_decode: Implement F2F | ReinUsesLisp | 1 | -1/+37 | |
2019-01-15 | shader_decode: Stub DEPBAR | ReinUsesLisp | 1 | -0/+4 | |
2019-01-15 | shader_decode: Implement SSY and SYNC | ReinUsesLisp | 1 | -0/+19 | |
2019-01-15 | shader_decode: Implement PSETP | ReinUsesLisp | 1 | -1/+21 | |
2019-01-15 | shader_decode: Implement TMML | ReinUsesLisp | 1 | -3/+45 | |
2019-01-15 | shader_decode: Implement TEX and TXQ | ReinUsesLisp | 2 | -0/+223 | |
2019-01-15 | shader_decode: Implement TEXS (F32) | ReinUsesLisp | 2 | -0/+217 | |
2019-01-15 | shader_decode: Implement FSETP | ReinUsesLisp | 1 | -1/+33 | |
2019-01-15 | shader_decode: Partially implement BRA | ReinUsesLisp | 1 | -0/+12 | |
2019-01-15 | shader_decode: Implement IPA | ReinUsesLisp | 1 | -0/+12 | |
2019-01-15 | shader_decode: Implement EXIT | ReinUsesLisp | 1 | -1/+32 | |
2019-01-15 | shader_decode: Implement ST_A | ReinUsesLisp | 1 | -0/+30 | |
2019-01-15 | shader_decode: Implement LD_A | ReinUsesLisp | 1 | -1/+39 | |
2019-01-15 | shader_decode: Implement FADD32I | ReinUsesLisp | 1 | -0/+12 | |
2019-01-15 | shader_decode: Implement FMUL32_IMM | ReinUsesLisp | 1 | -0/+10 | |
2019-01-15 | shader_decode: Implement MOV32_IMM | ReinUsesLisp | 1 | -1/+9 | |
2019-01-15 | shader_decode: Stub RRO_C, RRO_R and RRO_IMM | ReinUsesLisp | 1 | -0/+9 | |
2019-01-15 | shader_decode: Implement FMNMX_C, FMNMX_R and FMNMX_IMM | ReinUsesLisp | 1 | -0/+18 | |
2019-01-15 | shader_decode: Implement MUFU | ReinUsesLisp | 1 | -0/+29 | |
2019-01-15 | shader_decode: Implement FADD_C, FADD_R and FADD_IMM | ReinUsesLisp | 1 | -0/+15 | |
2019-01-15 | shader_decode: Implement FMUL_C, FMUL_R and FMUL_IMM | ReinUsesLisp | 1 | -0/+42 | |
2019-01-15 | shader_decode: Implement MOV_C and MOV_R | ReinUsesLisp | 1 | -1/+23 | |
2019-01-15 | video_core: Replace gl_shader_decompiler | ReinUsesLisp | 8 | -4185/+57 | |
2019-01-15 | glsl_decompiler: Implementation | ReinUsesLisp | 3 | -0/+1483 | |
2019-01-15 | shader_ir: Add condition code helper | ReinUsesLisp | 2 | -0/+13 | |
2019-01-15 | shader_ir: Add predicate combiner helper | ReinUsesLisp | 2 | -0/+15 | |
2019-01-15 | shader_ir: Add comparison helpers | ReinUsesLisp | 2 | -0/+106 | |
2019-01-15 | shader_ir: Add half float helpers | ReinUsesLisp | 2 | -0/+44 | |
2019-01-15 | shader_ir: Add integer helpers | ReinUsesLisp | 2 | -0/+40 | |
2019-01-15 | shader_ir: Add float helpers | ReinUsesLisp | 2 | -0/+24 | |
2019-01-15 | shader_ir: Add setters | ReinUsesLisp | 2 | -0/+24 | |
2019-01-15 | shader_ir: Add local memory getters | ReinUsesLisp | 2 | -0/+7 | |
2019-01-15 | shader_ir: Add internal flag getters | ReinUsesLisp | 2 | -0/+10 | |
2019-01-15 | shader_ir: Add attribute getters | ReinUsesLisp | 2 | -0/+26 | |
2019-01-15 | shader_ir: Add constant buffer getters | ReinUsesLisp | 2 | -0/+25 | |
2019-01-15 | shader_ir: Add register getter | ReinUsesLisp | 2 | -0/+9 | |
2019-01-15 | shader_ir: Add immediate node constructors | ReinUsesLisp | 2 | -1/+34 | |
2019-01-15 | shader_ir: Initial implementation | ReinUsesLisp | 30 | -0/+1573 | |
2019-01-15 | shader_bytecode: Fixup encoding | ReinUsesLisp | 1 | -1/+1 | |
2019-01-15 | shader_header: Make local memory size getter constant | ReinUsesLisp | 1 | -1/+1 | |
2019-01-09 | gl_rasterizer: Workaround Intel VAO DSA bug | ReinUsesLisp | 3 | -7/+16 | |
There is a bug on Intel's blob driver where it fails to properly build a vertex array object if it's not bound even after creating it with glCreateVertexArrays. This workaround binds it after creating it to bypass the issue. | |||||
2019-01-08 | gl_global_cache: Add dummy global cache manager | ReinUsesLisp | 5 | -3/+96 | |
2019-01-07 | gl_rasterizer: Skip framebuffer configuration if rendertargets have not been changed | ReinUsesLisp | 2 | -1/+31 | |
2019-01-07 | gl_rasterizer_cache: Use dirty flags for the depth buffer | ReinUsesLisp | 4 | -3/+23 | |
2019-01-07 | gl_rasterizer_cache: Use dirty flags for color buffers | ReinUsesLisp | 4 | -4/+24 | |
2019-01-07 | gl_shader_cache: Use dirty flags for shaders | ReinUsesLisp | 5 | -2/+23 | |
2019-01-06 | gl_stream_buffer: Use DSA for buffer management | ReinUsesLisp | 3 | -17/+14 | |
2019-01-06 | gl_rasterizer: Use DSA for vertex array objects | ReinUsesLisp | 6 | -79/+53 | |
2019-01-06 | gl_state: Drop uniform buffer state tracking | ReinUsesLisp | 3 | -10/+0 | |
2019-01-05 | gl_rasterizer_cache: Use GL_STREAM_COPY for PBOs | ReinUsesLisp | 1 | -1/+1 | |
Since the data is doing the path CPU -> GPU -> GPU copy is the most approximate hint. Using GL_STREAM_DRAW generated a performance warning on Nvidia's stack. Changing this hint removed the warning. | |||||
2018-12-30 | gl_rasterizer_cache: Texture view if shader samples array but OGL is not | ReinUsesLisp | 3 | -14/+74 | |
When a shader samples a texture array but that texture in OpenGL is created without layers, use a texture view to increase the texture hierarchy. For example, instead of binding a GL_TEXTURE_2D bind a GL_TEXTURE_2D_ARRAY view. | |||||
2018-12-28 | gpu: Remove PixelFormat G8R8U and G8R8S, as they do not seem to exist. | bunnei | 4 | -79/+46 | |
- Fixes UI rendering issues in The Legend of Zelda: Breath of the Wild. | |||||
2018-12-27 | Add missing uintBitsToFloat to SetRegisterToHalfFloat | Rodolfo Bogado | 1 | -2/+2 | |
2018-12-26 | renderer_opengl: Correct forward declaration of FramebufferLayout | Lioncash | 1 | -1/+1 | |
This is actually a struct, not a class, which can lead to compilation warnings. | |||||
2018-12-26 | Apply CC test to the final value to be stored in the register | Rodolfo Bogado | 1 | -9/+12 | |
2018-12-26 | Fixed shader linking error due to TLDS (#1934) | David | 1 | -1/+1 | |
* Fixed shader linking error due to TLDS coord should be coords * Fix remaining coords | |||||
2018-12-26 | shader_bytecode: Fixup TEXS.F16 encoding | ReinUsesLisp | 1 | -1/+1 | |
2018-12-22 | Includde saturation in the evaluation of the control code | Rodolfo Bogado | 1 | -3/+4 | |
2018-12-22 | Handle RZ cases evaluating the expression instead of the register value. | Rodolfo Bogado | 1 | -14/+22 | |
2018-12-22 | complete emulation of ZeroFlag | Rodolfo Bogado | 1 | -100/+97 | |
2018-12-19 | hopefully fix clang format issue | David Marcec | 1 | -0/+1 | |
2018-12-19 | Fixed uninitialized memory due to missing returns in canary | David Marcec | 10 | -3/+29 | |
Functions which are suppose to crash on non canary builds usually don't return anything which lead to uninitialized memory being used. | |||||
2018-12-18 | yuzu, video_core: Screenshot functionality | zhupengfei | 6 | -4/+95 | |
Allows capturing screenshot at the current internal resolution (native for software renderer), but a setting is available to capture it in other resolutions. The screenshot is saved to a single PNG in the current layout. | |||||
2018-12-18 | Texture format fixes: Flag RGBA16UI as GL_RGBA_INTEGER format, and interpret R16U as Z16 when depth_compare is enabled. | heapo | 1 | -1/+11 | |
2018-12-18 | shader_bytecode: Fixup half float's operator B encoding | ReinUsesLisp | 1 | -1/+1 | |
2018-12-17 | Implement postfactor multiplication/division for fmul instructions | heapo | 2 | -5/+21 | |
2018-12-17 | Fix arrayed shadow sampler array slice/depth comparison ordering, as well as invalid GLSL LOD selection. | heapo | 1 | -16/+14 | |
2018-12-11 | gl_shader_cache: Dehardcode constant in CalculateProgramSize() | Lioncash | 1 | -2/+2 | |
This constant is related to the size of the instruction. | |||||
2018-12-11 | gl_shader_cache: Resolve truncation compiler warning | Lioncash | 1 | -1/+1 | |
The previous code would cause a warning, as it was truncating size_t (64-bit) to a u32 (32-bit) implicitly. | |||||
2018-12-10 | gl_shader_decompiler: IPA FrontFacing: the right value when is the front face is 0xFFFFFFFF. | Marcos Vitali | 1 | -1/+1 | |
2018-12-09 | Implemented a shader unique identifier. | Fernando Sahmkow | 4 | -0/+57 | |
2018-12-09 | Add more info into textures' object labels | FernandoS27 | 2 | -2/+57 | |
2018-12-07 | gl_shader_decompiler: TLDS/TLD4/TLD4S Reworked reflecting the source registers, bugs fixed and modularize. | Marcos Vitali | 1 | -106/+134 | |
2018-12-05 | gl_shader_decompiler: Implement TEXS.F16 | ReinUsesLisp | 2 | -13/+51 | |
2018-12-05 | gl_shader_decompiler: Fixup inverted if | ReinUsesLisp | 1 | -6/+5 | |
2018-12-05 | Improve msvc codegen for hot-path array LUTs | heapo | 1 | -275/+277 | |
In some constexpr functions, msvc is building the LUT at runtime (pushing each element onto the stack) out of an abundance of caution. Moving the arrays into be file-scoped constexpr's avoids this and turns the functions into simple look-ups as intended. | |||||
2018-12-04 | Rewrited TEX/TEXS (TEX Scalar). (#1826) | Marcos | 1 | -259/+177 | |
* Rewrited TEX/TEXS (TEX Scalar). * Style fixes. * Styles issues. | |||||
2018-12-04 | Removed unused file. | Subv | 1 | -142/+0 | |
This is a leftover from #1792 | |||||
2018-12-04 | GPU: Don't try to route PFIFO methods (0-0x40) to the other engines. | Subv | 1 | -0/+6 | |
2018-12-01 | Fix debug build | Lioncash | 1 | -4/+2 | |
A non-existent parameter was left in some formatting calls (the logging macro for which only does anything meaningful on debug builds) | |||||
2018-11-30 | gl_rasterizer_cache: Update AccurateCopySurface to flush complete source surface. | bunnei | 1 | -1/+4 | |
- Fixes issues with Breath of the Wild with use_accurate_gpu_emulation setting. | |||||
2018-11-29 | gl_rasterizer: Enable clip distances when set in register and in shader | ReinUsesLisp | 5 | -13/+37 | |
2018-11-29 | gl_rasterizer: Implement a framebuffer cache | ReinUsesLisp | 2 | -40/+82 | |
2018-11-29 | gl_shader_manager: Update pipeline when programs have changed | ReinUsesLisp | 1 | -4/+17 | |
2018-11-29 | gl_rasterizer_cache: Remove BlitSurface and replace with more accurate copy. | bunnei | 1 | -144/+1 | |
- BlitSurface with different texture targets is inherently broken. - When target is the same, we can just use FastCopySurface. - Fixes rendering issues with Breath of the Wild. | |||||
2018-11-29 | gl_shader_decompiler: Remove texture temporal in TLD4 | ReinUsesLisp | 1 | -3/+1 | |
2018-11-29 | gl_shader_decompiler: Flip negated if else statement | ReinUsesLisp | 1 | -3/+3 | |
2018-11-29 | gl_shader_decompiler: Use GLSL scope on instructions unrelated to textures | ReinUsesLisp | 1 | -35/+10 | |
2018-11-29 | gl_shader_decompiler: Move texture code generation into lambdas | ReinUsesLisp | 1 | -97/+78 | |
2018-11-29 | gl_shader_decompiler: Clean up texture instructions | ReinUsesLisp | 1 | -87/+56 | |
2018-11-29 | gl_shader_decompiler: Scope GLSL variables with a scoped object | ReinUsesLisp | 1 | -32/+72 | |
2018-11-29 | gl_rasterizer: Signal UNIMPLEMENTED when rt_separate_frag_data is not zero | ReinUsesLisp | 1 | -1/+1 | |
2018-11-29 | gl_rasterizer_cache: Use brackets for two-line single-expresion blocks | ReinUsesLisp | 1 | -1/+2 | |
2018-11-29 | gl_rasterizer: Remove unused struct declarations | ReinUsesLisp | 1 | -14/+0 | |
2018-11-29 | gl_rasterizer: Remove extension booleans | ReinUsesLisp | 2 | -16/+0 | |
2018-11-28 | dma_pushbuffer: Optimize to avoid loop and copy on Push. | bunnei | 2 | -5/+17 | |
2018-11-28 | gpu: Move command list profiling to DmaPusher::DispatchCalls. | bunnei | 2 | -5/+5 | |
2018-11-27 | gl_shader_decompiler: Fixup clip distance index | ReinUsesLisp | 1 | -1/+1 | |
2018-11-27 | gl_rasterizer: Fixup for #1723. | Markus Wick | 1 | -1/+1 | |
On invalidating the streaming buffer, we need to reupload all vertex buffers. But we don't need to reconfigure the vertex format. This was a (silly) misstake in #1723. Thanks at Rodrigo for discovering the issue. Fun fact, as configuring the vertex format also invalidate the vertex buffer, this misstake had no affect on the behavior. | |||||
2018-11-27 | gpu: Rewrite GPU command list processing with DmaPusher class. | bunnei | 17 | -105/+343 | |
- More accurate impl., fixes Undertale (among other games). | |||||
2018-11-27 | remove viewport_transform_enabled as it seems to be inactive when valid transforms are used. | Rodolfo Bogado | 1 | -12/+5 | |
2018-11-27 | morton: Fixup compiler warning | ReinUsesLisp | 1 | -1/+2 | |
2018-11-27 | Implement depth clamp | Rodolfo Bogado | 5 | -10/+58 | |
2018-11-27 | Add support for Clip Distance enabled register | Rodolfo Bogado | 3 | -3/+26 | |
2018-11-27 | GPU States: Implement Polygon Offset. This is used in SMO all the time. (#1784) | Marcos | 5 | -5/+107 | |
* GPU States: Implement Polygon Offset. This is used in SMO all the time. * Clang Format fixes. * Initialize polygon_offset in the constructor. | |||||
2018-11-26 | Implemented Tile Width Spacing | FernandoS27 | 8 | -36/+55 | |
2018-11-25 | Limit the amount of viewports tested for state changes only to the usable ones | Rodolfo Bogado | 1 | -2/+10 | |
2018-11-25 | gl_shader_decompiler: Implement S2R's Y_DIRECTION | ReinUsesLisp | 5 | -16/+26 | |
2018-11-25 | morton: Style changes | ReinUsesLisp | 1 | -12/+12 | |
2018-11-25 | video_core: Move morton functions to their own file | ReinUsesLisp | 6 | -345/+391 | |
2018-11-24 | Fix Texture Overlapping | FernandoS27 | 1 | -43/+70 | |
2018-11-24 | Implemented BRA CC conditional and FSET CC Setting | FernandoS27 | 1 | -4/+14 | |
2018-11-24 | Add support for viewport_transfom_enable register | Rodolfo Bogado | 2 | -6/+22 | |
2018-11-24 | Add support for clear_flags register | Rodolfo Bogado | 5 | -28/+95 | |
2018-11-24 | Fix TEXS Instruction encodings | FernandoS27 | 1 | -22/+48 | |
2018-11-24 | Fix one encoding in TEX Instruction | FernandoS27 | 1 | -3/+3 | |
2018-11-24 | Corrected inputs indexing in TEX instruction | FernandoS27 | 1 | -66/+85 | |
2018-11-23 | memory_manager: Do not allow 0 to be a valid GPUVAddr. | bunnei | 2 | -1/+9 | |
- Fixes a bug with Undertale using 0 for a render target. | |||||
2018-11-23 | Added predicate comparison LessEqualWithNan (#1736) | Hexagon12 | 2 | -5/+13 | |
* Added predicate comparison LessEqualWithNan * oops * Clang fix | |||||
2018-11-23 | gl_shader_decompiler: Implement clip distances | ReinUsesLisp | 3 | -21/+58 | |
2018-11-22 | gl_shader_decompiler: Add a message for unimplemented cc generation | ReinUsesLisp | 1 | -23/+46 | |
2018-11-22 | macro_interpreter: Implement AddWithCarry and SubtractWithBorrow. | bunnei | 2 | -8/+25 | |
- Used by Undertale. | |||||
2018-11-22 | maxwell_3d: Implement alternate blend equations. | bunnei | 2 | -0/+12 | |
- Used by Undertale. | |||||
2018-11-22 | gl_shader_decompiler: Rename internal flag strings | ReinUsesLisp | 1 | -15/+20 | |
2018-11-22 | gl_shader_decompiler: Rename control codes to condition codes | ReinUsesLisp | 2 | -67/+50 | |
2018-11-22 | gl_shader_decompiler: Fix register overwriting on texture calls | ReinUsesLisp | 1 | -60/+78 | |
2018-11-21 | Properly Implemented TXQ Instruction | FernandoS27 | 1 | -2/+12 | |
2018-11-21 | gl_shader_decompiler: Implement BFI_IMM_R | ReinUsesLisp | 2 | -0/+23 | |
2018-11-21 | Removed pre 4.3 ARB extensions | FernandoS27 | 5 | -20/+13 | |
2018-11-21 | Use default values for unknown framebuffer pixel format | FernandoS27 | 2 | -0/+8 | |
2018-11-21 | gl_shader_decompiler: Implement R2P_IMM | ReinUsesLisp | 2 | -0/+42 | |
2018-11-21 | gl_shader_decompiler: Remove UNREACHABLE when setting RZ | ReinUsesLisp | 1 | -2/+1 | |
2018-11-21 | gl_shader_decompiler: Use UNIMPLEMENTED instead of LOG+UNREACHABLE when applicable | ReinUsesLisp | 1 | -371/+258 | |
2018-11-21 | maxwell_3d: Initialize rasterizer color mask registers as enabled. | bunnei | 1 | -0/+9 | |
- Fixes rendering regression with Sonic Mania. | |||||
2018-11-20 | shader_cache: Only lock covered instructions. | Markus Wick | 4 | -8/+24 | |
2018-11-20 | Implemented Fast Layered Copy | FernandoS27 | 2 | -2/+30 | |
2018-11-19 | Eliminated unnessessary memory allocation and copy (#1702) | Frederic L | 3 | -9/+20 | |
2018-11-19 | gl_rasterizer: Remove default clip distance | ReinUsesLisp | 1 | -2/+0 | |
2018-11-18 | drop support for non separate alpha as it seems to cause issues in some games | Rodolfo Bogado | 3 | -61/+35 | |
2018-11-17 | fix sampler configuration, thanks to Marcos for his investigation | Rodolfo Bogado | 3 | -19/+57 | |
2018-11-17 | small type fix | Rodolfo Bogado | 1 | -6/+6 | |
2018-11-17 | small fix for alphaToOne bit location | Rodolfo Bogado | 1 | -2/+2 | |
2018-11-17 | fix for gcc compilation | Rodolfo Bogado | 1 | -60/+61 | |
2018-11-17 | add AlphaToCoverage and AlphaToOne | Rodolfo Bogado | 5 | -1/+39 | |
2018-11-17 | add support for fragment_color_clamp | Rodolfo Bogado | 5 | -1/+24 | |
2018-11-17 | add missing MirrorOnceBorder support where supported | Rodolfo Bogado | 1 | -0/+6 | |
2018-11-17 | set border color not depending on the wrap mode | Rodolfo Bogado | 1 | -9/+9 | |
only enable color mask for the first framebuffer id independent blending is disabled | |||||
2018-11-17 | set default value for point size register | Rodolfo Bogado | 2 | -5/+4 | |
2018-11-17 | fix viewport and scissor behavior | Rodolfo Bogado | 6 | -64/+89 | |
2018-11-17 | gl_rasterizer: Skip VB upload if the state is clean. | Markus Wick | 9 | -6/+60 | |
2018-11-17 | textures/decoders: Replace magic numbers | Frederic Laing | 1 | -37/+33 | |
2018-11-15 | textures/decoders: Minor cleanup | Frederic Laing | 1 | -16/+16 | |
2018-11-15 | gl_rasterizer_chache: Minor cleanup | Frederic Laing | 1 | -3/+3 | |
2018-11-13 | video_core/renderer_base: Remove GL include from the renderer base class files | Lioncash | 1 | -1/+0 | |
Keeps the base class source files implementation-agnostic. | |||||
2018-11-13 | gl_rasterizer: Minor cleanup | Frederic L | 1 | -4/+2 | |
Minor code cleanup from unaddressed feedback in #1654 | |||||
2018-11-13 | gl_state: Amend compilation warnings | Lioncash | 2 | -3/+4 | |
Makes float -> integral conversions explicit via casts and also silences a sign conversion warning. | |||||
2018-11-13 | Implement ASTC_2D_10X8 & ASTC_2D_10X8_SRGB (#1666) | greggameplayer | 4 | -71/+101 | |
* Implement ASTC_2D_10X8 & ASTC_2D_10X8_SRGB ( needed by Mario+Rabbids Kingdom Battle ) * Small placement correction | |||||
2018-11-11 | Use core extensions when available to set max anisotropic filtering level | Rodolfo Bogado | 1 | -2/+7 | |
2018-11-11 | Improve state management by splitting some of the states id separated function to avoid a full apply overhead | Rodolfo Bogado | 6 | -39/+40 | |
2018-11-11 | Try to fix problems with stencil test in some games, relax translation to opengl enums to avoid crashing and only generate logs of the errors. | Rodolfo Bogado | 4 | -37/+61 | |
2018-11-11 | set sampler max lod, min lod, lod bias and max anisotropy | Rodolfo Bogado | 3 | -13/+33 | |
2018-11-11 | Improved GPU Caches lookup Speed | FernandoS27 | 1 | -18/+17 | |
2018-11-10 | gl_shader_decompiler: Guard out of bound geometry shader input reads | ReinUsesLisp | 4 | -15/+24 | |
Geometry shaders follow a pattern that results in out of bound reads. This pattern is: - VSETP to predicate - Use that predicate to conditionally set a register a big number - Use the register to access geometry shaders At the time of writing this commit I don't know what's the intent of this number. Some drivers argue about these out of bound reads. To avoid this issue, input reads are guarded limiting reads to the highest posible vertex input of the current topology (e.g. points to 1 and triangles to 3). | |||||
2018-11-08 | gl_rasterizer_cache: Remove unnecessary memory allocation and copy in CopySurface | Frederic Laing | 1 | -10/+7 | |
2018-11-08 | gl_rasterizer: Fix compiler warnings | Frederic Laing | 1 | -2/+2 | |
2018-11-08 | rasterizer_cache: Remove reliance on the System singleton | Lioncash | 9 | -10/+25 | |
Rather than have a transparent dependency, we can make it explicit in the interface. This also gets rid of the need to put the core include in a header. | |||||
2018-11-08 | rasterizer_cache: Add missing virtual destructor to RasterizerCacheObject | Lioncash | 3 | -0/+10 | |
Ensures that destruction will always do the right thing in any context. | |||||
2018-11-08 | gl_resource_manager: Amend clang-format discrepancies | Lioncash | 1 | -4/+2 | |
Fixes the buildbot. | |||||
2018-11-08 | Correct issue where texturelod could not be applied to 2darrayshadow | FernandoS27 | 1 | -1/+5 | |
2018-11-07 | Implement 3 coordinate array in TEXS instruction | FernandoS27 | 1 | -6/+6 | |
2018-11-06 | gl_rasterizer: Skip VAO binding if the state is clean. | Markus Wick | 3 | -2/+21 | |
2018-11-06 | gl_rasterizer: Split VAO and VB setup functions. | Markus Wick | 2 | -5/+16 | |
2018-11-06 | gl_rasterizer_cache: Add profiles for Copy and Blit. | Markus Wick | 1 | -2/+6 | |
They were missed, and Copy is very high in profile here. It doesn't block the GPU, but it stalls the driver thread. So with our bad GL instructions, this might block quite a while. | |||||
2018-11-06 | gl_resource_manager: Profile creation and deletion. | Markus Wick | 1 | -0/+42 | |
2018-11-06 | gl_stream_buffer: Profile orphaning of stream buffer. | Markus Wick | 1 | -0/+5 | |
This serialize to the driver thread and so it may block for a while. So if it is in the benchmark, we get noticed if it happens too often. | |||||
2018-11-06 | gl_resource_manager: Split implementations in .cpp file. | Markus Wick | 5 | -114/+167 | |
Those implementations are quite costly, so there is no need to inline them to the caller. Ressource deletion is often a performance bug, so in this way, we support to add breakpoints to them. | |||||
2018-11-05 | Add support to color mask to avoid issues in blending caused by wrong values in the alpha channel in some render targets. | Rodolfo Bogado | 5 | -25/+79 | |
2018-11-05 | Implement multi-target viewports and blending | Rodolfo Bogado | 6 | -128/+259 | |
2018-11-02 | correct syntax | greggameplayer | 1 | -4/+3 | |
2018-11-02 | Fix ASTC Decompressor to support depth parameter | FernandoS27 | 6 | -62/+128 | |
2018-11-01 | memory_manager: Do not MapBufferEx over already in use memory. | bunnei | 2 | -31/+52 | |
- This fixes rendering when changing areas in Super Mario Odyssey. | |||||
2018-11-01 | Fix ASTC formats | FernandoS27 | 3 | -11/+20 | |
2018-11-01 | Implemented ASTC 5x5 | FernandoS27 | 1 | -1/+5 | |
2018-11-01 | Implement Cube Arrays | FernandoS27 | 4 | -0/+20 | |
2018-11-01 | maxwell_3d: Restructure macro upload to use a single macro code memory. | bunnei | 4 | -27/+55 | |
- Fixes an issue where macros could be skipped. - Fixes rendering of distant objects in Super Mario Odyssey. | |||||
2018-10-31 | Implement SurfaceTarget Texture2DArray | greggameplayer | 1 | -0/+1 | |
( needed by Mario+Rabbids Kingdom Battle ) | |||||
2018-10-31 | Improve OpenGL state handling | Rodolfo Bogado | 3 | -105/+158 | |
2018-10-30 | video_core: Move surface declarations out of gl_rasterizer_cache | ReinUsesLisp | 6 | -898/+954 | |
2018-10-30 | Assert Control Codes Generation | FernandoS27 | 2 | -1/+103 | |
2018-10-30 | global: Use std::optional instead of boost::optional (#1578) | Frederic L | 17 | -97/+107 | |
* get rid of boost::optional * Remove optional references * Use std::reference_wrapper for optional references * Fix clang format * Fix clang format part 2 * Adressed feedback * Fix clang format and MacOS build | |||||
2018-10-29 | video_core: Move OpenGL specific utils to its renderer | ReinUsesLisp | 6 | -30/+61 | |
2018-10-29 | renderer_opengl: Correct bpp value for ASTC_2D_8X5_SRGB | Rodolfo Bogado | 1 | -1/+1 | |
2018-10-29 | Assert Control Flow Instructions using Control Codes | FernandoS27 | 2 | -3/+28 | |
2018-10-29 | Fixed black textures, pixelation and we no longer require to auto-generate mipmaps | FernandoS27 | 1 | -14/+2 | |
2018-10-29 | Fixed mipmap block autosizing algorithm | FernandoS27 | 3 | -13/+25 | |
2018-10-29 | Fixed Invalid Image size and Mipmap calculation | FernandoS27 | 1 | -4/+7 | |
2018-10-29 | Fixed Block Resizing algorithm and Clang Format | FernandoS27 | 3 | -12/+19 | |
2018-10-29 | Implement Mip Filter | FernandoS27 | 4 | -10/+33 | |
2018-10-29 | Zero out memory region of recreated surface before flushing | FernandoS27 | 1 | -0/+2 | |
2018-10-28 | Implement Mipmaps | FernandoS27 | 2 | -101/+211 | |
2018-10-28 | Enable alpha channel for DXT1 texture format | Michael | 1 | -2/+2 | |
2018-10-28 | Correct bpp value for ASTC_2D_8X5 | Tobias | 1 | -1/+1 | |
2018-10-28 | Refactor precise usage and add FMNMX, MUFU, FMUL32 and FADD332 | FernandoS27 | 2 | -74/+37 | |
2018-10-28 | Implement sRGB Support, including workarounds for nvidia driver issues and QT sRGB support | Rodolfo Bogado | 8 | -40/+197 | |
2018-10-28 | Improved Shader accuracy on Vertex and Geometry Shaders with FFMA, FMUL and FADD | FernandoS27 | 2 | -6/+58 | |
2018-10-27 | Implement Default Block Height for each format | FernandoS27 | 1 | -0/+62 | |
2018-10-27 | gl_rasterizer_cache: Fix compiler warning | Frederic Laing | 1 | -2/+2 | |
2018-10-26 | gl_rasterizer: Implement primitive restart. | bunnei | 5 | -1/+40 | |
2018-10-26 | maxwell_3d: Add code for initializing register defaults. | bunnei | 2 | -1/+21 | |
2018-10-26 | gl_rasterizer: Implement depth range. | bunnei | 4 | -13/+20 | |
2018-10-24 | Implemented LD_L and ST_L | FernandoS27 | 3 | -12/+112 | |
2018-10-24 | Implement Shader Local Memory | FernandoS27 | 1 | -0/+37 | |
2018-10-24 | decoders: Remove unused variable within SwizzledData() | Lioncash | 1 | -1/+0 | |
2018-10-24 | maxwell_3d: Remove unused variable within ProcessQueryGet() | Lioncash | 1 | -1/+0 | |
2018-10-23 | Implement PointSize | FernandoS27 | 3 | -5/+28 | |
2018-10-23 | Fixed Layered Textures Loading and Cubemaps | FernandoS27 | 3 | -72/+109 | |
2018-10-23 | gl_shader_decompiler: Implement VSETP | ReinUsesLisp | 2 | -0/+26 | |
2018-10-23 | gl_shader_decompiler: Abstract VMAD into a video subset | ReinUsesLisp | 2 | -75/+82 | |
2018-10-23 | Added Saturation to FMUL32I | FernandoS27 | 2 | -3/+8 | |
2018-10-22 | Assert that multiple render targets are not set while alpha testing | FernandoS27 | 3 | -3/+17 | |
2018-10-22 | Use standard UBO and fix/stylize the code | FernandoS27 | 8 | -91/+51 | |
2018-10-22 | Cache uniform locations and restructure the implementation | FernandoS27 | 3 | -33/+29 | |
2018-10-22 | Remove SyncAlphaTest and clang format | FernandoS27 | 4 | -8/+9 | |
2018-10-22 | Added Alpha Func | FernandoS27 | 2 | -3/+43 | |
2018-10-22 | Implemented Alpha Testing | FernandoS27 | 6 | -3/+59 | |
2018-10-22 | Fixed FSETP and FSET | FernandoS27 | 2 | -30/+12 | |
2018-10-22 | Fixed VAOs Float types only returning GL_FLOAT in cases that they had to return GL_HALF_FLOAT | FernandoS27 | 1 | -2/+14 | |
2018-10-20 | engines/maxwell_*: Use nested namespace specifiers where applicable | Lioncash | 3 | -12/+6 | |
These three source files are the only ones within the engines directory that don't use nested namespaces. We may as well change these over to keep things consistent. | |||||
2018-10-20 | maxwell_dma: Make variables const where applicable within HandleCopy() | Lioncash | 1 | -3/+3 | |
These are never modified, so we can make that assumption explicit. | |||||
2018-10-20 | maxwell_dma: Make FlushAndInvalidate's size parameter a u64 | Lioncash | 1 | -1/+1 | |
This prevents truncation warnings at the lambda's usage sites. | |||||
2018-10-20 | maxwell_dma: Remove unused variables in HandleCopy() | Lioncash | 1 | -3/+0 | |
These pointer variables are never used, so we can get rid of them. | |||||
2018-10-20 | gl_shader_decompiler: Allow std::move to function in SetPredicate | Lioncash | 1 | -1/+1 | |
If the variable being moved is const, then std::move will always perform a copy (since it can't actually move the data). | |||||
2018-10-20 | gl_shader_decompiler: Get rid of variable shadowing warnings | Lioncash | 1 | -2/+2 | |
A variable with the same name was previously declared in an outer scope. | |||||
2018-10-20 | gl_shader_decompiler: Fix a few comment typos | Lioncash | 1 | -3/+4 | |
2018-10-20 | gl_shader_decompiler: Move position varying declaration back to gl_shader_gen | ReinUsesLisp | 3 | -13/+9 | |
The intention of declaring them in gl_shader_decompiler was to be able to use blocks to implement geometry shaders. But that wasn't needed in the end and it caused issues when both vertex stages were being used, resulting in a redeclaration of "position". | |||||
2018-10-19 | GPU: Improved implementation of maxwell DMA (Subv). | bunnei | 3 | -17/+66 | |
2018-10-19 | decoders: Introduce functions for un/swizzling subrects. | bunnei | 2 | -0/+49 | |
2018-10-19 | GPU: Invalidate destination address of kepler_memory writes. | bunnei | 3 | -3/+17 | |
2018-10-19 | fermi_2d: Add support for more accurate surface copies. | bunnei | 2 | -3/+12 | |
2018-10-18 | gl_shader_decompiler: Implement PBK and BRK | ReinUsesLisp | 2 | -22/+43 | |
2018-10-18 | Clang format and other fixes | FernandoS27 | 1 | -16/+0 | |
2018-10-18 | Implement Reinterpret Surface, to accurately blit 3D textures | FernandoS27 | 1 | -2/+4 | |
2018-10-18 | Implement GetInRange in the Rasterizer Cache | FernandoS27 | 1 | -0/+16 | |
2018-10-18 | Implement 3D Textures | FernandoS27 | 4 | -1/+10 | |
2018-10-18 | gl_rasterizer_cache: Remove unnecessary block_depth=1 on Flush. | bunnei | 1 | -1/+0 | |
2018-10-18 | gl_rasterizer_cache: Remove unnecessary temporary buffer with unswizzle. | bunnei | 1 | -5/+2 | |
2018-10-16 | gl_rasterizer_cache: Use AccurateCopySurface for use_accurate_gpu_emulation. | bunnei | 2 | -2/+18 | |
2018-10-16 | config: Rename use_accurate_framebuffers -> use_accurate_gpu_emulation. | bunnei | 3 | -6/+6 | |
- This will be used as a catch-all for slow-but-accurate GPU emulation paths. | |||||
2018-10-16 | rasterizer_cache: Refactor to support in-order flushing. | bunnei | 6 | -63/+116 | |
2018-10-16 | gl_rasterizer_cache: Refactor to only call GetRegionEnd on surface creation. | bunnei | 2 | -16/+23 | |
2018-10-16 | gl_rasterizer_cache: Only flush when use_accurate_framebuffers is enabled. | bunnei | 2 | -2/+13 | |
2018-10-16 | gl_rasterizer_cache: Separate guest and host surface size managment. | bunnei | 2 | -92/+94 | |
2018-10-16 | gl_rasterizer_cache: Rename GetGLBytesPerPixel to GetBytesPerPixel. | bunnei | 2 | -17/+18 | |
- This does not really have anything to do with OpenGL. | |||||
2018-10-16 | gl_rasterizer_cache: Remove unused FlushSurface method. | bunnei | 2 | -7/+0 | |
2018-10-16 | gl_rasterizer: Implement flushing. | bunnei | 1 | -1/+25 | |
2018-10-16 | gl_rasterizer_cache: Remove usage of Memory::Read/Write functions. | bunnei | 1 | -13/+8 | |
- These cannot be used within the cache, as they change cache state. | |||||
2018-10-16 | gl_rasterizer_cache: Clamp cached surface size to mapped GPU region size. | bunnei | 2 | -19/+37 | |
2018-10-16 | memory_manager: Add a method for querying the end of a mapped GPU region. | bunnei | 2 | -0/+11 | |
2018-10-16 | rasterizer_cache: Reintroduce method for flushing. | bunnei | 3 | -0/+23 | |
2018-10-16 | gl_rasterizer_cache: Reintroduce code for handling swizzle and flush to guest RAM. | bunnei | 2 | -28/+119 | |
2018-10-15 | shader_bytecode: Add Control Code enum 0xf | ReinUsesLisp | 1 | -1/+1 | |
Control Code 0xf means to unconditionally execute the instruction. This value is passed to most BRA, EXIT and SYNC instructions (among others) but this may not always be the case. | |||||
2018-10-15 | gl_shader_decompiler: Fixup style inconsistencies | ReinUsesLisp | 1 | -5/+3 | |
2018-10-15 | gl_rasterizer: Silence implicit cast warning in glBindBufferRange | ReinUsesLisp | 1 | -1/+2 | |
2018-10-15 | gl_shader_decompiler: Implement HSET2_R | ReinUsesLisp | 2 | -0/+62 | |
2018-10-15 | gl_shader_decompiler: Implement HSETP2_R | ReinUsesLisp | 2 | -0/+65 | |
2018-10-15 | gl_shader_decompiler: Implement HFMA2 instructions | ReinUsesLisp | 2 | -0/+85 | |
2018-10-15 | gl_shader_decompiler: Implement HADD2_IMM and HMUL2_IMM | ReinUsesLisp | 2 | -0/+73 | |
2018-10-15 | gl_shader_decompiler: Implement non-immediate HADD2 and HMUL2 instructions | ReinUsesLisp | 2 | -0/+75 | |
2018-10-15 | gl_shader_decompiler: Setup base for half float unpacking and setting | ReinUsesLisp | 2 | -0/+98 | |
2018-10-14 | Implement Arrays on Tex Instruction | FernandoS27 | 1 | -14/+55 | |
2018-10-14 | Fix TLDS | FernandoS27 | 1 | -1/+5 | |
2018-10-14 | Shorten the implementation of 3D swizzle to only 3 functions | FernandoS27 | 1 | -70/+27 | |
2018-10-13 | Fix a Crash on Zelda BotW and Splatoon 2, and simplified LoadGLBuffer | FernandoS27 | 2 | -19/+2 | |
2018-10-13 | Propagate depth and depth_block on modules using decoders | FernandoS27 | 7 | -52/+64 | |
2018-10-13 | Remove old Swizzle algorithms and use 3d Swizzle | FernandoS27 | 1 | -93/+69 | |
2018-10-13 | Implement Precise 3D Swizzle | FernandoS27 | 1 | -3/+71 | |
2018-10-13 | Implement Fast 3D Swizzle | FernandoS27 | 1 | -2/+74 | |
2018-10-13 | Added ASTC 5x4; 8x5 | Hexagon12 | 3 | -6/+32 | |
2018-10-12 | Implemented helper function to correctly calculate a texture's size | FernandoS27 | 2 | -0/+22 | |
2018-10-11 | gl_shader_decompiler: Implement VMAD | ReinUsesLisp | 2 | -0/+118 | |
2018-10-10 | Add memory Layout to Render Targets and Depth Buffers | FernandoS27 | 3 | -21/+33 | |
2018-10-10 | Fixed block height settings for RenderTargets and Depth Buffers, and added block width and block depth | FernandoS27 | 5 | -12/+63 | |
2018-10-09 | gl_shader_decompiler: Remove unused variables in TMML's implementation | Lioncash | 1 | -7/+3 | |
Given "y" isn't always used, but "x" is, we can rearrange this to avoid unused variable warnings by changing the names of op_a and op_b | |||||
2018-10-09 | Implement Scissor Test | FernandoS27 | 1 | -4/+9 | |
2018-10-09 | Assert Scissor tests | FernandoS27 | 3 | -1/+31 | |
2018-10-07 | gl_shader_decompiler: Move position varying location from 15 to 0 and apply an offset | ReinUsesLisp | 1 | -6/+10 | |
2018-10-07 | gl_shader_decompiler: Implement geometry shaders | ReinUsesLisp | 10 | -107/+522 | |
2018-10-07 | video_core: Allow LabelGLObject to use extra info on any object | ReinUsesLisp | 1 | -10/+14 | |
2018-10-07 | gl_rasterizer: Fixup undefined behaviour in SetupDraw | ReinUsesLisp | 1 | -0/+1 | |
2018-10-06 | Implemented Depth Compare and Shadow Samplers | FernandoS27 | 6 | -65/+224 | |
2018-10-06 | fermi_2d: Implement simple copies with AccelerateSurfaceCopy. | bunnei | 3 | -24/+36 | |
2018-10-06 | gl_rasterizer: Add rasterizer cache code to handle accerated fermi copies. | bunnei | 5 | -16/+60 | |
2018-10-06 | gl_rasterizer_cache: Implement a simpler surface copy using glCopyImageSubData. | bunnei | 1 | -0/+21 | |
2018-10-04 | gl_rasterizer: Implement quads topology | ReinUsesLisp | 8 | -46/+236 | |
2018-10-03 | Implemented Texture Processing Modes in TEXS and TLDS | FernandoS27 | 1 | -5/+42 | |
2018-10-01 | gl_rasterizer: Fixup unassigned point sizes | ReinUsesLisp | 1 | -1/+4 | |
2018-09-30 | gl_rasterizer_cache: Fixes to how we do render to cubemap. | bunnei | 2 | -32/+5 | |
- Fixes issues with Splatoon 2. | |||||
2018-09-30 | gl_rasterizer_cache: Add check for array rendering to cubemap texture. | bunnei | 1 | -0/+8 | |
2018-09-30 | gl_rasterizer_cache: Implement render to cubemap. | bunnei | 3 | -119/+218 | |
2018-09-30 | gl_shader_decompiler: TEXS: Implement TextureType::TextureCube. | bunnei | 1 | -0/+8 | |
2018-09-30 | gl_rasterizer_cache: Add support for SurfaceTarget::TextureCubemap. | bunnei | 2 | -1/+36 | |
2018-09-30 | gl_rasterizer_cache: Implement LoadGLBuffer for Texture2DArray. | bunnei | 1 | -0/+8 | |
2018-09-30 | gl_rasterizer_cache: Update BlitTextures to support non-Texture2D ColorTexture surfaces. | bunnei | 1 | -23/+88 | |
2018-09-30 | gl_rasterizer_cache: Track texture target and depth in the cache. | bunnei | 1 | -2/+3 | |
2018-09-30 | gl_rasterizer_cache: Workaround for Texture2D -> Texture2DArray scenario. | bunnei | 3 | -6/+21 | |
2018-09-30 | gl_rasterizer_cache: Keep track of surface 2D size separately from total size. | bunnei | 2 | -32/+46 | |
2018-09-30 | Fix trailing whitespace | raven02 | 1 | -1/+4 | |
2018-09-28 | video_core: Implement point_size and add point state sync | ReinUsesLisp | 5 | -1/+27 | |
2018-09-28 | gl_state: Pack sampler bindings into a single ARB_multi_bind | ReinUsesLisp | 5 | -8/+25 | |
2018-09-26 | video_core: Add asserts for CS, TFB and alpha testing | ReinUsesLisp | 5 | -3/+92 | |
Add asserts for compute shader dispatching, transform feedback being enabled and alpha testing. These have in common that they'll probably break rendering without logging. | |||||
2018-09-23 | Added glObjectLabels for renderdoc for textures and shader programs (#1384) | David | 4 | -0/+48 | |
* Added glObjectLabels for renderdoc for textures and shader programs * Changed hardcoded "Texture" name to reflect the texture type instead * Removed string initialize | |||||
2018-09-23 | correct BC6H | greggameplayer | 1 | -2/+2 | |
2018-09-22 | gl_state: Remove unused type alias | Lioncash | 2 | -4/+1 | |
This isn't used anywhere within the header, so we can remove it, along with the include that was previously necessary. This also uncovers an indirect include in the cpp file for the assertion macros. | |||||
2018-09-21 | shader_bytecode: Lay out the Ipa-related enums better | Lioncash | 1 | -2/+12 | |
This is more consistent with the surrounding enums. | |||||
2018-09-21 | shader_bytecode: Make operator== and operator!= of IpaMode const qualified | Lioncash | 1 | -6/+7 | |
These don't affect the state of the struct and can be const member functions. | |||||
2018-09-21 | Reverse stride align restriction on FastSwizzle due to lost performance | FernandoS27 | 1 | -3/+2 | |
2018-09-21 | Join both Swizzle methods within one interface function | FernandoS27 | 1 | -11/+19 | |
2018-09-21 | Standarized Legacy Swizzle to look alike FastSwizzle and use a Swizzling Table instead | FernandoS27 | 1 | -42/+38 | |
2018-09-21 | Remove same output bpp restriction on FastSwizzle | FernandoS27 | 1 | -4/+5 | |
2018-09-21 | Improved Legacy Swizzler to be better documented and work better | FernandoS27 | 1 | -15/+21 | |
2018-09-21 | gl_stream_buffer: Fix use of bitwise OR instead of logical OR in Map() | Lioncash | 1 | -1/+1 | |
This was very likely intended to be a logical OR based off the conditioning and testing of inversion in one case. Even if this was intentional, this is the kind of non-obvious thing one should be clarifying with a comment. | |||||
2018-09-21 | RasterizerGL: Use the correct framebuffer when clearing via the CLEAR_BUFFERS register. | Subv | 1 | -1/+1 | |
Previously we were clearing the default backbuffer framebuffer. Found thanks to a Piglit test :) | |||||
2018-09-21 | Improved fast swizzle and removed restrictions to it | FernandoS27 | 1 | -7/+12 | |
2018-09-19 | gl_rasterizer: Fix StartAddress handling with indexed draw calls. | Markus Wick | 1 | -6/+7 | |
We uploaded the wrong data before. So the offset on the host GPU pointer may work for the first vertices, the last ones run out bounds. Let's just offset the upload instead. | |||||
2018-09-18 | Implemented Internal Flags | FernandoS27 | 1 | -13/+35 | |
2018-09-18 | gl_shader_decompiler: Avoid truncation warnings within LD_A and ST_A code | Lioncash | 1 | -4/+4 | |
These are internally stored as u64 values, so using u32 here causes truncation warnings. Instead, we can just use u64 and preserve the bit width. | |||||
2018-09-17 | Implemented I2I.CC on the NEU control code, used by SMO | FernandoS27 | 2 | -14/+18 | |
2018-09-17 | Implemented CSETP | FernandoS27 | 2 | -14/+49 | |
2018-09-17 | Implemented Control Codes | FernandoS27 | 2 | -0/+51 | |
2018-09-17 | Added asserts for texture misc modes to texture instructions | FernandoS27 | 1 | -2/+45 | |
2018-09-17 | Added texture misc modes to texture instructions | FernandoS27 | 1 | -1/+147 | |
2018-09-17 | Add 1D sampler for TLDS - TexelFetch (Mario Rabbids) | raven02 | 1 | -7/+12 | |
2018-09-16 | Implement ASTC_2D_8X8 (Bayonetta 2) | raven02 | 3 | -6/+20 | |
2018-09-15 | Implement RenderTargetFormat::BGR5A1_UNORM (Pokken Tournament DX) | raven02 | 2 | -0/+4 | |
2018-09-15 | Shaders: Implemented multiple-word loads and stores to and from attribute memory. | Subv | 2 | -7/+58 | |
This seems to be an optimization performed by nouveau. | |||||
2018-09-15 | Port #4182 from Citra: "Prefix all size_t with std::" | fearlessTobi | 20 | -133/+138 | |
2018-09-14 | Optimized Texture Swizzling | FernandoS27 | 1 | -2/+49 | |
2018-09-14 | gl_shader_decompiler: Get rid of variable shadowing within LEA instructions | Lioncash | 1 | -2/+0 | |
These variables are already defined within an outer scope. | |||||
2018-09-13 | Use ARB_multi_bind for uniform buffers (#1287) | ReinUsesLisp | 2 | -3/+23 | |
* gl_rasterizer: use ARB_multi_bind for uniform buffers * address feedback | |||||
2018-09-13 | gl_rasterizer_cache: B5G6R5U should use GL_RGB8 as an internal format. | bunnei | 1 | -1/+1 | |
- Fixes a regression with Sonic Mania with ARB_texture_storage. | |||||
2018-09-12 | GPU: Basic implementation of the Kepler Inline Memory engine (p2mf). | Subv | 6 | -0/+146 | |
This engine writes data from a FIFO register into the configured address. | |||||
2018-09-12 | Implemented Texture Processing Modes | FernandoS27 | 2 | -1/+43 | |
2018-09-12 | gl_rasterizer_cache: Always blit on recreate, regardless of format. | bunnei | 1 | -6/+10 | |
- Fixes several rendering issues with Super Mario Odyssey. | |||||
2018-09-12 | gl_shader_cache: Remove cache_width/cache_height. | bunnei | 2 | -12/+2 | |
- This was once an optimization, but we no longer need it with the cache reserve. - This is also inaccurate. | |||||
2018-09-11 | gl_rasterizer: Use ARB_texture_storage. | Markus Wick | 1 | -11/+8 | |
It allows us to use texture views and it reduces the overhead within the GPU driver. But it disallows us to reallocate the texture, but we don't do so anyways. In the end, it is the new way to allocate textures, so there is no need to use the old way. | |||||
2018-09-11 | Implemented LEA and PSET | FernandoS27 | 1 | -0/+91 | |
2018-09-11 | Implemented encodings for LEA and PSET | FernandoS27 | 1 | -0/+64 | |
2018-09-11 | Replace old FragmentHeader for the new Header | FernandoS27 | 2 | -31/+18 | |
2018-09-11 | Implemented (Partialy) Shader Header | FernandoS27 | 3 | -2/+102 | |
2018-09-11 | Fixed renderdoc input/output textures not working due to render targets | David Marcec | 2 | -2/+9 | |
2018-09-10 | video_core: Refactor command_processor. | Markus Wick | 2 | -44/+42 | |
Inline the WriteReg helper as it is called ~20k times per frame. | |||||
2018-09-10 | video_core: Move command buffer loop. | Markus Wick | 3 | -46/+72 | |
This moves the hot loop into video_core. This refactoring shall reduce the CPU overhead of calling ProcessCommandList. | |||||
2018-09-10 | rasterizer: Drop unused handler. | Markus Wick | 4 | -8/+0 | |
This virtual function is called in a very hot spot, and it does nothing. If this kind of feature is required, please be more specific and add callbacks in the switch statement within Maxwell3D::WriteReg. There is no point in having another switch statement within the rasterizer. | |||||
2018-09-10 | gl_rasterizer_cache: Only use depth for applicable texture formats. | bunnei | 1 | -6/+22 | |
- Fixes an issue with Octopath Traveler leaving stale data here. | |||||
2018-09-10 | gl_rasterizer: Implement clear for non-zero render targets. | bunnei | 2 | -50/+66 | |
- Several misc. changes to ConfigureFramebuffers in support of this. | |||||
2018-09-10 | gl_rasterizer_cache: Implement RenderTargetFormat::BGRA8_SRGB. | bunnei | 3 | -0/+4 | |
- Used by Octopath Traveler (with multiple render targets). | |||||
2018-09-10 | gl_rasterizer: Implement multiple color attachments. | bunnei | 5 | -132/+95 | |
2018-09-10 | Implemented TMML | FernandoS27 | 2 | -5/+67 | |
2018-09-09 | Implemented TXQ dimension query type, used by SMO. | FernandoS27 | 2 | -1/+36 | |
2018-09-09 | video_core: fixed arithmetic overflow warnings & improved code style | Patrick Elsässer | 5 | -89/+101 | |
- Fixed all warnings, for renderer_opengl items, which were indicating a possible incorrect behavior from integral promotion rules and types larger than those in which arithmetic is typically performed. - Added const for variables where possible and meaningful. - Added constexpr where possible. | |||||
2018-09-09 | Port Citra #4047 & #4052: add change background color support | tech4me | 3 | -0/+8 | |
2018-09-09 | Change name of TEXQ to TXQ, in order to match NVIDIA's naming | FernandoS27 | 1 | -2/+2 | |
2018-09-08 | GPU/DMA: Partially implemented the 'enable_2d' bit in the DMA engine. | Subv | 1 | -2/+10 | |
When not set, this tells the GPU to only use the X size when performing a DMA copy. This is only implemented for linear->linear and tiled->tiled copies. Conversion copies still retain the assert. This bit is unset by some games for various purposes, and by nouveau when copying the vertex buffers. | |||||
2018-09-08 | gl_rasterizer: Use baseInstance instead of moving the buffer points. | bunnei | 1 | -21/+25 | |
This hopefully helps our cache not to redundant upload the vertex buffer. # Conflicts: # src/video_core/renderer_opengl/gl_rasterizer.cpp | |||||
2018-09-08 | video_core: Arithmetic overflow warning fix for gl_rasterizer (#1262) | Patrick Elsässer | 1 | -12/+14 | |
* video_core: Arithmetic overflow fix for gl_rasterizer - Fixed warnings, which were indicating incorrect behavior from integral promotion rules and types larger than those in which arithmetic is typically performed. - Added const for variables where possible and meaningful. * Changed the casts from C to C++ style Changed the C-style casts to C++ casts as proposed. Took also care about signed / unsigned behaviour. | |||||
2018-09-08 | gl_rasterizer_cache: Improve accuracy of RecreateSurface for non-2D textures. | bunnei | 2 | -27/+45 | |
2018-09-08 | maxwell_3d: Remove assert that no longer applies. | bunnei | 1 | -4/+0 | |
2018-09-08 | gl_rasterizer_cache: Partially implement several non-2D texture types. | bunnei | 1 | -30/+111 | |
2018-09-08 | gl_shader_decompiler: Partially implement several non-2D texture types (Subv). | bunnei | 2 | -32/+143 | |
2018-09-08 | gl_rasterizer: Implement texture wrap mode p. | bunnei | 2 | -2/+8 | |
2018-09-08 | gl_rasterizer_cache: Track texture depth. | bunnei | 3 | -4/+15 | |
2018-09-08 | gl_rasterizer_cache: Remove impl. of FlushGLBuffer. | bunnei | 1 | -34/+1 | |
- Will not work for non-2d textures, and was not used anyways. | |||||
2018-09-08 | gl_rasterizer_cache: Keep track of texture type per surface. | bunnei | 3 | -32/+84 | |
2018-09-08 | gl_rasterizer_cache: Remove unused DownloadGLTexture. | bunnei | 2 | -51/+0 | |
2018-09-08 | gl_state: Keep track of texture target. | bunnei | 5 | -26/+28 | |
2018-09-06 | gl_rasterizer: Call state.Apply only once on SetupShaders. | bunnei | 1 | -4/+2 | |
2018-09-06 | gl_shader_decompiler: Implement saturate mode for IPA. | bunnei | 1 | -1/+5 | |
2018-09-06 | gl_buffer_cache: Default initialize member variables | Lioncash | 1 | -3/+3 | |
Ensures that the cache always has a deterministic initial state. | |||||
2018-09-06 | gl_buffer_cache: Make GetHandle() a const member function | Lioncash | 2 | -2/+2 | |
GetHandle() internally calls GetHandle() on the stream_buffer instance, which is a const member function, so this can be made const as well. | |||||
2018-09-06 | gl_buffer_cache: Remove unnecessary includes | Lioncash | 2 | -2/+4 | |
2018-09-06 | gl_buffer_cache: Make constructor explicit | Lioncash | 1 | -1/+1 | |
Implicit conversions during construction isn't desirable here. | |||||
2018-09-06 | video_core/CMakeLists: Add missing gl_buffer_cache.h | Lioncash | 1 | -0/+1 | |
Without this, the header file won't show up by default within IDEs such as Visual Studio. | |||||
2018-09-06 | gl_shader_gen: Initialize position. | Markus Wick | 1 | -0/+1 | |
IMO the old code is fine, but nvidia raises shader compiler warnings. Trivial fix through... | |||||
2018-09-06 | Implemented IPA Properly | FernandoS27 | 2 | -47/+98 | |
2018-09-05 | gl_rasterizer: Skip TODO log. | Markus Wick | 1 | -1/+1 | |
This is called ~3k times per frame in SMO ingame. My laptop spends ~3ms per frame on allocating and freeing this string. Let's just stop printing this kind of redundant information. | |||||
2018-09-05 | gl_rasterizer: Implement a VAO cache. | Markus Wick | 3 | -53/+60 | |
This patch caches VAO objects instead of re-emiting all pointers per draw call. Configuring this pointers is known as a fast task, but it yields too many GL calls. So for better performance, just bind the VAO instead of 16 pointers. | |||||
2018-09-05 | renderer_opengl: Implement a buffer cache. | Markus Wick | 5 | -86/+182 | |
The idea of this cache is to avoid redundant uploads. So we are going to cache the uploaded buffers within the stream_buffer and just reuse the old pointers. The next step is to implement a VBO cache on GPU memory, but for now, I want to check the overhead of the cache management. Fetching the buffer over PCI-E should be quite fast. | |||||
2018-09-04 | gl_shader_cache: Use an u32 for the binding point cache. | Markus Wick | 4 | -15/+23 | |
The std::string generation with its malloc and free requirement was a noticeable overhead. Also switch to an ordered_map to avoid the std::hash call. As those maps usually have a size of two elements, the lookup time shall not matter. | |||||
2018-09-04 | command_processor: Use std::array for bound_engines. | Markus Wick | 2 | -4/+4 | |
subchannel is a 3 bit field. So there must not be more than 8 bound engines. And using a hashmap for up to 8 values is a bit overpowered. | |||||
2018-09-04 | Update microprofile scopes. | Markus Wick | 1 | -3/+11 | |
Blame the subsystems which deserve the blame :) The updated list is not complete, just the ones I've spotted on random sampling the stack trace. | |||||
2018-09-02 | gl_shader_decompiler: Use used_shaders member variable directly within GenerateDeclarations() | Lioncash | 1 | -1/+1 | |
Using the getter function intended for external code here makes an unnecessary copy of the already-accessible used_shaders vector. | |||||
2018-09-01 | Removed saturate assert | David Marcec | 2 | -2/+0 | |
Unneeded as we already implement it | |||||
2018-09-01 | Removed saturate assert | David Marcec | 2 | -2/+0 | |
Saturate already implemented | |||||
2018-09-01 | Changed tab5980_0 default from 0 -> 1 | David Marcec | 1 | -2/+2 | |
2018-09-01 | Added FMUL asserts | David Marcec | 2 | -0/+15 | |
2018-09-01 | Added FFMA asserts | David Marcec | 2 | -0/+11 | |
2018-09-01 | Added assert for TEXS nodep | David Marcec | 2 | -0/+3 | |
2018-09-01 | Added better asserts to IPA, Renamed IPA modes to match mesa | David Marcec | 2 | -6/+13 | |
IpaMode is changed to IpaInterpMode IpaMode is suppose to be 2 bits not 3 Added IpaSampleMode Added Saturate Renamed modes based on https://github.com/mesa3d/mesa/blob/d27c7918916cdc8092959124955f887592e37d72/src/gallium/drivers/nouveau/codegen/nv50_ir_emit_gm107.cpp#L2530 | |||||
2018-09-01 | maxwell_3d: Use CoreTiming for query timestamp | Zach Hilman | 1 | -2/+3 | |
2018-08-31 | core/core: Replace includes with forward declarations where applicable | Lioncash | 3 | -4/+4 | |
The follow-up to e2457418dae19b889b2ad85255bb95d4cd0e4bff, which replaces most of the includes in the core header with forward declarations. This makes it so that if any of the headers the core header was previously including change, then no one will need to rebuild the bulk of the core, due to core.h being quite a prevalent inclusion. This should make turnaround for changes much faster for developers. | |||||
2018-08-31 | gl_rasterizer_cache: Use accurate framebuffer setting for accurate copies. | bunnei | 2 | -73/+54 | |
2018-08-31 | gl_rasterizer_cache: Also use reserve cache for RecreateSurface. | bunnei | 2 | -24/+18 | |
2018-08-31 | rasterizer_cache: Use boost::interval_map for a more accurate cache. | bunnei | 1 | -33/+45 | |
2018-08-31 | gl_renderer: Cache textures, framebuffers, and shaders based on CPU address. | bunnei | 8 | -100/+53 | |
2018-08-31 | gl_rasterizer: Fix issues with the rasterizer cache. | bunnei | 4 | -46/+57 | |
- Use a single cached page map. - Fix calculation of ending page. | |||||
2018-08-31 | Implement BC6H_UF16 & BC6H_SF16 (#1092) | greggameplayer | 3 | -31/+55 | |
* Implement BC6H_UF16 & BC6H_SF16 Require by ARMS * correct coding style * correct coding style part 2 | |||||
2018-08-31 | core: Make the main System class use the PImpl idiom | Lioncash | 1 | -3/+4 | |
core.h is kind of a massive header in terms what it includes within itself. It includes VFS utilities, kernel headers, file_sys header, ARM-related headers, etc. This means that changing anything in the headers included by core.h essentially requires you to rebuild almost all of core. Instead, we can modify the System class to use the PImpl idiom, which allows us to move all of those headers to the cpp file and forward declare the bulk of the types that would otherwise be included, reducing compile times. This change specifically only performs the PImpl portion. | |||||
2018-08-31 | Report correct shader size. | Markus Wick | 1 | -1/+1 | |
Seems like this was an oversee in regards to 1fd979f50a9f4c21fa8cafba7268d959e3076924 It changed GLShader::ProgramCode to a std::vector, so sizeof is wrong. | |||||
2018-08-31 | Added predicate comparison GreaterEqualWithNan | Hexagon12 | 2 | -3/+4 | |
2018-08-31 | gl_shader_decompiler: Implement POPC (#1203) | Laku | 2 | -0/+19 | |
* Implement POPC * implement invert | |||||
2018-08-29 | Shaders: Implemented IADD3 | tech4me | 2 | -1/+84 | |
2018-08-29 | gl_shader_decompiler: Improve IPA for Pass mode with Position attribute. | bunnei | 2 | -1/+39 | |
2018-08-28 | gl_shader_cache: Remove unused program_code vector in GetShaderAddress() | Lioncash | 1 | -2/+1 | |
Given std::vector is a type with a non-trivial destructor, this variable cannot be optimized away by the compiler, even if unused. Because of that, something that was intended to be fairly lightweight, was actually allocating 32KB and deallocating it at the end of the function. | |||||
2018-08-28 | gpu: Make memory_manager private | Lioncash | 4 | -16/+30 | |
Makes the class interface consistent and provides accessors for obtaining a reference to the memory manager instance. Given we also return references, this makes our more flimsy uses of const apparent, given const doesn't propagate through pointers in the way one would typically expect. This makes our mutable state more apparent in some places. | |||||
2018-08-28 | gl_rasterizer: Remove unused variables | Lioncash | 1 | -2/+0 | |
2018-08-28 | renderer_opengl: Implement a new shader cache. | bunnei | 9 | -285/+250 | |
2018-08-28 | gl_rasterizer_cache: Update to use RasterizerCache base class. | bunnei | 3 | -132/+20 | |
2018-08-28 | video_core: Add RasterizerCache class for common cache management code. | bunnei | 2 | -0/+117 | |
2018-08-25 | debug_utils: Remove unused includes | Lioncash | 2 | -23/+0 | |
Quite a bit of these aren't necessary directly within the debug_utils header and can be removed or included where actually necessary. | |||||
2018-08-25 | debug_utils: Make BreakpointObserver class' constructor explicit | Lioncash | 1 | -1/+1 | |
Avoids implicit conversions. | |||||
2018-08-25 | debug_utils: Initialize active_breakpoint member of DebugContext | Lioncash | 1 | -2/+2 | |
Ensures that all class members are initialized. | |||||
2018-08-25 | maxwell3d: Move FinishedPrimitiveBatch event after AcceleratedDrawBatch() | Lioncash | 1 | -4/+4 | |
The start and finish events should likely not be right after one another like this, otherwise the batch will appear to complete immediately | |||||
2018-08-24 | fix SEL_IMM bitstring | Laku | 1 | -1/+1 | |
2018-08-24 | gl_rasterizer: Correct assertion condition in SyncLogicOpState() | Lioncash | 1 | -1/+2 | |
Previously the assert would always be hit, since it was the equivalent of: array == nullptr, which is never true. | |||||
2018-08-23 | Shaders: Added decodings for IADD3 instructions | tech4me | 1 | -0/+6 | |
2018-08-23 | gl_rasterizer_cache: Blit when possible on RecreateSurface. | bunnei | 1 | -5/+12 | |
2018-08-23 | gl_rasterizer_cache: Reserve surfaces that have already been created for later use. | bunnei | 2 | -3/+61 | |
2018-08-23 | gl_rasterizer_cache: Remove assert for RecreateSurface type. | bunnei | 1 | -1/+0 | |
2018-08-23 | gl_rasterizer_cache: Implement compressed texture copies. | bunnei | 1 | -8/+18 | |
2018-08-23 | gl_rasterizer: Implement stencil test. | bunnei | 3 | -4/+58 | |
- Used by Splatoon 2. | |||||
2018-08-23 | gl_rasterizer: Implement partial color clear and stencil clear. | bunnei | 1 | -12/+42 | |
2018-08-23 | maxwell_3d: Update to include additional stencil registers. | bunnei | 1 | -20/+50 | |
2018-08-23 | gl_state: Update to handle stencil front/back face separately. | bunnei | 2 | -33/+38 | |
2018-08-22 | gl_shader_gen: Make ShaderSetup's constructor explicit | Lioncash | 1 | -1/+1 | |
Prevents implicit conversions. | |||||
2018-08-22 | gl_shader_gen: Use a std::vector to represent program code instead of std::array | Lioncash | 2 | -11/+16 | |
While convenient as a std::array, it's also quite a large set of data as well (32KB). It being an array also means data cannot be std::moved. Any situation where the code is being set or relocated means that a full copy of that 32KB data must be done. If we use a std::vector we do need to allocate on the heap, however, it does allow us to std::move the data we have within the std::vector into another std::vector instance, eliminating the need to always copy the program data (as std::move in this case would just transfer the pointers and bare necessities over to the new vector instance). | |||||
2018-08-22 | more fixes | Laku | 1 | -6/+7 | |
2018-08-22 | fixes | Laku | 1 | -6/+12 | |
2018-08-22 | renderer_opengl: Namespace OpenGL code | Lioncash | 21 | -23/+70 | |
Namespaces all OpenGL code under the OpenGL namespace. Prevents polluting the global namespace and allows clear distinction between other renderers' code in the future. | |||||
2018-08-22 | remove debug logging | Laku | 1 | -2/+0 | |
2018-08-22 | implement lop3 | Laku | 2 | -0/+55 | |
2018-08-22 | maxwell_to_gl: Implement PrimitiveTopology::Lines | OatmealDome | 1 | -0/+2 | |
Used by Splatoon 2's debug menu. | |||||
2018-08-22 | Revert "Shader: Use the right sampler type in the TEX, TEXS and TLDS instructions." | bunnei | 2 | -153/+31 | |
- This reverts commit 3ef4b3d4b445960576f10d1ba6521580d03e3da8. - This commit had broken a lot of games. We really should do a full implementation of this in one change. | |||||
2018-08-21 | shader_bytecode: Parenthesize conditional expression within GetTextureType() | Lioncash | 1 | -1/+1 | |
Resolves a -Wlogical-op-parentheses warning. | |||||
2018-08-21 | renderer_opengl: Use LOG_DEBUG for GL_DEBUG_SEVERITY_NOTIFICATION and GL_DEBUG_SEVERITY_LOW logs | Lioncash | 1 | -1/+1 | |
LOG_TRACE is only enabled on debug builds which can be quite slow when trying to debug graphics issues. Instead we can log the messages to the debug log, which is available on both release and debug builds. | |||||
2018-08-21 | gl_stream_buffer: Add missing header guard | Lioncash | 1 | -0/+2 | |
Prevents potential compilation errors from occuring due to multiple inclusions | |||||
2018-08-21 | Shaders: Implement depth writing in fragment shaders. | Subv | 1 | -1/+6 | |
We'll write <last color output reg + 2> to gl_FragDepth. | |||||
2018-08-21 | shader_bytecode: Replace some UNIMPLEMENTED logs. | bunnei | 1 | -2/+6 | |
2018-08-21 | gl_shader_decompiler: Implement Texture3D for TEXS. | bunnei | 1 | -0/+7 | |
2018-08-21 | gl_shader_decompiler: Implement TextureCube for TEX. | bunnei | 1 | -0/+8 | |
2018-08-21 | Shaders: Fixed the coords in TEX with Texture2D. | Subv | 1 | -1/+1 | |
The X and Y coordinates should be in gpr8 and gpr8+1, respectively. This fixes the cutscene rendering in Sonic Mania. | |||||
2018-08-21 | Shaders: Log and crash when using an unimplemented texture type in a texture sampling instruction. | Subv | 1 | -5/+14 | |
2018-08-21 | GPU: Implemented the logic op functionality of the GPU. | Subv | 3 | -0/+61 | |
This will ASSERT if blending is enabled at the same time as logic ops. | |||||
2018-08-21 | GLState: Allow enabling/disabling GL_COLOR_LOGIC_OP independently from blending. | Subv | 2 | -6/+19 | |
2018-08-21 | rasterizer_interface: Remove ScreenInfo from AccelerateDraw()'s signature | Lioncash | 5 | -17/+14 | |
This is an OpenGL renderer-specific data type. Given that, this type shouldn't be used within the base interface for the rasterizer. Instead, we can pass this information to the rasterizer via reference. | |||||
2018-08-21 | GPU: Added registers for the logicop functionality. | Subv | 1 | -1/+28 | |
2018-08-21 | renderer_base: Make creation of the rasterizer, the responsibility of the renderers themselves | Lioncash | 4 | -14/+12 | |
Given we use a base-class type within the renderer for the rasterizer (RasterizerInterface), we want to allow renderers to perform more complex initialization if they need to do such a thing. This makes it important to reserve type information. Given the OpenGL renderer is quite simple settings-wise, this is just a simple shuffling of the initialization code. For something like Vulkan however this might involve doing something like: // Initialize and call rasterizer-specific function that requires // the full type of the instance created. auto raster = std::make_unique<VulkanRasterizer>(some, params); raster->CallSomeVulkanRasterizerSpecificFunction(); // Assign to base class variable rasterizer = std::move(raster) | |||||
2018-08-21 | Port #3353 from Citra | fearlessTobi | 1 | -1/+1 | |
2018-08-21 | Shaders: Write all the enabled color outputs when a fragment shader exits. | Subv | 2 | -6/+45 | |
We were only writing to the first render target before. Note that this is only the GLSL side of the implementation, supporting multiple render targets requires more changes in the OpenGL renderer. Dual Source blending is not implemented and stuff that uses it might not work at all. | |||||
2018-08-20 | Rasterizer: Reinterpret the raw texture bytes instead of blitting (and thus doing format conversion) to a new texture when a game requests an old texture address with a different format. | Subv | 1 | -3/+49 | |
2018-08-20 | Rasterizer: Don't attempt to copy over the old texture's data when doing a format reinterpretation if we're only going to clear the framebuffer. | Subv | 4 | -13/+21 | |
2018-08-20 | Implemented RGBA8_UINT | David Marcec | 4 | -45/+58 | |
Needed by kirby | |||||
2018-08-20 | Shaders/TEXS: Fixed the component mask in the TEXS instruction. | Subv | 1 | -18/+18 | |
Previously we could end up with a TEXS that didn't write any outputs, this was wrong. | |||||
2018-08-19 | Shaders/TEXS: Fixed the component mask in the TEXS instruction. | Subv | 1 | -6/+11 | |
Previously we could end up with a TEXS that didn't write any outputs, this was wrong. | |||||
2018-08-19 | Shader: Implemented the TLD4 and TLD4S opcodes using GLSL's textureGather. | Subv | 1 | -0/+51 | |
It is unknown how TLD4S determines the sampler type, more research is needed. | |||||
2018-08-19 | Shader: Use the right sampler type in the TEX, TEXS and TLDS instructions. | Subv | 2 | -29/+127 | |
Different sampler types have their parameters in different registers. | |||||
2018-08-19 | Shader: Added bitfields for the texture type of the various sampling instructions. | Subv | 1 | -1/+65 | |
2018-08-19 | Shaders: Added decodings for TLD4 and TLD4S | Subv | 1 | -3/+7 | |
2018-08-19 | Shaders: Added decodings for the LDG and STG instructions. | Subv | 1 | -0/+4 | |
2018-08-19 | Shaders: Implemented the gl_FrontFacing input attribute (attr 63). | Subv | 2 | -0/+7 | |
2018-08-18 | Shader: Remove an unneeded assert, the negate bit is implemented for conversion instructions. | Subv | 1 | -2/+0 | |
2018-08-18 | GLRasterizer: Implemented instanced vertex arrays. | Subv | 2 | -4/+30 | |
Before each draw call, for every enabled vertex array configured as instanced, we take the current instance id and divide it by its configured divisor, then we multiply that by the corresponding stride and increment the start address by the resulting amount. This way we can simulate the vertex array being incremented once per instance without actually using OpenGL's instancing functions. | |||||
2018-08-18 | Shader: Implemented the predicate and mode arguments of LOP. | Subv | 2 | -11/+39 | |
The mode can be used to set the predicate to true depending on the result of the logic operation. In some cases, this means discarding the result (writing it to register 0xFF (Zero)). This is used by Super Mario Odyssey. | |||||
2018-08-18 | Added WrapMode MirrorOnceClampToEdge | David Marcec | 1 | -0/+2 | |
Used by splatoon 2 | |||||
2018-08-18 | Shaders: Implemented a stack for the SSY/SYNC instructions. | Subv | 1 | -3/+36 | |
The SSY instruction pushes an address into the stack, and the SYNC instruction pops it. The current stack depth is 20, we should figure out if this is enough or not. | |||||
2018-08-18 | Shaders: Corrected the 'abs' and 'neg' bit usage in the float arithmetic instructions. | Subv | 2 | -16/+38 | |
We should definitely audit our shader generator for more errors like this. | |||||
2018-08-18 | Added predcondition GreaterThanWithNan | David Marcec | 2 | -5/+8 | |
2018-08-17 | gl_rasterizer_cache: Remove asserts for supported blits. | bunnei | 1 | -2/+0 | |
2018-08-17 | renderer_opengl: Treat OpenGL errors as critical. | bunnei | 1 | -1/+1 | |
2018-08-16 | gl_rasterizer_cache: Treat Depth formats differently from DepthStencil. | bunnei | 2 | -16/+26 | |
2018-08-15 | Shader/Conversion: Implemented the negate bit in F2F and I2I instructions. | Subv | 1 | -4/+12 | |
2018-08-15 | Shader/I2F: Implemented the negate I2F_C instruction variant. | Subv | 1 | -7/+23 | |
2018-08-15 | Shader/F2I: Implemented the negate bit in the I2F instruction | Subv | 1 | -0/+4 | |
2018-08-15 | Shader/F2I: Implemented the F2I_C instruction variant. | Subv | 1 | -2/+10 | |
2018-08-15 | Shader/F2I: Implemented the negate bit in the F2I instruction. | Subv | 1 | -0/+4 | |
2018-08-15 | gl_rasterizer_cache: Cleanup some PixelFormat names and logging. | bunnei | 2 | -41/+71 | |
2018-08-15 | Rasterizer: Implemented instanced rendering. | Subv | 7 | -5/+28 | |
We keep track of the current instance and update an uniform in the shaders to let them know which instance they are. Instanced vertex arrays are not yet implemented. | |||||
2018-08-15 | gl_rasterizer_cache: Add RGBA16U to PixelFormatFromTextureFormat. | bunnei | 1 | -1/+9 | |
- Used by Breath of the Wild. | |||||
2018-08-15 | Implement Z16_UNORM in PixelFormatFromTextureFormat function | greggameplayer | 1 | -0/+2 | |
Require by Zelda Breath Of The Wild | |||||
2018-08-15 | gl_shader_decompiler: Several fixes for indirect constant buffer loads. | bunnei | 1 | -13/+22 | |
2018-08-15 | gl_rasterizer: Fix upload size for constant buffers. | bunnei | 1 | -3/+3 | |
2018-08-15 | maxwell_to_gl: Properly handle UnsignedInt/SignedInt sizes. | bunnei | 1 | -5/+20 | |
2018-08-15 | gl_rasterizer_cache: Implement G8R8S format. | bunnei | 2 | -34/+49 | |
- Used by Super Mario Odyssey. | |||||
2018-08-14 | Fix BC7U | greggameplayer | 1 | -1/+1 | |
2018-08-14 | renderer_opengl: Implement RenderTargetFormat::RGBA16_UNORM. | bunnei | 4 | -37/+48 | |
- Used by Breath of the Wild. | |||||
2018-08-13 | Implement RG32UI and R32UI | David Marcec | 4 | -7/+45 | |
Needed for xenoblade | |||||
2018-08-13 | maxwell_to_gl: Implement VertexAttribute::Size::Size_8. | bunnei | 1 | -0/+1 | |
- Used by Breath of the Wild. | |||||
2018-08-13 | renderer_opengl: Implement RenderTargetFormat::RGBA16_UINT. | bunnei | 4 | -34/+45 | |
- Used by Breath of the Wild. | |||||
2018-08-13 | maxwell_to_gl: Implement PrimitiveTopology::LineStrip. | bunnei | 1 | -0/+2 | |
- Used by Breath of the Wild. | |||||
2018-08-13 | renderer_opengl: Implement RenderTargetFormat::RG8_UNORM. | bunnei | 4 | -26/+61 | |
- Used by Breath of the Wild. | |||||
2018-08-13 | gl_shader_decompiler: Implement XMAD instruction. | bunnei | 2 | -4/+120 | |
2018-08-12 | gl_rasterizer: Use a shared helper to upload from CPU memory. | Markus Wick | 2 | -28/+33 | |
2018-08-12 | gl_state: Don't track constant buffer mappings. | Markus Wick | 3 | -41/+3 | |
2018-08-12 | gl_rasterizer: Use the stream buffer for constant buffers. | Markus Wick | 4 | -29/+32 | |
2018-08-12 | gl_rasterizer: Use the streaming buffer itself for the constant buffer. | Markus Wick | 2 | -33/+15 | |
Don't emut copies, especially not for data, which is used once. They just end in a huge GPU overhead. | |||||
2018-08-12 | gl_rasterizer: Use a helper for aligning the buffer. | Markus Wick | 2 | -15/+22 | |
2018-08-12 | Update the stream_buffer helper from Citra. | Markus Wick | 4 | -184/+98 | |
Please see https://github.com/citra-emu/citra/pull/3666 for more details. | |||||
2018-08-12 | gl_shader_decompiler: Fix SetOutputAttributeToRegister empty check. | bunnei | 1 | -2/+2 | |
2018-08-12 | gl_shader_decompiler: Fix GLSL compiler error with KIL instruction. | bunnei | 1 | -0/+8 | |
2018-08-12 | GPU/Maxwell3D: Implemented an alternative set of blend factors. | Subv | 2 | -0/+40 | |
These are used by nouveau and some games like SMO. | |||||
2018-08-12 | Implement R8_UINT RenderTargetFormat & PixelFormat (#1014) | greggameplayer | 4 | -55/+74 | |
- Used by Go Vacation | |||||
2018-08-12 | RasterizerGL: Ignore invalid/unset vertex attributes. | Subv | 2 | -1/+11 | |
This should make the es2gears example not crash anymore. | |||||
2018-08-12 | gl_rasterizer: Silence implicit truncation warning in SetupShaders() | Lioncash | 1 | -1/+1 | |
Previously this would warn of truncating a std::size_t to a u32. This is safe because we'll obviously never have more than UINT32_MAX amount of uniform buffers. | |||||
2018-08-12 | core: Namespace EmuWindow | Lioncash | 8 | -15/+26 | |
Gets the class out of the global namespace. | |||||
2018-08-12 | gl_shader_decompiler: Improve handling of unknown input/output attributes. | bunnei | 2 | -10/+11 | |
2018-08-12 | gl_rasterizer: Implement render target format RG8_SNORM. | bunnei | 4 | -8/+18 | |
- Used by Super Mario Odyssey. | |||||
2018-08-12 | gl_rasterizer: Implement render target format RGBA8_SNORM. | bunnei | 4 | -64/+83 | |
- Used by Super Mario Odyssey. | |||||
2018-08-11 | GPU/Shader: Don't predicate instructions that don't have a predicate field (SSY). | Subv | 2 | -2/+13 | |
2018-08-11 | GPU/Shaders: Implemented SSY and SYNC as a way to modify control flow during shader execution. | Subv | 1 | -6/+25 | |
SSY sets the target label to jump to when the SYNC instruction is executed. | |||||
2018-08-11 | Implement R16S & R16UI & R16I RenderTargetFormats & PixelFormats and more (R16_UNORM needed by Fate Extella) (#848) | greggameplayer | 4 | -19/+92 | |
* Implement R16S & R16UI & R16I RenderTargetFormats & PixelFormats Do a separate function in order to get Bytes Per Pixel of DepthFormat Apply the new function in gpu.h delete unneeded white space * correct merging error | |||||
2018-08-11 | video_core; Get rid of global g_toggle_framelimit_enabled variable | Lioncash | 6 | -25/+42 | |
Instead, we make a struct for renderer settings and allow the renderer to update all of these settings, getting rid of the need for global-scoped variables. This also uncovered a few indirect inclusions for certain headers, which this commit also fixes. | |||||
2018-08-11 | renderer_base: Remove unused kFramebuffer enumeration | Lioncash | 1 | -3/+0 | |
This is entirely unused and can be removed. | |||||
2018-08-11 | video_core: Remove unused Renderer enumeration | Lioncash | 1 | -2/+0 | |
Currently we only have an OpenGL renderer, so this is unused in code (and occupies the Renderer identifier in the VideoCore namespace). | |||||
2018-08-10 | maxwell_to_gl: Implement VertexAttribute::Size::Size_8_8. | bunnei | 1 | -0/+1 | |
- Used by Super Mario Odyssey. | |||||
2018-08-10 | maxwell_to_gl: Implement VertexAttribute::Size::Size_32_32_32. | bunnei | 1 | -0/+2 | |
- Used by Super Mario Odyssey. | |||||
2018-08-10 | Revert "gl_state: Temporarily disable culling and depth test." | bunnei | 1 | -3/+1 | |
2018-08-10 | gl_rasterizer_cache: Remove unused viewport parameter of GetFramebufferSurfaces() | Lioncash | 3 | -8/+6 | |
2018-08-10 | video_core: Use variable template variants of type_traits interfaces where applicable | Lioncash | 2 | -4/+2 | |
2018-08-10 | textures: Refactor out for Texture/Depth FormatFromPixelFormat. | bunnei | 4 | -179/+27 | |
2018-08-10 | gl_rasterizer_cache: Add bounds checking for gl_buffer copies. | bunnei | 1 | -10/+12 | |
2018-08-10 | Implement SNORM for BC5/DXN2 (#998) | Khangaroo | 2 | -38/+55 | |
* Implement BC5/DXN2 (#996) - Used by Kirby Star Allies. * Implement BC5/DXN2 SNORM UNORM for Kirby Star Allies SNORM for Super Mario Odyssey | |||||
2018-08-09 | gl_shader_decompiler: Reserve element memory beforehand in BuildRegisterList() | Lioncash | 1 | -0/+2 | |
Avoids potentially perfoming multiple reallocations when we know the total amount of memory we need beforehand. | |||||
2018-08-09 | gl_rasterizer_cache: Avoid iterator invalidation issues within InvalidateRegion() | Lioncash | 1 | -2/+4 | |
A range-based for loop can't be used when the container being iterated is also being erased from. | |||||
2018-08-09 | Implement BC5/DXN2 (#996) | Khangaroo | 3 | -33/+45 | |
- Used by Kirby Star Allies. | |||||
2018-08-09 | gl_rasterizer_cache: Invert conditional in LoadGLBuffer() | Lioncash | 1 | -5/+5 | |
It's generally easier to follow code using conditionals that operate in terms of the true case followed by the false case (no chance of overlooking the exclamation mark). | |||||
2018-08-09 | gl_rasterizer_cache: Use std::vector::assign in LoadGLBuffer() for the non-tiled case | Lioncash | 1 | -4/+6 | |
resize() causes the vector to expand and zero out the added members to the vector, however we can avoid this zeroing by using assign(). Given we have the pointer to the data we want to copy, we can calculate the end pointer and directly copy the range of data without the need to perform the resize() beforehand. | |||||
2018-08-09 | maxwell_to_gl: Implement VertexAttribute::Size::Size_16_16_16_16. | bunnei | 1 | -0/+1 | |
- Used by Super Mario Odyssey (in game). | |||||
2018-08-09 | maxwell_to_gl: Implement PrimitiveTopology::Points. | bunnei | 1 | -0/+2 | |
- Used by Super Mario Odyssey (in game). | |||||
2018-08-09 | gl_shader_decompiler: Declare predicates on use. | bunnei | 1 | -4/+5 | |
- Used by Super Mario Odyssey (when going in game). | |||||
2018-08-09 | maxwell_3d: Ignore macros that have not been uploaded yet. | bunnei | 1 | -4/+9 | |
- Used by Super Mario Odyssey (in game). | |||||
2018-08-09 | gl_rasterizer_cache: Make pointer const in LoadGLBuffer() | Lioncash | 1 | -1/+1 | |
This is only ever read from, so we can make the data it's pointing to const. | |||||
2018-08-09 | gl_rasterizer: Do not render when no render target is configured. | bunnei | 1 | -0/+5 | |
- Used by Super Mario Odyssey. | |||||
2018-08-08 | gpu: Add R11G11B10_FLOAT to RenderTargetBytesPerPixel. | bunnei | 1 | -0/+1 | |
- Used by Super Mario Odyssey. | |||||
2018-08-08 | gl_shader_decompiler: Stub input attribute Unknown_63. | bunnei | 2 | -0/+9 | |
2018-08-08 | maxwell_3d: Use correct const buffer size and check bounds. | bunnei | 4 | -3/+12 | |
- Fixes mem corruption with Super Mario Odyssey and Pokkén Tournament DX. | |||||
2018-08-08 | renderer_opengl: Use trace log in a few places. | bunnei | 2 | -2/+2 | |
2018-08-08 | maxwell_to_gl: Implement VertexAttribute::Size::Size_8_8. | bunnei | 1 | -0/+1 | |
2018-08-08 | gl_rasterizer_cached: Implement RenderTargetFormat::B5G6R5_UNORM. | bunnei | 2 | -0/+4 | |
- Used by Super Mario Odyssey. | |||||
2018-08-08 | gl_shader_decompiler: Let OpenGL interpret floats. | bunnei | 2 | -11/+6 | |
- Accuracy is lost in translation to string, e.g. with NaN. - Needed for Super Mario Odyssey. | |||||
2018-08-08 | Fixed the sRGB pixel format (#963) | Hexagon12 | 1 | -1/+2 | |
* Changed the sRGB pixel format return * Add a message about SRGBA -> RGBA conversion | |||||
2018-08-07 | Lowered down the logging for methods | Hexagon12 | 1 | -4/+4 | |
2018-08-06 | maxwell_3d: Remove outdated assert. | bunnei | 1 | -2/+0 | |
2018-08-06 | gl_rasterizer_cache: Avoid superfluous surface copies. | bunnei | 2 | -4/+21 | |
2018-08-05 | gl_shader_decompiler: Fix TEXS mask and dest. | bunnei | 1 | -2/+5 | |
2018-08-05 | added braces for conditions | David Marcec | 1 | -2/+3 | |
2018-08-05 | fix the attrib format for ints | David Marcec | 1 | -2/+7 | |
2018-08-04 | gl_shader_manager: Invert conditional in SetShaderUniformBlockBinding() | Lioncash | 1 | -7/+9 | |
This lets us indent the majority of the code and places the error case first. | |||||
2018-08-04 | gl_shader_manager: Amend sign differences in an assertion comparison in SetShaderUniformBlockBinding() | Lioncash | 1 | -3/+2 | |
Ensures both operands have the same sign in the comparison. While we're at it, we can get rid of the redundant casting of ub_size to an int. This type will always be trivial and alias a built-in type (not doing so would break backwards compatibility at a standard level). | |||||
2018-08-04 | renderer_base: Make Rasterizer() return the rasterizer by reference | Lioncash | 2 | -4/+8 | |
All calling code assumes that the rasterizer will be in a valid state, which is a totally fine assumption. The only way the rasterizer wouldn't be is if initialization is done incorrectly or fails, which is checked against in System::Init(). | |||||
2018-08-04 | video_core: Eliminate the g_renderer global variable | Lioncash | 10 | -47/+43 | |
We move the initialization of the renderer to the core class, while keeping the creation of it and any other specifics in video_core. This way we can ensure that the renderer is initialized and doesn't give unfettered access to the renderer. This also makes dependencies on types more explicit. For example, the GPU class doesn't need to depend on the existence of a renderer, it only needs to care about whether or not it has a rasterizer, but since it was accessing the global variable, it was also making the renderer a part of its dependency chain. By adjusting the interface, we can get rid of this dependency. | |||||
2018-08-03 | video_core: Remove unimplemented Start() function prototype | Lioncash | 1 | -3/+0 | |
Given this has no definition, we can just remove it entirely. | |||||
2018-08-03 | gl_shader_decompiler: Remove unused variable in GenerateDeclarations() | Lioncash | 1 | -2/+0 | |
This variable was being incremented, but we were never actually using it. | |||||
2018-08-03 | gl_shader_manager: Make ProgramManager's GetCurrentProgramStage() a const member function | Lioncash | 1 | -1/+1 | |
This function doesn't modify class state, so it can be made const. | |||||
2018-08-02 | Implement RGB32F PixelFormat (#886) (used by Go Vacation) | greggameplayer | 3 | -9/+23 | |
2018-08-02 | gl_state: Make texture_units a std::array | Lioncash | 1 | -2/+3 | |
Gets rid of the use of a raw C array. | |||||
2018-08-02 | gl_shader_manager: Take ShaderSetup instances by const reference in UseProgrammableVertexShader() and UseProgrammableFragmentShader() | Lioncash | 1 | -2/+2 | |
Avoids performing unnecessary copies of 65560 byte sized ShaderSetup instances, considering it's only used as part of lookup and not modified. Given the parameters were already const, it's likely taking these parameters by reference was intended but the ampersand was forgotten. | |||||
2018-08-02 | video_core: Make global EmuWindow instance part of the base renderer class | Lioncash | 8 | -51/+41 | |
Makes the global a member of the RendererBase class. We also change this to be a reference. Passing any form of null pointer to these functions is incorrect entirely, especially given the code itself assumes that the pointer would always be in a valid state. This also makes it easier to follow the lifecycle of instances being used, as we explicitly interact the renderer with the rasterizer, rather than it just operating on a global pointer. | |||||
2018-08-01 | Implement R32_FLOAT RenderTargetFormat | Unknown | 3 | -0/+5 | |
2018-07-31 | MacroInterpreter: Avoid left shifting negative values. | Subv | 2 | -2/+6 | |
The branch target is signed, so multiply by 4 instead of left shifting by 2 | |||||
2018-07-26 | GPU: Allow using R16F as a render target format. | Subv | 2 | -1/+4 | |
2018-07-26 | Implement R16_G16 | Unknown | 4 | -19/+100 | |
correct trailing white spaces Delete tabs correct placement Add RG16F & RG16UI & RG16I & RG16S PixelFormats Return correct data according to changes done previously correct PixelFormat declaration correct coding style error correct coding style error part 2 correct RG16S Declaration error correct alignment | |||||
2018-07-25 | GPU: Use the right texture format for sRGBA framebuffers. | Subv | 2 | -9/+17 | |
2018-07-25 | GPU: Allow the use of Z24S8 as a texture format. | Subv | 1 | -0/+4 | |
2018-07-25 | GPU: Implemented the Z32_S8_X24 depth buffer format. | Subv | 4 | -1/+16 | |
2018-07-25 | GPU: Allow using Z32 as a texture format. | Subv | 1 | -0/+4 | |
2018-07-25 | GPU: Allow the usage of R8 as a render target format. | Subv | 2 | -0/+4 | |
2018-07-24 | GPU: Remove the assert that required the CODE_ADDRESS to be 0. | Subv | 1 | -8/+0 | |
Games usually just leave it at 0 but nouveau sets it to something else. This already works fine, the assert is useless. | |||||
2018-07-24 | GPU: Implemented the R16 and R16F texture formats. | Subv | 3 | -5/+32 | |
2018-07-24 | gl_rasterizer: Replace magic number with GL_INVALID_INDEX in SetupConstBuffers() | Lioncash | 1 | -3/+5 | |
This is just the named constant that OpenGL provides, so we can use that instead of using a literal -1 | |||||
2018-07-24 | gl_rasterizer: Use std::string_view instead of std::string when checking for extensions | Lioncash | 1 | -1/+3 | |
We can avoid heap allocations here by just using a std::string_view instead of performing unnecessary copying of the string data. | |||||
2018-07-24 | gl_rasterizer: Use in-class member initializers where applicable | Lioncash | 2 | -12/+5 | |
We can just assign to the members directly in these cases. | |||||
2018-07-24 | video_core/memory_manager: Replace a loop with std::array's fill() function in PageSlot() | Lioncash | 1 | -3/+1 | |
We already have a function that does what this code was doing, so let's use that instead. | |||||
2018-07-24 | video_core/memory_manager: Avoid repeated unnecessary page slot lookups | Lioncash | 1 | -11/+21 | |
We don't need to keep calling the same function over and over again in a loop, especially when the behavior is slightly non-trivial. We can just keep a reference to the looked up location and do all the checking and assignments based off it instead. | |||||
2018-07-24 | gl_rasterizer: Implement texture border color. | bunnei | 3 | -11/+11 | |
2018-07-24 | maxwell_to_gl: Implement Texture::WrapMode::Border. | bunnei | 1 | -0/+2 | |
2018-07-24 | GPU: Implement texture format R32F. | Subv | 3 | -6/+19 | |
2018-07-24 | maxwell_to_gl: Implement VertexAttribute::Type::UnsignedInt. | bunnei | 1 | -0/+3 | |
2018-07-24 | gl_shader_decompiler: Correct return value of WriteTexsInstruction() | Lioncash | 1 | -2/+2 | |
This should be returning void, not a std::string | |||||
2018-07-24 | gl_shader_decompiler: Implement shader instruction TLDS. | bunnei | 1 | -29/+43 | |
2018-07-24 | gl_rasterizer_cache: Implement RenderTargetFormat RG32_FLOAT. | bunnei | 5 | -7/+25 | |
2018-07-24 | gl_rasterizer_cache: Implement RenderTargetFormat RGBA32_FLOAT. | bunnei | 2 | -10/+34 | |
2018-07-24 | gl_rasterizer_cache: Implement RenderTargetFormat BGRA8_UNORM. | bunnei | 4 | -8/+22 | |
2018-07-24 | gl_rasterizer_cache: Add missing log statements. | bunnei | 1 | -0/+2 | |
2018-07-24 | gl_shader_decompiler: Print instruction value in shader comments. | bunnei | 1 | -1/+2 | |
2018-07-24 | gl_shader_decompiler: Check if SetRegister result is ZeroIndex. | bunnei | 1 | -0/+6 | |
2018-07-23 | gl_shader_decompiler: Simplify GetCommonDeclarations() | Lioncash | 1 | -5/+5 | |
2018-07-22 | gl_shader_decompiler: Remove redundant Subroutine construction in AddSubroutine() | Lioncash | 1 | -4/+8 | |
We don't need to toss away the Subroutine instance after the find() call and reconstruct another instance with the same data right after it. Particularly give Subroutine contains a std::set. | |||||
2018-07-22 | shader_bytecode: Implement other TEXS masks. | bunnei | 1 | -5/+9 | |
2018-07-22 | gl_shader_decompiler: Remove unused state tracking and minor cleanup. | bunnei | 1 | -78/+15 | |
2018-07-22 | gl_shader_decompiler: Implement SEL instruction. | bunnei | 2 | -0/+20 | |
2018-07-22 | gl_rasterizer_cache: Blit surfaces on recreation instead of flush and load. | bunnei | 2 | -2/+86 | |
2018-07-22 | gl_rasterizer_cache: Use GPUVAddr as cache key, not parameter set. | bunnei | 3 | -57/+46 | |
2018-07-22 | gl_rasterizer_cache: Use zeta_width and zeta_height registers for depth buffer. | bunnei | 2 | -11/+11 | |
2018-07-22 | gl_rasterizer: Use zeta_enable register to enable depth buffer. | bunnei | 1 | -2/+2 | |
2018-07-22 | maxwell_3d: Add depth buffer enable, width, and height registers. | bunnei | 1 | -2/+14 | |
2018-07-21 | gl_shader_manager: Replace unimplemented function prototype | Lioncash | 2 | -3/+3 | |
This was just a linker error waiting to happen. | |||||
2018-07-21 | gpu: Rename Get3DEngine() to Maxwell3D() | Lioncash | 3 | -11/+14 | |
This makes it match its const qualified equivalent. | |||||
2018-07-21 | video_core: Use nested namespaces where applicable | Lioncash | 11 | -48/+24 | |
Compresses a few namespace specifiers to be more compact. | |||||
2018-07-20 | gl_state: Make references const where applicable in Apply() | Lioncash | 1 | -2/+3 | |
2018-07-20 | gl_state: Get rid of mismatched sign conversions | Lioncash | 1 | -14/+17 | |
While we're at it, amend the loop variable type to be the same width as that returned by the .size() call. | |||||
2018-07-20 | maxwell_3d: Remove unused variable within GetStageTextures() | Lioncash | 1 | -2/+0 | |
2018-07-20 | gl_shader_decompiler: Eliminate variable and declaration shadowing | Lioncash | 1 | -6/+4 | |
Ensures that no identifiers are being hidden, which also reduces compiler warnings. | |||||
2018-07-20 | gl_shader_decompiler: Remove unnecessary const from return values | Lioncash | 1 | -2/+2 | |
This adds nothing from a behavioral point of view, and can inhibit the move constructor/RVO | |||||
2018-07-19 | gl_state: Temporarily disable culling and depth test. | bunnei | 1 | -1/+3 | |
2018-07-19 | decoders: Fix calc of swizzle image_width_in_gobs. | bunnei | 1 | -1/+4 | |
2018-07-19 | core: Don't construct instance of Core::System, just to access its live instance | Lioncash | 3 | -15/+15 | |
This would result in a lot of allocations and related object construction, just to toss it all away immediately after the call. These are definitely not intentional, and it was intended that all of these should have been accessing the static function GetInstance() through the name itself, not constructed instances. | |||||
2018-07-18 | astc: Initialize vector size directly in Decompress | Lioncash | 1 | -2/+1 | |
There's no need to perform a separate resize. | |||||
2018-07-18 | astc: Mark functions as internally linked where applicable | Lioncash | 1 | -17/+20 | |
2018-07-18 | astc: const-correctness changes where applicable | Lioncash | 1 | -14/+13 | |
A few member functions didn't actually modify class state, so these can be amended as necessary. | |||||
2018-07-18 | astc: Delete Bits' copy contstructor and assignment operator | Lioncash | 1 | -8/+6 | |
This also potentially avoids warnings, considering the copy assignment operator is supposed to have a return value. | |||||
2018-07-18 | astc: In-class initialize member variables where appropriate | Lioncash | 1 | -39/+22 | |
2018-07-18 | vi: Partially implement buffer crop parameters. | bunnei | 3 | -4/+20 | |
2018-07-17 | GPU: Added register definitions for the stencil parameters. | Subv | 1 | -2/+25 | |
2018-07-15 | gl_rasterizer_cache: Implement texture format G8R8. | bunnei | 3 | -9/+40 | |
2018-07-15 | gl_rasterizer_cache: Fix incorrect offset in ConvertS8Z24ToZ24S8. | bunnei | 1 | -1/+2 | |
2018-07-15 | gl_rasterizer_cache: Implement depth format Z16_UNORM. | bunnei | 3 | -1/+15 | |
2018-07-14 | OpenGL: Use MakeCurrent/DoneCurrent for multithreaded rendering. | bunnei | 3 | -1/+27 | |
2018-07-14 | GPU: Always enable the depth write when clearing the depth buffer. | Subv | 1 | -3/+8 | |
The GPU ignores that register when clearing, but OpenGL obeys the glDepthMask parameter, so we set the depth mask to GL_TRUE when clearing the depth buffer. It will be restored to the correct value automatically on the next draw call. | |||||
2018-07-13 | gl_rasterizer: Fix check for if a shader stage is enabled. | bunnei | 3 | -35/+11 | |
2018-07-13 | gl_shader_gen: Implement dual vertex shader mode. | bunnei | 5 | -55/+139 | |
- When VertexA shader stage is enabled, we combine with VertexB program to make a single Vertex Shader stage. | |||||
2018-07-13 | gl_shader_decompiler: Implement PredCondition::LessThanWithNan. | bunnei | 2 | -5/+7 | |
2018-07-13 | gl_shader_decompiler: Use FlowCondition field in EXIT instruction. | bunnei | 2 | -8/+34 | |
2018-07-12 | GPU: Implement the FADD32I shader instruction. | Subv | 2 | -0/+32 | |
2018-07-12 | GPU: Corrected the decoding of FFMA for immediate operands. | Subv | 1 | -1/+1 | |
2018-07-08 | gl_rasterizer: Flip triangles when regs.viewport_transform[0].scale_y is negative. | bunnei | 1 | -1/+4 | |
- Fixes a regression with Binding of Isaac. | |||||
2018-07-07 | GPU: Implemented the BC7U texture format. | Subv | 3 | -7/+21 | |
Note: Our version of glad exports GL_COMPRESSED_RGBA_BPTC_UNORM as GL_COMPRESSED_RGBA_BPTC_UNORM_ARB, maybe it's time we update it. | |||||
2018-07-05 | GPU: Allow using the old NV04 values for the depth test function. | Subv | 2 | -9/+29 | |
These seem to be just a valid as the GL token values. Thanks @ReinUsesLisp This restores graphical output to Disgaea 5 | |||||
2018-07-04 | GPU: Implemented the IMNMX shader instruction. | Subv | 2 | -3/+31 | |
It's similar to the FMNMX instruction but it works on integers. | |||||
2018-07-04 | GPU: Implemented the F2F 'round' rounding mode. | Subv | 1 | -0/+3 | |
It's implemented via the GLSL 'roundEven()' function. | |||||
2018-07-04 | GPU: Stub the shader SYNC and DEPBAR instructions. | Subv | 2 | -0/+12 | |
It is unknown at this moment if we actually need to do something with these instructions or if the GLSL compiler takes care of that for us. | |||||
2018-07-04 | GPU: Implement the Size_16_16 and Size_10_10_10_2 vertex attribute types. | Subv | 1 | -0/+8 | |
Both signed and unsigned variants. | |||||
2018-07-04 | GPU: Ignore textures that the GLSL compiler deemed unused when binding textures to the shaders. | Subv | 1 | -1/+4 | |
2018-07-04 | GPU: Corrected the decoding for the TEX shader instruction. | Subv | 1 | -1/+1 | |
2018-07-04 | GPU: Implemented the PSETP shader instruction. | Subv | 2 | -0/+43 | |
It's similar to the isetp and fsetp instructions but it works on predicates instead. | |||||
2018-07-04 | GPU: Implemented the 32 bit float depth buffer format. | Subv | 3 | -2/+15 | |
2018-07-04 | GPU: Flip the triangle front face winding if the GPU is configured to not flip the triangles. | Subv | 2 | -3/+29 | |
OpenGL's default behavior is already correct when the GPU is configured to flip the triangles. This fixes 1-2 Switch's splash screen. | |||||
2018-07-04 | GPU: Only configure the used framebuffers during clear. | Subv | 4 | -17/+48 | |
Don't try to configure the color buffer if it is not being cleared, it may not be completely valid at this point. | |||||
2018-07-03 | GPU: Factor out the framebuffer configuration code for both Clear and Draw commands. | Subv | 2 | -72/+39 | |
2018-07-03 | GPU: Support clears that don't clear the color buffer. | Subv | 2 | -6/+17 | |
2018-07-03 | GPU: Bind and clear the render target when the CLEAR_BUFFERS register is written to. | Subv | 4 | -0/+86 | |
2018-07-03 | GPU: Added registers for the CLEAR_BUFFERS and CLEAR_COLOR methods. | Subv | 1 | -2/+27 | |
2018-07-03 | gl_rasterizer_cache: Implement PixelFormat S8Z24. | bunnei | 3 | -11/+83 | |
2018-07-03 | gl_rasterizer: Only set cull mode and front face if enabled. | bunnei | 1 | -2/+5 | |
2018-07-03 | GPU: Use only the least significant 3 bits when reading the depth test func. | Subv | 1 | -9/+9 | |
Some games set the full GL define value here (including nouveau), but others just seem to set those last 3 bits. | |||||
2018-07-03 | GPU: Don't try to parse the depth test function if the depth test is disabled. | Subv | 1 | -0/+4 | |
2018-07-03 | Update clang format | James Rowe | 7 | -21/+20 | |
2018-07-03 | Rename logging macro back to LOG_* | James Rowe | 13 | -70/+70 | |
2018-07-03 | GPU: Set up the culling configuration on each draw. | Subv | 1 | -6/+8 | |
2018-07-03 | GPU: Implemented MUFU suboperation 8, sqrt. | Subv | 2 | -0/+5 | |
2018-07-02 | GPU: Set up the depth test state on every draw. | Subv | 2 | -0/+14 | |
2018-07-02 | MaxwellToGL: Added conversion functions for depth test and cull mode. | Subv | 1 | -0/+50 | |
2018-07-02 | GPU: Added registers for depth test and cull mode. | Subv | 1 | -3/+51 | |
2018-07-02 | GPU: Implemented the Z24S8 depth format and load the depth framebuffer. | Subv | 7 | -24/+124 | |
2018-07-02 | GPU: Implement offsetted rendering when using non-indexed drawing. | Subv | 1 | -1/+1 | |
2018-07-02 | GPU: Fixed the index offset rendering, and implemented the base vertex functionality. | Subv | 1 | -6/+8 | |
This fixes Stardew Valley. | |||||
2018-07-02 | GPU: Added register definitions for the vertex buffer base element. | Subv | 1 | -1/+6 | |
2018-07-02 | GPU: Directly copy the pixels when performing a same-layout DMA. | Subv | 1 | -1/+5 | |
2018-07-02 | GPU: Ignore disabled textures and textures with an invalid address. | Subv | 2 | -1/+10 | |
2018-07-02 | GPU: Allow GpuToCpuAddress to return boost::none for unmapped addresses. | Subv | 1 | -2/+2 | |
2018-06-30 | GPU: Corrected the size of the MUFU subop field, and removed incorrect "min" operation. | Subv | 2 | -6/+1 | |
2018-06-30 | GPU: Implemented the RGBA32_UINT rendertarget format. | Subv | 4 | -9/+28 | |
2018-06-30 | GLCache: Specify the component type along the texture type in the format tuple. | Subv | 1 | -17/+21 | |
2018-06-30 | gl_shader_decompiler: Implement predicate NotEqualWithNan. | bunnei | 2 | -17/+24 | |
2018-06-29 | gl_rasterizer_cache: Only dereference color_surface/depth_surface if valid. | bunnei | 1 | -2/+6 | |
2018-06-27 | gl_shader_decompiler: Add a return path for unknown instructions. | bunnei | 1 | -0/+1 | |
2018-06-27 | gl_rasterizer_cache: Implement caching for texture and framebuffer surfaces. | bunnei | 3 | -16/+168 | |
gl_rasterizer_cache: Improved cache management based on Citra's implementation. gl_surface_cache: Add some docstrings. | |||||
2018-06-27 | gl_rasterizer_cache: Various fixes for ASTC handling. | bunnei | 2 | -35/+39 | |
2018-06-27 | gl_rasterizer_cache: Use SurfaceParams as a key for surface caching. | bunnei | 2 | -43/+72 | |
2018-06-27 | maxwell_3d: Add a struct for RenderTargetConfig. | bunnei | 1 | -17/+19 | |
2018-06-27 | gl_rasterizer: Implement AccelerateDisplay to forward textures to framebuffers. | bunnei | 6 | -8/+62 | |
2018-06-27 | gl_rasterizer_cache: Cache size_in_bytes as a const per surface. | bunnei | 2 | -9/+13 | |
2018-06-27 | gl_rasterizer_cache: Refactor to make SurfaceParams members const. | bunnei | 2 | -52/+37 | |
2018-06-27 | gl_rasterizer_cache: Remove Citra's rasterizer cache, always load/flush surfaces. | bunnei | 4 | -1494/+210 | |
2018-06-27 | gl_rasterizer: Workaround for when exceeding max UBO size. | bunnei | 2 | -1/+7 | |
2018-06-26 | gl_state: Fix state management for texture swizzle. | bunnei | 5 | -12/+20 | |
2018-06-26 | gl_state: Remove unused state management from 3DS. | bunnei | 2 | -94/+0 | |
2018-06-26 | gl_rasterizer_cache: Fix inverted B5G6R5 format. | bunnei | 1 | -1/+1 | |
2018-06-25 | Fix crash at exit | mailwl | 1 | -2/+4 | |
2018-06-20 | Build: Fixed some MSVC warnings in various parts of the code. | Subv | 7 | -12/+13 | |
2018-06-19 | GPU: Perform negation after absolute value in the float shader instructions. | Subv | 1 | -7/+14 | |
2018-06-19 | GPU: Don't mark uniform buffers and registers as used for instructions which don't have them. | Subv | 2 | -14/+18 | |
Like the MOV32I and FMUL32I instructions. This fixes a potential crash when using these instructions. | |||||
2018-06-18 | gl_rasterizer: Get loose on independent blending. | Jules Blok | 1 | -1/+1 | |
2018-06-18 | gl_rasterizer: Implement texture format ASTC_2D_4X4. | bunnei | 6 | -1/+1709 | |
2018-06-18 | gl_rasterizer_cache: Loosen things up a bit. | bunnei | 1 | -26/+8 | |
2018-06-17 | gl_shader_decompiler: Implement LOP instructions. | bunnei | 2 | -6/+42 | |
2018-06-17 | gl_shader_decompiler: Refactor LOP32I instruction a bit in support of LOP. | bunnei | 2 | -57/+42 | |
2018-06-16 | gl_shader_decompiler: Implement integer size conversions for I2I/I2F/F2I. | bunnei | 2 | -14/+43 | |
2018-06-16 | gl_shader_gen: Set position.w to 1. | bunnei | 1 | -0/+4 | |
2018-06-16 | gl_shader_decompiler: Implement LOP32I LogicOperation PassB. | bunnei | 1 | -6/+12 | |
2018-06-12 | GPU: Implemented the iadd32i shader instruction. | Subv | 2 | -2/+31 | |
2018-06-12 | GPU: Partially implemented the Maxwell DMA engine. | Subv | 7 | -1/+237 | |
Only tiled->linear and linear->tiled copies that aren't offsetted are supported for now. Queries are not supported. Swizzled copies are not supported. | |||||
2018-06-12 | gl_shader_decompiler: Implement saturate for float instructions. | bunnei | 2 | -39/+32 | |
2018-06-10 | GPU: Convert the gl_InstanceId and gl_VertexID variables to floats when reading from them. | Subv | 1 | -1/+1 | |
This corrects the invalid position values in some games when doing attribute-less rendering. | |||||
2018-06-10 | Rasterizer: Use UBOs instead of SSBOs for uploading const buffers. | Subv | 4 | -18/+39 | |
This should help a bit with GPU performance once we're GPU-bound. | |||||
2018-06-09 | GPU: Implement the iset family of shader instructions. | Subv | 2 | -2/+46 | |
2018-06-09 | GPU: Added decodings for the ISET family of instructions. | Subv | 1 | -0/+7 | |
2018-06-09 | gl_shader_decompiler: Implement SHR instruction. | bunnei | 2 | -0/+17 | |
2018-06-09 | GPU: Stub the SSY shader instruction. | Subv | 2 | -0/+7 | |
This instruction tells the GPU where the flow reconverges in a non-uniform control flow scenario, we can ignore this when generating GLSL code. | |||||
2018-06-09 | gl_shader_decompiler: Implement IADD instruction. | bunnei | 2 | -11/+37 | |
2018-06-09 | gl_shader_decompiler: Add missing asserts for saturate_a instructions. | bunnei | 2 | -8/+18 | |
2018-06-09 | GPU: Synchronize the blend state on every draw call. | Subv | 2 | -16/+20 | |
Only independent blending on render target 0 is implemented for now. This fixes the elongated squids in Splatoon 2's boot screen. | |||||
2018-06-09 | GPU: Added registers for normal and independent blending. | Subv | 2 | -31/+27 | |
2018-06-08 | GLCache: Align compressed texture sizes to their compression ratio, and then align that compressed size to the block height for tiled textures. | Subv | 1 | -2/+7 | |
This fixes issues with retrieving non-block-aligned tiled compressed textures from the cache. | |||||
2018-06-08 | Rasterizer: Flush the written region when writing shader uniform data before copying it to the uniform buffers. | Subv | 1 | -0/+3 | |
This fixes the flip_viewport uniform having invalid values when drawing. | |||||
2018-06-07 | GLRenderer: Write the shader stage configuration UBO data *before* copying it to the GPU. | Subv | 1 | -3/+4 | |
This should fix the bug with the vs_config UBO being uninitialized during shader execution. | |||||
2018-06-07 | gl_shader_decompiler: Implement BFE_IMM instruction. | bunnei | 2 | -7/+44 | |
2018-06-07 | GLCache: Use the full uncompressed size when blitting from one texture to another. | Subv | 1 | -3/+6 | |
This avoids the problem of only copying a tiny piece of the textures when they are compressed. | |||||
2018-06-07 | GLCache: Simplify the logic to copy from one texture to another in BlitTextures. | Subv | 1 | -53/+3 | |
We now use glCopyImageSubData, this should avoid errors with trying to attach a compressed texture as a framebuffer's color attachment and then blitting to it. Maybe in the future we can change this to glCopyTextureSubImage which only requires GL_ARB_direct_state_access. | |||||
2018-06-07 | gl_shader_decompiler: F2F: Implement rounding modes. | bunnei | 2 | -10/+35 | |
2018-06-07 | gl_shader_decompiler: Remove some attribute stuff that has nothing to do with TEX/TEXS. | bunnei | 1 | -8/+4 | |
2018-06-07 | shader_bytecode: Add instruction decodings for BFE, IMNMX, and XMAD. | bunnei | 1 | -0/+20 | |
2018-06-07 | gl_shader_decompiler: Implement ISETP_IMM instruction. | bunnei | 1 | -8/+9 | |
2018-06-07 | GPU: Support changing the texture swizzles for Maxwell textures. | Subv | 3 | -0/+45 | |
2018-06-07 | GLState: Support changing the GL_TEXTURE_SWIZZLE parameter of each texture unit. | Subv | 3 | -0/+20 | |
2018-06-07 | gl_shader_decompiler: Implement LD_C instruction. | bunnei | 2 | -0/+43 | |
2018-06-07 | gl_shader_gen: Add uniform handling for indirect const buffer access. | bunnei | 3 | -4/+40 | |
2018-06-06 | gl_shader_decompiler: Refactor uniform handling to allow different decodings. | bunnei | 2 | -26/+29 | |
2018-06-06 | GPU: Implement sampling multiple textures in the generated glsl shaders. | Subv | 9 | -69/+172 | |
All tested games that use a single texture show no regression. Only Texture2D textures are supported right now, each shader gets its own "tex_fs/vs/gs" sampler array to maintain independent textures between shader stages, the textures themselves are reused if possible. | |||||
2018-06-06 | gl_shader_decompiler: Fix un/signed mismatch with SHL. | bunnei | 1 | -1/+1 | |
2018-06-06 | maxwell_to_gl: Implement WrapMode Mirror. | bunnei | 1 | -0/+2 | |
2018-06-06 | GPU: Allow the usage of RGBA16_FLOAT in the texture copy engine. | Subv | 1 | -0/+2 | |
2018-06-06 | GPU: Implemented the R11FG11FB10F texture and rendertarget formats. | Subv | 4 | -11/+30 | |
2018-06-06 | GPU: Fixed the compression factor for RGBA16F textures. | Subv | 1 | -1/+1 | |
They're not compressed. | |||||
2018-06-06 | GPU: Allow the usage of RGBA32_FLOAT in the texture copy engine. | Subv | 2 | -0/+3 | |
2018-06-05 | GPU: Corrected the branch targets for the shader bra instruction. | Subv | 1 | -4/+5 | |
2018-06-05 | GPU: Implemented the F2I_R shader instruction. | Subv | 2 | -7/+64 | |
2018-06-05 | gl_shader_decompiler: Fix typo with ISCADD instruction. | bunnei | 1 | -1/+1 | |
2018-06-05 | gl_shader_decompiler: Implement SHL instruction. | bunnei | 2 | -14/+47 | |
2018-06-05 | gl_shader_decompiler: Implement PredCondition::NotEqual. | bunnei | 1 | -3/+3 | |
2018-06-05 | GPU: Implement the ISCADD shader instructions. | Subv | 2 | -0/+40 | |
2018-06-05 | GPU: Added decodings for the ISCADD instructions. | Subv | 1 | -0/+7 | |
2018-06-05 | GPU: Implement predicated exit instructions in the shader programs. | Subv | 1 | -4/+6 | |
2018-06-05 | GPU: Take into account predicated exits when performing shader control flow analysis. | Subv | 1 | -1/+10 | |
2018-06-04 | GPU: Use the bf bit in FSET to determine whether to write 0xFFFFFFFF or 1.0f. | Subv | 2 | -2/+7 | |
2018-06-04 | GPU: Corrected the I2F_R implementation. | Subv | 1 | -2/+12 | |
2018-06-04 | GPU: Calculate the correct viewport dimensions based on the scale and translate registers. | Subv | 2 | -14/+30 | |
This is how nouveau calculates the viewport width and height. For some reason some games set 0xFFFF in the VIEWPORT_HORIZ and VIEWPORT_VERT registers, maybe those are a misnomer and actually refer to something else? | |||||
2018-06-04 | GPU: Implemented the LOP32I instruction. | Subv | 2 | -1/+58 | |
2018-06-04 | GLCache: Corrected a mismatch between storing compressed sizes and verifying the uncompressed alignment in GetSurface. | Subv | 1 | -1/+2 | |
2018-06-04 | GPU: Use explicit types when retrieving the uniform values for fsetp/fset and isetp instead of the type of an invalid output register. | Subv | 1 | -9/+18 | |
2018-06-04 | GPU: Implemented the ISETP_R and ISETP_C shader instructions. | Subv | 2 | -0/+48 | |
2018-06-04 | GPU: Partially implemented the shader BRA instruction. | Subv | 2 | -1/+43 | |
2018-06-04 | GPU: Added decoding for the BRA instruction. | Subv | 1 | -0/+2 | |
2018-06-04 | GPU: Partial implementation of long GPU queries. | Subv | 1 | -9/+24 | |
Long queries write a 128-bit result value to memory, which consists of a 64 bit query value and a 64 bit timestamp. In this implementation, only select=Zero of the Crop unit is implemented, this writes the query sequence as a 64 bit value, and a 0u64 value for the timestamp, since we emulate an infinitely fast GPU. This specific type was hwtested, but more rigorous tests should be performed in the future for the other types. | |||||
2018-06-03 | gl_shader_decompiler: Implement TEXS component mask. | bunnei | 2 | -9/+26 | |
2018-06-03 | gl_shader_decompiler: Implement RRO as a register move. | bunnei | 2 | -9/+18 | |
2018-06-02 | GPU: Implemented the DXN1 (BC4) texture format. | Subv | 3 | -3/+16 | |
2018-06-01 | gl_shader_decompiler: Implement TEX instruction. | bunnei | 2 | -1/+36 | |
2018-06-01 | gl_shader_decompiler: Support multi-destination for TEXS. | bunnei | 2 | -2/+23 | |
2018-05-31 | gl_rasterizer_cache: Assert that component type is UNorm or format is RGBA16F. | bunnei | 1 | -1/+2 | |
2018-05-31 | gl_rasterizer_cache: Implement PixelFormat RGBA16F. | bunnei | 3 | -6/+22 | |
2018-05-30 | Shaders: Implemented reading the gl_InstanceID and gl_VertexID variables in the vertex shader. | Subv | 2 | -1/+11 | |
2018-05-30 | gl_shader_decompiler: F2F_R instruction: Implement abs. | bunnei | 1 | -1/+7 | |
2018-05-30 | gl_shader_decompiler: Partially implement F2F_R instruction. | bunnei | 2 | -4/+9 | |
2018-05-30 | GPU: Implemented the R8 texture format (0x1D) | Subv | 3 | -5/+18 | |
2018-05-30 | gl_rasterize_cache: Invert order of tex format RGB565. | bunnei | 1 | -1/+1 | |
2018-05-29 | add all the known TextureFormat (#474) | greggameplayer | 1 | -2/+71 | |
2018-05-27 | GPU: Implemented the A1B5G5R5 texture format (0x14) | Subv | 4 | -5/+21 | |
2018-05-26 | gl_shader_decompiler: Implement GetPredicateComparison GreaterEqual. | bunnei | 1 | -4/+3 | |
2018-05-26 | shader_bytecode: Implement other variants of FMNMX. | bunnei | 2 | -4/+10 | |
2018-05-25 | Shader: Implemented compound predicates in fset. | Subv | 1 | -28/+12 | |
You can specify a predicate in the fset instruction: Result = ((Value1 Comp Value2) OP P0) ? 1.0 : 0.0; | |||||
2018-05-25 | GPU: Allow command lists to rebind a channel to another engine in the middle of the command list. | Subv | 1 | -1/+0 | |
2018-05-25 | Shader: Implemented compound predicates in fsetp. | Subv | 1 | -19/+55 | |
You can specify three predicates in an fsetp instruction: P1 = (Value1 Comp Value2) OP P0; P2 = !(Value1 Comp Value2) OP P0; | |||||
2018-05-21 | Shaders: Implemented the FMNMX shader instruction. | Subv | 2 | -6/+26 | |
2018-05-20 | GPU: Implemented nvhost-as-gpu's UnmapBuffer ioctl. | Subv | 2 | -0/+20 | |
It removes a mapping previously created with the MapBufferEx ioctl. | |||||
2018-05-19 | ShadersDecompiler: Added decoding for the PSETP instruction. | Subv | 1 | -0/+3 | |
2018-05-19 | GLRenderer: Remove unused hw_vao_enabled_attributes variable. | Subv | 2 | -4/+0 | |
2018-05-19 | GLRenderer: Remove unused vertex buffer and increase the size of the stream buffer to 128 MB. | Subv | 2 | -9/+3 | |
The stream buffer is where all the vertex data is copied, some games require this to be much bigger than the 4 MB we used to have. | |||||
2018-05-19 | GLRenderer: Log the shader source code when program linking fails. | Subv | 1 | -0/+27 | |
2018-05-02 | general: Make formatting of logged hex values more straightforward | Lioncash | 1 | -1/+1 | |
This makes the formatting expectations more obvious (e.g. any zero padding specified is padding that's entirely dedicated to the value being printed, not any pretty-printing that also gets tacked on). | |||||
2018-04-29 | maxwell_3d: Reset vertex counts after drawing. | bunnei | 1 | -0/+10 | |
2018-04-29 | gl_shader_decompiler: Implement MOV_R. | bunnei | 1 | -1/+2 | |
2018-04-29 | maxwell_to_gl: Implement type SignedNorm, Size_8_8_8_8. | bunnei | 1 | -0/+12 | |
2018-04-29 | shader_bytecode: Add decoding for FMNMX instruction. | bunnei | 1 | -0/+2 | |
2018-04-29 | Shaders: Implemented predicate condition 3 (LessEqual) in the fset and fsetp instructions. | Subv | 1 | -0/+7 | |
2018-04-29 | gl_shader_decompiler: Implement MOV_C. | bunnei | 1 | -0/+5 | |
2018-04-29 | fermi_2d: Fix surface copy block height. | bunnei | 2 | -2/+7 | |
2018-04-29 | gl_shader_decompiler: Partially implement I2I_R, and I2F_R. | bunnei | 2 | -8/+34 | |
2018-04-29 | gl_shader_decompiler: More cleanups, etc. with how we handle register types. | bunnei | 1 | -44/+120 | |
2018-04-29 | GLSLRegister: Simplify register declarations, etc. | bunnei | 1 | -63/+31 | |
2018-04-29 | shader_bytecode: Add decodings for i2i instructions. | bunnei | 1 | -3/+20 | |
2018-04-29 | gl_shader_decompiler: Implement MOV32_IMM instruction. | bunnei | 2 | -2/+7 | |
2018-04-27 | renderer_opengl: Replace usages of LOG_GENERIC with fmt-capable equivalents | Lioncash | 1 | -6/+7 | |
2018-04-27 | gl_shader_decompiler: Add GLSLRegisterManager class to track register state. | bunnei | 1 | -154/+262 | |
2018-04-27 | general: Convert assertion macros over to be fmt-compatible | Lioncash | 4 | -7/+7 | |
2018-04-26 | gl_shader_decompiler: Boilerplate for handling integer instructions. | bunnei | 2 | -6/+111 | |
2018-04-26 | gl_shader_decompiler: Move color output to EXIT instruction. | bunnei | 1 | -6/+12 | |
2018-04-25 | GPU: Partially implemented the Fermi2D surface copy operation. | Subv | 2 | -0/+59 | |
The hardware allows for some rather complicated operations to be performed on the data during the copy, this is not implemented. Only same-format same-size raw copies are implemented for now. | |||||
2018-04-25 | Shaders: Added bit decodings for the I2I instruction. | Subv | 1 | -0/+6 | |
2018-04-25 | Shaders: Implemented the FSET instruction. | Subv | 1 | -0/+53 | |
This instruction is similar to the FSETP instruction, but it doesn't set a predicate, it sets the destination register to 1.0 if the condition holds, and 0 otherwise. | |||||
2018-04-25 | GPU: Make the Textures::CopySwizzledData function accessible from the outside of the file. | Subv | 2 | -3/+6 | |
2018-04-25 | GPU: Added a function to retrieve the bytes per pixel of the render target formats. | Subv | 2 | -0/+15 | |
2018-04-25 | GPU: Added surface copy registers to Fermi2D | Subv | 1 | -1/+57 | |
2018-04-25 | GPU: Added boilerplate code for the Fermi2D engine | Subv | 3 | -3/+34 | |
2018-04-25 | GPU: Reduce the number of registers of Maxwell3D to 0xE00. | Subv | 2 | -5/+5 | |
The rest are just macro shim registers. | |||||
2018-04-25 | GPU: Move the Maxwell3D macro uploading code to the inside of the Maxwell3D processor. | Subv | 4 | -40/+23 | |
It doesn't belong in the PFIFO handler. | |||||
2018-04-25 | GPU: Corrected the upper bound of the PFIFO method ids in the command processor. | Subv | 1 | -1/+1 | |
2018-04-25 | video-core: Move logging macros over to new fmt-capable ones | Lioncash | 5 | -18/+20 | |
2018-04-25 | Shaders: Added decodings for the FSET instructions. | Subv | 2 | -9/+30 | |
2018-04-25 | renderer_opengl: Use correct byte order for framebuffer pixel format ABGR8. | bunnei | 1 | -2/+1 | |
2018-04-25 | gl_rasterizer_cache: Use CHAR_BIT for bpp conversions instead of 8. | bunnei | 2 | -4/+4 | |
2018-04-25 | gl_rasterizer_cache: Use GPU PAGE_BITS/SIZE, not CPU. | bunnei | 1 | -5/+5 | |
2018-04-25 | gl_rasterizer_cache: Use new logger. | bunnei | 1 | -4/+4 | |
2018-04-25 | gl_rasterizer_cache: Add a function for finding framebuffer GPU address. | bunnei | 3 | -0/+31 | |
2018-04-25 | gl_rasterizer_cache: Handle compressed texture sizes. | bunnei | 2 | -24/+65 | |
2018-04-25 | gl_rasterizer_cache: Update to be based on GPU addresses, not CPU addresses. | bunnei | 8 | -50/+72 | |
2018-04-24 | memory_manager: Add implement CpuToGpuAddress. | bunnei | 2 | -0/+27 | |
2018-04-24 | memory_manager: Make GpuToCpuAddress return an optional. | bunnei | 6 | -24/+33 | |
2018-04-24 | memory_manager: Use GPUVAdddr, not PAddr, for GPU addresses. | bunnei | 6 | -58/+55 | |
2018-04-24 | renderer_opengl: Silence a -Wdangling-else warning in DrawScreenTriangles() | Lioncash | 1 | -1/+2 | |
2018-04-24 | GPU: Added asserts to our code for handling the QUERY_GET GPU command. | Subv | 2 | -2/+53 | |
This is based on research from nouveau. Many things are currently unknown and will require hwtests in the future. This commit also stubs QueryMode::Write2 to do the same as Write. Nouveau code treats them interchangeably, it is currently unknown what the difference is. | |||||
2018-04-23 | GPU: Support multiple enabled vertex arrays. | Subv | 3 | -43/+89 | |
The vertex arrays will be copied to the stream buffer one after the other, and the attributes will be set using the ARB_vertex_attrib_binding extension. yuzu now thus requires OpenGL 4.3 or the ARB_vertex_attrib_binding extension. | |||||
2018-04-23 | GPU: Make the GPU virtual memory manager use 16 page bits and 10 page table bits. | Subv | 2 | -34/+25 | |
Also removed some dead code and added memory map consistency asserts. | |||||
2018-04-23 | GPU: Implement the RGB10_A2 RenderTarget format, it will use the same format as the A2BGR10 texture format. | Subv | 1 | -0/+2 | |
2018-04-22 | GPU: Implement the A2BGR10 texture format. | Subv | 4 | -6/+18 | |
2018-04-21 | gl_shader_decompiler: Skip RRO instruction. | bunnei | 1 | -0/+4 | |
2018-04-21 | gl_shader_decompiler: Cleanup error logging. | bunnei | 1 | -14/+6 | |
2018-04-21 | shader_bytecode: Add several more instruction decodings. | bunnei | 1 | -5/+52 | |
2018-04-21 | shader_bytecode: Decode instructions based on bit strings. | bunnei | 2 | -205/+201 | |
2018-04-21 | ShaderGen: Implemented the KIL instruction, which is equivalent to 'discard'. | Subv | 1 | -1/+7 | |
2018-04-21 | ShaderGen: Implemented predicated instruction execution. | Subv | 2 | -1/+40 | |
Each predicated instruction will be wrapped in an `if (predicate) { instruction_body; }` in the GLSL, where `predicate` is one of the predicate boolean variables previously set by fsetp. | |||||
2018-04-21 | ShaderGen: Implemented the fsetp instruction. | Subv | 2 | -3/+112 | |
Predicate variables are now added to the generated shader code in the form of 'pX' where X is the predicate id. These predicate variables are initialized to false on shader startup and are set via the fsetp instructions. TODO: * Not all the comparison types are implemented. * Only the single-predicate version is implemented. | |||||
2018-04-21 | opengl: Remove unnecessary header inclusions | Lioncash | 4 | -11/+0 | |
2018-04-21 | gl_resource_manager: Add missing noexcept specifiers to move constructors and assignment operators | Lioncash | 1 | -20/+19 | |
Standard library containers may use std::move_if_noexcept to perform move operations. If a move cannot be performed under these circumstances, then a copy is attempted. Given we only intend for these types to be move-only this can be somewhat problematic. By defining these to be noexcept we prevent cases where copies may be attempted. | |||||
2018-04-21 | gl_rasterizer_cache: Make MatchFlags an enum class | Lioncash | 1 | -4/+9 | |
Prevents implicit conversions and scope pollution. | |||||
2018-04-20 | ShaderGen: Register id 255 is special and is hardcoded to return 0 (SR_ZERO). | Subv | 2 | -0/+5 | |
2018-04-20 | ShaderGen: Ignore the 'sched' instruction when generating shaders. | Subv | 1 | -0/+16 | |
The 'sched' instruction has a very convoluted encoding, but fortunately it seems to only appear on a fixed interval (once every 4 instructions). | |||||
2018-04-20 | math_util: Remove the Clamp() function | Lioncash | 2 | -16/+17 | |
C++17 adds clamp() to the standard library, so we can remove ours in favor of it. | |||||
2018-04-20 | common_funcs: Remove ARRAY_SIZE macro | Lioncash | 1 | -2/+2 | |
C++17 has non-member size() which we can just call where necessary. | |||||
2018-04-20 | renderer_opengl: Add missing header guards | Lioncash | 2 | -0/+4 | |
2018-04-20 | glsl_shader_decompiler: Use std::string_view instead of std::string for AddLine() | Lioncash | 1 | -1/+2 | |
This function doesn't need to take ownership of the string data being given to it, considering all we do is append the characters to the internal string instance. Instead, use a string view to simply reference the string data without any potential heap allocation. Now anything that is a raw const char* won't need to be converted to a std::string before appending. | |||||
2018-04-20 | glsl_shader_decompiler: Add AddNewLine() function to ShaderWriter | Lioncash | 1 | -6/+12 | |
Avoids constructing a std::string just to append a newline character | |||||
2018-04-20 | glsl_shader_decompiler: Add char overload for ShaderWriter's AddLine() | Lioncash | 1 | -4/+11 | |
Avoids constructing a std::string just to append a character. | |||||
2018-04-20 | glsl_shader_decompiler: Append indentation without constructing a separate std::string | Lioncash | 1 | -1/+5 | |
The interface of std::string already lets us append N copies of a character to an existing string. | |||||
2018-04-19 | ShaderGen: Implemented the fmul32i shader instruction. | Subv | 2 | -9/+30 | |
2018-04-19 | ShaderGen: Fixed a case where the TEXS instruction would use the same registers for the input and the output. | Subv | 1 | -2/+9 | |
It will now save the coords before writing the outputs in a subscope. | |||||
2018-04-19 | GPU: Add support for the DXT23 and DXT45 compressed texture formats. | Subv | 3 | -28/+35 | |
2018-04-19 | GPU: Implemented the B5G6R5 format. | Subv | 4 | -8/+28 | |
2018-04-18 | gl_shader_gen: Support vertical/horizontal viewport flipping. (#347) | bunnei | 4 | -5/+29 | |
* gl_shader_gen: Support vertical/horizontal viewport flipping. * fixup! gl_shader_gen: Support vertical/horizontal viewport flipping. | |||||
2018-04-18 | GLCache: Added boilerplate code to make supporting configurable texture component types. | Subv | 3 | -9/+69 | |
For now only the UNORM type is supported. | |||||
2018-04-18 | GLCache: Unify texture and framebuffer formats when converting to OpenGL. | Subv | 2 | -26/+13 | |
2018-04-18 | GPU: Texture format 8 and framebuffer format 0xD5 are actually ABGR8. | Subv | 2 | -10/+10 | |
2018-04-18 | GPU: Pitch textures are now supported, don't assert when encountering them. | Subv | 1 | -2/+3 | |
2018-04-18 | GLCache: Take into account the texture's block height when caching and unswizzling. | Subv | 3 | -43/+43 | |
2018-04-18 | GLCache: Added a function to convert cached PixelFormats back to texture formats. | Subv | 1 | -0/+12 | |
TODO: The way we handle cached formats must change, framebuffer and texture formats are too different to keep them in the same place. | |||||
2018-04-18 | GPU: Allow using a configurable block height when unswizzling textures. | Subv | 4 | -7/+23 | |
2018-04-18 | GPU/TIC: Added the pitch and block height fields to the TIC structure. | Subv | 1 | -1/+16 | |
2018-04-18 | gl_rasterizer_cache: Add missing LOG statements. | bunnei | 1 | -0/+3 | |
2018-04-18 | texture: Add missing formats. | bunnei | 1 | -1/+3 | |
2018-04-18 | gpu: Add several framebuffer formats to RenderTargetFormat. | bunnei | 1 | -0/+3 | |
2018-04-18 | maxwell3d: Allow Texture2DNoMipmap as Texture2D. | bunnei | 1 | -1/+2 | |
2018-04-18 | shader_bytecode: Make ctor's constexpr and explicit. | bunnei | 1 | -7/+7 | |
2018-04-18 | renderer_opengl: Implement BlendEquation and BlendFunc. | bunnei | 6 | -7/+140 | |
2018-04-17 | gl_shader_decompiler: Fix warnings with MarkAsUsed. | bunnei | 1 | -1/+2 | |
2018-04-17 | gl_shader_decompiler: Cleanup logging, updating to NGLOG_*. | bunnei | 1 | -24/+22 | |
2018-04-17 | gl_shader_decompiler: Implement several MUFU subops and abs_d. | bunnei | 1 | -7/+21 | |
2018-04-17 | gl_shader_decompiler: Fix swizzle in GetRegister. | bunnei | 1 | -1/+1 | |
2018-04-17 | gl_shader_decompiler: Implement FMUL/FADD/FFMA immediate instructions. | bunnei | 2 | -12/+53 | |
2018-04-17 | gl_shader_decompiler: Allow vertex position to be used in fragment shader. | bunnei | 2 | -16/+18 | |
2018-04-17 | gl_shader_decompiler: Implement IPA instruction. | bunnei | 1 | -0/+11 | |
2018-04-17 | gl_shader_decompiler: Add support for TEXS instruction. | bunnei | 2 | -12/+43 | |
2018-04-17 | gl_shader_decompiler: Use fragment output color for GPR 0-3. | bunnei | 1 | -0/+5 | |
2018-04-17 | gl_shader_decompiler: Partially implement MUFU. | bunnei | 1 | -2/+11 | |
2018-04-17 | MaxwellToGL: Implemented tex wrap mode 1 (Wrap, GL_REPEAT). | Subv | 1 | -0/+2 | |
2018-04-17 | MaxwellToGL: Added a TODO and partial implementation of maxwell wrap mode 4 (Clamp, GL_CLAMP). | Subv | 1 | -0/+5 | |
This clamp mode was removed from OpenGL as of 3.1, we can emulate it by using GL_CLAMP_TO_BORDER to get the border color of the texture, and then manually sampling the edge to mix them in the fragment shader. | |||||
2018-04-17 | gl_rendering: Use NGLOG* for changed code. | bunnei | 2 | -10/+11 | |
2018-04-17 | gl_rasterizer: Implement indexed vertex mode. | bunnei | 5 | -23/+92 | |
2018-04-15 | GPU: Use the same buffer names in the generated GLSL and the buffer uploading code. | Subv | 4 | -17/+24 | |
2018-04-15 | GPU: Don't use explicit binding points when uploading the constbuffers to opengl. | Subv | 3 | -7/+47 | |
The bindpoints will now be dynamically calculated based on the number of buffers used by the previous shader stage. | |||||
2018-04-15 | GPU: Don't use GetPointer when uploading the constbuffer data to the GPU. | Subv | 1 | -3/+4 | |
2018-04-15 | GPU: Use the buffer hints from the shader decompiler to upload only the necessary const buffers for each shader stage. | Subv | 3 | -31/+41 | |
2018-04-15 | shaders: Expose hints about used const buffers. | bunnei | 5 | -31/+146 | |
2018-04-15 | GPU: Upload the entirety of each constbuffer for each shader stage as SSBOs. | Subv | 4 | -14/+48 | |
We're going to need the shader generator to give us a mapping of the actual used const buffers to properly bind them to the shader. | |||||
2018-04-15 | GPU: Allow configuring ssbos in the opengl state manager. | Subv | 4 | -0/+30 | |
2018-04-15 | GPU: Added a function to determine whether a shader stage is enabled or not. | Subv | 3 | -3/+27 | |
2018-04-15 | shaders: Add NumTextureSamplers const, remove unused #pragma. | bunnei | 4 | -4/+5 | |
2018-04-14 | shaders: Address PR review feedback. | bunnei | 2 | -7/+9 | |
2018-04-14 | gl_shader_decompiler: Cleanup log statements. | bunnei | 1 | -15/+15 | |
2018-04-14 | shaders: Fix GCC and clang build issues. | bunnei | 3 | -5/+5 | |
2018-04-14 | gl_shader_decompiler: Implement negate, abs, etc. and lots of cleanup. | bunnei | 2 | -40/+96 | |
2018-04-14 | shader_bytecode: Add FSETP and KIL to GetInfo. | bunnei | 1 | -0/+3 | |
2018-04-14 | shader_bytecode: Add SubOp decoding. | bunnei | 1 | -0/+10 | |
2018-04-14 | gl_shader_decompiler: Add shader stage hint. | bunnei | 2 | -5/+12 | |
2018-04-14 | renderer_opengl: Fix Morton copy byteswap, etc. | bunnei | 2 | -6/+6 | |
2018-04-14 | gl_shader_manager: Implement SetShaderSamplerBindings. | bunnei | 1 | -0/+8 | |
2018-04-14 | gl_rasterizer: Generate shaders and upload uniforms. | bunnei | 2 | -32/+77 | |
2018-04-14 | gl_shader_decompiler: Basic impl. for very simple vertex shaders. | bunnei | 2 | -16/+311 | |
- Tested with Puyo Puyo Tetris and Cave Story+ | |||||
2018-04-14 | gl_shader_manager: Cleanup and consolidate uniform handling. | bunnei | 2 | -26/+24 | |
2018-04-14 | maxwell_3d: Make memory_manager public. | bunnei | 1 | -2/+1 | |
2018-04-14 | maxwell_3d: Fix shader_config decodings. | bunnei | 1 | -6/+3 | |
2018-04-14 | gl_rasterizer: Use shader program manager, remove test shader. | bunnei | 2 | -196/+31 | |
2018-04-14 | renderer_opengl: Add gl_shader_manager class. | bunnei | 3 | -0/+209 | |
2018-04-14 | maxwell_to_gl: Add a few types, etc. | bunnei | 1 | -0/+10 | |
2018-04-14 | gl_shader_gen: Add hashable setup/config structs. | bunnei | 2 | -29/+50 | |
2018-04-14 | gl_shader_util: Add missing includes. | bunnei | 1 | -0/+2 | |
2018-04-14 | renderer_opengl: Use OGLProgram instead of OGLShader. | bunnei | 6 | -6/+6 | |
2018-04-14 | gl_shader_util: Grab latest upstream. | bunnei | 2 | -149/+74 | |
2018-04-14 | gl_resource_manager: Grab latest upstream. | bunnei | 1 | -30/+86 | |
2018-04-14 | gl_shader_decompiler: Add skeleton code from Citra for shader analysis. | bunnei | 2 | -44/+142 | |
2018-04-14 | shader_bytecode: Add initial module for shader decoding. | bunnei | 2 | -0/+298 | |
2018-04-07 | Fix clang format issues | James Rowe | 1 | -1/+1 | |
2018-04-07 | GPU: Assert when finding a texture with a format type other than UNORM. | Subv | 2 | -4/+16 | |
2018-04-07 | GL: Set up the textures used for each draw call. | Subv | 2 | -2/+39 | |
Each Maxwell shader stage can have an arbitrary number of textures, but we're limited to a certain number in OpenGL. We try to only use the minimum amount of host textures by not keeping a 1:1 relation between guest texture ids and host texture ids, ie, guest texture id 8 can be host texture id 0 if it's the only texture used in the guest shader program. This mapping will have to be passed to the shader decompiler so it can rewrite the texture accesses. | |||||
2018-04-07 | GL: Bind the textures to the shaders used for drawing. | Subv | 1 | -2/+11 | |
2018-04-07 | GLCache: Specialize the MortonCopy function for the DXT1 texture format. | Subv | 1 | -1/+15 | |
It will now use the UnswizzleTexture function instead of the MortonCopyPixels128, which doesn't seem to work for textures. | |||||
2018-04-07 | GLCache: Implemented GetTextureSurface. | Subv | 1 | -3/+28 | |
2018-04-07 | GLCache: Support uploading compressed textures to the GPU. | Subv | 1 | -5/+17 | |
Compressed texture formats like DXT1, DXT2, DXT3, etc will use this to ease the load on the CPU. | |||||
2018-04-07 | GL: Remove remaining references to 3DS-specific pixel formats | Subv | 1 | -83/+22 | |
2018-04-07 | RasterizerCache: Remove 3DS-specific pixel formats. | Subv | 2 | -71/+32 | |
We're only left with RGB8 and DXT1 for now. More will be added as they are needed. | |||||
2018-04-07 | GL: Create the sampler objects when starting up the GL rasterizer. | Subv | 1 | -0/+6 | |
2018-04-07 | GL: Ported the SamplerInfo struct from citra. | Subv | 2 | -1/+59 | |
2018-04-07 | GL: Rename PicaTexture to MaxwellTexture. | Subv | 2 | -2/+2 | |
2018-04-07 | GL: Added functions to convert Maxwell tex filters and wrap modes to OpenGL. | Subv | 1 | -0/+23 | |
2018-04-07 | Textures: Added a helper function to know if a texture is blocklinear or pitch. | Subv | 1 | -0/+5 | |
2018-04-04 | rasterizer_interface.h: Update from citra to yuzu | N00byKing | 1 | -3/+3 | |
2018-04-04 | gl_rasterizer_cache.cpp: Update from citra to yuzu | N00byKing | 1 | -1/+1 | |
2018-04-04 | gl_rasterizer_cache.h: Update from citra to yuzu | N00byKing | 1 | -3/+3 | |
2018-04-04 | renderer_opengl.h: Update from citra to yuzu | N00byKing | 1 | -2/+2 | |
2018-04-01 | GPU: Use the MacroInterpreter class to execute the GPU macros instead of HLEing them. | Subv | 2 | -121/+13 | |
2018-04-01 | GPU: Implemented a gpu macro interpreter. | Subv | 5 | -0/+431 | |
The Ryujinx macro interpreter and envydis were used as reference. Macros are programs that are uploaded by the games during boot and can later be called by writing to their method id in a GPU command buffer. | |||||
2018-03-27 | renderer_opengl: Use better naming for DrawScreens and DrawSingleScreen. | bunnei | 2 | -8/+8 | |
2018-03-27 | gl_rasterizer: Move code to bind framebuffer surfaces before draw to its own function. | bunnei | 2 | -22/+31 | |
2018-03-27 | gl_rasterizer: Add a SyncViewport method. | bunnei | 3 | -18/+30 | |
2018-03-27 | gl_rasterizer: Move PrimitiveTopology check to MaxwellToGL. | bunnei | 2 | -11/+12 | |
2018-03-27 | graphics_surface: Fix merge conflicts. | bunnei | 1 | -0/+1 | |
2018-03-27 | gl_rasterizer: Use ReadBlock instead of GetPointer for SetupVertexArray. | bunnei | 1 | -1/+1 | |
2018-03-27 | gl_rasterizer: Normalize vertex array data as appropriate. | bunnei | 2 | -1/+5 | |
2018-03-27 | maxwel_to_gl: Fix string formatting in log statements. | bunnei | 1 | -2/+2 | |
2018-03-27 | rasterizer: Rename DrawTriangles to DrawArrays. | bunnei | 3 | -5/+5 | |
2018-03-27 | gl_rasterizer: Use passthrough shader for SetupVertexShader. | bunnei | 1 | -1/+2 | |
2018-03-27 | renderer_opengl: Logging, etc. cleanup. | bunnei | 6 | -33/+34 | |
2018-03-27 | renderer_opengl: Remove framebuffer RasterizerFlushVirtualRegion hack. | bunnei | 1 | -5/+0 | |
2018-03-27 | gl_rasterizer_cache: Implement UpdatePagesCachedCount. | bunnei | 2 | -8/+37 | |
2018-03-27 | gl_rasterizer: Implement SetupVertexArray. | bunnei | 1 | -20/+38 | |
2018-03-27 | gl_rasterizer_cache: Fix an ASSERT_MSG. | bunnei | 1 | -1/+1 | |
2018-03-27 | maxwell_to_gl: Add module and function for decoding VertexType. | bunnei | 2 | -0/+41 | |
2018-03-27 | maxwell_3d: Use names that match envytools for VertexType. | bunnei | 1 | -8/+8 | |
2018-03-27 | maxwell_3d: Add VertexAttribute struct and cleanup. | bunnei | 1 | -121/+160 | |
2018-03-27 | gl_rasterizer: Use 32 texture units instead of 3. | bunnei | 3 | -2/+3 | |
2018-03-27 | gl_rasterizer: Implement DrawTriangles. | bunnei | 1 | -1/+194 | |
2018-03-27 | Maxwell3D: Call AccelerateDrawBatch on DrawArrays. | bunnei | 1 | -1/+8 | |
2018-03-27 | gl_rasterizer: Implement AnalyzeVertexArray. | bunnei | 2 | -1/+56 | |
2018-03-27 | gl_rasterizer_cache: MortonCopy Switch-style. | bunnei | 1 | -72/+32 | |
2018-03-27 | gl_rasterizer_cache: Implement GetFramebufferSurfaces. | bunnei | 2 | -4/+104 | |
2018-03-27 | maxwell: Add RenderTargetFormat enum. | bunnei | 2 | -4/+5 | |
2018-03-27 | renderer_opengl: Only draw the screen if a framebuffer is specified. | bunnei | 1 | -6/+7 | |
2018-03-26 | GPU: Load the sampler info (TSC) when retrieving active textures. | Subv | 2 | -21/+67 | |
2018-03-26 | GPU: Added the TSC structure. It contains information about the sampler. | Subv | 1 | -0/+50 | |
2018-03-26 | GPU: Added more fields to the TIC structure. | Subv | 1 | -4/+30 | |
2018-03-25 | GPU: Make the debug_context variable a member of the frontend instead of a global. | Subv | 3 | -15/+13 | |
2018-03-24 | GPU: Added a function to retrieve the active textures for a shader stage. | Subv | 2 | -50/+59 | |
TODO: A shader may not use all of these textures at the same time, shader analysis should be performed to determine which textures are actually sampled. | |||||
2018-03-24 | Frontend: Updated the surface view debug widget to work with Maxwell surfaces. | Subv | 2 | -0/+15 | |
2018-03-24 | GPU: Implement the Incoming/FinishedPrimitiveBatch debug breakpoints. | Subv | 1 | -0/+7 | |
2018-03-24 | GPU: Implement the MaxwellCommandLoaded/Processed debug breakpoints. | Subv | 1 | -0/+10 | |
2018-03-24 | Frontend: Ported the GPU breakpoints and surface viewer widgets from citra. | Subv | 5 | -0/+242 | |
2018-03-24 | GPU: Added a method to unswizzle a texture without decoding it. | Subv | 4 | -5/+95 | |
Allow unswizzling of DXT1 textures. | |||||
2018-03-24 | GPU: Preliminary work for texture decoding. | Subv | 5 | -0/+139 | |
2018-03-24 | GPU: Added viewport registers to Maxwell3D's reg structure. | Subv | 1 | -1/+18 | |
2018-03-24 | gl_rasterizer: Fake render in green, because it's cooler. | bunnei | 1 | -1/+1 | |
2018-03-24 | gl_rasterizer: Log warning instead of sync'ing unimplemented funcs. | bunnei | 1 | -7/+1 | |
2018-03-23 | gl_rasterizer_cache: Add missing include for vm_manager. | bunnei | 1 | -0/+1 | |
2018-03-23 | renderer_opengl: Only invalidate the framebuffer region, not flush. | bunnei | 1 | -4/+3 | |
2018-03-23 | renderer_opengl: Fixes for properly flushing & rendering the framebuffer. | bunnei | 1 | -6/+12 | |
2018-03-23 | RasterizerCacheOpenGL: FlushAll should flush full memory region. | bunnei | 1 | -1/+1 | |
2018-03-23 | rasterizer: Flush and invalidate regions should be 64-bit. | bunnei | 3 | -9/+9 | |
2018-03-23 | renderer_opengl: Add framebuffer_transform_flags member variable. | bunnei | 1 | -2/+2 | |
2018-03-23 | renderer_opengl: Better handling of framebuffer transform flags. | bunnei | 2 | -3/+20 | |
2018-03-23 | renderer_opengl: Use accelerated framebuffer load with LoadFBToScreenInfo. | bunnei | 1 | -31/+25 | |
2018-03-23 | gl_rasterizer: Implement AccelerateDisplay method from Citra. | bunnei | 2 | -2/+44 | |
2018-03-23 | LoadGLBuffer: Use bytes_per_pixel, not bits. | bunnei | 1 | -1/+2 | |
2018-03-23 | gl_rasterizer_cache: LoadGLBuffer should do a morton copy. | bunnei | 1 | -16/+5 | |
2018-03-23 | video_core: Move MortonCopyPixels128 to utils header. | bunnei | 2 | -111/+113 | |
2018-03-23 | video_core: Remove usage of PAddr and replace with VAddr. | bunnei | 5 | -39/+39 | |
2018-03-23 | video_core: Move FramebufferInfo to FramebufferConfig in GPU. | bunnei | 7 | -66/+74 | |
2018-03-23 | gl_rasterizer: Replace a bunch of UNIMPLEMENTED with ASSERT. | bunnei | 2 | -20/+20 | |
2018-03-23 | gl_rasterizer: Add a simple passthrough shader in lieu of shader generation. | bunnei | 2 | -5/+68 | |
2018-03-23 | gpu: Expose Maxwell3D engine. | bunnei | 1 | -0/+4 | |
2018-03-23 | maxwell_3d: Add some format decodings and string helper functions. | bunnei | 1 | -3/+107 | |
2018-03-23 | renderer: Create rasterizer and cleanup. | bunnei | 4 | -4/+16 | |
2018-03-21 | GPU: Added vertex attribute format registers. | Subv | 1 | -1/+14 | |
2018-03-21 | GPU: Added registers for the number of vertices to render. | Subv | 1 | -2/+13 | |
2018-03-20 | renderer_gl: Port boilerplate rasterizer code over from Citra. | bunnei | 5 | -1/+495 | |
2018-03-20 | gl_shader_util: Sync latest version with Citra. | bunnei | 3 | -46/+116 | |
2018-03-20 | renderer_gl: Port over gl_shader_gen module from Citra. | bunnei | 3 | -0/+88 | |
2018-03-20 | renderer_gl: Port over gl_shader_decompiler module from Citra. | bunnei | 3 | -0/+87 | |
2018-03-20 | renderer_gl: Port over gl_rasterizer_cache module from Citra. | bunnei | 3 | -0/+1714 | |
2018-03-20 | gl_resource_manager: Sync latest version with Citra. | bunnei | 1 | -8/+77 | |
2018-03-20 | renderer_gl: Port over gl_stream_buffer module from Citra. | bunnei | 3 | -0/+218 | |
2018-03-20 | gl_state: Sync latest version with Citra. | bunnei | 2 | -47/+111 | |
2018-03-19 | GPU: Added Z buffer registers to Maxwell3D's reg structure. | Subv | 1 | -1/+17 | |
2018-03-19 | GPU: Added the render target (RT) registers to Maxwell3D's reg structure. | Subv | 1 | -1/+32 | |
2018-03-19 | Clang Fixes | N00byKing | 1 | -1/+2 | |
2018-03-19 | Clean Warnings (?) | N00byKing | 1 | -1/+1 | |
2018-03-19 | GPU: Added the TSC registers to the Maxwell3D register structure. | Subv | 1 | -1/+15 | |
2018-03-19 | GPU: Added the TIC registers to the Maxwell3D register structure. | Subv | 1 | -1/+16 | |
2018-03-19 | GPU: Implement macro 0xE1A BindTextureInfoBuffer in HLE. | Subv | 2 | -1/+29 | |
This macro simply sets the current CB_ADDRESS to the texture buffer address for the input shader stage. | |||||
2018-03-18 | GPU: Implement the BindStorageBuffer macro method in HLE. | Subv | 2 | -1/+36 | |
This macro binds the SSBO Info Buffer as the current ConstBuffer. This buffer is usually bound to c0 during shader execution. Games seem to use this macro instead of directly writing the address for some reason. | |||||
2018-03-18 | GPU: Handle writes to the CB_DATA method. | Subv | 2 | -0/+39 | |
Writing to this method will cause the written value to be stored in the currently-set ConstBuffer plus CB_POS. This method is usually used to upload uniforms or other shader-visible data. | |||||
2018-03-18 | GPU: Move the GPU's class constructor and destructors to a cpp file. | Subv | 3 | -10/+30 | |
This should reduce recompile times when editing the Maxwell3D register structure. | |||||
2018-03-18 | GPU: Store uploaded GPU macros and keep track of the number of method parameters. | Subv | 4 | -27/+74 | |
2018-03-18 | GPU: Macros are specific to the Maxwell3D engine, so handle them internally. | Subv | 8 | -63/+55 | |
2018-03-18 | GPU: Renamed ShaderType to ShaderStage as that is less confusing. | Subv | 2 | -19/+19 | |
2018-03-18 | GPU: Store shader constbuffer bindings in the GPU state. | Subv | 2 | -5/+61 | |
2018-03-18 | GPU: Corrected some register offsets and removed superfluous macro registers. | Subv | 1 | -9/+3 | |
2018-03-18 | GPU: Make the SetShader macro call do the same as the real macro's code. | Subv | 2 | -3/+44 | |
It'll now set the CB_SIZE, CB_ADDRESS and CB_BIND registers when it's called. Presumably this SetShader function is binding the constant shader uniforms to buffer 1 (c1[]). | |||||
2018-03-17 | GPU: Corrected the parameter documentation for the SetShader macro call. | Subv | 2 | -11/+12 | |
Register 0xE24 is actually a macro that sets some shader parameters in the register structure. Macros are uploaded to the GPU at startup and have their own ISA, we'll probably write an interpreter for this in the future. | |||||
2018-03-17 | GPU: Handle the SetShader method call (0xE24) and store the shader config. | Subv | 2 | -4/+38 | |
2018-03-17 | GPU: Added the vertex array registers. | Subv | 1 | -2/+33 | |
2018-03-17 | GPU: Process command mode 5 (IncreaseOnce) differently from other commands. | Subv | 9 | -8/+97 | |
Accumulate all arguments before calling the desired method. Note: Maybe we should do the same for the NonIncreasing mode? | |||||
2018-03-17 | GPU: Assert that we get a 0 CODE_ADDRESS register in the 3D engine. | Subv | 1 | -0/+8 | |
Shader address calculation depends on this value to some extent, we do not currently know what it being 0 entails. | |||||
2018-03-17 | GPU: Added Maxwell registers for Shader Program control. | Subv | 1 | -2/+55 | |
2018-03-05 | GPU: Intercept writes to the VERTEX_END_GL register. | Subv | 2 | -1/+18 | |
This is the register that gets written after a game calls DrawArrays(). We should collect all GPU state and draw using our graphics API here. | |||||
2018-02-14 | maxwell_3d: Make constructor explicit | Lioncash | 1 | -1/+1 | |
2018-02-12 | GPU: Partially implemented the QUERY_* registers in the Maxwell3D engine. | Subv | 3 | -3/+95 | |
Only QueryMode::Write is supported at the moment. | |||||
2018-02-12 | Make a GPU class in VideoCore to contain the GPU state. | Subv | 12 | -44/+252 | |
Also moved the GPU MemoryManager class to video_core since it makes more sense for it to be there. | |||||
2018-02-12 | GPU: Added a command processor to decode the GPU pushbuffers and forward the commands to their respective engines. | Subv | 9 | -0/+280 | |
2018-02-12 | renderer_opengl: Support framebuffer flip vertical. | bunnei | 3 | -5/+13 | |
2018-01-27 | memory: Replace all memory hooking with Special regions | MerryMage | 1 | -1/+1 | |
2018-01-21 | Format: Run the new clang format on everything | James Rowe | 4 | -4/+4 | |
2018-01-18 | CMakeLists: Derive the source directory grouping from targets themselves | Lioncash | 1 | -19/+15 | |
Removes the need to store to separate SRC and HEADER variables, and then construct the target in most cases. | |||||
2018-01-16 | clang-format | MerryMage | 1 | -1/+2 | |
2018-01-15 | renderer_gl: Clear screen to black before rendering framebuffer. | bunnei | 2 | -5/+8 | |
2018-01-15 | renderer: Render previous frame when no new one is available. | bunnei | 3 | -16/+18 | |
2018-01-13 | Fix build on macOS and linux | MerryMage | 1 | -0/+1 | |
2018-01-13 | Remove gpu debugger and get yuzu qt to compile | James Rowe | 2 | -5/+0 | |
2018-01-13 | Remove references to PICA and rasterizers in video_core | James Rowe | 64 | -14952/+3 | |
2018-01-12 | renderer_opengl: Fix LOG_TRACE in LoadFBToScreenInfo. | bunnei | 1 | -1/+1 | |
2018-01-11 | renderer_opengl: Support rendering Switch framebuffer. | bunnei | 3 | -138/+83 | |
2018-01-11 | render_base: Add a struct describing framebuffer metadata. | bunnei | 1 | -0/+26 | |
2018-01-11 | renderer_opengl: Add MortonCopyPixels function for Switch framebuffer. | bunnei | 1 | -0/+111 | |
2018-01-11 | renderer_opengl: Update DrawScreens for Switch. | bunnei | 2 | -23/+11 | |
2018-01-01 | core/video_core: Fix a bunch of u64 -> u32 warnings. | bunnei | 4 | -8/+8 | |
2017-10-15 | hle: Initial implementation of NX service framework and IPC. | bunnei | 1 | -1/+1 | |
2017-10-04 | Extracted the attribute setup and draw commands into their own functions | Huw Pascoe | 1 | -217/+222 | |
2017-09-30 | Fixed type conversion ambiguity | Huw Pascoe | 2 | -3/+3 | |
2017-09-27 | Disable unary operator- on Math::Vec2/Vec3/Vec4 for unsigned types. | Subv | 1 | -1/+1 | |
It is unlikely we will ever use this without first doing a Cast to a signed type. Fixes 9 "unary minus operator applied to unsigned type, result still unsigned" warnings on MSVC2017.3 | |||||
2017-09-25 | Optimized Float<M,E> multiplication | Huw Pascoe | 1 | -11/+7 | |
Before: ucomiss xmm1, xmm1 jp .L9 pxor xmm2, xmm2 mov edx, 1 ucomiss xmm0, xmm2 setp al cmovne eax, edx test al, al jne .L9 .L3: movaps xmm0, xmm2 ret .L9: ucomiss xmm0, xmm0 jp .L10 pxor xmm2, xmm2 mov edx, 1 ucomiss xmm1, xmm2 setp al cmovne eax, edx test al, al je .L3 After: movaps xmm2, xmm1 mulss xmm2, xmm0 ucomiss xmm2, xmm2 jnp .L3 ucomiss xmm1, xmm0 jnp .L11 .L3: movaps xmm0, xmm2 ret .L11: pxor xmm2, xmm2 jmp .L3 | |||||
2017-09-24 | Optimized Morton | Huw Pascoe | 1 | -10/+4 | |
2017-09-23 | Remove pipeline.gpu_mode and fix minor issues | James Rowe | 1 | -12/+2 | |
2017-09-17 | Improved performance of FromAttributeBuffer | Huw Pascoe | 1 | -1/+2 | |
Ternary operator is optimized by the compiler whereas std::min() is meant to return a value. I've noticed a 5%-10% emulation speed increase. | |||||
2017-09-17 | Fixed framebuffer warning | Huw Pascoe | 1 | -7/+18 | |
2017-09-11 | GPU: Add draw for immediate and batch modes | James Rowe | 1 | -2/+17 | |
PR #1461 introduced a regression where some games would change configuration even while in the poorly named "drawing" mode, which broke the heuristic citra was using to determine when to draw the batch. This change adds back in a draw call for batching, and also adds in a draw call in immediate mode each time it adds a triangle. | |||||
2017-09-03 | pica/lighting: only apply Fresnel factor for the last light | wwylele | 2 | -7/+9 | |
2017-08-31 | video_core: report telemetry for gas mode | wwylele | 1 | -0/+6 | |
2017-08-26 | Warnings: Fixed a few missing-return warnings in video_core. | Subv | 3 | -6/+10 | |
2017-08-25 | SwRasterizer/Clipper: flip the sign convention to match PICA and OpenGL | wwylele | 1 | -9/+9 | |
2017-08-25 | gl_rasterizer: implement custom clip plane | wwylele | 3 | -34/+83 | |
2017-08-24 | SwRasterizer: implement custom clip plane | wwylele | 2 | -4/+25 | |
2017-08-22 | gl_rasterizer/lighting: more accurate CP formula | wwylele | 1 | -2/+2 | |
2017-08-22 | SwRasterizer/Lighting: implement LUT input CP | wwylele | 1 | -0/+11 | |
2017-08-22 | SwRasterizer/Lighting: implement bump mapping | wwylele | 3 | -8/+27 | |
2017-08-21 | swrasterizer: remove invalid TODO | wwylele | 1 | -4/+2 | |
This function is called in clipping, before the pespective divide, and is not used in later rasterization. Thus it doesn't need perspective correction. | |||||
2017-08-21 | swrasterizer/clipper: remove tested TODO | wwylele | 1 | -4/+0 | |
hwtested. Current implementation is the correct behavior | |||||
2017-08-21 | gl_shader_gen: simplify and clarify the depth transformation between vertex shader and fragment shader | wwylele | 1 | -2/+5 | |
2017-08-21 | gl_rasterizer: add clipping plane z<=0 defined in PICA | wwylele | 4 | -0/+21 | |
2017-08-19 | pica/command_processor: build geometry pipeline and run geometry shader | wwylele | 6 | -28/+383 | |
The geometry pipeline manages data transfer between VS, GS and primitive assembler. It has known four modes: - no GS mode: sends VS output directly to the primitive assembler (what citra currently does) - GS mode 0: sends VS output to GS input registers, and sends GS output to primitive assembler - GS mode 1: sends VS output to GS uniform registers, and sends GS output to primitive assembler. It also takes an index from the index buffer at the beginning of each primitive for determine the primitive size. - GS mode 2: similar to mode 1, but doesn't take the index and uses a fixed primitive size. hwtest shows that immediate mode also supports GS (at least for mode 0), so the geometry pipeline gets refactored into its own class for supporting both drawing mode. In the immediate mode, some games don't set the pipeline registers to a valid value until the first attribute input, so a geometry pipeline reset flag is set in `pipeline.vs_default_attributes_setup.index` trigger, and the actual pipeline reconfigure is triggered in the first attribute input. In the normal drawing mode with index buffer, the vertex cache is a little bit modified to support the geometry pipeline. Instead of OutputVertex, it now holds AttributeBuffer, which is the input to the geometry pipeline. The AttributeBuffer->OutputVertex conversion is done inside the pipeline vertex handler. The actual hardware vertex cache is believed to be implemented in a similar way (because this is the only way that makes sense). Both geometry pipeline and GS unit rely on states preservation across drawing call, so they are put into the global state. In the future, the other three vertex shader units should be also placed in the global state, and a scheduler should be implemented on top of the four units. Note that the current gs_unit already allows running VS on it in the future. | |||||
2017-08-19 | pica/shader/jit: implement SETEMIT and EMIT | wwylele | 2 | -2/+49 | |
2017-08-19 | pica/primitive_assembly: Handle winding for GS primitive | wwylele | 2 | -3/+19 | |
hwtest shows that, although GS always emit a group of three vertices as one primitive, it still respects to the topology type, as if the three vertices are input into the primitive assembler independently and sequentially. It is also shown that the winding flag in SETEMIT only takes effect for Shader topology type, which is believed to be the actual difference between List and Shader (hence removed the TODO). However, only Shader topology type is observed in official games when GS is in use, so the other mode seems to be just unintended usage. | |||||
2017-08-19 | correct constness | wwylele | 2 | -2/+4 | |
2017-08-19 | pica/shader/interpreter: implement SETEMIT and EMIT | wwylele | 1 | -0/+16 | |
2017-08-19 | pica/shader: extend UnitState for GS | wwylele | 2 | -0/+84 | |
Among four shader units in pica, a special unit can be configured to run both VS and GS program. GSUnitState represents this unit, which extends UnitState (which represents the other three normal units) with extra state for primitive emitting. It uses lots of raw pointers to represent internal structure in order to keep it standard layout type for JIT to access. This unit doesn't handle triangle winding (inverting) itself; instead, it calls a WindingSetter handler. This will be explained in the following commits | |||||
2017-08-11 | gl_shader_gen: don't call SampleTexture when bump map is not used | wwylele | 1 | -4/+5 | |
2017-08-11 | SwRasterizer/Lighting: implement spot light | wwylele | 1 | -3/+19 | |
2017-08-11 | SwRasterizer/Lighting: implement geometric factor | wwylele | 1 | -4/+16 | |
2017-08-10 | SwRasterizer/Lighting: use make_tuple instead of constructor | wwylele | 1 | -1/+1 | |
implicit tuple constructor is a c++17 thing, which is not supported by some not-so-old libraries. Play safe for now | |||||
2017-08-10 | pica/regs: layout geometry shader configuration regs | wwylele | 2 | -2/+39 | |
All the register meanings are derived from ctrulib (3dbrew is outdated for most of them) | |||||
2017-08-07 | pica: upload shared shader code to both unit | wwylele | 2 | -26/+45 | |
2017-08-03 | SwRasterizer/Lighting: shorten file name | wwylele | 4 | -4/+4 | |
2017-08-02 | SwRasterizer/Lighting: move to its own file | wwylele | 4 | -240/+271 | |
2017-08-02 | SwRasterizer/Lighting: reduce confusion | wwylele | 1 | -1/+1 | |
2017-08-02 | SwRasterizer/Lighting: move quaternion normalization to the caller | wwylele | 1 | -3/+3 | |
2017-07-27 | pica/shader_interpreter: fix off-by-one in LOOP | wwylele | 1 | -1/+1 | |
2017-07-18 | telemetry: Log performance, configuration, and system data. | bunnei | 2 | -6/+16 | |
2017-07-11 | SwRasterizer/Lighting: dist atten lut input need to be clamp | wwylele | 1 | -1/+1 | |
2017-07-11 | SwRasterizer/Lighting: unify float suffix | wwylele | 1 | -11/+13 | |
2017-07-11 | SwRasterizer/Lighting: get rid of nested return | wwylele | 1 | -10/+11 | |
2017-07-11 | SwRasterizer/Lighting: refactor GetLutValue into a function. | wwylele | 1 | -83/+27 | |
merging similar pattern. Also makes the code more similar to the gl one | |||||
2017-07-11 | SwRasterizer: only interpolate quat and view when lighting is enabled | wwylele | 1 | -14/+14 | |
2017-07-11 | SwRasterizer/Lighting: pass lighting state as parameter | wwylele | 1 | -13/+13 | |
2017-07-11 | SwRasterizer/Lighting: Move the clamp highlight calculation to the end of the per-light loop body. | Subv | 1 | -17/+17 | |
2017-07-11 | SwRasterizer/Lighting: Move the lighting enable check outside the ComputeFragmentsColors function. | Subv | 1 | -7/+6 | |
2017-07-11 | SwRasterizer/Lighting: Do not use global registers state in ComputeFragmentsColors. | Subv | 1 | -3/+3 | |
2017-07-11 | SwRasterizer/Lighting: Do not use global state in LookupLightingLut. | Subv | 2 | -13/+22 | |
2017-07-11 | SwRasterizer/Lighting: Fixed a bug where the distance attenuation bias was being set to the dist atten scale. | Subv | 1 | -3/+2 | |
2017-07-11 | SwRasterizer: Fixed a few conversion warnings and moved per-light values into the per-light loop. | Subv | 1 | -5/+6 | |
2017-07-11 | SwRasterizer: Run clang-format | Subv | 1 | -45/+83 | |
2017-07-11 | SwRasterizer: Flip the vertex quaternions before clipping (if necessary). | Subv | 2 | -20/+15 | |
2017-07-11 | SwRasterizer: Corrected the light LUT lookups. | Subv | 1 | -6/+7 | |
2017-07-11 | SwRasterizer: Corrected the light LUT lookups. | Subv | 1 | -33/+43 | |
2017-07-11 | SwRasterizer: Fixed the lighting lut lookup function. | Subv | 1 | -2/+4 | |
2017-07-11 | SwRasterizer: Calculate fresnel for fragment lighting. | Subv | 1 | -1/+25 | |
2017-07-11 | SwRasterizer: Calculate specular_1 for fragment lighting. | Subv | 1 | -3/+59 | |
2017-07-11 | SwRasterizer: Calculate specular_0 for fragment lighting. | Subv | 1 | -13/+94 | |
2017-07-11 | SwRasterizer: Implement primary fragment color. | Subv | 1 | -4/+113 | |
2017-07-01 | gl_rasterizer: use texture buffer for proctex LUT | wwylele | 5 | -70/+80 | |
2017-06-22 | gl_rasterizer: use texture buffer for fog LUT | wwylele | 7 | -29/+32 | |
2017-06-22 | gl_rasterizer: create the texture before applying the state | wwylele | 1 | -2/+2 | |
this is a rebasing error from #2792. It doesn't affect much though, because the later more Apply() call fixes/hides it | |||||
2017-06-21 | gl_state: reset 1d textures | wwylele | 1 | -0/+14 | |
2017-06-21 | gl_rasterizer: fix glGetUniformLocation type | wwylele | 1 | -8/+8 | |
2017-06-21 | gl_rasterizer: manage texture ids in one place | wwylele | 3 | -31/+55 | |
2017-06-21 | gl_rasterizer/lighting: fix LUT interpolation | wwylele | 7 | -116/+102 | |
2017-06-18 | gl_rasterizer/lighting: use the formula from the paper for germetic factor | wwylele | 1 | -8/+8 | |
2017-06-17 | Stop using reserved operator names (and/or/xor) with Xbyak | Yuri Kunde Schlesner | 1 | -13/+13 | |
Also has the Dynarmic upgrade with the same change | |||||
2017-06-15 | gl_rasterizer/lighting: implement geometric factor | wwylele | 3 | -1/+20 | |
2017-06-11 | gl_rasterizer/lighting: Implement tangent mapping | wwylele | 1 | -7/+12 | |
2017-06-11 | gl_rasterizer/lighting: implement lut input 5 (CP) | wwylele | 2 | -3/+26 | |
2017-06-10 | gl_rasterizer_cache: depth write is disabled if allow_depth_stencil_write is false | wwylele | 1 | -4/+5 | |
2017-06-10 | OpenGL: Update comment on AreQuaternionsOpposite with new information | Yuri Kunde Schlesner | 1 | -8/+11 | |
While debugging the software renderer implementation, it was noticed that this is actually exactly what the hardware does, upgrading the status of this "hack" to being a proper implementation. And there was much rejoicing. | |||||
2017-06-04 | pica/rasterizer: implement/stub texture wrap mode 4-7 | wwylele | 4 | -12/+48 | |
2017-05-30 | gl_rasterizer: implement spot light | wwylele | 1 | -6/+24 | |
2017-05-30 | gl_rasterizer: sync spot light status | wwylele | 4 | -2/+61 | |
2017-05-30 | pica: prepare registers for spotlight | wwylele | 1 | -20/+43 | |
2017-05-29 | swrasterizer: implement TextureCube | wwylele | 1 | -2/+51 | |
2017-05-29 | pica: add registers for texture cube | wwylele | 1 | -1/+26 | |
2017-05-28 | CMake: Create INTERFACE targets for microprofile and nihstro | Yuri Kunde Schlesner | 1 | -1/+1 | |
2017-05-28 | CMake: Use IMPORTED target for libpng | Yuri Kunde Schlesner | 1 | -3/+2 | |
2017-05-28 | CMake: Correct inter-module dependencies and library visibility | Yuri Kunde Schlesner | 1 | -5/+7 | |
Modules didn't correctly define their dependencies before, which relied on the frontends implicitly including every module for linking to succeed. Also changed every target_link_libraries call to specify visibility of dependencies to avoid leaking definitions to dependents when not necessary. | |||||
2017-05-28 | Move screen size constants from video_core to core | Yuri Kunde Schlesner | 2 | -27/+8 | |
video_core didn't even properly use them, and they were the source of many otherwise-unnecessary dependencies from core to video_core. | |||||
2017-05-28 | OpenGL: Remove unused RendererOpenGL fields | Yuri Kunde Schlesner | 2 | -11/+2 | |
2017-05-27 | OpenGL: Improve accuracy of quaternion interpolation | Yuri Kunde Schlesner | 1 | -3/+5 | |
Current order of operations (rotate then normalize) seems to produce a lot more distortion than normalizing and then rotating. This makes Citra results match pretty closesly with hardware, and indicates that hardware may also be using lerp instead of slerp to interpolate the quaternions. | |||||
2017-05-27 | gl_shader: refactor texture sampler into its own function | wwylele | 1 | -40/+39 | |
2017-05-21 | swrasterizer: add missing tc0_w and fragment lighting attribute processing | wwylele | 2 | -5/+8 | |
2017-05-20 | gl_rasterizer: implement procedural texture | wwylele | 6 | -7/+600 | |
2017-05-20 | pica/swrasterizer: implement procedural texture | wwylele | 8 | -4/+438 | |
2017-05-17 | pica: use correct register value for shader bool_uniforms | wwylele | 1 | -2/+2 | |
variable value is not masked. the masked and combined register value should be used instead | |||||
2017-05-16 | pica: correct bit field length for some registers | wwylele | 4 | -17/+25 | |
2017-05-12 | Pica: Write GS registers | Jannik Vogel | 1 | -0/+52 | |
This adds the handlers for the geometry shader register writes which will call the functions from the previous commit to update registers for the GS. | |||||
2017-05-12 | Pica: Write shader registers in functions | Jannik Vogel | 1 | -57/+103 | |
The commit after this one adds GS register writes, so this moves the VS handlers into functions so they can be re-used and extended more easily. | |||||
2017-05-11 | Pica: Set program code / swizzle data limit to 4096 | Jannik Vogel | 5 | -13/+16 | |
One of the later commits will enable writing to GS regs. It turns out that on startup, most games will write 4096 GS program words. The current limit of 1024 would hence result in 3072 (4096 - 1024) error messages: ``` HW.GPU <Error> video_core/shader/shader.cpp:WriteProgramCode:229: Invalid GS program offset 1024 ``` New constants have been introduced to represent these limits. The swizzle data size has also been raised. This matches the given field sizes of [GPUREG_SH_OPDESCS_INDEX](https://3dbrew.org/wiki/GPU/Internal_Registers#GPUREG_SH_OPDESCS_INDEX) and [GPUREG_SH_CODETRANSFER_INDEX](https://www.3dbrew.org/wiki/GPU/Internal_Registers#GPUREG_SH_CODETRANSFER_INDEX) (12 bit = [0; 4095]). | |||||
2017-05-05 | pica: shader_dirty if texture2 coord changed | wwylele | 5 | -7/+12 | |
2017-05-03 | pica: use correct coordinates for texture 2 | wwylele | 4 | -5/+22 | |
2017-04-20 | gl_shader_gen: remove TODO about Lerp behaviour verification. The implementation is verified against hardware | wwylele | 1 | -2/+0 | |
2017-04-19 | rasterizer: implement combiner operation 7 (Dot3_RGBA) | wwylele | 4 | -20/+39 | |
2017-04-17 | OpenGL: Pass Pica regs via parameter | Yuri Kunde Schlesner | 3 | -7/+5 | |
2017-04-17 | OpenGL: Move PicaShaderConfig to gl_shader_gen.h | Yuri Kunde Schlesner | 4 | -202/+206 | |
Also move the implementation of CurrentConfig to the cpp file. | |||||
2017-04-17 | OpenGL: Move Attributes enum to a more appropriate file | Yuri Kunde Schlesner | 3 | -12/+11 | |
2017-04-08 | Pica/Regs: Correct bit width for blend-equations | Jannik Vogel | 1 | -2/+2 | |
2017-03-01 | Input: remove unused stuff & clean up | wwylele | 1 | -0/+1 | |
1. removed zl, zr and c-stick from HID::PadState. They are handled by IR, not HID 2. removed button handling in EmuWindow 3. removed key_map 4. cleanup #include | |||||
2017-02-27 | Doxygen: Amend minor issues (#2593) | Mat M | 3 | -3/+5 | |
Corrects a few issues with regards to Doxygen documentation, for example: - Incorrect parameter referencing. - Missing @param tags. - Typos in @param tags. and a few minor other issues. | |||||
2017-02-27 | Core: Re-write frame limiter | Yuri Kunde Schlesner | 1 | -3/+3 | |
Now based on std::chrono, and also works in terms of emulated time instead of frames, so we can in the future frame-limit even when the display is disabled, etc. The frame limiter can also be enabled along with v-sync now, which should be useful for those with displays running at more than 60 Hz. | |||||
2017-02-27 | Core: Make PerfStats internally locked | Yuri Kunde Schlesner | 1 | -8/+2 | |
More ergonomic to use and will be required for upcoming changes. | |||||
2017-02-27 | Remove built-in (non-Microprofile) profiler | Yuri Kunde Schlesner | 1 | -8/+0 | |
2017-02-27 | Add performance statistics to status bar | Yuri Kunde Schlesner | 1 | -0/+9 | |
2017-02-18 | OpenGL: Check if uniform block exists before updating it (#2581) | Jannik Vogel | 1 | -29/+30 | |
2017-02-15 | video_core: remove #pragma once in cpp file (#2570) | Weiyi Wang | 2 | -4/+0 | |
2017-02-13 | SWRasterizer: Move more framebuffer functions to file | Yuri Kunde Schlesner | 3 | -100/+105 | |
2017-02-13 | SWRasterizer: Move texturing functions to their own file | Yuri Kunde Schlesner | 4 | -210/+259 | |
2017-02-13 | SWRasterizer: Convert large no-capture lambdas to standalone functions | Yuri Kunde Schlesner | 1 | -315/+310 | |
2017-02-13 | SWRasterizer: Move framebuffer operation functions to their own file | Yuri Kunde Schlesner | 4 | -236/+285 | |
2017-02-13 | VideoCore: Move software rasterizer files to sub-directory | Yuri Kunde Schlesner | 8 | -12/+12 | |
2017-02-12 | video_core/shader: Document sanitized MUL operation | Yuri Kunde Schlesner | 1 | -0/+8 | |
2017-02-11 | video_core: Fix benign out-of-bounds indexing of array (#2553) | Yuri Kunde Schlesner | 1 | -2/+1 | |
The resulting pointer wasn't written to unless the index was verified as valid, but that's still UB and triggered debug checks in MSVC. Reported by garrettboast on IRC | |||||
2017-02-09 | VideoCore: Split u64 Pica reg unions into 2 separate u32 unions | Yuri Kunde Schlesner | 1 | -36/+42 | |
This eliminates UB when aliasing it with the array of u32 regs, and is compatible with non-LE architectures. | |||||
2017-02-09 | VideoCore: Force enum sizes to u32 in LightingRegs | Yuri Kunde Schlesner | 1 | -4/+4 | |
All enums that are used with BitField must have their type forced to u32 to ensure correctness. | |||||
2017-02-09 | OpenGL: Remove unused duplicate of IsPassThroughTevStage | Yuri Kunde Schlesner | 1 | -12/+0 | |
This copy was left behind when the shader generation code was moved to a separate file. | |||||
2017-02-09 | VideoCore: Split regs.h inclusions | Yuri Kunde Schlesner | 13 | -24/+45 | |
2017-02-09 | Pica/Regs: Use binary search to look up reg names | Yuri Kunde Schlesner | 2 | -15/+10 | |
This gets rid of the static unordered_map. Also changes the return type const char*, avoiding unnecessary allocations (the result was only used by calling .c_str() on it.) | |||||
2017-02-09 | VideoCore: Use union to index into Regs struct | Yuri Kunde Schlesner | 2 | -46/+28 | |
Also remove some unused members. | |||||
2017-02-05 | Use std::array<u8,2> instead of u8[2] to fix MSVC build | Lectem | 1 | -1/+1 | |
2017-02-04 | VideoCore: Move Regs to its own file | Yuri Kunde Schlesner | 22 | -658/+679 | |
2017-02-04 | VideoCore: Split shader regs from Regs struct | Yuri Kunde Schlesner | 9 | -102/+116 | |
2017-02-04 | VideoCore: Split geometry pipeline regs from Regs struct | Yuri Kunde Schlesner | 9 | -264/+292 | |
2017-02-04 | VideoCore: Split lighting regs from Regs struct | Yuri Kunde Schlesner | 6 | -312/+341 | |
2017-02-04 | VideoCore: Split framebuffer regs from Regs struct | Yuri Kunde Schlesner | 10 | -445/+491 | |
2017-02-04 | VideoCore: Split texturing regs from Regs struct | Yuri Kunde Schlesner | 15 | -494/+532 | |
2017-02-04 | VideoCore: Split rasterizer regs from Regs struct | Yuri Kunde Schlesner | 13 | -187/+218 | |
2017-02-04 | Pica/Texture: Move part of ETC1 decoding to new file and cleanups | Yuri Kunde Schlesner | 4 | -110/+159 | |
2017-02-04 | Pica/Texture: Simplify/cleanup texture tile addressing | Yuri Kunde Schlesner | 4 | -37/+111 | |
2017-02-04 | VideoCore: Move LookupTexture out of debug_utils.h | Yuri Kunde Schlesner | 7 | -301/+340 | |
2017-02-03 | ShaderJIT: add 16 dummy bytes at the bottom of the stack | wwylele | 1 | -2/+5 | |
2017-01-31 | Common/x64: remove legacy emitter and abi (#2504) | Weiyi Wang | 1 | -1/+0 | |
These are not used any more since we moved shader JIT to xbyak. | |||||
2017-01-31 | shader_jit_x64_compiler: esi and edi should be persistent (#2500) | Merry | 1 | -0/+2 | |
2017-01-30 | VideoCore: Make PrimitiveAssembler const-correct | Yuri Kunde Schlesner | 2 | -3/+4 | |
2017-01-30 | VideoCore: Extract swrast-specific data from OutputVertex | Yuri Kunde Schlesner | 5 | -58/+64 | |
2017-01-30 | VideoCore/Shader: Clean up OutputVertex::FromAttributeBuffer | Yuri Kunde Schlesner | 2 | -10/+16 | |
This also fixes a long-standing but neverthless harmless memory corruption bug, whech the padding of the OutputVertex struct would get corrupted by unused attributes. | |||||
2017-01-30 | VideoCore: Split shader output writing from semantic loading | Yuri Kunde Schlesner | 3 | -24/+24 | |
2017-01-30 | VideoCore: Consistently use shader configuration to load attributes | Yuri Kunde Schlesner | 6 | -44/+23 | |
2017-01-30 | VideoCore: Use correct register for immediate mode attribute count | Yuri Kunde Schlesner | 2 | -7/+13 | |
2017-01-30 | VideoCore: Rename some types to more accurate names | Yuri Kunde Schlesner | 8 | -18/+18 | |
2017-01-30 | VideoCore: Change misleading register names | Yuri Kunde Schlesner | 4 | -8/+9 | |
A few registers had names such as "count" or "number" when they actually contained the maximum (that is, count - 1). This can easily lead to hard to notice off by one errors. | |||||
2017-01-30 | video_core: gl_rasterizer_cache.cpp removed unused type alias | Kloen | 1 | -1/+0 | |
2017-01-30 | video_core: gl_rasterizer.cpp removed unused type alias | Kloen | 1 | -2/+0 | |
2017-01-29 | video_core: silence unused-local-typedef boost related warning on GCC | Kloen | 1 | -0/+7 | |
2017-01-26 | VideoCore/Shader: Move entry_point to SetupBatch | Yuri Kunde Schlesner | 6 | -26/+27 | |
2017-01-26 | VideoCore/Shader: Move per-batch ShaderEngine state into ShaderSetup | Yuri Kunde Schlesner | 6 | -44/+40 | |
2017-01-26 | Shader: Remove OutputRegisters struct | Yuri Kunde Schlesner | 4 | -22/+17 | |
2017-01-26 | Shader: Initialize conditional_code in interpreter | Yuri Kunde Schlesner | 2 | -3/+3 | |
This doesn't belong in LoadInputVertex because it also happens for non-VS invocations. Since it's not used by the JIT it seems adequate to initialize it in the interpreter which is the only thing that cares about them. | |||||
2017-01-26 | Shader: Don't read ShaderSetup from global state | Yuri Kunde Schlesner | 1 | -3/+3 | |
2017-01-26 | shader_jit_x64: Don't read program from global state | Yuri Kunde Schlesner | 3 | -22/+22 | |
2017-01-26 | VideoCore/Shader: Move ProduceDebugInfo to InterpreterEngine | Yuri Kunde Schlesner | 4 | -19/+10 | |
2017-01-26 | VideoCore/Shader: Split interpreter and JIT into separate ShaderEngines | Yuri Kunde Schlesner | 8 | -97/+153 | |
2017-01-26 | VideoCore/Shader: Rename shader_jit_x64{ => _compiler}.{cpp,h} | Yuri Kunde Schlesner | 4 | -4/+4 | |
2017-01-26 | VideoCore/Shader: Split shader uniform state and shader engine | Yuri Kunde Schlesner | 4 | -21/+54 | |
Currently there's only a single dummy implementation, which will be split in a following commit. | |||||
2017-01-26 | VideoCore/Shader: Add constness to methods | Yuri Kunde Schlesner | 2 | -4/+4 | |
2017-01-26 | VideoCore/Shader: Use only entry_point as ShaderSetup param | Yuri Kunde Schlesner | 3 | -11/+13 | |
This removes all implicit dependency of ShaderState on global PICA state. | |||||
2017-01-26 | VideoCore/Shader: Use self instead of g_state.vs in ShaderSetup | Yuri Kunde Schlesner | 2 | -11/+8 | |
2017-01-26 | VideoCore/Shader: Extract input vertex loading code into function | Yuri Kunde Schlesner | 3 | -22/+26 | |
2017-01-23 | video_core: fix shader.cpp signed / unsigned warning | Kloen | 1 | -2/+2 | |
2017-01-23 | video_core: gl_rasterizer float to int warning | Kloen | 1 | -1/+2 | |
2017-01-23 | video_core: fix gl_rasterizer warning on MSVC | Kloen | 1 | -1/+1 | |
2017-01-07 | config: Add option for specifying screen resolution scale factor. | bunnei | 3 | -5/+10 | |
2017-01-04 | Fix some warnings (#2399) | Jonathan Hao | 1 | -2/+0 | |
2016-12-25 | Minor cleanup in GLSL code | Jannik Vogel | 1 | -3/+2 | |
2016-12-25 | Offset lighting LUT samples correctly | Jannik Vogel | 1 | -7/+7 | |
2016-12-23 | core: Move emu_window and key_map into core | MerryMage | 2 | -2/+2 | |
* Removes circular dependences (common should not depend on core) | |||||
2016-12-19 | Use GL_TRUE when setting color_mask | Albin Bernhardsson | 1 | -4/+4 | |
2016-12-16 | VideoCore/Shader: Extract DebugData out from UnitState | Yuri Kunde Schlesner | 8 | -103/+99 | |
2016-12-16 | Remove unnecessary cast | Yuri Kunde Schlesner | 1 | -3/+1 | |
2016-12-16 | VideoCore/Shader: Extract evaluate_condition lambda to function scope | Yuri Kunde Schlesner | 1 | -26/+24 | |
2016-12-16 | VideoCore/Shader: Extract call lambda up a scope and remove unused param | Yuri Kunde Schlesner | 1 | -21/+17 | |
2016-12-16 | VideoCore/Shader: Remove dynamic control flow in (Get)UniformOffset | Yuri Kunde Schlesner | 2 | -18/+11 | |
2016-12-16 | VideoCore/Shader: Move DebugData to a separate file | Yuri Kunde Schlesner | 4 | -172/+189 | |
2016-12-15 | shader_jit_x64: Use LOOPCOUNT_REG as a 64-bit reg when indexing | Yuri Kunde Schlesner | 1 | -1/+1 | |
2016-12-15 | VideoCore: Make profiling scope more representative | Yuri Kunde Schlesner | 2 | -0/+15 | |
2016-12-15 | VideoCore: Inline IsPicaTracing | Yuri Kunde Schlesner | 3 | -16/+15 | |
Speeds up ALBW main menu slightly (~3%) | |||||
2016-12-15 | VideoCore: Eliminate an unnecessary copy in the drawcall loop | Yuri Kunde Schlesner | 3 | -5/+3 | |
2016-12-15 | shader_jit_x64: Use Reg32 for LOOP* registers, eliminating casts | Yuri Kunde Schlesner | 1 | -16/+16 | |
2016-12-15 | VideoCore: Convert x64 shader JIT to use Xbyak for assembly | Yuri Kunde Schlesner | 3 | -223/+228 | |
2016-12-11 | Add all services to the Service namespace | Lioncash | 2 | -6/+7 | |
Previously there was a split where some of the services were in the Service namespace and others were not. | |||||
2016-12-07 | OpenGL: Drop framebuffer completeness check. | Markus Wick | 5 | -47/+8 | |
This OpenGL call synchronize the worker thread of the nvidia blob. It can be verified on linux with the __GL_THREADED_OPTIMIZATIONS=1 environment variable. Those errors should not happen on tested drivers. It was used as a workaround for https://bugs.freedesktop.org/show_bug.cgi?id=94148 | |||||
2016-12-06 | Implement Frame rate limiter (#2223) | emmauss | 2 | -0/+2 | |
* implement frame limiter * fixes | |||||
2016-12-05 | ASSERT that shader was linked successfully | Jannik Vogel | 1 | -0/+2 | |
2016-12-05 | Report shader uniform block size in case of mismatch | Jannik Vogel | 1 | -1/+3 | |
2016-12-05 | Print broken shader code to log | Jannik Vogel | 1 | -3/+9 | |
2016-12-04 | OpenGL: Non-zero stride only makes sense for linear buffers | Yuri Kunde Schlesner | 3 | -7/+11 | |
2016-12-04 | OpenGL: Ensure framebuffer binding is restored if completion check fails | Yuri Kunde Schlesner | 1 | -10/+7 | |
2016-12-04 | OpenGL: Fix DisplayTransfer accel when input width != output width | Yuri Kunde Schlesner | 1 | -1/+10 | |
Fixes #2246, #2261 | |||||
2016-12-04 | shader_jit: Fix non-SSE4.1 path where FLR would not truncate | Jannik Vogel | 1 | -1/+1 | |
2016-12-03 | clang-format: Fix coding style | Yuri Kunde Schlesner | 1 | -1/+1 | |
2016-12-02 | shader_jit: Load LOOPCOUNT_REG and LOOPINC 4 bit left-shifted | Jannik Vogel | 1 | -6/+9 | |
2016-11-30 | ClangFormat: Fixed the clang-format errors | Subv | 2 | -6/+10 | |
2016-11-29 | Build: Fixed a few warnings. | Subv | 2 | -7/+7 | |
2016-11-27 | GPU: Remove the broken frame_skip option. | Emmanuel Gil Peyrot | 1 | -4/+0 | |
Fixes #1960. | |||||
2016-11-27 | RasterizerGL: Use GL_TRUE and 0xFF in the stencil and depth masks instead of simply true and -1 | Subv | 2 | -4/+4 | |
2016-11-27 | Rasterizer/Memfill: Set the correct stencil write mask when clearing the stencil buffer. | Subv | 1 | -1/+1 | |
2016-11-24 | Cache Vertices instead of Output registers (#2165) | jphalimi | 1 | -6/+7 | |
This patch brings +3% performance improvement on average. It removes ToVertex() as an important hotspot of the emulator. | |||||
2016-11-22 | Fix format error from #2195 | wwylele | 1 | -1/+1 | |
2016-11-20 | GPU/CiTrace: Avoid calling GetTextures() when not necessary. | Subv | 1 | -6/+5 | |
2016-11-19 | Minor formatting change | James Rowe | 1 | -1/+1 | |
2016-11-05 | Add default hotkey to swap primary screens. | James Rowe | 1 | -3/+2 | |
Also minor style changes | |||||
2016-11-05 | Support additional screen layouts. | James Rowe | 1 | -6/+12 | |
Allows users to choose a single screen layout or a large screen layout. Adds a configuration option to change the prominent screen. | |||||
2016-10-20 | Fix typos | Ricardo de Almeida Gonzaga | 1 | -1/+1 | |
2016-09-30 | VideoCore: Shader interpreter cleanups | Yuri Kunde Schlesner | 1 | -32/+42 | |
2016-09-30 | VideoCore: Fix out-of-bounds read in ShaderSetup::ProduceDebugInfo | Yuri Kunde Schlesner | 1 | -3/+1 | |
As far as I can tell, memset was replaced by a fill without correcting the parameter type, causing an out-of-bounds array read in the Vec4 constructor. | |||||
2016-09-30 | OpenGL: Take cached viewport sub-rect into account for scissor | Yuri Kunde Schlesner | 3 | -29/+25 | |
Fixes #1938 | |||||
2016-09-29 | rasterizer: separate TextureCopy from DisplayTransfer | wwylele | 3 | -6/+12 | |
2016-09-21 | Remove special rules for Windows.h and library includes | Yuri Kunde Schlesner | 1 | -1/+1 | |
2016-09-21 | Use negative priorities to avoid special-casing the self-include | Yuri Kunde Schlesner | 18 | -18/+18 | |
2016-09-21 | Remove empty newlines in #include blocks. | Emmanuel Gil Peyrot | 35 | -105/+17 | |
This makes clang-format useful on those. Also add a bunch of forgotten transitive includes, which otherwise prevented compilation. | |||||
2016-09-19 | Manually tweak source formatting and then re-run clang-format | Yuri Kunde Schlesner | 23 | -125/+119 | |
2016-09-18 | Sources: Run clang-format on everything. | Emmanuel Gil Peyrot | 42 | -2532/+2943 | |
2016-09-16 | VideoCore: Fix dangling lambda context in shader interpreter | Yuri Kunde Schlesner | 1 | -1/+1 | |
The static meant that after the first execution, these lambda context would be pointing to a random location on the stack. Fixes a random crash when using the interpreter. | |||||
2016-08-30 | OpenGL: Avoid error on unsupported lighting LUT | Jannik Vogel | 1 | -0/+1 | |
2016-08-30 | config: Add a setting for graphics V-Sync. | bunnei | 1 | -0/+1 | |
2016-06-28 | OpenGL: Add scaled resolution support to scissor | Yuri Kunde Schlesner | 4 | -3/+16 | |
2016-06-28 | PICA: Scissor fixes and cleanups | Yuri Kunde Schlesner | 5 | -45/+39 | |
2016-06-28 | PICA: Implement scissor test | Subv | 5 | -3/+105 | |
2016-06-25 | Remove superfluous std::move in return std::move(local_var) | scurest | 1 | -1/+1 | |
2016-06-07 | OpenGL: Implement fog | Jannik Vogel | 5 | -7/+124 | |
2016-06-07 | Rasterizer: Implement fog | Jannik Vogel | 1 | -21/+52 | |
2016-06-07 | Pica: Add fog state | Jannik Vogel | 3 | -14/+69 | |
2016-06-07 | OpenGL: Avoid undefined behaviour for UNIFORM_BLOCK_DATA_SIZE | Jannik Vogel | 2 | -6/+8 | |
2016-06-01 | gsp::gpu: Reset g_thread_id in UnregisterInterruptRelayQueue | mailwl | 1 | -1/+1 | |
2016-05-23 | OpenGL: Set shader_dirty on lighting changes | Jannik Vogel | 1 | -0/+23 | |
2016-05-23 | Pica: Name LightSrc.config register | Jannik Vogel | 2 | -17/+15 | |
2016-05-23 | Pica: Name lighting.config0 and .config1 registers | Jannik Vogel | 2 | -18/+18 | |
2016-05-23 | OpenGL: Use uniforms for dist_atten_bias and dist_atten_scale | Jannik Vogel | 3 | -8/+84 | |
2016-05-21 | Refactor Tev stage dumper | Jannik Vogel | 2 | -115/+114 | |
2016-05-21 | Extend Tev stage dumper | Jannik Vogel | 1 | -14/+38 | |
2016-05-16 | Retrieve shader result from new OutputRegisters-type | Jannik Vogel | 4 | -64/+81 | |
2016-05-14 | OpenGL: Only update depth uniforms if the depth changed | Jannik Vogel | 2 | -9/+22 | |
2016-05-14 | OpenGL: value-initialize variables which cause uninitialised access otherwise | Jannik Vogel | 1 | -2/+2 | |
2016-05-13 | Use new shader-jit signature for interpreter | Jannik Vogel | 3 | -8/+8 | |
2016-05-13 | Refactor access to state in shader-jit | Jannik Vogel | 4 | -24/+42 | |
2016-05-12 | OpenGL: Support blend equation | Jannik Vogel | 4 | -0/+31 | |
2016-05-12 | Move program_counter and call_stack from UnitState to interpreter | Jannik Vogel | 3 | -45/+42 | |
2016-05-12 | Move default_attributes into Pica state | Jannik Vogel | 4 | -4/+4 | |
2016-05-11 | Turn ShaderSetup into struct | Jannik Vogel | 4 | -57/+58 | |
2016-05-11 | OpenGL: Implement texture type 3 | Jannik Vogel | 4 | -35/+67 | |
2016-05-11 | Rasterizer: Implement texture type 3 | Jannik Vogel | 1 | -2/+27 | |
2016-05-11 | Pica: Add tc0.w to OutputVertex | Jannik Vogel | 1 | -1/+2 | |
2016-05-11 | Pica: Add texture type to state | Jannik Vogel | 1 | -0/+10 | |
2016-05-10 | gl_rasterizer: Fix compilation for debug builds | Lioncash | 1 | -1/+1 | |
2016-05-10 | OpenGL: Implement W-Buffers and fix depth-mapping | Jannik Vogel | 3 | -4/+23 | |
2016-05-10 | Pica: Implement W-Buffer in SW rasterizer | Jannik Vogel | 4 | -11/+43 | |
2016-05-09 | vertex_loader: Correct forward declaration of InputVertex | Lioncash | 1 | -1/+1 | |
It's actually a struct, not a class. | |||||
2016-05-09 | vertex_loader: Provide an assertion for ensuring the loader has been setup | Lioncash | 2 | -0/+7 | |
Also adds an assert to ensure that Setup is not called more than once during a VertexLoader's lifetime. | |||||
2016-05-09 | vertex_loader: Add constructors to facilitate immediate and two-step initialization | Lioncash | 2 | -2/+6 | |
2016-05-09 | vertex_loader: initialize_num_total_attributes. | Lioncash | 1 | -1/+1 | |
Keeps the public API sane. | |||||
2016-05-09 | vertex_loader: Use std::array instead of raw C arrays | Lioncash | 1 | -6/+7 | |
2016-05-09 | vertex_loader: Correct header ordering | Lioncash | 1 | -1/+1 | |
2016-05-07 | fixup simple type conversions where possible | Alexander Laties | 4 | -7/+8 | |
2016-05-06 | Frontends, VideoCore: Move glad initialisation to the frontend | Emmanuel Gil Peyrot | 1 | -6/+0 | |
On SDL2 this allows it to use SDL_GL_GetProcAddress() instead of the default function loader, and fixes a crash when using apitrace with an EGL context. On Qt we will need to migrate from QGLWidget to QOpenGLWidget and QOpenGLContext before we can use gladLoadGLLoader() instead of gladLoadGL(), since the former doesn’t expose a function loader. | |||||
2016-05-04 | Pica: Rename VertexLoaded breakpoint to VertexShaderInvocation | Jannik Vogel | 2 | -7/+5 | |
2016-05-03 | Pica: Use a union for PicaShaderConfig | Jannik Vogel | 3 | -125/+139 | |
2016-05-03 | Pica: Add TevStageConfigRaw to PicaShaderConfig (MSVC workaround) | Jannik Vogel | 2 | -2/+23 | |
2016-05-03 | Pica: Make PicaShaderConfig trivially_copyable and clear it before use | Jannik Vogel | 1 | -21/+28 | |
2016-05-03 | OpenGL: Don't copy const_color (Reverts #1745) | Jannik Vogel | 1 | -2/+3 | |
2016-05-03 | Pica: Replace logic in shader.cpp with loop | Jannik Vogel | 1 | -34/+4 | |
2016-05-01 | OpenGL: Copy TevStageConfig using a loop. Fixes bug: const_color not copied | Jannik Vogel | 1 | -30/+11 | |
2016-04-30 | OpenGL: border_color was never set. Fixed. (#1740) | Jannik Vogel | 1 | -0/+1 | |
2016-04-30 | VideoCore: Run include-what-you-use and fix most includes. | Emmanuel Gil Peyrot | 34 | -79/+212 | |
2016-04-30 | Remove TGA dumper | Jannik Vogel | 3 | -62/+0 | |
2016-04-29 | Common: Remove section measurement from profiler (#1731) | Yuri Kunde Schlesner | 4 | -11/+0 | |
This has been entirely superseded by MicroProfile. The rest of the code can go when a simpler frametime/FPS meter is added to the GUI. | |||||
2016-04-29 | Move and rename the MemoryAccesses class to MemoryAccessTracker. | Henrik Rydgard | 4 | -32/+35 | |
2016-04-28 | Debugger fix | Henrik Rydgard | 1 | -2/+2 | |
2016-04-28 | Optimize the vertex loader, nearly doubling its speed. | Henrik Rydgard | 2 | -32/+54 | |
2016-04-28 | Don't keep base_address in the loader, it doesn't belong there (with it, the loader can't be cached). | Henrik Rydgard | 3 | -11/+10 | |
2016-04-28 | Move "&" to their proper place, add missing includes and make some properly relative. | Henrik Rydgard | 2 | -8/+11 | |
2016-04-28 | Refactor: Extract VertexLoader from command_processor.cpp. | Henrik Rydgard | 5 | -125/+185 | |
Preparation for a similar concept to Dolphin or PPSSPP. These can be JIT-ed and cached. | |||||
2016-04-28 | Remove late accesses to attribute_config | Henrik Rydgard | 1 | -5/+7 | |
2016-04-24 | shader: Shader size is long uint, not uint. | Sam Spilsbury | 1 | -1/+1 | |
2016-04-24 | shader: Handle non-CALL opcodes with a break | Sam Spilsbury | 1 | -0/+2 | |
2016-04-24 | shader: Format string must be provided inline and not as a variable | Sam Spilsbury | 1 | -1/+1 | |
2016-04-24 | Replace std::map with std::array for graphics event breakpoints, and allow the compiler to inline. Saves 1%+ in vertex heavy situations. | Henrik Rydgard | 2 | -7/+14 | |
2016-04-23 | pica: Handle default lighting case | Sam Spilsbury | 1 | -1/+6 | |
2016-04-22 | HWRasterizer: reorder declarations to match defs | tfarley | 1 | -9/+9 | |
2016-04-22 | HWRasterizer: sync specular uniform for new shaders | tfarley | 1 | -0/+2 | |
2016-04-21 | HWRasterizer: Texture forwarding | tfarley | 13 | -759/+1371 | |
2016-04-21 | Config: Add scaled resolution option | tfarley | 2 | -0/+2 | |
2016-04-17 | Rasterizer: Allow all blend factors for alpha blend-func | Jannik Vogel | 1 | -57/+42 | |
2016-04-15 | debug_utils: use std::make_unique for initializing PicaTrace | Lioncash | 1 | -1/+1 | |
2016-04-14 | shader_jit_x64: Rename RuntimeAssert to Compile_Assert. | bunnei | 2 | -5/+5 | |
2016-04-14 | shader_jit_x64.cpp: Rename JitCompiler to JitShader. | bunnei | 3 | -92/+92 | |
2016-04-14 | shader_jit_x64: Free memory that's no longer needed after compilation. | bunnei | 1 | -0/+6 | |
2016-04-14 | shader_jit_x64: Use a sorted vector instead of a set for keeping track of return addresses. | bunnei | 2 | -5/+8 | |
2016-04-14 | shader_jit_x64: Use CALL/RET instead of JMP for subroutines. | bunnei | 1 | -17/+7 | |
2016-04-14 | shader_jit_x64: Separate initialization and code generation for readability. | bunnei | 1 | -9/+8 | |
2016-04-14 | shader_jit_x64: Get rid of unnecessary last_program_counter variable. | bunnei | 2 | -6/+2 | |
2016-04-14 | shader_jit_x64: Execute certain asserts at runtime. | bunnei | 2 | -5/+19 | |
- This is because we compile the full shader code space, and therefore its common to compile malformed instructions. | |||||
2016-04-14 | shader: Remove unused 'state' argument from 'Setup' function. | bunnei | 3 | -5/+4 | |
2016-04-14 | shader_jit_x64: Specify shader main offset at runtime. | bunnei | 3 | -10/+6 | |
2016-04-14 | shader_jit_x64: Allocate each program independently and persist for emu session. | bunnei | 3 | -38/+28 | |
2016-04-14 | shader_jit_x64: Rewrite flow control to support arbitrary CALL and JMP instructions. | bunnei | 2 | -35/+119 | |
2016-04-14 | shader_jit_x64: Fix strict memory aliasing issues. | bunnei | 1 | -1/+3 | |
2016-04-14 | file_util: Don't expose IOFile internals through the API | Lioncash | 1 | -1/+16 | |
2016-04-10 | Pica: Remove geometry dumper (PICA_DUMP_GEOMETRY) | Jannik Vogel | 4 | -71/+0 | |
2016-04-10 | OpenGL: Implement color combiner Operation::Dot3_RGB | Jannik Vogel | 1 | -0/+3 | |
2016-04-08 | OpenGL: Respect buffer-write allow registers | Jannik Vogel | 1 | -6/+28 | |
2016-04-08 | OpenGL: Split buffer-write mask sync into seperate functions | Jannik Vogel | 2 | -8/+39 | |
2016-04-08 | Rasterizer: Respect buffer-write allow registers | Jannik Vogel | 2 | -4/+16 | |
2016-04-08 | OpenGL: Keep stencil-test and framebuffer.depth_format in sync | Jannik Vogel | 1 | -0/+1 | |
2016-04-05 | Common: Remove Common::make_unique, use std::make_unique | MerryMage | 5 | -11/+7 | |
2016-04-03 | OpenGL: Fix a double framebuffer completeness checks. | Emmanuel Gil Peyrot | 1 | -4/+6 | |
2016-04-03 | OpenGL: Check for framebuffer completeness | Jannik Vogel | 1 | -0/+3 | |
2016-04-01 | Avoid warnings by casting to size_t for ARRAY_SIZE() comparisons | Jannik Vogel | 1 | -6/+6 | |
2016-03-24 | Pica: Improve accuracy of immediate-mode support | Yuri Kunde Schlesner | 5 | -29/+56 | |
This partially fixes Etrian Odyssey IV. | |||||
2016-03-24 | OpenGL: Don't attempt to draw empty triangle batches | Yuri Kunde Schlesner | 1 | -0/+3 | |
Our code did not handle this well, causing random crashes in some situations. | |||||
2016-03-17 | video_core: Don't cast away const | Lioncash | 3 | -18/+19 | |
2016-03-17 | shader_interpreter: use std::inner_product for the dot product | Lioncash | 1 | -5/+3 | |
Same thing, less code. | |||||
2016-03-17 | core/video_core: Make NumIds functions constexpr | Lioncash | 1 | -1/+1 | |
2016-03-17 | core/video_core: Don't cast away const in subscript operators | Lioncash | 1 | -3/+3 | |
Not to say these subscript operators aren't totally ugly as is. | |||||
2016-03-17 | PICA: Alignment happens locally in vertex | Jannik Vogel | 1 | -6/+6 | |
2016-03-15 | PICA: Fix MAD/MADI encoding | Jannik Vogel | 2 | -29/+33 | |
2016-03-14 | PICA: Fix viewport offset | Jannik Vogel | 1 | -2/+2 | |
2016-03-14 | Respect vs output map | Jannik Vogel | 2 | -7/+19 | |
2016-03-13 | PICA: Align vertex attributes | Jannik Vogel | 1 | -1/+5 | |
2016-03-12 | shader_jit_x64: Clear cache after code space fills up. | bunnei | 3 | -2/+19 | |
2016-03-12 | shader_jit_x64: Make assert outputs more useful & cleanup formatting. | bunnei | 1 | -4/+7 | |
2016-03-12 | shader: Update log message to use proper log class. | bunnei | 1 | -1/+1 | |
2016-03-09 | Common: Get rid of alignment macros | Lioncash | 1 | -4/+4 | |
The gl rasterizer already uses alignas, so we may as well move everything over. | |||||
2016-03-09 | renderer_base: In-class initialize variables | Lioncash | 1 | -5/+2 | |
2016-03-09 | render_base: Clarify/normalize getter functions | Lioncash | 1 | -2/+2 | |
2016-03-09 | renderer_base: Don't directly expose the rasterizer unique_ptr | Lioncash | 3 | -8/+11 | |
There's no reason to allow direct access to the unique_ptr instance. Only its contained pointer. | |||||
2016-03-08 | Improve error report from Init() functions | LittleWhite | 5 | -8/+18 | |
Add error popup when citra initialization failed | |||||
2016-03-06 | Pica: Write depth value even when depth test is disabled | Yuri Kunde Schlesner | 2 | -10/+12 | |
This has been confirmed on hardware. Fixes Etrian Odyssey IV. | |||||
2016-03-03 | Add immediate mode vertex submission | Dwayne Slater | 17 | -60/+172 | |
2016-02-26 | renderer_opengl: Initalise fragment shader LUT textures | MerryMage | 1 | -0/+4 | |
2016-02-21 | Fix out of bounds array access when loading a component >= 12 | Dwayne Slater | 1 | -1/+4 | |
2016-02-21 | Add support for padding vertex attributes | Dwayne Slater | 1 | -6/+13 | |
2016-02-12 | BitField: Make trivially copyable and remove assignment operator | MerryMage | 2 | -6/+6 | |
2016-02-05 | pica: Cleanup lighting register definitions and documentation. | bunnei | 2 | -48/+51 | |
2016-02-05 | gl_rasterizer: Use alignas(16) instead of explicit padding. | bunnei | 1 | -13/+6 | |
2016-02-05 | renderer_opengl: Use GLvec3/GLvec4 aliases for commonly used types. | bunnei | 4 | -14/+18 | |
2016-02-05 | gl_rasterizer: Fix issue with interpolation of opposite quaternions. | bunnei | 2 | -4/+32 | |
2016-02-05 | pica_types: Fix typo in docstring. | bunnei | 1 | -1/+1 | |
2016-02-05 | pica_types: Replace float24/20/16 with a template class. | bunnei | 5 | -116/+82 | |
2016-02-05 | command_processor: Add an assertion to ensure LUTs are not written past their boundaries. | bunnei | 1 | -0/+3 | |
2016-02-05 | gl_rasterizer: Remove unnecessary casts. | bunnei | 1 | -6/+6 | |
2016-02-05 | gl_rasterizer: Fix PicaShaderConfig on GCC. | bunnei | 1 | -29/+27 | |
2016-02-05 | gl_rasterizer: Initial implementation of bump mapping. | bunnei | 3 | -5/+42 | |
2016-02-05 | gl_shader_gen: Fix bug in LUT range (should within range [0, 255] not [0, 256]). | bunnei | 1 | -3/+3 | |
2016-02-05 | gl_shader_gen: Implement lighting red, green, and blue reflection. | bunnei | 3 | -21/+77 | |
2016-02-05 | gl_shader_gen: View should be normalized. | bunnei | 1 | -2/+2 | |
2016-02-05 | gl_shader_gen: Implement fragment lighting fresnel effect. | bunnei | 3 | -9/+38 | |
2016-02-05 | gl_shader_gen: Implement fragment lighting specular 1 component. | bunnei | 3 | -11/+41 | |
2016-02-05 | gl_shader_gen: Add support for D0 LUT scaling. | bunnei | 3 | -3/+71 | |
2016-02-05 | gl_shader_gen: Refactor lighting config to match Pica register naming. | bunnei | 3 | -42/+50 | |
- Also implement D0 LUT enable. | |||||
2016-02-05 | pica: Cleanup and add some comments to lighting registers. | bunnei | 2 | -19/+19 | |
2016-02-05 | gl_rasterizer: Minor naming refactor on Pica register naming. | bunnei | 2 | -20/+23 | |
2016-02-05 | gl_shader_gen: Reorganize and cleanup lighting code. | bunnei | 1 | -100/+107 | |
- No functional difference. | |||||
2016-02-05 | gl_shader_gen: Fix directional lights. | bunnei | 1 | -1/+1 | |
2016-02-05 | gl_shader_gen: Fix bug with lighting where clamp highlights was only applied to last light. | bunnei | 1 | -6/+6 | |
2016-02-05 | gl_shader_gen: View vector needs to be normalized when computing half angle vector. | bunnei | 1 | -3/+4 | |
2016-02-05 | renderer_opengl: Use textures for fragment shader LUTs instead of UBOs. | bunnei | 5 | -27/+64 | |
- Gets us LUT interpolation for free. - Some older Intel GPU drivers did not support the big UBOs needed to store the LUTs. | |||||
2016-02-05 | renderer_opengl: Initial implementation of basic specular lighting. | bunnei | 4 | -13/+165 | |
2016-02-05 | renderer_opengl: Implement HW fragment lighting distance attenuation. | bunnei | 2 | -17/+38 | |
2016-02-05 | renderer_opengl: Implement HW fragment lighting LUTs within our default UBO. | bunnei | 4 | -16/+67 | |
2016-02-05 | renderer_opengl: Implement diffuse component of HW fragment lighting. | bunnei | 6 | -15/+270 | |
2016-02-05 | pica: Implement decoding of basic fragment lighting components. | bunnei | 5 | -15/+120 | |
- Diffuse - Distance attenuation - float16/float20 types - Vertex Shader 'view' output | |||||
2016-02-05 | pica: Implement fragment lighting LUTs. | bunnei | 2 | -0/+34 | |
2016-02-05 | pica: Add decodings for distance attenuation and LUT registers. | bunnei | 1 | -1/+104 | |
2016-02-05 | pica: Add pica_types module and move float24 definition. | bunnei | 3 | -112/+127 | |
2016-02-03 | hwrasterizer: Use proper cached fb addr/size | tfarley | 2 | -42/+34 | |
2016-02-03 | OpenGL: Downgrade GL_DEBUG_SEVERITY_NOTIFICATION to Debug logging level | Yuri Kunde Schlesner | 1 | -2/+0 | |
The nVidia driver is *extremely* spammy on this category, sending a message on every buffer or texture upload, slowing down the emulator and making the log useless. | |||||
2016-01-25 | Debugger: Use 3dbrew names for GPU registers | Yuri Kunde Schlesner | 1 | -57/+465 | |
This list was imported from the 3dbrew wiki page and is pretty much complete. | |||||
2016-01-25 | Shader: Implement "invert condition" feature of IFU instruction | Yuri Kunde Schlesner | 2 | -2/+5 | |
If the bit 0 of the JMPU instruction is set, then the jump condition will be inverted. That is, a jump will happen when the boolean is false instead of when it is true. | |||||
2016-01-24 | Shader JIT: Fix off-by-one error when compiling JMPs | Yuri Kunde Schlesner | 2 | -6/+6 | |
There was a mistake in the JMP code which meant that one instruction at the destination would be skipped when the jump was taken. This commit also changes the meaning of the culprit parameter to make it less confusing and avoid similar mistakes in the future. | |||||
2016-01-21 | hwrasterizer: Use depth offset | tfarley | 3 | -2/+24 | |
2016-01-17 | command_processor: Get rid of variable shadowing | Lioncash | 1 | -2/+1 | |
2015-12-30 | video_core: Make the renderer global a unique_ptr | Lioncash | 2 | -6/+10 | |
2015-12-30 | swrasterizer: Add missing override specifier | Lioncash | 1 | -1/+1 | |
2015-12-21 | VideoCore: Sync state after changing rasterizers | Yuri Kunde Schlesner | 1 | -0/+1 | |
This fixes various bugs that appear in the HW rasterizer after switching between it and the SW one during emulation. | |||||
2015-12-08 | VideoCore: Unify interface to OpenGL and SW rasterizers | Yuri Kunde Schlesner | 13 | -67/+105 | |
This removes explicit checks sprinkled all over the codebase to instead just have the SW rasterizer expose an implementation with no-ops for most operations. | |||||
2015-12-07 | VideoCore: Rename HWRasterizer methods to be less confusing | Yuri Kunde Schlesner | 4 | -12/+12 | |
2015-12-07 | OpenGL: Rename cache functions to better match what they actually do | Yuri Kunde Schlesner | 3 | -12/+11 | |
2015-12-06 | GPU/PrimitiveAssembler: Fixed drawing triangle fans. | Subv | 1 | -5/+4 | |
It was skipping the second vertex assignment and using uninitialized garbage when assembling the corresponding triangle. | |||||
2015-12-05 | OpenGL: Flip framebuffers during transfer rather than when rendering | Yuri Kunde Schlesner | 2 | -12/+11 | |
2015-12-05 | OpenGL: Add support for glFrontFace in the state tracker | Yuri Kunde Schlesner | 2 | -0/+6 | |
2015-12-01 | PICA: Properly emulate 1-stage delay in the combiner buffer | Yuri Kunde Schlesner | 2 | -12/+19 | |
This was discovered and verified by @fincs. The tev combiner buffer actually lags behind by one stage, meaning stage 1 reads the initial color, stage 2 reads stage 0's output, and so on. Fixes character portraits in Fire Emblem: Awakening and world textures in Zelda: ALBW. Closes #1140. | |||||
2015-11-26 | renderer_opengl: Fix uniform issues introduced with kemenaran/avoid-explicit-uniform-location. | bunnei | 2 | -6/+8 | |
2015-11-25 | Use regular uniform location | Pierre de La Morinerie | 3 | -15/+5 | |
The support for GL_ARB_explicit_uniform_location is not that good (53% according to http://feedback.wildfiregames.com/report/opengl/feature/GL_ARB_explicit_uniform_location). This fix the shader compilation on Intel HD 4000 (#1222). | |||||
2015-11-19 | FragShader: Use an UBO instead of several individual uniforms | Subv | 6 | -13/+67 | |
2015-11-10 | GPU/Loaders: Log an error when a loader tries to load from a component beyond the available ones (12). | Subv | 1 | -0/+2 | |
Related to #1170 | |||||
2015-10-24 | OpenGL: Log GL_KHR_debug messages we receive | Emmanuel Gil Peyrot | 1 | -0/+57 | |
This allows the driver to communicate errors, warnings and improvement suggestions about our usage of the API. | |||||
2015-10-22 | gl_shader_gen: Use explicit locations for vertex shader attributes. | bunnei | 2 | -15/+9 | |
2015-10-22 | gl_shader_gen: Optimize code for AppendAlphaTestCondition. | bunnei | 1 | -16/+11 | |
- Also add a comment to AppendColorCombiner. | |||||
2015-10-22 | gl_rasterizer: Define enum types for each vertex texcoord attribute. | bunnei | 3 | -12/+14 | |
2015-10-22 | gl_shader_gen: Various cleanups to shader generation. | bunnei | 3 | -48/+52 | |
2015-10-22 | gl_rasterizer: Use MMH3 hash for shader cache hey. | bunnei | 4 | -83/+63 | |
- Includes a check to confirm no hash collisions. | |||||
2015-10-22 | gl_shader_gen: Require explicit uniform locations. | bunnei | 3 | -56/+34 | |
- Fixes uniform issue on AMD. | |||||
2015-10-22 | gl_shader_gen: Rename 'o' to 'attr' in vertex/fragment shaders. | bunnei | 1 | -11/+11 | |
2015-10-22 | gl_shader_gen: AppendAlphaModifier default should be 0.0, not vec4(0.0). | bunnei | 1 | -1/+1 | |
2015-10-22 | gl_shader_gen: Fix bug where TEV stage outputs should be clamped. | bunnei | 1 | -3/+3 | |
2015-10-22 | gl_rasterizer: Add documentation to ShaderCacheKey. | bunnei | 1 | -0/+16 | |
2015-10-22 | gl_shader_gen: Add additional function documentation. | bunnei | 2 | -0/+18 | |
2015-10-22 | gl_shader_util: Cleanup header file + add docstring. | bunnei | 1 | -1/+7 | |
2015-10-22 | gl_shader_gen: Various cleanups + moved TEV stage generation to its own function. | bunnei | 1 | -161/+170 | |
2015-10-22 | renderer_opengl: Refactor shader generation/caching to be more organized + various cleanups. | bunnei | 10 | -788/+509 | |
2015-10-22 | gl_rasterizer: Move logic for creating ShaderCacheKey to a static function. | bunnei | 3 | -22/+50 | |
2015-10-22 | gl_shader_util: Use vec3 constants for AppendColorCombiner. | bunnei | 1 | -6/+6 | |
2015-10-22 | gl_rasterizer: Fix typo in uploading TEV const color uniforms. | bunnei | 1 | -5/+5 | |
2015-10-22 | gl_shader_util: Fix precision bug with alpha testing. | bunnei | 2 | -9/+9 | |
- Alpha testing is not done with float32 precision, this makes the HW renderer match the SW renderer. | |||||
2015-10-22 | Initial implementation of fragment shader generation with caching. | Subv | 7 | -261/+568 | |
2015-10-09 | CitraQt, SkyEye, Loader, VideoCore: Remove newlines in LOG_* calls. | Emmanuel Gil Peyrot | 2 | -7/+7 | |
The LOG_* function itself already appends one. | |||||
2015-10-07 | Silence -Wsign-compare warnings. | Rohit Nirmal | 1 | -3/+3 | |
2015-09-29 | fix some xcode 7.0 warnings | Martin Lindhe | 3 | -2/+4 | |
2015-09-16 | general: Silence some warnings when using clang | Lioncash | 3 | -7/+7 | |
2015-09-11 | video_core: Reorganize headers | Lioncash | 19 | -62/+56 | |
2015-09-11 | video_core: Remove unnecessary includes from headers | Lioncash | 5 | -13/+3 | |
2015-09-10 | renderer_opengl: Remove unimplemented function declaration | Lioncash | 1 | -3/+0 | |
2015-09-10 | video_core: Remove unused variables | Lioncash | 3 | -4/+0 | |
2015-09-10 | gl_rasterizer: Replace push_back calls with emplace_back in AddTriangle | Lioncash | 1 | -3/+3 | |
2015-09-07 | Shader JIT: Use SCALE constant from emitter | aroulin | 1 | -4/+4 | |
2015-09-07 | Shader: Fix size_t to int casts of register offsets | aroulin | 2 | -15/+21 | |
2015-09-03 | OpenGL: Use Sampler Objects to decouple sampler config from textures | Yuri Kunde Schlesner | 4 | -21/+76 | |
Fixes #978 | |||||
2015-09-03 | OpenGL: Remove ugly and endian-unsafe color pointer casts | Yuri Kunde Schlesner | 4 | -9/+13 | |
2015-09-03 | OpenGL: Add support for Sampler Objects to state tracker | Yuri Kunde Schlesner | 3 | -4/+42 | |
2015-09-02 | video_core: Fix format specifiers warnings | aroulin | 2 | -2/+3 | |
2015-09-01 | x64: Proper stack alignment in shader JIT function calls | aroulin | 2 | -28/+18 | |
Import Dolphin stack handling and register saving routines Also removes the x86 parts from abi files | |||||
2015-08-31 | Pica: Added the primitive_restart register (0x25f) to the registers map. | Subv | 2 | -1/+5 | |
2015-08-31 | Pica: Add the vertex_offset register to the Pica registers map. | Subv | 2 | -0/+2 | |
2015-08-31 | Shader JIT: Fix SGE/SGEI NaN behavior | aroulin | 1 | -3/+3 | |
SGE was incorrectly emulated w.r.t. NaN behavior as the CMPSS SSE instruction was used with NLT | |||||
2015-08-30 | GPU: Implemented register 0x22A. | Subv | 2 | -2/+8 | |
This is the equivalent of the "first" parameter in glDrawArrays, it tells the GPU the vertex index at which to start rendering. Register 0x22A doesn't affect indexed rendering. | |||||
2015-08-30 | Replace the previous OpenGL loader with a glad-generated 3.3 one | Yuri Kunde Schlesner | 11 | -2812/+12 | |
The main advantage of switching to glad from glLoadGen is that, apart from being actively maintained, it supports a customizable entrypoint loader function, which makes it possible to also support OpenGL ES. | |||||
2015-08-28 | gl_rasterizer_cache: Detect and ignore unnecessary texture flushes. | bunnei | 3 | -8/+18 | |
2015-08-27 | Shader JIT: Fix float to integer rounding in MOVA | aroulin | 1 | -2/+2 | |
MOVA converts new address register values from floats to integers using truncation | |||||
2015-08-27 | Shader JIT: ifdef out reference to ifdef'd out shader_map | archshift | 1 | -0/+2 | |
shader_map was only defined on x86 architectures, but was cleared on shutdown with no ifdef protection. Ifdef this out so non-x86 architectures can be built. | |||||
2015-08-25 | Integrate the MicroProfile profiling library | Yuri Kunde Schlesner | 5 | -0/+25 | |
This brings goodies such as a configurable user interface and multi-threaded timeline view. | |||||
2015-08-24 | HWRenderer: Added a workaround for the Intel Windows driver bug that causes glTexSubImage2D to not change the stencil buffer. | Subv | 1 | -2/+9 | |
Reported here https://communities.intel.com/message/324464 | |||||
2015-08-24 | fixup! Shaders: Fix multiplications between 0.0 and inf | Yuri Kunde Schlesner | 1 | -4/+4 | |
2015-08-24 | Shader JIT: Tiny micro-optimization in DPH | Yuri Kunde Schlesner | 1 | -4/+4 | |
2015-08-24 | Shaders: Fix multiplications between 0.0 and inf | Yuri Kunde Schlesner | 3 | -40/+58 | |
The PICA200 semantics for multiplication are so that when multiplying inf by exactly 0.0, the result is 0.0, instead of NaN, as defined by IEEE. This is relied upon by games. Fixes #1024 (missing OoT interface items) | |||||
2015-08-24 | Shaders: Explicitly conform to PICA semantics in MAX/MIN | Yuri Kunde Schlesner | 2 | -2/+10 | |
2015-08-24 | Shader JIT: Add name to second scratch register (XMM4) | Yuri Kunde Schlesner | 1 | -3/+5 | |
2015-08-24 | shader_jit: Replace two MDisp usages with MatR | Lioncash | 1 | -2/+2 | |
2015-08-24 | Shader JIT: Fix CMP NaN behavior to match hardware | Yuri Kunde Schlesner | 1 | -8/+23 | |
2015-08-23 | HWRenderer: Only reload the framebuffer from gpu memory if the hw renderer is in use during a breakpoint. | Subv | 1 | -2/+6 | |
2015-08-23 | Shader: Use std::sqrt for float instead of sqrt | aroulin | 1 | -1/+1 | |
2015-08-23 | Shader: RCP and RSQ computes only the 1st component | aroulin | 2 | -10/+10 | |
2015-08-22 | Shader: implement DPH/DPHI in JIT | aroulin | 2 | -2/+36 | |
2015-08-22 | Shader: implement DPH/DPHI in interpreter | aroulin | 1 | -1/+8 | |
Tests revealed that the component with w=1 is SRC1 and not SRC2, it is now fixed on 3dbrew. | |||||
2015-08-21 | HWRasterizer: Implemented stencil ops 6 and 7. | Subv | 1 | -1/+3 | |
2015-08-21 | SWRasterizer: Implemented stencil ops 6 and 7. | Subv | 2 | -6/+14 | |
IncrementWrap and DecrementWrap, verified with hwtests. | |||||
2015-08-21 | HWRasterizer: Implemented stencil op 1 (GL_ZERO) | Subv | 1 | -1/+1 | |
2015-08-21 | SWRasterizer: Implemented stencil action 1 (GL_ZERO). | Subv | 2 | -1/+4 | |
Verified with hwtests. | |||||
2015-08-21 | SWRasterizer: Removed a todo. Verified with hwtests. | Subv | 1 | -1/+0 | |
2015-08-21 | SWRenderer: The stencil depth_pass action is executed even if depth testing is disabled. | Subv | 1 | -7/+5 | |
The HW renderer already did this. | |||||
2015-08-21 | Rasterizer: Abstract duplicated stencil code into a lambda. | Subv | 1 | -6/+9 | |
2015-08-20 | GLRasterizer: Implemented stencil testing in the hw renderer. | Subv | 4 | -2/+44 | |
2015-08-20 | GPU/Rasterizer: Corrected the stencil implementation. | Subv | 2 | -18/+39 | |
Verified the behavior with hardware tests. | |||||
2015-08-19 | Shader: implement SGE, SGEI and SLT in JIT | aroulin | 2 | -15/+36 | |
2015-08-19 | Shader: implement SGE, SGEI in interpreter | aroulin | 1 | -0/+14 | |
2015-08-19 | Shader: Save caller-saved registers in JIT before a CALL | aroulin | 2 | -0/+33 | |
2015-08-17 | Shader: implement EX2 and LG2 in JIT | aroulin | 2 | -2/+22 | |
2015-08-16 | Fix Linux GCC 4.9 build (complaining about undeclared memset) | LittleWhite | 1 | -1/+2 | |
2015-08-16 | Shader: implement EX2 and LG2 in interpreter | aroulin | 1 | -0/+36 | |
2015-08-16 | Build fix for Debug configurations. | Tony Wasserka | 1 | -1/+1 | |
2015-08-16 | Introduce a shader tracer to allow inspection of input/output values for each processed instruction. | Tony Wasserka | 8 | -41/+326 | |
2015-08-16 | Pica/DebugUtils: Include uniform information into shader dumps. | Tony Wasserka | 2 | -11/+51 | |
2015-08-16 | citra-qt: Improve shader debugger. | Tony Wasserka | 4 | -13/+28 | |
Now supports dumping the current shader and recognizes a larger number of output semantics. | |||||
2015-08-16 | videocore: Added RG8 texture support | Patrick Martin | 2 | -1/+8 | |
2015-08-16 | Shader: Use a POD struct for registers. | bunnei | 5 | -40/+43 | |
2015-08-16 | Rename ARCHITECTURE_X64 definition to ARCHITECTURE_x86_64. | bunnei | 2 | -7/+6 | |
2015-08-16 | Common: Cleanup CPU capability detection code. | bunnei | 1 | -5/+5 | |
2015-08-16 | Common: Move cpu_detect to x64 directory. | bunnei | 1 | -2/+1 | |
2015-08-16 | x64: Refactor to remove fake interfaces and general cleanups. | bunnei | 6 | -150/+26 | |
2015-08-16 | JIT: Support negative address offsets. | bunnei | 1 | -26/+25 | |
2015-08-16 | Shader: Initial implementation of x86_x64 JIT compiler for Pica vertex shaders. | bunnei | 10 | -3/+940 | |
- Config: Add an option for selecting to use shader JIT or interpreter. - Qt: Add a menu option for enabling/disabling the shader JIT. | |||||
2015-08-15 | Common: Added MurmurHash3 hash function for general-purpose use. | bunnei | 1 | -1/+1 | |
2015-08-15 | Shader: Define a common interface for running vertex shader programs. | bunnei | 7 | -186/+289 | |
2015-08-15 | Shader: Move shader code to its own subdirectory, "shader". | bunnei | 9 | -12/+12 | |
2015-08-15 | GPU: Refactor "VertexShader" namespace to "Shader". | bunnei | 13 | -50/+48 | |
- Also renames "vertex_shader.*" to "shader_interpreter.*" | |||||
2015-08-11 | ARM Core, Video Core, CitraQt, Citrace: Use CommonTypes types instead of the standard u?int*_t types. | Emmanuel Gil Peyrot | 1 | -1/+2 | |
2015-08-06 | OpenGL: Fix state tracking in situations with reused object handles | Yuri Kunde Schlesner | 4 | -0/+45 | |
If an OpenGL object is created, bound to a binding using the state tracker, and then destroyed, a newly created object can be assigned the same numeric handle by OpenGL. However, even though it is a new object, and thus needs to be bound to the binding again, the state tracker compared the current and previous handles and concluded that no change needed to be made, leading to failure to bind objects in certain cases. This manifested as broken text in VVVVVV, which this commit fixes along with similar texturing problems in other games. | |||||
2015-08-06 | OpenGL: Remove redundant texture.enable_2d field from OpenGLState | Yuri Kunde Schlesner | 4 | -26/+3 | |
All uses of this field where it's false can just set the texture id to 0 instead. | |||||
2015-08-05 | Videocore: Implement simple vertex caching | Yuri Kunde Schlesner | 1 | -62/+89 | |
This gives a ~2/3 reduction in the amount of vertices that need to be processed through the vertex loaders and the vertex shader, yielding a good speedup. | |||||
2015-07-28 | OpenGL: Add a profiler category measuring framebuffer readback | Yuri Kunde Schlesner | 1 | -0/+7 | |
2015-07-26 | citra-qt/debug_utils: Use lock_guard everywhere | Lectem | 1 | -6/+5 | |
unique_lock were being used as lock_guards. Also replaced manual lock/unlock by lock_guard for harmonization. | |||||
2015-07-26 | citra-qt/command list: Add mask column | Lectem | 3 | -25/+24 | |
2015-07-26 | OpenGL: Make OpenGL object resource wrappers fully inline | Yuri Kunde Schlesner | 3 | -143/+79 | |
The functions are so simple that having them separate only bloats the code and hinders optimization. | |||||
2015-07-26 | Videocore: Don't reinitialize register name map on every query | Yuri Kunde Schlesner | 2 | -65/+72 | |
This greatly speeds up the command list debug widget. | |||||
2015-07-26 | Videocore: Simplify variables in vertex shader interpreter | Yuri Kunde Schlesner | 1 | -24/+21 | |
Simplifies the code and gives a tiny speed-up. | |||||
2015-07-26 | Videocore: Replace std::stack in shader interpreter with static_vector | Yuri Kunde Schlesner | 1 | -6/+6 | |
Shaves off 1/3rd of the vertex shader time in Fire Emblem | |||||
2015-07-26 | VideoCore: #ifdef out some debugging routines | Yuri Kunde Schlesner | 5 | -13/+18 | |
Some disabled debugging functionality was being called from rendering routines in VideoCore. Although disabled, many of them still allocated memory or did some extra work that was enough to show up in a profiler. Gives a slight (~2ms) speedup. | |||||
2015-07-25 | Address error that remained in last merge | Yuri Kunde Schlesner | 1 | -1/+1 | |
2015-07-23 | VideoCore: Fix values of unset components in input attribute arrays | Yuri Kunde Schlesner | 1 | -42/+38 | |
If an input attribute array had a field with less than 4 components, the remaining components were left unset if not specified by a default vertex attribute. If neither mechanism would set a component, it would assume a garbage value. It has been verified that the hardware behavior is to instead to set the missing components from the fixed default of (0 0 0 1). The default vertex attribute values aren't used at all if a vertex array is specified for that attribute. Fixes UI graphics on Fire Emblem: Awakening, a small texturing glitch when selecting a character in Cubic Ninja, as well as eliminating the unset-W hack which was required for Ocarina of Time to not have garbled triangles. This change has been tested against hardware. | |||||
2015-07-23 | VideoCore: Saturate vertex colors before interpolating | Yuri Kunde Schlesner | 1 | -0/+6 | |
During testing, it was discovered that hardware does not interpolate colors output by the vertex shader as-is. Rather, it drops the sign and saturates the value to 1.0. This is done before interpolation, such that (e.g.) interpolating outputs 1.5 and -0.5 is equivalent to as if the shader had output the values 1.0 and 0.5 instead, with the interpolated value never crossing 0.0. This change has been tested against hardware. | |||||
2015-07-23 | Qt/GPU Breakpoints: Added three more breakpoint types: | Subv | 2 | -4/+7 | |
* IncomingDisplayTransfer: Triggered just before a display transfer is performed. * GSPCommandProcessed: Triggered right after a GSP command is processed. * BufferSwapped: Triggered when the frames flip | |||||
2015-07-23 | Rasterizer/GL: Set the border color when binding a texture. | Subv | 1 | -2/+9 | |
2015-07-22 | GL Renderer: Remove erroneous glEnable(GL_TEXTURE_2D) calls | Yuri Kunde Schlesner | 1 | -8/+5 | |
In OpenGL 3, texturing is always enabled, and this call is invalid. While it produced no effect in the rest of the execution, it wouldn't have the intended effect of disabling texturing for that unit. Instead bind a null texture to the unit. | |||||
2015-07-21 | GPU: Added registers for min and mag texture filters and implemented them in the hw renderer. | Subv | 4 | -3/+37 | |
2015-07-20 | Pica: Correct switched S/T texture wrapping registers | Yuri Kunde Schlesner | 1 | -2/+2 | |
This was found and hwtested by Lectem | |||||
2015-07-20 | Pica: Fix DP3 instruction, which wasn't assigning to the w component | Yuri Kunde Schlesner | 1 | -1/+1 | |
2015-07-19 | GLRasterizer: Don't try to get a pointer to the depth buffer if it doesn't exist. | Subv | 1 | -3/+7 | |
2015-07-19 | Rasterizer/Textures: Fixed a bug where the I4 format would get twice the real stride. | Subv | 1 | -0/+1 | |
Also added its name to the texture viewer widget | |||||
2015-07-19 | Vertex Shader : Undo casting | zawata | 1 | -1/+1 | |
2015-07-19 | Video_Core : Type fixes | zawata | 2 | -2/+2 | |
2015-07-19 | Video_Core: Finally fix pesky warning | zawata | 1 | -1/+1 | |
2015-07-19 | Video_Core : Change Tabs to Spaces | zawata | 1 | -0/+15 | |
This really should be universalized, I keep getting errors creating commits because lines I've edited use tabs instead of spaces(and yes I did read the contributing guide and i know they are supposed to be spaces) | |||||
2015-07-19 | Video_Core : Fix Conversion Warnings | zawata | 3 | -18/+3 | |
2015-07-15 | Pica/Shader: Add geometry shader definitions. | Tony Wasserka | 5 | -149/+162 | |
2015-07-15 | Pica/CommandProcessor: Move default attribute setup to the proper position. | Tony Wasserka | 1 | -40/+40 | |
2015-07-15 | Pica/Clipper: Output proper number of triangles in debugging logs. | Tony Wasserka | 1 | -1/+1 | |
2015-07-14 | VideoCore: Implement the DOT3_RGB combiner | Lectem | 2 | -1/+13 | |
2015-07-13 | Pica: Implement stencil testing. | Tony Wasserka | 2 | -12/+173 | |
2015-07-13 | Clean up command_processor.cpp. | Tony Wasserka | 1 | -22/+27 | |
2015-07-13 | Add CiTrace recording support. | Tony Wasserka | 3 | -2/+63 | |
This is exposed in the GUI as a new "CiTrace Recording" widget. Playback is implemented by a standalone 3DS homebrew application (which only runs reliably within Citra currently; on an actual 3DS it will often crash still). | |||||
2015-07-09 | Added GL_CLAMP_TO_BORDER support | Lectem | 3 | -13/+28 | |
2015-06-28 | Core: Cleanup hw includes. | Emmanuel Gil Peyrot | 5 | -4/+13 | |
2015-06-28 | Core, VideoCore: Replace or fix exit() calls. | Emmanuel Gil Peyrot | 1 | -6/+9 | |
2015-06-28 | CitraQt: Cleanup includes. | Emmanuel Gil Peyrot | 3 | -5/+10 | |
2015-06-28 | Common: Cleanup emu_window includes. | Emmanuel Gil Peyrot | 3 | -10/+8 | |
2015-06-28 | Common: Cleanup key_map includes. | Emmanuel Gil Peyrot | 2 | -3/+9 | |
2015-06-27 | VideoCore: Fix floating point warning | zawata | 1 | -1/+1 | |
2015-06-16 | VideoCore: Log the GL driver’s vendor and renderer. | Emmanuel Gil Peyrot | 1 | -0/+2 | |
2015-06-14 | video_core: add extra braces around initializer | Yuri Kunde Schlesner | 1 | -3/+3 | |
Trivial change and fixes several warnings in the clang build. | |||||
2015-06-09 | Renderer formatting edits | tfarley | 2 | -26/+29 | |
2015-06-09 | Render-to-texture flush, interval math fix | tfarley | 1 | -1/+13 | |
2015-06-09 | Liberal texture unbind (clout menu) | tfarley | 2 | -4/+40 | |
2015-06-09 | Depth format fix (crush3d intro/black screens) | tfarley | 1 | -46/+46 | |
2015-06-09 | Implemented glColorMask | tfarley | 3 | -0/+24 | |
2015-05-31 | Pica: Use zero for the SecondaryFragmentColor source. | bunnei | 3 | -11/+21 | |
- This is a workaround until we support fragment lighting. | |||||
2015-05-31 | rasterizer: Remove unnecessary 'using' for BlendEquation. | bunnei | 1 | -2/+1 | |
2015-05-31 | Pica: Implement LogicOp function. | bunnei | 7 | -8/+135 | |
2015-05-31 | rasterizer: Implement AddSigned combiner function for alpha channel. | bunnei | 1 | -0/+7 | |
2015-05-31 | vertex_shader: Use address offset on src2 in inverted mode. | bunnei | 1 | -3/+3 | |
2015-05-31 | Pica: Implement command buffer execution registers. | bunnei | 2 | -44/+76 | |
2015-05-31 | vertex_shader: Implement SLT/SLTI instructions. | bunnei | 1 | -4/+10 | |
2015-05-31 | vertex_shader: Implement MIN instruction. | bunnei | 1 | -0/+9 | |
2015-05-30 | Move video_core/color.h to common/color.h | archshift | 5 | -218/+4 | |
2015-05-30 | Move video_core/math.h to common/vector_math.h | archshift | 7 | -648/+6 | |
The file only contained vector manipulation code, and such widely-useable code doesn't belong in video_core. | |||||
2015-05-29 | Remove every trailing whitespace from the project (but externals). | Emmanuel Gil Peyrot | 11 | -25/+25 | |
2015-05-23 | gl_state: Remove unnecessary const specifier on Apply | Lioncash | 2 | -2/+2 | |
2015-05-23 | video_core/utils: Remove unused variables in GetMortonOffset | Lioncash | 1 | -3/+0 | |
2015-05-23 | Pica: Create 'State' structure and move state memory there. | bunnei | 12 | -428/+451 | |
2015-05-23 | gl_state: Fix a condition typo in Apply | Lioncash | 1 | -1/+1 | |
2015-05-23 | OpenGL renderer | tfarley | 21 | -44/+2196 | |
2015-05-17 | GPU/DefaultAttributes: Clear up a comment in command_processor | Subv | 1 | -2/+2 | |
2015-05-17 | GPU/DefaultAttributes: Let the attribute data from the loaders overwrite the default attributes, if set. | Subv | 1 | -21/+23 | |
closes #735 | |||||
2015-05-15 | Memmap: Re-organize memory function in two files | Yuri Kunde Schlesner | 4 | -5/+3 | |
memory.cpp/h contains definitions related to acessing memory and configuring the address space mem_map.cpp/h contains higher-level definitions related to configuring the address space accoording to the kernel and allocating memory. | |||||
2015-05-14 | pica: Add the ULL specifier in IsDefaultAttribute | Lioncash | 1 | -1/+1 | |
This is necessary otherwise there are warnings about a 32-bit result being casted to a 64-bit value. | |||||
2015-05-12 | GPU: Add more fine grained profiling for vertex shader and rasterization | Yuri Kunde Schlesner | 2 | -0/+10 | |
2015-05-11 | Implement I4 texture format | archshift | 2 | -1/+12 | |
@neobrain, could you confirm that this is correct? It's been tested with various different games and fixes different textures, including in Animal Crossing, Kirby Triple Deluxe, and SMB3D. | |||||
2015-05-10 | rasterizer: Implemented combiner output scaling. | bunnei | 2 | -2/+16 | |
2015-05-10 | rasterizer: Implemented AddSigned combiner op. | bunnei | 1 | -0/+10 | |
2015-05-10 | rasterizer: Fixed a depth testing bug. | bunnei | 2 | -6/+19 | |
2015-05-10 | rasterizer: Implement combiner buffer input. | bunnei | 2 | -4/+53 | |
2015-05-10 | rasterizer: Return zero'd vectors on error conditions. | bunnei | 1 | -3/+3 | |
2015-05-10 | vertex_shader: Implement FLR instruction. | bunnei | 1 | -0/+9 | |
2015-05-10 | vertex_shader: Implement MADI instruction. | bunnei | 1 | -4/+7 | |
nihstro: Update submodule to latest upstream/master to support MADI instruction decoding. | |||||
2015-05-09 | Memory: Add GetPhysicalPointer helper function | Yuri Kunde Schlesner | 3 | -11/+11 | |
2015-05-09 | Memory: Support more regions in the VAddr-PAddr translation functions | Yuri Kunde Schlesner | 3 | -18/+7 | |
Also adds better documentation and removes the one-off reimplementation of the function in pica.h. | |||||
2015-05-09 | Memory: Re-organize and rename memory area address constants | Yuri Kunde Schlesner | 1 | -1/+1 | |
2015-05-07 | Common: Remove common.h | Yuri Kunde Schlesner | 6 | -3/+8 | |
2015-05-07 | GPU: Implemented default vertex shader attributes. | Subv | 4 | -68/+137 | |
Fixes some games crashing. | |||||
2015-04-29 | VideoCore: Remove a superfluous auto variable declaration in debug_utils. | Emmanuel Gil Peyrot | 1 | -1/+1 | |
2015-04-10 | Silence some -Wsign-compare warnings. | Rohit Nirmal | 1 | -2/+2 | |
2015-04-05 | Changed occurences of colour to color for consistency | Gareth Higgins | 2 | -4/+4 | |
2015-04-04 | Allow the user to set the background clear color during emulation | archshift | 1 | -1/+2 | |
The background color can be seen at the sides of the bottom screen or when the window is wider than normal. | |||||
2015-03-16 | VideoCore: Add static_cast around expressions where the compiler doesn’t deduce the right type. | Emmanuel Gil Peyrot | 2 | -4/+4 | |
2015-03-12 | Pica/VertexShader: Fix a bug caused due to incorrect assumptions of consecutive output register tables. | Tony Wasserka | 1 | -20/+24 | |
We now write create a temporary buffer for output registers and copy all of them to the actual output vertex structure after the shader has run. This is technically not necessary, but it's easier to vectorize in the future. | |||||
2015-03-10 | GPU: Added the stencil test structure to the Pica Regs struct. | Subv | 3 | -50/+65 | |
2015-03-10 | GPU: Implemented more depth buffer formats. | Subv | 3 | -9/+115 | |
This fixes the horizontal lines in Picross E, Cubic Ninja, Cave Story 3D and possibly others | |||||
2015-03-09 | Added LCD registers, and implementation for color filling in OGL code. | archshift | 2 | -11/+48 | |
2015-03-09 | Pica/PrimitiveAssembly: Fix triangle strips and fans being generated with incorrect winding order. | Tony Wasserka | 1 | -6/+3 | |
2015-03-08 | Update nihstro submodule to the initial release version. | archshift | 1 | -37/+38 | |
Includes more opcodes to implement in the future. | |||||
2015-03-07 | Set framebuffer layout from EmuWindow. | bunnei | 3 | -43/+9 | |
2015-03-07 | GPU/Textures: Fixed ETC texture decoding. | Subv | 1 | -1/+1 | |
2015-03-04 | GPU: Added RGB565/RGB8 framebuffer support and various cleanups. | bunnei | 5 | -85/+155 | |
- Centralizes color format encode/decode functions. - Fixes endianness issues. - Implements remaining framebuffer formats in the debugger. | |||||
2015-03-02 | Add profiling infrastructure and widget | Yuri Kunde Schlesner | 2 | -0/+18 | |
2015-02-28 | Added RGBA5551 compatibility in the rasterizer | archshift | 3 | -2/+41 | |
This allows Virtual Console games to display properly. | |||||
2015-02-27 | GPU: Implemented bits 3 and 1 from the display transfer flags. | Subv | 3 | -54/+91 | |
Bit 3 is used to specify a raw copy, where no processing is done to the data, seems to behave exactly as a DMA. Bit 1 is used to specify whether to convert from a tiled format to a linear format or viceversa. | |||||
2015-02-26 | Video core: Fix A4 texture decoding | Yuri Kunde Schlesner | 1 | -2/+2 | |
It was trying to take the LSB from `coarse_x`, which would always be 0 and thus would always return the same texel from each byte. To add insult to the injury, the conditional was actually the wrong way around too. Fixes blocky text in OoT. | |||||
2015-02-26 | Video core: Fix pixelation/blockiness in textures. | Yuri Kunde Schlesner | 1 | -3/+3 | |
This was caused during morton decoding by me not masking the bits of each coordinate before merging them, so the bits from x could set bits in y if it was >255. | |||||
2015-02-25 | Rasterizer: Add support for RGBA4 framebuffer format. | bunnei | 1 | -0/+21 | |
2015-02-22 | Rasterize with the correct color component order. | bunnei | 1 | -11/+24 | |
- Fixes a regression with #594. | |||||
2015-02-21 | Pica/VertexShader: Fixed LOOP with more than one iteration. | Subv | 1 | -1/+4 | |
Previously it wouldn't jump back to the start of the loop code once it reached the end of the block. Fixes the texture problems in a lot of games. | |||||
2015-02-20 | Remove duplication of INSERT_PADDING_WORDS between pica.h and gpu.h | archshift | 1 | -11/+0 | |
2015-02-19 | Rasterizer: Fixed a warning in GetWrappedTexCoord. | Subv | 1 | -4/+4 | |
Redeclaring the variable inside the switch was causing weird behavior. | |||||
2015-02-18 | Pica/Rasterizer: Replace exit() calls with UNIMPLEMENTED(). | Tony Wasserka | 1 | -5/+5 | |
2015-02-18 | Pica/Rasterizer: Make some local lambdas static. | Tony Wasserka | 1 | -8/+8 | |
2015-02-18 | Pica/BlendUnit: Implement separate color/alpha blend equations. | Tony Wasserka | 2 | -65/+59 | |
2015-02-18 | Pica/TextureEnvironment: Add a note. | Tony Wasserka | 1 | -0/+4 | |
2015-02-18 | Pica/TextureEnvironment: Treat texture combiner source 1 as the PrimaryColor. | Tony Wasserka | 2 | -0/+4 | |
Not really sure where the difference is, but some applications seem to use this 1:1 the same way... | |||||
2015-02-18 | Pica/TextureEnvironment: Add support for the MAD-like texture combiners and clean up texture environment logic. | Tony Wasserka | 2 | -0/+28 | |
2015-02-18 | Pica/OutputMerger: Fix flipped framebuffers. | Tony Wasserka | 1 | -0/+10 | |
2015-02-18 | Pica/TextureUnit: Implement mirrored repeating texture wrapping. | Tony Wasserka | 2 | -3/+12 | |
2015-02-18 | Pica: Fix a bug in the register definitions, relating to texture wrapping. | Tony Wasserka | 2 | -2/+2 | |
2015-02-18 | Pica/OutputMerger: Implement color format checking. | Tony Wasserka | 2 | -4/+13 | |
2015-02-18 | Pica/Rasterizer: Rasterize actual pixel centers instead of pixel corners. | Tony Wasserka | 1 | -2/+3 | |
2015-02-18 | Pica/Rasterizer: Fix garbage pixels at triangle borders. | Tony Wasserka | 1 | -1/+3 | |
2015-02-18 | Pica/Rasterizer: Clean up and fix backface culling. | Tony Wasserka | 1 | -11/+27 | |
2015-02-18 | Pica: Cleanup clipping code and change screenspace z to range from -1..0. | Tony Wasserka | 2 | -53/+42 | |
The change in depth range seems to reflect better to what applications are expecting, and makes for cleaner code overall (hence is more likely to reflect hardware behavior). | |||||
2015-02-18 | Pica/VertexShader: Implement the LOOP instruction. | Tony Wasserka | 1 | -14/+36 | |
2015-02-18 | Pica/CommandProcessor: Properly implement shader load destination offset registers. | Tony Wasserka | 2 | -20/+10 | |
2015-02-18 | Pica/CommandProcessor: Work around initialized vertex attributes some more. | Tony Wasserka | 1 | -2/+8 | |
2015-02-17 | core/video_core: Use in-place construction where possible | Lioncash | 2 | -4/+4 | |
2015-02-16 | VideoCore: Fix a typo in Vec4 MakeVec(T, Vec3<T>), where the second argument was Vec2<T> instead. | Emmanuel Gil Peyrot | 1 | -1/+1 | |
2015-02-15 | video_core: Implement the remaining framebuffer formats in the OpenGL renderer. | Emmanuel Gil Peyrot | 2 | -12/+67 | |
2015-02-12 | Build: Fixed some warnings | Subv | 2 | -3/+3 | |
2015-02-11 | Fix Min and Max blend equations | Darius Goad | 1 | -6/+8 | |
2015-02-11 | Asserts: break/crash program, fit to style guide; log.h->assert.h | archshift | 8 | -23/+18 | |
Involves making asserts use printf instead of the log functions (log functions are asynchronous and, as such, the log won't be printed in time) As such, the log type argument was removed (printf obviously can't use it, and it's made obsolete by the file and line printing) Also removed some GEKKO cruft. | |||||
2015-02-10 | Add more blend equations from 3dbrew | Darius Goad | 2 | -2/+49 | |
2015-02-05 | Rasterizer: Implement the other color and alpha modifiers. | bunnei | 2 | -58/+69 | |
2015-02-05 | VideoCore: Added same-component swizzlers to math utility functions. | bunnei | 1 | -16/+35 | |
2015-01-31 | Pica: Implement blend factors. | bunnei | 2 | -10/+67 | |
2015-01-28 | Pica: Implement color/alpha channel enable. | bunnei | 2 | -1/+12 | |
2015-01-27 | Rasterizer: Implemented alpha testing. | bunnei | 2 | -7/+52 | |
2015-01-26 | GPU: Implement the remaining depth testing functions. | bunnei | 2 | -3/+28 | |
2015-01-14 | GSP: Update framebuffer info on all interrupts | Yuri Kunde Schlesner | 1 | -3/+1 | |
Hardware testing determined that the GSP processes shared memory framebuffer update info even when no memory transfer or filling GX commands are used. They are now updated on every interrupt, which isn't confirmed correct but matches hardware behaviour more closely. This also reverts the hack introduced in #404. It made a few games behave better, but I believe it's incorrect and also breaks other games. | |||||
2015-01-13 | Pica/Rasterizer: Add ETC1 texture decompression support. | Tony Wasserka | 2 | -14/+142 | |
2015-01-13 | Pica/VertexShader: Implement JMPC/JMPU/CALLC/CALLU. | Tony Wasserka | 1 | -23/+52 | |
2015-01-13 | Pica/VertexShader: Implement the MAD instruction. | Tony Wasserka | 1 | -0/+69 | |
2015-01-08 | GSP: Toggle active framebuffer each frame | bunnei | 1 | -1/+4 | |
2014-12-31 | Pica/Rasterizer: Remove some redundant casts. | Tony Wasserka | 1 | -3/+3 | |
2014-12-31 | Pica/Rasterizer: Make orient2d a free function and rename it to SignedArea. | Tony Wasserka | 1 | -31/+38 | |
2014-12-31 | Pica: Cleanup color conversion. | Tony Wasserka | 2 | -18/+46 | |
2014-12-31 | VideoCore: Remove some unused functions. | Tony Wasserka | 1 | -26/+0 | |
2014-12-31 | Pica/Rasterizer: Fix a bug related to multitexturing and texture wrapping. | Tony Wasserka | 1 | -2/+2 | |
2014-12-31 | Pica/Rasterizer: Clean up long code lines. | Tony Wasserka | 1 | -4/+8 | |
2014-12-31 | Pica/VertexShader: Coding style fixes. | Tony Wasserka | 1 | -16/+8 | |
2014-12-31 | Pica/CommandProcessor: Cleanups. | Tony Wasserka | 1 | -3/+4 | |
2014-12-31 | Pica/CommandProcessor: Workaround games not setting the input position's w component. | Tony Wasserka | 1 | -0/+14 | |
2014-12-31 | Pica/Rasterizer: Implement backface culling. | Tony Wasserka | 2 | -10/+36 | |
2014-12-31 | Pica/Rasterizer: Textures seem to be laid out flipped vertically. | Tony Wasserka | 1 | -1/+1 | |
Not sure if this is a correct fix. Probably should instead change the decoding logic itself. | |||||
2014-12-31 | Pica/DebugUtils: Fix a bug in RGBA4 texture decoding. | Tony Wasserka | 1 | -2/+2 | |
2014-12-31 | Pica/Rasterizer: Implement alpha blending. | Tony Wasserka | 1 | -0/+84 | |
2014-12-31 | Pica/Rasterizer: Implement depth testing. | Tony Wasserka | 2 | -6/+34 | |
2014-12-31 | Pica/Rasterizer: Further enhance Tev support. | Tony Wasserka | 1 | -4/+19 | |
2014-12-31 | Pica: Add output merger definitions. | Tony Wasserka | 1 | -1/+56 | |
2014-12-31 | Pica: Fix A4, IA4 and IA8 texture formats. | Tony Wasserka | 1 | -13/+7 | |
Both IA4 and IA8 had their component order mixed up. Additionally, IA4 used the wrong number of nibbles per texel. A4 skipped every second texel. | |||||
2014-12-31 | Pica/CommandProcessor: Add support for integer uniforms. | Tony Wasserka | 4 | -1/+30 | |
2014-12-29 | Rasterizer: Pre-divide vertex attributes by W | Yuri Kunde Schlesner | 3 | -8/+32 | |
Execute the division-by-W for perspective-correct interpolation of values in the clipper, moving them out of the rasterization inner loop. | |||||
2014-12-29 | GPU: Bitwise texture swizzling | Yuri Kunde Schlesner | 1 | -27/+24 | |
Replace the loop-based texture address swizzling code by a bit-twiddling implementation, providing a very small speed up. Also simplify addressing code. | |||||
2014-12-29 | Rasterizer: Common sub-expression elimination | Yuri Kunde Schlesner | 1 | -14/+17 | |
Move the computation of some values out of loops so that they're not constantly recalculated even when they don't change. | |||||
2014-12-29 | Clipper: Compact buffers on each clipping pass | Yuri Kunde Schlesner | 1 | -28/+27 | |
Use a new buffer management scheme in the clipper that allows using a bounded minimal amount of buffer space. Even though it copies more data it is still slightly faster likely due to using less cache. | |||||
2014-12-29 | Clipper: Avoid dynamic allocations | Yuri Kunde Schlesner | 1 | -10/+7 | |
The triangle clipper was allocating its temporary input, output and work buffers using a std::vector. Since this is a hot path, it's desirable to use stack allocation instead. | |||||
2014-12-29 | Vertex Shader: Zero OutputVertex to avoid denormals | Yuri Kunde Schlesner | 1 | -0/+4 | |
Unused OutputVertex attributes were being left un-initialized. The leftover garbage sometimes decoded as floating-point denormalized values, causing fallbacks to microcode and massive slowdowns in the rest of the rasterization pipeline even though the results were unused. By zeroing the structure we ensure these attributes only contain harmless zeros. | |||||
2014-12-29 | GPU: Implement frameskip and remove forced framebuffer swap hack. | bunnei | 1 | -0/+5 | |
2014-12-21 | Fix visual studio ambiguous symbol error | Apology11 | 1 | -4/+4 | |
2014-12-21 | More warning cleanups | Chin | 2 | -7/+7 | |
2014-12-21 | License change | purpasmart96 | 23 | -23/+23 | |
2014-12-20 | Pica/VertexShader: Promote a log message to critical status. | Tony Wasserka | 1 | -1/+1 | |
2014-12-20 | Pica/VertexShader: Small optimization. | Tony Wasserka | 1 | -7/+7 | |
2014-12-20 | Pica/VertexShader: Be robust against invalid inputs. | Tony Wasserka | 1 | -2/+9 | |
More specifically, this also fixes crashes by Citra trying to load a src2 register even if the current instruction does not use that. | |||||
2014-12-20 | Pica/VertexShader: Clarify a comment. | Tony Wasserka | 1 | -1/+3 | |
2014-12-20 | Pica/DebugUtils: Further cleanups to LookupTexture. | Tony Wasserka | 1 | -7/+7 | |
2014-12-20 | Pica/DebugUtils: Fix two warnings. | Tony Wasserka | 1 | -2/+2 | |
2014-12-20 | Pica/DebugUtils: Better document LookupTexture. | Tony Wasserka | 2 | -7/+16 | |
2014-12-20 | Pica/Rasterizer: Get rid of C-style casts. | Tony Wasserka | 1 | -4/+4 | |
2014-12-20 | Pica/DebugUtils: Make a number of variables static. | Tony Wasserka | 1 | -13/+13 | |
Makes for cleaner and faster code. | |||||
2014-12-20 | Pica/VertexShader: Cleanup flow control logic and implement CMP/IFU instructions. | Tony Wasserka | 1 | -50/+56 | |
2014-12-20 | Pica/VertexShader: Run instruction handlers according to the effective opcode. | Tony Wasserka | 1 | -1/+1 | |
This allows for proper emulation of the different CMP/LRP/MAD instructions. | |||||
2014-12-20 | Pica/VertexShader: Implement MAX instructions. | Tony Wasserka | 1 | -0/+9 | |
2014-12-20 | Pica: Add support for boolean uniforms. | Tony Wasserka | 4 | -2/+21 | |
2014-12-20 | Pica/VertexShader: Add support for MOVA, CMP and IFC. | Tony Wasserka | 2 | -7/+138 | |
2014-12-20 | Pica/VertexShader: Move code around a bit. | Tony Wasserka | 1 | -42/+58 | |
2014-12-20 | Pica/VertexShader: Some cleanups using std::array. | Tony Wasserka | 2 | -5/+19 | |
2014-12-20 | Pica/VertexShader: Support negating src2. | Tony Wasserka | 2 | -3/+9 | |
2014-12-20 | Pica/DebugUtils: Replace duplicated SHBIN structures in favor of nihstro's ones. | Tony Wasserka | 1 | -61/+8 | |
2014-12-20 | Pica/VertexShader: Remove (now) duplicated shader bytecode definitions in favor of nihstro's ones. | Tony Wasserka | 2 | -222/+30 | |
2014-12-20 | Pica/DebugUtils: Add an event triggered after loading a vertex. | Tony Wasserka | 2 | -0/+4 | |
2014-12-20 | Pica/PrimitiveAssembly: Implement triangle strips. | Tony Wasserka | 2 | -8/+16 | |
2014-12-20 | Pica/CommandProcessor: Add a safety check for invalid (?) GPU configurations. | Tony Wasserka | 1 | -0/+7 | |
2014-12-20 | Pica/CommandProcessor: Fix vertex decoding if multiple memory areas are accessed for different attributes. | Tony Wasserka | 1 | -7/+8 | |
2014-12-20 | Add support for a ridiculous number of texture formats. | Tony Wasserka | 2 | -7/+80 | |
2014-12-20 | Pica: Unify ugly address translation hacks. | Tony Wasserka | 5 | -16/+25 | |
2014-12-20 | Pica: Further improve Tev emulation. | Tony Wasserka | 3 | -12/+51 | |
2014-12-20 | Pica: Merge texture lookup logic for DebugUtils and Rasterizer. | Tony Wasserka | 3 | -55/+41 | |
This effectively adds support for a lot texture formats in the rasterizer. | |||||
2014-12-20 | Pica: Implement texture wrapping. | Tony Wasserka | 2 | -2/+31 | |
2014-12-20 | Pica/DebugUtils: Add support for RGBA8, RGBA5551, RGBA4 and A8 texture formats. | Tony Wasserka | 2 | -3/+48 | |
2014-12-20 | Pica: Initial support for multitexturing. | Tony Wasserka | 3 | -24/+83 | |
2014-12-20 | Clean up some warnings | Chin | 1 | -2/+2 | |
2014-12-19 | Properly erase/remove an observer | chinhodado | 1 | -1/+1 | |
2014-12-13 | Convert old logging calls to new logging macros | Yuri Kunde Schlesner | 10 | -38/+50 | |
2014-12-12 | MemMap: Renamed "GSP" heap to "linear", as this is not specific to GSP. | bunnei | 1 | -2/+2 | |
- Linear simply indicates that the mapped physical address is always MappedVAddr+0x0C000000, thus this memory can be used for hardware devices' DMA (such as the GPU). | |||||
2014-12-10 | GSP: Trigger GPU interrupts at more accurate locations. | bunnei | 2 | -1/+15 | |
2014-12-10 | GPU: Fixed bug in command list size decoding. | bunnei | 1 | -1/+2 | |
2014-12-09 | Pica: Re-enable command names on MSVC. | Tony Wasserka | 1 | -5/+0 | |
The affected code is no longer limited by compiler support on that platform. | |||||
2014-12-09 | More coding style fixes. | Tony Wasserka | 1 | -6/+12 | |
2014-12-09 | Some code cleanup. | Tony Wasserka | 1 | -3/+1 | |
2014-12-09 | citra_qt: Add enhanced texture debugging widgets. | Tony Wasserka | 3 | -1/+30 | |
Double-clicking a texture parameter command in the pica command lists will spawn these as a new tab in the pica command list dock area. | |||||
2014-12-09 | citra-qt: Add texture viewer to Pica command list. | Tony Wasserka | 2 | -21/+45 | |
The texture viewer is enabled when selecting a write command to one of the texture config registers. | |||||
2014-12-09 | Pica/DebugUtils: Add breakpoint functionality. | Tony Wasserka | 3 | -0/+189 | |
2014-12-09 | Build fix for something which shouldn't have compiled successfully to begin with. | Tony Wasserka | 1 | -1/+1 | |
2014-12-07 | Integrate Boost into build system and perform a trivial cleanup in vertex_shader.cpp. | Tony Wasserka | 1 | -6/+10 | |
2014-12-03 | Change NULLs to nullptrs. | Rohit Nirmal | 2 | -7/+7 | |
2014-12-01 | Silence a few -Wsign-compare warnings. | Rohit Nirmal | 3 | -6/+6 | |
2014-11-30 | Fixed viewport error caused by rounding | vaguilar | 1 | -2/+2 | |
2014-11-19 | Remove tabs in all files except in skyeye imports and in generated GL code | Emmanuel Gil Peyrot | 4 | -14/+14 | |
2014-11-19 | Remove trailing spaces in every file but the ones imported from SkyEye, AOSP or generated | Emmanuel Gil Peyrot | 2 | -3/+3 | |
2014-11-18 | OpenGL Renderer: Cleanup viewport extent calculation. | Tony Wasserka | 2 | -44/+29 | |
2014-11-18 | Fixup EmuWindow interface and implementations thereof. | Tony Wasserka | 1 | -3/+3 | |
2014-11-18 | Viewport scaling and display density independence | Kevin Hartman | 2 | -1/+50 | |
The view is scaled to be as large as possible, without changing the aspect, within the bounds of the window. On "retina" displays, or other displays where window units != pixels, the view should no longer draw incorrectly. | |||||
2014-11-16 | vertex_shader: Fix control reaches end of function warning | Lioncash | 1 | -1/+1 | |
2014-11-14 | Fix two format strings. | Lioncash | 1 | -2/+2 | |
2014-10-30 | Fix some warnings | Sean | 2 | -3/+3 | |
2014-10-29 | Renamed souce files of services to match port names | Gareth Poole | 1 | -1/+1 | |
2014-10-26 | Add `override` keyword through the code. | Yuri Kunde Schlesner | 1 | -4/+4 | |
This was automated using `clang-modernize`. | |||||
2014-10-21 | Only check OpenGL shader log if size is >1. | Yuri Kunde Schlesner | 1 | -9/+6 | |
This prevents a crash when the buffer size returned by the driver is 0, in which case no space is allocated to store even the NULL byte and glGetShaderInfoLog errors out. Thanks to @Relys for the bug report. | |||||
2014-10-12 | Rework OpenGL renderer. | Yuri Kunde Schlesner | 4 | -233/+193 | |
The OpenGL renderer has been revised, with the following changes: - Initialization and rendering have been refactored to reduce the number of redundant objects used. - Framebuffer rotation is now done directly, using texture mapping. - Vertex coordinates are now given in pixels, and the projection matrix isn't hardcoded anymore. | |||||
2014-10-12 | OpenGL renderer: Shuffle initialization code around and rename functions. | Yuri Kunde Schlesner | 2 | -25/+18 | |
2014-10-12 | Remove virtual inheritance from RendererOpenGL | Yuri Kunde Schlesner | 2 | -3/+3 | |
Also make destructor virtual so that instances are properly destructed. | |||||
2014-10-08 | Fix warnings in video_core | Lioncash | 7 | -23/+23 | |
2014-09-17 | Common: Rename the File namespace to FileUtil, to match the filename and prevent collisions. | Emmanuel Gil Peyrot | 1 | -1/+1 | |
2014-09-14 | Core: Fix warnings in gpu.cpp | Lioncash | 1 | -1/+1 | |
2014-09-12 | Added support for multiple input device types for KeyMap and connected Qt. | Kevin Hartman | 1 | -0/+1 | |
2014-09-09 | Moved common_types::Rect from common to Common namespace | archshift | 2 | -3/+3 | |
2014-09-07 | renderer_opengl.cpp: improved alignment for readability | archshift | 1 | -16/+16 | |
2014-09-07 | Dead code removal: video_core.cpp, load_symbol_map.cpp | archshift | 1 | -7/+0 | |
2014-09-07 | utils: cleaned up DumpTGA, removing redundancies | archshift | 2 | -21/+13 | |
2014-09-01 | Remove hand-crafted Visual Studio solution. | Yuri Kunde Schlesner | 2 | -217/+0 | |
2014-09-01 | CMake cleanup | Yuri Kunde Schlesner | 1 | -13/+26 | |
Several cleanups to the buildsystem: - Do better factoring of common libs between platforms. - Add support to building on Windows. - Remove Qt4 support. - Re-sort file lists and add missing headers. | |||||
2014-09-01 | Replace GLEW with a glLoadGen loader. | Yuri Kunde Schlesner | 10 | -13/+2819 | |
This should fix the GL loading errors that occur in some drivers due to the use of deprecated functions by GLEW. Side benefits are more accurate auto-completion (deprecated function and symbols don't exist) and faster pointer loading (less entrypoints to load). In addition it removes an external library depency, simplifying the build system a bit and eliminating one set of binary libraries for Windows. | |||||
2014-08-28 | Downgrade GLSL version to 1.50 (compatible with GL 3.2) | Yuri Kunde Schlesner | 3 | -10/+15 | |
2014-08-26 | VideoCore: Fixes rendering issues on Qt and corrects framebuffer output size. | bunnei | 4 | -8/+15 | |
2014-08-26 | Rewrite of OpenGL renderer, including OS X support | Kevin Hartman | 8 | -211/+340 | |
Screen contents are now displayed using textured quads. This can be updated to expose an FBO once an OpenGL backend for when Pica rendering is being worked on. That FBO's texture can then be applied to the quads. Previously, FBO blitting was used in order to display screen contents, which did not work on OS X. The new textured quad approach is less of a compatibility risk. | |||||
2014-08-25 | Pica/Rasterizer: Clarify a TODO. | Tony Wasserka | 1 | -1/+3 | |
2014-08-25 | Pica/VertexShader: Fix a bug in the call stack handling. | Tony Wasserka | 1 | -2/+3 | |
2014-08-25 | Math: Warning fixes. | Tony Wasserka | 1 | -14/+23 | |
2014-08-25 | Pica: Consolidate the primitive assembly code in PrimitiveAssembly and GeometryDumper. | Tony Wasserka | 5 | -46/+74 | |
2014-08-25 | Pica/Rasterizer: Add texturing support. | Tony Wasserka | 3 | -18/+69 | |
2014-08-25 | Pica/DebugUtils: Add convenient tev setup printer. | Tony Wasserka | 3 | -0/+101 | |
2014-08-25 | Pica/Rasterizer: Add initial implementation of texture combiners. | Tony Wasserka | 2 | -2/+225 | |
2014-08-25 | Pica: Add support for dumping textures. | Tony Wasserka | 3 | -1/+177 | |
2014-08-25 | Pica/Math: Improved the design of the Vec2/Vec3/Vec4 classes and simplified rasterizer code accordingly. | Tony Wasserka | 3 | -98/+133 | |
- Swizzlers now return const objects so that things like "first_vec4.xyz() = some_vec3" now will fail to compile (ideally we should support some vector holding references to make this actually work). - The methods "InsertBeforeX/Y/Z" and "Append" have been replaced by more versions of MakeVec, which now also supports building new vectors from vectors. - Vector library now follows C++ type promotion rules (hence, the result of Vec2<u8> with another Vec2<u8> is now a Vec2<int>). | |||||
2014-08-25 | Pica/VertexShader: Fix a bug in the bitfield definitions and add the "negate" field for swizzlers. | Tony Wasserka | 2 | -14/+92 | |
2014-08-25 | Pica/citra-qt: Replace command list view and command list debugging code with something more sophisticated. | Tony Wasserka | 4 | -63/+78 | |
2014-08-25 | Pica/CommandProcessor: Implement parameter masking. | Tony Wasserka | 2 | -6/+25 | |
2014-08-25 | Pica: Add debug utilities for dumping shaders. | Tony Wasserka | 4 | -1/+227 | |
2014-08-25 | Pica: Add debug utility functions for dumping geometry data. | Tony Wasserka | 6 | -4/+123 | |
2014-08-24 | Fix the threading for GL Context in Qt5. | Sacha | 1 | -1/+0 | |
Connect the emu_thread start/finish to a moveContext slot. | |||||
2014-08-13 | float24: Remove private default constructor | archshift | 1 | -2/+0 | |
Fixes building with clang. | |||||
2014-08-12 | Use glewExperimental on Linux in order to fix GLFW-mode | archshift | 1 | -3/+2 | |
2014-08-12 | Pica: Add basic rasterizer. | Tony Wasserka | 7 | -2/+260 | |
2014-08-12 | Pica: Add triangle clipper. | Tony Wasserka | 7 | -8/+230 | |
2014-08-12 | Pica: Add primitive assembly stage. | Tony Wasserka | 7 | -2/+95 | |
2014-08-12 | Pica: Add vertex shader implementation. | Tony Wasserka | 7 | -10/+722 | |
2014-08-12 | Pica: Implement vertex loading. | Tony Wasserka | 2 | -8/+102 | |
2014-08-12 | Pica: Add register definition for vertex loading and rendering. | Tony Wasserka | 1 | -33/+128 | |
2014-08-12 | Pica: Add command processor. | Tony Wasserka | 7 | -5/+107 | |
2014-08-12 | Pica: Add float24 structure. | Tony Wasserka | 1 | -0/+75 | |
24-bit floating points are used internally for calculations on the GPU, however the current code will still emulate that with 32-bit floating points. In the future we might want to accurately perform the calculations with correct bitness in the future, but for now we just wrap the calculations around this class. | |||||
2014-08-12 | Video core: Add utility class for vector operations. | Tony Wasserka | 4 | -1/+582 | |
I wrote most of this for ppsspp, so I hold full copyright over it. In addition to the original release in ppsspp, this provides functionality to easily extend e.g. two-dimensional vectors to three-dimensional vectors. | |||||
2014-08-12 | Pica/GPU: Change hardware registers to use physical addresses rather than virtual ones. | Tony Wasserka | 2 | -8/+8 | |
This cleans up the mess that address reading/writing had become and makes the code a *lot* more sensible. This adds a physical<->virtual address converter to mem_map.h. For further accuracy, we will want to properly extend this to support a wider range of address regions. For now, this makes simply homebrew applications work in a good manner though. | |||||
2014-08-12 | Remove the fancy RegisterSet class introduced in 4c2bff61e. | Tony Wasserka | 2 | -100/+146 | |
While it was some nice and fancy template usage, it ultimately had many practical issues regarding length of involved expressions under regular usage as well as common code completion tools not being able to handle the structures. Instead, we now use a more conventional approach which is a lot more clean to use. | |||||
2014-08-06 | GSP: Removed dumb GX prefixes to functions/structs in GSP namespace. | bunnei | 1 | -6/+6 | |
- Various other cleanups. | |||||
2014-07-23 | Use uniform formatting when printing hexadecimal numbers. | Tony Wasserka | 1 | -1/+1 | |
2014-07-23 | GSP: Clean up GX command processing a lot and treat command id as a u8 rather than a u32. | Tony Wasserka | 1 | -3/+2 | |
Anonymous structs are not standard C++, hence don't use them. | |||||
2014-07-23 | RegisterSet: Simplify code by using structs for register definition instead of unions. | Tony Wasserka | 1 | -9/+9 | |
2014-07-23 | GPU: Make use of RegisterSet. | Tony Wasserka | 1 | -26/+28 | |
2014-07-23 | Renderer: Fix component order in bottom framebuffer. | Tony Wasserka | 2 | -5/+4 | |
2014-07-23 | Renderer: Respect the active_fb GPU register. | Tony Wasserka | 1 | -2/+9 | |
2014-07-23 | Renderer: Add a few TODOs. | Tony Wasserka | 1 | -3/+10 | |
2014-07-22 | GPU debugger: Don't keep track of debugging data if no debugger views are active. | Tony Wasserka | 1 | -0/+6 | |
2014-06-12 | GPU debugger: Const correctness and build fix. | Tony Wasserka | 1 | -3/+3 | |
2014-06-12 | Preprocessor: #if's out OSX-specific GL changes on other platforms | archshift | 1 | -0/+3 | |
2014-06-12 | Pica: Use some template magic to define register structures efficiently. | Tony Wasserka | 1 | -25/+102 | |
2014-06-12 | Further refine GPU command list debugging. | Tony Wasserka | 2 | -0/+17 | |
2014-06-12 | Refine command list debugging functionality and its qt interface. | Tony Wasserka | 2 | -8/+17 | |
2014-06-12 | citra-qt: Add command list view. | Tony Wasserka | 1 | -2/+2 | |
2014-06-12 | GPU debugger: Add functionality to inspect command lists. | Tony Wasserka | 1 | -1/+53 | |
2014-06-12 | video core: added PICA definitions file. | Tony Wasserka | 3 | -0/+37 | |
2014-06-12 | Rename LCD to GPU. | Tony Wasserka | 1 | -3/+3 | |
2014-06-12 | Add initial graphics debugger interface. | Tony Wasserka | 3 | -3/+102 | |
2014-05-20 | common_types: Changed BasicRect back to Rect, in the common namespace | archshift | 2 | -3/+3 | |
Only Rect is in the namespace for now; the rest of common should be added in the future | |||||
2014-05-20 | Improved clarity and whitespace | archshift | 2 | -3/+4 | |
Changed QGL version to 3,2 in order to be less restrictive, yet it should still change up to 4,1 on OSX on Qt5. | |||||
2014-05-20 | CMakeLists: rename HEADS, improved comments | archshift | 1 | -2/+2 | |
Changes for clarity of comments, removed redundant compiler flags. | |||||
2014-05-19 | Indent fixes | archshift | 1 | -31/+31 | |
2014-05-08 | Update FlipFramebuffer | Sethpaien | 1 | -7/+6 | |
Less calculations + fix | |||||
2014-05-01 | Fixed indents | archshift | 2 | -37/+35 | |
2014-05-01 | Reverse debugging changes | archshift | 1 | -2/+0 | |
2014-05-01 | Unintended change reversal | archshift | 1 | -36/+36 | |
2014-05-01 | TGA dumps work, courtesy of @bunnei | archshift | 2 | -36/+38 | |
2014-05-01 | OpenGL 3+ on OSX with GLFW | archshift | 1 | -0/+2 | |
2014-04-29 | IT'S ALIVE! | archshift | 1 | -1/+6 | |
2014-04-28 | Xcode complains that the class name is redundant. | archshift | 1 | -1/+1 | |
2014-04-28 | Rect to BasicRect | archshift | 2 | -3/+3 | |
Somewhere along the line an OSX header had already taken the name Rect. | |||||
2014-04-28 | removed DISALLOW_COPY_AND_ASSIGN in favor of NonCopyable class | bunnei | 2 | -4/+1 | |
2014-04-27 | fixed renderer to use correct framebuffer location | bunnei | 2 | -8/+7 | |
2014-04-22 | fixed GL context acquire (applies to Qt GUI only) | bunnei | 1 | -0/+2 | |
2014-04-18 | renamed hw_lcd module to just lcd | bunnei | 1 | -1/+1 | |
2014-04-17 | fixed framebuffer color order | bunnei | 1 | -1/+1 | |
2014-04-17 | removed hard-coded framebuffer addresses from renderer_opengl.cpp | bunnei | 1 | -2/+4 | |
2014-04-11 | cleaned up some logging messages | bunnei | 1 | -1/+2 | |
2014-04-10 | fixed a bunch of errors in CMakeLists | bunnei | 1 | -3/+3 | |
2014-04-10 | updated CMakeLists | bunnei | 1 | -16/+2 | |
2014-04-09 | fixed project includes to use new directory structure | bunnei | 9 | -38/+39 | |
2014-04-09 | got rid of 'src' folders in each sub-project | bunnei | 7 | -0/+0 | |
2014-04-09 | fixed license header in video_core | bunnei | 7 | -182/+23 | |
2014-04-09 | - removed lots of unused code from gekko | bunnei | 3 | -367/+110 | |
- updated code style/naming conventions | |||||
2014-04-07 | added support for renderering the external framebuffers | bunnei | 2 | -20/+147 | |
2014-04-07 | added external framebuffer GL handles | bunnei | 1 | -2/+11 | |
2014-04-06 | added initial renderer code | bunnei | 7 | -19/+534 | |
2014-04-05 | added video_core project to solution | bunnei | 8 | -0/+567 | |