summaryrefslogtreecommitdiffstats
path: root/src/video_core/host_shaders (follow)
Commit message (Collapse)AuthorAgeFilesLines
* video_core: Integrate SMAALiam2022-12-089-5/+1564
| | | | | Co-authored-by: goldenx86 <goldenx86@users.noreply.github.com> Co-authored-by: BreadFish64 <breadfish64@users.noreply.github.com>
* video_core: Modify astc texture decode error fill valueFengChen2022-09-151-1/+1
|
* chore: make yuzu REUSE compliantAndrea Pappacoda2022-07-275-0/+15
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | [REUSE] is a specification that aims at making file copyright information consistent, so that it can be both human and machine readable. It basically requires that all files have a header containing copyright and licensing information. When this isn't possible, like when dealing with binary assets, generated files or embedded third-party dependencies, it is permitted to insert copyright information in the `.reuse/dep5` file. Oh, and it also requires that all the licenses used in the project are present in the `LICENSES` folder, that's why the diff is so huge. This can be done automatically with `reuse download --all`. The `reuse` tool also contains a handy subcommand that analyzes the project and tells whether or not the project is (still) compliant, `reuse lint`. Following REUSE has a few advantages over the current approach: - Copyright information is easy to access for users / downstream - Files like `dist/license.md` do not need to exist anymore, as `.reuse/dep5` is used instead - `reuse lint` makes it easy to ensure that copyright information of files like binary assets / images is always accurate and up to date To add copyright information of files that didn't have it I looked up who committed what and when, for each file. As yuzu contributors do not have to sign a CLA or similar I couldn't assume that copyright ownership was of the "yuzu Emulator Project", so I used the name and/or email of the commit author instead. [REUSE]: https://reuse.software Follow-up to 01cf05bc75b1e47beb08937439f3ed9339e7b254
* chore: add missing SPDX tagsAndrea Pappacoda2022-04-281-21/+2
| | | | Follow-up to 99ceb03a1cfcf35968cab589ea188a8c406cda52
* general: Convert source file copyright comments over to SPDXMorph2022-04-2329-87/+58
| | | | | This formats all copyright comments according to SPDX formatting guidelines. Additionally, this resolves the remaining GPLv2 only licensed files by relicensing them to GPLv2.0-or-later.
* OpenGL: fix S8D24 to ABGR8 conversionsLiam2022-04-072-0/+19
|
* Address review commentsLiam2022-03-162-2/+2
|
* Vulkan: convert S8D24 <-> ABGR8Liam2022-03-162-0/+24
|
* astc_decoder: Combine FastReplicate functions to work around new NV driver bugameerj2022-01-161-34/+46
| | | | | | The new Nvidia drivers have a bug where the FastReplicateTo6 function produces a lookup into the REPLICATE_TO_8 table rather than the REPLICATE_TO_6 table. This seems to be an optimization gone wrong. Combining the logic of the FastReplicate functions seems to address the bug.
* Texture Cache: Correct conversion shaders.Fernando Sahmkow2021-11-222-2/+2
|
* TextureCache: Simplify blitting of D24S8 formats and fix bugs.Fernando Sahmkow2021-11-225-104/+0
|
* HostShaders: Fix D24S8 convertion shaders.Fernando Sahmkow2021-11-216-23/+47
|
* TextureCache: Assure full conversions on depth/stencil write shaders.Fernando Sahmkow2021-11-203-6/+6
|
* TextureCache: Add R16G16 to D24S8 converter.Fernando Sahmkow2021-11-202-0/+19
|
* TextureCache: Add B10G11R11 to D24S8 converter.Fernando Sahmkow2021-11-192-0/+20
|
* TextureCache: Implement additional D24S8 convertions.Fernando Sahmkow2021-11-193-0/+44
|
* Vulkan: implement D24S8 <-> RGBA8 convertions.Fernando Sahmkow2021-11-193-0/+40
|
* HostShader: fix Gaussian filter.FernandoS272021-11-161-2/+2
|
* host_shaders: Misc copyright/style changesameerj2021-11-164-10/+12
|
* Presentation: Only use FP16 in scaling shaders on supported devices in VulkanMarshall Mohror2021-11-169-95/+147
|
* HostShader: Fix gaussian and add attribution.Fernando Sahmkow2021-11-161-23/+19
|
* Vulkan: Implement FXAAFernandoS272021-11-161-1/+1
|
* OpenGL: Implement FXAAMarshall Mohror2021-11-163-0/+114
|
* VideoCore: Add gaussian filtering.FernandoS272021-11-162-0/+75
|
* Update scaleforce to use FP16Marshall Mohror2021-11-161-88/+55
|
* vulkan: Implement FidelityFX Super ResolutionMarshall Mohror2021-11-164-2/+155
|
* Renderers: Unify post processing filter shadersameerj2021-11-165-202/+28
|
* Renderer: Implement Bicubic and ScaleForce filters.Fernando Sahmkow2021-11-165-0/+388
|
* host_shaders: Remove opengl_copy_bgra.compameerj2021-09-172-16/+0
|
* astc_decoder: Reduce workgroup sizeameerj2021-08-011-1/+1
| | | | This reduces the amount of over dispatching when there are odd dimensions (i.e. ASTC 8x5), which rarely evenly divide into 32x32.
* astc_decoder: Compute offset swizzles in-shaderameerj2021-08-011-33/+13
| | | | Alleviates the dependency on the swizzle table and a uniform which is constant for all ASTC texture sizes.
* astc_decoder: Make use of uvec4 for payload dataameerj2021-08-011-79/+43
|
* astc_decoder: Simplify Select2DPartitionameerj2021-08-011-38/+19
|
* astc_decoder: Optimize the use EncodingDataameerj2021-08-011-25/+25
| | | | | | | This buffer was a list of EncodingData structures sorted by their bit length, with some duplication from the cpu decoder implementation. We can take advantage of its sorted property to optimize its usage in the shader. Thanks to wwylele for the optimization idea.
* Merge pull request #6459 from lat9nq/ubuntu-fixesAmeer J2021-07-011-1/+4
|\ | | | | cmake: Improve Linux dependency checking for externals
| * cmake: Fix find_program usage for 3.15lat9nq2021-06-131-1/+4
| | | | | | | | | | | | | | | | | | | | yuzu requires CMake 3.15 yet find_program was using REQUIRED, which is only available on 3.18 and later. Instead, we check for "<VAR>-NOTFOUND". In addition, check for additional requirements before building libusb or FFmpeg with autotools. Otherwise, CMake configuration will pass yet compilation will fail.
* | astc_decoder.comp: Remove unnecessary LUT SSBOsameerj2021-06-191-19/+16
| | | | | | | | We can move them to instead be compile time constants within the shader.
* | astc: Various robustness enhancements for the gpu decoderameerj2021-06-191-9/+6
| | | | | | | | | | | | These changes should help in reducing crashes/drivers panics that may occur due to synchronization issues between the shader completion and later access of the decoded texture.
* | astc_decoder: Fix LDR CEM1 endpoint calculationameerj2021-06-161-1/+1
|/ | | | | | | | Per the spec, L1 is clamped to the value 0xff if it is greater than 0xff. An oversight caused us to take the maximum of L1 and 0xff, rather than the minimum. Huge thanks to wwylele for finding this. Co-Authored-By: Weiyi Wang <wwylele@gmail.com>
* astc_decoder: Refactor for style and more efficient memory useameerj2021-03-251-262/+307
|
* astc_decoder: Reimplement LayersRodrigo Locatti2021-03-131-18/+15
| | | | Reimplements the approach to decoding layers in the compute shader. Fixes multilayer astc decoding when using Vulkan.
* astc_decoder: Fix out of bounds memory accessameerj2021-03-131-2/+10
| | | | resolves a crash with some anamolous textures found in Astral Chain.
* renderer_vulkan: Accelerate ASTC decodingameerj2021-03-131-21/+22
| | | | Co-Authored-By: Rodrigo Locatti <reinuseslisp@airmail.cc>
* host_shaders: Modify shader cmake integration to allow for larger shadersameerj2021-03-133-2/+25
| | | | using a raw string to encapsulate the entire shader code limits us to shaders of size less than 2KB. This change overcomes this limitation.
* renderer_opengl: Accelerate ASTC texture decoding with a compute shaderameerj2021-03-131-0/+1288
| | | | | | ASTC texture decoding is currently handled by a CPU decoder for GPU's without native ASTC decoding support (most desktop GPUs). This is the cause for noticeable performance degradation in titles which use the format extensively. This commit adds support to accelerate ASTC decoding using a compute shader on OpenGL for GPUs without native support.
* renderer_opengl: Swizzle BGR textures on copyameerj2021-03-042-0/+16
| | | | | | OpenGL does not natively support BGR internal formats, which causes many BGR textures to render incorrectly, with Red and Blue channels swapped. This commit aims to address this by swizzling the blue and red channels on texture copies when a BGR format is encountered.
* video_core: Reimplement the buffer cacheReinUsesLisp2021-02-133-30/+8
| | | | | | | | | | | | | | | | | | | | | | | | | | | | Reimplement the buffer cache using cached bindings and page level granularity for modification tracking. This also drops the usage of shared pointers and virtual functions from the cache. - Bindings are cached, allowing to skip work when the game changes few bits between draws. - OpenGL Assembly shaders no longer copy when a region has been modified from the GPU to emulate constant buffers, instead GL_EXT_memory_object is used to alias sub-buffers within the same allocation. - OpenGL Assembly shaders stream constant buffer data using glProgramBufferParametersIuivNV, from NV_parameter_buffer_object. In theory this should save one hash table resolve inside the driver compared to glBufferSubData. - A new OpenGL stream buffer is implemented based on fences for drivers that are not Nvidia's proprietary, due to their low performance on partial glBufferSubData calls synchronized with 3D rendering (that some games use a lot). - Most optimizations are shared between APIs now, allowing Vulkan to cache more bindings than before, skipping unnecesarry work. This commit adds the necessary infrastructure to use Vulkan object from OpenGL. Overall, it improves performance and fixes some bugs present on the old cache. There are still some edge cases hit by some games that harm performance on some vendors, this are planned to be fixed in later commits.
* video_core: host_shaders: Don't pass --quiet to glslangValidator if unavailablelat9nq2021-02-021-1/+19
| | | | | Prevents CMake from calling `glslangValidator` with `--quiet` when it is not available, i.e. on older downstream versions from Ubuntu.
* host_shaders/cmake: Pass --quiet to glslang to keep it quietReinUsesLisp2021-01-241-1/+1
| | | | Silences noisy builds on toolchains.
* host_shaders: Add Vulkan assembler compute shadersReinUsesLisp2020-12-304-0/+96
|
* host_shaders: Add helper to blit depth stencil fragment shaderReinUsesLisp2020-12-302-0/+17
|
* host_shaders: Add texture color blit fragment shaderReinUsesLisp2020-12-302-0/+15
|
* host_shaders: Add shaders to present to the swapchainReinUsesLisp2020-12-303-0/+36
|
* host_shaders: Add shaders to convert between depth and color imagesReinUsesLisp2020-12-303-0/+28
|
* host_shaders: Add compute shader to copy BC4 as RG32UI to RGBA8ReinUsesLisp2020-12-302-0/+71
|
* host_shaders: Add shader to render a full screen triangleReinUsesLisp2020-12-302-0/+30
|
* host_shaders: Add pitch linear upload compute shaderReinUsesLisp2020-12-302-0/+87
|
* host_shaders: Add block linear upload compute shadersReinUsesLisp2020-12-303-0/+249
|
* host_shaders: Add copyright headers to OpenGL present shadersReinUsesLisp2020-12-302-0/+8
|
* video_core/host_shaders: Add support for prebuilt SPIR-V shadersReinUsesLisp2020-12-301-16/+37
| | | | | Add support for building SPIR-V shaders from GLSL and generating headers to include the text of those same GLSL shaders to consume from OpenGL.
* video_core: Fix instances where msbuild always regenerated host shadersReinUsesLisp2020-09-242-12/+7
| | | | | | When HEADER_GENERATOR was included in the DEPENDS section of custom commands, msbuild assumed this was always modified. Changing this file is not common so we can remove it from there.
* video_core/host_shaders: Add CMake integration for string shadersReinUsesLisp2020-08-245-0/+97
Add the necessary CMake code to copy the contents in a string source shader (GLSL or GLASM) to a header file then consumed by video_core files. This allows editting GLSL in its own files without having to maintain them in source files. For now, only OpenGL presentation shaders are moved, but we can add GLASM presentation shaders and static SPIR-V generation through glslangValidator in the future.