summaryrefslogtreecommitdiffstats
path: root/src/video_core/engines/maxwell_dma.cpp (follow)
Commit message (Collapse)AuthorAgeFilesLines
* engines: Remove unnecessary castsLioncash2022-11-291-7/+7
| | | | In a few cases we have some casts that can be trivially removed.
* MaxwellDMA: Implement BlockLinear to BlockLinear copies.Fernando Sahmkow2022-11-241-1/+66
|
* Merge pull request #9229 from Docteh/achy_breaky_heartMorph2022-11-181-0/+1
|\ | | | | Add break for default cases
| * Add break for default casesKyle Kienapfel2022-11-141-0/+1
| | | | | | | | | | | | | | | | | | | | | | Visual Studio has an option to search all files in a solution, so I did a search in there for "default:" looking for any missing break statements. I've left out default statements that return something, and that throw something, even if via ThrowInvalidType. UNREACHABLE leads towards throw R_THROW macro leads towards a return
* | video_core: Fix dma copy 1D random crashFengChen2022-11-101-17/+20
|/
* video_core: implement 1D copies based on VMM 'kind'FengChen2022-10-171-54/+73
|
* maxwell_dma: remove warnings from implemented functionalityLiam2022-10-061-2/+0
|
* NVDRV: Further improvements.Fernando Sahmkow2022-10-061-5/+13
|
* DMA & InlineToMemory Engines Rework.bunnei2022-10-061-33/+58
|
* common: Change semantics of UNREACHABLE to unconditionally crashLiam2022-06-141-2/+2
|
* maxwell_dma: use fallback if remapping is enabledLiam2022-05-111-3/+6
|
* maxwell_dma: fix bytes per pixelLiam2022-05-071-3/+3
|
* general: Convert source file copyright comments over to SPDXMorph2022-04-231-3/+2
| | | | | This formats all copyright comments according to SPDX formatting guidelines. Additionally, this resolves the remaining GPLv2 only licensed files by relicensing them to GPLv2.0-or-later.
* MaxwellDMA: Implement semaphore operationsLody2022-03-071-1/+19
|
* maxwell_dma: Minor refactoringameerj2021-09-201-32/+32
|
* DMA: Restrict optimised path for BlockToLinear further.FernandoS272021-07-231-1/+2
|
* Buffer cache: Fixes, Clang and Feedback.Fernando Sahmkow2021-07-151-0/+5
|
* DMAEngine: Revert flushing from Pitch to BlpockLinear.Fernando Sahmkow2021-07-141-2/+7
|
* DMAEngine: Accelerate BufferClearFernando Sahmkow2021-07-131-2/+4
|
* accelerateDMA: Accelerate Buffer Copies.Fernando Sahmkow2021-07-111-9/+27
|
* common: Move settings to common from core.bunnei2021-04-151-1/+1
| | | | - Removes a dependency on core and input_common from common.
* video_core: Reimplement the buffer cacheReinUsesLisp2021-02-131-3/+0
| | | | | | | | | | | | | | | | | | | | | | | | | | | | Reimplement the buffer cache using cached bindings and page level granularity for modification tracking. This also drops the usage of shared pointers and virtual functions from the cache. - Bindings are cached, allowing to skip work when the game changes few bits between draws. - OpenGL Assembly shaders no longer copy when a region has been modified from the GPU to emulate constant buffers, instead GL_EXT_memory_object is used to alias sub-buffers within the same allocation. - OpenGL Assembly shaders stream constant buffer data using glProgramBufferParametersIuivNV, from NV_parameter_buffer_object. In theory this should save one hash table resolve inside the driver compared to glBufferSubData. - A new OpenGL stream buffer is implemented based on fences for drivers that are not Nvidia's proprietary, due to their low performance on partial glBufferSubData calls synchronized with 3D rendering (that some games use a lot). - Most optimizations are shared between APIs now, allowing Vulkan to cache more bindings than before, skipping unnecesarry work. This commit adds the necessary infrastructure to use Vulkan object from OpenGL. Overall, it improves performance and fixes some bugs present on the old cache. There are still some edge cases hit by some games that harm performance on some vendors, this are planned to be fixed in later commits.
* video_core: Rewrite the texture cacheReinUsesLisp2020-12-301-0/+3
| | | | | | | | | | | | | | The current texture cache has several points that hurt maintainability and performance. It's easy to break unrelated parts of the cache when doing minor changes. The cache can easily forget valuable information about the cached textures by CPU writes or simply by its normal usage.The current texture cache has several points that hurt maintainability and performance. It's easy to break unrelated parts of the cache when doing minor changes. The cache can easily forget valuable information about the cached textures by CPU writes or simply by its normal usage. This commit aims to address those issues.
* video_core: Resolve more variable shadowing scenariosLioncash2020-12-041-2/+4
| | | | | | Resolves variable shadowing scenarios up to the end of the OpenGL code to make it nicer to review. The rest will be resolved in a following commit.
* video_core: Enforce -Wunused-variable and -Wunused-but-set-variableReinUsesLisp2020-10-031-2/+0
|
* textures/decoders: Fix block linear to pitch copiesReinUsesLisp2020-08-111-13/+8
| | | | | | | | | | | | There were two issues with block linear copies. First the swizzling was wrong and this commit reimplements them. The other issue was that these copies are generally used to download render targets from the GPU and yuzu was not downloading them from host GPU memory unless the extreme GPU accuracy setting was selected. This commit enables cached memory reads for all accuracy levels. - Fixes level thumbnails in Super Mario Maker 2.
* video_core/textures: Add and use SwizzleSliceToVoxel, and minor style changesReinUsesLisp2020-07-101-13/+17
| | | | | | | Change GOB sizes from free-functions to constexpr constants. Add SwizzleSliceToVoxel, a function that swizzles a 2D array of pixels into a 3D texture and use it for 3D copies.
* maxwell_dma: Rename registers to match official docs and reorderReinUsesLisp2020-07-081-143/+151
| | | | | | | | | | Rename registers in the MaxwellDMA class to match Nvidia's official documentation. This one can be found here: https://github.com/NVIDIA/open-gpu-doc/blob/master/classes/dma-copy/clb0b5.h While we are at it, reorganize the code in MaxwellDMA to be separated in different functions.
* MaxwellDMA: Optimize micro copies.Fernando Sahmkow2020-04-281-0/+40
|
* VideoCore/Engines: Refactor Engines CallMethod.Fernando Sahmkow2020-04-281-5/+5
|
* Clang Format.Fernando Sahmkow2020-04-231-1/+2
|
* DMAPusher: Propagate multimethod writes into the engines.Fernando Sahmkow2020-04-231-0/+6
|
* MaxwellDMA: Correct copying on accuracy level.Fernando Sahmkow2020-04-221-2/+7
|
* UI: Replasce accurate GPU option for GPU Accuracy LevelFernando Sahmkow2020-04-221-1/+1
|
* video_core: Reintroduce dirty flags infrastructureReinUsesLisp2020-02-281-0/+3
|
* gl_rasterizer: Remove dirty flagsReinUsesLisp2020-02-281-3/+0
|
* Merge pull request #2765 from FernandoS27/dma-fixbunnei2019-09-011-16/+26
|\ | | | | MaxwellDMA: Fixes, corrections and relaxations.
| * MaxwellDMA: Fixes, corrections and relaxations.Fernando Sahmkow2019-07-261-16/+26
| | | | | | | | | | | | | | This commit fixes offsets on Linear -> Tiled copies, corrects z pos fortiled->linear copies, corrects bytes_per_pixel calculation in tiled -> linear copies and relaxes some limitations set by latest dma fixes refactors.
* | video_core: Silent miscellaneous warnings (#2820)Rodrigo Locatti2019-08-301-4/+2
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | * texture_cache/surface_params: Remove unused local variable * rasterizer_interface: Add missing documentation commentary * maxwell_dma: Remove unused rasterizer reference * video_core/gpu: Sort member declaration order to silent -Wreorder warning * fermi_2d: Remove unused MemoryManager reference * video_core: Silent unused variable warnings * buffer_cache: Silent -Wreorder warnings * kepler_memory: Remove unused MemoryManager reference * gl_texture_cache: Add missing override * buffer_cache: Add missing include * shader/decode: Remove unused variables
* | Merge pull request #2743 from FernandoS27/surpress-assertbunnei2019-07-251-1/+1
|\ \ | |/ |/| Downgrade and suppress a series of GPU asserts and debug messages.
| * MaxwellDMA/KeplerCopy: Downgrade DMA log message to Trace.Fernando Sahmkow2019-07-181-1/+1
| | | | | | | | | | This log was just to know which games used DMA. It's no longer important.
* | Maxwell3D: Rework the dirty system to be more consistant and scaleableFernando Sahmkow2019-07-171-1/+1
|/
* surface: Correct format S8Z24Fernando Sahmkow2019-06-211-1/+1
|
* decoders: correct block calculationFernando Sahmkow2019-06-211-1/+1
|
* texture_cache: loose TryReconstructSurface when accurate GPU is not on.Fernando Sahmkow2019-06-211-1/+1
| | | | Also corrects some asserts.
* Fixes and Corrections to DMA EngineFernando Sahmkow2019-04-231-37/+46
|
* video_core/engines: Make memory manager members privateLioncash2019-04-061-1/+1
| | | | | These aren't used externally by anything, so they can be made private data members.
* video_core/engines: Remove unnecessary inclusions where applicableLioncash2019-04-061-1/+1
| | | | | | Replaces header inclusions with forward declarations where applicable and also removes unused headers within the cpp file. This reduces a few more dependencies on core/memory.h
* maxwell_dma: Check for valid source in destination before copy.bunnei2019-03-211-0/+10
| | | | - Avoid a crash in Octopath Traveler.
* video_core: Refactor to use MemoryManager interface for all memory access.bunnei2019-03-161-16/+12
| | | | | | | | | | | # Conflicts: # src/video_core/engines/kepler_memory.cpp # src/video_core/engines/maxwell_3d.cpp # src/video_core/morton.cpp # src/video_core/morton.h # src/video_core/renderer_opengl/gl_global_cache.cpp # src/video_core/renderer_opengl/gl_global_cache.h # src/video_core/renderer_opengl/gl_rasterizer_cache.cpp
* gpu: Use host address for caching instead of guest address.bunnei2019-03-151-2/+5
|
* gpu: Move command processing to another thread.bunnei2019-03-071-2/+2
|
* video_core/engines: Remove unnecessary includesLioncash2019-03-061-0/+1
| | | | | | | | | Removes a few unnecessary dependencies on core-related machinery, such as the core.h and memory.h, which reduces the amount of rebuilding necessary if those files change. This also uncovered some indirect dependencies within other source files. This also fixes those.
* video_core: Remove usages of System::GetInstance() within the enginesLioncash2019-02-161-3/+5
| | | | | Avoids the use of the global accessor in favor of explicitly making the system a dependency within the interface.
* video_core: Assert on invalid GPU to CPU address queriesReinUsesLisp2019-02-031-10/+12
|
* gpu: Rewrite GPU command list processing with DmaPusher class.bunnei2018-11-271-4/+4
| | | | - More accurate impl., fixes Undertale (among other games).
* gl_rasterizer: Skip VB upload if the state is clean.Markus Wick2018-11-171-0/+5
|
* engines/maxwell_*: Use nested namespace specifiers where applicableLioncash2018-10-201-4/+2
| | | | | | These three source files are the only ones within the engines directory that don't use nested namespaces. We may as well change these over to keep things consistent.
* maxwell_dma: Make variables const where applicable within HandleCopy()Lioncash2018-10-201-3/+3
| | | | These are never modified, so we can make that assumption explicit.
* maxwell_dma: Make FlushAndInvalidate's size parameter a u64Lioncash2018-10-201-1/+1
| | | | This prevents truncation warnings at the lambda's usage sites.
* maxwell_dma: Remove unused variables in HandleCopy()Lioncash2018-10-201-3/+0
| | | | These pointer variables are never used, so we can get rid of them.
* GPU: Improved implementation of maxwell DMA (Subv).bunnei2018-10-191-15/+58
|
* Propagate depth and depth_block on modules using decodersFernandoS272018-10-131-4/+6
|
* Port #4182 from Citra: "Prefix all size_t with std::"fearlessTobi2018-09-151-1/+1
|
* GPU/DMA: Partially implemented the 'enable_2d' bit in the DMA engine.Subv2018-09-081-2/+10
| | | | | | | When not set, this tells the GPU to only use the X size when performing a DMA copy. This is only implemented for linear->linear and tiled->tiled copies. Conversion copies still retain the assert. This bit is unset by some games for various purposes, and by nouveau when copying the vertex buffers.
* Rename logging macro back to LOG_*James Rowe2018-07-031-1/+1
|
* GPU: Directly copy the pixels when performing a same-layout DMA.Subv2018-07-021-1/+5
|
* GPU: Partially implemented the Maxwell DMA engine.Subv2018-06-121-0/+69
Only tiled->linear and linear->tiled copies that aren't offsetted are supported for now. Queries are not supported. Swizzled copies are not supported.