summaryrefslogtreecommitdiffstats
path: root/src/video_core/engines/maxwell_dma.cpp (unfollow)
Commit message (Collapse)AuthorFilesLines
2024-01-19Core: Eliminate core/memory dependancies.Fernando Sahmkow1-1/+0
2024-01-19SMMU: Initial adaptation to video_core.Fernando Sahmkow1-12/+13
2023-12-31MaxwellDMA: Don't flush the outputs of a dma copy.Fernando Sahmkow1-2/+2
2023-09-23Query Cachge: Fully rework Vulkan's query cacheFernando Sahmkow1-8/+4
2023-09-21Fix DMA engine register offsetsKelebek11-2/+3
2023-08-10general: fix apple clang buildLiam1-0/+1
2023-07-03Use spans over guest memory where possible instead of copying data.Kelebek11-46/+41
2023-07-01maxwell_dma: Specify dst_operand.pitch instead of a temp varMorph1-4/+3
2023-06-26Use safe reads in DMA engineKelebek11-5/+6
2023-06-22Remove memory allocations in some hot pathsKelebek11-16/+19
2023-05-07Texture cache: Only force flush the dma downloadsFernando Sahmkow1-1/+1
2023-05-07GPU: Add Reactive flushingFernando Sahmkow1-6/+2
2023-03-05Engines: Implement Accelerate DMA Texture.Fernando Sahmkow1-48/+59
2023-01-05BufferBase: Don't ignore GPU pages.Fernando Sahmkow1-2/+2
2023-01-05video_core: Cache GPU internal writes.Fernando Sahmkow1-8/+9
2023-01-01DMAPusher: Improve collection of non executing methodsFernando Sahmkow1-1/+11
2022-12-20scratch_buffer: Explicitly defing resize and resize_destructive functionsameerj1-9/+9
resize keeps previous data intact when the buffer grows resize_destructive destroys the previous data when the buffer grows
2022-12-20video_core: Add usages of ScratchBufferameerj1-24/+8
2022-11-29engines: Remove unnecessary castsLioncash1-7/+7
In a few cases we have some casts that can be trivially removed.
2022-11-24MaxwellDMA: Implement BlockLinear to BlockLinear copies.Fernando Sahmkow1-1/+66
2022-11-14Add break for default casesKyle Kienapfel1-0/+1
Visual Studio has an option to search all files in a solution, so I did a search in there for "default:" looking for any missing break statements. I've left out default statements that return something, and that throw something, even if via ThrowInvalidType. UNREACHABLE leads towards throw R_THROW macro leads towards a return
2022-11-10video_core: Fix dma copy 1D random crashFengChen1-17/+20
2022-10-17video_core: implement 1D copies based on VMM 'kind'FengChen1-54/+73
2022-10-06maxwell_dma: remove warnings from implemented functionalityLiam1-2/+0
2022-10-06NVDRV: Further improvements.Fernando Sahmkow1-5/+13
2022-10-06DMA & InlineToMemory Engines Rework.bunnei1-33/+58
2022-06-14common: Change semantics of UNREACHABLE to unconditionally crashLiam1-2/+2
2022-05-11maxwell_dma: use fallback if remapping is enabledLiam1-3/+6
2022-05-07maxwell_dma: fix bytes per pixelLiam1-3/+3
2022-04-23general: Convert source file copyright comments over to SPDXMorph1-3/+2
This formats all copyright comments according to SPDX formatting guidelines. Additionally, this resolves the remaining GPLv2 only licensed files by relicensing them to GPLv2.0-or-later.
2022-03-07MaxwellDMA: Implement semaphore operationsLody1-1/+19
2021-09-20maxwell_dma: Minor refactoringameerj1-32/+32
2021-07-23DMA: Restrict optimised path for BlockToLinear further.FernandoS271-1/+2
2021-07-15Buffer cache: Fixes, Clang and Feedback.Fernando Sahmkow1-0/+5
2021-07-14DMAEngine: Revert flushing from Pitch to BlpockLinear.Fernando Sahmkow1-2/+7
2021-07-13DMAEngine: Accelerate BufferClearFernando Sahmkow1-2/+4
2021-07-11accelerateDMA: Accelerate Buffer Copies.Fernando Sahmkow1-9/+27
2021-04-15common: Move settings to common from core.bunnei1-1/+1
- Removes a dependency on core and input_common from common.
2021-02-13video_core: Reimplement the buffer cacheReinUsesLisp1-3/+0
Reimplement the buffer cache using cached bindings and page level granularity for modification tracking. This also drops the usage of shared pointers and virtual functions from the cache. - Bindings are cached, allowing to skip work when the game changes few bits between draws. - OpenGL Assembly shaders no longer copy when a region has been modified from the GPU to emulate constant buffers, instead GL_EXT_memory_object is used to alias sub-buffers within the same allocation. - OpenGL Assembly shaders stream constant buffer data using glProgramBufferParametersIuivNV, from NV_parameter_buffer_object. In theory this should save one hash table resolve inside the driver compared to glBufferSubData. - A new OpenGL stream buffer is implemented based on fences for drivers that are not Nvidia's proprietary, due to their low performance on partial glBufferSubData calls synchronized with 3D rendering (that some games use a lot). - Most optimizations are shared between APIs now, allowing Vulkan to cache more bindings than before, skipping unnecesarry work. This commit adds the necessary infrastructure to use Vulkan object from OpenGL. Overall, it improves performance and fixes some bugs present on the old cache. There are still some edge cases hit by some games that harm performance on some vendors, this are planned to be fixed in later commits.
2020-12-30video_core: Rewrite the texture cacheReinUsesLisp1-0/+3
The current texture cache has several points that hurt maintainability and performance. It's easy to break unrelated parts of the cache when doing minor changes. The cache can easily forget valuable information about the cached textures by CPU writes or simply by its normal usage.The current texture cache has several points that hurt maintainability and performance. It's easy to break unrelated parts of the cache when doing minor changes. The cache can easily forget valuable information about the cached textures by CPU writes or simply by its normal usage. This commit aims to address those issues.
2020-12-04video_core: Resolve more variable shadowing scenariosLioncash1-2/+4
Resolves variable shadowing scenarios up to the end of the OpenGL code to make it nicer to review. The rest will be resolved in a following commit.
2020-10-03video_core: Enforce -Wunused-variable and -Wunused-but-set-variableReinUsesLisp1-2/+0
2020-08-11textures/decoders: Fix block linear to pitch copiesReinUsesLisp1-13/+8
There were two issues with block linear copies. First the swizzling was wrong and this commit reimplements them. The other issue was that these copies are generally used to download render targets from the GPU and yuzu was not downloading them from host GPU memory unless the extreme GPU accuracy setting was selected. This commit enables cached memory reads for all accuracy levels. - Fixes level thumbnails in Super Mario Maker 2.
2020-07-10video_core/textures: Add and use SwizzleSliceToVoxel, and minor style changesReinUsesLisp1-13/+17
Change GOB sizes from free-functions to constexpr constants. Add SwizzleSliceToVoxel, a function that swizzles a 2D array of pixels into a 3D texture and use it for 3D copies.
2020-07-08maxwell_dma: Rename registers to match official docs and reorderReinUsesLisp1-143/+151
Rename registers in the MaxwellDMA class to match Nvidia's official documentation. This one can be found here: https://github.com/NVIDIA/open-gpu-doc/blob/master/classes/dma-copy/clb0b5.h While we are at it, reorganize the code in MaxwellDMA to be separated in different functions.
2020-04-28MaxwellDMA: Optimize micro copies.Fernando Sahmkow1-0/+40
2020-04-28VideoCore/Engines: Refactor Engines CallMethod.Fernando Sahmkow1-5/+5
2020-04-23Clang Format.Fernando Sahmkow1-1/+2
2020-04-23DMAPusher: Propagate multimethod writes into the engines.Fernando Sahmkow1-0/+6
2020-04-22MaxwellDMA: Correct copying on accuracy level.Fernando Sahmkow1-2/+7
2020-04-22UI: Replasce accurate GPU option for GPU Accuracy LevelFernando Sahmkow1-1/+1
2020-02-28video_core: Reintroduce dirty flags infrastructureReinUsesLisp1-0/+3
2020-02-28gl_rasterizer: Remove dirty flagsReinUsesLisp1-3/+0
2019-08-30video_core: Silent miscellaneous warnings (#2820)Rodrigo Locatti1-4/+2
* texture_cache/surface_params: Remove unused local variable * rasterizer_interface: Add missing documentation commentary * maxwell_dma: Remove unused rasterizer reference * video_core/gpu: Sort member declaration order to silent -Wreorder warning * fermi_2d: Remove unused MemoryManager reference * video_core: Silent unused variable warnings * buffer_cache: Silent -Wreorder warnings * kepler_memory: Remove unused MemoryManager reference * gl_texture_cache: Add missing override * buffer_cache: Add missing include * shader/decode: Remove unused variables
2019-07-26MaxwellDMA: Fixes, corrections and relaxations.Fernando Sahmkow1-16/+26
This commit fixes offsets on Linear -> Tiled copies, corrects z pos fortiled->linear copies, corrects bytes_per_pixel calculation in tiled -> linear copies and relaxes some limitations set by latest dma fixes refactors.
2019-07-18MaxwellDMA/KeplerCopy: Downgrade DMA log message to Trace.Fernando Sahmkow1-1/+1
This log was just to know which games used DMA. It's no longer important.
2019-07-17Maxwell3D: Rework the dirty system to be more consistant and scaleableFernando Sahmkow1-1/+1
2019-06-21surface: Correct format S8Z24Fernando Sahmkow1-1/+1
2019-06-21decoders: correct block calculationFernando Sahmkow1-1/+1
2019-06-21texture_cache: loose TryReconstructSurface when accurate GPU is not on.Fernando Sahmkow1-1/+1
Also corrects some asserts.
2019-04-23Fixes and Corrections to DMA EngineFernando Sahmkow1-37/+46
2019-04-06video_core/engines: Make memory manager members privateLioncash1-1/+1
These aren't used externally by anything, so they can be made private data members.
2019-04-06video_core/engines: Remove unnecessary inclusions where applicableLioncash1-1/+1
Replaces header inclusions with forward declarations where applicable and also removes unused headers within the cpp file. This reduces a few more dependencies on core/memory.h
2019-03-21maxwell_dma: Check for valid source in destination before copy.bunnei1-0/+10
- Avoid a crash in Octopath Traveler.
2019-03-16video_core: Refactor to use MemoryManager interface for all memory access.bunnei1-16/+12
# Conflicts: # src/video_core/engines/kepler_memory.cpp # src/video_core/engines/maxwell_3d.cpp # src/video_core/morton.cpp # src/video_core/morton.h # src/video_core/renderer_opengl/gl_global_cache.cpp # src/video_core/renderer_opengl/gl_global_cache.h # src/video_core/renderer_opengl/gl_rasterizer_cache.cpp
2019-03-15gpu: Use host address for caching instead of guest address.bunnei1-2/+5
2019-03-07gpu: Move command processing to another thread.bunnei1-2/+2
2019-03-06video_core/engines: Remove unnecessary includesLioncash1-0/+1
Removes a few unnecessary dependencies on core-related machinery, such as the core.h and memory.h, which reduces the amount of rebuilding necessary if those files change. This also uncovered some indirect dependencies within other source files. This also fixes those.
2019-02-16video_core: Remove usages of System::GetInstance() within the enginesLioncash1-3/+5
Avoids the use of the global accessor in favor of explicitly making the system a dependency within the interface.
2019-02-03video_core: Assert on invalid GPU to CPU address queriesReinUsesLisp1-10/+12
2018-11-27gpu: Rewrite GPU command list processing with DmaPusher class.bunnei1-4/+4
- More accurate impl., fixes Undertale (among other games).
2018-11-17gl_rasterizer: Skip VB upload if the state is clean.Markus Wick1-0/+5
2018-10-20engines/maxwell_*: Use nested namespace specifiers where applicableLioncash1-4/+2
These three source files are the only ones within the engines directory that don't use nested namespaces. We may as well change these over to keep things consistent.
2018-10-20maxwell_dma: Make variables const where applicable within HandleCopy()Lioncash1-3/+3
These are never modified, so we can make that assumption explicit.
2018-10-20maxwell_dma: Make FlushAndInvalidate's size parameter a u64Lioncash1-1/+1
This prevents truncation warnings at the lambda's usage sites.
2018-10-20maxwell_dma: Remove unused variables in HandleCopy()Lioncash1-3/+0
These pointer variables are never used, so we can get rid of them.
2018-10-19GPU: Improved implementation of maxwell DMA (Subv).bunnei1-15/+58
2018-10-13Propagate depth and depth_block on modules using decodersFernandoS271-4/+6
2018-09-15Port #4182 from Citra: "Prefix all size_t with std::"fearlessTobi1-1/+1
2018-09-08GPU/DMA: Partially implemented the 'enable_2d' bit in the DMA engine.Subv1-2/+10
When not set, this tells the GPU to only use the X size when performing a DMA copy. This is only implemented for linear->linear and tiled->tiled copies. Conversion copies still retain the assert. This bit is unset by some games for various purposes, and by nouveau when copying the vertex buffers.
2018-07-03Rename logging macro back to LOG_*James Rowe1-1/+1
2018-07-02GPU: Directly copy the pixels when performing a same-layout DMA.Subv1-1/+5
2018-06-12GPU: Partially implemented the Maxwell DMA engine.Subv1-0/+69
Only tiled->linear and linear->tiled copies that aren't offsetted are supported for now. Queries are not supported. Swizzled copies are not supported.