mirror of
https://github.com/GPUOpen-LibrariesAndSDKs/VulkanMemoryAllocator
synced 2024-11-05 04:10:06 +00:00
Made virtual allocations not applying VMA_DEBUG_MARGIN
Improved and regenerated documentation.
This commit is contained in:
parent
bcd5f57deb
commit
e1813512c5
@ -151,7 +151,7 @@ Linear allocation algorithm</h1>
|
|||||||
<p ><img src="../gfx/Linear_allocator_1_algo_default.png" alt="Default allocation algorithm" class="inline"/></p>
|
<p ><img src="../gfx/Linear_allocator_1_algo_default.png" alt="Default allocation algorithm" class="inline"/></p>
|
||||||
<p >Sometimes there is a need to use simpler, linear allocation algorithm. You can create custom pool that uses such algorithm by adding flag <a class="el" href="vk__mem__alloc_8h.html#a9a7c45f9c863695d98c83fa5ac940fe7a13c8a444197c67866be9cb05599fc726" title="Enables alternative, linear allocation algorithm in this pool.">VMA_POOL_CREATE_LINEAR_ALGORITHM_BIT</a> to <a class="el" href="struct_vma_pool_create_info.html#a8405139f63d078340ae74513a59f5446" title="Use combination of VmaPoolCreateFlagBits.">VmaPoolCreateInfo::flags</a> while creating <a class="el" href="struct_vma_pool.html" title="Represents custom memory pool.">VmaPool</a> object. Then an alternative metadata management is used. It always creates new allocations after last one and doesn't reuse free regions after allocations freed in the middle. It results in better allocation performance and less memory consumed by metadata.</p>
|
<p >Sometimes there is a need to use simpler, linear allocation algorithm. You can create custom pool that uses such algorithm by adding flag <a class="el" href="vk__mem__alloc_8h.html#a9a7c45f9c863695d98c83fa5ac940fe7a13c8a444197c67866be9cb05599fc726" title="Enables alternative, linear allocation algorithm in this pool.">VMA_POOL_CREATE_LINEAR_ALGORITHM_BIT</a> to <a class="el" href="struct_vma_pool_create_info.html#a8405139f63d078340ae74513a59f5446" title="Use combination of VmaPoolCreateFlagBits.">VmaPoolCreateInfo::flags</a> while creating <a class="el" href="struct_vma_pool.html" title="Represents custom memory pool.">VmaPool</a> object. Then an alternative metadata management is used. It always creates new allocations after last one and doesn't reuse free regions after allocations freed in the middle. It results in better allocation performance and less memory consumed by metadata.</p>
|
||||||
<p ><img src="../gfx/Linear_allocator_2_algo_linear.png" alt="Linear allocation algorithm" class="inline"/></p>
|
<p ><img src="../gfx/Linear_allocator_2_algo_linear.png" alt="Linear allocation algorithm" class="inline"/></p>
|
||||||
<p >With this one flag, you can create a custom pool that can be used in many ways: free-at-once, stack, double stack, and ring buffer. See below for details.</p>
|
<p >With this one flag, you can create a custom pool that can be used in many ways: free-at-once, stack, double stack, and ring buffer. See below for details. You don't need to specify explicitly which of these options you are going to use - it is detected automatically.</p>
|
||||||
<h2><a class="anchor" id="linear_algorithm_free_at_once"></a>
|
<h2><a class="anchor" id="linear_algorithm_free_at_once"></a>
|
||||||
Free-at-once</h2>
|
Free-at-once</h2>
|
||||||
<p >In a pool that uses linear algorithm, you still need to free all the allocations individually, e.g. by using <a class="el" href="vk__mem__alloc_8h.html#a5fea5518972ae9094b1526cbcb19b05f" title="Frees memory previously allocated using vmaAllocateMemory(), vmaAllocateMemoryForBuffer(),...">vmaFreeMemory()</a> or <a class="el" href="vk__mem__alloc_8h.html#a0d9f4e4ba5bf9aab1f1c746387753d77" title="Destroys Vulkan buffer and frees allocated memory.">vmaDestroyBuffer()</a>. You can free them in any order. New allocations are always made after last one - free space in the middle is not reused. However, when you release all the allocation and the pool becomes empty, allocation starts from the beginning again. This way you can use linear algorithm to speed up creation of allocations that you are going to release all at once.</p>
|
<p >In a pool that uses linear algorithm, you still need to free all the allocations individually, e.g. by using <a class="el" href="vk__mem__alloc_8h.html#a5fea5518972ae9094b1526cbcb19b05f" title="Frees memory previously allocated using vmaAllocateMemory(), vmaAllocateMemoryForBuffer(),...">vmaFreeMemory()</a> or <a class="el" href="vk__mem__alloc_8h.html#a0d9f4e4ba5bf9aab1f1c746387753d77" title="Destroys Vulkan buffer and frees allocated memory.">vmaDestroyBuffer()</a>. You can free them in any order. New allocations are always made after last one - free space in the middle is not reused. However, when you release all the allocation and the pool becomes empty, allocation starts from the beginning again. This way you can use linear algorithm to speed up creation of allocations that you are going to release all at once.</p>
|
||||||
@ -169,7 +169,7 @@ Double stack</h2>
|
|||||||
<li>First, default one, growing up from offset 0.</li>
|
<li>First, default one, growing up from offset 0.</li>
|
||||||
<li>Second, "upper" one, growing down from the end towards lower offsets.</li>
|
<li>Second, "upper" one, growing down from the end towards lower offsets.</li>
|
||||||
</ul>
|
</ul>
|
||||||
<p >To make allocation from upper stack, add flag <a class="el" href="vk__mem__alloc_8h.html#ad9889c10c798b040d59c92f257cae597a42ba3a2d2c7117953210b7c3ef8da0df">VMA_ALLOCATION_CREATE_UPPER_ADDRESS_BIT</a> to <a class="el" href="struct_vma_allocation_create_info.html#add09658ac14fe290ace25470ddd6d41b" title="Use VmaAllocationCreateFlagBits enum.">VmaAllocationCreateInfo::flags</a>.</p>
|
<p >To make allocation from the upper stack, add flag <a class="el" href="vk__mem__alloc_8h.html#ad9889c10c798b040d59c92f257cae597a42ba3a2d2c7117953210b7c3ef8da0df">VMA_ALLOCATION_CREATE_UPPER_ADDRESS_BIT</a> to <a class="el" href="struct_vma_allocation_create_info.html#add09658ac14fe290ace25470ddd6d41b" title="Use VmaAllocationCreateFlagBits enum.">VmaAllocationCreateInfo::flags</a>.</p>
|
||||||
<p ><img src="../gfx/Linear_allocator_7_double_stack.png" alt="Double stack" class="inline"/></p>
|
<p ><img src="../gfx/Linear_allocator_7_double_stack.png" alt="Double stack" class="inline"/></p>
|
||||||
<p >Double stack is available only in pools with one memory block - <a class="el" href="struct_vma_pool_create_info.html#ae41142f2834fcdc82baa4883c187b75c" title="Maximum number of blocks that can be allocated in this pool. Optional.">VmaPoolCreateInfo::maxBlockCount</a> must be 1. Otherwise behavior is undefined.</p>
|
<p >Double stack is available only in pools with one memory block - <a class="el" href="struct_vma_pool_create_info.html#ae41142f2834fcdc82baa4883c187b75c" title="Maximum number of blocks that can be allocated in this pool. Optional.">VmaPoolCreateInfo::maxBlockCount</a> must be 1. Otherwise behavior is undefined.</p>
|
||||||
<p >When the two stacks' ends meet so there is not enough space between them for a new allocation, such allocation fails with usual <code>VK_ERROR_OUT_OF_DEVICE_MEMORY</code> error.</p>
|
<p >When the two stacks' ends meet so there is not enough space between them for a new allocation, such allocation fails with usual <code>VK_ERROR_OUT_OF_DEVICE_MEMORY</code> error.</p>
|
||||||
@ -182,10 +182,9 @@ Ring buffer</h2>
|
|||||||
<p >Ring buffer is available only in pools with one memory block - <a class="el" href="struct_vma_pool_create_info.html#ae41142f2834fcdc82baa4883c187b75c" title="Maximum number of blocks that can be allocated in this pool. Optional.">VmaPoolCreateInfo::maxBlockCount</a> must be 1. Otherwise behavior is undefined.</p>
|
<p >Ring buffer is available only in pools with one memory block - <a class="el" href="struct_vma_pool_create_info.html#ae41142f2834fcdc82baa4883c187b75c" title="Maximum number of blocks that can be allocated in this pool. Optional.">VmaPoolCreateInfo::maxBlockCount</a> must be 1. Otherwise behavior is undefined.</p>
|
||||||
<h1><a class="anchor" id="buddy_algorithm"></a>
|
<h1><a class="anchor" id="buddy_algorithm"></a>
|
||||||
Buddy allocation algorithm</h1>
|
Buddy allocation algorithm</h1>
|
||||||
<p >There is another allocation algorithm that can be used with custom pools, called "buddy". Its internal data structure is based on a tree of blocks, each having size that is a power of two and a half of its parent's size. When you want to allocate memory of certain size, a free node in the tree is located. If it is too large, it is recursively split into two halves (called "buddies"). However, if requested allocation size is not a power of two, the size of a tree node is aligned up to the nearest power of two and the remaining space is wasted. When two buddy nodes become free, they are merged back into one larger node.</p>
|
<p >There is another allocation algorithm that can be used with custom pools, called "buddy". Its internal data structure is based on a binary tree of blocks, each having size that is a power of two and a half of its parent's size. When you want to allocate memory of certain size, a free node in the tree is located. If it is too large, it is recursively split into two halves (called "buddies"). However, if requested allocation size is not a power of two, the size of the allocation is aligned up to the nearest power of two and the remaining space is wasted. When two buddy nodes become free, they are merged back into one larger node.</p>
|
||||||
<p ><img src="../gfx/Buddy_allocator.png" alt="Buddy allocator" class="inline"/></p>
|
<p ><img src="../gfx/Buddy_allocator.png" alt="Buddy allocator" class="inline"/></p>
|
||||||
<p >The advantage of buddy allocation algorithm over default algorithm is faster allocation and deallocation, as well as smaller external fragmentation. The disadvantage is more wasted space (internal fragmentation).</p>
|
<p >The advantage of buddy allocation algorithm over default algorithm is faster allocation and deallocation, as well as smaller external fragmentation. The disadvantage is more wasted space (internal fragmentation). For more information, please search the Internet for "Buddy memory allocation" - sources that describe this concept in general.</p>
|
||||||
<p >For more information, please search the Internet for "Buddy memory allocation" - sources that describe this concept in general.</p>
|
|
||||||
<p >To use buddy allocation algorithm with a custom pool, add flag <a class="el" href="vk__mem__alloc_8h.html#a9a7c45f9c863695d98c83fa5ac940fe7a97a0dc38e5161b780594d998d313d35e" title="Enables alternative, buddy allocation algorithm in this pool.">VMA_POOL_CREATE_BUDDY_ALGORITHM_BIT</a> to <a class="el" href="struct_vma_pool_create_info.html#a8405139f63d078340ae74513a59f5446" title="Use combination of VmaPoolCreateFlagBits.">VmaPoolCreateInfo::flags</a> while creating <a class="el" href="struct_vma_pool.html" title="Represents custom memory pool.">VmaPool</a> object.</p>
|
<p >To use buddy allocation algorithm with a custom pool, add flag <a class="el" href="vk__mem__alloc_8h.html#a9a7c45f9c863695d98c83fa5ac940fe7a97a0dc38e5161b780594d998d313d35e" title="Enables alternative, buddy allocation algorithm in this pool.">VMA_POOL_CREATE_BUDDY_ALGORITHM_BIT</a> to <a class="el" href="struct_vma_pool_create_info.html#a8405139f63d078340ae74513a59f5446" title="Use combination of VmaPoolCreateFlagBits.">VmaPoolCreateInfo::flags</a> while creating <a class="el" href="struct_vma_pool.html" title="Represents custom memory pool.">VmaPool</a> object.</p>
|
||||||
<p >Several limitations apply to pools that use buddy algorithm:</p>
|
<p >Several limitations apply to pools that use buddy algorithm:</p>
|
||||||
<ul>
|
<ul>
|
||||||
|
@ -170,7 +170,9 @@ Statistics</h1>
|
|||||||
</div><!-- fragment --><p >You can also request a full list of allocations and free regions as a string in JSON format by calling <a class="el" href="vk__mem__alloc_8h.html#a52d810e1222c592e5d80556ad005f1e6" title="Builds and returns a null-terminated string in JSON format with information about given VmaVirtualBlo...">vmaBuildVirtualBlockStatsString()</a>. Returned string must be later freed using <a class="el" href="vk__mem__alloc_8h.html#a47fb8d8aa69df4a7c23a9719b4080623" title="Frees a string returned by vmaBuildVirtualBlockStatsString().">vmaFreeVirtualBlockStatsString()</a>. The format of this string differs from the one returned by the main Vulkan allocator, but it is similar.</p>
|
</div><!-- fragment --><p >You can also request a full list of allocations and free regions as a string in JSON format by calling <a class="el" href="vk__mem__alloc_8h.html#a52d810e1222c592e5d80556ad005f1e6" title="Builds and returns a null-terminated string in JSON format with information about given VmaVirtualBlo...">vmaBuildVirtualBlockStatsString()</a>. Returned string must be later freed using <a class="el" href="vk__mem__alloc_8h.html#a47fb8d8aa69df4a7c23a9719b4080623" title="Frees a string returned by vmaBuildVirtualBlockStatsString().">vmaFreeVirtualBlockStatsString()</a>. The format of this string differs from the one returned by the main Vulkan allocator, but it is similar.</p>
|
||||||
<h1><a class="anchor" id="virtual_allocator_additional_considerations"></a>
|
<h1><a class="anchor" id="virtual_allocator_additional_considerations"></a>
|
||||||
Additional considerations</h1>
|
Additional considerations</h1>
|
||||||
<p >Note that the "virtual allocator" functionality is implemented on a level of individual memory blocks. Keeping track of a whole collection of blocks, allocating new ones when out of free space, deleting empty ones, and deciding which one to try first for a new allocation must be implemented by the user. </p>
|
<p >The "virtual allocator" functionality is implemented on a level of individual memory blocks. Keeping track of a whole collection of blocks, allocating new ones when out of free space, deleting empty ones, and deciding which one to try first for a new allocation must be implemented by the user.</p>
|
||||||
|
<p >Alternative allocation algorithms are supported, just like in custom pools of the real GPU memory. See enum <a class="el" href="vk__mem__alloc_8h.html#a88bcf8c1cd3bb1610ff7343811c65bca" title="Flags to be passed as VmaVirtualBlockCreateInfo::flags.">VmaVirtualBlockCreateFlagBits</a> to learn how to specify them (e.g. <a class="el" href="vk__mem__alloc_8h.html#a88bcf8c1cd3bb1610ff7343811c65bcaae6423e2fa2f3c9211b21c819e3f10f96" title="Enables alternative, linear allocation algorithm in this virtual block.">VMA_VIRTUAL_BLOCK_CREATE_LINEAR_ALGORITHM_BIT</a>). You can find their description in chapter <a class="el" href="custom_memory_pools.html">Custom memory pools</a>. Allocation strategies are also supported. See enum <a class="el" href="vk__mem__alloc_8h.html#a2e9c64d405b14156fea7e10c4ad06cb6" title="Flags to be passed as VmaVirtualAllocationCreateInfo::flags.">VmaVirtualAllocationCreateFlagBits</a> to learn how to specify them (e.g. <a class="el" href="vk__mem__alloc_8h.html#a2e9c64d405b14156fea7e10c4ad06cb6a562d10a46012719d33167d3dc5dbbf9b" title="Allocation strategy that tries to minimize allocation time.">VMA_VIRTUAL_ALLOCATION_CREATE_STRATEGY_MIN_TIME_BIT</a>).</p>
|
||||||
|
<p >Following features are supported only by the allocator of the real GPU memory and not by virtual allocations: buffer-image granularity, <a class="el" href="lost_allocations.html">Lost allocations</a>, <code>VMA_DEBUG_MARGIN</code>, <code>VMA_MIN_ALIGNMENT</code>. </p>
|
||||||
</div></div><!-- contents -->
|
</div></div><!-- contents -->
|
||||||
</div><!-- PageDoc -->
|
</div><!-- PageDoc -->
|
||||||
<!-- start footer part -->
|
<!-- start footer part -->
|
||||||
|
@ -5143,6 +5143,10 @@ public:
|
|||||||
|
|
||||||
protected:
|
protected:
|
||||||
const VkAllocationCallbacks* GetAllocationCallbacks() const { return m_pAllocationCallbacks; }
|
const VkAllocationCallbacks* GetAllocationCallbacks() const { return m_pAllocationCallbacks; }
|
||||||
|
VkDeviceSize GetDebugMargin() const
|
||||||
|
{
|
||||||
|
return IsVirtual() ? 0 : VMA_DEBUG_MARGIN;
|
||||||
|
}
|
||||||
|
|
||||||
#if VMA_STATS_STRING_ENABLED
|
#if VMA_STATS_STRING_ENABLED
|
||||||
void PrintDetailedMap_Begin(class VmaJsonWriter& json,
|
void PrintDetailedMap_Begin(class VmaJsonWriter& json,
|
||||||
@ -7890,6 +7894,8 @@ bool VmaBlockMetadata_Generic::Validate() const
|
|||||||
// True if previous visited suballocation was free.
|
// True if previous visited suballocation was free.
|
||||||
bool prevFree = false;
|
bool prevFree = false;
|
||||||
|
|
||||||
|
const VkDeviceSize debugMargin = GetDebugMargin();
|
||||||
|
|
||||||
for(const auto& subAlloc : m_Suballocations)
|
for(const auto& subAlloc : m_Suballocations)
|
||||||
{
|
{
|
||||||
// Actual offset of this suballocation doesn't match expected one.
|
// Actual offset of this suballocation doesn't match expected one.
|
||||||
@ -7912,7 +7918,7 @@ bool VmaBlockMetadata_Generic::Validate() const
|
|||||||
++freeSuballocationsToRegister;
|
++freeSuballocationsToRegister;
|
||||||
|
|
||||||
// Margin required between allocations - every free space must be at least that large.
|
// Margin required between allocations - every free space must be at least that large.
|
||||||
VMA_VALIDATE(subAlloc.size >= VMA_DEBUG_MARGIN);
|
VMA_VALIDATE(subAlloc.size >= debugMargin);
|
||||||
}
|
}
|
||||||
else
|
else
|
||||||
{
|
{
|
||||||
@ -7923,7 +7929,7 @@ bool VmaBlockMetadata_Generic::Validate() const
|
|||||||
}
|
}
|
||||||
|
|
||||||
// Margin required between allocations - previous allocation must be free.
|
// Margin required between allocations - previous allocation must be free.
|
||||||
VMA_VALIDATE(VMA_DEBUG_MARGIN == 0 || prevFree);
|
VMA_VALIDATE(debugMargin == 0 || prevFree);
|
||||||
}
|
}
|
||||||
|
|
||||||
calculatedOffset += subAlloc.size;
|
calculatedOffset += subAlloc.size;
|
||||||
@ -8052,9 +8058,11 @@ bool VmaBlockMetadata_Generic::CreateAllocationRequest(
|
|||||||
pAllocationRequest->type = VmaAllocationRequestType::Normal;
|
pAllocationRequest->type = VmaAllocationRequestType::Normal;
|
||||||
pAllocationRequest->size = allocSize;
|
pAllocationRequest->size = allocSize;
|
||||||
|
|
||||||
|
const VkDeviceSize debugMargin = GetDebugMargin();
|
||||||
|
|
||||||
// There is not enough total free space in this block to fulfill the request: Early return.
|
// There is not enough total free space in this block to fulfill the request: Early return.
|
||||||
if(canMakeOtherLost == false &&
|
if(canMakeOtherLost == false &&
|
||||||
m_SumFreeSize < allocSize + 2 * VMA_DEBUG_MARGIN)
|
m_SumFreeSize < allocSize + 2 * debugMargin)
|
||||||
{
|
{
|
||||||
return false;
|
return false;
|
||||||
}
|
}
|
||||||
@ -8065,11 +8073,11 @@ bool VmaBlockMetadata_Generic::CreateAllocationRequest(
|
|||||||
{
|
{
|
||||||
if(strategy == VMA_ALLOCATION_CREATE_STRATEGY_BEST_FIT_BIT)
|
if(strategy == VMA_ALLOCATION_CREATE_STRATEGY_BEST_FIT_BIT)
|
||||||
{
|
{
|
||||||
// Find first free suballocation with size not less than allocSize + 2 * VMA_DEBUG_MARGIN.
|
// Find first free suballocation with size not less than allocSize + 2 * debugMargin.
|
||||||
VmaSuballocationList::iterator* const it = VmaBinaryFindFirstNotLess(
|
VmaSuballocationList::iterator* const it = VmaBinaryFindFirstNotLess(
|
||||||
m_FreeSuballocationsBySize.data(),
|
m_FreeSuballocationsBySize.data(),
|
||||||
m_FreeSuballocationsBySize.data() + freeSuballocCount,
|
m_FreeSuballocationsBySize.data() + freeSuballocCount,
|
||||||
allocSize + 2 * VMA_DEBUG_MARGIN,
|
allocSize + 2 * debugMargin,
|
||||||
VmaSuballocationItemSizeLess());
|
VmaSuballocationItemSizeLess());
|
||||||
size_t index = it - m_FreeSuballocationsBySize.data();
|
size_t index = it - m_FreeSuballocationsBySize.data();
|
||||||
for(; index < freeSuballocCount; ++index)
|
for(; index < freeSuballocCount; ++index)
|
||||||
@ -8258,7 +8266,7 @@ VkResult VmaBlockMetadata_Generic::CheckCorruption(const void* pBlockData)
|
|||||||
{
|
{
|
||||||
if(suballoc.type != VMA_SUBALLOCATION_TYPE_FREE)
|
if(suballoc.type != VMA_SUBALLOCATION_TYPE_FREE)
|
||||||
{
|
{
|
||||||
if(!VmaValidateMagicValue(pBlockData, suballoc.offset - VMA_DEBUG_MARGIN))
|
if(!VmaValidateMagicValue(pBlockData, suballoc.offset - GetDebugMargin()))
|
||||||
{
|
{
|
||||||
VMA_ASSERT(0 && "MEMORY CORRUPTION DETECTED BEFORE VALIDATED ALLOCATION!");
|
VMA_ASSERT(0 && "MEMORY CORRUPTION DETECTED BEFORE VALIDATED ALLOCATION!");
|
||||||
return VK_ERROR_UNKNOWN;
|
return VK_ERROR_UNKNOWN;
|
||||||
@ -8443,6 +8451,8 @@ bool VmaBlockMetadata_Generic::CheckAllocation(
|
|||||||
*pSumFreeSize = 0;
|
*pSumFreeSize = 0;
|
||||||
*pSumItemSize = 0;
|
*pSumItemSize = 0;
|
||||||
|
|
||||||
|
const VkDeviceSize debugMargin = GetDebugMargin();
|
||||||
|
|
||||||
if(canMakeOtherLost)
|
if(canMakeOtherLost)
|
||||||
{
|
{
|
||||||
VMA_ASSERT(!IsVirtual());
|
VMA_ASSERT(!IsVirtual());
|
||||||
@ -8474,10 +8484,10 @@ bool VmaBlockMetadata_Generic::CheckAllocation(
|
|||||||
// Start from offset equal to beginning of this suballocation.
|
// Start from offset equal to beginning of this suballocation.
|
||||||
*pOffset = suballocItem->offset;
|
*pOffset = suballocItem->offset;
|
||||||
|
|
||||||
// Apply VMA_DEBUG_MARGIN at the beginning.
|
// Apply debugMargin at the beginning.
|
||||||
if(VMA_DEBUG_MARGIN > 0)
|
if(debugMargin > 0)
|
||||||
{
|
{
|
||||||
*pOffset += VMA_DEBUG_MARGIN;
|
*pOffset += debugMargin;
|
||||||
}
|
}
|
||||||
|
|
||||||
// Apply alignment.
|
// Apply alignment.
|
||||||
@ -8522,7 +8532,7 @@ bool VmaBlockMetadata_Generic::CheckAllocation(
|
|||||||
const VkDeviceSize paddingBegin = *pOffset - suballocItem->offset;
|
const VkDeviceSize paddingBegin = *pOffset - suballocItem->offset;
|
||||||
|
|
||||||
// Calculate required margin at the end.
|
// Calculate required margin at the end.
|
||||||
const VkDeviceSize requiredEndMargin = VMA_DEBUG_MARGIN;
|
const VkDeviceSize requiredEndMargin = debugMargin;
|
||||||
|
|
||||||
const VkDeviceSize totalSize = paddingBegin + allocSize + requiredEndMargin;
|
const VkDeviceSize totalSize = paddingBegin + allocSize + requiredEndMargin;
|
||||||
// Another early return check.
|
// Another early return check.
|
||||||
@ -8619,10 +8629,10 @@ bool VmaBlockMetadata_Generic::CheckAllocation(
|
|||||||
// Start from offset equal to beginning of this suballocation.
|
// Start from offset equal to beginning of this suballocation.
|
||||||
*pOffset = suballoc.offset;
|
*pOffset = suballoc.offset;
|
||||||
|
|
||||||
// Apply VMA_DEBUG_MARGIN at the beginning.
|
// Apply debugMargin at the beginning.
|
||||||
if(VMA_DEBUG_MARGIN > 0)
|
if(debugMargin > 0)
|
||||||
{
|
{
|
||||||
*pOffset += VMA_DEBUG_MARGIN;
|
*pOffset += debugMargin;
|
||||||
}
|
}
|
||||||
|
|
||||||
// Apply alignment.
|
// Apply alignment.
|
||||||
@ -8660,7 +8670,7 @@ bool VmaBlockMetadata_Generic::CheckAllocation(
|
|||||||
const VkDeviceSize paddingBegin = *pOffset - suballoc.offset;
|
const VkDeviceSize paddingBegin = *pOffset - suballoc.offset;
|
||||||
|
|
||||||
// Calculate required margin at the end.
|
// Calculate required margin at the end.
|
||||||
const VkDeviceSize requiredEndMargin = VMA_DEBUG_MARGIN;
|
const VkDeviceSize requiredEndMargin = debugMargin;
|
||||||
|
|
||||||
// Fail if requested size plus margin before and after is bigger than size of this suballocation.
|
// Fail if requested size plus margin before and after is bigger than size of this suballocation.
|
||||||
if(paddingBegin + allocSize + requiredEndMargin > suballoc.size)
|
if(paddingBegin + allocSize + requiredEndMargin > suballoc.size)
|
||||||
@ -8900,7 +8910,8 @@ bool VmaBlockMetadata_Linear::Validate() const
|
|||||||
|
|
||||||
VkDeviceSize sumUsedSize = 0;
|
VkDeviceSize sumUsedSize = 0;
|
||||||
const size_t suballoc1stCount = suballocations1st.size();
|
const size_t suballoc1stCount = suballocations1st.size();
|
||||||
VkDeviceSize offset = VMA_DEBUG_MARGIN;
|
const VkDeviceSize debugMargin = GetDebugMargin();
|
||||||
|
VkDeviceSize offset = debugMargin;
|
||||||
|
|
||||||
if(m_2ndVectorMode == SECOND_VECTOR_RING_BUFFER)
|
if(m_2ndVectorMode == SECOND_VECTOR_RING_BUFFER)
|
||||||
{
|
{
|
||||||
@ -8932,7 +8943,7 @@ bool VmaBlockMetadata_Linear::Validate() const
|
|||||||
++nullItem2ndCount;
|
++nullItem2ndCount;
|
||||||
}
|
}
|
||||||
|
|
||||||
offset = suballoc.offset + suballoc.size + VMA_DEBUG_MARGIN;
|
offset = suballoc.offset + suballoc.size + debugMargin;
|
||||||
}
|
}
|
||||||
|
|
||||||
VMA_VALIDATE(nullItem2ndCount == m_2ndNullItemsCount);
|
VMA_VALIDATE(nullItem2ndCount == m_2ndNullItemsCount);
|
||||||
@ -8974,7 +8985,7 @@ bool VmaBlockMetadata_Linear::Validate() const
|
|||||||
++nullItem1stCount;
|
++nullItem1stCount;
|
||||||
}
|
}
|
||||||
|
|
||||||
offset = suballoc.offset + suballoc.size + VMA_DEBUG_MARGIN;
|
offset = suballoc.offset + suballoc.size + debugMargin;
|
||||||
}
|
}
|
||||||
VMA_VALIDATE(nullItem1stCount == m_1stNullItemsBeginCount + m_1stNullItemsMiddleCount);
|
VMA_VALIDATE(nullItem1stCount == m_1stNullItemsBeginCount + m_1stNullItemsMiddleCount);
|
||||||
|
|
||||||
@ -9008,7 +9019,7 @@ bool VmaBlockMetadata_Linear::Validate() const
|
|||||||
++nullItem2ndCount;
|
++nullItem2ndCount;
|
||||||
}
|
}
|
||||||
|
|
||||||
offset = suballoc.offset + suballoc.size + VMA_DEBUG_MARGIN;
|
offset = suballoc.offset + suballoc.size + debugMargin;
|
||||||
}
|
}
|
||||||
|
|
||||||
VMA_VALIDATE(nullItem2ndCount == m_2ndNullItemsCount);
|
VMA_VALIDATE(nullItem2ndCount == m_2ndNullItemsCount);
|
||||||
@ -9808,14 +9819,16 @@ bool VmaBlockMetadata_Linear::CreateAllocationRequest_UpperAddress(
|
|||||||
// Start from offset equal to end of free space.
|
// Start from offset equal to end of free space.
|
||||||
VkDeviceSize resultOffset = resultBaseOffset;
|
VkDeviceSize resultOffset = resultBaseOffset;
|
||||||
|
|
||||||
// Apply VMA_DEBUG_MARGIN at the end.
|
const VkDeviceSize debugMargin = GetDebugMargin();
|
||||||
if(VMA_DEBUG_MARGIN > 0)
|
|
||||||
|
// Apply debugMargin at the end.
|
||||||
|
if(debugMargin > 0)
|
||||||
{
|
{
|
||||||
if(resultOffset < VMA_DEBUG_MARGIN)
|
if(resultOffset < debugMargin)
|
||||||
{
|
{
|
||||||
return false;
|
return false;
|
||||||
}
|
}
|
||||||
resultOffset -= VMA_DEBUG_MARGIN;
|
resultOffset -= debugMargin;
|
||||||
}
|
}
|
||||||
|
|
||||||
// Apply alignment.
|
// Apply alignment.
|
||||||
@ -9851,7 +9864,7 @@ bool VmaBlockMetadata_Linear::CreateAllocationRequest_UpperAddress(
|
|||||||
const VkDeviceSize endOf1st = !suballocations1st.empty() ?
|
const VkDeviceSize endOf1st = !suballocations1st.empty() ?
|
||||||
suballocations1st.back().offset + suballocations1st.back().size :
|
suballocations1st.back().offset + suballocations1st.back().size :
|
||||||
0;
|
0;
|
||||||
if(endOf1st + VMA_DEBUG_MARGIN <= resultOffset)
|
if(endOf1st + debugMargin <= resultOffset)
|
||||||
{
|
{
|
||||||
// Check previous suballocations for BufferImageGranularity conflicts.
|
// Check previous suballocations for BufferImageGranularity conflicts.
|
||||||
// If conflict exists, allocation cannot be made here.
|
// If conflict exists, allocation cannot be made here.
|
||||||
@ -9900,6 +9913,7 @@ bool VmaBlockMetadata_Linear::CreateAllocationRequest_LowerAddress(
|
|||||||
VmaAllocationRequest* pAllocationRequest)
|
VmaAllocationRequest* pAllocationRequest)
|
||||||
{
|
{
|
||||||
const VkDeviceSize blockSize = GetSize();
|
const VkDeviceSize blockSize = GetSize();
|
||||||
|
const VkDeviceSize debugMargin = GetDebugMargin();
|
||||||
SuballocationVectorType& suballocations1st = AccessSuballocations1st();
|
SuballocationVectorType& suballocations1st = AccessSuballocations1st();
|
||||||
SuballocationVectorType& suballocations2nd = AccessSuballocations2nd();
|
SuballocationVectorType& suballocations2nd = AccessSuballocations2nd();
|
||||||
|
|
||||||
@ -9917,10 +9931,10 @@ bool VmaBlockMetadata_Linear::CreateAllocationRequest_LowerAddress(
|
|||||||
// Start from offset equal to beginning of free space.
|
// Start from offset equal to beginning of free space.
|
||||||
VkDeviceSize resultOffset = resultBaseOffset;
|
VkDeviceSize resultOffset = resultBaseOffset;
|
||||||
|
|
||||||
// Apply VMA_DEBUG_MARGIN at the beginning.
|
// Apply debugMargin at the beginning.
|
||||||
if(VMA_DEBUG_MARGIN > 0)
|
if(debugMargin > 0)
|
||||||
{
|
{
|
||||||
resultOffset += VMA_DEBUG_MARGIN;
|
resultOffset += debugMargin;
|
||||||
}
|
}
|
||||||
|
|
||||||
// Apply alignment.
|
// Apply alignment.
|
||||||
@ -9956,7 +9970,7 @@ bool VmaBlockMetadata_Linear::CreateAllocationRequest_LowerAddress(
|
|||||||
suballocations2nd.back().offset : blockSize;
|
suballocations2nd.back().offset : blockSize;
|
||||||
|
|
||||||
// There is enough free space at the end after alignment.
|
// There is enough free space at the end after alignment.
|
||||||
if(resultOffset + allocSize + VMA_DEBUG_MARGIN <= freeSpaceEnd)
|
if(resultOffset + allocSize + debugMargin <= freeSpaceEnd)
|
||||||
{
|
{
|
||||||
// Check next suballocations for BufferImageGranularity conflicts.
|
// Check next suballocations for BufferImageGranularity conflicts.
|
||||||
// If conflict exists, allocation cannot be made here.
|
// If conflict exists, allocation cannot be made here.
|
||||||
@ -10007,10 +10021,10 @@ bool VmaBlockMetadata_Linear::CreateAllocationRequest_LowerAddress(
|
|||||||
// Start from offset equal to beginning of free space.
|
// Start from offset equal to beginning of free space.
|
||||||
VkDeviceSize resultOffset = resultBaseOffset;
|
VkDeviceSize resultOffset = resultBaseOffset;
|
||||||
|
|
||||||
// Apply VMA_DEBUG_MARGIN at the beginning.
|
// Apply debugMargin at the beginning.
|
||||||
if(VMA_DEBUG_MARGIN > 0)
|
if(debugMargin > 0)
|
||||||
{
|
{
|
||||||
resultOffset += VMA_DEBUG_MARGIN;
|
resultOffset += debugMargin;
|
||||||
}
|
}
|
||||||
|
|
||||||
// Apply alignment.
|
// Apply alignment.
|
||||||
@ -10050,7 +10064,7 @@ bool VmaBlockMetadata_Linear::CreateAllocationRequest_LowerAddress(
|
|||||||
{
|
{
|
||||||
VMA_ASSERT(!IsVirtual());
|
VMA_ASSERT(!IsVirtual());
|
||||||
while(index1st < suballocations1st.size() &&
|
while(index1st < suballocations1st.size() &&
|
||||||
resultOffset + allocSize + VMA_DEBUG_MARGIN > suballocations1st[index1st].offset)
|
resultOffset + allocSize + debugMargin > suballocations1st[index1st].offset)
|
||||||
{
|
{
|
||||||
// Next colliding allocation at the beginning of 1st vector found. Try to make it lost.
|
// Next colliding allocation at the beginning of 1st vector found. Try to make it lost.
|
||||||
const VmaSuballocation& suballoc = suballocations1st[index1st];
|
const VmaSuballocation& suballoc = suballocations1st[index1st];
|
||||||
@ -10112,7 +10126,7 @@ bool VmaBlockMetadata_Linear::CreateAllocationRequest_LowerAddress(
|
|||||||
|
|
||||||
// Special case: There is not enough room at the end for this allocation, even after making all from the 1st lost.
|
// Special case: There is not enough room at the end for this allocation, even after making all from the 1st lost.
|
||||||
if(index1st == suballocations1st.size() &&
|
if(index1st == suballocations1st.size() &&
|
||||||
resultOffset + allocSize + VMA_DEBUG_MARGIN > blockSize)
|
resultOffset + allocSize + debugMargin > blockSize)
|
||||||
{
|
{
|
||||||
// TODO: This is a known bug that it's not yet implemented and the allocation is failing.
|
// TODO: This is a known bug that it's not yet implemented and the allocation is failing.
|
||||||
VMA_DEBUG_LOG("Unsupported special case in custom pool with linear allocation algorithm used as ring buffer with allocations that can be lost.");
|
VMA_DEBUG_LOG("Unsupported special case in custom pool with linear allocation algorithm used as ring buffer with allocations that can be lost.");
|
||||||
@ -10120,8 +10134,8 @@ bool VmaBlockMetadata_Linear::CreateAllocationRequest_LowerAddress(
|
|||||||
}
|
}
|
||||||
|
|
||||||
// There is enough free space at the end after alignment.
|
// There is enough free space at the end after alignment.
|
||||||
if((index1st == suballocations1st.size() && resultOffset + allocSize + VMA_DEBUG_MARGIN <= blockSize) ||
|
if((index1st == suballocations1st.size() && resultOffset + allocSize + debugMargin <= blockSize) ||
|
||||||
(index1st < suballocations1st.size() && resultOffset + allocSize + VMA_DEBUG_MARGIN <= suballocations1st[index1st].offset))
|
(index1st < suballocations1st.size() && resultOffset + allocSize + debugMargin <= suballocations1st[index1st].offset))
|
||||||
{
|
{
|
||||||
// Check next suballocations for BufferImageGranularity conflicts.
|
// Check next suballocations for BufferImageGranularity conflicts.
|
||||||
// If conflict exists, allocation cannot be made here.
|
// If conflict exists, allocation cannot be made here.
|
||||||
@ -10285,7 +10299,7 @@ VkResult VmaBlockMetadata_Linear::CheckCorruption(const void* pBlockData)
|
|||||||
const VmaSuballocation& suballoc = suballocations1st[i];
|
const VmaSuballocation& suballoc = suballocations1st[i];
|
||||||
if(suballoc.type != VMA_SUBALLOCATION_TYPE_FREE)
|
if(suballoc.type != VMA_SUBALLOCATION_TYPE_FREE)
|
||||||
{
|
{
|
||||||
if(!VmaValidateMagicValue(pBlockData, suballoc.offset - VMA_DEBUG_MARGIN))
|
if(!VmaValidateMagicValue(pBlockData, suballoc.offset - GetDebugMargin()))
|
||||||
{
|
{
|
||||||
VMA_ASSERT(0 && "MEMORY CORRUPTION DETECTED BEFORE VALIDATED ALLOCATION!");
|
VMA_ASSERT(0 && "MEMORY CORRUPTION DETECTED BEFORE VALIDATED ALLOCATION!");
|
||||||
return VK_ERROR_UNKNOWN;
|
return VK_ERROR_UNKNOWN;
|
||||||
@ -10304,7 +10318,7 @@ VkResult VmaBlockMetadata_Linear::CheckCorruption(const void* pBlockData)
|
|||||||
const VmaSuballocation& suballoc = suballocations2nd[i];
|
const VmaSuballocation& suballoc = suballocations2nd[i];
|
||||||
if(suballoc.type != VMA_SUBALLOCATION_TYPE_FREE)
|
if(suballoc.type != VMA_SUBALLOCATION_TYPE_FREE)
|
||||||
{
|
{
|
||||||
if(!VmaValidateMagicValue(pBlockData, suballoc.offset - VMA_DEBUG_MARGIN))
|
if(!VmaValidateMagicValue(pBlockData, suballoc.offset - GetDebugMargin()))
|
||||||
{
|
{
|
||||||
VMA_ASSERT(0 && "MEMORY CORRUPTION DETECTED BEFORE VALIDATED ALLOCATION!");
|
VMA_ASSERT(0 && "MEMORY CORRUPTION DETECTED BEFORE VALIDATED ALLOCATION!");
|
||||||
return VK_ERROR_UNKNOWN;
|
return VK_ERROR_UNKNOWN;
|
||||||
@ -13583,7 +13597,7 @@ void VmaBlockVectorDefragmentationContext::Begin(bool overlappingMoveSupported,
|
|||||||
/*
|
/*
|
||||||
Fast algorithm is supported only when certain criteria are met:
|
Fast algorithm is supported only when certain criteria are met:
|
||||||
- VMA_DEBUG_MARGIN is 0.
|
- VMA_DEBUG_MARGIN is 0.
|
||||||
- All allocations in this block vector are moveable.
|
- All allocations in this block vector are movable.
|
||||||
- There is no possibility of image/buffer granularity conflict.
|
- There is no possibility of image/buffer granularity conflict.
|
||||||
- The defragmentation is not incremental
|
- The defragmentation is not incremental
|
||||||
*/
|
*/
|
||||||
@ -19234,6 +19248,7 @@ less memory consumed by metadata.
|
|||||||
|
|
||||||
With this one flag, you can create a custom pool that can be used in many ways:
|
With this one flag, you can create a custom pool that can be used in many ways:
|
||||||
free-at-once, stack, double stack, and ring buffer. See below for details.
|
free-at-once, stack, double stack, and ring buffer. See below for details.
|
||||||
|
You don't need to specify explicitly which of these options you are going to use - it is detected automatically.
|
||||||
|
|
||||||
\subsection linear_algorithm_free_at_once Free-at-once
|
\subsection linear_algorithm_free_at_once Free-at-once
|
||||||
|
|
||||||
@ -19269,7 +19284,7 @@ stacks:
|
|||||||
- First, default one, growing up from offset 0.
|
- First, default one, growing up from offset 0.
|
||||||
- Second, "upper" one, growing down from the end towards lower offsets.
|
- Second, "upper" one, growing down from the end towards lower offsets.
|
||||||
|
|
||||||
To make allocation from upper stack, add flag #VMA_ALLOCATION_CREATE_UPPER_ADDRESS_BIT
|
To make allocation from the upper stack, add flag #VMA_ALLOCATION_CREATE_UPPER_ADDRESS_BIT
|
||||||
to VmaAllocationCreateInfo::flags.
|
to VmaAllocationCreateInfo::flags.
|
||||||
|
|
||||||
![Double stack](../gfx/Linear_allocator_7_double_stack.png)
|
![Double stack](../gfx/Linear_allocator_7_double_stack.png)
|
||||||
@ -19304,11 +19319,11 @@ VmaPoolCreateInfo::maxBlockCount must be 1. Otherwise behavior is undefined.
|
|||||||
\section buddy_algorithm Buddy allocation algorithm
|
\section buddy_algorithm Buddy allocation algorithm
|
||||||
|
|
||||||
There is another allocation algorithm that can be used with custom pools, called
|
There is another allocation algorithm that can be used with custom pools, called
|
||||||
"buddy". Its internal data structure is based on a tree of blocks, each having
|
"buddy". Its internal data structure is based on a binary tree of blocks, each having
|
||||||
size that is a power of two and a half of its parent's size. When you want to
|
size that is a power of two and a half of its parent's size. When you want to
|
||||||
allocate memory of certain size, a free node in the tree is located. If it is too
|
allocate memory of certain size, a free node in the tree is located. If it is too
|
||||||
large, it is recursively split into two halves (called "buddies"). However, if
|
large, it is recursively split into two halves (called "buddies"). However, if
|
||||||
requested allocation size is not a power of two, the size of a tree node is
|
requested allocation size is not a power of two, the size of the allocation is
|
||||||
aligned up to the nearest power of two and the remaining space is wasted. When
|
aligned up to the nearest power of two and the remaining space is wasted. When
|
||||||
two buddy nodes become free, they are merged back into one larger node.
|
two buddy nodes become free, they are merged back into one larger node.
|
||||||
|
|
||||||
@ -19317,7 +19332,6 @@ two buddy nodes become free, they are merged back into one larger node.
|
|||||||
The advantage of buddy allocation algorithm over default algorithm is faster
|
The advantage of buddy allocation algorithm over default algorithm is faster
|
||||||
allocation and deallocation, as well as smaller external fragmentation. The
|
allocation and deallocation, as well as smaller external fragmentation. The
|
||||||
disadvantage is more wasted space (internal fragmentation).
|
disadvantage is more wasted space (internal fragmentation).
|
||||||
|
|
||||||
For more information, please search the Internet for "Buddy memory allocation" -
|
For more information, please search the Internet for "Buddy memory allocation" -
|
||||||
sources that describe this concept in general.
|
sources that describe this concept in general.
|
||||||
|
|
||||||
@ -19945,10 +19959,19 @@ The format of this string differs from the one returned by the main Vulkan alloc
|
|||||||
|
|
||||||
\section virtual_allocator_additional_considerations Additional considerations
|
\section virtual_allocator_additional_considerations Additional considerations
|
||||||
|
|
||||||
Note that the "virtual allocator" functionality is implemented on a level of individual memory blocks.
|
The "virtual allocator" functionality is implemented on a level of individual memory blocks.
|
||||||
Keeping track of a whole collection of blocks, allocating new ones when out of free space,
|
Keeping track of a whole collection of blocks, allocating new ones when out of free space,
|
||||||
deleting empty ones, and deciding which one to try first for a new allocation must be implemented by the user.
|
deleting empty ones, and deciding which one to try first for a new allocation must be implemented by the user.
|
||||||
|
|
||||||
|
Alternative allocation algorithms are supported, just like in custom pools of the real GPU memory.
|
||||||
|
See enum #VmaVirtualBlockCreateFlagBits to learn how to specify them (e.g. #VMA_VIRTUAL_BLOCK_CREATE_LINEAR_ALGORITHM_BIT).
|
||||||
|
You can find their description in chapter \ref custom_memory_pools.
|
||||||
|
Allocation strategies are also supported.
|
||||||
|
See enum #VmaVirtualAllocationCreateFlagBits to learn how to specify them (e.g. #VMA_VIRTUAL_ALLOCATION_CREATE_STRATEGY_MIN_TIME_BIT).
|
||||||
|
|
||||||
|
Following features are supported only by the allocator of the real GPU memory and not by virtual allocations:
|
||||||
|
buffer-image granularity, \ref lost_allocations, `VMA_DEBUG_MARGIN`, `VMA_MIN_ALIGNMENT`.
|
||||||
|
|
||||||
|
|
||||||
\page debugging_memory_usage Debugging incorrect memory usage
|
\page debugging_memory_usage Debugging incorrect memory usage
|
||||||
|
|
||||||
|
Loading…
Reference in New Issue
Block a user