OpenSubdiv

mirror of https://github.com/PixarAnimationStudios/OpenSubdiv synced 2024-11-27 05:50:05 +00:00

Author	SHA1	Message	Date
David G. Yu	301496bd3b	Additional tess control/hull shader perf fixes This change make the bspline patch tess control/hull shader revert to control vertex mirroring for boundary edges when the patch sharpness is zero. This change helps improve some shader codegen optimization and L1 cache behavior on (at least) Kepler GPUs with recent drivers.	2016-02-05 14:53:13 -08:00
David G. Yu	71fcb67509	Tessellation shader performance improvement Re-organized the single-crease patch code path in the tessellation control/hull shaders to improve performance in the case where no patches have non-zero crease sharpnesses.	2016-01-29 15:31:48 -08:00
David G. Yu	86026a9fe8	Shader performance workaround Changed a couple of local 4x4 matrices to global uniforms to work around a performance problem on recent GL drivers. There are two local 4x4 float matrices with constant initializers in the function OsdComputePerPatchVertexBSpline(...). Changing these from local variables to global initialized uniforms improves performance dramatically on recent NVIDIA drivers (e.g. 361.48 windows). There is no such difference with Direct3D, but this change updates the shader code for both implementations for consistency.	2016-01-29 15:31:38 -08:00
manuelk	4746ab4bc9	Surround D3D10_SHADER_RESOURCES_MAY_ALIAS with #ifdef to avoid build errors with some versions of D3D SDK Fixes #773	2016-01-13 11:33:38 -08:00
xiao-deng	73b5883d02	Add events control to clVertexBuffer and clEvaluator	2015-11-23 17:54:03 +08:00
David G. Yu	c588e95e13	Disabled fractional spacing w/o screen space tess For now, the common patch shader code supports fractional spacing modes only when screen-space tessellation is also enabled. It's possible to relax this restriction, but that requires changing the client shader interface.	2015-11-18 18:54:19 -08:00
David G. Yu	b0d612f837	Restored support for fractional tessellation This change includes support for both fractional_even_spacing and fractional_odd_spacing. The implementation follows the existing pattern of re-parameterizing the tessellation domain only along transition boundary edges. This allows for crack-free tessellation, but it might be better to consistently re-parameterize all of the outer edges of all patches, which also would be required for numerically watertight tessellation. This is implemented in a way that requires no changes to the client shader API. It should be more efficient to move some computations to the control/hull shaders and reduce divergence in the execution of eval/domain shaders.	2015-10-23 15:24:35 -07:00
David G. Yu	6bbf0f78ce	Fixed out of bounds mem access for OMP and TBB	2015-09-30 13:26:35 -07:00
Jeremy Cowles	f744e20c4b	Merge pull request #742 from anshuarya/dev Set layout to std430 for GLSL Compute kernel	2015-09-21 23:50:56 -07:00
Takahito Tejima	007a2e8885	Removing debug code that mistakenly slipped into glPatchTable.cpp. Also use GL_ARRAY_BUFFER for the binding point of updating indices instead of GL_ELEMENT_ARRAY_BUFFER	2015-09-21 14:33:11 -07:00
Anshu Arya	4be4b25cc9	Set layout to std430 for GLSL Compute kernel Fixes corrupted mesh issues with GLSL Compute on AMD platforms.	2015-09-21 09:01:47 -07:00
Jeremy Cowles	84ff9d3604	Merge pull request #728 from significant-bit/win32-lean use WIN32_LEAN_AND_MEAN properly	2015-09-20 22:36:25 -07:00
Sergey Sharybin	b25c0dd777	Fix GLSL program being deleted twice in GLSL Compute controller GLSL programs used by stencil and patch kernels were freed from both GLSL Compute Evaluator and from kernel classes themselves.	2015-09-10 20:00:42 +05:00
Mike Erwin	585858c1e4	use WIN32_LEAN_AND_MEAN properly Misspelled W32_LEAN_AND_MEAN in osd/opengl.h had no effect on windows.h. Also added to example code wherever windows.h is included.	2015-08-07 17:45:41 -04:00
David G Yu	e897a1c0e5	Merge pull request #702 from daz3d/cmake_error_on_no_gpu Fixes cmake projects for non-gpu use case	2015-08-04 08:00:58 -07:00
Jeremy Cowles	5f393dbf23	Merge pull request #703 from Nazg-Gul/dev Fixing linking errors on both Linux and WIndows	2015-08-03 22:51:52 -07:00
barfowl	db3fe9a8e8	Reduced warnings resulting from GCC's -Wshadow option - eliminated warnings from core libraries and regression code	2015-07-29 18:46:18 -07:00
Takahito Tejima	8a8771c97d	Revert "Add ISPC limit surface evaluation"	2015-07-20 17:13:51 -07:00
Sheng Fu	d3f8725e79	Add ISPC limit surface evaluation	2015-07-20 14:12:11 -07:00
Sergey Sharybin	a095ebf1f0	Fix linking error when using GCC and including osd/mesh.h from several places Te issue here is that some of the functions were not considered templated anymore because all their template arguments were specified, which made it so compiler was creating implementation for them in every file from where the header was included. This causes errors during linking related on the same symbol implemented in several places. Marking those functions as inlined solves the problem and should not cause any bad side effects because they're small enough and likely being inlined by an optimizer anyway.	2015-07-19 18:06:54 +02:00
Sheng Fu	59382abeaa	Fix Intel compiler warning	2015-07-16 09:27:48 -07:00
Christopher Jones	87dc09e769	Fix cmake projects to not require GLFW when NO_OPENGL provided and not build gpu lib if not needed.	2015-07-15 23:55:14 -06:00
Takahito Tejima	51e1833139	Merge pull request #699 from daz3d/tbb_offset_fixes Fix offsets in TbbEvalStencils when start is non-zero.	2015-07-13 11:40:49 -07:00
Christopher Jones	89b698e519	Fix offsets in TbbEvalStencils when start is non-zero.	2015-07-12 16:03:09 -06:00
Christopher Jones	d317cbc86b	Fix offsets in OmpEvalStencils when start is non-zero.	2015-07-05 11:22:38 -06:00
Takahito Tejima	e008776325	fix legacy gregory patch shader to support higher valence Note that the actual limitation stays same, due to hardware capability (GL_MAX_VARYING_VECTORS)	2015-06-22 11:08:11 -07:00
Takahito Tejima	8b7e55b6f5	Fix undesirable doxygen annotations in osd.	2015-06-18 18:12:08 -07:00
David G Yu	5c914a0622	Fixed misspelled point indices in shader comments	2015-06-17 11:18:52 -07:00
David G. Yu	d4a18a6c98	Fixed crash in GL viewers w/ LegacyGregory Fixed a crash (assertion failure on Windows) that occurs when creating GLLegacyGregoryPatchTable for a mesh that requires no Gregory patches.	2015-06-11 17:30:14 -07:00
David G. Yu	f0128a5f5e	Fixed Far::PatchParam encoding of refinement level This change restores the use of 4-bits in Far::PatchParam to encode the refinement level of a patch. This restores one bit that was stolen to allow for more general encoding of boundary edge and transition edge masks. In order to accommodate all of the bits that are required, the transition edge mask bits are now stored along with the faceId bits. Also, accessors are now exposed directly as members of Far::PatchParam and the internal bitfield class is no longer directly exposed.	2015-06-11 15:10:30 -07:00
David G. Yu	80fe31225c	Fixed the tessellation metrics calculations Unified transition patch drawing affects the calculation of tessellation level metrics. Because a single edge of a shader patch might be split into two halfs along a transition edge, the effective maximum number of spans along any adaptive edge is limited to half of the device maximum.	2015-06-03 18:01:19 -07:00
Takahito Tejima	16d566884a	Fix gregory patch shader: wrong tangents on corner corners Besides we've not been computing accurate derivatives on gregory patch, there was a separate bug in shaders which gives completely bogus dUdV on corner vertices. This change fixes that significant artifact, however, is still approximating derivatives by ignoring rational components.	2015-06-02 18:27:36 -07:00
Takahito Tejima	670ce267f3	Fix tessellation cracks when screen space tesselation is turned off. The tess factor of transition edges has to be clamped to 2.	2015-06-02 17:08:17 -07:00
Takahito Tejima	5c54962d3b	Improve OpenCL/DX interop configuration build osd and examples with DX/CL interop only of cmake finds cl_d3d11.h or cl_d3d11_ext.h.	2015-06-02 16:20:59 -07:00
Takahito Tejima	1e02fbebf8	Sync HLSL shaders for the single crease orientation fix. - add HLSL equivalents of the previous GLSL change - rename OsdGetSingleCreaseSegmentParameter to OsdGetPatchSingleCreaseSegmentParameter. - add shadingMode UI for dxViewer similar to glViewer	2015-06-02 12:03:27 -07:00
Takahito Tejima	fe92265db9	Unlabel the rotation of single crease patch. use boundaryMask to identify the crease edge from 4 edges. with this change, single-crease patch no longer needs to be rotated on its population. In shader, experimentally use same infinite sharp matrix for both boundary and single-crease patch.	2015-06-01 21:26:39 -07:00
David G. Yu	1b6d9ba409	Workaround GLSL shader compiler on AMD Added a size specifier to the shader output array declaration in the GregoryBasis and Gregory control shaders. This seems to be required by the GLSL compiler on AMD and is harmless elsewhere.	2015-05-31 10:59:01 -07:00
David G. Yu	99b214ca92	Workaround GLSL shader compiler on AMD Added a size specifier to the shader output array declaration in the BSpline control shader. This seems to be required by the GLSL compiler on AMD and is harmless elsewhere.	2015-05-31 10:26:50 -07:00
David G. Yu	bd6d9b4ea9	Sync'd screen-space single-crease patch fix to DX	2015-05-31 08:18:18 -07:00
David G Yu	4bf1ef722c	Merge pull request #564 from takahito-tejima/singlecrease Fix screen-space tessellation on single crease patch.	2015-05-30 23:58:39 -07:00
George ElKoura	cb5044c462	Fix out of bounds mem access in CPU stencil eval. When the stencil table contains no stencils, bail out on the EvalStencil before indexing into the stencil table.	2015-05-30 22:55:43 -07:00
Takahito Tejima	2102b4f6b2	Fix screen-space tessellation on single crease patch. - Rename OsdPerPatchVertexBSpline to OsdPerPatchVertexBezier - Split OsdGetTessLevels into 3 functions, OsdGetTessLevelUniform, OsdGetTessLevelAdaptiveRefinedPoints, OsdGetTessLevelAdaptiveLimitPoints. - Add a regression test shape : catmark_single_crease	2015-05-30 22:41:24 -07:00
David G. Yu	c245407854	Re-organized patch tessellation shader code This change refactors the GLSL and HLSL patch shader code so that most of the work is implemented within a library of common functions and the remaining shader snippets just manage plumbing. There is more to do here: - varying and face-varying data can be managed entirely by the client - similarly, displacement can be implemented in client code - there's still quite a bit of residual boiler-plate code needed in each shader stage that we should be able to wrap up in a more convenient form.	2015-05-29 23:00:19 -07:00
Takahito Tejima	4e807a776d	Add Far::PatchTable::ComputeLocalPointValues() to compute endcap patch points. To encapsulate endcap functions from public API, add methods to tell the number of patch points needed (GetNumLocalPoints()) and to compute those patch points as a result of change of basis from the refined vertices (ComputeLocalPointValues()). ComputeLocalPointValues takes contiguous source data of all levels including level0 control vertices.	2015-05-29 12:41:22 -07:00
Takahito Tejima	f9523f916f	osdlayer: rename Osd::VertexBufferDescriptor to Osd::BufferDescriptor. A mechanical substitution of VertexBufferDescriptor with BufferDescriptor. Also removing unused files from osd.	2015-05-29 09:21:14 -07:00
Takahito Tejima	9cfdb6c5c1	Mac build and shader compiler error fix.	2015-05-28 10:27:23 -07:00
Takahito Tejima	9dcbaee147	Add GPU stencil/patch derivative kernels implements cuda, CL, GLXFB and GLCompute kernels of derivative evaluation for both limit stencils and patches.	2015-05-27 17:23:36 -07:00
Takahito Tejima	49a16539dd	A workaround for the GLSL compute crash on some nvidia drivers It looks like there's a compiler bug in some earlier nvidia driver 340/346 releases. It has been fixed in 348.07 (win) as far as I can tell. Following code behaves incorrectly. void f(int a) { for (int i=0; i<3; ++i) doSomething(a, i); } void g() { for (int i=0; i<100; ++i) f(i); } The workaround is to use different identifiers for each function.	2015-05-27 08:57:47 -07:00
Takahito Tejima	749bbf4271	Fix OpenCL and CUDA EvalPatches kernels.	2015-05-26 11:13:30 -07:00
Takahito Tejima	541aeddd3a	Osd API refactor: EvalStencils and EvalPatches Add EvalStencils and EvalPatches API for most of CPU and GPU evaluators. with this change, Eval API in the osd layer consists of following parts: - Evaluators (Cpu, Omp, Tbb, Cuda, CL, GLXFB, GLCompute, D3D11Compute) implements EvalStencils and EvalPatches(). Both supports derivatives (not fully implemented though) - Interop vertex buffer classes (optional, same as before) Note that these classes are not necessary to use Evaluators. All evaluators have EvalStencils/Patches which take device-specific buffer objects. For example, GLXFBEvaluator can take GLuint directly for both stencil tables and input primvars. Although using these interop classes makes it easy to integrate osd into relatively simple applications. - device-dependent StencilTable and PatchTable (optional) These are also optional, but can be used simply a substitute of Far::StencilTable and Far::PatchTable for osd evaluators. - PatchArray, PatchCoord, PatchParam They are tiny structs used for GPU based patch evaluation. () TODO and known issues: - CLEvaluator and D3D11Evaluator's EvalPatches() have not been implemented. - GPU Gregory patch evaluation has not been implemented in EvalPatches(). - CudaEvaluator::EvalPatches() is very unstable. - All patch evaluation kernels have not been well optimized. - Currently GLXFB kernel doesn't support derivative evaluation. There's a technical difficulty for the multi-stream output.	2015-05-25 22:43:43 -07:00
Takahito Tejima	a7c5179ef9	renaming (2/2) PatchTables and StencilTables to PatchTable and StencilTable replace all occurrences reffering PatchTables, StencilTables, and their factories.	2015-05-22 11:50:01 -07:00
barfowl	ff13665a02	Introduced Far::TopologyLevel methods in place of TopologyRefiner in Osd	2015-05-21 20:20:16 -07:00
Takahito Tejima	2e26f932e1	implements Osd::CpuEvaluator::EvalPatches function. - it takes number and pointer for the input PatchCoords. - add derivative evaluations. - enhance glEvalLimit example to see the derivative evaluation works.	2015-05-21 17:45:13 -07:00
David G Yu	f4a8cafe86	Sync'ed up GLSL and HLSL patch shader source. Cleaned up the Legacy Gregory shader source by accessing buffer data through helper functions. Switched to performing tessellation in untransformed (object) space.	2015-05-21 09:56:25 -07:00
David G. Yu	d1452a02cf	Fixed HLSL shader compile error.	2015-05-20 21:48:52 -07:00
George ElKoura	a7d9d6a9b4	Merge pull request #504 from davidgyu/cleanup Comment cleanups discovered while reviewing Far	2015-05-20 19:40:49 -07:00
David G Yu	c82b3d551d	Merge pull request #502 from barfowl/patch_eval Refactor public patch evaluation methods exposed by Far::PatchTables	2015-05-20 19:39:32 -07:00
Takahito Tejima	a1424ae163	Fix gregory patch tesslevel computation - flipping UV - using limit position in legacy gregory shader.	2015-05-20 18:27:32 -07:00
David G Yu	11e987a7d8	Fixed the misspellings of Ignacio Castano's name.	2015-05-20 17:39:06 -07:00
barfowl	be50309752	Fixed missing qualification of IndexArray	2015-05-20 17:21:02 -07:00
Takahito Tejima	109a3f5383	osd layer: Add GLPatchTable and retire DrawContext In osd layer, we use GLPatchTable (D3D11PatchTable) as a device-specific representation of FarPatchTables instead of DrawContext. GLPatchTable may be used not only for drawing but also for GPU eval APIs (not yet supported though. We may add CudaPatchTable etc as needed). The legacy gregory patch drawing buffers are carved out to the separate class, named GLLegacyGregoryPatchTable. Also face-varying data are split into client side for now, until we add new and more robust face-varying drawing structure (scheduled at 3.1 release) Tentatively replicate PatchArray structure in GLPatchTables. It will be revised in the upcoming change. Shifting hard-coded SRV locations of legacy gregory buffers in HLSL shaders.	2015-05-20 15:55:06 -07:00
barfowl	891772f58b	Refactor public patch evalations methods exposed by Far::PatchTables: - changes completely deprecate AddWithWeight(T, float, float, float) - added new EvaluateBasis() method to PatchTables - replaced usage of old Evaluate...<T,U>() methods with EvaluateBasis() - removed old Evaluate...<T,U>() methods - removed now unused Interpolate...<T,U>() functions in far/interpolate.h - moved low-level basis code from far/interpolate.* to patchBasis.*	2015-05-20 15:44:29 -07:00
David G. Yu	00ce36433f	Fixed handedness flip in Gregory{Basis} shaders There was an unnecessary double-flip in the shading calculations.	2015-05-20 14:17:31 -07:00
David G Yu	0811046c2f	Merge pull request #497 from jcowles/dev-remove-pragma Remove #pragma once	2015-05-20 12:57:21 -07:00
Takahito Tejima	8d9ae550ff	Add a HLSL version of GregoryBasis patch shader. hlslPatchGregoryBasis.hlsl is an equivalent to glslPatchGregoryBasis. Update dxViewer to be able to switch among bspline, gregorybasis, legacy end capping. also fixes a bug of GLSL legacy gregory shader which had an inconsistent resource naming with example codes. It looks like there's still an issue of D3D11 patchParam data fetching. we'll come back to that bug.	2015-05-20 10:49:45 -07:00
jcowles	392e5e8bed	Remove #pragma once While this may be worth revisiting, we should first quantify the benefits and identify the compilers that support it. Ultimately, we may never use pragma once in favor of strictly using standard C++.	2015-05-20 09:59:18 -07:00
Takahito Tejima	2f90e3799a	Merge pull request #488 from jcowles/dev-size Prefix header guards and add pragma once.	2015-05-19 13:18:18 -07:00
jcowles	9d32712ec4	Prefix header guards and add pragma once.	2015-05-19 11:22:37 -07:00
Takahito Tejima	c3aa00e706	remove SupportsAdaptiveTessellation from OsdDrawContext, and example cleanups As a preparation for retiring DrawContext, move SupportsAdaptiveTessellation method to examples/common/glUtils, which is renamed and namespaced from gl_common.{cpp,h} to be consistent to other files. Same renamings applied to other example files.	2015-05-19 10:30:16 -07:00
jcowles	7b4aa33a1f	Change stencils sizes from unsigned char to int.	2015-05-19 10:16:56 -07:00
David G Yu	8716a8eebf	Fixed example code use of single crease patches Also made a couple minor changes to true up the GLSL patch shaders.	2015-05-15 11:52:41 -07:00
Takahito Tejima	4a4322983f	Osd drawing API refactoring. Remove DrawRegistry from osd layer and put a simple shader caching utility into examples/common. osd layer only provides patch shader snippet and let client configure and compile the code. Clients also maintain the lifetime of shader object, which is preferable for the actual application integration. update all examples to use the new scheme.	2015-05-13 17:35:46 -07:00
David G Yu	fb009f41b2	Removed BOUNDARY and CORNER patch types These are now redundant since all bspline patches are encoded in the patch tables consistently using 16 point indices with boundary and corner edges indicated in the boundary mask of the patch params.	2015-05-13 11:31:02 -07:00
David G Yu	1c91945d10	Added stronger guards for OSD_MAX_VALENCE	2015-05-12 11:15:51 -07:00
Takahito Tejima	917974f7d1	fix typo in the include guard.	2015-05-11 18:07:26 -07:00
Takahito Tejima	b66380ee52	Refactor Osd::DrawContext Since unified shading work already removed subPatch info from Osd::PatchDescriptor, the difference between Far::PatchDescriptor and Osd::PatchDescriptor is just maxValence and numElements. They are used for legacy gregory patch drawing. Both maxValence and numElements are actually constant within a topology (drawContext). This change move maxValence to DrawContext and let client manage numElements, then we can eliminate Osd::PatchDescriptor and simply use Far::PatchDescritor instead. This is still an intermediate step toward further DrawRegistry refactoring. For the time being, adding EffectDesc struct to include maxValence and numValence to be maintained by the clients. They will be cleaned up later. The side benefit of this change is we no longer need to recompile regular b-spline shaders for the different max-valences.	2015-05-11 18:06:46 -07:00
Takahito Tejima	94d6eb6491	cleanup remaining ptex specific code in osd layer - Remove MeshPtexData bit from Osd::MeshBits. It's not used any more - Rename ptexIndexBuffer in D3D11DrawContext to paramParamBuffer - Remove Is/SetPtexEnabled from D3D11DrawRegistry	2015-05-11 10:59:16 -07:00
Takahito Tejima	33bfbf699b	Refurbish osd layer API. In OpenSubdiv 2.x, we encapsulated subdivision tables into compute context in osd layer since those tables are order-dependent and have to be applied in a certain manner. In 3.0, we adopted stencil table based refinement. It's more simple and such an encapsulation is no longer needed. Also 2.0 API has several ownership issues of GPU kernel caching, and forces unnecessary instantiation of controllers even though the cpu kernels typically don't need instances unlike GPU ones. This change completely revisit osd client facing APIs. All contexts and controllers were replaced with device-specific tables and evaluators. While we can still use consistent API across various device backends, unnecessary complexities have been removed. For example, cpu evaluator is just a set of static functions and also there's no need to replicate FarStencilTables to ComputeContext. Also the new API delegates the ownership of compiled GPU kernels to clients, for the better management of resources especially in multiple GPU environment. In addition to integrating ComputeController and EvalStencilController into a single function Evaluator::EvalStencils(), EvalLimit API is also added into Evaluator. This is working but still in progress, and we'll make a followup change for the complete implementation. -some naming convention changes: GLSLTransformFeedback to GLXFBEvaluator GLSLCompute to GLComputeEvaluator -move LimitLocation struct into examples/glEvalLimit. We're still discussing patch evaluation interface. Basically we'd like to tease all ptex-specific parametrization out of far/osd layer. TODO: -implments EvalPatches() in the right way -derivative evaluation API is still interim. -VertexBufferDescriptor needs a better API to advance its location -synchronization mechanism is not ideal (too global). -OsdMesh class is hacky. need to fix it.	2015-05-08 17:31:26 -07:00
Takahito Tejima	1a37da1c26	osd refactor: split the vertex buffer argument into src and dst Changing all device kernels to take two buffer identifiers for source and destination separately. This change is an intermediate step toward upcoming context/controller refactoring. Previously we have a limitation that the source and destination vertex buffer has to be a single buffer, since the subdivision kernels are iteratively applied by level. With stencil tables, we don't have such a limitation any more, so we may want to apply stencils from seprate source buffer to another. To specifiy the output location within the destination buffer, we can use VertexBufferDescriptor.offset. This allows us not only configuring arbitrary batching scheme, but also relaxing the limitation that source and destination buffers are in same interleaved layout. For examples, we could include derivatives only in the destination buffer, which doesn't need to be allocated in the source buffer.	2015-05-07 16:11:00 -07:00
Takahito Tejima	8fc0e5db93	Move Ptex code from osd to examples. we're teasing out ptex specific data from core osd entities, so there's no reason to keep ptex texturing utilities in core osd. move them into example libs and let clients assemble shader snippets as needed. Also removing older ptex texturing code (without mipmap)	2015-05-07 13:56:48 -07:00
Takahito Tejima	f3aec9eb31	Remove osd/vertex.h OsdVertex was used in 2.x FarMesh as a placeholder for Hbr template parameter. We don't need such a class any more.	2015-05-07 12:34:09 -07:00
David G Yu	8b4ef28116	Updated handling of patchParam and patchCoord Each patch has a corresponding patchParam. This is a set of three values specifying additional information about the patch: faceId -- topological face identifier (e.g. Ptex FaceId) bitfield -- refinement-level, non-quad, boundary, transition, uv-offset sharpness -- crease sharpness for a single-crease patch These are stored in OsdPatchParamBuffer indexed by the value returned from OsdGetPatchIndex() which is a function of the current PrimitiveID along with an optional client provided offset. Accessors are provided to extract values from a patchParam. These are all named OsdGetPatch(). While drawing patches, the patchParam is condensed into a patchCoord which has four values (u, v, faceLevel, faceId). These patchCoords are treated as int values during per-prim processing but are converted to float values during per-vertex processing where the values are interpolated. Also, cleaned up more of the shader namespace by giving an Osd prefix to public functions, and consolidated boundary and transition handling code into the PatchCommon shader files. The functions determining tessellation levels are now all named OsdGetTessLevel().	2015-05-06 13:47:33 -07:00
Takahito Tejima	dcb022e1db	Fix DX-OpenCL/CUDA interop. - resolves DX-CL interop functions in Osd::ClD3D11VertexBuffer. - enable CL kernels in DX build. - more cleanup in test harnesses, adding D3D11 initializations into DeviceContext. - add new defines OPENSUBDIV_HAS_OPENGL and OPENSUBDIV_HAS_DX for convenience.	2015-04-29 11:51:12 -07:00
Takahito Tejima	82a0513326	cleanup CL/CUDA example harnesses. refactor CL/CUDA specific initialization stuffs into examples/common/clDeviceContext and cudaDeviceContext, and update examples to use those structs. also - remove CL/CUDA tests from osd_regression. The tests for those kernels will be covered by glImaging. - update cuda initialization to use the GL-interoperable device if available. - remove CL specialization from glShareTopology, following the same pattern as we took in the previous OsdGLMesh refactoring. (still something strange with XFB kernels though) - fix file permissions.	2015-04-28 15:46:37 -07:00
Takahito Tejima	de0082ba95	Simplify Osd::MeshInterface. Removed OpenCL/D3D11 specialization and add DEVICE_CONTEXT as a template parameter. For the kernels which don't need a context object (e.g. CPU, OpenGL, cuda) just ignore the context, and for the kernels which use a context (e.g. OpenCL, DirectX) takes a context or a user-defined class as which encapsulates device contexts. Note that OpenCL requires two objects, cl_context and cl_command_queue. The user-defined class must provide GetContext() and GetCommandQueue() for strongly typed binding to osd VertexBuffers and ComputeContexts. Osd::Mesh and MeshInterface have been used as a handy harness to host multiple GPU kernels and graphics APIs. However it has CL/DirectX specializations and duplicates large amount of plubming code. With this change, glMesh.h and d3d11Mesh.h become just typedefs and all logic is put into mesh.h without specializations. Also cleaned up unused header files and code formatting.	2015-04-27 11:27:05 -07:00
Takahito Tejima	74b78b3d17	Refactor again FarPatchTableFactory. - rename "Regular end cap" to "BSplineBasis end cap" - revert templating and add EndCapType into PatchTablesFactory::Options. - make EndCapFactories internal in PatchTablesFactory. - move end cap stencils into PatchTables, keep them relative to the max level. - add a utility StencilTablesFactory::AppendEndCapStencilTables to splice and factorize endcap stencil tables.	2015-04-23 16:58:56 -07:00
David G Yu	7d9dcf8c03	Added a new screen-space adaptive tessellation metric which computes edge lengths using limit surface points. Made this the default screen-space metric so that we avoid cracks when using Gregory Basis or Regular B-spline end caps. The alternative method which computes edge lengths using the distance between B-spline control points is still available. Added a diagram and comments to explain how the control points and limit points are organized.	2015-04-21 14:17:29 -07:00
Takahito Tejima	43aa2500c4	Refactor far factories. This change moves all gregory patch generation from Far::PatchTablesFactory so that we can construct patch tables without stencil tables as well as client can chose any end patch strategies (we have 3 options for now: legacy 2.x style gregory patch, gregory basis patch and experimental regular patch approximation). Also Far::EndCapGregoryBasisPatchFactory provides index mapping from patch index to vtr face index, which can be used for single gregory patch evaluation on top of refined points, without involving heavier stencil tables generation.	2015-04-20 18:59:07 -07:00
David G. Yu	9f1da077e8	Updated HLSL shaders for unified patch drawing	2015-04-18 14:08:53 -07:00
David G. Yu	cb689194ab	Fixed windows build errors.	2015-04-17 18:29:56 -07:00
Takahito Tejima	c58e63c8d5	Refactor single crease patch to be in unfined shader.	2015-04-17 16:26:57 -07:00
David G Yu	336fbb6748	Removed some more obsolete transition patch state Temporarily disabled fractional tessellation for Gregory patches.	2015-04-17 15:24:48 -07:00
David G Yu	759513d2dc	Fixed adaptive tessellation and removed debug code	2015-04-17 12:04:21 -07:00
David G Yu	010a31efc4	Checkpoint changes to unify adaptive patch drawing.	2015-04-17 07:42:53 -07:00
Takahito Tejima	3b748e3b8d	Fixes a gregory basis evaluation bug.	2015-04-10 17:44:33 -07:00
Takahito Tejima	7b37eb8cd6	#version 410 for XFB kernel and then OSX can run the XFB kernel.	2015-04-10 15:42:25 -07:00
Takahito Tejima	1e9ca564b0	remove subroutines from GLSL kernels.	2015-04-10 15:34:04 -07:00
Takahito Tejima	1ec0b2b1f2	add includes	2015-04-10 11:09:14 -07:00
Takahito Tejima	24a435da01	avoid lauching stencil kernel and not to allocate device tables if the stencil table is empty.	2015-04-10 11:08:51 -07:00
Takahito Tejima	8da827336d	Removes FarKernelBatch. This is the first step to tease off Osd compute controller/contexts from Far API. Currently FarStencilTable only creates a kernelbatch for the entire range, so we can use [0, numStencils) for all cases instead of KernelBatch. This might not be true if we apply non-factorized level-wise stencils, then we'll add another modular utility to serve those cases.	2015-04-10 11:08:24 -07:00

1 2 3 4 5 ...

504 Commits