OpenSubdiv

mirror of https://github.com/PixarAnimationStudios/OpenSubdiv synced 2024-11-23 12:10:08 +00:00

Author	SHA1	Message	Date
David G. Yu	c588e95e13	Disabled fractional spacing w/o screen space tess For now, the common patch shader code supports fractional spacing modes only when screen-space tessellation is also enabled. It's possible to relax this restriction, but that requires changing the client shader interface.	2015-11-18 18:54:19 -08:00
David G. Yu	b0d612f837	Restored support for fractional tessellation This change includes support for both fractional_even_spacing and fractional_odd_spacing. The implementation follows the existing pattern of re-parameterizing the tessellation domain only along transition boundary edges. This allows for crack-free tessellation, but it might be better to consistently re-parameterize all of the outer edges of all patches, which also would be required for numerically watertight tessellation. This is implemented in a way that requires no changes to the client shader API. It should be more efficient to move some computations to the control/hull shaders and reduce divergence in the execution of eval/domain shaders.	2015-10-23 15:24:35 -07:00
David G. Yu	6bbf0f78ce	Fixed out of bounds mem access for OMP and TBB	2015-09-30 13:26:35 -07:00
Jeremy Cowles	f744e20c4b	Merge pull request #742 from anshuarya/dev Set layout to std430 for GLSL Compute kernel	2015-09-21 23:50:56 -07:00
Takahito Tejima	007a2e8885	Removing debug code that mistakenly slipped into glPatchTable.cpp. Also use GL_ARRAY_BUFFER for the binding point of updating indices instead of GL_ELEMENT_ARRAY_BUFFER	2015-09-21 14:33:11 -07:00
Anshu Arya	4be4b25cc9	Set layout to std430 for GLSL Compute kernel Fixes corrupted mesh issues with GLSL Compute on AMD platforms.	2015-09-21 09:01:47 -07:00
Jeremy Cowles	84ff9d3604	Merge pull request #728 from significant-bit/win32-lean use WIN32_LEAN_AND_MEAN properly	2015-09-20 22:36:25 -07:00
Sergey Sharybin	b25c0dd777	Fix GLSL program being deleted twice in GLSL Compute controller GLSL programs used by stencil and patch kernels were freed from both GLSL Compute Evaluator and from kernel classes themselves.	2015-09-10 20:00:42 +05:00
Mike Erwin	585858c1e4	use WIN32_LEAN_AND_MEAN properly Misspelled W32_LEAN_AND_MEAN in osd/opengl.h had no effect on windows.h. Also added to example code wherever windows.h is included.	2015-08-07 17:45:41 -04:00
David G Yu	e897a1c0e5	Merge pull request #702 from daz3d/cmake_error_on_no_gpu Fixes cmake projects for non-gpu use case	2015-08-04 08:00:58 -07:00
Jeremy Cowles	5f393dbf23	Merge pull request #703 from Nazg-Gul/dev Fixing linking errors on both Linux and WIndows	2015-08-03 22:51:52 -07:00
barfowl	db3fe9a8e8	Reduced warnings resulting from GCC's -Wshadow option - eliminated warnings from core libraries and regression code	2015-07-29 18:46:18 -07:00
Takahito Tejima	8a8771c97d	Revert "Add ISPC limit surface evaluation"	2015-07-20 17:13:51 -07:00
Sheng Fu	d3f8725e79	Add ISPC limit surface evaluation	2015-07-20 14:12:11 -07:00
Sergey Sharybin	a095ebf1f0	Fix linking error when using GCC and including osd/mesh.h from several places Te issue here is that some of the functions were not considered templated anymore because all their template arguments were specified, which made it so compiler was creating implementation for them in every file from where the header was included. This causes errors during linking related on the same symbol implemented in several places. Marking those functions as inlined solves the problem and should not cause any bad side effects because they're small enough and likely being inlined by an optimizer anyway.	2015-07-19 18:06:54 +02:00
Sheng Fu	59382abeaa	Fix Intel compiler warning	2015-07-16 09:27:48 -07:00
Christopher Jones	87dc09e769	Fix cmake projects to not require GLFW when NO_OPENGL provided and not build gpu lib if not needed.	2015-07-15 23:55:14 -06:00
Takahito Tejima	51e1833139	Merge pull request #699 from daz3d/tbb_offset_fixes Fix offsets in TbbEvalStencils when start is non-zero.	2015-07-13 11:40:49 -07:00
Christopher Jones	89b698e519	Fix offsets in TbbEvalStencils when start is non-zero.	2015-07-12 16:03:09 -06:00
Christopher Jones	d317cbc86b	Fix offsets in OmpEvalStencils when start is non-zero.	2015-07-05 11:22:38 -06:00
Takahito Tejima	e008776325	fix legacy gregory patch shader to support higher valence Note that the actual limitation stays same, due to hardware capability (GL_MAX_VARYING_VECTORS)	2015-06-22 11:08:11 -07:00
Takahito Tejima	8b7e55b6f5	Fix undesirable doxygen annotations in osd.	2015-06-18 18:12:08 -07:00
David G Yu	5c914a0622	Fixed misspelled point indices in shader comments	2015-06-17 11:18:52 -07:00
David G. Yu	d4a18a6c98	Fixed crash in GL viewers w/ LegacyGregory Fixed a crash (assertion failure on Windows) that occurs when creating GLLegacyGregoryPatchTable for a mesh that requires no Gregory patches.	2015-06-11 17:30:14 -07:00
David G. Yu	f0128a5f5e	Fixed Far::PatchParam encoding of refinement level This change restores the use of 4-bits in Far::PatchParam to encode the refinement level of a patch. This restores one bit that was stolen to allow for more general encoding of boundary edge and transition edge masks. In order to accommodate all of the bits that are required, the transition edge mask bits are now stored along with the faceId bits. Also, accessors are now exposed directly as members of Far::PatchParam and the internal bitfield class is no longer directly exposed.	2015-06-11 15:10:30 -07:00
David G. Yu	80fe31225c	Fixed the tessellation metrics calculations Unified transition patch drawing affects the calculation of tessellation level metrics. Because a single edge of a shader patch might be split into two halfs along a transition edge, the effective maximum number of spans along any adaptive edge is limited to half of the device maximum.	2015-06-03 18:01:19 -07:00
Takahito Tejima	16d566884a	Fix gregory patch shader: wrong tangents on corner corners Besides we've not been computing accurate derivatives on gregory patch, there was a separate bug in shaders which gives completely bogus dUdV on corner vertices. This change fixes that significant artifact, however, is still approximating derivatives by ignoring rational components.	2015-06-02 18:27:36 -07:00
Takahito Tejima	670ce267f3	Fix tessellation cracks when screen space tesselation is turned off. The tess factor of transition edges has to be clamped to 2.	2015-06-02 17:08:17 -07:00
Takahito Tejima	5c54962d3b	Improve OpenCL/DX interop configuration build osd and examples with DX/CL interop only of cmake finds cl_d3d11.h or cl_d3d11_ext.h.	2015-06-02 16:20:59 -07:00
Takahito Tejima	1e02fbebf8	Sync HLSL shaders for the single crease orientation fix. - add HLSL equivalents of the previous GLSL change - rename OsdGetSingleCreaseSegmentParameter to OsdGetPatchSingleCreaseSegmentParameter. - add shadingMode UI for dxViewer similar to glViewer	2015-06-02 12:03:27 -07:00
Takahito Tejima	fe92265db9	Unlabel the rotation of single crease patch. use boundaryMask to identify the crease edge from 4 edges. with this change, single-crease patch no longer needs to be rotated on its population. In shader, experimentally use same infinite sharp matrix for both boundary and single-crease patch.	2015-06-01 21:26:39 -07:00
David G. Yu	1b6d9ba409	Workaround GLSL shader compiler on AMD Added a size specifier to the shader output array declaration in the GregoryBasis and Gregory control shaders. This seems to be required by the GLSL compiler on AMD and is harmless elsewhere.	2015-05-31 10:59:01 -07:00
David G. Yu	99b214ca92	Workaround GLSL shader compiler on AMD Added a size specifier to the shader output array declaration in the BSpline control shader. This seems to be required by the GLSL compiler on AMD and is harmless elsewhere.	2015-05-31 10:26:50 -07:00
David G. Yu	bd6d9b4ea9	Sync'd screen-space single-crease patch fix to DX	2015-05-31 08:18:18 -07:00
David G Yu	4bf1ef722c	Merge pull request #564 from takahito-tejima/singlecrease Fix screen-space tessellation on single crease patch.	2015-05-30 23:58:39 -07:00
George ElKoura	cb5044c462	Fix out of bounds mem access in CPU stencil eval. When the stencil table contains no stencils, bail out on the EvalStencil before indexing into the stencil table.	2015-05-30 22:55:43 -07:00
Takahito Tejima	2102b4f6b2	Fix screen-space tessellation on single crease patch. - Rename OsdPerPatchVertexBSpline to OsdPerPatchVertexBezier - Split OsdGetTessLevels into 3 functions, OsdGetTessLevelUniform, OsdGetTessLevelAdaptiveRefinedPoints, OsdGetTessLevelAdaptiveLimitPoints. - Add a regression test shape : catmark_single_crease	2015-05-30 22:41:24 -07:00
David G. Yu	c245407854	Re-organized patch tessellation shader code This change refactors the GLSL and HLSL patch shader code so that most of the work is implemented within a library of common functions and the remaining shader snippets just manage plumbing. There is more to do here: - varying and face-varying data can be managed entirely by the client - similarly, displacement can be implemented in client code - there's still quite a bit of residual boiler-plate code needed in each shader stage that we should be able to wrap up in a more convenient form.	2015-05-29 23:00:19 -07:00
Takahito Tejima	4e807a776d	Add Far::PatchTable::ComputeLocalPointValues() to compute endcap patch points. To encapsulate endcap functions from public API, add methods to tell the number of patch points needed (GetNumLocalPoints()) and to compute those patch points as a result of change of basis from the refined vertices (ComputeLocalPointValues()). ComputeLocalPointValues takes contiguous source data of all levels including level0 control vertices.	2015-05-29 12:41:22 -07:00
Takahito Tejima	f9523f916f	osdlayer: rename Osd::VertexBufferDescriptor to Osd::BufferDescriptor. A mechanical substitution of VertexBufferDescriptor with BufferDescriptor. Also removing unused files from osd.	2015-05-29 09:21:14 -07:00
Takahito Tejima	9cfdb6c5c1	Mac build and shader compiler error fix.	2015-05-28 10:27:23 -07:00
Takahito Tejima	9dcbaee147	Add GPU stencil/patch derivative kernels implements cuda, CL, GLXFB and GLCompute kernels of derivative evaluation for both limit stencils and patches.	2015-05-27 17:23:36 -07:00
Takahito Tejima	49a16539dd	A workaround for the GLSL compute crash on some nvidia drivers It looks like there's a compiler bug in some earlier nvidia driver 340/346 releases. It has been fixed in 348.07 (win) as far as I can tell. Following code behaves incorrectly. void f(int a) { for (int i=0; i<3; ++i) doSomething(a, i); } void g() { for (int i=0; i<100; ++i) f(i); } The workaround is to use different identifiers for each function.	2015-05-27 08:57:47 -07:00
Takahito Tejima	749bbf4271	Fix OpenCL and CUDA EvalPatches kernels.	2015-05-26 11:13:30 -07:00
Takahito Tejima	541aeddd3a	Osd API refactor: EvalStencils and EvalPatches Add EvalStencils and EvalPatches API for most of CPU and GPU evaluators. with this change, Eval API in the osd layer consists of following parts: - Evaluators (Cpu, Omp, Tbb, Cuda, CL, GLXFB, GLCompute, D3D11Compute) implements EvalStencils and EvalPatches(). Both supports derivatives (not fully implemented though) - Interop vertex buffer classes (optional, same as before) Note that these classes are not necessary to use Evaluators. All evaluators have EvalStencils/Patches which take device-specific buffer objects. For example, GLXFBEvaluator can take GLuint directly for both stencil tables and input primvars. Although using these interop classes makes it easy to integrate osd into relatively simple applications. - device-dependent StencilTable and PatchTable (optional) These are also optional, but can be used simply a substitute of Far::StencilTable and Far::PatchTable for osd evaluators. - PatchArray, PatchCoord, PatchParam They are tiny structs used for GPU based patch evaluation. () TODO and known issues: - CLEvaluator and D3D11Evaluator's EvalPatches() have not been implemented. - GPU Gregory patch evaluation has not been implemented in EvalPatches(). - CudaEvaluator::EvalPatches() is very unstable. - All patch evaluation kernels have not been well optimized. - Currently GLXFB kernel doesn't support derivative evaluation. There's a technical difficulty for the multi-stream output.	2015-05-25 22:43:43 -07:00
Takahito Tejima	a7c5179ef9	renaming (2/2) PatchTables and StencilTables to PatchTable and StencilTable replace all occurrences reffering PatchTables, StencilTables, and their factories.	2015-05-22 11:50:01 -07:00
barfowl	ff13665a02	Introduced Far::TopologyLevel methods in place of TopologyRefiner in Osd	2015-05-21 20:20:16 -07:00
Takahito Tejima	2e26f932e1	implements Osd::CpuEvaluator::EvalPatches function. - it takes number and pointer for the input PatchCoords. - add derivative evaluations. - enhance glEvalLimit example to see the derivative evaluation works.	2015-05-21 17:45:13 -07:00
David G Yu	f4a8cafe86	Sync'ed up GLSL and HLSL patch shader source. Cleaned up the Legacy Gregory shader source by accessing buffer data through helper functions. Switched to performing tessellation in untransformed (object) space.	2015-05-21 09:56:25 -07:00
David G. Yu	d1452a02cf	Fixed HLSL shader compile error.	2015-05-20 21:48:52 -07:00
George ElKoura	a7d9d6a9b4	Merge pull request #504 from davidgyu/cleanup Comment cleanups discovered while reviewing Far	2015-05-20 19:40:49 -07:00
David G Yu	c82b3d551d	Merge pull request #502 from barfowl/patch_eval Refactor public patch evaluation methods exposed by Far::PatchTables	2015-05-20 19:39:32 -07:00
Takahito Tejima	a1424ae163	Fix gregory patch tesslevel computation - flipping UV - using limit position in legacy gregory shader.	2015-05-20 18:27:32 -07:00
David G Yu	11e987a7d8	Fixed the misspellings of Ignacio Castano's name.	2015-05-20 17:39:06 -07:00
barfowl	be50309752	Fixed missing qualification of IndexArray	2015-05-20 17:21:02 -07:00
Takahito Tejima	109a3f5383	osd layer: Add GLPatchTable and retire DrawContext In osd layer, we use GLPatchTable (D3D11PatchTable) as a device-specific representation of FarPatchTables instead of DrawContext. GLPatchTable may be used not only for drawing but also for GPU eval APIs (not yet supported though. We may add CudaPatchTable etc as needed). The legacy gregory patch drawing buffers are carved out to the separate class, named GLLegacyGregoryPatchTable. Also face-varying data are split into client side for now, until we add new and more robust face-varying drawing structure (scheduled at 3.1 release) Tentatively replicate PatchArray structure in GLPatchTables. It will be revised in the upcoming change. Shifting hard-coded SRV locations of legacy gregory buffers in HLSL shaders.	2015-05-20 15:55:06 -07:00
barfowl	891772f58b	Refactor public patch evalations methods exposed by Far::PatchTables: - changes completely deprecate AddWithWeight(T, float, float, float) - added new EvaluateBasis() method to PatchTables - replaced usage of old Evaluate...<T,U>() methods with EvaluateBasis() - removed old Evaluate...<T,U>() methods - removed now unused Interpolate...<T,U>() functions in far/interpolate.h - moved low-level basis code from far/interpolate.* to patchBasis.*	2015-05-20 15:44:29 -07:00
David G. Yu	00ce36433f	Fixed handedness flip in Gregory{Basis} shaders There was an unnecessary double-flip in the shading calculations.	2015-05-20 14:17:31 -07:00
David G Yu	0811046c2f	Merge pull request #497 from jcowles/dev-remove-pragma Remove #pragma once	2015-05-20 12:57:21 -07:00
Takahito Tejima	8d9ae550ff	Add a HLSL version of GregoryBasis patch shader. hlslPatchGregoryBasis.hlsl is an equivalent to glslPatchGregoryBasis. Update dxViewer to be able to switch among bspline, gregorybasis, legacy end capping. also fixes a bug of GLSL legacy gregory shader which had an inconsistent resource naming with example codes. It looks like there's still an issue of D3D11 patchParam data fetching. we'll come back to that bug.	2015-05-20 10:49:45 -07:00
jcowles	392e5e8bed	Remove #pragma once While this may be worth revisiting, we should first quantify the benefits and identify the compilers that support it. Ultimately, we may never use pragma once in favor of strictly using standard C++.	2015-05-20 09:59:18 -07:00
Takahito Tejima	2f90e3799a	Merge pull request #488 from jcowles/dev-size Prefix header guards and add pragma once.	2015-05-19 13:18:18 -07:00
jcowles	9d32712ec4	Prefix header guards and add pragma once.	2015-05-19 11:22:37 -07:00
Takahito Tejima	c3aa00e706	remove SupportsAdaptiveTessellation from OsdDrawContext, and example cleanups As a preparation for retiring DrawContext, move SupportsAdaptiveTessellation method to examples/common/glUtils, which is renamed and namespaced from gl_common.{cpp,h} to be consistent to other files. Same renamings applied to other example files.	2015-05-19 10:30:16 -07:00
jcowles	7b4aa33a1f	Change stencils sizes from unsigned char to int.	2015-05-19 10:16:56 -07:00
David G Yu	8716a8eebf	Fixed example code use of single crease patches Also made a couple minor changes to true up the GLSL patch shaders.	2015-05-15 11:52:41 -07:00
Takahito Tejima	4a4322983f	Osd drawing API refactoring. Remove DrawRegistry from osd layer and put a simple shader caching utility into examples/common. osd layer only provides patch shader snippet and let client configure and compile the code. Clients also maintain the lifetime of shader object, which is preferable for the actual application integration. update all examples to use the new scheme.	2015-05-13 17:35:46 -07:00
David G Yu	fb009f41b2	Removed BOUNDARY and CORNER patch types These are now redundant since all bspline patches are encoded in the patch tables consistently using 16 point indices with boundary and corner edges indicated in the boundary mask of the patch params.	2015-05-13 11:31:02 -07:00
David G Yu	1c91945d10	Added stronger guards for OSD_MAX_VALENCE	2015-05-12 11:15:51 -07:00
Takahito Tejima	917974f7d1	fix typo in the include guard.	2015-05-11 18:07:26 -07:00
Takahito Tejima	b66380ee52	Refactor Osd::DrawContext Since unified shading work already removed subPatch info from Osd::PatchDescriptor, the difference between Far::PatchDescriptor and Osd::PatchDescriptor is just maxValence and numElements. They are used for legacy gregory patch drawing. Both maxValence and numElements are actually constant within a topology (drawContext). This change move maxValence to DrawContext and let client manage numElements, then we can eliminate Osd::PatchDescriptor and simply use Far::PatchDescritor instead. This is still an intermediate step toward further DrawRegistry refactoring. For the time being, adding EffectDesc struct to include maxValence and numValence to be maintained by the clients. They will be cleaned up later. The side benefit of this change is we no longer need to recompile regular b-spline shaders for the different max-valences.	2015-05-11 18:06:46 -07:00
Takahito Tejima	94d6eb6491	cleanup remaining ptex specific code in osd layer - Remove MeshPtexData bit from Osd::MeshBits. It's not used any more - Rename ptexIndexBuffer in D3D11DrawContext to paramParamBuffer - Remove Is/SetPtexEnabled from D3D11DrawRegistry	2015-05-11 10:59:16 -07:00
Takahito Tejima	33bfbf699b	Refurbish osd layer API. In OpenSubdiv 2.x, we encapsulated subdivision tables into compute context in osd layer since those tables are order-dependent and have to be applied in a certain manner. In 3.0, we adopted stencil table based refinement. It's more simple and such an encapsulation is no longer needed. Also 2.0 API has several ownership issues of GPU kernel caching, and forces unnecessary instantiation of controllers even though the cpu kernels typically don't need instances unlike GPU ones. This change completely revisit osd client facing APIs. All contexts and controllers were replaced with device-specific tables and evaluators. While we can still use consistent API across various device backends, unnecessary complexities have been removed. For example, cpu evaluator is just a set of static functions and also there's no need to replicate FarStencilTables to ComputeContext. Also the new API delegates the ownership of compiled GPU kernels to clients, for the better management of resources especially in multiple GPU environment. In addition to integrating ComputeController and EvalStencilController into a single function Evaluator::EvalStencils(), EvalLimit API is also added into Evaluator. This is working but still in progress, and we'll make a followup change for the complete implementation. -some naming convention changes: GLSLTransformFeedback to GLXFBEvaluator GLSLCompute to GLComputeEvaluator -move LimitLocation struct into examples/glEvalLimit. We're still discussing patch evaluation interface. Basically we'd like to tease all ptex-specific parametrization out of far/osd layer. TODO: -implments EvalPatches() in the right way -derivative evaluation API is still interim. -VertexBufferDescriptor needs a better API to advance its location -synchronization mechanism is not ideal (too global). -OsdMesh class is hacky. need to fix it.	2015-05-08 17:31:26 -07:00
Takahito Tejima	1a37da1c26	osd refactor: split the vertex buffer argument into src and dst Changing all device kernels to take two buffer identifiers for source and destination separately. This change is an intermediate step toward upcoming context/controller refactoring. Previously we have a limitation that the source and destination vertex buffer has to be a single buffer, since the subdivision kernels are iteratively applied by level. With stencil tables, we don't have such a limitation any more, so we may want to apply stencils from seprate source buffer to another. To specifiy the output location within the destination buffer, we can use VertexBufferDescriptor.offset. This allows us not only configuring arbitrary batching scheme, but also relaxing the limitation that source and destination buffers are in same interleaved layout. For examples, we could include derivatives only in the destination buffer, which doesn't need to be allocated in the source buffer.	2015-05-07 16:11:00 -07:00
Takahito Tejima	8fc0e5db93	Move Ptex code from osd to examples. we're teasing out ptex specific data from core osd entities, so there's no reason to keep ptex texturing utilities in core osd. move them into example libs and let clients assemble shader snippets as needed. Also removing older ptex texturing code (without mipmap)	2015-05-07 13:56:48 -07:00
Takahito Tejima	f3aec9eb31	Remove osd/vertex.h OsdVertex was used in 2.x FarMesh as a placeholder for Hbr template parameter. We don't need such a class any more.	2015-05-07 12:34:09 -07:00
David G Yu	8b4ef28116	Updated handling of patchParam and patchCoord Each patch has a corresponding patchParam. This is a set of three values specifying additional information about the patch: faceId -- topological face identifier (e.g. Ptex FaceId) bitfield -- refinement-level, non-quad, boundary, transition, uv-offset sharpness -- crease sharpness for a single-crease patch These are stored in OsdPatchParamBuffer indexed by the value returned from OsdGetPatchIndex() which is a function of the current PrimitiveID along with an optional client provided offset. Accessors are provided to extract values from a patchParam. These are all named OsdGetPatch(). While drawing patches, the patchParam is condensed into a patchCoord which has four values (u, v, faceLevel, faceId). These patchCoords are treated as int values during per-prim processing but are converted to float values during per-vertex processing where the values are interpolated. Also, cleaned up more of the shader namespace by giving an Osd prefix to public functions, and consolidated boundary and transition handling code into the PatchCommon shader files. The functions determining tessellation levels are now all named OsdGetTessLevel().	2015-05-06 13:47:33 -07:00
Takahito Tejima	dcb022e1db	Fix DX-OpenCL/CUDA interop. - resolves DX-CL interop functions in Osd::ClD3D11VertexBuffer. - enable CL kernels in DX build. - more cleanup in test harnesses, adding D3D11 initializations into DeviceContext. - add new defines OPENSUBDIV_HAS_OPENGL and OPENSUBDIV_HAS_DX for convenience.	2015-04-29 11:51:12 -07:00
Takahito Tejima	82a0513326	cleanup CL/CUDA example harnesses. refactor CL/CUDA specific initialization stuffs into examples/common/clDeviceContext and cudaDeviceContext, and update examples to use those structs. also - remove CL/CUDA tests from osd_regression. The tests for those kernels will be covered by glImaging. - update cuda initialization to use the GL-interoperable device if available. - remove CL specialization from glShareTopology, following the same pattern as we took in the previous OsdGLMesh refactoring. (still something strange with XFB kernels though) - fix file permissions.	2015-04-28 15:46:37 -07:00
Takahito Tejima	de0082ba95	Simplify Osd::MeshInterface. Removed OpenCL/D3D11 specialization and add DEVICE_CONTEXT as a template parameter. For the kernels which don't need a context object (e.g. CPU, OpenGL, cuda) just ignore the context, and for the kernels which use a context (e.g. OpenCL, DirectX) takes a context or a user-defined class as which encapsulates device contexts. Note that OpenCL requires two objects, cl_context and cl_command_queue. The user-defined class must provide GetContext() and GetCommandQueue() for strongly typed binding to osd VertexBuffers and ComputeContexts. Osd::Mesh and MeshInterface have been used as a handy harness to host multiple GPU kernels and graphics APIs. However it has CL/DirectX specializations and duplicates large amount of plubming code. With this change, glMesh.h and d3d11Mesh.h become just typedefs and all logic is put into mesh.h without specializations. Also cleaned up unused header files and code formatting.	2015-04-27 11:27:05 -07:00
Takahito Tejima	74b78b3d17	Refactor again FarPatchTableFactory. - rename "Regular end cap" to "BSplineBasis end cap" - revert templating and add EndCapType into PatchTablesFactory::Options. - make EndCapFactories internal in PatchTablesFactory. - move end cap stencils into PatchTables, keep them relative to the max level. - add a utility StencilTablesFactory::AppendEndCapStencilTables to splice and factorize endcap stencil tables.	2015-04-23 16:58:56 -07:00
David G Yu	7d9dcf8c03	Added a new screen-space adaptive tessellation metric which computes edge lengths using limit surface points. Made this the default screen-space metric so that we avoid cracks when using Gregory Basis or Regular B-spline end caps. The alternative method which computes edge lengths using the distance between B-spline control points is still available. Added a diagram and comments to explain how the control points and limit points are organized.	2015-04-21 14:17:29 -07:00
Takahito Tejima	43aa2500c4	Refactor far factories. This change moves all gregory patch generation from Far::PatchTablesFactory so that we can construct patch tables without stencil tables as well as client can chose any end patch strategies (we have 3 options for now: legacy 2.x style gregory patch, gregory basis patch and experimental regular patch approximation). Also Far::EndCapGregoryBasisPatchFactory provides index mapping from patch index to vtr face index, which can be used for single gregory patch evaluation on top of refined points, without involving heavier stencil tables generation.	2015-04-20 18:59:07 -07:00
David G. Yu	9f1da077e8	Updated HLSL shaders for unified patch drawing	2015-04-18 14:08:53 -07:00
David G. Yu	cb689194ab	Fixed windows build errors.	2015-04-17 18:29:56 -07:00
Takahito Tejima	c58e63c8d5	Refactor single crease patch to be in unfined shader.	2015-04-17 16:26:57 -07:00
David G Yu	336fbb6748	Removed some more obsolete transition patch state Temporarily disabled fractional tessellation for Gregory patches.	2015-04-17 15:24:48 -07:00
David G Yu	759513d2dc	Fixed adaptive tessellation and removed debug code	2015-04-17 12:04:21 -07:00
David G Yu	010a31efc4	Checkpoint changes to unify adaptive patch drawing.	2015-04-17 07:42:53 -07:00
Takahito Tejima	3b748e3b8d	Fixes a gregory basis evaluation bug.	2015-04-10 17:44:33 -07:00
Takahito Tejima	7b37eb8cd6	#version 410 for XFB kernel and then OSX can run the XFB kernel.	2015-04-10 15:42:25 -07:00
Takahito Tejima	1e9ca564b0	remove subroutines from GLSL kernels.	2015-04-10 15:34:04 -07:00
Takahito Tejima	1ec0b2b1f2	add includes	2015-04-10 11:09:14 -07:00
Takahito Tejima	24a435da01	avoid lauching stencil kernel and not to allocate device tables if the stencil table is empty.	2015-04-10 11:08:51 -07:00
Takahito Tejima	8da827336d	Removes FarKernelBatch. This is the first step to tease off Osd compute controller/contexts from Far API. Currently FarStencilTable only creates a kernelbatch for the entire range, so we can use [0, numStencils) for all cases instead of KernelBatch. This might not be true if we apply non-factorized level-wise stencils, then we'll add another modular utility to serve those cases.	2015-04-10 11:08:24 -07:00
Takahito Tejima	24f9dc154b	Enable GregoryBasis patch. PatchTablesFactory fills 20 indices topology into patchtable, and use it for eval and draw. note: currently screen-space adaptive tessellation of gregory basis patches is broken and cracks appear around them.	2015-04-08 17:07:10 -07:00
barfowl	9222c9e169	Removed use of "full topology" flag with Far::AdaptiveOptions: - it was always ignored as full topology is required for PatchTables - it will soon be removed from AdaptiveOptions entirely	2015-04-01 14:57:28 -07:00
manuelk	d0a624f373	Add "smooth UVs" to Far - extend Far::PatchTables data structures & interfaces to store requisite information for channels of face-varying bi-cubic patches - implement gather function in Far::PatchTablesFactory to populate face-varying channels with adaptive patches - extend accessor interface in Vtr::Level - propagate code fall-out throughout OpenSubdiv code base, examples & tutorials - extend vtrViewer code to visualize tessellated bi-cubic face-varying patches	2015-02-26 13:57:47 -08:00
manuelk	866b88b7c9	Add support for single-crease patches to D3D draw context & controller - also updated the dxViewer code example note: dxPtexViewer still needs to be updated	2015-02-08 17:31:55 -08:00
manuelk	c646ac2e3c	Refactor & cleanup patch interpolation - move patch interpolation code out of Far::PatchTables into far/interpolate - add bilinear quad interpolation function with derivatives - switch OsdCpuEvalLimitController to far/interpolate - add support for bilinear quad interpolation & clean varying interpolation	2015-01-27 15:38:30 -08:00
Manuel Kraemer	558733298b	Merge pull request #375 from hsdk/gregory_optimizations Minor gregory patch shader optimizations	2015-01-13 16:03:49 -08:00
Manuel Kraemer	2071123148	Merge pull request #378 from hsdk/FixComputeBoundaryBug Fix boundary bug of DX/CL kernels.	2015-01-13 09:55:19 -08:00
Henry Schaefer	eb8abc0da6	Optimize GL/DX compute dispatches.	2015-01-13 10:59:06 +01:00
Henry Schaefer	19aad3cd54	Fixing boundary bug of DX/CL kernels.	2015-01-13 10:40:51 +01:00
Takahito Tejima	d1863eea12	fix batching bug of GLSL/GLSLXFB compute controllers.	2015-01-12 17:03:21 -08:00
Takahito Tejima	0178b09a38	fix boundary bug of GLSL/GLSLXFB kernels.	2015-01-12 17:02:54 -08:00
Henry Schaefer	18440e2f08	Fix GLTransformFeedback crash in computeStencil - changed loop counter names in addWithWeight and readVertex that have overwritten the outer loop counter due to unrolling	2015-01-11 02:55:06 +01:00
Henry Schaefer	1b05cd8ee2	gregory patch shader optimization - reduces instruction count by ~1/5 - improves readability	2015-01-10 21:28:54 +01:00
manuelk	974462e3db	Fix 'single-crease' crash bug in OsdGLDrawContext - unset sharpness indices are -1, which should not be used to look up into the sharpness values vector...	2015-01-09 19:50:12 -08:00
Henry Schaefer	7c73cf0128	dxPtexViewer fixes: - changed ptex layout data types in shaders to match srv format - changed ptex srv type to unorm format for uchar data - fixed hlsl compiler warning: initialized edgeDistance of OutputVertex struct in domain shader even if we are not in wireframe mode - added directx debug device and enabled automatic break points to easily spot dx errors	2015-01-09 02:03:42 +01:00
manuelk	bfbd868fe2	Modify Far::TopologyRefiner refinement options - move level of refinement / isolation into the Options structs - fix splash damage in rest of the code note 1: this is less than ideal, because most compilers accept the previous call to these functions with an incorrect parameter list (ie. passing the level instead of the struct issues no warnings and compiles...) caveat emptor... note 2: the level parameter names may not be final for adaptive modes as we will likely want independent controls over crease vs. extraordinary vertex isolation.	2014-12-30 14:07:24 -08:00
manuelk	7954fbab37	Fix tangents in Osd::EvalLimitController - don't rotate (s,t) coordinates but rotate the patch instead ! - refactor osd/cpuEvalLimitKernels to share Far::PatchTables cubic spline interpolation functions : this replaces tensor product formulation with weight matrices, which does not really impact performance here, but would have to be replaced when implementing regular gridding functions. - fix OsdCpuEvalLimitController to not rotate coordinates and pass the rotation bitfields - expose Far::PatchTables spline interpolation API (protected -> public) - fix glEvalLimit tangent buffers (remove empty padding - see below) - change policy for tangent buffers : the output buffer descriptor is NO LONGER APPLIED to tangent output buffers. Tangent primvar data buffers are no longer applying the offset and stride from the descriptor (because it doesn't make sense to share it). If more flexiblity is required, we will consider adding independent descriptors for the tangent buffers. This change will impact existing code that generates tangents with the EvalLimit controller. fixes #370	2014-12-25 13:22:27 -08:00
manuelk	5944ada0f9	Add Options structs to Far::TopologyRefiner refinement methods - fix all splash damage to tutorials / examples...	2014-12-23 10:07:24 -08:00
manuelk	48cf4b6528	Refactor Vtr::Array for better "const" safety Const' declared instances of Vtr::Array do not protect the pointer held privately by the class properly. In order to force the compiler to protect this pointer, we removed all non-const accessors from Vtr::Array (now renamed Vtr::ConstArray) and moved them to a child class (Vtr::Array), which requires const_cast<> operators internally to allow access. The change & renaming is then propagated to all internal dependencies.	2014-12-15 10:23:13 -08:00
manuelk	0ff7b15711	Fix Win build - make sure we don't get conflicting enums (CODE_ERROR) - fix template specialization for Far::TopologyRefinerFactory in regression/common/vtr_utils - fix remaining error reporting code around osd	2014-12-03 20:04:35 -08:00
manuelk	cc6c0925a3	Refactor error reporting system - change error codes from situational to general (fatal / coding / run-time...) - pull error functions from Osd into Far - add a templated topology validation reporting system to Far::TopologyRefinerFactory - fix fallout on rest of code-base	2014-12-03 17:19:07 -08:00
manuelk	538c13f394	Fix Windows build	2014-11-25 14:14:59 -08:00
manuelk	b27b55e4a8	Refactor Far::PatchTables - split Far::PatchDescriptor into its own class (mirrors Far::PatchParam) - hide PatchArray as a private internal structure - add public accessors patterned after Far::TopologyRefiner (returning Vtr::Arrays) - propagate new API to all dependent code note: some direct table accessors have not been removed yet - see code for details	2014-11-25 12:41:19 -08:00
manuelk	9158a430ac	Fix Far::PatchTables control vertex indexing to use 4 CVs to represent the 0-ring - we were allocating 20 CVs for the Gregory basis, but only need 4 (the 20 CVs are stored in the stencil table)	2014-11-12 13:20:55 -08:00
manuelk	e8210886a8	Extend stencil-driven Gregory basis code support to Osd::EvalLimit (cpu)	2014-11-11 17:35:52 -08:00
manuelk	82a53c4278	Fix bug in cpuEvalLimitKernel.cpp apparently an offset to a basis vertex was incorrectly set	2014-10-31 17:06:34 -07:00
David G Yu	1e00056d8a	Improved stability on OSX drivers by not declaring unused shader in/out vars.	2014-10-23 15:59:05 -07:00
David G. Yu	b11d5e22f0	Fixed d3d11 and gl mesh wrappers to be consistent wrt the MeshInterface API.	2014-10-23 13:45:57 -07:00
David G. Yu	6cc24d32fd	Fixed missing namespace closing brace.	2014-10-23 13:43:45 -07:00
David G. Yu	fce3dcca00	Fixed vec array initializer.	2014-10-23 13:41:27 -07:00
manuelk	f4c2f8d9e1	Fix Windows build of Osd OMP CPU kernel note: we #include <vector> instead malloc.h for portability	2014-10-23 12:07:12 -07:00
manuelk	f145f70fd2	Refactor Far::StencilTablesFactory - re-implement the pool allocator - use templates to remove code redundancy between regular & limit stencils - leverage [] operator overloading to simplify stencil factorization - add the ability to treat subdivision levels independently (see below) - refactor Far::TopologyRefiner::Interpolate<>() methods to pass buffers by reference (allows overloading of [] operator) - rename some of the stencil factory options - propagate changes to Osd / examples / tutorials...	2014-10-21 16:36:26 -07:00
manuelk	a16d558a3e	Fix GLSL compute code version defintion - remove #version declaration from the kernel code - move it in front of shader sources before compiling to prevent some drivers from throwing errors fixes #360	2014-10-20 18:09:30 -07:00
Takahito Tejima	c0907c7bc1	Introduces 'single-crease patch' : implements "Efficient Evaluation of Semi-Smooth Creases in Catmull-Clark Subdivision Surfaces", Niessner et al, Eurographics 2012. This change includes; -topology identification for single-crease patch during adaptive refinement. -patch array population (similar to boundary) -sharpness buffer generation -glsl shader Eval stuffs will be coming.	2014-10-13 08:52:09 -07:00
manuelk	cea1dd1880	Change FarStencilTables indices from 'unsigned int' to Far::Index type	2014-10-09 16:36:23 -07:00
manuelk	a79832b3a1	Far::PatchTables re-factor & clean-up - remove Descriptor iterators - switch 'unsigned int' indices to typed Far::Index We aren't done yet... but its a step in the right direction	2014-10-09 14:48:50 -07:00
manuelk	639788abbe	Fix vec array initialization in GLSL & HLSL (should fix some DX compiler problems) Fixes #354	2014-10-04 16:22:43 -07:00
manuelk	d3fa91deaa	Fix incorrect assert in Osd::DrawContext	2014-10-03 10:35:00 -07:00
manuelk	82e7a1bcc5	Minor EvalLimit naming refactor - also added some additional animation to the glEvalLimit code example	2014-09-22 16:01:43 -07:00
manuelk	ad402d3623	Fix inconsistent file access permissions	2014-09-17 13:45:50 -07:00
Nathan Litke	78ae830794	Fixed Osd::Mesh constructors that initialize member variables from user variables.	2014-09-17 13:41:45 -07:00
Manuel Kraemer	8fc5371ec9	Merge pull request #350 from nathan-at-digitalfish/build_android Fix build for Android	2014-09-12 17:27:57 -07:00
manuelk	c66ee724a8	WIP limit stencils factory todo: - fix rotated tangents on patches - add "end-caps" (bilinear or Gregory ?) - add pure bilinear stencil mode	2014-09-12 16:59:16 -07:00
manuelk	e538261f7d	Fix Cuda compile warning in osd/cudaKernel.cu	2014-09-12 10:43:18 -07:00
Julien Demouth	29b2d033e3	Introduces a kernel for NUM_ELEMENTS=4. It's faster and not buggy, anymore.	2014-09-12 10:22:30 -07:00
Julien Demouth	cf0fe60ab4	Add a first pass of optimization in CUDA kernels Conflicts: opensubdiv/osd/cudaKernel.cu	2014-09-12 10:21:46 -07:00
Nathan Litke	d7311801c1	Fix compiler errors for OpenGL-ES.	2014-09-11 16:30:50 -07:00
Takahito Tejima	b3f6f52bb9	Fixes the bug of glShareTopology. This example mutates patchDescriptor in drawcontext, so it needs a write access. Adding another GetPatchArray() returning non-const reference. We should think about better API for this.	2014-09-10 15:24:16 -07:00
manuelk	c399655dcc	Landing 3.0.0.alpha Sync'ing the 'dev' branch with the 'feature_3.0dev' branch at commit 68c6d11fc36761ae1a5e6cdc3457be16f2e9704a The branch 'feature_3.0dev' is now locked and preserved for historical purposes.	2014-09-05 15:07:46 -07:00
Nathan Litke	0128417447	Allow NEON to be disabled on Android. Works similar to GCD on Mac OS X.	2014-08-07 13:10:54 -07:00
Rachid El Guerrab	035e58f44a	Added subdivision kernels for ARM NEON * assembler kernels are based on the C implementation in neonKernel.cpp * enable assembler kernel functions in neonComputeController.cpp with #define USE_ASM_KERNELS 1	2014-08-05 22:09:12 -07:00
Manuel Kraemer	df719eb9ed	Revert "added memcpyasync and streams to cuda backend" This reverts commit `84212cd725`.	2014-08-02 12:40:08 -07:00
Frost	84212cd725	added memcpyasync and streams to cuda backend	2014-07-24 10:18:09 -04:00
Sergey Sharybin	1ebf2449dd	Fix mismatch in declaration and usage of OsdCudaComputeRestrictedVertexA Unused argument `pass` was defined in the CUDA kernel and it was never passed to this function from the C++ code. This argument is also wasn't used by the function itself.	2014-07-03 16:54:15 +06:00
Sergey Sharybin	7ac06db165	Fix crashes when using rather low-end cards like Intel ones Solved by checking on run-time whether texture buffer objects are supported. When building with GLEW library doing compile-time check is not enough, because actual information about existing features is only known on runtime. This only makes ti so CPU backend works, GLSL backends still requires some work if we want them to make working. Not sure it worth doing this now.	2014-07-02 22:17:58 +06:00

1 2 3 4 5 ...

549 Commits