Commit Graph

  • 33432e5dd7 Add 1 main maxjhandsome 2025-01-24 09:52:25 +0800
  • 9d966660b6
    Merge pull request #116 from ziliangzl/inline-asm zhoujingya 2024-05-20 09:43:55 +0800
  • c168c442ce
    Merge pull request #118 from THU-DSP-LAB/checkInstr zhoujingya 2024-05-18 14:17:34 +0800
  • 60f388930d [VENTUS][fix]Assign initial value for VastartStoreFrameIndex VastartStoreFrameIndex havn't initial value, caused issue THU-DSP-LAB/llvm-project#117 ziliangzl 2024-05-11 14:47:19 +0800
  • 451062314a
    Merge pull request #119 from ziliangzl/workgroup ziliangzl 2024-05-15 11:36:10 +0800
  • b465e58817 [VENTUS][fix] Add a switch to the C extension compress_instruction_disassemble qinfan 2024-05-14 17:26:24 +0800
  • 6a8e4d4667 [VENTUS][#119]Fix workgroup function barrier scope ziliangzl 2024-05-14 14:03:20 +0800
  • d977b0bf8b [VENTUS][#119]Complete workgroup function implementation Implement work_group_reduce_<op> functions in wgreduce.cl . Implement work_group_scan_inclusive_<op> work_group_scan_exclusive_<op> functions in wgscan.cl . Passed corresponding OPENCL-CTS tests. ziliangzl 2024-05-14 13:36:09 +0800
  • 408ed74df2 [VENTUS][fix] Modify the disassembly result of a compress instruction error qinfan 2024-05-13 15:57:55 +0800
  • 4789f2096b [VENTUS][#119]Add work_group_broadcast implementation Passed corresponding OPENCL-CTS test. ziliangzl 2024-05-13 11:22:56 +0800
  • d63caa094a [VENTUS][#119]Fix __wg_scratch multiple define 1.Global variable shouldn't define in header. 2.Fix code format. ziliangzl 2024-05-13 11:18:03 +0800
  • 6b70120436 [VENTUS][libclc][feat]Start workgroup function implementation 1.Implement barrier and work_group_barrier function with intrinsics. 2.Implement work_group_all and work_group_any function,passed corresponding OPENCL-CTS test. ziliangzl 2024-05-11 16:38:52 +0800
  • 3267496a27 [VENTUS][NFC]Add inline-asm testcase Add an inline-asm testcase for constraint r Add an inline-asm testcase for branch instruction ziliangzl 2024-05-07 15:33:42 +0800
  • 5816925f3f Merge branch 'compile-guide' into terapines-dev terapines-dev ziliangzl 2024-05-06 14:39:57 +0800
  • d59d0805ff [VENTUS][fix] Do not build workitem functions to static library workitem zhoujing 2024-04-30 16:14:41 +0800
  • 8db464ae81 [VENTUS][fix] Add function sections for workitem functions zhoujing 2024-04-30 14:13:51 +0800
  • 279251d31d
    Merge pull request #114 from ziliangzl/compile-guide ziliangzl 2024-04-29 16:15:33 +0800
  • 5ff2eddbad [VENTUS][NFC]Add compile guide in README 1. Explained how to generate ELF file step-by-step. 2.Add compile assembly code to object file example. ziliangzl 2024-04-26 16:38:57 +0800
  • 42a893c3be
    Merge pull request #113 from THU-DSP-LAB/workaround ziliangzl 2024-04-26 16:12:15 +0800
  • 573ae5e8ee [VENTUS][workaround] Fix flw/fsw assembly errors workaround zhoujing 2024-04-26 15:14:22 +0800
  • 45856f653d
    Merge pull request #112 from ziliangzl/Pseudo-FLW/FSW zhoujingya 2024-04-26 13:28:05 +0800
  • 8be3150696 [#112][fix]Remove flw/fsw InstAlias 1.Removed flw/fsw InstAlias for now,cause flw/fsw could not match correctly. 2.Modified kernel_arg testcase. ziliangzl 2024-04-26 10:59:35 +0800
  • 4354b039f3 [VENTUS][fix]Fix FLW/FSW instruction coding conflict Replace FLW/FSW instruction with PseudoFLW/PseudoFSW ziliangzl 2024-04-25 10:44:57 +0800
  • 968f0c07cd
    Merge pull request #110 from ziliangzl/divergent-analysis zhoujingya 2024-04-23 14:22:12 +0800
  • 47a0946abc [VENTUS][fix]Fix kernel divergent analysis 1.Kernel function argument is not divergent 2.GPRF32 is not divergent 3.Set PHINode is divergent 4.Add flw/fsw instruction for GPRF32 5.Add a floating add kernel function test case ziliangzl 2024-04-16 14:55:49 +0800
  • 1347c06d50
    Merge pull request #109 from ziliangzl/divergent-analyse zhoujingya 2024-04-22 15:58:52 +0800
  • d138bdacf6 [VENTUS][fix]Fix kernel divergent analysis ziliangzl 2024-04-16 14:55:49 +0800
  • 4d31e996d7
    Merge pull request #108 from THU-DSP-LAB/mv_after_join zhoujingya 2024-04-03 13:06:32 +0800
  • b0e72840f8 Merge branch 'mv_after_join' into terapines-dev qinfan 2024-03-29 16:18:22 +0800
  • f781479b52 [VENTUS][RISCV] Fix move instructions after JOIN move forward bug qinfan 2024-03-29 16:15:12 +0800
  • 2fda4b01d8 Merge remote-tracking branch 'origin/main' into terapines-dev zhoujing 2024-03-11 09:33:27 +0800
  • 078bb26e2e
    Merge pull request #106 from THU-DSP-LAB/patch zhoujingya 2024-03-11 09:28:55 +0800
  • 797c85d829 [patch] Add a fix patch from terapines_dev branch zhoujing 2024-03-08 18:23:47 +0800
  • 1a36d94fd0
    Merge pull request #105 from THU-DSP-LAB/test-case-fix zhoujingya 2024-03-07 11:06:12 +0800
  • 9b11eb8feb [NFC][fix] Fix test cases failure zhoujing 2024-03-07 10:57:30 +0800
  • efef613b61
    Merge pull request #83 from THU-DSP-LAB/34_local_addressed_variables_into_stack zhoujingya 2024-03-06 09:19:25 +0800
  • 45a884a824
    Merge pull request #104 from THU-DSP-LAB/fix-insert-vmv zhoujingya 2024-03-05 17:06:42 +0800
  • 87fe5f3ce8 [VENTUS][fix] Put local variables declared in kernel function into shared memory zhoujing 2024-03-05 16:32:59 +0800
  • a909be0434 [VENTUS][fix] Fix insert vmv instruction bug when vmv instruction is in JOIN MBB insert-vmv-instruction zhoujing 2024-03-05 15:26:43 +0800
  • a48f51ab76
    Merge pull request #101 from THU-DSP-LAB/resource_manage zhoujingya 2024-03-05 14:16:57 +0800
  • 292c639510
    Merge pull request #103 from THU-DSP-LAB/102-code-refactoring zhoujingya 2024-03-04 23:01:27 +0800
  • 1a04073d12 [VENTUS][NFC] Remove dead code and refactor code Jules-Kong 2024-03-04 18:51:34 +0800
  • c42c00f67e [VENTUS][fix] Modified the resource statistics interface qinfan 2024-03-04 13:04:43 +0800
  • adb1bcd308 [VENTUS][fix] Modified the resource statistics interface qinfan 2024-03-04 14:19:29 +0800
  • ed93af0195
    Merge pull request #99 from THU-DSP-LAB/fix-runtime-error Jules-Kong 2024-02-23 15:04:33 +0800
  • 18f624b661 [VENTUS][fix] Add files to install package && add VENTUS_INSTALL_PREFIX enviroments variable Add VENTUS_INSTALL_PREFIX for runtime use(pocl) zhoujing 2024-02-23 10:52:28 +0800
  • 0392670a46 [VENTUS][fix] Add files to install package && add VENTUS_INSTALL_PREFIX enviroments variable Add VENTUS_INSTALL_PREFIX for runtime use(pocl) zhoujing 2024-02-18 16:14:22 +0800
  • f2517d8bc4 [VENTUS] Solve conflict with 8099fdef1c qinfan 2024-02-05 12:05:52 +0800
  • 2cc69bc468 Merge remote-tracking branch 'origin/main' into terapines-dev zhoujing 2024-02-01 15:38:29 +0800
  • b32b529523
    Merge pull request #98 from THU-DSP-LAB/56_workitem_function_fix zhoujingya 2024-02-01 15:05:25 +0800
  • 6cac00d141 [NFC] comment fix 56_workitem_function_fix zhoujing 2024-02-01 15:03:53 +0800
  • 2d8a9b9c1b [VENTUS] Solve conflict with 56_workitem_function_fix qinfan 2024-02-01 14:58:41 +0800
  • 4b25812260 [VENTUS] Fix some comments qinfan 2024-01-31 15:44:16 +0800
  • 3bd573e3b3 [VENTUS][fix] Remove codes and fix wrong register error in workitem.s zhoujing 2024-01-31 15:27:21 +0800
  • 03759b1bed [VENTUS][fix] Fix get_local_id builtin function implementation zhoujing 2024-01-31 14:14:44 +0800
  • efd82b9d86 [#56][fix] Fix the implementation of get_local_linear_id zhoujingya 2023-11-10 09:26:41 +0800
  • e04c1a6ec7 [#56][fix] Fix workitem function(enqueued_local_size & local_linear_id) bugs in libclc Support get_enqueued_local_size function and fix the calculation of get_local_linear_id zhoujingya 2023-11-06 16:07:27 +0800
  • 49c039a902
    Merge pull request #89 from THU-DSP-LAB/eliminate_call_frame zhoujingya 2024-02-01 14:54:42 +0800
  • 6b17accc5f
    Merge pull request #70 from THU-DSP-LAB/resource_manage zhoujingya 2024-02-01 13:17:45 +0800
  • 965f8c1fb6
    Merge branch 'main' into eliminate_call_frame eliminate_call_frame zhoujingya 2024-02-01 13:15:03 +0800
  • a3fc326ce4
    Merge pull request #53 from THU-DSP-LAB/libclc-precision zhoujingya 2024-01-31 19:19:54 +0800
  • aaf1c41a21 [VENTUS][fix] Fix clzl function implementation in floatdidf zhoujing 2024-01-31 16:12:58 +0800
  • dfd2affa51 [VENTUS][fix] Fix float precision issue in libclc for ventus zhoujingya 2023-10-30 10:59:59 +0800
  • 9c7f45b2b1 Merge branch '56_workitem_function_fix' into terapines-dev qinfan 2024-01-31 15:50:47 +0800
  • 3243d9a5b1 [VENTUS] Fix some comments qinfan 2024-01-31 15:44:16 +0800
  • 827903cdd4 [VENTUS][fix] Remove codes and fix wrong register error in workitem.s zhoujing 2024-01-31 15:27:21 +0800
  • 5edf4db0ee [VENTUS][fix] Fix get_local_id builtin function implementation zhoujing 2024-01-31 14:14:44 +0800
  • 7fd690a3c4 [#56][fix] Fix the implementation of get_local_linear_id zhoujingya 2023-11-10 09:26:41 +0800
  • fcae0cdc6b [#56][fix] Fix workitem function(enqueued_local_size & local_linear_id) bugs in libclc Support get_enqueued_local_size function and fix the calculation of get_local_linear_id zhoujingya 2023-11-06 16:07:27 +0800
  • 91b9888732
    Merge pull request #66 from THU-DSP-LAB/39_parameter_types zhoujingya 2024-01-24 11:46:59 +0800
  • 0b7be4b4a5
    Merge branch 'main' into 39_parameter_types zhoujingya 2024-01-24 11:43:08 +0800
  • a87bae445c
    Merge pull request #49 from THU-DSP-LAB/instructions-remove zhoujingya 2024-01-24 09:41:33 +0800
  • 3085032da8
    Merge pull request #51 from THU-DSP-LAB/regexti wangqinfan 2024-01-23 13:21:26 +0800
  • 5a4e7b3c6c [VENTUS][fix] Fix test file regexti qinfan 2024-01-23 10:25:46 +0800
  • 71caf2361b [VENTUS][fix] Fix register extension qinfan 2024-01-22 15:28:36 +0800
  • 7adec4402a [VENTUS][fix] Support the regexti instruction zhoujingya 2023-10-27 15:57:45 +0800
  • b793a55a42
    Merge pull request #95 from THU-DSP-LAB/clamp zhoujingya 2024-01-22 22:27:44 +0800
  • 8099fdef1c
    Merge pull request #96 from THU-DSP-LAB/mad_hi zhoujingya 2024-01-22 22:27:23 +0800
  • 5fb8d44245 Merge branch 'mad_hi' into terapines-dev qinfan 2024-01-22 16:57:55 +0800
  • eb8de4e634 [VENTUS][fix] Add mul_hi function qinfan 2024-01-22 16:55:53 +0800
  • 9c9120fad5 Merge branch 'clamp' into terapines-dev qinfan 2024-01-22 16:45:31 +0800
  • 9181e1a435 [VENTUS][fix] Fix clamp function qinfan 2024-01-22 16:44:35 +0800
  • 87c5b8751f Merge branch 'regexti' into terapines-dev qinfan 2024-01-22 15:30:40 +0800
  • 80d51133b4 [VENTUS][fix] Fix register extension qinfan 2024-01-22 15:28:36 +0800
  • cca98d2b4a Merge branch 'ctz' into terapines-dev qinfan 2024-01-19 14:15:10 +0800
  • 5b0ffc2cfa
    Merge pull request #88 from THU-DSP-LAB/ctz Jules-Kong 2024-01-15 14:37:15 +0800
  • 7e8e66058c [VENTUS][fix] Add ctz function implementation zhoujingya 2023-12-15 17:09:06 +0800
  • 6028fdc2f8 Merge branch 'ctz' into terapines-dev qinfan 2023-12-27 13:17:06 +0800
  • a3a061e46c [VENTUS][fix] Modify the implementation of ctz to builtin qinfan 2023-12-27 11:45:24 +0800
  • 857555ce0f Merge branch 'ctz' into terapines-dev qinfan 2023-12-27 11:49:15 +0800
  • 5523bb49b5 [VENTUS][fix] Modify the implementation of ctz to builtin qinfan 2023-12-27 11:45:24 +0800
  • 747049464b Merge branch 'branch_instruction' into terapines-dev qinfan 2023-12-26 16:45:48 +0800
  • 715fd8dead [VENTUS][fix] Add phi-node instruction to divergence for non-kernel function branch_instruction qinfan 2023-12-26 16:38:50 +0800
  • 83a92a8be1 Merge branch 'regexti' into terapines-dev qinfan 2023-12-25 16:16:29 +0800
  • 6d92b790df Merge branch 'eliminate_call_frame' into terapines-dev qinfan 2023-12-25 14:41:21 +0800
  • 93c99240db [VENTUS][fix] Fix the calculation of stack size qinfan 2023-12-25 13:26:25 +0800
  • 6be8ad9c24 [VENTUS][docs] Add test file qinfan 2023-12-22 17:51:05 +0800
  • 1b0d9edaf0 [VENTUS][fix] Support the regexti instruction zhoujingya 2023-10-27 15:57:45 +0800
  • d809d3a2bd [VENTUS][fix] Fix the Offset of private variable offset on stack qinfan 2023-12-22 14:13:04 +0800
  • 447c152448 [VENTUS][fix] Add ctz function implementation zhoujingya 2023-12-15 17:09:06 +0800