[WIP] Export CTC decoding algorithm to sherpa #1093

pkufool · 2022-11-06T14:45:29Z

Move https://github.com/k2-fsa/k2/blob/master/k2/csrc/torch_api.h to https://github.com/k2-fsa/k2/tree/master/k2/torch/csrc
Remove the dependency kaldifeat_core from k2_torch. We can make the binaries depend on kaldifeat_core directly.
Export the library k2_torch in

k2/cmake/k2Config.cmake.in

Line 48 in c3a7404

set(K2_LIBRARIES k2_torch_api k2_log k2context k2fsa)
Add the following functions to torch_api.h
- A function to load HLG.pt. The returned type can be std::shared_ptr<k2::FsaClass>. We can define an alias FsaClassPtr for it, like RaggedShapePtr
- A function to wrap k2::CtcTopo(). It should also return a value of type FsaClassPtr.
- A function for CTC/HLG decoding. It takes the following inputs:
  - log_softmax_out: a 3-D tensor of shape (N, T, C)
  - log_softmax_out_lens, a 1-D tensor of shape (N,)
  - FsaClassPtr, can be either a CtcTopo or an HLG
  and it returns std::vector<std::vector<int32_t>>

csukuangfj · 2022-11-07T04:36:15Z

k2/torch/csrc/CMakeLists.txt

@@ -65,3 +87,21 @@ if(K2_ENABLE_TESTS)
    k2_add_torch_test(${source})
  endforeach()
 endif()
+
+file(MAKE_DIRECTORY
+  DESTINATION


Suggested change

DESTINATION

csukuangfj · 2022-11-07T04:37:12Z

k2/torch/csrc/CMakeLists.txt

+    ${PROJECT_BINARY_DIR}/include/k2
+)
+
+install(TARGETS k2_torch_api


Suggested change

install(TARGETS k2_torch_api

install(TARGETS k2_torch_api k2_torch

csukuangfj · 2022-11-07T14:47:51Z

k2/torch/bin/CMakeLists.txt

 #----------------------------------------
 #       CTC decoding
 #----------------------------------------
-set(ctc_decode_srcs ctc_decode.cu)
+set(ctc_decode_srcs ctc_decode.cu ${feature_srcs})


It will recompile feature_srcs for each binary. Shall we make it a library that can be shared?

OK, I thinks so. so, let's change back to its original

csukuangfj · 2022-11-07T15:02:00Z

I suggest that we create a new PR to bind the exposed APIs to Python and provide python APIs and examples to decode models trained using CTC loss from various frameworks, such as icefall, nemo, espnet, speechbrain, wenet, etc.

pkufool · 2022-11-08T00:02:41Z

I suggest that we create a new PR to bind the exposed APIs to Python and provide python APIs and examples to decode models trained using CTC loss from various frameworks, such as icefall, nemo, espnet, speechbrain, wenet, etc.

I think we MUST create a new PR, because the binding and demo code will be in sherpa, not k2.

csukuangfj · 2022-11-08T00:17:30Z

k2 only requires log_softmax_out + TLG for decoding, so it is easier to use.
People can just import k2 and provide the required inputs for decoding and they may not want to install sherpa.

pkufool · 2022-11-08T01:31:31Z

k2 only requires log_softmax_out + TLG for decoding, so it is easier to use. People can just import k2 and provide the required inputs for decoding and they may not want to install sherpa.

OK, I got your idea, I think to do that the functions to be wrapped is in k2/torch/* not in torch_api.h. It does not relate to this PR, will do it in another change, thanks!

csukuangfj · 2022-11-08T08:21:29Z

k2/torch/csrc/torch_api.cu

-                                         int32_t min_activate_states,
-                                         int32_t max_activate_states,
-                                         int32_t subsampling_factor) {
+FsaClassPtr GetLattice(torch::Tensor log_softmax_out,


I suggest that we provide only a single method Decode() to give the results directly.

In the current approach, users have to call BestPath on the returned lattice and that is the only function that users can call for the returned lattice.

That is an implementation detail and we can hide it from the users.

I suggest that we provide only a single method Decode() to give the results directly.

In the current approach, users have to call BestPath on the returned lattice and that is the only function that users can call for the returned lattice.

That is an implementation detail and we can hide it from the users.

I thought we can use this lattice to do rescoring, we can implement one_best_decoding and ngram_rescoring in sherpa.

I think there is no need to hide lattice.

pkufool added 2 commits November 6, 2022 22:08

Move torch_api from k2/csrc to k2/torch/csrc

23bb987

Move kaldifeat dependancy from lib to bin

cbf16b4

pkufool changed the title ~~Export CTC decoding algorithm to sherpa~~ [WIP] Export CTC decoding algorithm to sherpa Nov 6, 2022

Add more ctc decoding api to torch_api

57829be

csukuangfj reviewed Nov 7, 2022

View reviewed changes

Minor fixes

1f99bb0

csukuangfj reviewed Nov 7, 2022

View reviewed changes

pkufool added 2 commits November 8, 2022 10:06

Minor fixes

f99da4b

Return decoding lattice

6af7538

csukuangfj reviewed Nov 8, 2022

View reviewed changes

pkufool mentioned this pull request Nov 8, 2022

[WIP] Add CTC model support k2-fsa/sherpa#196

Closed

csukuangfj approved these changes Nov 9, 2022

View reviewed changes

fix style

931918e

pkufool merged commit e552812 into k2-fsa:master Nov 9, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Export CTC decoding algorithm to sherpa #1093

[WIP] Export CTC decoding algorithm to sherpa #1093

pkufool commented Nov 6, 2022 •

edited

Loading

csukuangfj Nov 7, 2022

csukuangfj Nov 7, 2022

csukuangfj Nov 7, 2022

pkufool Nov 8, 2022

csukuangfj commented Nov 7, 2022

pkufool commented Nov 8, 2022 •

edited

Loading

csukuangfj commented Nov 8, 2022 •

edited

Loading

pkufool commented Nov 8, 2022

csukuangfj Nov 8, 2022

pkufool Nov 8, 2022

pkufool Nov 8, 2022 •

edited

Loading

	install(TARGETS k2_torch_api
	install(TARGETS k2_torch_api k2_torch

[WIP] Export CTC decoding algorithm to sherpa #1093

[WIP] Export CTC decoding algorithm to sherpa #1093

Conversation

pkufool commented Nov 6, 2022 • edited Loading

csukuangfj Nov 7, 2022

Choose a reason for hiding this comment

csukuangfj Nov 7, 2022

Choose a reason for hiding this comment

csukuangfj Nov 7, 2022

Choose a reason for hiding this comment

pkufool Nov 8, 2022

Choose a reason for hiding this comment

csukuangfj commented Nov 7, 2022

pkufool commented Nov 8, 2022 • edited Loading

csukuangfj commented Nov 8, 2022 • edited Loading

pkufool commented Nov 8, 2022

csukuangfj Nov 8, 2022

Choose a reason for hiding this comment

pkufool Nov 8, 2022

Choose a reason for hiding this comment

pkufool Nov 8, 2022 • edited Loading

Choose a reason for hiding this comment

pkufool commented Nov 6, 2022 •

edited

Loading

pkufool commented Nov 8, 2022 •

edited

Loading

csukuangfj commented Nov 8, 2022 •

edited

Loading

pkufool Nov 8, 2022 •

edited

Loading