; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0026817 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0026817
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr10:42252559..42253278
RNA-Seq ExpressionLag0026817
SyntenyLag0026817
Gene Ontology termsNA
InterPro domainsIPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022145060.1 uncharacterized protein LOC111014578 [Momordica charantia]1.2e-2836.79Show/hide
Query:  FCWKALKGFLPTGVNLSSRGMNIYNGCPMCSMAPESTDHCLFSCSKAKQIWSLTFRHVILERNFNHSLEDRWLALCDELSMEDLRIVAVTSWAIWGDKNK
        F W+     +P+  NL  RG++    C +C    E+TDH LF C +AK++W++       + +FN+S++D  L L + LS  D  +V V  WAIW D+N 
Subjt:  FCWKALKGFLPTGVNLSSRGMNIYNGCPMCSMAPESTDHCLFSCSKAKQIWSLTFRHVILERNFNHSLEDRWLALCDELSMEDLRIVAVTSWAIWGDKNK

Query:  KIHEAEVPSPNIRSRWILNYLSEFDRAEERRKSS--LQAGSMLNVASRSQSWSPPPDNAWKINVDAAW--DDLSTGIGAICRNSRGEILGACS
           + ++P   IRS WIL Y+ +F   +        +Q  +  N  +    WSPPP    KINVDAA       TGIG +CRN +G+IL A S
Subjt:  KIHEAEVPSPNIRSRWILNYLSEFDRAEERRKSS--LQAGSMLNVASRSQSWSPPPDNAWKINVDAAW--DDLSTGIGAICRNSRGEILGACS

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]2.8e-2732.71Show/hide
Query:  WKMVWKLKVPSKVKFFCWKALKGFLPTGVNLSSRGMNIYNGCPMCSMAPESTDHCLFSCSKAKQIWSLTFRHV-ILERNFNHSLEDRWLALCDELSMEDL
        W  +WKL VP+K+K F W++    +PT  NL  RG+     C +C    ES  H  F C +A+QIW   F  +  L    N S  + W +L ++L  +DL
Subjt:  WKMVWKLKVPSKVKFFCWKALKGFLPTGVNLSSRGMNIYNGCPMCSMAPESTDHCLFSCSKAKQIWSLTFRHV-ILERNFNHSLEDRWLALCDELSMEDL

Query:  RIVAVTSWAIWGDKNKKIHEAEVPSPNIRSRWILNYLSEFDRAEERRKSSLQAGSMLNVASRSQSWSPPPDNAWKINVDAAWDDLSTGIGAICRNSRGEI
         + A+T W IW D+N  IH  +V     +  W+  +L    +A+    S+    +  N     Q W P    + K+N DAA    ST  G I R+S   +
Subjt:  RIVAVTSWAIWGDKNKKIHEAEVPSPNIRSRWILNYLSEFDRAEERRKSSLQAGSMLNVASRSQSWSPPPDNAWKINVDAAWDDLSTGIGAICRNSRGEI

Query:  LGACSKFLDFSLPP
        + A S  + F L P
Subjt:  LGACSKFLDFSLPP

XP_023893701.1 uncharacterized protein LOC112005643 [Quercus suber]1.8e-2635.68Show/hide
Query:  SKVWKMVWKLKVPSKVKFFCWKALKGFLPTGVNLSSRGMNIYNGCPMCSMAPESTDHCLFSCSKAKQIWSLTFRHVILERNFNHSLEDRWLALCDELSME
        +++WK VW+LK+P+KV+ F W+A    LPT +NL+ RG+N+   CP+C  A ES+ H L  C K  ++W       I     N S  D  L + D  S  
Subjt:  SKVWKMVWKLKVPSKVKFFCWKALKGFLPTGVNLSSRGMNIYNGCPMCSMAPESTDHCLFSCSKAKQIWSLTFRHVILERNFNHSLEDRWLALCDELSME

Query:  DLRIVAVTSWAIWGDKNKKIHEAEVPSPNIRSRWILNYLSEFDRAEERRKSSLQAGSMLNVASRSQSWSPPPDNAWKINVDAAWDDL--STGIGAICRNS
        DL  + VT+WAIW ++N+ +HEA+  SP+ +  W     S   R  E  K+++        A+    W+ PP + +KINV+ A   +   + IG + R+S
Subjt:  DLRIVAVTSWAIWGDKNKKIHEAEVPSPNIRSRWILNYLSEFDRAEERRKSSLQAGSMLNVASRSQSWSPPPDNAWKINVDAAWDDL--STGIGAICRNS

Query:  RGEILGACSKFLD
        RG ++ A SK L+
Subjt:  RGEILGACSKFLD

XP_023913142.1 uncharacterized protein LOC112024740 [Quercus suber]5.4e-2632.52Show/hide
Query:  KVWKMVWKLKVPSKVKFFCWKALKGFLPTGVNLSSRGMNIYNGCPMCSMAPESTDHCLFSCSKAKQIWSLTFRHVILERNFNHSLEDRWLALCDELSMED
        + W  +W+L+VPSK+K F W+A    LP+ VNL+ R +   + C +C  +PE+T H ++ CS A+ +W+ +   +    +        +  +  +LS+E+
Subjt:  KVWKMVWKLKVPSKVKFFCWKALKGFLPTGVNLSSRGMNIYNGCPMCSMAPESTDHCLFSCSKAKQIWSLTFRHVILERNFNHSLEDRWLALCDELSMED

Query:  LRIVAVTSWAIWGDKNKKIHEAEVPSPNIRSRWILNYLSEFDRAEERRKSSLQAGSMLNVASRSQSWSPPPDNAWKINVDAAWDDLSTGIGAICRNSRGE
        L I  V SW IW  +N  ++   +  P   ++   +YL E+  A+    + L AGS        Q+W PPP   +K+N DAA  D  +G+GA+ RN+ GE
Subjt:  LRIVAVTSWAIWGDKNKKIHEAEVPSPNIRSRWILNYLSEFDRAEERRKSSLQAGSMLNVASRSQSWSPPPDNAWKINVDAAWDDLSTGIGAICRNSRGE

Query:  ILGACS
        ++ A S
Subjt:  ILGACS

XP_030943489.1 uncharacterized protein LOC115968280 [Quercus lobata]5.7e-2833.81Show/hide
Query:  VWKMVWKLKVPSKVKFFCWKALKGFLPTGVNLSSRGMNIYNGCPMCSMAPESTDHCLFSCSKAKQIWSLTFRHVILERNFNHSLEDRWLALCDELSMEDL
        +WK +W LK+PSK++ F W+A    LPT  NL  RG+N+   CP C   PES  H L  C  AK++W+      I   +      D  L + ++ +  DL
Subjt:  VWKMVWKLKVPSKVKFFCWKALKGFLPTGVNLSSRGMNIYNGCPMCSMAPESTDHCLFSCSKAKQIWSLTFRHVILERNFNHSLEDRWLALCDELSMEDL

Query:  RIVAVTSWAIWGDKNKKIHEAEVPSPNIRSRWILNYLSEFDRAEERRKSSLQAGSMLNVASRSQSWSPPPDNAWKINVDAAWDD--LSTGIGAICRNSRG
         +  V +W+IW ++N+ +HE+    PN    +   YL ++   ++   SS Q  + +     S SW PPP   +KINVD A  +   ++ +G I R+S+G
Subjt:  RIVAVTSWAIWGDKNKKIHEAEVPSPNIRSRWILNYLSEFDRAEERRKSSLQAGSMLNVASRSQSWSPPPDNAWKINVDAAWDD--LSTGIGAICRNSRG

Query:  EILGACSKFL
         I+ AC+ +L
Subjt:  EILGACSKFL

TrEMBL top hitse value%identityAlignment
A0A6J1CTE3 uncharacterized protein LOC1110145785.6e-2936.79Show/hide
Query:  FCWKALKGFLPTGVNLSSRGMNIYNGCPMCSMAPESTDHCLFSCSKAKQIWSLTFRHVILERNFNHSLEDRWLALCDELSMEDLRIVAVTSWAIWGDKNK
        F W+     +P+  NL  RG++    C +C    E+TDH LF C +AK++W++       + +FN+S++D  L L + LS  D  +V V  WAIW D+N 
Subjt:  FCWKALKGFLPTGVNLSSRGMNIYNGCPMCSMAPESTDHCLFSCSKAKQIWSLTFRHVILERNFNHSLEDRWLALCDELSMEDLRIVAVTSWAIWGDKNK

Query:  KIHEAEVPSPNIRSRWILNYLSEFDRAEERRKSS--LQAGSMLNVASRSQSWSPPPDNAWKINVDAAW--DDLSTGIGAICRNSRGEILGACS
           + ++P   IRS WIL Y+ +F   +        +Q  +  N  +    WSPPP    KINVDAA       TGIG +CRN +G+IL A S
Subjt:  KIHEAEVPSPNIRSRWILNYLSEFDRAEERRKSS--LQAGSMLNVASRSQSWSPPPDNAWKINVDAAW--DDLSTGIGAICRNSRGEILGACS

A0A6J1DX30 uncharacterized protein LOC1110248741.4e-2732.71Show/hide
Query:  WKMVWKLKVPSKVKFFCWKALKGFLPTGVNLSSRGMNIYNGCPMCSMAPESTDHCLFSCSKAKQIWSLTFRHV-ILERNFNHSLEDRWLALCDELSMEDL
        W  +WKL VP+K+K F W++    +PT  NL  RG+     C +C    ES  H  F C +A+QIW   F  +  L    N S  + W +L ++L  +DL
Subjt:  WKMVWKLKVPSKVKFFCWKALKGFLPTGVNLSSRGMNIYNGCPMCSMAPESTDHCLFSCSKAKQIWSLTFRHV-ILERNFNHSLEDRWLALCDELSMEDL

Query:  RIVAVTSWAIWGDKNKKIHEAEVPSPNIRSRWILNYLSEFDRAEERRKSSLQAGSMLNVASRSQSWSPPPDNAWKINVDAAWDDLSTGIGAICRNSRGEI
         + A+T W IW D+N  IH  +V     +  W+  +L    +A+    S+    +  N     Q W P    + K+N DAA    ST  G I R+S   +
Subjt:  RIVAVTSWAIWGDKNKKIHEAEVPSPNIRSRWILNYLSEFDRAEERRKSSLQAGSMLNVASRSQSWSPPPDNAWKINVDAAWDDLSTGIGAICRNSRGEI

Query:  LGACSKFLDFSLPP
        + A S  + F L P
Subjt:  LGACSKFLDFSLPP

A0A7N2RFA1 Uncharacterized protein2.3e-2735.19Show/hide
Query:  SKVWKMVWKLKVPSKVKFFCWKALKGFLPTGVNLSSRGMNIYNGCPMCSMAPESTDHCLFSCSKAKQIWSLTFRHVILERNFNHSLEDRWLALCDE----
        SKVW M+WKLKVP+K+K F W+  +  LPT  NL  R +   + C +C    E+  H L+ C  A+ +W+ +    +++R      +   + L +E    
Subjt:  SKVWKMVWKLKVPSKVKFFCWKALKGFLPTGVNLSSRGMNIYNGCPMCSMAPESTDHCLFSCSKAKQIWSLTFRHVILERNFNHSLEDRWLALCDE----

Query:  LSMEDLRIVAVTSWAIWGDKNKKIHEAEVPSPNIRSRWILNYLSEFDRAEERRKSSLQAGSMLNVASRSQS---WSPPPDNAWKINVDAA-WDDLS-TGI
        L+ E+L +  V +W IW  +N+  H  ++ SP   ++   + L+EF +A+E+          L +ASR+ +   W PPP++ +K+N DAA + DL  +GI
Subjt:  LSMEDLRIVAVTSWAIWGDKNKKIHEAEVPSPNIRSRWILNYLSEFDRAEERRKSSLQAGSMLNVASRSQS---WSPPPDNAWKINVDAA-WDDLS-TGI

Query:  GAICRNSRGEILGACS
        GAI RN RGE++GA S
Subjt:  GAICRNSRGEILGACS

A0A803NM27 Uncharacterized protein1.2e-2631.91Show/hide
Query:  WKMVWKLKVPSKVKFFCWKALKGFLPTGVNLSSRGMNIYNGCPMCSMAPESTDHCLFSCSKAKQIWSLTFRHVILERNFNHSLEDRWLALCDELSMEDLR
        WK  W+LK+P KVK F WKA+   LP    L  R       C +C  A ES  H +F+C  A+ +W +       +   +  +ED    + +  +  +L 
Subjt:  WKMVWKLKVPSKVKFFCWKALKGFLPTGVNLSSRGMNIYNGCPMCSMAPESTDHCLFSCSKAKQIWSLTFRHVILERNFNHSLEDRWLALCDELSMEDLR

Query:  IVAVTSWAIWGDKNKKIHEAEVPSPNIRSRWILNYLSEFDRAEERRKSSLQAG-SMLNVASRSQSWSPPPDNAWKINVDAAWDDL--STGIGAICRNSRG
        ++  T W+IW D+N  +H      P++ S    ++LS F  A++    SL AG +  +  +  ++W+PPP N  K+NVDAA+DD     G GAI R+S G
Subjt:  IVAVTSWAIWGDKNKKIHEAEVPSPNIRSRWILNYLSEFDRAEERRKSSLQAG-SMLNVASRSQSWSPPPDNAWKINVDAAWDDL--STGIGAICRNSRG

Query:  EILGACSKFLDFSLPPPWLNLGLSRKALILLSLWA
         +  A S  +D    P      +  K L     WA
Subjt:  EILGACSKFLDFSLPPPWLNLGLSRKALILLSLWA

A0A803Q8J4 Uncharacterized protein4.4e-2634.3Show/hide
Query:  WKMVWKLKVPSKVKFFCWKALKGFLPTGVNLSSRGMNIYNGCPMCSMAPESTDHCLFSCSKAKQIWSLTFRHVILERNFNHSLEDRWLALCDELSMEDLR
        WK  W+LK+P KVK F WKA+   LP    L  +       C +C  A ES  H LFSC  A+ +W +       +   + ++ED    + +  +  +L 
Subjt:  WKMVWKLKVPSKVKFFCWKALKGFLPTGVNLSSRGMNIYNGCPMCSMAPESTDHCLFSCSKAKQIWSLTFRHVILERNFNHSLEDRWLALCDELSMEDLR

Query:  IVAVTSWAIWGDKNKKIHEAEVPSPNIRSRWILNYLSEFDRAEERRKSSLQAG-SMLNVASRSQSWSPPPDNAWKINVDAAWDDL--STGIGAICRNSRG
         +  T W+IW D+N  IH      P + S    N+L+ +   +  ++ SL AG S     S S++WSPPP    K+NVDAA+D+     G GAI R+S G
Subjt:  IVAVTSWAIWGDKNKKIHEAEVPSPNIRSRWILNYLSEFDRAEERRKSSLQAG-SMLNVASRSQSWSPPPDNAWKINVDAAWDDL--STGIGAICRNSRG

Query:  EILGACS
         +  A S
Subjt:  EILGACS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G60720.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.0e-0629.75Show/hide
Query:  KMVWKLKVPSKVKFFCWKALKGFLPTGVNLSSRGMNIYNGCPMCSMAPESTDHCLFSCSKAKQIWSLTFRHVILERNFNHSLED--RWLALCDELSMEDL
        K VW      K  F  W +    LPT   L+S G      C +C++  ES DH LFSC  A Q+W L F  +   +    S  +   W+      +   L
Subjt:  KMVWKLKVPSKVKFFCWKALKGFLPTGVNLSSRGMNIYNGCPMCSMAPESTDHCLFSCSKAKQIWSLTFRHVILERNFNHSLED--RWLALCDELSMEDL

Query:  RIVAVTS--WAIWGDKNKKIH
        R V+  +  + IW  +N  +H
Subjt:  RIVAVTS--WAIWGDKNKKIH

AT2G02650.1 Ribonuclease H-like superfamily protein7.2e-1323.72Show/hide
Query:  SKVWKMVWKLKVPSKVKFFCWKALKGFLPTGVNLSSRGMNIYNGCPMCSMAPESTDHCLFSCSKAKQIWSLTFRHVILERNFN--HSLEDR-----WLAL
        ++V + +WKL V  K+K F W+ + G L T   L SR ++    C  C +  E+  H +F+C   + +W     ++I+   +    S ED       L+ 
Subjt:  SKVWKMVWKLKVPSKVKFFCWKALKGFLPTGVNLSSRGMNIYNGCPMCSMAPESTDHCLFSCSKAKQIWSLTFRHVILERNFN--HSLEDR-----WLAL

Query:  CDELSMEDLRIVAVTSWAIWGDKNKKIHEAEVPSPNIRSRWILNYLSEFDRAEERRKSS---LQAGSMLNVASRSQSWSPPPDNAWKINVDAAWDDLS--
            +  D  +     W +W  +N  + + +  SP+  +R  +   +E+  A E  +++   +    +      S  W+PPP+   K N D+ +   S  
Subjt:  CDELSMEDLRIVAVTSWAIWGDKNKKIHEAEVPSPNIRSRWILNYLSEFDRAEERRKSS---LQAGSMLNVASRSQSWSPPPDNAWKINVDAAWDDLS--

Query:  TGIGAICRNSRGEIL
        T  G   R   G I+
Subjt:  TGIGAICRNSRGEIL

AT3G09510.1 Ribonuclease H-like superfamily protein1.3e-1428.08Show/hide
Query:  VWKLKVPSKVKFFCWKALKGFLPTGVNLSSRGMNIYNGCPMCSMAPESTDHCLFSCSKAKQIWSLT----FRHVILERNFNHSLEDRWLALCDELSMEDL
        +W L +  K+K F W+AL   L T   L++RGM I   CP C    ES +H LF+C  A   W L+     R+ ++  +F  ++ +  L    + +M D 
Subjt:  VWKLKVPSKVKFFCWKALKGFLPTGVNLSSRGMNIYNGCPMCSMAPESTDHCLFSCSKAKQIWSLT----FRHVILERNFNHSLEDRWLALCDELSMEDL

Query:  R--IVAVTSWAIWGDKNKKIHEAEVPSPNIRSRWILNYLSE-FDRAEERRKSSLQAGSMLNVASRSQSWSPPPDNAWKINVDAAWD--DLSTGIGAICRN
           +     W IW  +N  +      SP   S+ +L+  +E  D     +           +A     W  PP    K N DA +D   L    G I RN
Subjt:  R--IVAVTSWAIWGDKNKKIHEAEVPSPNIRSRWILNYLSE-FDRAEERRKSSLQAGSMLNVASRSQSWSPPPDNAWKINVDAAWD--DLSTGIGAICRN

Query:  SRG
          G
Subjt:  SRG

AT3G25270.1 Ribonuclease H-like superfamily protein1.4e-1126.29Show/hide
Query:  VWKLKVPSKVKFFCWKALKGFLPTGVNLSSRGMNIYNGCPMCSMAPESTDHCLFSCSKAKQIWSLT-FRHVILERNFNHSLEDRW---LALCDELSMEDL
        +WKLK   K+K F WK L G L TG NL  R +  +  C  C    E++ H  F C  A+Q+W  +   H  L R    ++E +    L+ C       L
Subjt:  VWKLKVPSKVKFFCWKALKGFLPTGVNLSSRGMNIYNGCPMCSMAPESTDHCLFSCSKAKQIWSLT-FRHVILERNFNHSLEDRW---LALCDELSMEDL

Query:  RIVAV-TSWAIWGDKNKKIHEAEVPSPNIRSRWILNYLSEFDRAEERRKSSLQAGSMLNVASRSQ-------SWSPPPDNAWKINVDAAWDDLSTG--IG
          +A+   W +W  +N+ + + +  S     +   N + E+    E   + +Q+ +    +SR Q        W  PP    K N D A++  +     G
Subjt:  RIVAV-TSWAIWGDKNKKIHEAEVPSPNIRSRWILNYLSEFDRAEERRKSSLQAGSMLNVASRSQ-------SWSPPPDNAWKINVDAAWDDLSTG--IG

Query:  AICRNSRGEILGA
         + R+  G  +G+
Subjt:  AICRNSRGEILGA

AT3G25720.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.2e-0427.83Show/hide
Query:  SRGMNIYNGCPMCSMAPESTDHCLFSCSKAKQIWSLTFR----HVILERNFNHSLEDRWLALCDELSMEDLRIVAVTS--WAIWGDKNKKIHEAEVPSP-
        S G+ +   C +CS APE+ DH L  CS +K +WS        H ++  N+   L   W+      +   LR +   S  +A W  +N  +H +    P 
Subjt:  SRGMNIYNGCPMCSMAPESTDHCLFSCSKAKQIWSLTFR----HVILERNFNHSLEDRWLALCDELSMEDLRIVAVTS--WAIWGDKNKKIHEAEVPSP-

Query:  ---NIRSRWILNYLS
            I  R I+N ++
Subjt:  ---NIRSRWILNYLS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTAAAGTCTGGAAGATGGTCTGGAAGTTAAAAGTCCCTTCGAAAGTGAAATTCTTTTGCTGGAAGGCTCTGAAAGGGTTTCTCCCTACCGGAGTTAATTTATCCAG
TAGAGGGATGAATATCTATAATGGTTGTCCTATGTGTTCTATGGCTCCTGAATCAACGGATCACTGCCTTTTCTCTTGTTCAAAAGCAAAACAGATATGGAGTCTTACCT
TCCGCCATGTTATTCTGGAGAGGAATTTCAACCATAGCCTTGAAGATAGGTGGTTAGCTCTCTGTGACGAACTGTCAATGGAGGATCTCAGAATTGTGGCAGTCACAAGC
TGGGCTATATGGGGAGACAAGAATAAGAAAATTCATGAGGCTGAAGTTCCCTCTCCGAATATTCGTAGCAGATGGATATTAAATTACCTGTCTGAGTTTGATCGAGCTGA
AGAAAGGAGAAAGTCCAGCCTTCAAGCTGGTAGCATGCTGAATGTAGCATCTAGATCTCAGTCCTGGTCGCCTCCTCCTGATAACGCCTGGAAAATTAACGTCGATGCGG
CTTGGGATGATTTGTCTACTGGTATTGGAGCAATTTGCAGGAATAGCAGAGGGGAAATTCTGGGAGCTTGCAGTAAATTTCTTGATTTTTCCCTTCCTCCCCCATGGCTG
AACTTAGGGCTATCAAGGAAGGCGTTGATCTTGCTATCTCTCTGGGCGGAAGCAAGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGTAAAGTCTGGAAGATGGTCTGGAAGTTAAAAGTCCCTTCGAAAGTGAAATTCTTTTGCTGGAAGGCTCTGAAAGGGTTTCTCCCTACCGGAGTTAATTTATCCAG
TAGAGGGATGAATATCTATAATGGTTGTCCTATGTGTTCTATGGCTCCTGAATCAACGGATCACTGCCTTTTCTCTTGTTCAAAAGCAAAACAGATATGGAGTCTTACCT
TCCGCCATGTTATTCTGGAGAGGAATTTCAACCATAGCCTTGAAGATAGGTGGTTAGCTCTCTGTGACGAACTGTCAATGGAGGATCTCAGAATTGTGGCAGTCACAAGC
TGGGCTATATGGGGAGACAAGAATAAGAAAATTCATGAGGCTGAAGTTCCCTCTCCGAATATTCGTAGCAGATGGATATTAAATTACCTGTCTGAGTTTGATCGAGCTGA
AGAAAGGAGAAAGTCCAGCCTTCAAGCTGGTAGCATGCTGAATGTAGCATCTAGATCTCAGTCCTGGTCGCCTCCTCCTGATAACGCCTGGAAAATTAACGTCGATGCGG
CTTGGGATGATTTGTCTACTGGTATTGGAGCAATTTGCAGGAATAGCAGAGGGGAAATTCTGGGAGCTTGCAGTAAATTTCTTGATTTTTCCCTTCCTCCCCCATGGCTG
AACTTAGGGCTATCAAGGAAGGCGTTGATCTTGCTATCTCTCTGGGCGGAAGCAAGTTGA
Protein sequenceShow/hide protein sequence
MSKVWKMVWKLKVPSKVKFFCWKALKGFLPTGVNLSSRGMNIYNGCPMCSMAPESTDHCLFSCSKAKQIWSLTFRHVILERNFNHSLEDRWLALCDELSMEDLRIVAVTS
WAIWGDKNKKIHEAEVPSPNIRSRWILNYLSEFDRAEERRKSSLQAGSMLNVASRSQSWSPPPDNAWKINVDAAWDDLSTGIGAICRNSRGEILGACSKFLDFSLPPPWL
NLGLSRKALILLSLWAEAS