; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0011902 (gene) of Snake gourd v1 genome

Gene IDTan0011902
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionReverse transcriptase
Genome locationLG11:20781457..20782407
RNA-Seq ExpressionTan0011902
SyntenyTan0011902
Gene Ontology termsNA
InterPro domainsIPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_010681662.1 PREDICTED: uncharacterized protein LOC104896592 [Beta vulgaris subsp. vulgaris]2.2e-3731.48Show/hide
Query:  MDSIRRSLKFDGCLTVESKG----KSGGLSLLWKNECAVTLKSFSFNHIDCNILWQD-KSWRFTGIYGALEGGNKKITCELLERLAEEDTVPWLVRGDLN
        MD++R+ L+F G L V+ +G    + GGL LLWK+E AV +K+FS +HI+  +     + WRFTG+YG  E GNK  TCELL RL  + + PW+  GD N
Subjt:  MDSIRRSLKFDGCLTVESKG----KSGGLSLLWKNECAVTLKSFSFNHIDCNILWQD-KSWRFTGIYGALEGGNKKITCELLERLAEEDTVPWLVRGDLN

Query:  ECLWEKEKKGGSSRLSSNVELFRSTFDTLQLKDLGYIGNMYTWTNKWQKSTPIFKRVLHL-------DWYG-----------SDHRLICAHLNNPVTRST
          +W  EK+GG+        +FR   D  +L DL ++G  +TWTN    +  I +R+  +       D +G           SDH  I   + +   + T
Subjt:  ECLWEKEKKGGSSRLSSNVELFRSTFDTLQLKDLGYIGNMYTWTNKWQKSTPIFKRVLHL-------DWYG-----------SDHRLICAHLNNPVTRST

Query:  QPNGHHAFRFEEIWSQEEECENILHNCWNEAPTESIRGHQPLTQFSVALGRSRELLKPWGQRKKREMFSDINYYK---KKLQEAYEAKDEVDFKKIFDIE
               FRFEE+W +E +CE I+   W+E               S ++  +   L+ W  +    +  +  + +    +L E  +  + +   K  D+ 
Subjt:  QPNGHHAFRFEEIWSQEEECENILHNCWNEAPTESIRGHQPLTQFSVALGRSRELLKPWGQRKKREMFSDINYYK---KKLQEAYEAKDEVDFKKIFDIE

Query:  RLLDQALTKEEIYLKQRSREERLE
          +D    +EE+Y KQRSR+E L+
Subjt:  RLLDQALTKEEIYLKQRSREERLE

XP_030483769.1 uncharacterized protein LOC115700339 [Cannabis sativa]7.1e-3632.4Show/hide
Query:  RRSLKFDGCLTVESKGKSGGLSLLWKNECAVTLKSFSFNHIDCNILWQD-KSWRFTGIYGALEGGNKKITCELLERLAEEDTV-PWLVRGDLNECLWEKE
        + SL +   L V   G  GG+ LLW++   VTL S + NH DC +L++D   W F+ IYG  E  NKK T  L+ RLA+   + PWL+ GDLNE    + 
Subjt:  RRSLKFDGCLTVESKGKSGGLSLLWKNECAVTLKSFSFNHIDCNILWQD-KSWRFTGIYGALEGGNKKITCELLERLAEEDTV-PWLVRGDLNECLWEKE

Query:  KKGGSSRLSSNVELFRSTFDTLQLKDLGYIGNMYTWTNKWQKSTPIFKR------------------VLHLDWYGSDHRLICAHL-----NNPVTRSTQP
        K  G  R  + ++ FR+T D   L  L  +G+ +TW    +  T + +R                  + HLD++GSDHR++   +     +NPV R    
Subjt:  KKGGSSRLSSNVELFRSTFDTLQLKDLGYIGNMYTWTNKWQKSTPIFKR------------------VLHLDWYGSDHRLICAHL-----NNPVTRSTQP

Query:  NGHHAFRFEEIWSQEEECENILHNCWNEAPTESIRGHQPLTQFSVALGRSRELLKPWGQRKKREMFSDINYYKKKLQEAYEAKDEV--DFKKIFDIERLL
             FRFE+IW ++EEC +I+ NCW          + P      +L +    L+ W  RK  +M  DI++ +K++     A        +++   E++L
Subjt:  NGHHAFRFEEIWSQEEECENILHNCWNEAPTESIRGHQPLTQFSVALGRSRELLKPWGQRKKREMFSDINYYKKKLQEAYEAKDEV--DFKKIFDIERLL

Query:  DQALTKEEIYLKQRSREERLE
        ++ L  EE Y +QRSR E L+
Subjt:  DQALTKEEIYLKQRSREERLE

XP_030486845.1 uncharacterized protein LOC115703751 [Cannabis sativa]1.7e-3733.65Show/hide
Query:  MDSIRRSLKFDGCLTVESKGKSGGLSLLWKNECAVTLKSFSFNHIDCNILWQD-KSWRFTGIYGALEGGNKKITCELLERLAEEDTV-PWLVRGDLNECL
        +   R  LKF   + V   G SGGL  LWK+   V++ ++  N +DC + + D  SW F+G YGA     ++ T ELL++L +   + PWLV GD NE L
Subjt:  MDSIRRSLKFDGCLTVESKGKSGGLSLLWKNECAVTLKSFSFNHIDCNILWQD-KSWRFTGIYGALEGGNKKITCELLERLAEEDTV-PWLVRGDLNECL

Query:  WEKEKKGGSSRLSSNVELFRSTFDTLQLKDLGYIGNMYTWTNKWQKSTPIFKR------------------VLHLDWYGSDHRLICAHLNNPVTRSTQPN
         + +K GG  R  + ++ FR   D   L DL Y G+ +TW N   +++ + +R                  V HLD++GSDHR +   +  P  +S QP 
Subjt:  WEKEKKGGSSRLSSNVELFRSTFDTLQLKDLGYIGNMYTWTNKWQKSTPIFKR------------------VLHLDWYGSDHRLICAHLNNPVTRSTQPN

Query:  GHHAFRFEEIWSQEEECENILHNCWNEAPTESIRGHQPLTQFSVALGRSRELLKPWGQRKKREMFSDINYYKKKLQEAYEAKD-EVD-FKKIFDIERLLD
            FRFE+IW QEE+C +I+   W  + T+      P  Q    +      L+ W   K  ++   I   + K+       D  VD F+++   E +LD
Subjt:  GHHAFRFEEIWSQEEECENILHNCWNEAPTESIRGHQPLTQFSVALGRSRELLKPWGQRKKREMFSDINYYKKKLQEAYEAKD-EVD-FKKIFDIERLLD

Query:  QALTKEEIYLKQRSR
        + LTKEE Y KQRSR
Subjt:  QALTKEEIYLKQRSR

XP_030502555.1 uncharacterized protein LOC115717715 [Cannabis sativa]9.3e-3633.12Show/hide
Query:  LKFDGCLTVESKGKSGGLSLLWKNECAVTLKSFSFNHIDCNILWQD-KSWRFTGIYGALEGGNKKITCELLERLAEEDTV-PWLVRGDLNECLWEKEKKG
        L F   L V   G  GGL LLW++   VTL S + NH DC +L+ D   W  + IYG  E  NKK T +L++RLA+   + PWL+ GD+NE    + K G
Subjt:  LKFDGCLTVESKGKSGGLSLLWKNECAVTLKSFSFNHIDCNILWQD-KSWRFTGIYGALEGGNKKITCELLERLAEEDTV-PWLVRGDLNECLWEKEKKG

Query:  GSSRLSSNVELFRSTFDTLQLKDLGYIGNMYTWTNKWQK------------------STPIFKRVLHLDWYGSDHRLICAHLNNPVTRSTQPNGHHAFRF
        G  R  + ++ FR+T D   L ++   G+ +TW    +K                   T  + ++ HLD+YGSDHR++ A+++   T+  Q      FRF
Subjt:  GSSRLSSNVELFRSTFDTLQLKDLGYIGNMYTWTNKWQK------------------STPIFKRVLHLDWYGSDHRLICAHLNNPVTRSTQPNGHHAFRF

Query:  EEIWSQEEECENILHNCW-NEAPTESIRGHQPLTQFSVALGRSRELLKPWGQRKKREMFSDINYYKKKLQEAYEA-KDEVDFK-KIFDIERLLDQALTKE
        E++W +++EC  I+ + W + + ++S       TQ   +L      L+ W  RK  +M  DI + ++ +     A   + DF+ KI   E +LD  L  E
Subjt:  EEIWSQEEECENILHNCW-NEAPTESIRGHQPLTQFSVALGRSRELLKPWGQRKKREMFSDINYYKKKLQEAYEA-KDEVDFK-KIFDIERLLDQALTKE

Query:  EIYLKQRSREERLE
        E Y +QRSR + L+
Subjt:  EIYLKQRSREERLE

XP_042980077.1 uncharacterized protein LOC122310261 [Carya illinoinensis]3.2e-3635Show/hide
Query:  MDSIRRSLKFDGCLTVESKGKSGGLSLLWKNECAVTLKSFSFNHIDCNILWQD-KSWRFTGIYGALEGGNKKITCELLERLAEEDTVPWLVRGDLNECLW
        MD+++R L F  C +V+S+G+SGGL LLW ++  V L+SFS  HID +I   D   WRFTG+YG  +   +  T  L+  L+  + +PWLV GDLNE L 
Subjt:  MDSIRRSLKFDGCLTVESKGKSGGLSLLWKNECAVTLKSFSFNHIDCNILWQD-KSWRFTGIYGALEGGNKKITCELLERLAEEDTVPWLVRGDLNECLW

Query:  EKEKKGGSSRLSSNVELFRSTFDTLQLKDLGYIGNMYTWTNKWQKSTPIFKR------------------VLHLDWYGSDHRLICAHLNNPVTRSTQPNG
          EK+GG  R  S +E FR      +L+DLGY G  +TW N    +  IF+R                  V H +   SDH  I    +N       P  
Subjt:  EKEKKGGSSRLSSNVELFRSTFDTLQLKDLGYIGNMYTWTNKWQKSTPIFKR------------------VLHLDWYGSDHRLICAHLNNPVTRSTQPNG

Query:  HHAFRFEEIWSQEEECENILHNCWNEAPTESIRGHQPLTQFSVALGRSRELLKPWGQRKKREMFSDINYYKKKLQEAYEAKDEVDFKKIFDIE-RLLDQA
           FRFE +W  EE+CE I+   W+E      RG + L       G+    LK W +             KKKL EA    ++  F    D     L QA
Subjt:  HHAFRFEEIWSQEEECENILHNCWNEAPTESIRGHQPLTQFSVALGRSRELLKPWGQRKKREMFSDINYYKKKLQEAYEAKDEVDFKKIFDIE-RLLDQA

Query:  -------LTKEEIYLKQRSR
               L +EE+  +QRSR
Subjt:  -------LTKEEIYLKQRSR

TrEMBL top hitse value%identityAlignment
A0A803P9R9 Uncharacterized protein3.7e-3833.23Show/hide
Query:  MDSIRRSLKFDGCLTVESKGKSGGLSLLWKNECAVTLKSFSFNHIDCNI-LWQDKSWRFTGIYGALEGGNKKITCELLERLAEEDTVPWLVRGDLNECLW
        M+ IR  L FD C  V ++GKSGGL+LLWK+   VT+ SF+ +HID  +       WRFTG YG+ + G +K +  L+ERL +    PW+  GD NE + 
Subjt:  MDSIRRSLKFDGCLTVESKGKSGGLSLLWKNECAVTLKSFSFNHIDCNI-LWQDKSWRFTGIYGALEGGNKKITCELLERLAEEDTVPWLVRGDLNECLW

Query:  EKEKKGGSSRLSSNVELFRSTFDTLQLKDLGYIGNMYTWTNKWQKSTPIFK-----------------RVLHLDWYGSDHR-LICAHLNNPVTRSTQPNG
        EKEKKGG  + +S +  F+        K++   G  +TW N  Q +    K                 +V  L W+ SDHR LI    N   +  +    
Subjt:  EKEKKGGSSRLSSNVELFRSTFDTLQLKDLGYIGNMYTWTNKWQKSTPIFK-----------------RVLHLDWYGSDHR-LICAHLNNPVTRSTQPNG

Query:  HHAFRFEEIWSQEEECENILHNCW----NEAPTESIRGHQPLTQFSVALGRSRELLKPWGQRKKREMFSDINYYKKKLQEAYEAKDEVDFKKIFDIERLL
           F +E+ W+ EEEC  I++N W    N    + IRG          +    E+L  W + KK E+ +     K++L+    +  E+D+     +E+ L
Subjt:  HHAFRFEEIWSQEEECENILHNCW----NEAPTESIRGHQPLTQFSVALGRSRELLKPWGQRKKREMFSDINYYKKKLQEAYEAKDEVDFKKIFDIERLL

Query:  DQALTKEEIYLKQRSR
        + A  KEE+  KQRSR
Subjt:  DQALTKEEIYLKQRSR

A0A803PCN1 Uncharacterized protein2.4e-3733.23Show/hide
Query:  MDSIRRSLKFDGCLTVESKGKSGGLSLLWKNECAVTLKSFSFNHIDCNI-LWQDKSWRFTGIYGALEGGNKKITCELLERLAEEDTVPWLVRGDLNECLW
        M+ IR    FD C  V +KGKSGGL+LLWK+   +T+ SF+ +HID  +       WRFTG YG+ + G +K +  L+ERL +    PW+  GD NE + 
Subjt:  MDSIRRSLKFDGCLTVESKGKSGGLSLLWKNECAVTLKSFSFNHIDCNI-LWQDKSWRFTGIYGALEGGNKKITCELLERLAEEDTVPWLVRGDLNECLW

Query:  EKEKKGGSSRLSSNVELFRSTFDTLQLKDLGYIGNMYTWTNKWQKSTPIFK-----------------RVLHLDWYGSDHR-LICAHLNNPVTRSTQPNG
        EKEKKGG  + +S +  F+        K++   G  +TW N  Q +    K                 +V  L W+ SDHR LI    N   T  +    
Subjt:  EKEKKGGSSRLSSNVELFRSTFDTLQLKDLGYIGNMYTWTNKWQKSTPIFK-----------------RVLHLDWYGSDHR-LICAHLNNPVTRSTQPNG

Query:  HHAFRFEEIWSQEEECENILHNCW----NEAPTESIRGHQPLTQFSVALGRSRELLKPWGQRKKREMFSDINYYKKKLQEAYEAKDEVDFKKIFDIERLL
           F +E+ W+ EEEC  I++N W    N    + IRG          +    E+L  W + KK E+ +     KK+L     +  E +++    +E+ L
Subjt:  HHAFRFEEIWSQEEECENILHNCW----NEAPTESIRGHQPLTQFSVALGRSRELLKPWGQRKKREMFSDINYYKKKLQEAYEAKDEVDFKKIFDIERLL

Query:  DQALTKEEIYLKQRSR
        + A  KEE+  KQRSR
Subjt:  DQALTKEEIYLKQRSR

A0A803PHH5 Uncharacterized protein9.1e-3733.12Show/hide
Query:  DSIRRSLKFDGCLTVESKGKSGGLSLLWKNECAVTLKSFSFNHIDCNILWQD-KSWRFTGIYGALEGGNKKITCELLERLAEEDTVPWLVRGDLNECLWE
        +S+R SL F GC  VE+KGKSGGL LLW  +   ++ SFS  HID  I  ++ + WRFTG YG  +   +  +  LL R+A   T PW++ GD NE L +
Subjt:  DSIRRSLKFDGCLTVESKGKSGGLSLLWKNECAVTLKSFSFNHIDCNILWQD-KSWRFTGIYGALEGGNKKITCELLERLAEEDTVPWLVRGDLNECLWE

Query:  KEKKGGSSRLSSNVELFRSTFDTLQLKDLGYIGNMYTWTNKWQKSTPIFKR------------------VLHLDWYGSDH---RLICAHLNNPVTRSTQP
        KEKKGG+ +    +  FR   D   L+++ + GNM+TW N  Q++  IF+R                  V+HL+   SDH    L C        +  + 
Subjt:  KEKKGGSSRLSSNVELFRSTFDTLQLKDLGYIGNMYTWTNKWQKSTPIFKR------------------VLHLDWYGSDH---RLICAHLNNPVTRSTQP

Query:  NGHHAFRFEEIWSQEEECENILHNCWNEAPTESIRGHQPLTQFSVALGRSRELLKPWGQRKKREMFSDINYYKKKLQEAYEAKDEVDFKKIFDIERLLDQ
          H  F FE  W +E+ C  +++  W    T +        Q    L +   +L  W + +K+EM   I  Y+ K+       D   +  + +IE+  + 
Subjt:  NGHHAFRFEEIWSQEEECENILHNCWNEAPTESIRGHQPLTQFSVALGRSRELLKPWGQRKKREMFSDINYYKKKLQEAYEAKDEVDFKKIFDIERLLDQ

Query:  ALTKEEIYLKQRSR
         L KEE + KQRSR
Subjt:  ALTKEEIYLKQRSR

A0A803PMD0 Uncharacterized protein5.3e-3734.62Show/hide
Query:  MDSIRRSLKFDGCLTVESKGKSGGLSLLWKNECAVTLKSFSFNHIDCNI-LWQDKSWRFTGIYGALEGGNKKITCELLERLAEEDTVPWLVRGDLNECLW
        M+ IR  L F+GC  V +KGKSGGL+LLWK    V +KSF+ +HID  +       WRFTG YG+ + G +K +  L+ERL +     W+  GD NE + 
Subjt:  MDSIRRSLKFDGCLTVESKGKSGGLSLLWKNECAVTLKSFSFNHIDCNI-LWQDKSWRFTGIYGALEGGNKKITCELLERLAEEDTVPWLVRGDLNECLW

Query:  EKEKKGGSSRLSSNVELFRSTFDTLQLKDLGYIGNMYTWTNKWQKSTPIFK-----------------RVLHLDWYGSDHR-LICAHLNNPVTRSTQPNG
          EKKGG  +    +  FR        K++   G  +TW N  Q S    K                 +V  L W+ SDHR LI    +N     T P  
Subjt:  EKEKKGGSSRLSSNVELFRSTFDTLQLKDLGYIGNMYTWTNKWQKSTPIFK-----------------RVLHLDWYGSDHR-LICAHLNNPVTRSTQPNG

Query:  HHAFRFEEIWSQEEECENILHNCWNEAPTESIRGHQPLTQFSVALGRSRELLKPWGQRKKREMFSDINYYKKKLQEAYEAKDEVDFKKIFDIERLLDQAL
           F +E+ W++EEEC  I+   W +    +    Q L +    + R  E+LK W + KK E+ +     K++L+    +  E+D+K    IER L+ A 
Subjt:  HHAFRFEEIWSQEEECENILHNCWNEAPTESIRGHQPLTQFSVALGRSRELLKPWGQRKKREMFSDINYYKKKLQEAYEAKDEVDFKKIFDIERLLDQAL

Query:  TKEEIYLKQRSR
         K+EI  KQRSR
Subjt:  TKEEIYLKQRSR

A0A803PRV5 Uncharacterized protein3.1e-3734.19Show/hide
Query:  DSIRRSLKFDGCLTVESKGKSGGLSLLWKNECAVTLKSFSFNHIDCNI-LWQDKSWRFTGIYGALEGGNKKITCELLERLAEEDTVPWLVRGDLNECLWE
        +++R  L F GC TVE++GKSGGL LLW  +    + S+S  HID  I +   + WRFTG YG  +   +  + +LL+RL    T PW+V GD NE L +
Subjt:  DSIRRSLKFDGCLTVESKGKSGGLSLLWKNECAVTLKSFSFNHIDCNI-LWQDKSWRFTGIYGALEGGNKKITCELLERLAEEDTVPWLVRGDLNECLWE

Query:  KEKKGGSSRLSSNVELFRSTFDTLQLKDLGYIGNMYTWTNKWQKSTPIFKR------------------VLHLDWYGSDHRLICAHLNNPVTRSTQPNGH
        KEK GG  + +  +  FR   D  QL+D+GY GN YTW N  +K+  IF+R                  V HLD   SDH  +    ++P  +       
Subjt:  KEKKGGSSRLSSNVELFRSTFDTLQLKDLGYIGNMYTWTNKWQKSTPIFKR------------------VLHLDWYGSDHRLICAHLNNPVTRSTQPNGH

Query:  HA-FRFEEIWSQEEECENILHNCW-NEAPTESIRGHQPLTQFSVALGRSRELLKPWGQRKKREMFSDINYYKKKLQEAYEAKDEVDFKKIFDIERLLDQA
           F FE  W+++EEC +I+ + W       S +G          LG+    L+ W + K++EM  ++  Y+ K+     + +  D++ + D+ER  +  
Subjt:  HA-FRFEEIWSQEEECENILHNCW-NEAPTESIRGHQPLTQFSVALGRSRELLKPWGQRKKREMFSDINYYKKKLQEAYEAKDEVDFKKIFDIERLLDQA

Query:  LTKEEIYLKQRSR
        L KEE + KQRSR
Subjt:  LTKEEIYLKQRSR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCCATTAGGAGGAGTCTCAAATTTGATGGTTGTTTAACTGTTGAAAGTAAAGGAAAGAGTGGGGGCCTAAGCTTACTTTGGAAGAACGAATGTGCAGTAACACT
TAAATCCTTCTCCTTTAATCATATAGATTGTAATATCCTTTGGCAAGATAAAAGTTGGCGTTTTACTGGCATTTATGGGGCCCTAGAAGGAGGGAACAAGAAAATCACTT
GTGAACTCCTTGAAAGACTGGCGGAGGAAGATACTGTCCCTTGGCTGGTGAGAGGTGACCTGAACGAATGTTTATGGGAAAAGGAGAAAAAAGGTGGATCGTCCAGACTT
TCTTCCAATGTGGAATTATTCAGATCCACATTTGACACTCTACAATTGAAGGATTTGGGCTACATTGGTAATATGTACACATGGACAAACAAATGGCAGAAAAGTACACC
TATCTTCAAAAGAGTGCTCCATCTCGATTGGTATGGTTCGGATCATAGGCTAATTTGTGCGCATCTGAATAACCCCGTGACCAGAAGTACACAACCCAATGGTCATCATG
CATTTAGATTCGAGGAAATATGGTCTCAAGAGGAGGAATGCGAGAATATATTGCATAACTGCTGGAATGAGGCTCCAACTGAATCTATCAGAGGCCATCAGCCACTGACT
CAATTTTCAGTCGCACTAGGAAGAAGCAGGGAGCTTTTGAAACCATGGGGCCAAAGGAAAAAGAGGGAAATGTTCTCAGATATTAATTATTACAAGAAAAAACTACAGGA
GGCGTATGAGGCTAAGGATGAGGTGGACTTTAAGAAGATCTTTGATATTGAAAGGCTTCTAGACCAGGCGTTGACTAAGGAAGAAATCTACTTGAAGCAGAGGTCCAGGG
AAGAACGGCTAGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGATTCCATTAGGAGGAGTCTCAAATTTGATGGTTGTTTAACTGTTGAAAGTAAAGGAAAGAGTGGGGGCCTAAGCTTACTTTGGAAGAACGAATGTGCAGTAACACT
TAAATCCTTCTCCTTTAATCATATAGATTGTAATATCCTTTGGCAAGATAAAAGTTGGCGTTTTACTGGCATTTATGGGGCCCTAGAAGGAGGGAACAAGAAAATCACTT
GTGAACTCCTTGAAAGACTGGCGGAGGAAGATACTGTCCCTTGGCTGGTGAGAGGTGACCTGAACGAATGTTTATGGGAAAAGGAGAAAAAAGGTGGATCGTCCAGACTT
TCTTCCAATGTGGAATTATTCAGATCCACATTTGACACTCTACAATTGAAGGATTTGGGCTACATTGGTAATATGTACACATGGACAAACAAATGGCAGAAAAGTACACC
TATCTTCAAAAGAGTGCTCCATCTCGATTGGTATGGTTCGGATCATAGGCTAATTTGTGCGCATCTGAATAACCCCGTGACCAGAAGTACACAACCCAATGGTCATCATG
CATTTAGATTCGAGGAAATATGGTCTCAAGAGGAGGAATGCGAGAATATATTGCATAACTGCTGGAATGAGGCTCCAACTGAATCTATCAGAGGCCATCAGCCACTGACT
CAATTTTCAGTCGCACTAGGAAGAAGCAGGGAGCTTTTGAAACCATGGGGCCAAAGGAAAAAGAGGGAAATGTTCTCAGATATTAATTATTACAAGAAAAAACTACAGGA
GGCGTATGAGGCTAAGGATGAGGTGGACTTTAAGAAGATCTTTGATATTGAAAGGCTTCTAGACCAGGCGTTGACTAAGGAAGAAATCTACTTGAAGCAGAGGTCCAGGG
AAGAACGGCTAGAATGA
Protein sequenceShow/hide protein sequence
MDSIRRSLKFDGCLTVESKGKSGGLSLLWKNECAVTLKSFSFNHIDCNILWQDKSWRFTGIYGALEGGNKKITCELLERLAEEDTVPWLVRGDLNECLWEKEKKGGSSRL
SSNVELFRSTFDTLQLKDLGYIGNMYTWTNKWQKSTPIFKRVLHLDWYGSDHRLICAHLNNPVTRSTQPNGHHAFRFEEIWSQEEECENILHNCWNEAPTESIRGHQPLT
QFSVALGRSRELLKPWGQRKKREMFSDINYYKKKLQEAYEAKDEVDFKKIFDIERLLDQALTKEEIYLKQRSREERLE