; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0006954 (gene) of Snake gourd v1 genome

Gene IDTan0006954
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionzf-RVT domain-containing protein
Genome locationLG04:23234228..23237518
RNA-Seq ExpressionTan0006954
SyntenyTan0006954
Gene Ontology termsNA
InterPro domainsIPR026960 - Reverse transcriptase zinc-binding domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF4395958.1 hypothetical protein F8388_013127 [Cannabis sativa]5.0e-2733.33Show/hide
Query:  ERLDRGLCSTDFYELFPFATISHLEFMNSDHSPLEINLGEKMLK--YGTKKPYIFRFEEVWTLQEDCAKIIKEGWNSLNQRGNQKVSNDAKVDSLLTLDG
        ERLDR   ++D++++FP A + HLE +NSDH PL +   ++ L    G +    F FE  W  +E C +++ +                  + SL     
Subjt:  ERLDRGLCSTDFYELFPFATISHLEFMNSDHSPLEINLGEKMLK--YGTKKPYIFRFEEVWTLQEDCAKIIKEGWNSLNQRGNQKVSNDAKVDSLLTLDG

Query:  NWNEEEIKSMFHEEDAFQILKIPRPKTIGPDKMIWHYSKDGIYTVKSGYYLASSL-LEHDSSSEEDQTQRWWKTLWNCSIPNKIKFFIWRLFHDFIPTLV
        +WNE  +   FH+ED   IL IP       D ++W ++KDG Y VK GY +A  + L     S  DQT  WWK  WN ++P ++K F W++  +++P   
Subjt:  NWNEEEIKSMFHEEDAFQILKIPRPKTIGPDKMIWHYSKDGIYTVKSGYYLASSL-LEHDSSSEEDQTQRWWKTLWNCSIPNKIKFFIWRLFHDFIPTLV

Query:  NLNKRGIKVE
        NL  RG K++
Subjt:  NLNKRGIKVE

KAF8408042.1 hypothetical protein HHK36_007182 [Tetracentron sinense]3.4e-3935.97Show/hide
Query:  MDNFRNIINQCNLSNMGYKGHHFTWYQSRDGLISMKERLDRGLCSTDFYELFPFATISHLEFMNSDHSPLEINLGEKMLKYGTKKPYIFRFEEVWTLQED
        M  F+  I  C+L  +G++G+ FTW   R G  +++ERLDR + ++D+  LFPF T+ HL    SDHSPL +   ++  K   KK   FRFE +W    +
Subjt:  MDNFRNIINQCNLSNMGYKGHHFTWYQSRDGLISMKERLDRGLCSTDFYELFPFATISHLEFMNSDHSPLEINLGEKMLKYGTKKPYIFRFEEVWTLQED

Query:  CAKIIKEGWNSLNQRGNQKVSNDAKVDSLLTLD-GNWNEEEIKSMFHEEDAFQILKIPRPKTIGPDKMIWHYSKDGIYTVKSGYYLASSLLEHDS-SSEE
        CA+II   W   +Q   Q +  DAKV  L+  D   WN   + ++F   +A  I  IP  + + PDK +WH++  G ++V+S Y+L S+L + +S +S  
Subjt:  CAKIIKEGWNSLNQRGNQKVSNDAKVDSLLTLD-GNWNEEEIKSMFHEEDAFQILKIPRPKTIGPDKMIWHYSKDGIYTVKSGYYLASSLLEHDS-SSEE

Query:  DQTQRW--------WKTLWNCSIPNKIKFFIWRLFHDFIPTLVNLNKRGIKVE
          +  W        W  +W  +IP K+K FIW++  + +P   NL KR I VE
Subjt:  DQTQRW--------WKTLWNCSIPNKIKFFIWRLFHDFIPTLVNLNKRGIKVE

KAG8363091.1 hypothetical protein BUALT_BualtUnG0005200 [Buddleja alternifolia]1.8e-3232.2Show/hide
Query:  MDNFRNIINQCNLSNMGYKGHHFTWYQSRDGLISMKERLDRGLCSTDFYELFPFATISHLEFMNSDHSPLEIN-----LGEKMLKYGTKKPYIFRFEEVW
        + +FR  +    LS +G+ G+ FTW   RD    ++ RLDR   S  +   FP     HL  +++DHSP+ I        + + +   ++P  FRFE  W
Subjt:  MDNFRNIINQCNLSNMGYKGHHFTWYQSRDGLISMKERLDRGLCSTDFYELFPFATISHLEFMNSDHSPLEIN-----LGEKMLKYGTKKPYIFRFEEVW

Query:  TLQEDCAKIIKEGWNSLNQRGNQK---------------------VSN------DAKVDSLLTLDGN-WNEEEIKSMFHEEDAFQILKIPRPKTIGPDKM
           ++C K+I+E WNS ++   Q                      ++N      DA V +L+      WNEE I+++F   DA  IL IP  +   PD +
Subjt:  TLQEDCAKIIKEGWNSLNQRGNQK---------------------VSN------DAKVDSLLTLDGN-WNEEEIKSMFHEEDAFQILKIPRPKTIGPDKM

Query:  IWHYSKDGIYTVKSGYYLASSLLEHDSSSEEDQTQRWWKTLWNCSIPNKIKFFIWRLFHDFIPT
        IWHYSK+G+++VKS Y++A SL+  D  S    +   W+ +W   +P+KI+ FIWR+    IP+
Subjt:  IWHYSKDGIYTVKSGYYLASSLLEHDSSSEEDQTQRWWKTLWNCSIPNKIKFFIWRLFHDFIPT

RYQ92969.1 hypothetical protein Ahy_B09g099214 [Arachis hypogaea]1.0e-2730.88Show/hide
Query:  MDNFRNIINQCNLSNMGYKGHHFTWYQS-RDGLISMKERLDRGLCSTDFYELFPFATISHLEFMNSDHSPLEINLGEKMLKYGTKKPYIFRFEEVWTLQE
        +D FR  +++  L ++  +G  +TW+ + R+G ++ KER+DR L + ++ + F  AT+S L  ++SDH+PL +N     +K   K+   F+FE  WT   
Subjt:  MDNFRNIINQCNLSNMGYKGHHFTWYQS-RDGLISMKERLDRGLCSTDFYELFPFATISHLEFMNSDHSPLEINLGEKMLKYGTKKPYIFRFEEVWTLQE

Query:  DCAKIIKEGWNS-----------LNQRGN---QKVSNDAKVDSLLTLDGNWNEEEIKSMFHEEDAFQILKIPRPKTIGPDKMIWHYSKDGIYTVKSGYYL
        DC  IIK GWNS           L +R N   Q++   ++V +    DG WN  E++  F  +   +I++ P       DK  W    DG YT+K+GY++
Subjt:  DCAKIIKEGWNS-----------LNQRGN---QKVSNDAKVDSLLTLDGNWNEEEIKSMFHEEDAFQILKIPRPKTIGPDKMIWHYSKDGIYTVKSGYYL

Query:  ASSLLEHDSS---SEEDQTQRWWKTLWNCSIPNKIKFFIWRLFHDFIPTLVNLNKRGIKVEKEKLEEFFIMC
        A      D+S   S  D  +  W+ +W   +P KI+ F+WR  H+ +P            E E  E   ++C
Subjt:  ASSLLEHDSS---SEEDQTQRWWKTLWNCSIPNKIKFFIWRLFHDFIPTLVNLNKRGIKVEKEKLEEFFIMC

RYR64854.1 hypothetical protein Ahy_A03g010888 [Arachis hypogaea]8.6e-2730.13Show/hide
Query:  FRNIINQCNLSNMGYKGHHFTWY-QSRDGLISMKERLDRGLCSTDFYELFPFATISHLEFMNSDHSPLEINLGEKMLKYGTKKPYIFRFEEVWTLQEDCA
        FR   +   L ++  KG  FTW+   R+G I+ +ER+DR L + ++  L+  A++  L  ++SDH PL +++ +      T+K   F+FE  WT  EDC 
Subjt:  FRNIINQCNLSNMGYKGHHFTWY-QSRDGLISMKERLDRGLCSTDFYELFPFATISHLEFMNSDHSPLEINLGEKMLKYGTKKPYIFRFEEVWTLQEDCA

Query:  KIIKEGWNSLNQRGNQKVSNDAKVDSLLTLDGNWNEEEIKSMFHEEDAFQILKIPRPKTIGPDKMIWHYSKDGIYTVKSGYYLASSL---LEHDSSSEED
          +K+GW   + +G        ++ +       WN+ +I+S F +E   +IL  P       D + W + +DG Y++K+GYY A  +    ++ + S  +
Subjt:  KIIKEGWNSLNQRGNQKVSNDAKVDSLLTLDGNWNEEEIKSMFHEEDAFQILKIPRPKTIGPDKMIWHYSKDGIYTVKSGYYLASSL---LEHDSSSEED

Query:  QTQRWWKTLWNCSIPNKIKFFIWRLFHDFIPTLVNLNKR
          +  W+ +W   +P KI+ F+W+  HD +P   NL KR
Subjt:  QTQRWWKTLWNCSIPNKIKFFIWRLFHDFIPTLVNLNKR

TrEMBL top hitse value%identityAlignment
A0A444XT63 CCHC-type domain-containing protein4.9e-2830.88Show/hide
Query:  MDNFRNIINQCNLSNMGYKGHHFTWYQS-RDGLISMKERLDRGLCSTDFYELFPFATISHLEFMNSDHSPLEINLGEKMLKYGTKKPYIFRFEEVWTLQE
        +D FR  +++  L ++  +G  +TW+ + R+G ++ KER+DR L + ++ + F  AT+S L  ++SDH+PL +N     +K   K+   F+FE  WT   
Subjt:  MDNFRNIINQCNLSNMGYKGHHFTWYQS-RDGLISMKERLDRGLCSTDFYELFPFATISHLEFMNSDHSPLEINLGEKMLKYGTKKPYIFRFEEVWTLQE

Query:  DCAKIIKEGWNS-----------LNQRGN---QKVSNDAKVDSLLTLDGNWNEEEIKSMFHEEDAFQILKIPRPKTIGPDKMIWHYSKDGIYTVKSGYYL
        DC  IIK GWNS           L +R N   Q++   ++V +    DG WN  E++  F  +   +I++ P       DK  W    DG YT+K+GY++
Subjt:  DCAKIIKEGWNS-----------LNQRGN---QKVSNDAKVDSLLTLDGNWNEEEIKSMFHEEDAFQILKIPRPKTIGPDKMIWHYSKDGIYTVKSGYYL

Query:  ASSLLEHDSS---SEEDQTQRWWKTLWNCSIPNKIKFFIWRLFHDFIPTLVNLNKRGIKVEKEKLEEFFIMC
        A      D+S   S  D  +  W+ +W   +P KI+ F+WR  H+ +P            E E  E   ++C
Subjt:  ASSLLEHDSS---SEEDQTQRWWKTLWNCSIPNKIKFFIWRLFHDFIPTLVNLNKRGIKVEKEKLEEFFIMC

A0A445CZL3 Uncharacterized protein1.2e-2626.64Show/hide
Query:  FRNIINQCNLSNMGYKGHHFTWY-QSRDGLISMKERLDRGLCSTDFYELFPFATISHLEFMNSDHSPLEINLGEKMLKYGTKKPYIFRFEEVWTLQEDCA
        FR  ++   L ++  KG  FT +   R+G+I+ +E++DR L + ++  L+P A++  L  ++SDH PL +N+ +       +K   F+FE  WT  E+C 
Subjt:  FRNIINQCNLSNMGYKGHHFTWY-QSRDGLISMKERLDRGLCSTDFYELFPFATISHLEFMNSDHSPLEINLGEKMLKYGTKKPYIFRFEEVWTLQEDCA

Query:  KIIKEGWNSLNQRG---------------------------------NQKVSNDAKVDSLLTLDGNWNEEEIKSMFHEEDAFQILKIPRPKTIGPDKMIW
         I+++GW   + +G                                 N   +ND +V  L+     W+  +I+S F +E   +IL  P       D + W
Subjt:  KIIKEGWNSLNQRG---------------------------------NQKVSNDAKVDSLLTLDGNWNEEEIKSMFHEEDAFQILKIPRPKTIGPDKMIW

Query:  HYSKDGIYTVKSGYYLASSLLE---HDSSSEEDQTQRWWKTLWNCSIPNKIKFFIWRLFHDFIPTLVNLNKRGIKVE---------KEKLEEFFIMC-W-
         + +DG Y++K+GYY A    +   H + S  +  +  W+ +W   +P KI+ F+W+   D +P   NL KR I  +          E +E   ++C W 
Subjt:  HYSKDGIYTVKSGYYLASSLLE---HDSSSEEDQTQRWWKTLWNCSIPNKIKFFIWRLFHDFIPTLVNLNKRGIKVE---------KEKLEEFFIMC-W-

Query:  -GTW
          TW
Subjt:  -GTW

A0A445DNV3 Uncharacterized protein4.2e-2730.13Show/hide
Query:  FRNIINQCNLSNMGYKGHHFTWY-QSRDGLISMKERLDRGLCSTDFYELFPFATISHLEFMNSDHSPLEINLGEKMLKYGTKKPYIFRFEEVWTLQEDCA
        FR   +   L ++  KG  FTW+   R+G I+ +ER+DR L + ++  L+  A++  L  ++SDH PL +++ +      T+K   F+FE  WT  EDC 
Subjt:  FRNIINQCNLSNMGYKGHHFTWY-QSRDGLISMKERLDRGLCSTDFYELFPFATISHLEFMNSDHSPLEINLGEKMLKYGTKKPYIFRFEEVWTLQEDCA

Query:  KIIKEGWNSLNQRGNQKVSNDAKVDSLLTLDGNWNEEEIKSMFHEEDAFQILKIPRPKTIGPDKMIWHYSKDGIYTVKSGYYLASSL---LEHDSSSEED
          +K+GW   + +G        ++ +       WN+ +I+S F +E   +IL  P       D + W + +DG Y++K+GYY A  +    ++ + S  +
Subjt:  KIIKEGWNSLNQRGNQKVSNDAKVDSLLTLDGNWNEEEIKSMFHEEDAFQILKIPRPKTIGPDKMIWHYSKDGIYTVKSGYYLASSL---LEHDSSSEED

Query:  QTQRWWKTLWNCSIPNKIKFFIWRLFHDFIPTLVNLNKR
          +  W+ +W   +P KI+ F+W+  HD +P   NL KR
Subjt:  QTQRWWKTLWNCSIPNKIKFFIWRLFHDFIPTLVNLNKR

A0A7J6HMS9 Uncharacterized protein2.4e-2733.33Show/hide
Query:  ERLDRGLCSTDFYELFPFATISHLEFMNSDHSPLEINLGEKMLK--YGTKKPYIFRFEEVWTLQEDCAKIIKEGWNSLNQRGNQKVSNDAKVDSLLTLDG
        ERLDR   ++D++++FP A + HLE +NSDH PL +   ++ L    G +    F FE  W  +E C +++ +                  + SL     
Subjt:  ERLDRGLCSTDFYELFPFATISHLEFMNSDHSPLEINLGEKMLK--YGTKKPYIFRFEEVWTLQEDCAKIIKEGWNSLNQRGNQKVSNDAKVDSLLTLDG

Query:  NWNEEEIKSMFHEEDAFQILKIPRPKTIGPDKMIWHYSKDGIYTVKSGYYLASSL-LEHDSSSEEDQTQRWWKTLWNCSIPNKIKFFIWRLFHDFIPTLV
        +WNE  +   FH+ED   IL IP       D ++W ++KDG Y VK GY +A  + L     S  DQT  WWK  WN ++P ++K F W++  +++P   
Subjt:  NWNEEEIKSMFHEEDAFQILKIPRPKTIGPDKMIWHYSKDGIYTVKSGYYLASSL-LEHDSSSEEDQTQRWWKTLWNCSIPNKIKFFIWRLFHDFIPTLV

Query:  NLNKRGIKVE
        NL  RG K++
Subjt:  NLNKRGIKVE

A0A7J9CXV6 Uncharacterized protein7.1e-2729.38Show/hide
Query:  MDNFRNIINQCNLSNMGYKGHHFTWYQSRDGLISMKERLDRGLCSTDFYELFPFATISHLEFMNSDHSPLEIN----LGEKMLKYGTKKPYIFRFEEVWT
        M  FR +   C L ++G+ G  FTW Q R    ++KERLDRG+ S ++  LFP   + HL    SDH PL ++    +G     YG     IFRFE  W 
Subjt:  MDNFRNIINQCNLSNMGYKGHHFTWYQSRDGLISMKERLDRGLCSTDFYELFPFATISHLEFMNSDHSPLEIN----LGEKMLKYGTKKPYIFRFEEVWT

Query:  LQEDCAKIIKEGW-------------------------------NSLNQRG------NQKVSND-----------AKVDSLLTL-DGNWNEEEIKSMFHE
        L      ++K  W                               N LN         NQ   N+           + V+ L+   D  WN E I ++  +
Subjt:  LQEDCAKIIKEGW-------------------------------NSLNQRG------NQKVSND-----------AKVDSLLTL-DGNWNEEEIKSMFHE

Query:  EDAFQILKIPRPKTIGPDKMIWHYSKDGIYTVKSGY------YLASSLLEHDSSSEEDQTQRWWKTLWNCSIPNKIKFFIWRLFHDFIPTLVNLNKRGIK
        + A +I  IP  K+   D ++W +   G +TV+SGY      YL S+L    +S+ +++ + ++  LW   IP KIK  +WRLF++ +P   NL +R + 
Subjt:  EDAFQILKIPRPKTIGPDKMIWHYSKDGIYTVKSGY------YLASSLLEHDSSSEEDQTQRWWKTLWNCSIPNKIKFFIWRLFHDFIPTLVNLNKRGIK

Query:  VE------KEKLEEFFIMCW
         E      KE+LE    + W
Subjt:  VE------KEKLEEFFIMCW

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G22440.1 BEST Arabidopsis thaliana protein match is: Ribonuclease H-like superfamily protein (TAIR:AT4G29090.1)3.7e-0424.82Show/hide
Query:  VDSLLTLDGN-WNEEEIKSMFHEEDAFQILKIPRPKTIGPDKMIWHYSKDGIYTVKSGYYLASSLLEHDSSSEEDQTQRWWKTLWNCSIPNKIKFFIWRL
        V+ L+  + N W  + ++++    D   IL I   +T   D   W ++K G YTVKSGY++A  L      + +   Q         S+     F  WR 
Subjt:  VDSLLTLDGN-WNEEEIKSMFHEEDAFQILKIPRPKTIGPDKMIWHYSKDGIYTVKSGYYLASSLLEHDSSSEEDQTQRWWKTLWNCSIPNKIKFFIWRL

Query:  FHDFIPTLVNLNKRGIKVEKEKLEEFFIMCWGTWNERNRVI
                     R   + ++ LE F  + W  W  +NR +
Subjt:  FHDFIPTLVNLNKRGIKVEKEKLEEFFIMCWGTWNERNRVI

AT3G09510.1 Ribonuclease H-like superfamily protein1.2e-1033.33Show/hide
Query:  WNEEEIKSMFHEEDAFQILKIPRPKTIGPDKMIWHYSKDGIYTVKSGYYLASSLLEHDSSS-----EEDQTQRWWKT-LWNCSIPNKIKFFIWRLFHDFI
        W++ +I     + D   I +I   K+  PDK+IW+Y+  G YTV+SGY+    LL HD S+              KT +WN  I  K+K F+WR     +
Subjt:  WNEEEIKSMFHEEDAFQILKIPRPKTIGPDKMIWHYSKDGIYTVKSGYYLASSLLEHDSSS-----EEDQTQRWWKT-LWNCSIPNKIKFFIWRLFHDFI

Query:  PTLVNLNKRGIKVE
         T   L  RG++++
Subjt:  PTLVNLNKRGIKVE

AT4G29090.1 Ribonuclease H-like superfamily protein1.6e-0726.56Show/hide
Query:  VSNDAKVDSLLTLDG-NWNEEEIKSMFHEEDAFQILKIPRPKTIGPDKMIWHYSKDGIYTVKSGYYLASSLLEHDSSSE---EDQTQRWWKTLWNCSIPN
        VS+  KV  L+   G  W ++ I+ +F E +   I ++        D   W Y+  G YTVKSGY++ + ++   SS +   E      ++ +W      
Subjt:  VSNDAKVDSLLTLDG-NWNEEEIKSMFHEEDAFQILKIPRPKTIGPDKMIWHYSKDGIYTVKSGYYLASSLLEHDSSSE---EDQTQRWWKTLWNCSIPN

Query:  KIKFFIWRLFHDFIPTLVNLNKRGIKVE
        KI+ F+W+   + +P    L  R +  E
Subjt:  KIKFFIWRLFHDFIPTLVNLNKRGIKVE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATAACTTCAGGAATATAATCAACCAGTGTAATCTCTCTAATATGGGGTATAAGGGGCACCATTTTACCTGGTACCAATCTAGAGATGGACTAATCAGTATGAAAGA
AAGACTTGATAGAGGTCTATGCTCTACTGACTTTTATGAACTCTTTCCTTTTGCAACAATCTCTCACCTAGAATTCATGAATTCTGACCATAGTCCTTTGGAGATTAATC
TAGGTGAGAAAATGTTGAAATATGGAACTAAGAAGCCCTATATATTCCGTTTCGAGGAAGTTTGGACCCTTCAAGAAGATTGTGCTAAAATTATTAAAGAAGGGTGGAAT
TCGCTGAATCAAAGAGGAAATCAGAAGGTGTCAAATGATGCTAAAGTTGATAGCTTACTGACTCTAGATGGGAATTGGAATGAGGAAGAAATAAAGTCCATGTTTCATGA
GGAGGATGCTTTCCAAATTCTCAAGATCCCAAGACCTAAGACCATCGGTCCAGATAAAATGATTTGGCACTATTCGAAAGATGGGATCTACACGGTCAAATCAGGTTACT
ACCTCGCAAGTTCTCTCCTTGAACATGATTCAAGCTCTGAAGAGGATCAGACTCAGAGATGGTGGAAGACACTTTGGAATTGTTCTATTCCTAATAAGATAAAGTTTTTT
ATTTGGCGCCTGTTCCATGATTTTATTCCTACTTTAGTTAATCTCAATAAGAGAGGAATTAAAGTAGAGAAGGAAAAGCTAGAGGAATTCTTTATAATGTGCTGGGGTAC
ATGGAATGAAAGAAACAGAGTTATAGTTATGAGCAACAAAGGTGATAAGGAGAATAAGTTCGATTGGAACTCATGTCTAGCATACATCCAACAATTCAGAGAATATGTAA
TCATCAACAAAGATCAGGGAAGAAGTACGATTGGGGTTGTTATTATAAATGAGAAAGGGGAAGCGATGCTCACCATGACCAAACAGTTACCATGA
mRNA sequenceShow/hide mRNA sequence
ATGGATAACTTCAGGAATATAATCAACCAGTGTAATCTCTCTAATATGGGGTATAAGGGGCACCATTTTACCTGGTACCAATCTAGAGATGGACTAATCAGTATGAAAGA
AAGACTTGATAGAGGTCTATGCTCTACTGACTTTTATGAACTCTTTCCTTTTGCAACAATCTCTCACCTAGAATTCATGAATTCTGACCATAGTCCTTTGGAGATTAATC
TAGGTGAGAAAATGTTGAAATATGGAACTAAGAAGCCCTATATATTCCGTTTCGAGGAAGTTTGGACCCTTCAAGAAGATTGTGCTAAAATTATTAAAGAAGGGTGGAAT
TCGCTGAATCAAAGAGGAAATCAGAAGGTGTCAAATGATGCTAAAGTTGATAGCTTACTGACTCTAGATGGGAATTGGAATGAGGAAGAAATAAAGTCCATGTTTCATGA
GGAGGATGCTTTCCAAATTCTCAAGATCCCAAGACCTAAGACCATCGGTCCAGATAAAATGATTTGGCACTATTCGAAAGATGGGATCTACACGGTCAAATCAGGTTACT
ACCTCGCAAGTTCTCTCCTTGAACATGATTCAAGCTCTGAAGAGGATCAGACTCAGAGATGGTGGAAGACACTTTGGAATTGTTCTATTCCTAATAAGATAAAGTTTTTT
ATTTGGCGCCTGTTCCATGATTTTATTCCTACTTTAGTTAATCTCAATAAGAGAGGAATTAAAGTAGAGAAGGAAAAGCTAGAGGAATTCTTTATAATGTGCTGGGGTAC
ATGGAATGAAAGAAACAGAGTTATAGTTATGAGCAACAAAGGTGATAAGGAGAATAAGTTCGATTGGAACTCATGTCTAGCATACATCCAACAATTCAGAGAATATGTAA
TCATCAACAAAGATCAGGGAAGAAGTACGATTGGGGTTGTTATTATAAATGAGAAAGGGGAAGCGATGCTCACCATGACCAAACAGTTACCATGA
Protein sequenceShow/hide protein sequence
MDNFRNIINQCNLSNMGYKGHHFTWYQSRDGLISMKERLDRGLCSTDFYELFPFATISHLEFMNSDHSPLEINLGEKMLKYGTKKPYIFRFEEVWTLQEDCAKIIKEGWN
SLNQRGNQKVSNDAKVDSLLTLDGNWNEEEIKSMFHEEDAFQILKIPRPKTIGPDKMIWHYSKDGIYTVKSGYYLASSLLEHDSSSEEDQTQRWWKTLWNCSIPNKIKFF
IWRLFHDFIPTLVNLNKRGIKVEKEKLEEFFIMCWGTWNERNRVIVMSNKGDKENKFDWNSCLAYIQQFREYVIINKDQGRSTIGVVIINEKGEAMLTMTKQLP