; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg016992 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg016992
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold9:45562904..45570070
RNA-Seq ExpressionSpg016992
SyntenySpg016992
Gene Ontology termsGO:0090304 - nucleic acid metabolic process (biological process)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR026960 - Reverse transcriptase zinc-binding domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
BBN67784.1 VIRB2-interacting protein 2 [Prunus dulcis]3.3e-2532.06Show/hide
Query:  LSLSYSFGFARSLTDRDTTDLLSLLSLIEETTFSTSRRDFRLWNPNPSMGFSCRSFFHCLLN-SSPTVSSIFSMLWKVKVPKKVQFFIWQVIHRRVNTLD
        L LS+ FGF R+L + + T++  LL L+E     TSR D R+W  +PS  F+C SF   + N     +   ++ +WK K P KV+ F+WQ +  ++NT D
Subjt:  LSLSYSFGFARSLTDRDTTDLLSLLSLIEETTFSTSRRDFRLWNPNPSMGFSCRSFFHCLLN-SSPTVSSIFSMLWKVKVPKKVQFFIWQVIHRRVNTLD

Query:  RLSRKIPCL-IGPFCCITCRMAEEDLDHILWRCSFARVVWDMFFVSFGLQFARHRGLREMIEEFLPHPPFREQRRFLWQAGICAIIWGLWGERNNRTFRG
         L R+ P L + P  C  C MA + ++H+L  C F+  +W+         +    G  E+    +      ++ + LW + + A++W LW ERN R F  
Subjt:  RLSRKIPCL-IGPFCCITCRMAEEDLDHILWRCSFARVVWDMFFVSFGLQFARHRGLREMIEEFLPHPPFREQRRFLWQAGICAIIWGLWGERNNRTFRG

Query:  FERDSSEVL
        ++  S  V+
Subjt:  FERDSSEVL

RVX17758.1 putative ribonuclease H protein [Vitis vinifera]3.7e-2433.48Show/hide
Query:  GCSLSLSYSFGFARSLTDRDTTDLLSLLSLIEETTFSTSRRDFRLWNPNPSMGFSCRSFFHCLLN-SSPTVSSIFSMLWKVKVPKKVQFFIWQVIHRRVN
        G SL LS++F F R+LTD +   L  L+S +     S S  D R+W+ + S  FS +SFF+ L   S+P +      LW  KVP KV+   W V H +VN
Subjt:  GCSLSLSYSFGFARSLTDRDTTDLLSLLSLIEETTFSTSRRDFRLWNPNPSMGFSCRSFFHCLLN-SSPTVSSIFSMLWKVKVPKKVQFFIWQVIHRRVN

Query:  TLDRLSRKIPC-LIGPFCCITCRMAEEDLDHILWRCSFARVVWDMFFVSFGLQFARHRGLREMIEEFLPHPPFREQRRFLWQAGICAIIWGLWGERNNRT
        T D+L  + P   + P  CI C+   E +DH+   C     +W   F   G+ +   R L +M+           + + LWQ     +IW +W ERNNR 
Subjt:  TLDRLSRKIPC-LIGPFCCITCRMAEEDLDHILWRCSFARVVWDMFFVSFGLQFARHRGLREMIEEFLPHPPFREQRRFLWQAGICAIIWGLWGERNNRT

Query:  FRGFERDSSEVLSLVRYNTDI
        F    R    V  L+R+ + +
Subjt:  FRGFERDSSEVLSLVRYNTDI

TYK09969.1 calpain-type cysteine protease DEK1 [Cucumis melo var. makuwa]6.1e-3542.02Show/hide
Query:  SLSYSFGFARSLTDRDTTDLLSLLSLIEETTFSTSRRDFRLWNPNPSMGFSCRSFFHCLLNSSPTVSSIFSMLWKVKVPKKVQFFIWQVIHRRVNTLDRL
        S S+SFGF R+L+DR+T+++++L+SL+E  +F   RRD  +W+P P  GF C+SFF CL+NS+PT  S+ S++W++KVP+K   F WQV           
Subjt:  SLSYSFGFARSLTDRDTTDLLSLLSLIEETTFSTSRRDFRLWNPNPSMGFSCRSFFHCLLNSSPTVSSIFSMLWKVKVPKKVQFFIWQVIHRRVNTLDRL

Query:  SRKIPCLIGPFCCITCRMAEEDLDHILWRCSFARVVWDMFFVSFGLQFARHRGLREMIEEFLPHPPFREQRRFLWQAGICAIIWGLWG
                           EEDLDH+LW C     VWD F  +FGL +ARHR +R  +EEFL + P  E+  FLW A + A + G  G
Subjt:  SRKIPCLIGPFCCITCRMAEEDLDHILWRCSFARVVWDMFFVSFGLQFARHRGLREMIEEFLPHPPFREQRRFLWQAGICAIIWGLWG

TYK29954.1 reverse transcriptase [Cucumis melo var. makuwa]1.5e-2548.41Show/hide
Query:  RDFRLWNPNPSMGFSCRSFFHCLLNSSPTVSSIFSMLWKVKVPKKVQFFIWQVIHRRVNTLDRLSRKIPCLIGPFCCITCRMAEEDLDHILWRCSFARVV
        R+  +W+ NPS GFS +S F  LL+ SPT   +F  +W++KV KKV+FF WQV+  R N +DRL R+        CCI CR A+EDLDH+LW C +AR V
Subjt:  RDFRLWNPNPSMGFSCRSFFHCLLNSSPTVSSIFSMLWKVKVPKKVQFFIWQVIHRRVNTLDRLSRKIPCLIGPFCCITCRMAEEDLDHILWRCSFARVV

Query:  WDMFFVSFGLQFARHRGLREMIEEFL
        W  F   F ++ A  R +R+ IEEFL
Subjt:  WDMFFVSFGLQFARHRGLREMIEEFL

VVA33204.1 PREDICTED: ribonuclease H [Prunus dulcis]2.2e-2433.5Show/hide
Query:  LSYSFGFARSLTDRDTTDLLSLLSLIEETTFSTSRRDFRLWNPNPSMGFSCRSFFHCLLN-SSPTVSSIFSMLWKVKVPKKVQFFIWQVIHRRVNTLDRL
        LS+ FGF R+L + + T+   LL L+E     TSR D R W  +PS  F+C S    + N     + S ++ +WK K P KV+ F+WQ +  ++NT D L
Subjt:  LSYSFGFARSLTDRDTTDLLSLLSLIEETTFSTSRRDFRLWNPNPSMGFSCRSFFHCLLN-SSPTVSSIFSMLWKVKVPKKVQFFIWQVIHRRVNTLDRL

Query:  SRKIPCL-IGPFCCITCRMAEEDLDHILWRCSFARVVWDMFFVSFGLQFARHRGLREMIEEFLPHPPFREQRRFLWQAGICAIIWGLWGERNNRTFRGFE
         R+ P L I P  C  C  A E +DH+L RC F+  +W+         +    G  E+    +      ++ + LW + + A++W LW ERN R F  ++
Subjt:  SRKIPCL-IGPFCCITCRMAEEDLDHILWRCSFARVVWDMFFVSFGLQFARHRGLREMIEEFLPHPPFREQRRFLWQAGICAIIWGLWGERNNRTFRGFE

TrEMBL top hitse value%identityAlignment
A0A5D3CI74 Calpain-type cysteine protease DEK12.9e-3542.02Show/hide
Query:  SLSYSFGFARSLTDRDTTDLLSLLSLIEETTFSTSRRDFRLWNPNPSMGFSCRSFFHCLLNSSPTVSSIFSMLWKVKVPKKVQFFIWQVIHRRVNTLDRL
        S S+SFGF R+L+DR+T+++++L+SL+E  +F   RRD  +W+P P  GF C+SFF CL+NS+PT  S+ S++W++KVP+K   F WQV           
Subjt:  SLSYSFGFARSLTDRDTTDLLSLLSLIEETTFSTSRRDFRLWNPNPSMGFSCRSFFHCLLNSSPTVSSIFSMLWKVKVPKKVQFFIWQVIHRRVNTLDRL

Query:  SRKIPCLIGPFCCITCRMAEEDLDHILWRCSFARVVWDMFFVSFGLQFARHRGLREMIEEFLPHPPFREQRRFLWQAGICAIIWGLWG
                           EEDLDH+LW C     VWD F  +FGL +ARHR +R  +EEFL + P  E+  FLW A + A + G  G
Subjt:  SRKIPCLIGPFCCITCRMAEEDLDHILWRCSFARVVWDMFFVSFGLQFARHRGLREMIEEFLPHPPFREQRRFLWQAGICAIIWGLWG

A0A5D3E255 Reverse transcriptase7.2e-2648.41Show/hide
Query:  RDFRLWNPNPSMGFSCRSFFHCLLNSSPTVSSIFSMLWKVKVPKKVQFFIWQVIHRRVNTLDRLSRKIPCLIGPFCCITCRMAEEDLDHILWRCSFARVV
        R+  +W+ NPS GFS +S F  LL+ SPT   +F  +W++KV KKV+FF WQV+  R N +DRL R+        CCI CR A+EDLDH+LW C +AR V
Subjt:  RDFRLWNPNPSMGFSCRSFFHCLLNSSPTVSSIFSMLWKVKVPKKVQFFIWQVIHRRVNTLDRLSRKIPCLIGPFCCITCRMAEEDLDHILWRCSFARVV

Query:  WDMFFVSFGLQFARHRGLREMIEEFL
        W  F   F ++ A  R +R+ IEEFL
Subjt:  WDMFFVSFGLQFARHRGLREMIEEFL

A0A5H2XKI7 VIRB2-interacting protein 21.6e-2532.06Show/hide
Query:  LSLSYSFGFARSLTDRDTTDLLSLLSLIEETTFSTSRRDFRLWNPNPSMGFSCRSFFHCLLN-SSPTVSSIFSMLWKVKVPKKVQFFIWQVIHRRVNTLD
        L LS+ FGF R+L + + T++  LL L+E     TSR D R+W  +PS  F+C SF   + N     +   ++ +WK K P KV+ F+WQ +  ++NT D
Subjt:  LSLSYSFGFARSLTDRDTTDLLSLLSLIEETTFSTSRRDFRLWNPNPSMGFSCRSFFHCLLN-SSPTVSSIFSMLWKVKVPKKVQFFIWQVIHRRVNTLD

Query:  RLSRKIPCL-IGPFCCITCRMAEEDLDHILWRCSFARVVWDMFFVSFGLQFARHRGLREMIEEFLPHPPFREQRRFLWQAGICAIIWGLWGERNNRTFRG
         L R+ P L + P  C  C MA + ++H+L  C F+  +W+         +    G  E+    +      ++ + LW + + A++W LW ERN R F  
Subjt:  RLSRKIPCL-IGPFCCITCRMAEEDLDHILWRCSFARVVWDMFFVSFGLQFARHRGLREMIEEFLPHPPFREQRRFLWQAGICAIIWGLWGERNNRTFRG

Query:  FERDSSEVL
        ++  S  V+
Subjt:  FERDSSEVL

M5XUF8 Reverse transcriptase domain-containing protein (Fragment)6.1e-2533.83Show/hide
Query:  SLSYSFGFARSLTDRDTTDLLSLLSLIEETTFSTSRRDFRLWNPNPSMGFSCRSFFHCLLN-SSPTVSSIFSMLWKVKVPKKVQFFIWQVIHRRVNTLDR
        SLS+ FGF R L + + T+   LL L+E     TSR D R W  +PS  F+C SF   + N     + S ++ +WK K P KV+ F+WQ +  ++NT D 
Subjt:  SLSYSFGFARSLTDRDTTDLLSLLSLIEETTFSTSRRDFRLWNPNPSMGFSCRSFFHCLLN-SSPTVSSIFSMLWKVKVPKKVQFFIWQVIHRRVNTLDR

Query:  LSRKIPCL-IGPFCCITCRMAEEDLDHILWRCSFARVVWDMFFVSFGLQFARHRGLREMIEEFLPHPPFREQRRFLWQAGICAIIWGLWGERNNRTFRGF
        L R+ P L I P  C  C  A E +DH+L  C F+  +W+         +    G  E+    +      ++ + LW + + A++W LW ERN R F  +
Subjt:  LSRKIPCL-IGPFCCITCRMAEEDLDHILWRCSFARVVWDMFFVSFGLQFARHRGLREMIEEFLPHPPFREQRRFLWQAGICAIIWGLWGERNNRTFRGF

Query:  E
        +
Subjt:  E

M5XUF8 Reverse transcriptase domain-containing protein (Fragment)6.4e-1453.62Show/hide
Query:  WVTGTYGPLRPKGRQEFWNELGDLFGLCGNNWCILEDFNVTRSPLEKASGGRVSKSMKFFNEWIEGCSL
        W++G YG   P+ R+ FW EL  LFGLCGN WCI  DFNV R   EK++GGR++ SMK FN++I+  +L
Subjt:  WVTGTYGPLRPKGRQEFWNELGDLFGLCGNNWCILEDFNVTRSPLEKASGGRVSKSMKFFNEWIEGCSL

M5XUF8 Reverse transcriptase domain-containing protein (Fragment)2.3e-2432.18Show/hide
Query:  LSLSYSFGFARSLTDRDTTDLLSLLSLIEETTFSTSRRDFRLWNPNPSMGFSCRSFFHCLLN-SSPTVSSIFSMLWKVKVPKKVQFFIWQVIHRRVNTLD
        L LS+ FGF R+L + + T++  LL L+E     TSR D R+W  +PS  F+C SF   + N     +   ++ +WK K P KV+ F+WQ +  ++NT D
Subjt:  LSLSYSFGFARSLTDRDTTDLLSLLSLIEETTFSTSRRDFRLWNPNPSMGFSCRSFFHCLLN-SSPTVSSIFSMLWKVKVPKKVQFFIWQVIHRRVNTLD

Query:  RLSRKIPCL-IGPFCCITCRMAEEDLDHILWRCSFARVVWDMFFVSFGLQFARHRGLREMIEEFLPHPPFREQRRFLWQAGICAIIWGLWGERNNRTFRG
         L R+ P L + P  C  C MA + ++H+L  C F+  +W+         +    G  E+    +      ++ + LW + + A++W LW ERN R F  
Subjt:  RLSRKIPCL-IGPFCCITCRMAEEDLDHILWRCSFARVVWDMFFVSFGLQFARHRGLREMIEEFLPHPPFREQRRFLWQAGICAIIWGLWGERNNRTFRG

Query:  FE
        ++
Subjt:  FE

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657508.0e-0627.27Show/hide
Query:  TSRRDFRLWNPNPSMGFSCRSFFHCLLNSS---PTVSSIFSMLWKVKVPKKVQFFIWQVIHRRVNTLDRLSRKIPCLIGPFCCITCRMAEEDLDHILWRC
        T  RD   W  +    FS RS +  L       P ++S F+ LWKV+VP++V+ F+W V ++ V T +   R+   L     C  C+   E + H+L  C
Subjt:  TSRRDFRLWNPNPSMGFSCRSFFHCLLNSS---PTVSSIFSMLWKVKVPKKVQFFIWQVIHRRVNTLDRLSRKIPCLIGPFCCITCRMAEEDLDHILWRC

Query:  SFARVVWDMFFVSFGLQFARHRGLREMIEEFLPHPPFREQRRFLWQAGICAIIWGLWGERNNRTF
             +W         Q    + L E + + L      E     W      IIW  W  R    F
Subjt:  SFARVVWDMFFVSFGLQFARHRGLREMIEEFLPHPPFREQRRFLWQAGICAIIWGLWGERNNRTF

Arabidopsis top hitse value%identityAlignment
AT4G10613.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.0e-0429.91Show/hide
Query:  VNTLDRLSRKIPCL-----IGPFCCITCRMAEEDLDHILWRCSFARVVWDMFFVSFGLQFARHR---GLREMIEEFLPHPPFREQRRFLWQAGICAIIWG
        ++ LDRL  +   +     I P CC+ C  + E  DH++  C F+  +W+M      LQ  + R    L   I+      P    R+ + QA +CAI   
Subjt:  VNTLDRLSRKIPCL-----IGPFCCITCRMAEEDLDHILWRCSFARVVWDMFFVSFGLQFARHR---GLREMIEEFLPHPPFREQRRFLWQAGICAIIWG

Query:  LWGERNN
         W +RNN
Subjt:  LWGERNN

AT4G29090.1 Ribonuclease H-like superfamily protein7.2e-1025.33Show/hide
Query:  SSPTVSSIFSMLWKVKVPKKVQFFIWQVIHRRVNTLDRLSRKIPCLIGPFCCITCRMAEEDLDHILWRCSFARVVWDMFFVSFGL----QFARHRGLREM
        S P+++ I+  +WK +   K+Q F+W+ +   +     L+ +   L     CI C   +E ++H+L++C+FAR+ W +  +   L      + +  L  +
Subjt:  SSPTVSSIFSMLWKVKVPKKVQFFIWQVIHRRVNTLDRLSRKIPCLIGPFCCITCRMAEEDLDHILWRCSFARVVWDMFFVSFGL----QFARHRGLREM

Query:  IEEFLPHPPFREQRRFL-WQAGICAIIWGLWGERNNRTFRGFERDSSEVL
              +P + +  + + W      ++W LW  RN   FRG E ++ EVL
Subjt:  IEEFLPHPPFREQRRFL-WQAGICAIIWGLWGERNNRTFRGFERDSSEVL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGTGGGTCTCTGTGGAAGCGATAAGGAAGCTAAAGATTTGATTCGGATGGGCTCTGTTGCGATTTCCGATTTTTTCTCAGTCCACGAGTGGGATTTTAAGGTGAT
AGCTGAAAGAAAATGCCAAAATTTCAACGGTGGATGGATTACGGCCCTGGATGTTCCACCATTTTTTAGGACCACACAGATGATCTCCAACCTTGCAGGTATGTGCGAGG
GATTAGAGGAAGAAGAAGAGATTTTCGATGAGAATTGCTGGACCGATGAAATATGCTTCAAAGTCAAAGGGAATGATAGTGGTTTCATTCCCACGTCGTGCTACGTTAAT
CACGGGAAGACTTACTTTCCGGTGAGGTTCGTGATCGGAAAAGGGACGGGTTTCGAGCAGGCCGGCGGGGGTTTGGTAAGTCCAAGGCAATCCGGCAAAATCGTACCGCA
GACGCCAGTTGCAGCAGATATGGCAGAGAAGAGGATCCCTTCTCCGAGAAGTCACGTGAAATCTGGAGAAGGGAGAGTTGGGCCTGGGCATGGGTCTAAAAATGGGCTGC
GGGCTGATAATCCAATTTTTAACTTAGAAGACAATGGTATTATTAATTCTGAAGCCCAGAGTGTGGTTTGTAACAAGCTGAATAAGGAGGGAGATGGGCCGGCGAAGATA
AAAGATGGGCAGGGGCCTGACCTTTCCCAGAAGATGGTCTCTTCGACAGAATCTCTTGGGCTCAACTTGTATGACTTAGAGTGGGACGGTGTAATTAACCAAGCGGAGGA
TTTCAGTATTGAGGAAGAGGCGGCATCTGATCCATCAGTCGTGAGTTACTCAGAGGACGAGTCATTCTTATCGACTCCGGCAGCCAAGGAAATAGAATCCATCAACCTGA
ATGCGATGTTTTCAGAGGAGAAATCGCCGACTCAACTTGCCAGCCCGGTACAGAAGGAAGACAATCAGATAATGCCTACTGAAGATTTGCAGAGGGATTCTTTGGCTTTG
ATGGTAGGGGTGCCGCAAGATGTCTCGACTTTAGGGGATCCGACTCAACCGCTGATCAAGGTAAATTCTGACGGGATTCCTATCCAGGTACGCTCTTCTTCTTTGGATCC
TTCACCTTCTAATTCGATTGAGAGGGGAGGGTGGATGTGTGCTGGGGGGTTGTCTTATCCATACCCTTATTCTGTAGCCTTACCGTTGGGAGAAGGAAAGGGAGAGTTCT
TGGGCTTTAATCATAACCTCGGGCATCCTGTTTTTTTGAATACTTTCCAACCTTCTGGAAGTGCCTTTTCCTACCCGTCTGCTTATTCCATAGGAGAGAGAATGCCGCCA
TTTGCTTCCTTTGCTGGTGTCCCGGGTAGTGGAATTGGGGGAGGTGTTCAACCTTTTCCAAATGGACTTGTGAACAATATTCCTTTACCTATTTATAGCCTCTCGGACCT
TGAAGGCCTATCTGGAAATCATGGTTTTTGGCAACCGGGGTGGGCTGCATTAGCGTCGATTGGGATAAGAGAAGTCCCGGTTGATCAAACCCCCACTAATCACCTGATTG
ATGATAAAGATGGTAAAACCCCTAACAATAAGACAGGAGACTCAAAAAAAGTTGATAGGGAATTGAGAAGATTGGCGTCATCTGTGAACTATGACAGGAGAAAGACCTCT
AAGGAGGGCAAGGACTTAGGGTGGGTCACCGGTACCTACGGCCCTCTAAGGCCCAAAGGTAGACAAGAGTTTTGGAATGAGCTCGGTGATCTTTTTGGGTTATGTGGGAA
CAACTGGTGCATTTTGGAGGATTTTAATGTCACTCGTTCCCCTTTAGAGAAGGCTTCGGGAGGGAGGGTGTCTAAATCGATGAAGTTTTTCAATGAGTGGATTGAAGGCT
GTAGTCTCTCTCTATCCTACTCTTTTGGCTTCGCGCGGTCTTTAACCGATAGAGATACTACTGACCTTCTATCTCTTCTGTCCTTGATTGAGGAGACCACCTTCAGTACT
TCGAGGAGGGATTTTCGTTTGTGGAACCCCAACCCCTCCATGGGCTTCTCTTGTCGGTCCTTCTTCCATTGCTTACTAAATTCTTCTCCCACCGTGTCGTCCATTTTTTC
CATGTTATGGAAGGTGAAGGTCCCAAAAAAGGTGCAGTTTTTCATCTGGCAGGTTATCCATAGAAGAGTTAATACTCTTGACCGGCTCTCCAGAAAGATCCCTTGCTTGA
TCGGGCCTTTTTGTTGCATTACCTGTCGGATGGCGGAGGAAGACCTCGACCATATTTTGTGGAGGTGCAGTTTTGCTAGGGTTGTGTGGGACATGTTTTTTGTCTCGTTT
GGTTTGCAGTTTGCTAGGCACAGGGGCCTTAGAGAGATGATCGAGGAGTTCCTCCCCCATCCCCCTTTTCGTGAGCAGAGGAGATTCTTGTGGCAAGCAGGGATTTGCGC
TATTATTTGGGGGCTTTGGGGCGAGAGAAATAATAGGACGTTTAGAGGGTTTGAGAGGGACTCGTCTGAGGTCTTGTCCCTAGTGAGATATAATACAGATATTGGAGAAA
AGGTAAATGTTGGTAATGGAAAATCTGAACCTTCAAATGTGGAAGCCAAGTTGCCACAGCAGAAGTTGGAAGATTGGGGTGTTGGATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCAGTGGGTCTCTGTGGAAGCGATAAGGAAGCTAAAGATTTGATTCGGATGGGCTCTGTTGCGATTTCCGATTTTTTCTCAGTCCACGAGTGGGATTTTAAGGTGAT
AGCTGAAAGAAAATGCCAAAATTTCAACGGTGGATGGATTACGGCCCTGGATGTTCCACCATTTTTTAGGACCACACAGATGATCTCCAACCTTGCAGGTATGTGCGAGG
GATTAGAGGAAGAAGAAGAGATTTTCGATGAGAATTGCTGGACCGATGAAATATGCTTCAAAGTCAAAGGGAATGATAGTGGTTTCATTCCCACGTCGTGCTACGTTAAT
CACGGGAAGACTTACTTTCCGGTGAGGTTCGTGATCGGAAAAGGGACGGGTTTCGAGCAGGCCGGCGGGGGTTTGGTAAGTCCAAGGCAATCCGGCAAAATCGTACCGCA
GACGCCAGTTGCAGCAGATATGGCAGAGAAGAGGATCCCTTCTCCGAGAAGTCACGTGAAATCTGGAGAAGGGAGAGTTGGGCCTGGGCATGGGTCTAAAAATGGGCTGC
GGGCTGATAATCCAATTTTTAACTTAGAAGACAATGGTATTATTAATTCTGAAGCCCAGAGTGTGGTTTGTAACAAGCTGAATAAGGAGGGAGATGGGCCGGCGAAGATA
AAAGATGGGCAGGGGCCTGACCTTTCCCAGAAGATGGTCTCTTCGACAGAATCTCTTGGGCTCAACTTGTATGACTTAGAGTGGGACGGTGTAATTAACCAAGCGGAGGA
TTTCAGTATTGAGGAAGAGGCGGCATCTGATCCATCAGTCGTGAGTTACTCAGAGGACGAGTCATTCTTATCGACTCCGGCAGCCAAGGAAATAGAATCCATCAACCTGA
ATGCGATGTTTTCAGAGGAGAAATCGCCGACTCAACTTGCCAGCCCGGTACAGAAGGAAGACAATCAGATAATGCCTACTGAAGATTTGCAGAGGGATTCTTTGGCTTTG
ATGGTAGGGGTGCCGCAAGATGTCTCGACTTTAGGGGATCCGACTCAACCGCTGATCAAGGTAAATTCTGACGGGATTCCTATCCAGGTACGCTCTTCTTCTTTGGATCC
TTCACCTTCTAATTCGATTGAGAGGGGAGGGTGGATGTGTGCTGGGGGGTTGTCTTATCCATACCCTTATTCTGTAGCCTTACCGTTGGGAGAAGGAAAGGGAGAGTTCT
TGGGCTTTAATCATAACCTCGGGCATCCTGTTTTTTTGAATACTTTCCAACCTTCTGGAAGTGCCTTTTCCTACCCGTCTGCTTATTCCATAGGAGAGAGAATGCCGCCA
TTTGCTTCCTTTGCTGGTGTCCCGGGTAGTGGAATTGGGGGAGGTGTTCAACCTTTTCCAAATGGACTTGTGAACAATATTCCTTTACCTATTTATAGCCTCTCGGACCT
TGAAGGCCTATCTGGAAATCATGGTTTTTGGCAACCGGGGTGGGCTGCATTAGCGTCGATTGGGATAAGAGAAGTCCCGGTTGATCAAACCCCCACTAATCACCTGATTG
ATGATAAAGATGGTAAAACCCCTAACAATAAGACAGGAGACTCAAAAAAAGTTGATAGGGAATTGAGAAGATTGGCGTCATCTGTGAACTATGACAGGAGAAAGACCTCT
AAGGAGGGCAAGGACTTAGGGTGGGTCACCGGTACCTACGGCCCTCTAAGGCCCAAAGGTAGACAAGAGTTTTGGAATGAGCTCGGTGATCTTTTTGGGTTATGTGGGAA
CAACTGGTGCATTTTGGAGGATTTTAATGTCACTCGTTCCCCTTTAGAGAAGGCTTCGGGAGGGAGGGTGTCTAAATCGATGAAGTTTTTCAATGAGTGGATTGAAGGCT
GTAGTCTCTCTCTATCCTACTCTTTTGGCTTCGCGCGGTCTTTAACCGATAGAGATACTACTGACCTTCTATCTCTTCTGTCCTTGATTGAGGAGACCACCTTCAGTACT
TCGAGGAGGGATTTTCGTTTGTGGAACCCCAACCCCTCCATGGGCTTCTCTTGTCGGTCCTTCTTCCATTGCTTACTAAATTCTTCTCCCACCGTGTCGTCCATTTTTTC
CATGTTATGGAAGGTGAAGGTCCCAAAAAAGGTGCAGTTTTTCATCTGGCAGGTTATCCATAGAAGAGTTAATACTCTTGACCGGCTCTCCAGAAAGATCCCTTGCTTGA
TCGGGCCTTTTTGTTGCATTACCTGTCGGATGGCGGAGGAAGACCTCGACCATATTTTGTGGAGGTGCAGTTTTGCTAGGGTTGTGTGGGACATGTTTTTTGTCTCGTTT
GGTTTGCAGTTTGCTAGGCACAGGGGCCTTAGAGAGATGATCGAGGAGTTCCTCCCCCATCCCCCTTTTCGTGAGCAGAGGAGATTCTTGTGGCAAGCAGGGATTTGCGC
TATTATTTGGGGGCTTTGGGGCGAGAGAAATAATAGGACGTTTAGAGGGTTTGAGAGGGACTCGTCTGAGGTCTTGTCCCTAGTGAGATATAATACAGATATTGGAGAAA
AGGTAAATGTTGGTAATGGAAAATCTGAACCTTCAAATGTGGAAGCCAAGTTGCCACAGCAGAAGTTGGAAGATTGGGGTGTTGGATAA
Protein sequenceShow/hide protein sequence
MAVGLCGSDKEAKDLIRMGSVAISDFFSVHEWDFKVIAERKCQNFNGGWITALDVPPFFRTTQMISNLAGMCEGLEEEEEIFDENCWTDEICFKVKGNDSGFIPTSCYVN
HGKTYFPVRFVIGKGTGFEQAGGGLVSPRQSGKIVPQTPVAADMAEKRIPSPRSHVKSGEGRVGPGHGSKNGLRADNPIFNLEDNGIINSEAQSVVCNKLNKEGDGPAKI
KDGQGPDLSQKMVSSTESLGLNLYDLEWDGVINQAEDFSIEEEAASDPSVVSYSEDESFLSTPAAKEIESINLNAMFSEEKSPTQLASPVQKEDNQIMPTEDLQRDSLAL
MVGVPQDVSTLGDPTQPLIKVNSDGIPIQVRSSSLDPSPSNSIERGGWMCAGGLSYPYPYSVALPLGEGKGEFLGFNHNLGHPVFLNTFQPSGSAFSYPSAYSIGERMPP
FASFAGVPGSGIGGGVQPFPNGLVNNIPLPIYSLSDLEGLSGNHGFWQPGWAALASIGIREVPVDQTPTNHLIDDKDGKTPNNKTGDSKKVDRELRRLASSVNYDRRKTS
KEGKDLGWVTGTYGPLRPKGRQEFWNELGDLFGLCGNNWCILEDFNVTRSPLEKASGGRVSKSMKFFNEWIEGCSLSLSYSFGFARSLTDRDTTDLLSLLSLIEETTFST
SRRDFRLWNPNPSMGFSCRSFFHCLLNSSPTVSSIFSMLWKVKVPKKVQFFIWQVIHRRVNTLDRLSRKIPCLIGPFCCITCRMAEEDLDHILWRCSFARVVWDMFFVSF
GLQFARHRGLREMIEEFLPHPPFREQRRFLWQAGICAIIWGLWGERNNRTFRGFERDSSEVLSLVRYNTDIGEKVNVGNGKSEPSNVEAKLPQQKLEDWGVG