; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg025764 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg025764
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold13:32286524..32292446
RNA-Seq ExpressionSpg025764
SyntenySpg025764
Gene Ontology termsNA
InterPro domainsIPR026960 - Reverse transcriptase zinc-binding domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
BBN69274.1 TatD related DNase [Prunus dulcis]3.5e-3433.47Show/hide
Query:  VGDVRDICFWEDLWVGDRPLCSLFPRLYHLSSMKNRTVADILGPVGSSTSFSFSFRRSLSDRETTDIMALLSLIEGASLRSDRRNVRFWSLDPSAGFSCR
        VG+   I FWED W+ +  L  LFPRLY LS  KN+ +A          ++ F FRR+LS+ E  +++ LL ++    L   R + R W ++    FSC+
Subjt:  VGDVRDICFWEDLWVGDRPLCSLFPRLYHLSSMKNRTVADILGPVGSSTSFSFSFRRSLSDRETTDIMALLSLIEGASLRSDRRNVRFWSLDPSAGFSCR

Query:  SFFQFLIHPSPVRKFVFSCLWKVKAPKKVLFFAWQVILGRVNTFDRLSRVK-ALVVGPFCCMLCRKAVENLDHLLWNCEFARFVWSLFFEVFEFHFASHR
        SF  FLI  + V    +  +WK K P K+ FF W    GR+NT D + R +  + + P  C+ C++  EN+DHL  +C ++  +W    +     +   +
Subjt:  SFFQFLIHPSPVRKFVFSCLWKVKAPKKVLFFAWQVILGRVNTFDRLSRVK-ALVVGPFCCMLCRKAVENLDHLLWNCEFARFVWSLFFEVFEFHFASHR

Query:  RCRDTIEEFLLHPPVRKKGKLLWHVGVCAILWGLWGERNNRIFRG
         C + +   L      KK  +L    + AI W +W ERN RIF+G
Subjt:  RCRDTIEEFLLHPPVRKKGKLLWHVGVCAILWGLWGERNNRIFRG

TYK09969.1 calpain-type cysteine protease DEK1 [Cucumis melo var. makuwa]5.8e-4544.12Show/hide
Query:  ECIVGDVRDICFWEDLWVGDRPLCSLFPRLYHLSSMKNRTVADILGPVGSSTSFSFSFRRSLSDRETTDIMALLSLIEGASLRSDRRNVRFWSLDPSAGF
        + I G  RDI     L  G+RPLC LFP LYHLSS+KN  +AD L   G+S SFSF FRR+LSDRET++++AL+SL+E  S R  RR+V  WS  P  GF
Subjt:  ECIVGDVRDICFWEDLWVGDRPLCSLFPRLYHLSSMKNRTVADILGPVGSSTSFSFSFRRSLSDRETTDIMALLSLIEGASLRSDRRNVRFWSLDPSAGF

Query:  SCRSFFQFLIHPSPVRKFVFSCLWKVKAPKKVLFFAWQVILGRVNTFDRLSRVKALVVGPFCCMLCRKAVENLDHLLWNCEFARFVWSLFFEVFEFHFAS
         C+SFFQ L++ +P  + V S +W++K P+K + F+WQV                               E+LDHLLW+CE    VW  F + F   +A 
Subjt:  SCRSFFQFLIHPSPVRKFVFSCLWKVKAPKKVLFFAWQVILGRVNTFDRLSRVKALVVGPFCCMLCRKAVENLDHLLWNCEFARFVWSLFFEVFEFHFAS

Query:  HRRCRDTIEEFLLHPPVRKKGKLLWHVGVCAILWGLWG
        HR  R T+EEFLL+ P  ++G  LWH  V A L G  G
Subjt:  HRRCRDTIEEFLLHPPVRKKGKLLWHVGVCAILWGLWG

VVA33204.1 PREDICTED: ribonuclease H [Prunus dulcis]6.0e-3436.44Show/hide
Query:  FWEDLWVGDRPLCSLFPRLYHLSSMKNRTVADILGPVGSSTSFSFSFRRSLSDRETTDIMALLSLIEGASLRSDRRNVRFWSLDPSAGFSCRSFFQFLIH
        FWED W     L  +FPRL++LS  +N  ++      G   S+ F FRR+L++ E T+   LL L+EG  L + R + R W LDPS  F+C S    + +
Subjt:  FWEDLWVGDRPLCSLFPRLYHLSSMKNRTVADILGPVGSSTSFSFSFRRSLSDRETTDIMALLSLIEGASLRSDRRNVRFWSLDPSAGFSCRSFFQFLIH

Query:  PSPVRKF-VFSCLWKVKAPKKVLFFAWQVILGRVNTFDRLS-RVKALVVGPFCCMLCRKAVENLDHLLWNCEFARFVWSLFFEVFEFHFASHRRCRDTIE
              F  ++ +WK K P KV  F WQ +LG++NT D L  R   L + P  C LC KA E++DHLL  C F+  +W    +     +     C +   
Subjt:  PSPVRKF-VFSCLWKVKAPKKVLFFAWQVILGRVNTFDRLS-RVKALVVGPFCCMLCRKAVENLDHLLWNCEFARFVWSLFFEVFEFHFASHRRCRDTIE

Query:  EFLLHPPVRKKGKLLWHVGVCAILWGLWGERNNRIF
          +      KK K+LW   + A++W LW ERN RIF
Subjt:  EFLLHPPVRKKGKLLWHVGVCAILWGLWGERNNRIF

VVA39726.1 Hypothetical predicted protein, partial [Prunus dulcis]7.1e-3535.51Show/hide
Query:  IVGDVRDICFWEDLWVGDRPLCSLFPRLYHLSSMKNRTVADILGPVGSSTSFSFSFRRSLSDRETTDIMALLSLIEGASLRSDRRNVRFWSLDPSAGFSC
        +VG    + FWED W     L  +FPRL++LS  +N  ++      G   S+ F FRR+L++ E T++  LL L+EG  L + R + R W LDPS  F+C
Subjt:  IVGDVRDICFWEDLWVGDRPLCSLFPRLYHLSSMKNRTVADILGPVGSSTSFSFSFRRSLSDRETTDIMALLSLIEGASLRSDRRNVRFWSLDPSAGFSC

Query:  RSFFQFLIHPSPVRKF-VFSCLWKVKAPKKVLFFAWQVILGRVNTFDRLS-RVKALVVGPFCCMLCRKAVENLDHLLWNCEFARFVWSLFFEVFEFHFAS
         S    + +      F  ++ +WK K P KV  F WQ +LG++NT D L  R   L + P  C LC KA +++DHLL +C F+  +W    +     +  
Subjt:  RSFFQFLIHPSPVRKF-VFSCLWKVKAPKKVLFFAWQVILGRVNTFDRLS-RVKALVVGPFCCMLCRKAVENLDHLLWNCEFARFVWSLFFEVFEFHFAS

Query:  HRRCRDTIEEFLLHPPVRKKGKLLWHVGVCAILWGLWGERNNRIF
           C +     +      KK K+LW   + A++W LW ERN RIF
Subjt:  HRRCRDTIEEFLLHPPVRKKGKLLWHVGVCAILWGLWGERNNRIF

XP_040371876.1 uncharacterized protein LOC112192237 isoform X1 [Rosa chinensis]1.2e-3436.44Show/hide
Query:  FWEDLWVGDRPLCSLFPRLYHLSSMKNRTVADILGPVGSSTSFSFSFRRSLSDRETTDIMALLSLIEGASLRSDRRNVRFWSLDPSAGFSCRSFFQFLIH
        FWED W+ D+PL   FPRL+ LS+  N  V  +  P+    S+ F FRR+L+D+ET +  +LL+ +E   L+  + + R W L+P+  FSC+SF  FL  
Subjt:  FWEDLWVGDRPLCSLFPRLYHLSSMKNRTVADILGPVGSSTSFSFSFRRSLSDRETTDIMALLSLIEGASLRSDRRNVRFWSLDPSAGFSCRSFFQFLIH

Query:  PSPVRKF-VFSCLWKVKAPKKVLFFAWQVILGRVNTFDRLSRVK-ALVVGPFCCMLCRKAVENLDHLLWNCEFARFVWSLFFEVFEFHFASHRRCRDTIE
         +    F     +W VKAP KV    W V+LG+ NT D L + +      P  C+LCR   E+ DH+  +C+    +W   F      + +  +  + + 
Subjt:  PSPVRKF-VFSCLWKVKAPKKVLFFAWQVILGRVNTFDRLSRVK-ALVVGPFCCMLCRKAVENLDHLLWNCEFARFVWSLFFEVFEFHFASHRRCRDTIE

Query:  EFLLHPPVRKKGKLLWHVGVCAILWGLWGERNNRIF
        E  L     KK K LW  GV A+ W +W ERN RIF
Subjt:  EFLLHPPVRKKGKLLWHVGVCAILWGLWGERNNRIF

TrEMBL top hitse value%identityAlignment
A0A2N9FKU6 Reverse transcriptase domain-containing protein6.9e-3635.03Show/hide
Query:  RRLVKSVWGRRKVEWAGLESFGASGGILF--MCN--ESLMSVKECIVGDVRDICFWEDLWVGDRPLCSLFPRLYHLSSMKNRTVADILGPVGSSTS--FS
        RR+V + +G  +  W      G+ G  L+  +C   E+  S     VG    + FW D W  DRPL  LFPRLY LS  +N TVA++L P GSS S  + 
Subjt:  RRLVKSVWGRRKVEWAGLESFGASGGILF--MCN--ESLMSVKECIVGDVRDICFWEDLWVGDRPLCSLFPRLYHLSSMKNRTVADILGPVGSSTS--FS

Query:  FSFRRSLSDRETTDIMALLSLIEGASLRSDRRNVRFWSLDPSAGFSCRSFFQFLIHPSPVRKFVFSCLWKVKAPKKVLFFAWQVILGRVNTFDRLSRVKA
          F R  +D E   + ALLS++     RS   +   W    +  F+ R+F+Q ++H  P   F + C+W+VKAP +V FF W    GR+ T + L + K 
Subjt:  FSFRRSLSDRETTDIMALLSLIEGASLRSDRRNVRFWSLDPSAGFSCRSFFQFLIHPSPVRKFVFSCLWKVKAPKKVLFFAWQVILGRVNTFDRLSRVKA

Query:  LVVGPFCCMLCRKAVENLDHLLWNCEFARFVWSLFFEVFEFHFASHRRCRDTIEEFLLHPPVRKKGKLLWHVGVCAILWGLWGERNNRIFRGIE
         V+  +CCM C+KA E +DHLL +C FAR +WSL F+     +       + +  +       KK   +W++    ++W +W ERN R F  +E
Subjt:  LVVGPFCCMLCRKAVENLDHLLWNCEFARFVWSLFFEVFEFHFASHRRCRDTIEEFLLHPPVRKKGKLLWHVGVCAILWGLWGERNNRIFRGIE

A0A2N9FKU6 Reverse transcriptase domain-containing protein5.2e-0736.56Show/hide
Query:  EKRSLVKDLLSQENPDIVVLLETKRQKYCRRLVKSVWGRRKVEWAGLESFGASGGILFMCNESLMSVKECIVGDVRDICFWEDLWVGDRPLCS
        +KR +VK+LL +   DIV L ETK      RL++S+WG + V+W  L++   +GGIL + +  ++   + +VG     CFW  L  G   +CS
Subjt:  EKRSLVKDLLSQENPDIVVLLETKRQKYCRRLVKSVWGRRKVEWAGLESFGASGGILFMCNESLMSVKECIVGDVRDICFWEDLWVGDRPLCS

A0A2N9FKU6 Reverse transcriptase domain-containing protein2.0e-3534.69Show/hide
Query:  RRLVKSVWGRRKVEWAGLESFGASGGILF--MCN--ESLMSVKECIVGDVRDICFWEDLWVGDRPLCSLFPRLYHLSSMKNRTVADILGPVGSSTS--FS
        RR+V + +G  +  W      G+ G  L+  +C   E+  S     VG    + FW D W  DRPL  LFPRLY LS  +N TVA++L P GSS S  + 
Subjt:  RRLVKSVWGRRKVEWAGLESFGASGGILF--MCN--ESLMSVKECIVGDVRDICFWEDLWVGDRPLCSLFPRLYHLSSMKNRTVADILGPVGSSTS--FS

Query:  FSFRRSLSDRETTDIMALLSLIEGASLRSDRRNVRFWSLDPSAGFSCRSFFQFLIHPSPVRKFVFSCLWKVKAPKKVLFFAWQVILGRVNTFDRLSRVKA
          F R  +D E   + A LS++     RS   +   W    +  F+ R+F+Q ++H  P   F + C+W+VKAP +V FF W    GR+ T + L + K 
Subjt:  FSFRRSLSDRETTDIMALLSLIEGASLRSDRRNVRFWSLDPSAGFSCRSFFQFLIHPSPVRKFVFSCLWKVKAPKKVLFFAWQVILGRVNTFDRLSRVKA

Query:  LVVGPFCCMLCRKAVENLDHLLWNCEFARFVWSLFFEVFEFHFASHRRCRDTIEEFLLHPPVRKKGKLLWHVGVCAILWGLWGERNNRIFRGIE
         V+  +CCM C+KA E +DHLL +C FAR +WSL F+     +       + +  +       KK   +W++    ++W +W ERN R F  +E
Subjt:  LVVGPFCCMLCRKAVENLDHLLWNCEFARFVWSLFFEVFEFHFASHRRCRDTIEEFLLHPPVRKKGKLLWHVGVCAILWGLWGERNNRIFRGIE

A0A2N9H6P8 zf-RVT domain-containing protein5.3e-3631.91Show/hide
Query:  TKRQKYCRRLVK----SVWGRRKVEWAGLESFGASGGILFMCN-ESLMSVKECIVGDVRDICFWEDLWVGDRPLCSLFPRLYHLSSMKNRTVADILGPVG
        T+R+   RR+V     S+WG       G  S+G S         +SL      +VGD   + FW+D W G+ PL + +P L+  +     +VAD++  V 
Subjt:  TKRQKYCRRLVK----SVWGRRKVEWAGLESFGASGGILFMCN-ESLMSVKECIVGDVRDICFWEDLWVGDRPLCSLFPRLYHLSSMKNRTVADILGPVG

Query:  SSTSFSFSFRRSLSDRETTDIMALLSLIEGASLRSDRRNVRFWSLDPSAGFSCRSFFQFLIHPSPVRKFVFSCLWKVKAPKKVLFFAWQVILGRVNTFDR
         +  +  SF R + D E   + + ++ I    L+    +   W  DP   FS +S++   + PSP R F++  +WKVKAP +V FF+W  +LG++ T D 
Subjt:  SSTSFSFSFRRSLSDRETTDIMALLSLIEGASLRSDRRNVRFWSLDPSAGFSCRSFFQFLIHPSPVRKFVFSCLWKVKAPKKVLFFAWQVILGRVNTFDR

Query:  LSRVKALVVGPFCCMLCRKAVENLDHLLWNCEFARFVWSLFFEVFEFHFASHRRCRDTIEEFLL--HPPVRKKGK-LLWHVGVCAILWGLWGERNNRIFR
        L R + L++  +CCM C+K+ EN+DHLL +C  A  VWS+ F  F   +  +     TI++  L       + G+ ++W +    ++W LW ERN R F 
Subjt:  LSRVKALVVGPFCCMLCRKAVENLDHLLWNCEFARFVWSLFFEVFEFHFASHRRCRDTIEEFLL--HPPVRKKGK-LLWHVGVCAILWGLWGERNNRIFR

Query:  GIER
          +R
Subjt:  GIER

A0A5D3CI74 Calpain-type cysteine protease DEK12.8e-4544.12Show/hide
Query:  ECIVGDVRDICFWEDLWVGDRPLCSLFPRLYHLSSMKNRTVADILGPVGSSTSFSFSFRRSLSDRETTDIMALLSLIEGASLRSDRRNVRFWSLDPSAGF
        + I G  RDI     L  G+RPLC LFP LYHLSS+KN  +AD L   G+S SFSF FRR+LSDRET++++AL+SL+E  S R  RR+V  WS  P  GF
Subjt:  ECIVGDVRDICFWEDLWVGDRPLCSLFPRLYHLSSMKNRTVADILGPVGSSTSFSFSFRRSLSDRETTDIMALLSLIEGASLRSDRRNVRFWSLDPSAGF

Query:  SCRSFFQFLIHPSPVRKFVFSCLWKVKAPKKVLFFAWQVILGRVNTFDRLSRVKALVVGPFCCMLCRKAVENLDHLLWNCEFARFVWSLFFEVFEFHFAS
         C+SFFQ L++ +P  + V S +W++K P+K + F+WQV                               E+LDHLLW+CE    VW  F + F   +A 
Subjt:  SCRSFFQFLIHPSPVRKFVFSCLWKVKAPKKVLFFAWQVILGRVNTFDRLSRVKALVVGPFCCMLCRKAVENLDHLLWNCEFARFVWSLFFEVFEFHFAS

Query:  HRRCRDTIEEFLLHPPVRKKGKLLWHVGVCAILWGLWG
        HR  R T+EEFLL+ P  ++G  LWH  V A L G  G
Subjt:  HRRCRDTIEEFLLHPPVRKKGKLLWHVGVCAILWGLWG

A0A5E4GJ11 Reverse transcriptase domain-containing protein (Fragment)3.4e-3535.51Show/hide
Query:  IVGDVRDICFWEDLWVGDRPLCSLFPRLYHLSSMKNRTVADILGPVGSSTSFSFSFRRSLSDRETTDIMALLSLIEGASLRSDRRNVRFWSLDPSAGFSC
        +VG    + FWED W     L  +FPRL++LS  +N  ++      G   S+ F FRR+L++ E T++  LL L+EG  L + R + R W LDPS  F+C
Subjt:  IVGDVRDICFWEDLWVGDRPLCSLFPRLYHLSSMKNRTVADILGPVGSSTSFSFSFRRSLSDRETTDIMALLSLIEGASLRSDRRNVRFWSLDPSAGFSC

Query:  RSFFQFLIHPSPVRKF-VFSCLWKVKAPKKVLFFAWQVILGRVNTFDRLS-RVKALVVGPFCCMLCRKAVENLDHLLWNCEFARFVWSLFFEVFEFHFAS
         S    + +      F  ++ +WK K P KV  F WQ +LG++NT D L  R   L + P  C LC KA +++DHLL +C F+  +W    +     +  
Subjt:  RSFFQFLIHPSPVRKF-VFSCLWKVKAPKKVLFFAWQVILGRVNTFDRLS-RVKALVVGPFCCMLCRKAVENLDHLLWNCEFARFVWSLFFEVFEFHFAS

Query:  HRRCRDTIEEFLLHPPVRKKGKLLWHVGVCAILWGLWGERNNRIF
           C +     +      KK K+LW   + A++W LW ERN RIF
Subjt:  HRRCRDTIEEFLLHPPVRKKGKLLWHVGVCAILWGLWGERNNRIF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G33710.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.3e-0530.65Show/hide
Query:  LWKVKAPKKVLFFAWQVILGRVNTFDRLSRVKALVVGPFCCMLCRKAVENLDHLLWNCEFARFVW---SLFFEVFEFHFASHRRCRD-TIEEFLLHPPVR
        +W   A  K  F  W   L R+ T  RL+     +     C LC   +E+ DHL   CEFA F+W   S+  E+  F F       D T++     PP  
Subjt:  LWKVKAPKKVLFFAWQVILGRVNTFDRLSRVKALVVGPFCCMLCRKAVENLDHLLWNCEFARFVW---SLFFEVFEFHFASHRRCRD-TIEEFLLHPPVR

Query:  KKGKLLWHVGVCAILWGLWGERNN
        +K      + V ++L+ +W +RNN
Subjt:  KKGKLLWHVGVCAILWGLWGERNN

AT2G02520.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein8.7e-0729.86Show/hide
Query:  IHPSPVRKFV--FSCLW-KVKAPKKVLFFAWQVILGRVNTFDRLSRVKALVVGPFCCMLCRKAVENLDHLLWNCEFARFVWSLFFEVFEFHFASHRRCRD
        +H +P+ + V  F  +W K K PK   F AW  +  R++T DR+  +    + P  C+ C    E   HL ++CEFAR VW  F      H        D
Subjt:  IHPSPVRKFV--FSCLW-KVKAPKKVLFFAWQVILGRVNTFDRLSRVKALVVGPFCCMLCRKAVENLDHLLWNCEFARFVWSLFFEVFEFHFASHRRCRD

Query:  TIEEFLLHPPVRKKGKLLWHVGVCAILWGLWGERNNRIFRGIER
         I  +L +P   K    +  +   A ++ +W ERN R+     R
Subjt:  TIEEFLLHPPVRKKGKLLWHVGVCAILWGLWGERNNRIFRGIER

AT4G29090.1 Ribonuclease H-like superfamily protein1.0e-0721.22Show/hide
Query:  ESLMSVKECIVGDVRDICFWEDLWVGDRPLCSLFPRLYHLSSMKNRTVADILGPVGSSTSFSFSFRRSLSDRETTDIMALLSLIEGASLRSDRRNV---R
        E L      +VG+  DI  W   W+  +P  S   R+  +   +  +V+ IL            +R+ + +    ++   L       LR   R +    
Subjt:  ESLMSVKECIVGDVRDICFWEDLWVGDRPLCSLFPRLYHLSSMKNRTVADILGPVGSSTSFSFSFRRSLSDRETTDIMALLSLIEGASLRSDRRNV---R

Query:  FWSLDPSAGFSCRSFFQFLIH-----------PSPVRKFVFSCLWKVKAPKKVLFFAWQVI-----LGRVNTFDRLSRVKALVVGPFCCMLCRKAVENLD
         W    S  ++ +S +  L               P    ++  +WK +   K+  F W+ +     +     +  LS+  A       C+ C    E ++
Subjt:  FWSLDPSAGFSCRSFFQFLIH-----------PSPVRKFVFSCLWKVKAPKKVLFFAWQVI-----LGRVNTFDRLSRVKALVVGPFCCMLCRKAVENLD

Query:  HLLWNCEFARFVWSLFFEVFEFHFASHRRCRDTIEEFLLHPPVRKKGKLLWHVG---VCAILWGLWGERNNRIFRGIE
        HLL+ C FAR  W+    +            D+I   L        G   W      V  +LW LW  RN  +FRG E
Subjt:  HLLWNCEFARFVWSLFFEVFEFHFASHRRCRDTIEEFLLHPPVRKKGKLLWHVG---VCAILWGLWGERNNRIFRGIE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCGGTGGAAGAGGGTGAACATCGAGTGGAAGGTGTTTCACCTTCACAAGGTTGGGGATGGGAGGAAAGTGGTTATGGAGGAAAGACATGGATCGAGAACGAGTCA
TCTAGATGTGGACGTGGACGTGGGTACCTTCGCTTGGATTAGGGATTGTCTTATGGCAGTGTCGAAATCAGAGAAACCAAACGGATTCTGGAGGAGGAGGAAACTAGAGG
CAGAGGTGATCTTATTCCAGGTGCTATCAAACAAGAGAGGAAGATTTGGGTTCCTCTCTTTGGAATCAAACAGGGGGAAGAAATCTTCAATCTTCATCCCTGAGGGTCAA
AATCGCAGAGGTTGGGGATCGCTGGAGACGGCCATGACTGAAATGCTCCCCCCTCCAAAAGAGTCTAGTCGGTTCGATAATGGGTTTCTTAAAAAGAAAGAGAGTGTCGG
AGACCAGATCGACGAAGGGATTCGAAAATCCATCTCCTTTGCGGAAATAGTGAGAAAGGGACCTACGTCAGACATCGAGAGGGATAATTTACCTTCGGAAGTTGGAGAGC
TTGCCTTCGACTGTTCATCGGCGATCATCATCAAAAGGATTAACGTCGATCTGCAATGGATGGGTATAAGAGATTCGTGCTTAGCAATGGCGAATTTAGGTATCATCCTT
AAACCCGTTTGTATTAATCTTGCAGTGGGCCTTTGCGGGAGCGAGAATGAAGCCAAGGAGTTGATCGAGTCGTGTAAGGGGCAACAACAAGAACACTTCAATATTCACCC
GTGGGATTGCAATCTGCTTGTTGAGAGGAAAAGGCTAGCCTTCAAAGGGGGATGGATTACAACTCTAGACGTTCCATCTTTCCTTAGAACAAGACAGATGATTGAGTTGC
TAGCTAAAGAATGCGGCGGGTTCACAGAGGAAGACGAAATTAGAGTTGAGGTTTGTTGGACAGAGGAAATCAGTTTCAAAGCTAAAGGTACCGACAACGGTCTTATTCCG
GCCACCAGATTCCTGATTCATGAAGGAAGATTCTTTCCGGTGAGGTTTCTGGTCGGAAGTGAAGCTGTCGAAAGGATTTCCGGCGAGGCTTCAACAATTCCAAGAGCGCC
GGCGAGGGTTGTCCTTCAGCTACCAAAGTCGGCGTCTTTTTCGGGTCAGATCGTGGCAAGAGATCACTTGAAGATAACGCAAAAAGTGGGACCCAACAATGGAAAAGGCT
CAGTTCAGCTTCTTGGGCCGAAGCTTGATGGGCCGGTTGGCATTGAAGGAAATTTTAATAAGTCGACAACCGATCCTTTTAAGGCTCACCCTTCCTCGCGGAAAAATCAC
TACAGGCCTGAGATGAATGGGCTAGATGGGCCTGCAGCCTCTTCTTTAAGCAATTCCAGTAGAGAGAAAGGGAAAGCTGTGATTAATTATGAAAGGGAAGAGTTCGGTGT
GGAAAAGGAGCATTCGGTTACAATCATCAATCCTATCTACTCTGAGGATGAATCATTCTTATCTACCCCTGCAGCCAAGGATATGGAAGCCCCGAATTTGGGTTCCTCGT
TCATGAATGAAGAGGTTGCCGTCGAGCAAAATCAGCCTGAACCATCTGCCGAGGATCCTTCTCAATCGATAGTCGAGGTTAGCCTTGCTGGAATGCCAAACCAGGTACGC
GCCTTTTCTTCGAAGTCTTTACCTTCCGTTTCGGGTTTTGGGGAATTGAGAGAAATGCCTGTTCAGAGCATGATGCCAATTTATCCTTCGTATGGCGCTCTTCTGGGAAT
GGGGGAGGGGCTTCAAGGCGGGAACCCTTTTGCTTCTTGTTATCCGGTTTTTGTCAGTCCTAGTGCTTTCCCAGGAGGGCATGTAGGGGTTCCTTTTCAGGGGGGTGCTA
TAGGCCAATGGCCCTCGGGTTGGGCGGCGTTAGCTTCGGTGGGTATTCAGGCAACCTCGGCTAAAGGAGACCCTTCGAAATCCATCGATCCCCCAAATGCCCCTATGAAA
GGAAATCAATCTCGTCAAAGGAAAGGGAAAGGGGGTACCAACAAGAGGGATAAGGAGCTAAAGAAGCTGCCAAGCTCAGTTAACTATGATCGTAAGAAGAGGGTTGGGGG
TAGGGAGAAGAGGAGTCTTGTGAAGGATCTGCTGTCCCAAGAAAATCCGGACATAGTGGTCCTTCTCGAGACTAAGCGGCAGAAGTACTGTAGAAGACTCGTGAAAAGTG
TTTGGGGTAGGAGAAAAGTGGAGTGGGCTGGTTTGGAGTCCTTTGGTGCGTCGGGGGGTATTCTTTTTATGTGTAATGAGAGTCTGATGTCGGTCAAGGAGTGCATTGTG
GGGGATGTGAGGGATATTTGTTTTTGGGAGGATTTGTGGGTGGGGGATCGTCCCCTCTGCTCCTTGTTCCCGCGTCTTTATCATTTATCTTCGATGAAAAATCGCACTGT
GGCGGACATCCTTGGTCCTGTGGGGAGTTCCACTTCGTTTTCTTTTAGTTTTCGTCGTTCGTTATCTGATAGAGAGACCACAGACATCATGGCCCTCTTATCTTTGATTG
AGGGGGCTAGTCTTAGATCGGACAGGAGGAATGTGCGCTTTTGGAGTCTTGACCCCTCTGCGGGCTTTTCCTGCAGATCGTTCTTTCAGTTTTTAATCCATCCCTCCCCC
GTTAGAAAGTTTGTCTTTTCGTGTCTCTGGAAGGTTAAGGCTCCGAAGAAGGTCTTGTTCTTTGCTTGGCAGGTCATCTTGGGCCGTGTTAACACTTTTGATAGGCTTTC
GAGAGTGAAGGCTCTTGTGGTCGGTCCTTTTTGTTGCATGCTTTGTCGGAAGGCTGTGGAAAATCTTGATCATTTGTTATGGAATTGTGAGTTTGCTCGTTTTGTCTGGA
GTCTCTTCTTTGAGGTCTTCGAGTTTCATTTTGCGAGCCACCGTCGTTGTAGGGATACGATCGAGGAGTTCCTCCTCCATCCACCGGTCCGGAAGAAAGGAAAGTTACTT
TGGCATGTTGGTGTGTGTGCTATTTTGTGGGGTTTATGGGGGGAAAGGAACAATAGAATTTTTAGAGGTATTGAGAGACATTATTGTATGGTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATCGGTGGAAGAGGGTGAACATCGAGTGGAAGGTGTTTCACCTTCACAAGGTTGGGGATGGGAGGAAAGTGGTTATGGAGGAAAGACATGGATCGAGAACGAGTCA
TCTAGATGTGGACGTGGACGTGGGTACCTTCGCTTGGATTAGGGATTGTCTTATGGCAGTGTCGAAATCAGAGAAACCAAACGGATTCTGGAGGAGGAGGAAACTAGAGG
CAGAGGTGATCTTATTCCAGGTGCTATCAAACAAGAGAGGAAGATTTGGGTTCCTCTCTTTGGAATCAAACAGGGGGAAGAAATCTTCAATCTTCATCCCTGAGGGTCAA
AATCGCAGAGGTTGGGGATCGCTGGAGACGGCCATGACTGAAATGCTCCCCCCTCCAAAAGAGTCTAGTCGGTTCGATAATGGGTTTCTTAAAAAGAAAGAGAGTGTCGG
AGACCAGATCGACGAAGGGATTCGAAAATCCATCTCCTTTGCGGAAATAGTGAGAAAGGGACCTACGTCAGACATCGAGAGGGATAATTTACCTTCGGAAGTTGGAGAGC
TTGCCTTCGACTGTTCATCGGCGATCATCATCAAAAGGATTAACGTCGATCTGCAATGGATGGGTATAAGAGATTCGTGCTTAGCAATGGCGAATTTAGGTATCATCCTT
AAACCCGTTTGTATTAATCTTGCAGTGGGCCTTTGCGGGAGCGAGAATGAAGCCAAGGAGTTGATCGAGTCGTGTAAGGGGCAACAACAAGAACACTTCAATATTCACCC
GTGGGATTGCAATCTGCTTGTTGAGAGGAAAAGGCTAGCCTTCAAAGGGGGATGGATTACAACTCTAGACGTTCCATCTTTCCTTAGAACAAGACAGATGATTGAGTTGC
TAGCTAAAGAATGCGGCGGGTTCACAGAGGAAGACGAAATTAGAGTTGAGGTTTGTTGGACAGAGGAAATCAGTTTCAAAGCTAAAGGTACCGACAACGGTCTTATTCCG
GCCACCAGATTCCTGATTCATGAAGGAAGATTCTTTCCGGTGAGGTTTCTGGTCGGAAGTGAAGCTGTCGAAAGGATTTCCGGCGAGGCTTCAACAATTCCAAGAGCGCC
GGCGAGGGTTGTCCTTCAGCTACCAAAGTCGGCGTCTTTTTCGGGTCAGATCGTGGCAAGAGATCACTTGAAGATAACGCAAAAAGTGGGACCCAACAATGGAAAAGGCT
CAGTTCAGCTTCTTGGGCCGAAGCTTGATGGGCCGGTTGGCATTGAAGGAAATTTTAATAAGTCGACAACCGATCCTTTTAAGGCTCACCCTTCCTCGCGGAAAAATCAC
TACAGGCCTGAGATGAATGGGCTAGATGGGCCTGCAGCCTCTTCTTTAAGCAATTCCAGTAGAGAGAAAGGGAAAGCTGTGATTAATTATGAAAGGGAAGAGTTCGGTGT
GGAAAAGGAGCATTCGGTTACAATCATCAATCCTATCTACTCTGAGGATGAATCATTCTTATCTACCCCTGCAGCCAAGGATATGGAAGCCCCGAATTTGGGTTCCTCGT
TCATGAATGAAGAGGTTGCCGTCGAGCAAAATCAGCCTGAACCATCTGCCGAGGATCCTTCTCAATCGATAGTCGAGGTTAGCCTTGCTGGAATGCCAAACCAGGTACGC
GCCTTTTCTTCGAAGTCTTTACCTTCCGTTTCGGGTTTTGGGGAATTGAGAGAAATGCCTGTTCAGAGCATGATGCCAATTTATCCTTCGTATGGCGCTCTTCTGGGAAT
GGGGGAGGGGCTTCAAGGCGGGAACCCTTTTGCTTCTTGTTATCCGGTTTTTGTCAGTCCTAGTGCTTTCCCAGGAGGGCATGTAGGGGTTCCTTTTCAGGGGGGTGCTA
TAGGCCAATGGCCCTCGGGTTGGGCGGCGTTAGCTTCGGTGGGTATTCAGGCAACCTCGGCTAAAGGAGACCCTTCGAAATCCATCGATCCCCCAAATGCCCCTATGAAA
GGAAATCAATCTCGTCAAAGGAAAGGGAAAGGGGGTACCAACAAGAGGGATAAGGAGCTAAAGAAGCTGCCAAGCTCAGTTAACTATGATCGTAAGAAGAGGGTTGGGGG
TAGGGAGAAGAGGAGTCTTGTGAAGGATCTGCTGTCCCAAGAAAATCCGGACATAGTGGTCCTTCTCGAGACTAAGCGGCAGAAGTACTGTAGAAGACTCGTGAAAAGTG
TTTGGGGTAGGAGAAAAGTGGAGTGGGCTGGTTTGGAGTCCTTTGGTGCGTCGGGGGGTATTCTTTTTATGTGTAATGAGAGTCTGATGTCGGTCAAGGAGTGCATTGTG
GGGGATGTGAGGGATATTTGTTTTTGGGAGGATTTGTGGGTGGGGGATCGTCCCCTCTGCTCCTTGTTCCCGCGTCTTTATCATTTATCTTCGATGAAAAATCGCACTGT
GGCGGACATCCTTGGTCCTGTGGGGAGTTCCACTTCGTTTTCTTTTAGTTTTCGTCGTTCGTTATCTGATAGAGAGACCACAGACATCATGGCCCTCTTATCTTTGATTG
AGGGGGCTAGTCTTAGATCGGACAGGAGGAATGTGCGCTTTTGGAGTCTTGACCCCTCTGCGGGCTTTTCCTGCAGATCGTTCTTTCAGTTTTTAATCCATCCCTCCCCC
GTTAGAAAGTTTGTCTTTTCGTGTCTCTGGAAGGTTAAGGCTCCGAAGAAGGTCTTGTTCTTTGCTTGGCAGGTCATCTTGGGCCGTGTTAACACTTTTGATAGGCTTTC
GAGAGTGAAGGCTCTTGTGGTCGGTCCTTTTTGTTGCATGCTTTGTCGGAAGGCTGTGGAAAATCTTGATCATTTGTTATGGAATTGTGAGTTTGCTCGTTTTGTCTGGA
GTCTCTTCTTTGAGGTCTTCGAGTTTCATTTTGCGAGCCACCGTCGTTGTAGGGATACGATCGAGGAGTTCCTCCTCCATCCACCGGTCCGGAAGAAAGGAAAGTTACTT
TGGCATGTTGGTGTGTGTGCTATTTTGTGGGGTTTATGGGGGGAAAGGAACAATAGAATTTTTAGAGGTATTGAGAGACATTATTGTATGGTTTGA
Protein sequenceShow/hide protein sequence
MDRWKRVNIEWKVFHLHKVGDGRKVVMEERHGSRTSHLDVDVDVGTFAWIRDCLMAVSKSEKPNGFWRRRKLEAEVILFQVLSNKRGRFGFLSLESNRGKKSSIFIPEGQ
NRRGWGSLETAMTEMLPPPKESSRFDNGFLKKKESVGDQIDEGIRKSISFAEIVRKGPTSDIERDNLPSEVGELAFDCSSAIIIKRINVDLQWMGIRDSCLAMANLGIIL
KPVCINLAVGLCGSENEAKELIESCKGQQQEHFNIHPWDCNLLVERKRLAFKGGWITTLDVPSFLRTRQMIELLAKECGGFTEEDEIRVEVCWTEEISFKAKGTDNGLIP
ATRFLIHEGRFFPVRFLVGSEAVERISGEASTIPRAPARVVLQLPKSASFSGQIVARDHLKITQKVGPNNGKGSVQLLGPKLDGPVGIEGNFNKSTTDPFKAHPSSRKNH
YRPEMNGLDGPAASSLSNSSREKGKAVINYEREEFGVEKEHSVTIINPIYSEDESFLSTPAAKDMEAPNLGSSFMNEEVAVEQNQPEPSAEDPSQSIVEVSLAGMPNQVR
AFSSKSLPSVSGFGELREMPVQSMMPIYPSYGALLGMGEGLQGGNPFASCYPVFVSPSAFPGGHVGVPFQGGAIGQWPSGWAALASVGIQATSAKGDPSKSIDPPNAPMK
GNQSRQRKGKGGTNKRDKELKKLPSSVNYDRKKRVGGREKRSLVKDLLSQENPDIVVLLETKRQKYCRRLVKSVWGRRKVEWAGLESFGASGGILFMCNESLMSVKECIV
GDVRDICFWEDLWVGDRPLCSLFPRLYHLSSMKNRTVADILGPVGSSTSFSFSFRRSLSDRETTDIMALLSLIEGASLRSDRRNVRFWSLDPSAGFSCRSFFQFLIHPSP
VRKFVFSCLWKVKAPKKVLFFAWQVILGRVNTFDRLSRVKALVVGPFCCMLCRKAVENLDHLLWNCEFARFVWSLFFEVFEFHFASHRRCRDTIEEFLLHPPVRKKGKLL
WHVGVCAILWGLWGERNNRIFRGIERHYCMV