; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg032563 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg032563
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold3:23257453..23261541
RNA-Seq ExpressionSpg032563
SyntenySpg032563
Gene Ontology termsGO:0044237 - cellular metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0071704 - organic substance metabolic process (biological process)
GO:0016301 - kinase activity (molecular function)
InterPro domainsIPR025558 - Domain of unknown function DUF4283
IPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035739.1 hypothetical protein E6C27_scaffold403G00100 [Cucumis melo var. makuwa]2.0e-4344.19Show/hide
Query:  ILGVLDSLVVGSSDDARVWSLENSGQFTVSSLSHQLGSSFHIQSDLFWAIWKSKSPKRINILMWIILNGSLNTSDVLQRKLPFFSFFPSICPLCLKAGDS
        +L +L +  V +SDD R WS+E+ G+F+  SLS  L ++  +   LF AI +S SP+RINIL+WI++   + +S++LQ+K P  +  PSICPLCLKA  +
Subjt:  ILGVLDSLVVGSSDDARVWSLENSGQFTVSSLSHQLGSSFHIQSDLFWAIWKSKSPKRINILMWIILNGSLNTSDVLQRKLPFFSFFPSICPLCLKAGDS

Query:  ALHLFFECAYSQLCWSKFFAIFNMQWVFSNSVKENVLQLLIGPSFSSRPRLLWINGVKALISEIWLERNQRVFEDKAWHSLARLESACLKASSWCTLSKS
          H+F  C  S   W + F++FN+ W F +S+  +V+QLL G +    PR++W    KAL+ EIW+ERNQR+F DKA      + +A L A++WC+L K 
Subjt:  ALHLFFECAYSQLCWSKFFAIFNMQWVFSNSVKENVLQLLIGPSFSSRPRLLWINGVKALISEIWLERNQRVFEDKAWHSLARLESACLKASSWCTLSKS

Query:  FVAFSPQDICFNWHS
        FV +S QDIC NW++
Subjt:  FVAFSPQDICFNWHS

KAA0047189.1 hypothetical protein E6C27_scaffold83G00690 [Cucumis melo var. makuwa]9.5e-4140Show/hide
Query:  SSVLQKRKVSYWIRKNQEVLNVNFADFWVVSRLFAHNSWKDIIEVLEFHFKSKISINPLFADKALLKFEDDNVVSSLDMVGKWKVFGNFHLLLEKWNKER
        SS L   K   W+ +N EV++ NF + W++++LFA +  + I ++LE +F++KI INPLF + AL+  ++ ++   +   GKW+V G+F+L  EKW+K +
Subjt:  SSVLQKRKVSYWIRKNQEVLNVNFADFWVVSRLFAHNSWKDIIEVLEFHFKSKISINPLFADKALLKFEDDNVVSSLDMVGKWKVFGNFHLLLEKWNKER

Query:  HSHPCFMEGYGGWISIKNLPLDYWCRQSFEAIGEYFGGLVNISSKTLNMTIVSEARIQVKTNLCGFMPATIELKDSFRGNVYLHFGDVSTLDSPKIIHHS
        +S P  M+GYGGW+ IKNL    W   + E                      SEARIQVK NLCGF+P+TIE+ D  RGN++L+FGD   L+ P     +
Subjt:  HSHPCFMEGYGGWISIKNLPLDYWCRQSFEAIGEYFGGLVNISSKTLNMTIVSEARIQVKTNLCGFMPATIELKDSFRGNVYLHFGDVSTLDSPKIIHHS

Query:  LELSDFSNPMDLFRIKQVMEDEGFD
        + +SDF   + L RI +V++DEG D
Subjt:  LELSDFSNPMDLFRIKQVMEDEGFD

TYK21876.1 hypothetical protein E5676_scaffold494G00090 [Cucumis melo var. makuwa]6.3e-4544.7Show/hide
Query:  ILGVLDSLVVGSSDDARVWSLENSGQFTVSSLSHQLGSSFHIQSDLFWAIWKSKSPKRINILMWIILNGSLNTSDVLQRKLPFFSFFPSICPLCLKAGDS
        +L +L +  V +SDD R WS+E+ G+F+  SLS  L ++  +   LF AI +S SP+RINIL+WI++   +N+S++LQ+K P  +  PSICPLCLKA  +
Subjt:  ILGVLDSLVVGSSDDARVWSLENSGQFTVSSLSHQLGSSFHIQSDLFWAIWKSKSPKRINILMWIILNGSLNTSDVLQRKLPFFSFFPSICPLCLKAGDS

Query:  ALHLFFECAYSQLCWSKFFAIFNMQWVFSNSVKENVLQLLIGPSFSSRPRLLWINGVKALISEIWLERNQRVFEDKAWHSLARLESACLKASSWCTLSKS
          H+F  C  S   W + F++FN+ W F +S+  +V+QLL G +    PR++W    KAL+ EIW+ERNQR+F DKA      + +A L A++WC+L K 
Subjt:  ALHLFFECAYSQLCWSKFFAIFNMQWVFSNSVKENVLQLLIGPSFSSRPRLLWINGVKALISEIWLERNQRVFEDKAWHSLARLESACLKASSWCTLSKS

Query:  FVAFSPQDICFNWHSFI
        FV +S QDIC NW+ F+
Subjt:  FVAFSPQDICFNWHSFI

XP_038903695.1 uncharacterized protein LOC120090219 [Benincasa hispida]2.9e-5042.44Show/hide
Query:  GPSPFRFLNSWLNLSECVEIMENSLA---GDRSYGWAVQFRNF---GIVNIILGVLDSLVVGS---SDDARVWSLENSGQFTVSSLSHQLGSSFHIQSDL
        G SPF  L S  +L     +   S+A    +    W++ FR       V+    +L  +V  S   S D RVWS+ N+ Q+TV SL + L     ++  +
Subjt:  GPSPFRFLNSWLNLSECVEIMENSLA---GDRSYGWAVQFRNF---GIVNIILGVLDSLVVGS---SDDARVWSLENSGQFTVSSLSHQLGSSFHIQSDL

Query:  FWAIWKSKSPKRINILMWIILNGSLNTSDVLQRKLPFFSFFPSICPLCLKAGDSALHLFFECAYSQLCWSKFFAIFNMQWVFSNSVKENVLQLLIGPSFS
        F  IWK+KSP+R+NIL+WI+L G LN ++VLQ+K P  S  P++CP CL   + +LHLFF C YS  CW+K    FN+     N  K NV QLL  P+  
Subjt:  FWAIWKSKSPKRINILMWIILNGSLNTSDVLQRKLPFFSFFPSICPLCLKAGDSALHLFFECAYSQLCWSKFFAIFNMQWVFSNSVKENVLQLLIGPSFS

Query:  SRPRLLWINGVKALISEIWLERNQRVFEDKAWHSLARLESACLKASSWCTLSKSFVAFSPQDICFNWHSFI
           RLLW N VKAL++++W ERNQR+F +KA     RLE+A  +ASSWC LS  F A+S  D   NW +FI
Subjt:  SRPRLLWINGVKALISEIWLERNQRVFEDKAWHSLARLESACLKASSWCTLSKSFVAFSPQDICFNWHSFI

XP_038904899.1 uncharacterized protein LOC120091119 isoform X2 [Benincasa hispida]4.5e-4352.9Show/hide
Query:  EDDNVVSSLDMVGKWKVFGNFHLLLEKWNKERHSHPCFMEGYGGWISIKNLPLDYWCRQSFEAIGEYFGGLVNISSKTLNMTIVSEARIQVKTNLCGFMP
        E +++   +++ GKW+ FG+FHL  E+WN   H  P ++ GYGGWISIKNLPLDYWC+Q+FEAIG+YFGGL +I+ + LN+  V +A I+VK NLCGF+P
Subjt:  EDDNVVSSLDMVGKWKVFGNFHLLLEKWNKERHSHPCFMEGYGGWISIKNLPLDYWCRQSFEAIGEYFGGLVNISSKTLNMTIVSEARIQVKTNLCGFMP

Query:  ATIELKDSFRGNVYLHFGDVSTLDSPKIIHHSLELSDFSNPMDLFRIKQVMEDEG
        ATIE+ +  RG++YL+FGD+ST + P  +   L  SDF+NP+DL R+ +V   EG
Subjt:  ATIELKDSFRGNVYLHFGDVSTLDSPKIIHHSLELSDFSNPMDLFRIKQVMEDEG

TrEMBL top hitse value%identityAlignment
A0A5A7T2Y0 zf-RVT domain-containing protein9.9e-4444.19Show/hide
Query:  ILGVLDSLVVGSSDDARVWSLENSGQFTVSSLSHQLGSSFHIQSDLFWAIWKSKSPKRINILMWIILNGSLNTSDVLQRKLPFFSFFPSICPLCLKAGDS
        +L +L +  V +SDD R WS+E+ G+F+  SLS  L ++  +   LF AI +S SP+RINIL+WI++   + +S++LQ+K P  +  PSICPLCLKA  +
Subjt:  ILGVLDSLVVGSSDDARVWSLENSGQFTVSSLSHQLGSSFHIQSDLFWAIWKSKSPKRINILMWIILNGSLNTSDVLQRKLPFFSFFPSICPLCLKAGDS

Query:  ALHLFFECAYSQLCWSKFFAIFNMQWVFSNSVKENVLQLLIGPSFSSRPRLLWINGVKALISEIWLERNQRVFEDKAWHSLARLESACLKASSWCTLSKS
          H+F  C  S   W + F++FN+ W F +S+  +V+QLL G +    PR++W    KAL+ EIW+ERNQR+F DKA      + +A L A++WC+L K 
Subjt:  ALHLFFECAYSQLCWSKFFAIFNMQWVFSNSVKENVLQLLIGPSFSSRPRLLWINGVKALISEIWLERNQRVFEDKAWHSLARLESACLKASSWCTLSKS

Query:  FVAFSPQDICFNWHS
        FV +S QDIC NW++
Subjt:  FVAFSPQDICFNWHS

A0A5A7U128 Uncharacterized protein4.6e-4140Show/hide
Query:  SSVLQKRKVSYWIRKNQEVLNVNFADFWVVSRLFAHNSWKDIIEVLEFHFKSKISINPLFADKALLKFEDDNVVSSLDMVGKWKVFGNFHLLLEKWNKER
        SS L   K   W+ +N EV++ NF + W++++LFA +  + I ++LE +F++KI INPLF + AL+  ++ ++   +   GKW+V G+F+L  EKW+K +
Subjt:  SSVLQKRKVSYWIRKNQEVLNVNFADFWVVSRLFAHNSWKDIIEVLEFHFKSKISINPLFADKALLKFEDDNVVSSLDMVGKWKVFGNFHLLLEKWNKER

Query:  HSHPCFMEGYGGWISIKNLPLDYWCRQSFEAIGEYFGGLVNISSKTLNMTIVSEARIQVKTNLCGFMPATIELKDSFRGNVYLHFGDVSTLDSPKIIHHS
        +S P  M+GYGGW+ IKNL    W   + E                      SEARIQVK NLCGF+P+TIE+ D  RGN++L+FGD   L+ P     +
Subjt:  HSHPCFMEGYGGWISIKNLPLDYWCRQSFEAIGEYFGGLVNISSKTLNMTIVSEARIQVKTNLCGFMPATIELKDSFRGNVYLHFGDVSTLDSPKIIHHS

Query:  LELSDFSNPMDLFRIKQVMEDEGFD
        + +SDF   + L RI +V++DEG D
Subjt:  LELSDFSNPMDLFRIKQVMEDEGFD

A0A5A7V878 DUF4283 domain-containing protein1.5e-3634.3Show/hide
Query:  GWFLRCVVSPFSGGRQFIHVPIGNSKMGCSLFKELVVDSIRSLMVSNAPVASGEVPPMSFAETLKFPLKSLVQGSGAVGVSKNSSVLQKRKVSYWIRKNQ
        GW LRC V P SGGR ++H+P+G ++ G   F  ++ D    L VS     S E   M   + L+ P         A   S       K K S W+ KN 
Subjt:  GWFLRCVVSPFSGGRQFIHVPIGNSKMGCSLFKELVVDSIRSLMVSNAPVASGEVPPMSFAETLKFPLKSLVQGSGAVGVSKNSSVLQKRKVSYWIRKNQ

Query:  EVLNVNFADFWVVSRLFAHNSWKDIIEVLEFHFKSKISINPLFADKALLKFEDDNVVSSLDMVGKWKVFGNFHLLLEKWNKERHSHPCFMEGYGGWISIK
        EVL  +F               K I++V                                                             ++GYGGWISIK
Subjt:  EVLNVNFADFWVVSRLFAHNSWKDIIEVLEFHFKSKISINPLFADKALLKFEDDNVVSSLDMVGKWKVFGNFHLLLEKWNKERHSHPCFMEGYGGWISIK

Query:  NLPLDYWCRQSFEAIGEYFGGLVNISSKTLNMTIVSEARIQVKTNLCGFMPATIELKDSFRGNVYLHFGDVSTLDSPKIIHHSLELSDFSNPMDLFRIKQ
        NLPLDYW    ++AIG +FGG  +IS KT+N+   SEA+I+V  NLCGF+PA +EL+D+FR N++L+FGD+  L++PK+I  +L +S  +N +DL RI Q
Subjt:  NLPLDYWCRQSFEAIGEYFGGLVNISSKTLNMTIVSEARIQVKTNLCGFMPATIELKDSFRGNVYLHFGDVSTLDSPKIIHHSLELSDFSNPMDLFRIKQ

Query:  VMEDEGFDS
        V+ DEG +S
Subjt:  VMEDEGFDS

A0A5D3DE60 zf-RVT domain-containing protein3.1e-4544.7Show/hide
Query:  ILGVLDSLVVGSSDDARVWSLENSGQFTVSSLSHQLGSSFHIQSDLFWAIWKSKSPKRINILMWIILNGSLNTSDVLQRKLPFFSFFPSICPLCLKAGDS
        +L +L +  V +SDD R WS+E+ G+F+  SLS  L ++  +   LF AI +S SP+RINIL+WI++   +N+S++LQ+K P  +  PSICPLCLKA  +
Subjt:  ILGVLDSLVVGSSDDARVWSLENSGQFTVSSLSHQLGSSFHIQSDLFWAIWKSKSPKRINILMWIILNGSLNTSDVLQRKLPFFSFFPSICPLCLKAGDS

Query:  ALHLFFECAYSQLCWSKFFAIFNMQWVFSNSVKENVLQLLIGPSFSSRPRLLWINGVKALISEIWLERNQRVFEDKAWHSLARLESACLKASSWCTLSKS
          H+F  C  S   W + F++FN+ W F +S+  +V+QLL G +    PR++W    KAL+ EIW+ERNQR+F DKA      + +A L A++WC+L K 
Subjt:  ALHLFFECAYSQLCWSKFFAIFNMQWVFSNSVKENVLQLLIGPSFSSRPRLLWINGVKALISEIWLERNQRVFEDKAWHSLARLESACLKASSWCTLSKS

Query:  FVAFSPQDICFNWHSFI
        FV +S QDIC NW+ F+
Subjt:  FVAFSPQDICFNWHSFI

A0A6J1DIE2 uncharacterized protein LOC1110207653.3e-3945.79Show/hide
Query:  VSSLSHQLGSSFHIQSDLFWAIWKSKSPKRINILMWIILNGSLNTSDVLQRKLPFFSFFPSICPLCLKAGDSALHLFFECAYSQLCWSKFFAIFNMQWVF
        V+SL  + GS+  +  + F A+WK+KSP+R+N+  WI+  G LNT+D++Q+K P  +  PS C LC K+G+   HLFF C ++  CW+  F  FN+ W F
Subjt:  VSSLSHQLGSSFHIQSDLFWAIWKSKSPKRINILMWIILNGSLNTSDVLQRKLPFFSFFPSICPLCLKAGDSALHLFFECAYSQLCWSKFFAIFNMQWVF

Query:  SNSVKENVLQLLIG-PSFSSRPRLLWINGVKALISEIWLERNQRVFEDKAWHSLARLESACLKASSWCTLSKSFVAFSPQDICFNWHSFI
             +NV QLL G P  SS  R LW+N VKAL+SE+W ERN R+FE+K         SA  KAS WC+L  SF+  SP  I  NW +FI
Subjt:  SNSVKENVLQLLIG-PSFSSRPRLLWINGVKALISEIWLERNQRVFEDKAWHSLARLESACLKASSWCTLSKSFVAFSPQDICFNWHSFI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G04650.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein8.6e-0826.7Show/hide
Query:  IMENSLAGDRSYGWAVQFRNFGIVNI--ILGVLDSLVVGSSDDARVWSLE---NSGQFTVSSLSHQLGSSFHIQSDLF-W--AIWKSKSPKRINILMWII
        ++ ++L G   +  + + RN  IV +  +L     L+    DD+ +W  +    S +F+    + +  S+ H QS    W  A+W      +   + W++
Subjt:  IMENSLAGDRSYGWAVQFRNFGIVNI--ILGVLDSLVVGSSDDARVWSLE---NSGQFTVSSLSHQLGSSFHIQSDLF-W--AIWKSKSPKRINILMWII

Query:  LNGSLNTSDVLQRKLPFFSFFPSICPLCLKAGDSALHLFFECAYSQLCWSKFFAIFNMQWVFSNSVKENVLQLLIGPSFSSRPRLLWINGVKALISEIWL
            L+T D LQ    +    P+ C LC    DS  HLFFEC +S + W  F A  N+      +   + L  L+ PS      L+      + +  IW 
Subjt:  LNGSLNTSDVLQRKLPFFSFFPSICPLCLKAGDSALHLFFECAYSQLCWSKFFAIFNMQWVFSNSVKENVLQLLIGPSFSSRPRLLWINGVKALISEIWL

Query:  ERNQRV
        ERNQR+
Subjt:  ERNQRV

AT5G18880.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.2e-0425.15Show/hide
Query:  DARVWSLENSGQFTVSSLSHQLGSSFHIQSDLF-WA--IWKSKSPKRINILMWIILNGSLNTSDVLQRKLPFFSFFPSICPLCLKAGDSALHLFFECAYS
        D+ +W    +G +  S  S        + S    WA  +W  +   R +++ W+     L T D L+    +    PS   LC    ++  HLFFEC++S
Subjt:  DARVWSLENSGQFTVSSLSHQLGSSFHIQSDLF-WA--IWKSKSPKRINILMWIILNGSLNTSDVLQRKLPFFSFFPSICPLCLKAGDSALHLFFECAYS

Query:  QLCWSKFFAIFNMQWVFSNSVKEN-VLQLLIGPSFSSRPRLLWINGVKALISEIWLERNQRVF
           W  F + F     F      + +LQL +    ++  +LL    +++ +  +W ERN R+F
Subjt:  QLCWSKFFAIFNMQWVFSNSVKEN-VLQLLIGPSFSSRPRLLWINGVKALISEIWLERNQRVF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGTGATAAGCCGCAAGATTATGAATTCTTTTTATTGTGTTTGGTCCGAAGAGGAATGCTTTTTTGTGGAAGATGTGGCTTTCAACAAGGTGATTTCTCTTTCTTC
TTCCCTTTTGCTTTGGTTGGAGCGTTCGTTGGTTGAGATTTTATGCCAGCCTATTCAGAAATTTTTTCGTAAGCAGTTTCGGGATGCTTTTGGGTTAATTCGTTTAGGTA
AACTTCGTTCTTCTTCAGGCTGGTTTTTGCGCTGTGTTGTTTCGCCTTTTTCAGGTGGTAGACAATTCATCCATGTGCCAATTGGAAATTCCAAAATGGGTTGTTCTTTA
TTCAAGGAGCTTGTGGTCGATTCCATTAGGAGTTTAATGGTTTCTAATGCTCCAGTTGCTTCTGGGGAGGTACCTCCTATGAGTTTTGCTGAAACTCTAAAGTTTCCCTT
GAAATCATTAGTGCAAGGTTCTGGGGCTGTTGGTGTTTCTAAGAATTCTTCAGTTCTTCAAAAACGAAAGGTTTCTTACTGGATTAGAAAAAATCAGGAGGTTTTGAATG
TTAATTTTGCAGATTTTTGGGTGGTGTCCAGATTATTTGCCCACAATAGTTGGAAAGATATTATTGAGGTTTTGGAGTTTCATTTCAAATCAAAGATTTCGATCAATCCG
CTTTTTGCAGATAAAGCATTACTCAAATTTGAAGATGATAATGTTGTTAGTTCATTAGATATGGTTGGGAAATGGAAGGTCTTTGGTAATTTTCATCTTTTGCTTGAAAA
ATGGAACAAAGAGCGTCATAGTCATCCCTGTTTTATGGAGGGTTATGGTGGTTGGATTTCAATCAAGAATTTGCCTTTAGATTATTGGTGTCGACAATCTTTTGAAGCAA
TTGGAGAATATTTTGGGGGTTTAGTAAACATTTCAAGTAAAACTCTTAACATGACTATTGTTTCAGAAGCTAGAATTCAGGTTAAAACAAATTTATGTGGTTTTATGCCG
GCTACAATTGAACTTAAAGATAGTTTTAGAGGGAATGTTTATCTTCATTTTGGCGATGTGTCTACTCTGGATTCTCCGAAAATTATTCACCATAGTCTTGAGTTGAGCGA
TTTTTCCAACCCAATGGACCTTTTTCGAATCAAACAAGTAATGGAAGATGAAGGGTTTGATTCTCAAATTCAAAATCCAGATGTTGAACACTATGGAAAAATTCTGGAAA
TTCCTTCTGTATCAAGGGACCCTAAGGTAATTTACAAGTTTGTTCAATTACCTTGTGATGATAGTAATATTAAAGGGTTTTCTGTTTTGCCAAATATTTCTGAAGTTGAA
GAGTCAACGAGATTTAATTTTGATGATGCTTTATTATCAAGAAAGTCCTTTTCAGATTCCCCTGTAATTGGAAAGGCCTTTTCGAATATTGTAAAGGCCCGTGGTAATTT
TCAAGAACATGTAGTAATTGAGAAAGAGAAAGAGTTATTTAATGTTGAAGACCATGCAGTAATTGAGAAAGAGAAGGAGTTATTAAATGCCAATGGGCTGCATGTCTTTG
GTTTTCCTTCTGCCGAGAGTATATTTTCCAAGGAGCCTTTGCATGAAGTGGGGGAGGTTTTTAAAAAAAATAAGGAGGATGCTGTATTTATTGAAGAGATAGTAAATGAT
GGGGAGGTTCTTAAGAATAAGGAGGATGCTGCATTTATTGATGAGTTAGTAAATGATATTTTAACCCAAGTTTCAAATATGGCTGATAAGTCTTTAACTCAGGGGCAATC
TTTCAATTGTCCAGCAATTAATAACCCAAAGGCTTCTTCAAATGTGAACTCGGTTCAGGATATTTTCAATTCCGATTTAAATGAAAATTATTCTTTGTCTGAATACTTGG
ACTCCGGTATTCCACCTAGTGTTAAAGAAGTGCATGATAATTTCTGTAAATCTTATTCTAAATATTATGTGCGAAAGAAAGGGCCAAGTGTGGAAGGTGATATTTTAAAG
GTTAATGCTGATGTTTTGGAAGAAGTTGTCTCTAAGGTATTGGCTCCTCAAGAGTCTATTTTAAATCAGGATTCTTCAAAGGCAGCAGACAAGTCAAATGATTTTGCAAT
CAATAGTTGTAATATTGGGCCAGAAGGTGTGCTTTTCACAAGAGTTCTTTTTCCCTCTTCTAAAGATAGTTTGTCGGCAAATGCTTCATCACATTGTGGTAATGAAAATA
CAGATGATGAGTCAGAGGTTAGTATGAGCAGTGAAGAGATTGATTTTCCTCCGGAAAATCTTTTGAATGTTGATAGTTGTGATATCTCGGTAAATGATGATTTGAATTTG
CTCTTTACTTCTCCATCAAAATCTAATGCTTCAAGCAAGAAGATTGTGGATGACTTTAAGACTTCAATTCCAACTGCTTCAGATAATTTAGATACTTTTTCTGCTCTTAT
TAAAGCCAGTGGTCTACAGTTCAAGGAAATTCTATCGGGTGGAATGTTAATTATGTGGGATGAAAGTAGAGTCAATGTGGTTGAGGTGCGTACAATTTCAGATCATTTTC
CTATTCTTTTAGAAGCTGGAGGCTTTTCTTGGGGGCCTTCTCCTTTTCGTTTTTTGAATTCCTGGCTAAATCTGAGTGAATGTGTTGAGATTATGGAGAATTCTCTTGCT
GGAGATCGATCGTATGGATGGGCAGTTCAGTTTCGGAATTTTGGGATTGTCAACATAATTCTTGGTGTTTTGGACTCTTTGGTGGTTGGTTCATCTGATGATGCTCGTGT
TTGGTCTCTTGAAAATTCTGGACAGTTTACCGTTAGCTCTTTGTCTCATCAACTTGGTTCGAGTTTTCACATTCAATCGGATTTGTTTTGGGCCATTTGGAAATCTAAAA
GCCCTAAACGAATAAATATTTTGATGTGGATTATTTTGAATGGTAGTTTGAATACTTCTGATGTTCTTCAAAGAAAATTGCCGTTCTTCAGCTTTTTTCCTTCGATTTGT
CCTCTTTGTTTGAAGGCAGGAGACTCCGCATTGCATTTGTTCTTTGAGTGTGCTTATTCACAACTTTGTTGGTCAAAGTTCTTTGCTATTTTCAATATGCAGTGGGTTTT
TTCAAATTCCGTAAAAGAGAATGTGCTTCAACTGCTTATTGGTCCTTCTTTTTCTTCAAGACCGAGATTATTATGGATTAATGGTGTTAAAGCTTTGATATCAGAAATTT
GGTTGGAAAGAAATCAGAGGGTTTTTGAAGATAAAGCGTGGCATTCTTTAGCTCGTCTGGAATCAGCTTGCTTAAAGGCTTCTTCCTGGTGCACTCTTTCTAAATCTTTT
GTAGCTTTCTCTCCACAGGATATTTGTTTTAATTGGCATTCTTTTATTTTTCCCCTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGGTGATAAGCCGCAAGATTATGAATTCTTTTTATTGTGTTTGGTCCGAAGAGGAATGCTTTTTTGTGGAAGATGTGGCTTTCAACAAGGTGATTTCTCTTTCTTC
TTCCCTTTTGCTTTGGTTGGAGCGTTCGTTGGTTGAGATTTTATGCCAGCCTATTCAGAAATTTTTTCGTAAGCAGTTTCGGGATGCTTTTGGGTTAATTCGTTTAGGTA
AACTTCGTTCTTCTTCAGGCTGGTTTTTGCGCTGTGTTGTTTCGCCTTTTTCAGGTGGTAGACAATTCATCCATGTGCCAATTGGAAATTCCAAAATGGGTTGTTCTTTA
TTCAAGGAGCTTGTGGTCGATTCCATTAGGAGTTTAATGGTTTCTAATGCTCCAGTTGCTTCTGGGGAGGTACCTCCTATGAGTTTTGCTGAAACTCTAAAGTTTCCCTT
GAAATCATTAGTGCAAGGTTCTGGGGCTGTTGGTGTTTCTAAGAATTCTTCAGTTCTTCAAAAACGAAAGGTTTCTTACTGGATTAGAAAAAATCAGGAGGTTTTGAATG
TTAATTTTGCAGATTTTTGGGTGGTGTCCAGATTATTTGCCCACAATAGTTGGAAAGATATTATTGAGGTTTTGGAGTTTCATTTCAAATCAAAGATTTCGATCAATCCG
CTTTTTGCAGATAAAGCATTACTCAAATTTGAAGATGATAATGTTGTTAGTTCATTAGATATGGTTGGGAAATGGAAGGTCTTTGGTAATTTTCATCTTTTGCTTGAAAA
ATGGAACAAAGAGCGTCATAGTCATCCCTGTTTTATGGAGGGTTATGGTGGTTGGATTTCAATCAAGAATTTGCCTTTAGATTATTGGTGTCGACAATCTTTTGAAGCAA
TTGGAGAATATTTTGGGGGTTTAGTAAACATTTCAAGTAAAACTCTTAACATGACTATTGTTTCAGAAGCTAGAATTCAGGTTAAAACAAATTTATGTGGTTTTATGCCG
GCTACAATTGAACTTAAAGATAGTTTTAGAGGGAATGTTTATCTTCATTTTGGCGATGTGTCTACTCTGGATTCTCCGAAAATTATTCACCATAGTCTTGAGTTGAGCGA
TTTTTCCAACCCAATGGACCTTTTTCGAATCAAACAAGTAATGGAAGATGAAGGGTTTGATTCTCAAATTCAAAATCCAGATGTTGAACACTATGGAAAAATTCTGGAAA
TTCCTTCTGTATCAAGGGACCCTAAGGTAATTTACAAGTTTGTTCAATTACCTTGTGATGATAGTAATATTAAAGGGTTTTCTGTTTTGCCAAATATTTCTGAAGTTGAA
GAGTCAACGAGATTTAATTTTGATGATGCTTTATTATCAAGAAAGTCCTTTTCAGATTCCCCTGTAATTGGAAAGGCCTTTTCGAATATTGTAAAGGCCCGTGGTAATTT
TCAAGAACATGTAGTAATTGAGAAAGAGAAAGAGTTATTTAATGTTGAAGACCATGCAGTAATTGAGAAAGAGAAGGAGTTATTAAATGCCAATGGGCTGCATGTCTTTG
GTTTTCCTTCTGCCGAGAGTATATTTTCCAAGGAGCCTTTGCATGAAGTGGGGGAGGTTTTTAAAAAAAATAAGGAGGATGCTGTATTTATTGAAGAGATAGTAAATGAT
GGGGAGGTTCTTAAGAATAAGGAGGATGCTGCATTTATTGATGAGTTAGTAAATGATATTTTAACCCAAGTTTCAAATATGGCTGATAAGTCTTTAACTCAGGGGCAATC
TTTCAATTGTCCAGCAATTAATAACCCAAAGGCTTCTTCAAATGTGAACTCGGTTCAGGATATTTTCAATTCCGATTTAAATGAAAATTATTCTTTGTCTGAATACTTGG
ACTCCGGTATTCCACCTAGTGTTAAAGAAGTGCATGATAATTTCTGTAAATCTTATTCTAAATATTATGTGCGAAAGAAAGGGCCAAGTGTGGAAGGTGATATTTTAAAG
GTTAATGCTGATGTTTTGGAAGAAGTTGTCTCTAAGGTATTGGCTCCTCAAGAGTCTATTTTAAATCAGGATTCTTCAAAGGCAGCAGACAAGTCAAATGATTTTGCAAT
CAATAGTTGTAATATTGGGCCAGAAGGTGTGCTTTTCACAAGAGTTCTTTTTCCCTCTTCTAAAGATAGTTTGTCGGCAAATGCTTCATCACATTGTGGTAATGAAAATA
CAGATGATGAGTCAGAGGTTAGTATGAGCAGTGAAGAGATTGATTTTCCTCCGGAAAATCTTTTGAATGTTGATAGTTGTGATATCTCGGTAAATGATGATTTGAATTTG
CTCTTTACTTCTCCATCAAAATCTAATGCTTCAAGCAAGAAGATTGTGGATGACTTTAAGACTTCAATTCCAACTGCTTCAGATAATTTAGATACTTTTTCTGCTCTTAT
TAAAGCCAGTGGTCTACAGTTCAAGGAAATTCTATCGGGTGGAATGTTAATTATGTGGGATGAAAGTAGAGTCAATGTGGTTGAGGTGCGTACAATTTCAGATCATTTTC
CTATTCTTTTAGAAGCTGGAGGCTTTTCTTGGGGGCCTTCTCCTTTTCGTTTTTTGAATTCCTGGCTAAATCTGAGTGAATGTGTTGAGATTATGGAGAATTCTCTTGCT
GGAGATCGATCGTATGGATGGGCAGTTCAGTTTCGGAATTTTGGGATTGTCAACATAATTCTTGGTGTTTTGGACTCTTTGGTGGTTGGTTCATCTGATGATGCTCGTGT
TTGGTCTCTTGAAAATTCTGGACAGTTTACCGTTAGCTCTTTGTCTCATCAACTTGGTTCGAGTTTTCACATTCAATCGGATTTGTTTTGGGCCATTTGGAAATCTAAAA
GCCCTAAACGAATAAATATTTTGATGTGGATTATTTTGAATGGTAGTTTGAATACTTCTGATGTTCTTCAAAGAAAATTGCCGTTCTTCAGCTTTTTTCCTTCGATTTGT
CCTCTTTGTTTGAAGGCAGGAGACTCCGCATTGCATTTGTTCTTTGAGTGTGCTTATTCACAACTTTGTTGGTCAAAGTTCTTTGCTATTTTCAATATGCAGTGGGTTTT
TTCAAATTCCGTAAAAGAGAATGTGCTTCAACTGCTTATTGGTCCTTCTTTTTCTTCAAGACCGAGATTATTATGGATTAATGGTGTTAAAGCTTTGATATCAGAAATTT
GGTTGGAAAGAAATCAGAGGGTTTTTGAAGATAAAGCGTGGCATTCTTTAGCTCGTCTGGAATCAGCTTGCTTAAAGGCTTCTTCCTGGTGCACTCTTTCTAAATCTTTT
GTAGCTTTCTCTCCACAGGATATTTGTTTTAATTGGCATTCTTTTATTTTTCCCCTGTAA
Protein sequenceShow/hide protein sequence
MEVISRKIMNSFYCVWSEEECFFVEDVAFNKVISLSSSLLLWLERSLVEILCQPIQKFFRKQFRDAFGLIRLGKLRSSSGWFLRCVVSPFSGGRQFIHVPIGNSKMGCSL
FKELVVDSIRSLMVSNAPVASGEVPPMSFAETLKFPLKSLVQGSGAVGVSKNSSVLQKRKVSYWIRKNQEVLNVNFADFWVVSRLFAHNSWKDIIEVLEFHFKSKISINP
LFADKALLKFEDDNVVSSLDMVGKWKVFGNFHLLLEKWNKERHSHPCFMEGYGGWISIKNLPLDYWCRQSFEAIGEYFGGLVNISSKTLNMTIVSEARIQVKTNLCGFMP
ATIELKDSFRGNVYLHFGDVSTLDSPKIIHHSLELSDFSNPMDLFRIKQVMEDEGFDSQIQNPDVEHYGKILEIPSVSRDPKVIYKFVQLPCDDSNIKGFSVLPNISEVE
ESTRFNFDDALLSRKSFSDSPVIGKAFSNIVKARGNFQEHVVIEKEKELFNVEDHAVIEKEKELLNANGLHVFGFPSAESIFSKEPLHEVGEVFKKNKEDAVFIEEIVND
GEVLKNKEDAAFIDELVNDILTQVSNMADKSLTQGQSFNCPAINNPKASSNVNSVQDIFNSDLNENYSLSEYLDSGIPPSVKEVHDNFCKSYSKYYVRKKGPSVEGDILK
VNADVLEEVVSKVLAPQESILNQDSSKAADKSNDFAINSCNIGPEGVLFTRVLFPSSKDSLSANASSHCGNENTDDESEVSMSSEEIDFPPENLLNVDSCDISVNDDLNL
LFTSPSKSNASSKKIVDDFKTSIPTASDNLDTFSALIKASGLQFKEILSGGMLIMWDESRVNVVEVRTISDHFPILLEAGGFSWGPSPFRFLNSWLNLSECVEIMENSLA
GDRSYGWAVQFRNFGIVNIILGVLDSLVVGSSDDARVWSLENSGQFTVSSLSHQLGSSFHIQSDLFWAIWKSKSPKRINILMWIILNGSLNTSDVLQRKLPFFSFFPSIC
PLCLKAGDSALHLFFECAYSQLCWSKFFAIFNMQWVFSNSVKENVLQLLIGPSFSSRPRLLWINGVKALISEIWLERNQRVFEDKAWHSLARLESACLKASSWCTLSKSF
VAFSPQDICFNWHSFIFPL