; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg019837 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg019837
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold5:31782207..31785571
RNA-Seq ExpressionSpg019837
SyntenySpg019837
Gene Ontology termsNA
InterPro domainsIPR026960 - Reverse transcriptase zinc-binding domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU38731.1 hypothetical protein TSUD_208420 [Trifolium subterraneum]1.7e-2423.6Show/hide
Query:  RLSHRSSLPWLIGGDFNEISNLSEKEGGRARVQRQMDLFNEMMESCGLRDLGFSGDIFTWRKGSKGSSWIREKLDRFLGNNDMVQMCNRLGVNHLGYHKS
        +LS+ S LPW I GDFN+I +  EK+G   R Q  ++ F E +   GL D+ + G  FTW K       + EKLDR + N+    M     V  L    S
Subjt:  RLSHRSSLPWLIGGDFNEISNLSEKEGGRARVQRQMDLFNEMMESCGLRDLGFSGDIFTWRKGSKGSSWIREKLDRFLGNNDMVQMCNRLGVNHLGYHKS

Query:  NHRIILAQLQFQGDARSNSRRRPLKLEESWLKFPVSRNIIKDCWKAFLGNDSASFHLKIQRCLVKLAN-EGGRW-----------------------KED
        +H  +L  L+       +   +  K E +W   P     +K  W+ + GN + +   K+  C   L +  G +W                       K D
Subjt:  NHRIILAQLQFQGDARSNSRRRPLKLEESWLKFPVSRNIIKDCWKAFLGNDSASFHLKIQRCLVKLAN-EGGRW-----------------------KED

Query:  LILAEFCGVDSID-------------------------ILNTPTGGRNYKDEIIWKCDPKGMFSFLEA--------------------------KAIPKA
         I ++   +   D                         IL+TP       D+I W+ +  G+++   A                          +  PK 
Subjt:  LILAEFCGVDSID-------------------------ILNTPTGGRNYKDEIIWKCDPKGMFSFLEA--------------------------KAIPKA

Query:  KISAWRIIQDSIPTRANISKKGIDSNHVCVFCRTCEETTSHAMWSCKLAKKVWIYFIILMSSLFRLNMEAWSPSDYWDWLSKNVEIEELELAIL--ILWQ
        K   WRI ++ +PTRA +  +G+     CV C   +E ++H  +SC+ +   W    +  S +   N+      + ++ +   +++ E   A+   ++W 
Subjt:  KISAWRIIQDSIPTRANISKKGIDSNHVCVFCRTCEETTSHAMWSCKLAKKVWIYFIILMSSLFRLNMEAWSPSDYWDWLSKNVEIEELELAIL--ILWQ

Query:  IWSHRNKIVHNAINSDLNSIIRAIESRRSEGLTSQSSNLEEPLPRLESQL---SLVSWIPPPLGSWKINVDASWSVALSAGGI
        IW  RN ++          + R +   R+  L +   N  E   R  +Q        W  P  G+WK NVDAS+S + +  GI
Subjt:  IWSHRNKIVHNAINSDLNSIIRAIESRRSEGLTSQSSNLEEPLPRLESQL---SLVSWIPPPLGSWKINVDASWSVALSAGGI

KAF7824053.1 hypothetical protein G2W53_022197 [Senna tora]5.0e-3224.9Show/hide
Query:  LSHRSSLPWLIGGDFNEISNLSEKEGGRARVQRQMDLFNEMMESCGLRDLGFSGDIFTWRKGSKGSSWIREKLDRFLGNNDMVQMCNRLGVNHLGYHKSN
        L+  S LPWL  GDFNEI   SEK+GG A+  R M  F +    CG  D+GF G  FTW  G +    I+E+LDR     + +       VNH+ +  S+
Subjt:  LSHRSSLPWLIGGDFNEISNLSEKEGGRARVQRQMDLFNEMMESCGLRDLGFSGDIFTWRKGSKGSSWIREKLDRFLGNNDMVQMCNRLGVNHLGYHKSN

Query:  HRIILAQLQFQGDARSNSRRRPLKLEESWLKFPVSRNIIKDCWKAFLGNDSASFHLK-------------------IQRCLVK-----------------
        H  +         +    R+R  + EE+W      +++I   W+   G D+ S  L                      RC++K                 
Subjt:  HRIILAQLQFQGDARSNSRRRPLKLEESWLKFPVSRNIIKDCWKAFLGNDSASFHLK-------------------IQRCLVK-----------------

Query:  ----------------------------------LANEGGRWKEDLILAEFCGVDSIDILNTPTGGRNYKDEIIWKCDPKGMFSFLEAKAI---------
                                          + +E   WK DLI + F     I IL+ P   RN +D+ IW  +    +S   A  I         
Subjt:  ----------------------------------LANEGGRWKEDLILAEFCGVDSIDILNTPTGGRNYKDEIIWKCDPKGMFSFLEAKAI---------

Query:  -------------------PKAKISAWRIIQDSIPTRANISKKGIDSNHVCVFCRTCEETTSHAMWSCKLAKKVWI-----YFIILMSSLFRLNMEAWSP
                           PK +I  WR+  +++PT  N+ K+G+   + C  CR   E T H    C  A+ VW      +F IL S+           
Subjt:  -------------------PKAKISAWRIIQDSIPTRANISKKGIDSNHVCVFCRTCEETTSHAMWSCKLAKKVWI-----YFIILMSSLFRLNMEAWSP

Query:  SDYWDWLSKNVEIEELEL---AILILWQIWSHRNKIVHNAINSDLNSIIRAIESRRSEGLTSQSSNLEEPLPRLESQLSLVSWIPPPLGSWKINVDAS
          + DWL   +E E +E+      + W IW+ RN  + NA +  +   +  + S  +E       +L    P + S +S   W  P     K+NVDA+
Subjt:  SDYWDWLSKNVEIEELEL---AILILWQIWSHRNKIVHNAINSDLNSIIRAIESRRSEGLTSQSSNLEEPLPRLESQLSLVSWIPPPLGSWKINVDAS

KAF8408042.1 hypothetical protein HHK36_007182 [Tetracentron sinense]1.4e-3125.9Show/hide
Query:  LSHRSSLPWLIGGDFNEISNLSEKEGGRARVQRQMDLFNEMMESCGLRDLGFSGDIFTWRKGSKGSSWIREKLDRFLGNNDMVQMCNRLGVNHLGYHKSN
        LS   S+PW+  GDFNEI+   EK G   +   +M  F E +  C L  LGF G+ FTW     G   +RE+LDR +  +D   +     V HL  H S+
Subjt:  LSHRSSLPWLIGGDFNEISNLSEKEGGRARVQRQMDLFNEMMESCGLRDLGFSGDIFTWRKGSKGSSWIREKLDRFLGNNDMVQMCNRLGVNHLGYHKSN

Query:  HRIILAQLQFQGDARSNSR-RRPLKLEESWLKFPVSRNIIKDCWKAFLGNDSASFHLKIQRCLVKLANEGGRWKEDLILAEFCGVDSIDILNTPTGGRNY
        H  +L  L F  +A   +R +R  + E  W+  P    II   W       SA    +  +  + +  +   W   L++  F   ++  I + P   R  
Subjt:  HRIILAQLQFQGDARSNSR-RRPLKLEESWLKFPVSRNIIKDCWKAFLGNDSASFHLKIQRCLVKLANEGGRWKEDLILAEFCGVDSIDILNTPTGGRNY

Query:  KDEIIWKCDPKGMFSFLEAKAI--------------------------------------PKAKISAWRIIQDSIPTRANISKKGIDSNHVCVFCRTCEE
         D+ +W    KG FS   A  +                                      PK KI  W++  + +P RAN+ K+ I   +VC  C    E
Subjt:  KDEIIWKCDPKGMFSFLEAKAI--------------------------------------PKAKISAWRIIQDSIPTRANISKKGIDSNHVCVFCRTCEE

Query:  TTSHAMWSCKLAKKVWIYFIILMSSLFRLNMEAWSPSDYWDW---LSKNVEIEELELAILILWQIWSHRNKIVHNAINSDLNSIIRAIESRRSEGLTSQS
        T  H + +C  A++VW+       S   L  +A S      W   + K+   E L    +I W IW HRN+ + + +     + +     +R+  L +  
Subjt:  TTSHAMWSCKLAKKVWIYFIILMSSLFRLNMEAWSPSDYWDW---LSKNVEIEELELAILILWQIWSHRNKIVHNAINSDLNSIIRAIESRRSEGLTSQS

Query:  SNLEEPLPRLESQLSLVSWIPPPLGSWKINVDASWSVALSAGGI
         N  +     ES  +  SW+ PP   +K+N+D +  +   + G+
Subjt:  SNLEEPLPRLESQLSLVSWIPPPLGSWKINVDASWSVALSAGGI

MBA0733287.1 hypothetical protein [Gossypium gossypioides]1.7e-2425.52Show/hide
Query:  RLSHRSSLPWLIGGDFNEISNLSEKEGGRARVQRQMDLFNEMMESCGLRDLGFSGDIFTWRKGSKGSSWIREKLDRFLGNNDMVQMCNRLGVNHLGYHKS
        RL+    +PWL+  DFNEI    EK+GG  R +R+M+ F + ++ C L D+GF G  FTW +G+   + IRE+LDR + N   + M   + V HL +  S
Subjt:  RLSHRSSLPWLIGGDFNEISNLSEKEGGRARVQRQMDLFNEMMESCGLRDLGFSGDIFTWRKGSKGSSWIREKLDRFLGNNDMVQMCNRLGVNHLGYHKS

Query:  NHRIILAQLQFQGDARSNSRRRPLKLEESWLKFPVSRNIIKDCWKAFLGN--------------------DSASFHLKIQRCLVKLA-----NEGGRWKE
        +H  +L     +G    N+     K E  WL       ++K  W++  G+                    D    +    R  +KL      +   +W  
Subjt:  NHRIILAQLQFQGDARSNSRRRPLKLEESWLKFPVSRNIIKDCWKAFLGN--------------------DSASFHLKIQRCLVKLA-----NEGGRWKE

Query:  DLILAEFCGVDSIDILNTPTGGRNYKDEIIWKCDPKGMFS------------------FLEAKA-----------IP-KAKISAWRIIQDSIPTRANISK
         LI   F       IL  P     + D   WK +  G FS                   L+A+            +P K   + WRI  D IP   N+  
Subjt:  DLILAEFCGVDSIDILNTPTGGRNYKDEIIWKCDPKGMFS------------------FLEAKA-----------IP-KAKISAWRIIQDSIPTRANISK

Query:  KGIDSNHVCVFCRTCEETTSHAMWSCKLAKKVWIYFIILMSSLFRLNMEAWSPSDYWDWLSKNVEIEELELAILILWQIWSHRNKIVH
        + + SN  C  C +  E + H    C    +VW     L++  + +N  + +  ++  W+ K    ++       LW IW  RN+++H
Subjt:  KGIDSNHVCVFCRTCEETTSHAMWSCKLAKKVWIYFIILMSSLFRLNMEAWSPSDYWDWLSKNVEIEELELAILILWQIWSHRNKIVH

XP_023921269.1 uncharacterized protein LOC112032742 [Quercus suber]5.7e-2827.27Show/hide
Query:  MVRLSHRSSLPWLIGGDFNEISNLSEKEGGRARVQRQMDLFNEMMESCGLRDLGFSGDIFTWRKGSKGSSWIREKLDRFLGNNDMVQMCNRLGVNHLGYH
        M  LSHRS LPW+  GD+NEI +  EKEGG  R++ QM  F E++  C LRDLG+ G  +TW +      W+R++LDR L +   V       + +    
Subjt:  MVRLSHRSSLPWLIGGDFNEISNLSEKEGGRARVQRQMDLFNEMMESCGLRDLGFSGDIFTWRKGSKGSSWIREKLDRFLGNNDMVQMCNRLGVNHLGYH

Query:  KSNHRI-ILAQLQFQGDARSNSRRRPLKLEESWLKFPVSRNIIKDCWKAFLGNDSASFHLKIQRCLVKLANEGGRWKEDLILAEFCGVDSIDILNTPTGG
         S+H I +L +  +Q   R     +    E  WL+                                  A     W  D +   F   D   IL  P   
Subjt:  KSNHRI-ILAQLQFQGDARSNSRRRPLKLEESWLKFPVSRNIIKDCWKAFLGNDSASFHLKIQRCLVKLANEGGRWKEDLILAEFCGVDSIDILNTPTGG

Query:  RNYKDEIIWKCDPKGMFSFLEA--------------------------KAI------PKAKISAWRIIQDSIPTRANISKKGIDSNHVCVFCRTCEETTS
         +  D ++W  +  G F+   A                          KAI       K K+ AW+  +D + ++ N++K+ I  + VC FC    ET  
Subjt:  RNYKDEIIWKCDPKGMFSFLEA--------------------------KAI------PKAKISAWRIIQDSIPTRANISKKGIDSNHVCVFCRTCEETTS

Query:  HAMWSCKLAKKVWIYFIILMSSLFRLNMEAWSPSDYWDWLSKNVEIEE-----LELAILILWQIWSHRNKIVHNAINSDLNSIIR
        H +W C  AK+VW       +S F L  E  +   + D L   +  E      LEL I   W IW +RN++          SI+R
Subjt:  HAMWSCKLAKKVWIYFIILMSSLFRLNMEAWSPSDYWDWLSKNVEIEE-----LELAILILWQIWSHRNKIVHNAINSDLNSIIR

TrEMBL top hitse value%identityAlignment
A0A2N9HE04 Reverse transcriptase domain-containing protein6.6e-2232.03Show/hide
Query:  RLSHRSSLPWLIGGDFNEISNLSEKEGGRARVQRQMDLFNEMMESCGLRDLGFSGDIFTWRKGSKGSSWIREKLDRFLGNNDMVQMCNRLGVNHLGYHKS
        +L+   SLPWL  GDFNEI   +EK G R R  R+M  F E++  C   DLG+ G  FTW       ++++E+LDR +       + N + V HL   KS
Subjt:  RLSHRSSLPWLIGGDFNEISNLSEKEGGRARVQRQMDLFNEMMESCGLRDLGFSGDIFTWRKGSKGSSWIREKLDRFLGNNDMVQMCNRLGVNHLGYHKS

Query:  NHRIILAQLQFQGDARSNSRRRPLKLEESWLKFPVSRNIIKDCWKAFLGNDSASFHL--KIQRCLVKLANEGGRWKEDL-------ILAEFCGVDSIDIL
        +H  IL +   Q  +++ ++RR  + EE W   P    +I+  W+  +G  S  F L  KI+RC + LA    +W + +       I A F   ++++ L
Subjt:  NHRIILAQLQFQGDARSNSRRRPLKLEESWLKFPVSRNIIKDCWKAFLGNDSASFHL--KIQRCLVKLANEGGRWKEDL-------ILAEFCGVDSIDIL

Query:  NTPTGGRN---------------YKDEIIWK
            GG+N                 DEI WK
Subjt:  NTPTGGRN---------------YKDEIIWK

A0A2N9HE04 Reverse transcriptase domain-containing protein2.2e-0929.75Show/hide
Query:  KAKISAWRIIQDSIPTRANISKKGIDSNHVCVFCRTCEETTSHAMWSCKLAKKVWIYFIILMSSLFRLNMEAWSPSDYWDWLSKNVEIEELELAILILWQ
        K K   WR   +++PT+ N+ K+ I +N +C FC    ETTSH +W+C  A  VW     +   L +  +   +  +    L   ++ EE+E   ++ W 
Subjt:  KAKISAWRIIQDSIPTRANISKKGIDSNHVCVFCRTCEETTSHAMWSCKLAKKVWIYFIILMSSLFRLNMEAWSPSDYWDWLSKNVEIEELELAILILWQ

Query:  IWSHRNKIVHNAINSDLNSII
        +W+ RN+ +H  + S L  I+
Subjt:  IWSHRNKIVHNAINSDLNSII

A0A2Z6N4T0 Uncharacterized protein8.3e-2523.6Show/hide
Query:  RLSHRSSLPWLIGGDFNEISNLSEKEGGRARVQRQMDLFNEMMESCGLRDLGFSGDIFTWRKGSKGSSWIREKLDRFLGNNDMVQMCNRLGVNHLGYHKS
        +LS+ S LPW I GDFN+I +  EK+G   R Q  ++ F E +   GL D+ + G  FTW K       + EKLDR + N+    M     V  L    S
Subjt:  RLSHRSSLPWLIGGDFNEISNLSEKEGGRARVQRQMDLFNEMMESCGLRDLGFSGDIFTWRKGSKGSSWIREKLDRFLGNNDMVQMCNRLGVNHLGYHKS

Query:  NHRIILAQLQFQGDARSNSRRRPLKLEESWLKFPVSRNIIKDCWKAFLGNDSASFHLKIQRCLVKLAN-EGGRW-----------------------KED
        +H  +L  L+       +   +  K E +W   P     +K  W+ + GN + +   K+  C   L +  G +W                       K D
Subjt:  NHRIILAQLQFQGDARSNSRRRPLKLEESWLKFPVSRNIIKDCWKAFLGNDSASFHLKIQRCLVKLAN-EGGRW-----------------------KED

Query:  LILAEFCGVDSID-------------------------ILNTPTGGRNYKDEIIWKCDPKGMFSFLEA--------------------------KAIPKA
         I ++   +   D                         IL+TP       D+I W+ +  G+++   A                          +  PK 
Subjt:  LILAEFCGVDSID-------------------------ILNTPTGGRNYKDEIIWKCDPKGMFSFLEA--------------------------KAIPKA

Query:  KISAWRIIQDSIPTRANISKKGIDSNHVCVFCRTCEETTSHAMWSCKLAKKVWIYFIILMSSLFRLNMEAWSPSDYWDWLSKNVEIEELELAIL--ILWQ
        K   WRI ++ +PTRA +  +G+     CV C   +E ++H  +SC+ +   W    +  S +   N+      + ++ +   +++ E   A+   ++W 
Subjt:  KISAWRIIQDSIPTRANISKKGIDSNHVCVFCRTCEETTSHAMWSCKLAKKVWIYFIILMSSLFRLNMEAWSPSDYWDWLSKNVEIEELELAIL--ILWQ

Query:  IWSHRNKIVHNAINSDLNSIIRAIESRRSEGLTSQSSNLEEPLPRLESQL---SLVSWIPPPLGSWKINVDASWSVALSAGGI
        IW  RN ++          + R +   R+  L +   N  E   R  +Q        W  P  G+WK NVDAS+S + +  GI
Subjt:  IWSHRNKIVHNAINSDLNSIIRAIESRRSEGLTSQSSNLEEPLPRLESQL---SLVSWIPPPLGSWKINVDASWSVALSAGGI

A0A6J1DRA0 uncharacterized protein LOC1110224231.3e-2241.67Show/hide
Query:  RLSHRSSLPWLIGGDFNEISNLSEKEGGRARVQRQMDLFNEMMESCGLRDLGFSGDIFTWRKGSKGSSWIREKLDRFLGNNDMVQMCNRLGVNHLGYHKS
        RL     LPW++GGDFNEI   SEK  G  R Q  M  F + ++ CGL D GF GDIFTW  G K    I E+LDRFL N  + Q+   L + HL +  S
Subjt:  RLSHRSSLPWLIGGDFNEISNLSEKEGGRARVQRQMDLFNEMMESCGLRDLGFSGDIFTWRKGSKGSSWIREKLDRFLGNNDMVQMCNRLGVNHLGYHKS

Query:  NHRIILAQLQFQGDARSNSR--RRPLKLEESWLKFPVSRNIIKDCWKAFLGNDSASFHLKIQRCLVKL
        +HR ILA+    G+A    R  RRP + EE W  F   + I++  W          F  KI  CL +L
Subjt:  NHRIILAQLQFQGDARSNSR--RRPLKLEESWLKFPVSRNIIKDCWKAFLGNDSASFHLKIQRCLVKL

A0A7J9BAA2 Uncharacterized protein (Fragment)8.3e-2525.52Show/hide
Query:  RLSHRSSLPWLIGGDFNEISNLSEKEGGRARVQRQMDLFNEMMESCGLRDLGFSGDIFTWRKGSKGSSWIREKLDRFLGNNDMVQMCNRLGVNHLGYHKS
        RL+    +PWL+  DFNEI    EK+GG  R +R+M+ F + ++ C L D+GF G  FTW +G+   + IRE+LDR + N   + M   + V HL +  S
Subjt:  RLSHRSSLPWLIGGDFNEISNLSEKEGGRARVQRQMDLFNEMMESCGLRDLGFSGDIFTWRKGSKGSSWIREKLDRFLGNNDMVQMCNRLGVNHLGYHKS

Query:  NHRIILAQLQFQGDARSNSRRRPLKLEESWLKFPVSRNIIKDCWKAFLGN--------------------DSASFHLKIQRCLVKLA-----NEGGRWKE
        +H  +L     +G    N+     K E  WL       ++K  W++  G+                    D    +    R  +KL      +   +W  
Subjt:  NHRIILAQLQFQGDARSNSRRRPLKLEESWLKFPVSRNIIKDCWKAFLGN--------------------DSASFHLKIQRCLVKLA-----NEGGRWKE

Query:  DLILAEFCGVDSIDILNTPTGGRNYKDEIIWKCDPKGMFS------------------FLEAKA-----------IP-KAKISAWRIIQDSIPTRANISK
         LI   F       IL  P     + D   WK +  G FS                   L+A+            +P K   + WRI  D IP   N+  
Subjt:  DLILAEFCGVDSIDILNTPTGGRNYKDEIIWKCDPKGMFS------------------FLEAKA-----------IP-KAKISAWRIIQDSIPTRANISK

Query:  KGIDSNHVCVFCRTCEETTSHAMWSCKLAKKVWIYFIILMSSLFRLNMEAWSPSDYWDWLSKNVEIEELELAILILWQIWSHRNKIVH
        + + SN  C  C +  E + H    C    +VW     L++  + +N  + +  ++  W+ K    ++       LW IW  RN+++H
Subjt:  KGIDSNHVCVFCRTCEETTSHAMWSCKLAKKVWIYFIILMSSLFRLNMEAWSPSDYWDWLSKNVEIEELELAILILWQIWSHRNKIVH

A0A803N338 Uncharacterized protein3.8e-2236.21Show/hide
Query:  MVRLSHRSSLPWLIGGDFNEISNLSEKEGGRARVQRQMDLFNEMMESCGLRDLGFSGDIFTWRKGSKGSSWIREKLDRFLGNNDMVQMCNRLGVNHLGYH
        MV L  +S+LP ++ GD NEI + SEKEGG  R +R MD F   +++C LRDLGF G IFTW +GS  ++++ E+LDRFL ++  V +       +   +
Subjt:  MVRLSHRSSLPWLIGGDFNEISNLSEKEGGRARVQRQMDLFNEMMESCGLRDLGFSGDIFTWRKGSKGSSWIREKLDRFLGNNDMVQMCNRLGVNHLGYH

Query:  KSNHRIILAQLQFQGDARSNSRRRPLKLEESWLKFPVSRNIIKDCWKAFLGNDSASFHLKIQRCLVKLANEGGR
         S+H  I+  ++ + D  +N   +  + E  WL +     I+   W A L  D    + KI RC V+L++  G+
Subjt:  KSNHRIILAQLQFQGDARSNSRRRPLKLEESWLKFPVSRNIIKDCWKAFLGNDSASFHLKIQRCLVKLANEGGR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTAGGCTGAGTCACAGATCATCTCTCCCTTGGCTTATAGGGGGAGACTTCAATGAAATATCGAACCTCTCTGAGAAGGAGGGAGGAAGAGCTCGAGTGCAACGCCA
AATGGATTTGTTCAATGAGATGATGGAAAGCTGCGGTTTAAGGGATTTGGGTTTCTCGGGTGACATATTTACTTGGAGGAAAGGTAGCAAGGGTAGTAGTTGGATCAGGG
AGAAGCTAGACAGGTTCCTAGGCAATAATGATATGGTGCAAATGTGTAACAGATTGGGGGTCAATCACTTAGGTTATCACAAATCTAATCATAGGATCATTTTAGCCCAA
CTCCAGTTTCAAGGCGATGCCAGATCAAACTCCCGTAGAAGGCCTTTAAAGCTTGAAGAATCTTGGTTAAAATTCCCTGTAAGCAGGAATATCATTAAAGACTGCTGGAA
GGCTTTCCTTGGAAATGATTCAGCTTCATTCCATCTTAAGATTCAAAGGTGCTTAGTTAAGCTTGCAAATGAGGGGGGTAGATGGAAGGAGGATTTGATCCTAGCCGAAT
TCTGTGGGGTTGACTCAATTGATATCTTAAACACTCCAACAGGGGGAAGAAATTACAAGGATGAGATCATATGGAAGTGTGACCCAAAAGGAATGTTCTCGTTTCTGGAA
GCTAAGGCAATTCCCAAGGCTAAGATCAGTGCTTGGAGAATCATCCAAGACTCCATTCCTACTCGAGCTAATATAAGTAAAAAAGGGATCGACTCTAATCATGTTTGCGT
TTTTTGCAGGACTTGTGAAGAAACTACATCGCACGCTATGTGGAGCTGTAAGTTAGCCAAGAAAGTGTGGATTTATTTCATTATTCTTATGTCCTCCTTGTTTCGTTTGA
ATATGGAAGCTTGGAGCCCCTCGGACTATTGGGATTGGTTGTCTAAGAACGTGGAGATTGAGGAGTTGGAGTTAGCTATCCTAATTCTTTGGCAAATTTGGTCTCACCGA
AACAAGATTGTTCACAACGCAATCAATTCAGATCTCAATTCCATCATCAGAGCTATCGAGTCTAGGAGATCTGAAGGTCTTACCTCACAATCCTCTAATCTAGAGGAGCC
GTTGCCGAGATTGGAGAGCCAGCTGAGTCTGGTGTCGTGGATTCCCCCGCCGTTGGGTTCGTGGAAGATCAATGTGGACGCGTCTTGGAGCGTAGCCCTCTCCGCTGGAG
GAATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTAGGCTGAGTCACAGATCATCTCTCCCTTGGCTTATAGGGGGAGACTTCAATGAAATATCGAACCTCTCTGAGAAGGAGGGAGGAAGAGCTCGAGTGCAACGCCA
AATGGATTTGTTCAATGAGATGATGGAAAGCTGCGGTTTAAGGGATTTGGGTTTCTCGGGTGACATATTTACTTGGAGGAAAGGTAGCAAGGGTAGTAGTTGGATCAGGG
AGAAGCTAGACAGGTTCCTAGGCAATAATGATATGGTGCAAATGTGTAACAGATTGGGGGTCAATCACTTAGGTTATCACAAATCTAATCATAGGATCATTTTAGCCCAA
CTCCAGTTTCAAGGCGATGCCAGATCAAACTCCCGTAGAAGGCCTTTAAAGCTTGAAGAATCTTGGTTAAAATTCCCTGTAAGCAGGAATATCATTAAAGACTGCTGGAA
GGCTTTCCTTGGAAATGATTCAGCTTCATTCCATCTTAAGATTCAAAGGTGCTTAGTTAAGCTTGCAAATGAGGGGGGTAGATGGAAGGAGGATTTGATCCTAGCCGAAT
TCTGTGGGGTTGACTCAATTGATATCTTAAACACTCCAACAGGGGGAAGAAATTACAAGGATGAGATCATATGGAAGTGTGACCCAAAAGGAATGTTCTCGTTTCTGGAA
GCTAAGGCAATTCCCAAGGCTAAGATCAGTGCTTGGAGAATCATCCAAGACTCCATTCCTACTCGAGCTAATATAAGTAAAAAAGGGATCGACTCTAATCATGTTTGCGT
TTTTTGCAGGACTTGTGAAGAAACTACATCGCACGCTATGTGGAGCTGTAAGTTAGCCAAGAAAGTGTGGATTTATTTCATTATTCTTATGTCCTCCTTGTTTCGTTTGA
ATATGGAAGCTTGGAGCCCCTCGGACTATTGGGATTGGTTGTCTAAGAACGTGGAGATTGAGGAGTTGGAGTTAGCTATCCTAATTCTTTGGCAAATTTGGTCTCACCGA
AACAAGATTGTTCACAACGCAATCAATTCAGATCTCAATTCCATCATCAGAGCTATCGAGTCTAGGAGATCTGAAGGTCTTACCTCACAATCCTCTAATCTAGAGGAGCC
GTTGCCGAGATTGGAGAGCCAGCTGAGTCTGGTGTCGTGGATTCCCCCGCCGTTGGGTTCGTGGAAGATCAATGTGGACGCGTCTTGGAGCGTAGCCCTCTCCGCTGGAG
GAATTTGA
Protein sequenceShow/hide protein sequence
MVRLSHRSSLPWLIGGDFNEISNLSEKEGGRARVQRQMDLFNEMMESCGLRDLGFSGDIFTWRKGSKGSSWIREKLDRFLGNNDMVQMCNRLGVNHLGYHKSNHRIILAQ
LQFQGDARSNSRRRPLKLEESWLKFPVSRNIIKDCWKAFLGNDSASFHLKIQRCLVKLANEGGRWKEDLILAEFCGVDSIDILNTPTGGRNYKDEIIWKCDPKGMFSFLE
AKAIPKAKISAWRIIQDSIPTRANISKKGIDSNHVCVFCRTCEETTSHAMWSCKLAKKVWIYFIILMSSLFRLNMEAWSPSDYWDWLSKNVEIEELELAILILWQIWSHR
NKIVHNAINSDLNSIIRAIESRRSEGLTSQSSNLEEPLPRLESQLSLVSWIPPPLGSWKINVDASWSVALSAGGI