; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc04g0103231 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc04g0103231
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr04:20268683..20273599
RNA-Seq ExpressionCmc04g0103231
SyntenyCmc04g0103231
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0016020 - membrane (cellular component)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0068077.1 reverse transcriptase [Cucumis melo var. makuwa]4.5e-13598.82Show/hide
Query:  LRTRTPEEIHGFPPIEELEAYKIVFETYTFGGSEQIPYRGDDNPELEVDFQEPMENFPQETQILPDETEAKTEEPKEAQIENRENEMMKDLRKMAEESSI
        LRTRTPEEIHGFPPIEELEAYKIVFETYTFGGSEQIPYRGDDNPELEVDFQEPMENFPQETQILPDETEAKTEE KEAQIENRENEMMKDLRKMAEESSI
Subjt:  LRTRTPEEIHGFPPIEELEAYKIVFETYTFGGSEQIPYRGDDNPELEVDFQEPMENFPQETQILPDETEAKTEEPKEAQIENRENEMMKDLRKMAEESSI

Query:  SSRTESSPWSSPGSFSSREYNSSYTLGSYGSMRKEKEWRRTLACKLFEERHNSEGTEGMDSLWETYENSESKNLQKKEKMNGKLKKGKKIQKKTDDDDDE
        SSRTESSPWSSPGSFSSREYNS+YTLGSYGSMRKEKEWRRTLACKLFEERHNSEGTEGMDSLWETYENSESKNLQKKEKMNGKLKKGKKIQKKTDDDD+E
Subjt:  SSRTESSPWSSPGSFSSREYNSSYTLGSYGSMRKEKEWRRTLACKLFEERHNSEGTEGMDSLWETYENSESKNLQKKEKMNGKLKKGKKIQKKTDDDDDE

Query:  EEDGEGQLCCLQALKFSAGKMNLGMGKPNLLKMTKALKGFGWLNRNGSRKRLIHS
        EEDGEGQLCCLQALKFSAGKMNLGMGKPNLLKMTKALKGFGWLNRNGSRKRLIHS
Subjt:  EEDGEGQLCCLQALKFSAGKMNLGMGKPNLLKMTKALKGFGWLNRNGSRKRLIHS

KAG7036245.1 hypothetical protein SDJN02_03047, partial [Cucurbita argyrosperma subsp. argyrosperma]7.2e-8569.09Show/hide
Query:  LRTRTPEEIHGFPPIEELEAYKIVFETYTFGGSEQIPYRGDDNP------ELEVDFQEPMENFPQETQILPD-----ETEAKT---EEPKEAQIENR---
        LR RTPEEI  F P+EELEAYKIVFETYTF G+EQ  Y G+D+       E+EVD +E MENFP + +  P+     E EAKT   EE  E + E R   
Subjt:  LRTRTPEEIHGFPPIEELEAYKIVFETYTFGGSEQIPYRGDDNP------ELEVDFQEPMENFPQETQILPD-----ETEAKT---EEPKEAQIENR---

Query:  ---ENEMMKDLRKMAEESSISSRTESSPWSSPGSFSSREYNSSYTLGSYGSMRKEKEWRRTLACKLFEERHNSEGTEGMDSLWETYENSESKNLQKKEKM
           ENEMMK+LRK+ EESSISSR+ESSPWSSPGSFS       Y+LGSYGSMRKEKEWRRTLACKLFEERHNSEGTEGMDSLWETYE SE   LQK EK+
Subjt:  ---ENEMMKDLRKMAEESSISSRTESSPWSSPGSFSSREYNSSYTLGSYGSMRKEKEWRRTLACKLFEERHNSEGTEGMDSLWETYENSESKNLQKKEKM

Query:  NGKLKKGKKIQ-KKTDDDDDEEEDGEGQLCCLQALKFSAGKMNLGMGKPNLLKMTKALKGFGWLNRNGSRKRLIH
        NGK KKGKKI+ K+ D+D+DE ED EGQLCCLQALKFSAGKMNLGMG+PNL+KM+KALKGFGWL+R+GSRKR +H
Subjt:  NGKLKKGKKIQ-KKTDDDDDEEEDGEGQLCCLQALKFSAGKMNLGMGKPNLLKMTKALKGFGWLNRNGSRKRLIH

XP_004144391.1 uncharacterized protein LOC101214978 [Cucumis sativus]3.1e-12894.92Show/hide
Query:  LRTRTPEEIHGFPPIEELEAYKIVFETYTFGGSEQIPYRGDDNPELEVDFQEPMENFPQETQILPDETEAKTEEPKEAQIENRENEMMKDLRKMAEESSI
        LRTRTPEEIHGFPPIEELEAYKIVFETYTFGGSEQIPY GDDNPELEVDFQE MENFPQETQILPDETEAKTEE KEAQI NRENEMMKDLRK+ EESSI
Subjt:  LRTRTPEEIHGFPPIEELEAYKIVFETYTFGGSEQIPYRGDDNPELEVDFQEPMENFPQETQILPDETEAKTEEPKEAQIENRENEMMKDLRKMAEESSI

Query:  SSRTESSPWSSPGSFSSREYNSSYTLGSYGSMRKEKEWRRTLACKLFEERHNSEGTEGMDSLWETYENSESKNLQKKEKMNGKLKKGKKIQKKTDDDDDE
        SSRTESSPWSSPGSFSSREYN++YTLGSYGSMRKEKEWRRTLACKLFEERHNSEGTEGMDSLWETYENSESKNLQKKEKMNGK  KGKKIQKKTDDDD+E
Subjt:  SSRTESSPWSSPGSFSSREYNSSYTLGSYGSMRKEKEWRRTLACKLFEERHNSEGTEGMDSLWETYENSESKNLQKKEKMNGKLKKGKKIQKKTDDDDDE

Query:  EEDGE-GQLCCLQALKFSAGKMNLGMGKPNLLKMTKALKGFGWLNRNGSRKRLIHS
        EEDGE GQLCCLQALKFSAGKMNLGMGKPNLLKMTKALKGFGWLNRNGSRK+LIHS
Subjt:  EEDGE-GQLCCLQALKFSAGKMNLGMGKPNLLKMTKALKGFGWLNRNGSRKRLIHS

XP_016902526.1 PREDICTED: uncharacterized protein LOC103499217 [Cucumis melo]1.9e-13898.85Show/hide
Query:  MTQPFLRTRTPEEIHGFPPIEELEAYKIVFETYTFGGSEQIPYRGDDNPELEVDFQEPMENFPQETQILPDETEAKTEEPKEAQIENRENEMMKDLRKMA
        MTQPFLRTRTPEEIHGFPPIEELEAYKIVFETYTFGGSEQIPYRGDDNPELEVDFQEPMENFPQETQILPDETEAKTEE KEAQIENRENEMMKDLRKMA
Subjt:  MTQPFLRTRTPEEIHGFPPIEELEAYKIVFETYTFGGSEQIPYRGDDNPELEVDFQEPMENFPQETQILPDETEAKTEEPKEAQIENRENEMMKDLRKMA

Query:  EESSISSRTESSPWSSPGSFSSREYNSSYTLGSYGSMRKEKEWRRTLACKLFEERHNSEGTEGMDSLWETYENSESKNLQKKEKMNGKLKKGKKIQKKTD
        EESSISSRTESSPWSSPGSFSSREYNS+YTLGSYGSMRKEKEWRRTLACKLFEERHNSEGTEGMDSLWETYENSESKNLQKKEKMNGKLKKGKKIQKKTD
Subjt:  EESSISSRTESSPWSSPGSFSSREYNSSYTLGSYGSMRKEKEWRRTLACKLFEERHNSEGTEGMDSLWETYENSESKNLQKKEKMNGKLKKGKKIQKKTD

Query:  DDDDEEEDGEGQLCCLQALKFSAGKMNLGMGKPNLLKMTKALKGFGWLNRNGSRKRLIHS
        DDD+EEEDGEGQLCCLQALKFSAGKMNLGMGKPNLLKMTKALKGFGWLNRNGSRKRLIHS
Subjt:  DDDDEEEDGEGQLCCLQALKFSAGKMNLGMGKPNLLKMTKALKGFGWLNRNGSRKRLIHS

XP_038887002.1 stress response protein NST1 [Benincasa hispida]2.7e-10077.22Show/hide
Query:  LRTRTPEEIHGF-PPIEELEAYKIVFETYTFGGSEQIPYRGDDNPELEV--DFQEPMENFPQETQILPDE-----TEAKTEEPKEAQIEN----------
        LR RTPEE  GF PP+EELEAYKIVFETYTF GSEQ PY  DD+PE+EV  DFQEPMENFP+E QIL +       E KTEE KEAQIEN          
Subjt:  LRTRTPEEIHGF-PPIEELEAYKIVFETYTFGGSEQIPYRGDDNPELEV--DFQEPMENFPQETQILPDE-----TEAKTEEPKEAQIEN----------

Query:  --------RENEMMKDLRKMAEESSISSRTESSPWSSPGSFSSREYNSSYTLGSYGSMRKEKEWRRTLACKLFEERHNSEGTEGMDSLWETYENSESKNL
                 ENEMMKDLRK+ EESSISSRTESSPWSSPGSFSSREYNS   LGSYGSMRKEKEWRRTLACKLFEERHNSEGTEGMDSLWETYE SESK L
Subjt:  --------RENEMMKDLRKMAEESSISSRTESSPWSSPGSFSSREYNSSYTLGSYGSMRKEKEWRRTLACKLFEERHNSEGTEGMDSLWETYENSESKNL

Query:  QKKEKMNGKLKKGKKIQKKTDDDDDEEEDGEGQLCCLQALKFSAGKMNLGMGKPNLLKMTKALKGFGWLNRNGSRK-RLIH
        QK+ K+NGKLKKGKKIQKK +++D+EEEDGEGQLCCLQALKFSAGKMNLGMG+PNLLKMTKALKGFGWL+RNGSR+ RLIH
Subjt:  QKKEKMNGKLKKGKKIQKKTDDDDDEEEDGEGQLCCLQALKFSAGKMNLGMGKPNLLKMTKALKGFGWLNRNGSRK-RLIH

TrEMBL top hitse value%identityAlignment
A0A0A0L955 Uncharacterized protein5.5e-13194.64Show/hide
Query:  MTQPFLRTRTPEEIHGFPPIEELEAYKIVFETYTFGGSEQIPYRGDDNPELEVDFQEPMENFPQETQILPDETEAKTEEPKEAQIENRENEMMKDLRKMA
        MTQ FLRTRTPEEIHGFPPIEELEAYKIVFETYTFGGSEQIPY GDDNPELEVDFQE MENFPQETQILPDETEAKTEE KEAQI NRENEMMKDLRK+ 
Subjt:  MTQPFLRTRTPEEIHGFPPIEELEAYKIVFETYTFGGSEQIPYRGDDNPELEVDFQEPMENFPQETQILPDETEAKTEEPKEAQIENRENEMMKDLRKMA

Query:  EESSISSRTESSPWSSPGSFSSREYNSSYTLGSYGSMRKEKEWRRTLACKLFEERHNSEGTEGMDSLWETYENSESKNLQKKEKMNGKLKKGKKIQKKTD
        EESSISSRTESSPWSSPGSFSSREYN++YTLGSYGSMRKEKEWRRTLACKLFEERHNSEGTEGMDSLWETYENSESKNLQKKEKMNGK  KGKKIQKKTD
Subjt:  EESSISSRTESSPWSSPGSFSSREYNSSYTLGSYGSMRKEKEWRRTLACKLFEERHNSEGTEGMDSLWETYENSESKNLQKKEKMNGKLKKGKKIQKKTD

Query:  DDDDEEEDGE-GQLCCLQALKFSAGKMNLGMGKPNLLKMTKALKGFGWLNRNGSRKRLIHS
        DDD+EEEDGE GQLCCLQALKFSAGKMNLGMGKPNLLKMTKALKGFGWLNRNGSRK+LIHS
Subjt:  DDDDEEEDGE-GQLCCLQALKFSAGKMNLGMGKPNLLKMTKALKGFGWLNRNGSRKRLIHS

A0A1S4E2S0 uncharacterized protein LOC1034992179.4e-13998.85Show/hide
Query:  MTQPFLRTRTPEEIHGFPPIEELEAYKIVFETYTFGGSEQIPYRGDDNPELEVDFQEPMENFPQETQILPDETEAKTEEPKEAQIENRENEMMKDLRKMA
        MTQPFLRTRTPEEIHGFPPIEELEAYKIVFETYTFGGSEQIPYRGDDNPELEVDFQEPMENFPQETQILPDETEAKTEE KEAQIENRENEMMKDLRKMA
Subjt:  MTQPFLRTRTPEEIHGFPPIEELEAYKIVFETYTFGGSEQIPYRGDDNPELEVDFQEPMENFPQETQILPDETEAKTEEPKEAQIENRENEMMKDLRKMA

Query:  EESSISSRTESSPWSSPGSFSSREYNSSYTLGSYGSMRKEKEWRRTLACKLFEERHNSEGTEGMDSLWETYENSESKNLQKKEKMNGKLKKGKKIQKKTD
        EESSISSRTESSPWSSPGSFSSREYNS+YTLGSYGSMRKEKEWRRTLACKLFEERHNSEGTEGMDSLWETYENSESKNLQKKEKMNGKLKKGKKIQKKTD
Subjt:  EESSISSRTESSPWSSPGSFSSREYNSSYTLGSYGSMRKEKEWRRTLACKLFEERHNSEGTEGMDSLWETYENSESKNLQKKEKMNGKLKKGKKIQKKTD

Query:  DDDDEEEDGEGQLCCLQALKFSAGKMNLGMGKPNLLKMTKALKGFGWLNRNGSRKRLIHS
        DDD+EEEDGEGQLCCLQALKFSAGKMNLGMGKPNLLKMTKALKGFGWLNRNGSRKRLIHS
Subjt:  DDDDEEEDGEGQLCCLQALKFSAGKMNLGMGKPNLLKMTKALKGFGWLNRNGSRKRLIHS

A0A5D3DQW2 Reverse transcriptase2.2e-13598.82Show/hide
Query:  LRTRTPEEIHGFPPIEELEAYKIVFETYTFGGSEQIPYRGDDNPELEVDFQEPMENFPQETQILPDETEAKTEEPKEAQIENRENEMMKDLRKMAEESSI
        LRTRTPEEIHGFPPIEELEAYKIVFETYTFGGSEQIPYRGDDNPELEVDFQEPMENFPQETQILPDETEAKTEE KEAQIENRENEMMKDLRKMAEESSI
Subjt:  LRTRTPEEIHGFPPIEELEAYKIVFETYTFGGSEQIPYRGDDNPELEVDFQEPMENFPQETQILPDETEAKTEEPKEAQIENRENEMMKDLRKMAEESSI

Query:  SSRTESSPWSSPGSFSSREYNSSYTLGSYGSMRKEKEWRRTLACKLFEERHNSEGTEGMDSLWETYENSESKNLQKKEKMNGKLKKGKKIQKKTDDDDDE
        SSRTESSPWSSPGSFSSREYNS+YTLGSYGSMRKEKEWRRTLACKLFEERHNSEGTEGMDSLWETYENSESKNLQKKEKMNGKLKKGKKIQKKTDDDD+E
Subjt:  SSRTESSPWSSPGSFSSREYNSSYTLGSYGSMRKEKEWRRTLACKLFEERHNSEGTEGMDSLWETYENSESKNLQKKEKMNGKLKKGKKIQKKTDDDDDE

Query:  EEDGEGQLCCLQALKFSAGKMNLGMGKPNLLKMTKALKGFGWLNRNGSRKRLIHS
        EEDGEGQLCCLQALKFSAGKMNLGMGKPNLLKMTKALKGFGWLNRNGSRKRLIHS
Subjt:  EEDGEGQLCCLQALKFSAGKMNLGMGKPNLLKMTKALKGFGWLNRNGSRKRLIHS

A0A6J1E4P5 uncharacterized protein LOC1110260372.4e-7061.87Show/hide
Query:  LRTRTPEEIHGF---PPIEELEAYKIVFETYTFG---GSEQIPYRGD--DNPELE--VDFQEPMENFPQETQILPDE--TEAKTEEP-------KEAQIE
        LR RTP+E+  F   P  EELEAY IVFETYTFG    +E+ PY G   D PE+E  VD +EP+ NFP+  +ILP+      KTEE        KE + +
Subjt:  LRTRTPEEIHGF---PPIEELEAYKIVFETYTFG---GSEQIPYRGD--DNPELE--VDFQEPMENFPQETQILPDE--TEAKTEEP-------KEAQIE

Query:  NRENEMMKDLRKMAEESSISSRTESSPWSSPGSFSSREYNSSYTLGSYGSMRKEKEWRRTLACKLFEERHNSEGTEGMDSLWETYENSESKNLQKKEKMN
        + ENE M   +   ++SSISSR+ESSPWSSPGSF  REY+S   LGSYGSMRKEKEWRRTLACKLFEERHN+EG+EGMDSLWETYE SESK  QK E   
Subjt:  NRENEMMKDLRKMAEESSISSRTESSPWSSPGSFSSREYNSSYTLGSYGSMRKEKEWRRTLACKLFEERHNSEGTEGMDSLWETYENSESKNLQKKEKMN

Query:  -----GKLKKGKKIQKKTDDDDDEEEDGEGQLCCLQALKFSAGKMNLGMGKPNLLKMTKALKGFGWLNRNGSRKRLIH
              K K+GKK Q++ D+D D     EGQLCCLQALKFSAGKMNLGMG+PNL+KM+KA KGFGWLNR+GSRK LIH
Subjt:  -----GKLKKGKKIQKKTDDDDDEEEDGEGQLCCLQALKFSAGKMNLGMGKPNLLKMTKALKGFGWLNRNGSRKRLIH

A0A6J1HI04 uncharacterized protein LOC1114638231.3e-7165.53Show/hide
Query:  LRTRTPEEIHGFPPIEELEAYKIVFETYTFGGSEQIPYRGD-DNPELEVDFQEPMENFPQETQILPD------ETEAKTEEPKEAQIENRE--NEMMKDL
        LR RTP+E     P+EELEAYKIVFE YTF GSEQ PY       E+EVDFQEPME+FP++ + LP+      E EAKTEE +EA+ ENR+  +E+M DL
Subjt:  LRTRTPEEIHGFPPIEELEAYKIVFETYTFGGSEQIPYRGD-DNPELEVDFQEPMENFPQETQILPD------ETEAKTEEPKEAQIENRE--NEMMKDL

Query:  RKMAEESSISSRTESS-PWSSPGSFSSREYNSSYTLGSYGSMRKEKEWRRTLACKLFEERHNSEGTEGMDSLWETYENSESKNLQKKEKMNGKLKKGKKI
        +    ESS SSR+ESS PWSSPGSF  R+Y+S   LGSYGSMRKEKEWRRTLACKLFEERH+SE TEGMDSLWETYE        KKEK N K       
Subjt:  RKMAEESSISSRTESS-PWSSPGSFSSREYNSSYTLGSYGSMRKEKEWRRTLACKLFEERHNSEGTEGMDSLWETYENSESKNLQKKEKMNGKLKKGKKI

Query:  QKKTDDDDDEEEDGEGQLCCLQALKFSAGKMNLGMGKPNLLKMTKALKGFGWLNRNGSRKRLIH
         KK +++++EEE+ EGQLCCLQALKFSAGKMNLGM +PNL+KMTKALKGFGWL+R GSRKRLIH
Subjt:  QKKTDDDDDEEEDGEGQLCCLQALKFSAGKMNLGMGKPNLLKMTKALKGFGWLNRNGSRKRLIH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G25130.1 unknown protein1.6e-2936.21Show/hide
Query:  GFPPIEELEAYKIVFETYTFGGSEQIPYRGDD--------NPELEVDFQEPMENFPQETQILP------------DET-EAKTEEPKEAQIENRENEMMK
        GF  +EELEAYK+V E  +   + +     D+        + E  V      E   ++ +I P            +ET + + EE +E +++++ + ++ 
Subjt:  GFPPIEELEAYKIVFETYTFGGSEQIPYRGDD--------NPELEVDFQEPMENFPQETQILP------------DET-EAKTEEPKEAQIENRENEMMK

Query:  DLRKMAEESSISSRTESSPWSSPGSF-------------------SSREYNSSYTLGSYGSMRKEKEWRRTLACKLFEERHNSEGTEGMDSLWETYENSE
        +  +  +E S + + +    S+  S+                   +  E   + +L S+GSMRKEKEWRRTLACKLFEERHN++  +GMD LWETYE   
Subjt:  DLRKMAEESSISSRTESSPWSSPGSF-------------------SSREYNSSYTLGSYGSMRKEKEWRRTLACKLFEERHNSEGTEGMDSLWETYENSE

Query:  SKNLQKKEKMNGKLKKGKKIQKKTDDDDD----EEEDGEG----QLCCLQALKFSAGKMNLGMGKPNLLKMTKALKGFG-WLNRNGSRKR
         K  Q +E+   KLKK  K   KT   +     EEED +G    QLCCLQALKFS GKM+LG+ +PNLLK++KA KG G + N N   K+
Subjt:  SKNLQKKEKMNGKLKKGKKIQKKTDDDDD----EEEDGEG----QLCCLQALKFSAGKMNLGMGKPNLLKMTKALKGFG-WLNRNGSRKR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACACAACCATTTTTGAGAACAAGAACACCAGAGGAGATCCATGGATTCCCCCCCATCGAAGAGCTTGAAGCTTATAAAATCGTCTTTGAAACTTACACTTTTGGTGG
CTCTGAACAAATCCCGTATAGGGGGGATGACAATCCAGAACTTGAAGTTGATTTTCAAGAACCCATGGAGAATTTCCCTCAGGAAACCCAAATTCTCCCGGATGAAACTG
AAGCAAAAACAGAGGAACCAAAAGAAGCCCAAATCGAAAACAGAGAGAATGAAATGATGAAAGATTTGAGGAAAATGGCAGAAGAATCATCAATATCTTCAAGAACAGAA
TCGAGTCCATGGAGTTCGCCAGGGAGTTTCAGTAGTAGAGAGTATAATAGTAGTTATACATTAGGAAGTTATGGATCGATGAGGAAAGAGAAAGAATGGCGAAGAACACT
TGCTTGTAAGCTGTTTGAAGAACGGCATAATTCAGAAGGAACAGAAGGAATGGATTCATTATGGGAAACATATGAGAATAGTGAATCAAAGAACTTACAGAAGAAAGAGA
AAATGAATGGAAAATTGAAGAAAGGAAAGAAAATTCAAAAGAAAACCGATGATGACGATGACGAAGAAGAAGATGGAGAAGGGCAACTTTGTTGTTTACAAGCATTGAAA
TTCTCAGCAGGGAAGATGAATTTGGGAATGGGAAAACCAAATCTTTTGAAAATGACTAAAGCTTTGAAGGGATTTGGATGGTTGAACAGAAATGGAAGTAGAAAGAGATT
GATCCATTCATGA
mRNA sequenceShow/hide mRNA sequence
GACAAGCAGGAATATAATTATGTAACATTCGCATTGGACAAATACAAATGGAAGAAAAATCGTTCTTTCTTGGGCGATGGGAACAATATCTTAACATTGTCAACCTTGAT
GGACCCATCCTCACAAAAGCATCCGAAGCACCATTATTGTCATGACACAACCATTTTTGAGAACAAGAACACCAGAGGAGATCCATGGATTCCCCCCCATCGAAGAGCTT
GAAGCTTATAAAATCGTCTTTGAAACTTACACTTTTGGTGGCTCTGAACAAATCCCGTATAGGGGGGATGACAATCCAGAACTTGAAGTTGATTTTCAAGAACCCATGGA
GAATTTCCCTCAGGAAACCCAAATTCTCCCGGATGAAACTGAAGCAAAAACAGAGGAACCAAAAGAAGCCCAAATCGAAAACAGAGAGAATGAAATGATGAAAGATTTGA
GGAAAATGGCAGAAGAATCATCAATATCTTCAAGAACAGAATCGAGTCCATGGAGTTCGCCAGGGAGTTTCAGTAGTAGAGAGTATAATAGTAGTTATACATTAGGAAGT
TATGGATCGATGAGGAAAGAGAAAGAATGGCGAAGAACACTTGCTTGTAAGCTGTTTGAAGAACGGCATAATTCAGAAGGAACAGAAGGAATGGATTCATTATGGGAAAC
ATATGAGAATAGTGAATCAAAGAACTTACAGAAGAAAGAGAAAATGAATGGAAAATTGAAGAAAGGAAAGAAAATTCAAAAGAAAACCGATGATGACGATGACGAAGAAG
AAGATGGAGAAGGGCAACTTTGTTGTTTACAAGCATTGAAATTCTCAGCAGGGAAGATGAATTTGGGAATGGGAAAACCAAATCTTTTGAAAATGACTAAAGCTTTGAAG
GGATTTGGATGGTTGAACAGAAATGGAAGTAGAAAGAGATTGATCCATTCATGAATTGTTGCTAATTTATTTTACATTTCTTCTATGTTCTTCATCATCCTAATTCTCTA
TTTTTTTCTTCTCTTTGTTGGTCTGGTTTTTTTTTTCTCTTGGATTCAACTGTTCAAATAGTGAAAGTAAATATTTTGGTTATGGTTGCTAATTGTAGATCTATCTAATC
TTAACCCATTTCACCTCTTTTAATAATCTTTTTTTACCTTTATGAGTCTTTGTTTTTGTGTTTCTCTTTTGTTGCTTTGGAATCGCTCATCATATGTGATGGTAGAGTGA
TGTTCAATCTACATTTCTTTTTTCCTTAATCTCTGTCTTTTCATTCTTAGATAAGAATATATGGTT
Protein sequenceShow/hide protein sequence
MTQPFLRTRTPEEIHGFPPIEELEAYKIVFETYTFGGSEQIPYRGDDNPELEVDFQEPMENFPQETQILPDETEAKTEEPKEAQIENRENEMMKDLRKMAEESSISSRTE
SSPWSSPGSFSSREYNSSYTLGSYGSMRKEKEWRRTLACKLFEERHNSEGTEGMDSLWETYENSESKNLQKKEKMNGKLKKGKKIQKKTDDDDDEEEDGEGQLCCLQALK
FSAGKMNLGMGKPNLLKMTKALKGFGWLNRNGSRKRLIHS