; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0001286 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0001286
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionReverse transcriptase
Genome locationchr04:20448142..20452957
RNA-Seq ExpressionPay0001286
SyntenyPay0001286
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0068077.1 reverse transcriptase [Cucumis melo var. makuwa]1.1e-136100Show/hide
Query:  LRTRTPEEIHGFPPIEELEAYKIVFETYTFGGSEQIPYRGDDNPELEVDFQEPMENFPQETQILPDETEAKTEESKEAQIENRENEMMKDLRKMAEESSI
        LRTRTPEEIHGFPPIEELEAYKIVFETYTFGGSEQIPYRGDDNPELEVDFQEPMENFPQETQILPDETEAKTEESKEAQIENRENEMMKDLRKMAEESSI
Subjt:  LRTRTPEEIHGFPPIEELEAYKIVFETYTFGGSEQIPYRGDDNPELEVDFQEPMENFPQETQILPDETEAKTEESKEAQIENRENEMMKDLRKMAEESSI

Query:  SSRTESSPWSSPGSFSSREYNSNYTLGSYGSMRKEKEWRRTLACKLFEERHNSEGTEGMDSLWETYENSESKNLQKKEKMNGKLKKGKKIQKKTDDDDEE
        SSRTESSPWSSPGSFSSREYNSNYTLGSYGSMRKEKEWRRTLACKLFEERHNSEGTEGMDSLWETYENSESKNLQKKEKMNGKLKKGKKIQKKTDDDDEE
Subjt:  SSRTESSPWSSPGSFSSREYNSNYTLGSYGSMRKEKEWRRTLACKLFEERHNSEGTEGMDSLWETYENSESKNLQKKEKMNGKLKKGKKIQKKTDDDDEE

Query:  EEDGEGQLCCLQALKFSAGKMNLGMGKPNLLKMTKALKGFGWLNRNGSRKRLIHS
        EEDGEGQLCCLQALKFSAGKMNLGMGKPNLLKMTKALKGFGWLNRNGSRKRLIHS
Subjt:  EEDGEGQLCCLQALKFSAGKMNLGMGKPNLLKMTKALKGFGWLNRNGSRKRLIHS

KAG7036245.1 hypothetical protein SDJN02_03047, partial [Cucurbita argyrosperma subsp. argyrosperma]4.2e-8569.09Show/hide
Query:  LRTRTPEEIHGFPPIEELEAYKIVFETYTFGGSEQIPYRGDDNP------ELEVDFQEPMENFPQETQILPD-----ETEAKT---EESKEAQIENR---
        LR RTPEEI  F P+EELEAYKIVFETYTF G+EQ  Y G+D+       E+EVD +E MENFP + +  P+     E EAKT   EE  E + E R   
Subjt:  LRTRTPEEIHGFPPIEELEAYKIVFETYTFGGSEQIPYRGDDNP------ELEVDFQEPMENFPQETQILPD-----ETEAKT---EESKEAQIENR---

Query:  ---ENEMMKDLRKMAEESSISSRTESSPWSSPGSFSSREYNSNYTLGSYGSMRKEKEWRRTLACKLFEERHNSEGTEGMDSLWETYENSESKNLQKKEKM
           ENEMMK+LRK+ EESSISSR+ESSPWSSPGSFS      +Y+LGSYGSMRKEKEWRRTLACKLFEERHNSEGTEGMDSLWETYE SE   LQK EK+
Subjt:  ---ENEMMKDLRKMAEESSISSRTESSPWSSPGSFSSREYNSNYTLGSYGSMRKEKEWRRTLACKLFEERHNSEGTEGMDSLWETYENSESKNLQKKEKM

Query:  NGKLKKGKKIQKKTDDDDEEE-EDGEGQLCCLQALKFSAGKMNLGMGKPNLLKMTKALKGFGWLNRNGSRKRLIH
        NGK KKGKKI+ K +D+DE+E ED EGQLCCLQALKFSAGKMNLGMG+PNL+KM+KALKGFGWL+R+GSRKR +H
Subjt:  NGKLKKGKKIQKKTDDDDEEE-EDGEGQLCCLQALKFSAGKMNLGMGKPNLLKMTKALKGFGWLNRNGSRKRLIH

XP_004144391.1 uncharacterized protein LOC101214978 [Cucumis sativus]7.4e-13096.09Show/hide
Query:  LRTRTPEEIHGFPPIEELEAYKIVFETYTFGGSEQIPYRGDDNPELEVDFQEPMENFPQETQILPDETEAKTEESKEAQIENRENEMMKDLRKMAEESSI
        LRTRTPEEIHGFPPIEELEAYKIVFETYTFGGSEQIPY GDDNPELEVDFQE MENFPQETQILPDETEAKTEESKEAQI NRENEMMKDLRK+ EESSI
Subjt:  LRTRTPEEIHGFPPIEELEAYKIVFETYTFGGSEQIPYRGDDNPELEVDFQEPMENFPQETQILPDETEAKTEESKEAQIENRENEMMKDLRKMAEESSI

Query:  SSRTESSPWSSPGSFSSREYNSNYTLGSYGSMRKEKEWRRTLACKLFEERHNSEGTEGMDSLWETYENSESKNLQKKEKMNGKLKKGKKIQKKTDDDDEE
        SSRTESSPWSSPGSFSSREYN+NYTLGSYGSMRKEKEWRRTLACKLFEERHNSEGTEGMDSLWETYENSESKNLQKKEKMNGK  KGKKIQKKTDDDDEE
Subjt:  SSRTESSPWSSPGSFSSREYNSNYTLGSYGSMRKEKEWRRTLACKLFEERHNSEGTEGMDSLWETYENSESKNLQKKEKMNGKLKKGKKIQKKTDDDDEE

Query:  EEDGE-GQLCCLQALKFSAGKMNLGMGKPNLLKMTKALKGFGWLNRNGSRKRLIHS
        EEDGE GQLCCLQALKFSAGKMNLGMGKPNLLKMTKALKGFGWLNRNGSRK+LIHS
Subjt:  EEDGE-GQLCCLQALKFSAGKMNLGMGKPNLLKMTKALKGFGWLNRNGSRKRLIHS

XP_016902526.1 PREDICTED: uncharacterized protein LOC103499217 [Cucumis melo]6.0e-140100Show/hide
Query:  MTQPFLRTRTPEEIHGFPPIEELEAYKIVFETYTFGGSEQIPYRGDDNPELEVDFQEPMENFPQETQILPDETEAKTEESKEAQIENRENEMMKDLRKMA
        MTQPFLRTRTPEEIHGFPPIEELEAYKIVFETYTFGGSEQIPYRGDDNPELEVDFQEPMENFPQETQILPDETEAKTEESKEAQIENRENEMMKDLRKMA
Subjt:  MTQPFLRTRTPEEIHGFPPIEELEAYKIVFETYTFGGSEQIPYRGDDNPELEVDFQEPMENFPQETQILPDETEAKTEESKEAQIENRENEMMKDLRKMA

Query:  EESSISSRTESSPWSSPGSFSSREYNSNYTLGSYGSMRKEKEWRRTLACKLFEERHNSEGTEGMDSLWETYENSESKNLQKKEKMNGKLKKGKKIQKKTD
        EESSISSRTESSPWSSPGSFSSREYNSNYTLGSYGSMRKEKEWRRTLACKLFEERHNSEGTEGMDSLWETYENSESKNLQKKEKMNGKLKKGKKIQKKTD
Subjt:  EESSISSRTESSPWSSPGSFSSREYNSNYTLGSYGSMRKEKEWRRTLACKLFEERHNSEGTEGMDSLWETYENSESKNLQKKEKMNGKLKKGKKIQKKTD

Query:  DDDEEEEDGEGQLCCLQALKFSAGKMNLGMGKPNLLKMTKALKGFGWLNRNGSRKRLIHS
        DDDEEEEDGEGQLCCLQALKFSAGKMNLGMGKPNLLKMTKALKGFGWLNRNGSRKRLIHS
Subjt:  DDDEEEEDGEGQLCCLQALKFSAGKMNLGMGKPNLLKMTKALKGFGWLNRNGSRKRLIHS

XP_038887002.1 stress response protein NST1 [Benincasa hispida]9.4e-10177.58Show/hide
Query:  LRTRTPEEIHGF-PPIEELEAYKIVFETYTFGGSEQIPYRGDDNPELEV--DFQEPMENFPQETQILPDE-----TEAKTEESKEAQIEN----------
        LR RTPEE  GF PP+EELEAYKIVFETYTF GSEQ PY  DD+PE+EV  DFQEPMENFP+E QIL +       E KTEE KEAQIEN          
Subjt:  LRTRTPEEIHGF-PPIEELEAYKIVFETYTFGGSEQIPYRGDDNPELEV--DFQEPMENFPQETQILPDE-----TEAKTEESKEAQIEN----------

Query:  --------RENEMMKDLRKMAEESSISSRTESSPWSSPGSFSSREYNSNYTLGSYGSMRKEKEWRRTLACKLFEERHNSEGTEGMDSLWETYENSESKNL
                 ENEMMKDLRK+ EESSISSRTESSPWSSPGSFSSREYNS   LGSYGSMRKEKEWRRTLACKLFEERHNSEGTEGMDSLWETYE SESK L
Subjt:  --------RENEMMKDLRKMAEESSISSRTESSPWSSPGSFSSREYNSNYTLGSYGSMRKEKEWRRTLACKLFEERHNSEGTEGMDSLWETYENSESKNL

Query:  QKKEKMNGKLKKGKKIQKKTDDDDEEEEDGEGQLCCLQALKFSAGKMNLGMGKPNLLKMTKALKGFGWLNRNGSRK-RLIH
        QK+ K+NGKLKKGKKIQKK +++DEEEEDGEGQLCCLQALKFSAGKMNLGMG+PNLLKMTKALKGFGWL+RNGSR+ RLIH
Subjt:  QKKEKMNGKLKKGKKIQKKTDDDDEEEEDGEGQLCCLQALKFSAGKMNLGMGKPNLLKMTKALKGFGWLNRNGSRK-RLIH

TrEMBL top hitse value%identityAlignment
A0A0A0L955 Uncharacterized protein1.7e-13295.79Show/hide
Query:  MTQPFLRTRTPEEIHGFPPIEELEAYKIVFETYTFGGSEQIPYRGDDNPELEVDFQEPMENFPQETQILPDETEAKTEESKEAQIENRENEMMKDLRKMA
        MTQ FLRTRTPEEIHGFPPIEELEAYKIVFETYTFGGSEQIPY GDDNPELEVDFQE MENFPQETQILPDETEAKTEESKEAQI NRENEMMKDLRK+ 
Subjt:  MTQPFLRTRTPEEIHGFPPIEELEAYKIVFETYTFGGSEQIPYRGDDNPELEVDFQEPMENFPQETQILPDETEAKTEESKEAQIENRENEMMKDLRKMA

Query:  EESSISSRTESSPWSSPGSFSSREYNSNYTLGSYGSMRKEKEWRRTLACKLFEERHNSEGTEGMDSLWETYENSESKNLQKKEKMNGKLKKGKKIQKKTD
        EESSISSRTESSPWSSPGSFSSREYN+NYTLGSYGSMRKEKEWRRTLACKLFEERHNSEGTEGMDSLWETYENSESKNLQKKEKMNGK  KGKKIQKKTD
Subjt:  EESSISSRTESSPWSSPGSFSSREYNSNYTLGSYGSMRKEKEWRRTLACKLFEERHNSEGTEGMDSLWETYENSESKNLQKKEKMNGKLKKGKKIQKKTD

Query:  DDDEEEEDGE-GQLCCLQALKFSAGKMNLGMGKPNLLKMTKALKGFGWLNRNGSRKRLIHS
        DDDEEEEDGE GQLCCLQALKFSAGKMNLGMGKPNLLKMTKALKGFGWLNRNGSRK+LIHS
Subjt:  DDDEEEEDGE-GQLCCLQALKFSAGKMNLGMGKPNLLKMTKALKGFGWLNRNGSRKRLIHS

A0A1S4E2S0 uncharacterized protein LOC1034992172.9e-140100Show/hide
Query:  MTQPFLRTRTPEEIHGFPPIEELEAYKIVFETYTFGGSEQIPYRGDDNPELEVDFQEPMENFPQETQILPDETEAKTEESKEAQIENRENEMMKDLRKMA
        MTQPFLRTRTPEEIHGFPPIEELEAYKIVFETYTFGGSEQIPYRGDDNPELEVDFQEPMENFPQETQILPDETEAKTEESKEAQIENRENEMMKDLRKMA
Subjt:  MTQPFLRTRTPEEIHGFPPIEELEAYKIVFETYTFGGSEQIPYRGDDNPELEVDFQEPMENFPQETQILPDETEAKTEESKEAQIENRENEMMKDLRKMA

Query:  EESSISSRTESSPWSSPGSFSSREYNSNYTLGSYGSMRKEKEWRRTLACKLFEERHNSEGTEGMDSLWETYENSESKNLQKKEKMNGKLKKGKKIQKKTD
        EESSISSRTESSPWSSPGSFSSREYNSNYTLGSYGSMRKEKEWRRTLACKLFEERHNSEGTEGMDSLWETYENSESKNLQKKEKMNGKLKKGKKIQKKTD
Subjt:  EESSISSRTESSPWSSPGSFSSREYNSNYTLGSYGSMRKEKEWRRTLACKLFEERHNSEGTEGMDSLWETYENSESKNLQKKEKMNGKLKKGKKIQKKTD

Query:  DDDEEEEDGEGQLCCLQALKFSAGKMNLGMGKPNLLKMTKALKGFGWLNRNGSRKRLIHS
        DDDEEEEDGEGQLCCLQALKFSAGKMNLGMGKPNLLKMTKALKGFGWLNRNGSRKRLIHS
Subjt:  DDDEEEEDGEGQLCCLQALKFSAGKMNLGMGKPNLLKMTKALKGFGWLNRNGSRKRLIHS

A0A5D3DQW2 Reverse transcriptase5.2e-137100Show/hide
Query:  LRTRTPEEIHGFPPIEELEAYKIVFETYTFGGSEQIPYRGDDNPELEVDFQEPMENFPQETQILPDETEAKTEESKEAQIENRENEMMKDLRKMAEESSI
        LRTRTPEEIHGFPPIEELEAYKIVFETYTFGGSEQIPYRGDDNPELEVDFQEPMENFPQETQILPDETEAKTEESKEAQIENRENEMMKDLRKMAEESSI
Subjt:  LRTRTPEEIHGFPPIEELEAYKIVFETYTFGGSEQIPYRGDDNPELEVDFQEPMENFPQETQILPDETEAKTEESKEAQIENRENEMMKDLRKMAEESSI

Query:  SSRTESSPWSSPGSFSSREYNSNYTLGSYGSMRKEKEWRRTLACKLFEERHNSEGTEGMDSLWETYENSESKNLQKKEKMNGKLKKGKKIQKKTDDDDEE
        SSRTESSPWSSPGSFSSREYNSNYTLGSYGSMRKEKEWRRTLACKLFEERHNSEGTEGMDSLWETYENSESKNLQKKEKMNGKLKKGKKIQKKTDDDDEE
Subjt:  SSRTESSPWSSPGSFSSREYNSNYTLGSYGSMRKEKEWRRTLACKLFEERHNSEGTEGMDSLWETYENSESKNLQKKEKMNGKLKKGKKIQKKTDDDDEE

Query:  EEDGEGQLCCLQALKFSAGKMNLGMGKPNLLKMTKALKGFGWLNRNGSRKRLIHS
        EEDGEGQLCCLQALKFSAGKMNLGMGKPNLLKMTKALKGFGWLNRNGSRKRLIHS
Subjt:  EEDGEGQLCCLQALKFSAGKMNLGMGKPNLLKMTKALKGFGWLNRNGSRKRLIHS

A0A6J1E4P5 uncharacterized protein LOC1110260373.2e-7061.9Show/hide
Query:  LRTRTPEEIHGF---PPIEELEAYKIVFETYTFG---GSEQIPYRGD--DNPELE--VDFQEPMENFPQETQILPDE--TEAKTEES-------KEAQIE
        LR RTP+E+  F   P  EELEAY IVFETYTFG    +E+ PY G   D PE+E  VD +EP+ NFP+  +ILP+      KTEE        KE + +
Subjt:  LRTRTPEEIHGF---PPIEELEAYKIVFETYTFG---GSEQIPYRGD--DNPELE--VDFQEPMENFPQETQILPDE--TEAKTEES-------KEAQIE

Query:  NRENEMMKDLRKMAEESSISSRTESSPWSSPGSFSSREYNSNYTLGSYGSMRKEKEWRRTLACKLFEERHNSEGTEGMDSLWETYENSESKNLQKKEKMN
        + ENE M   +   ++SSISSR+ESSPWSSPGSF  REY+S   LGSYGSMRKEKEWRRTLACKLFEERHN+EG+EGMDSLWETYE SESK  QK E   
Subjt:  NRENEMMKDLRKMAEESSISSRTESSPWSSPGSFSSREYNSNYTLGSYGSMRKEKEWRRTLACKLFEERHNSEGTEGMDSLWETYENSESKNLQKKEKMN

Query:  GKLKKGKKIQKKTDDDDEEEEDGEGQLCCLQALKFSAGKMNLGMGKPNLLKMTKALKGFGWLNRNGSRKRLIH
         K K  K  + K   ++E+E+  EGQLCCLQALKFSAGKMNLGMG+PNL+KM+KA KGFGWLNR+GSRK LIH
Subjt:  GKLKKGKKIQKKTDDDDEEEEDGEGQLCCLQALKFSAGKMNLGMGKPNLLKMTKALKGFGWLNRNGSRKRLIH

A0A6J1HI04 uncharacterized protein LOC1114638234.4e-7265.91Show/hide
Query:  LRTRTPEEIHGFPPIEELEAYKIVFETYTFGGSEQIPYRGD-DNPELEVDFQEPMENFPQETQILPD------ETEAKTEESKEAQIENRE--NEMMKDL
        LR RTP+E     P+EELEAYKIVFE YTF GSEQ PY       E+EVDFQEPME+FP++ + LP+      E EAKTEE +EA+ ENR+  +E+M DL
Subjt:  LRTRTPEEIHGFPPIEELEAYKIVFETYTFGGSEQIPYRGD-DNPELEVDFQEPMENFPQETQILPD------ETEAKTEESKEAQIENRE--NEMMKDL

Query:  RKMAEESSISSRTESS-PWSSPGSFSSREYNSNYTLGSYGSMRKEKEWRRTLACKLFEERHNSEGTEGMDSLWETYENSESKNLQKKEKMNGKLKKGKKI
        +    ESS SSR+ESS PWSSPGSF  R+Y+S   LGSYGSMRKEKEWRRTLACKLFEERH+SE TEGMDSLWETYE        KKEK N K       
Subjt:  RKMAEESSISSRTESS-PWSSPGSFSSREYNSNYTLGSYGSMRKEKEWRRTLACKLFEERHNSEGTEGMDSLWETYENSESKNLQKKEKMNGKLKKGKKI

Query:  QKKTDDDDEEEEDGEGQLCCLQALKFSAGKMNLGMGKPNLLKMTKALKGFGWLNRNGSRKRLIH
         KK ++++EEEE+ EGQLCCLQALKFSAGKMNLGM +PNL+KMTKALKGFGWL+R GSRKRLIH
Subjt:  QKKTDDDDEEEEDGEGQLCCLQALKFSAGKMNLGMGKPNLLKMTKALKGFGWLNRNGSRKRLIH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G25130.1 unknown protein2.4e-3036.68Show/hide
Query:  GFPPIEELEAYKIVFETYTFGGSEQIPYRGDD--------NPELEVDFQEPMENFPQETQILP------------DET---------EAKTEESKEAQIE
        GF  +EELEAYK+V E  +   + +     D+        + E  V      E   ++ +I P            +ET         E K +   +  ++
Subjt:  GFPPIEELEAYKIVFETYTFGGSEQIPYRGDD--------NPELEVDFQEPMENFPQETQILP------------DET---------EAKTEESKEAQIE

Query:  NRENEMMKDLRKMAEESSISSRTESSPWSSPGSF-----------SSREYNSNYTLGSYGSMRKEKEWRRTLACKLFEERHNSEGTEGMDSLWETYENSE
        NRE    ++ +    +    S  ES       +F           +  E   N +L S+GSMRKEKEWRRTLACKLFEERHN++  +GMD LWETYE   
Subjt:  NRENEMMKDLRKMAEESSISSRTESSPWSSPGSF-----------SSREYNSNYTLGSYGSMRKEKEWRRTLACKLFEERHNSEGTEGMDSLWETYENSE

Query:  SKNLQKKEKMNGKLKKGKKIQKKTDDDDE---EEEDGEG----QLCCLQALKFSAGKMNLGMGKPNLLKMTKALKGFG-WLNRNGSRKR
         K  Q +E+     KK K + K    + E   EEED +G    QLCCLQALKFS GKM+LG+ +PNLLK++KA KG G + N N   K+
Subjt:  SKNLQKKEKMNGKLKKGKKIQKKTDDDDE---EEEDGEG----QLCCLQALKFSAGKMNLGMGKPNLLKMTKALKGFG-WLNRNGSRKR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACACAACCATTTTTGAGAACAAGAACACCAGAGGAGATCCATGGATTCCCCCCCATCGAAGAGCTTGAAGCTTATAAAATCGTCTTTGAAACTTACACTTTTGGTGG
CTCTGAACAAATCCCGTATAGGGGGGATGACAATCCAGAACTTGAAGTTGATTTTCAAGAACCCATGGAGAATTTCCCTCAGGAAACCCAAATTCTCCCGGATGAAACTG
AAGCAAAAACAGAGGAATCAAAAGAAGCCCAAATCGAAAACAGAGAGAATGAAATGATGAAAGATTTGAGGAAAATGGCAGAAGAATCATCAATATCTTCAAGAACAGAA
TCGAGTCCATGGAGTTCGCCAGGGAGTTTCAGTAGTAGAGAGTATAATAGTAATTATACATTAGGAAGTTATGGATCGATGAGGAAAGAGAAAGAATGGAGAAGAACACT
TGCTTGTAAGCTGTTTGAAGAACGGCATAATTCAGAAGGAACAGAAGGAATGGATTCATTATGGGAAACATATGAGAATAGTGAATCAAAGAACTTACAGAAGAAAGAAA
AAATGAATGGAAAATTGAAGAAAGGAAAGAAAATTCAAAAGAAAACCGATGATGACGATGAAGAAGAAGAAGATGGAGAAGGGCAACTTTGTTGTTTACAAGCATTGAAA
TTCTCAGCAGGGAAGATGAATTTGGGAATGGGAAAACCAAATCTTTTGAAAATGACTAAAGCTTTGAAGGGATTTGGATGGTTGAACAGAAATGGAAGTAGAAAGAGATT
GATCCATTCATGA
mRNA sequenceShow/hide mRNA sequence
ATTATGTAACATTCGCATTGGACAAATACAAATGGAAGAAAAATCGTTCTTTCTTGGGCGATGGGAACAATATCTTAACATTGTCAACCTTGATGGGCCCATCCTCACAA
AAGCATCCGAAGCACCATTATTGTCATGACACAACCATTTTTGAGAACAAGAACACCAGAGGAGATCCATGGATTCCCCCCCATCGAAGAGCTTGAAGCTTATAAAATCG
TCTTTGAAACTTACACTTTTGGTGGCTCTGAACAAATCCCGTATAGGGGGGATGACAATCCAGAACTTGAAGTTGATTTTCAAGAACCCATGGAGAATTTCCCTCAGGAA
ACCCAAATTCTCCCGGATGAAACTGAAGCAAAAACAGAGGAATCAAAAGAAGCCCAAATCGAAAACAGAGAGAATGAAATGATGAAAGATTTGAGGAAAATGGCAGAAGA
ATCATCAATATCTTCAAGAACAGAATCGAGTCCATGGAGTTCGCCAGGGAGTTTCAGTAGTAGAGAGTATAATAGTAATTATACATTAGGAAGTTATGGATCGATGAGGA
AAGAGAAAGAATGGAGAAGAACACTTGCTTGTAAGCTGTTTGAAGAACGGCATAATTCAGAAGGAACAGAAGGAATGGATTCATTATGGGAAACATATGAGAATAGTGAA
TCAAAGAACTTACAGAAGAAAGAAAAAATGAATGGAAAATTGAAGAAAGGAAAGAAAATTCAAAAGAAAACCGATGATGACGATGAAGAAGAAGAAGATGGAGAAGGGCA
ACTTTGTTGTTTACAAGCATTGAAATTCTCAGCAGGGAAGATGAATTTGGGAATGGGAAAACCAAATCTTTTGAAAATGACTAAAGCTTTGAAGGGATTTGGATGGTTGA
ACAGAAATGGAAGTAGAAAGAGATTGATCCATTCATGAATTGTTGCTAATTTATTTTACATTTCTTCTATGTTCTTCATCATCCTAATTCTCTATTTTTTTCTTCTCTTT
GTTGGTCTGGTTTTTTTTTTTCTCTTGGATTCAACTGTTCAAATAGTGAAAGTAAATATTTTGGTTATGGTTGCTAATTGTAGATCTATCTAATCTTAACCCCATTTCAC
CTCTTTTAATAATCTTTTTTTACCTTTATGAGTCTTTGTTTTTGTGTTTCTCTTTTATTGCTTTGGAATCGC
Protein sequenceShow/hide protein sequence
MTQPFLRTRTPEEIHGFPPIEELEAYKIVFETYTFGGSEQIPYRGDDNPELEVDFQEPMENFPQETQILPDETEAKTEESKEAQIENRENEMMKDLRKMAEESSISSRTE
SSPWSSPGSFSSREYNSNYTLGSYGSMRKEKEWRRTLACKLFEERHNSEGTEGMDSLWETYENSESKNLQKKEKMNGKLKKGKKIQKKTDDDDEEEEDGEGQLCCLQALK
FSAGKMNLGMGKPNLLKMTKALKGFGWLNRNGSRKRLIHS