; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0006782 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0006782
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionGag protease polyprotein
Genome locationchr10:13223887..13226684
RNA-Seq ExpressionPay0006782
SyntenyPay0006782
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0016020 - membrane (cellular component)
GO:0004518 - nuclease activity (molecular function)
GO:0005488 - binding (molecular function)
GO:0008233 - peptidase activity (molecular function)
GO:0016779 - nucleotidyltransferase activity (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025469.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]5.0e-8758.89Show/hide
Query:  MTPRRSARRGDQRGRGVGCN------QPAEQVANPVAPITHADLTQLEQRFNDTVTEVLARHQLAQAALAQG--------QTAQQDLPDQLSVEAKHLRD
        M PRR ARRG + GRG G        QP  Q  +P AP+THADL  +EQRF D + ++  + Q A  A A            A Q +PDQLS EAKHLRD
Subjt:  MTPRRSARRGDQRGRGVGCN------QPAEQVANPVAPITHADLTQLEQRFNDTVTEVLARHQLAQAALAQG--------QTAQQDLPDQLSVEAKHLRD

Query:  FRIYDPQTFNGSLEDPISTKLWLSSVETIFRFMRCPDDQKVQCAVFLLRERAAIWWQSVERMLGGNVNQITWDQFKESFYAKFLPSSLRDAKRQEFINLK
        FR Y+P TF+GSL+DP   +LWLSS+ETIFR+M+CP+DQKVQCAVF+L +R   WW++ ERMLGG+V+QITW QFKESFYAKF  +SLRDAKRQEF+NL+
Subjt:  FRIYDPQTFNGSLEDPISTKLWLSSVETIFRFMRCPDDQKVQCAVFLLRERAAIWWQSVERMLGGNVNQITWDQFKESFYAKFLPSSLRDAKRQEFINLK

Query:  QGQMTVEEYDYEFDMLSLFAPELVETEAARAKMFVWGLRKDLQGFVRASKPATQTEALYLALDLSVQKDDDLLKVSRKGPFLGQKRR
        QG MTVE+YD EFDMLS FAPE++ TEAARA  FV GLR D+QGFVRA +PAT  +AL LA+DLS+Q+  +  K + +G   GQKR+
Subjt:  QGQMTVEEYDYEFDMLSLFAPELVETEAARAKMFVWGLRKDLQGFVRASKPATQTEALYLALDLSVQKDDDLLKVSRKGPFLGQKRR

KAA0031931.1 pol protein [Cucumis melo var. makuwa]5.5e-8658.08Show/hide
Query:  MTPRRSARRGDQRGRGVGCN------QPAEQVANPVAPITHADLTQLEQRFNDTVTEVLARHQLAQAALAQG------------QTAQQDLPDQLSVEAK
        M PRR ARRG Q GRG G        QP  Q  +P AP+THADL  +EQRF D + ++  + Q    A A                A Q +PDQLS EAK
Subjt:  MTPRRSARRGDQRGRGVGCN------QPAEQVANPVAPITHADLTQLEQRFNDTVTEVLARHQLAQAALAQG------------QTAQQDLPDQLSVEAK

Query:  HLRDFRIYDPQTFNGSLEDPISTKLWLSSVETIFRFMRCPDDQKVQCAVFLLRERAAIWWQSVERMLGGNVNQITWDQFKESFYAKFLPSSLRDAKRQEF
        HLRDFR Y+P TF+GSLEDP   +LWLSS+ETIFR+M+CP+DQKVQCAVF+L +R   WW++ ERMLGG+V+QITW QFKESFYAKF  +SLRDAKRQEF
Subjt:  HLRDFRIYDPQTFNGSLEDPISTKLWLSSVETIFRFMRCPDDQKVQCAVFLLRERAAIWWQSVERMLGGNVNQITWDQFKESFYAKFLPSSLRDAKRQEF

Query:  INLKQGQMTVEEYDYEFDMLSLFAPELVETEAARAKMFVWGLRKDLQGFVRASKPATQTEALYLALDLSVQKDDDLLKVSRKGPFLGQKRR
        +NL+QG MTVE+YD EFDMLS FAPE++ TEAARA  FV GLR D+QG VRA +PAT  +AL LA+DLS+Q+  +  K + +G   GQKR+
Subjt:  INLKQGQMTVEEYDYEFDMLSLFAPELVETEAARAKMFVWGLRKDLQGFVRASKPATQTEALYLALDLSVQKDDDLLKVSRKGPFLGQKRR

KAA0065602.1 pol protein [Cucumis melo var. makuwa]3.2e-8658.95Show/hide
Query:  MTPRRSARRGDQRGRGVGCN------QPAEQVANPVAPITHADLTQLEQRFNDTVTEVLARHQLAQAALAQG------QTAQQDLPDQLSVEAKHLRDFR
        M PRR ARRG Q GRG G        QP  Q  +P AP+THADL  +EQRF D + ++  + Q A  A A          A Q +PDQLS EAKHLRDFR
Subjt:  MTPRRSARRGDQRGRGVGCN------QPAEQVANPVAPITHADLTQLEQRFNDTVTEVLARHQLAQAALAQG------QTAQQDLPDQLSVEAKHLRDFR

Query:  IYDPQTFNGSLEDPISTKLWLSSVETIFRFMRCPDDQKVQCAVFLLRERAAIWWQSVERMLGGNVNQITWDQFKESFYAKFLPSSLRDAKRQEFINLKQG
         Y+P TF+GSLEDP   +LWLSS+ETIFR+M+CP+DQKVQCAVF+L +R   WW++ ERMLGG+V+QITW QFKESFYAKF  +SLRDAKRQ+F+NL+QG
Subjt:  IYDPQTFNGSLEDPISTKLWLSSVETIFRFMRCPDDQKVQCAVFLLRERAAIWWQSVERMLGGNVNQITWDQFKESFYAKFLPSSLRDAKRQEFINLKQG

Query:  QMTVEEYDYEFDMLSLFAPELVETEAARAKMFVWGLRKDLQGFVRASKPATQTEALYLALDLSVQKDDDLLKVSRKGPFLGQKRR
         MTVE+YD EFDMLS FAPE++ TE ARA  FV GLR D+QG VRA +PAT  +AL LA+DLS+Q+  +  K + +G   GQKR+
Subjt:  QMTVEEYDYEFDMLSLFAPELVETEAARAKMFVWGLRKDLQGFVRASKPATQTEALYLALDLSVQKDDDLLKVSRKGPFLGQKRR

XP_008456947.1 PREDICTED: uncharacterized protein LOC103496742 [Cucumis melo]1.0e-15699.3Show/hide
Query:  MTPRRSARRGDQRGRGVGCNQPAEQVANPVAPITHADLTQLEQRFNDTVTEVLARHQLAQAALAQGQTAQQDLPDQLSVEAKHLRDFRIYDPQTFNGSLE
        MTPRRSARRGDQ GRGVGCNQPAEQVANPVAPITHADLTQLEQRFNDTVTEVLARHQLAQAALAQGQTAQQDLPDQLSVEAKHLRDFRIYDPQTFNGSLE
Subjt:  MTPRRSARRGDQRGRGVGCNQPAEQVANPVAPITHADLTQLEQRFNDTVTEVLARHQLAQAALAQGQTAQQDLPDQLSVEAKHLRDFRIYDPQTFNGSLE

Query:  DPISTKLWLSSVETIFRFMRCPDDQKVQCAVFLLRERAAIWWQSVERMLGGNVNQITWDQFKESFYAKFLPSSLRDAKRQEFINLKQGQMTVEEYDYEFD
        DPISTKLWLSSVETIFRFMRCPDDQKVQCAVFLLRERAAIWWQSVERMLGGNVNQITWDQFKESFYAKFLPSSLRDAKRQEFINLKQGQMTVEEYDYEFD
Subjt:  DPISTKLWLSSVETIFRFMRCPDDQKVQCAVFLLRERAAIWWQSVERMLGGNVNQITWDQFKESFYAKFLPSSLRDAKRQEFINLKQGQMTVEEYDYEFD

Query:  MLSLFAPELVETEAARAKMFVWGLRKDLQGFVRASKPATQTEALYLALDLSVQKDDDLLKVSRKGPFLGQKRRLSRSLLMFHRET
        MLSLFAPELVETEAARAKMFVWGLRKDLQGFVRA KPATQTEALYLALDLSVQKDDDLLKVSRKGPFLGQKRRLSRSLLMFHRET
Subjt:  MLSLFAPELVETEAARAKMFVWGLRKDLQGFVRASKPATQTEALYLALDLSVQKDDDLLKVSRKGPFLGQKRRLSRSLLMFHRET

XP_011655042.1 uncharacterized protein LOC101209878 [Cucumis sativus]6.9e-11376.16Show/hide
Query:  MTPRRSARRGDQRGRGVGCN----QPAEQVANPVAPITHADL----TQLEQRFNDTVTEVLARHQLAQAALAQGQTAQQDLPDQLSVEAKHLRDFRIYDP
        MT R+ A RG Q GR  G N    QPAEQV NPVAP++ ADL    T+LEQ F DTVT+VLA+HQLAQAA +QGQ A QDLP+QLS EAKHLRDFRIYDP
Subjt:  MTPRRSARRGDQRGRGVGCN----QPAEQVANPVAPITHADL----TQLEQRFNDTVTEVLARHQLAQAALAQGQTAQQDLPDQLSVEAKHLRDFRIYDP

Query:  QTFNGSLEDPISTKLWLSSVETIFRFMRCPDDQKVQCAVFLLRERAAIWWQSVERMLGGNVNQITWDQFKESFYAKFLPSSLRDAKRQEFINLKQGQMTV
        QTFNG  EDPI+  LWLSSVE IF +M+CPD+QKVQCAVFLLRERAAIWW SVERMLGGNVNQ TWDQFKESFYAKF P+SLRD KRQEFI+LKQGQMT+
Subjt:  QTFNGSLEDPISTKLWLSSVETIFRFMRCPDDQKVQCAVFLLRERAAIWWQSVERMLGGNVNQITWDQFKESFYAKFLPSSLRDAKRQEFINLKQGQMTV

Query:  EEYDYEFDMLSLFAPELVETEAARAKMFVWGLRKDLQGFVRASKPATQTEALYLALDLSVQKDDDLLKVSRKGPFLGQKRR
        EEYDYEFD+LSLFAPELVETEAARA+ FVWGL+ DL+GFVRA KPATQTEAL +A+DLS  KDDD  KVSR GP  GQKR+
Subjt:  EEYDYEFDMLSLFAPELVETEAARAKMFVWGLRKDLQGFVRASKPATQTEALYLALDLSVQKDDDLLKVSRKGPFLGQKRR

TrEMBL top hitse value%identityAlignment
A0A1S3C5M7 uncharacterized protein LOC1034967424.9e-15799.3Show/hide
Query:  MTPRRSARRGDQRGRGVGCNQPAEQVANPVAPITHADLTQLEQRFNDTVTEVLARHQLAQAALAQGQTAQQDLPDQLSVEAKHLRDFRIYDPQTFNGSLE
        MTPRRSARRGDQ GRGVGCNQPAEQVANPVAPITHADLTQLEQRFNDTVTEVLARHQLAQAALAQGQTAQQDLPDQLSVEAKHLRDFRIYDPQTFNGSLE
Subjt:  MTPRRSARRGDQRGRGVGCNQPAEQVANPVAPITHADLTQLEQRFNDTVTEVLARHQLAQAALAQGQTAQQDLPDQLSVEAKHLRDFRIYDPQTFNGSLE

Query:  DPISTKLWLSSVETIFRFMRCPDDQKVQCAVFLLRERAAIWWQSVERMLGGNVNQITWDQFKESFYAKFLPSSLRDAKRQEFINLKQGQMTVEEYDYEFD
        DPISTKLWLSSVETIFRFMRCPDDQKVQCAVFLLRERAAIWWQSVERMLGGNVNQITWDQFKESFYAKFLPSSLRDAKRQEFINLKQGQMTVEEYDYEFD
Subjt:  DPISTKLWLSSVETIFRFMRCPDDQKVQCAVFLLRERAAIWWQSVERMLGGNVNQITWDQFKESFYAKFLPSSLRDAKRQEFINLKQGQMTVEEYDYEFD

Query:  MLSLFAPELVETEAARAKMFVWGLRKDLQGFVRASKPATQTEALYLALDLSVQKDDDLLKVSRKGPFLGQKRRLSRSLLMFHRET
        MLSLFAPELVETEAARAKMFVWGLRKDLQGFVRA KPATQTEALYLALDLSVQKDDDLLKVSRKGPFLGQKRRLSRSLLMFHRET
Subjt:  MLSLFAPELVETEAARAKMFVWGLRKDLQGFVRASKPATQTEALYLALDLSVQKDDDLLKVSRKGPFLGQKRRLSRSLLMFHRET

A0A5A7SJH3 Reverse transcriptase2.4e-8758.89Show/hide
Query:  MTPRRSARRGDQRGRGVGCN------QPAEQVANPVAPITHADLTQLEQRFNDTVTEVLARHQLAQAALAQG--------QTAQQDLPDQLSVEAKHLRD
        M PRR ARRG + GRG G        QP  Q  +P AP+THADL  +EQRF D + ++  + Q A  A A            A Q +PDQLS EAKHLRD
Subjt:  MTPRRSARRGDQRGRGVGCN------QPAEQVANPVAPITHADLTQLEQRFNDTVTEVLARHQLAQAALAQG--------QTAQQDLPDQLSVEAKHLRD

Query:  FRIYDPQTFNGSLEDPISTKLWLSSVETIFRFMRCPDDQKVQCAVFLLRERAAIWWQSVERMLGGNVNQITWDQFKESFYAKFLPSSLRDAKRQEFINLK
        FR Y+P TF+GSL+DP   +LWLSS+ETIFR+M+CP+DQKVQCAVF+L +R   WW++ ERMLGG+V+QITW QFKESFYAKF  +SLRDAKRQEF+NL+
Subjt:  FRIYDPQTFNGSLEDPISTKLWLSSVETIFRFMRCPDDQKVQCAVFLLRERAAIWWQSVERMLGGNVNQITWDQFKESFYAKFLPSSLRDAKRQEFINLK

Query:  QGQMTVEEYDYEFDMLSLFAPELVETEAARAKMFVWGLRKDLQGFVRASKPATQTEALYLALDLSVQKDDDLLKVSRKGPFLGQKRR
        QG MTVE+YD EFDMLS FAPE++ TEAARA  FV GLR D+QGFVRA +PAT  +AL LA+DLS+Q+  +  K + +G   GQKR+
Subjt:  QGQMTVEEYDYEFDMLSLFAPELVETEAARAKMFVWGLRKDLQGFVRASKPATQTEALYLALDLSVQKDDDLLKVSRKGPFLGQKRR

A0A5A7VBD0 Reverse transcriptase1.6e-8658.95Show/hide
Query:  MTPRRSARRGDQRGRGVGCN------QPAEQVANPVAPITHADLTQLEQRFNDTVTEVLARHQLAQAALAQG------QTAQQDLPDQLSVEAKHLRDFR
        M PRR ARRG Q GRG G        QP  Q  +P AP+THADL  +EQRF D + ++  + Q A  A A          A Q +PDQLS EAKHLRDFR
Subjt:  MTPRRSARRGDQRGRGVGCN------QPAEQVANPVAPITHADLTQLEQRFNDTVTEVLARHQLAQAALAQG------QTAQQDLPDQLSVEAKHLRDFR

Query:  IYDPQTFNGSLEDPISTKLWLSSVETIFRFMRCPDDQKVQCAVFLLRERAAIWWQSVERMLGGNVNQITWDQFKESFYAKFLPSSLRDAKRQEFINLKQG
         Y+P TF+GSLEDP   +LWLSS+ETIFR+M+CP+DQKVQCAVF+L +R   WW++ ERMLGG+V+QITW QFKESFYAKF  +SLRDAKRQ+F+NL+QG
Subjt:  IYDPQTFNGSLEDPISTKLWLSSVETIFRFMRCPDDQKVQCAVFLLRERAAIWWQSVERMLGGNVNQITWDQFKESFYAKFLPSSLRDAKRQEFINLKQG

Query:  QMTVEEYDYEFDMLSLFAPELVETEAARAKMFVWGLRKDLQGFVRASKPATQTEALYLALDLSVQKDDDLLKVSRKGPFLGQKRR
         MTVE+YD EFDMLS FAPE++ TE ARA  FV GLR D+QG VRA +PAT  +AL LA+DLS+Q+  +  K + +G   GQKR+
Subjt:  QMTVEEYDYEFDMLSLFAPELVETEAARAKMFVWGLRKDLQGFVRASKPATQTEALYLALDLSVQKDDDLLKVSRKGPFLGQKRR

A0A5D3BCD2 Gag protease polyprotein4.9e-15799.3Show/hide
Query:  MTPRRSARRGDQRGRGVGCNQPAEQVANPVAPITHADLTQLEQRFNDTVTEVLARHQLAQAALAQGQTAQQDLPDQLSVEAKHLRDFRIYDPQTFNGSLE
        MTPRRSARRGDQ GRGVGCNQPAEQVANPVAPITHADLTQLEQRFNDTVTEVLARHQLAQAALAQGQTAQQDLPDQLSVEAKHLRDFRIYDPQTFNGSLE
Subjt:  MTPRRSARRGDQRGRGVGCNQPAEQVANPVAPITHADLTQLEQRFNDTVTEVLARHQLAQAALAQGQTAQQDLPDQLSVEAKHLRDFRIYDPQTFNGSLE

Query:  DPISTKLWLSSVETIFRFMRCPDDQKVQCAVFLLRERAAIWWQSVERMLGGNVNQITWDQFKESFYAKFLPSSLRDAKRQEFINLKQGQMTVEEYDYEFD
        DPISTKLWLSSVETIFRFMRCPDDQKVQCAVFLLRERAAIWWQSVERMLGGNVNQITWDQFKESFYAKFLPSSLRDAKRQEFINLKQGQMTVEEYDYEFD
Subjt:  DPISTKLWLSSVETIFRFMRCPDDQKVQCAVFLLRERAAIWWQSVERMLGGNVNQITWDQFKESFYAKFLPSSLRDAKRQEFINLKQGQMTVEEYDYEFD

Query:  MLSLFAPELVETEAARAKMFVWGLRKDLQGFVRASKPATQTEALYLALDLSVQKDDDLLKVSRKGPFLGQKRRLSRSLLMFHRET
        MLSLFAPELVETEAARAKMFVWGLRKDLQGFVRA KPATQTEALYLALDLSVQKDDDLLKVSRKGPFLGQKRRLSRSLLMFHRET
Subjt:  MLSLFAPELVETEAARAKMFVWGLRKDLQGFVRASKPATQTEALYLALDLSVQKDDDLLKVSRKGPFLGQKRRLSRSLLMFHRET

E5GB72 Ty3-gypsy retrotransposon protein4.9e-15799.3Show/hide
Query:  MTPRRSARRGDQRGRGVGCNQPAEQVANPVAPITHADLTQLEQRFNDTVTEVLARHQLAQAALAQGQTAQQDLPDQLSVEAKHLRDFRIYDPQTFNGSLE
        MTPRRSARRGDQ GRGVGCNQPAEQVANPVAPITHADLTQLEQRFNDTVTEVLARHQLAQAALAQGQTAQQDLPDQLSVEAKHLRDFRIYDPQTFNGSLE
Subjt:  MTPRRSARRGDQRGRGVGCNQPAEQVANPVAPITHADLTQLEQRFNDTVTEVLARHQLAQAALAQGQTAQQDLPDQLSVEAKHLRDFRIYDPQTFNGSLE

Query:  DPISTKLWLSSVETIFRFMRCPDDQKVQCAVFLLRERAAIWWQSVERMLGGNVNQITWDQFKESFYAKFLPSSLRDAKRQEFINLKQGQMTVEEYDYEFD
        DPISTKLWLSSVETIFRFMRCPDDQKVQCAVFLLRERAAIWWQSVERMLGGNVNQITWDQFKESFYAKFLPSSLRDAKRQEFINLKQGQMTVEEYDYEFD
Subjt:  DPISTKLWLSSVETIFRFMRCPDDQKVQCAVFLLRERAAIWWQSVERMLGGNVNQITWDQFKESFYAKFLPSSLRDAKRQEFINLKQGQMTVEEYDYEFD

Query:  MLSLFAPELVETEAARAKMFVWGLRKDLQGFVRASKPATQTEALYLALDLSVQKDDDLLKVSRKGPFLGQKRRLSRSLLMFHRET
        MLSLFAPELVETEAARAKMFVWGLRKDLQGFVRA KPATQTEALYLALDLSVQKDDDLLKVSRKGPFLGQKRRLSRSLLMFHRET
Subjt:  MLSLFAPELVETEAARAKMFVWGLRKDLQGFVRASKPATQTEALYLALDLSVQKDDDLLKVSRKGPFLGQKRRLSRSLLMFHRET

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACACCACGTAGAAGTGCTCGTAGGGGTGATCAAAGAGGCAGAGGAGTAGGGTGTAATCAACCTGCTGAACAAGTTGCTAACCCTGTTGCACCCATTACCCATGCTGA
TCTTACTCAGCTAGAGCAGAGGTTTAATGACACAGTGACTGAAGTGTTGGCACGACATCAGCTAGCCCAAGCAGCCCTTGCTCAAGGTCAAACAGCACAACAAGACCTAC
CAGATCAGTTGTCAGTTGAGGCTAAGCATTTGAGGGATTTCAGAATATATGATCCACAAACTTTTAATGGATCGCTGGAGGATCCCATTAGCACGAAGTTATGGTTATCT
TCTGTGGAGACCATCTTCCGTTTCATGAGATGTCCTGACGACCAGAAGGTCCAGTGTGCCGTGTTCCTACTGAGGGAGAGAGCTGCCATTTGGTGGCAGTCAGTAGAAAG
GATGCTTGGTGGTAATGTTAACCAGATTACGTGGGACCAGTTCAAGGAGAGTTTTTATGCTAAGTTTTTACCTTCTAGCCTCAGGGATGCCAAACGTCAAGAGTTCATAA
ACCTGAAGCAAGGTCAAATGACGGTTGAGGAGTACGACTATGAGTTTGACATGCTATCCCTCTTTGCCCCTGAGTTGGTTGAGACTGAGGCTGCTAGAGCTAAGATGTTT
GTTTGGGGTTTGAGAAAGGATCTTCAAGGTTTTGTTCGAGCCTCCAAACCAGCCACCCAAACTGAAGCACTATACTTGGCGCTGGACTTGAGTGTCCAGAAGGACGATGA
CCTGTTGAAGGTATCAAGGAAGGGACCATTCTTAGGTCAGAAGAGAAGGCTGAGCAGAAGCTTGTTAATGTTCCACAGAGAAACTTGA
mRNA sequenceShow/hide mRNA sequence
ATTTTACTAGTTGGGTTATAAATTAGGCCAAAGAAAAAGAAAAAACAACTTTCTTTCTTCTTCTTCTTCTCCGTCAAGCCCGTGCCTCACTCCTCTCCACTGTGAATCTT
TGTTCCGCCGTCCGCTGCTGGTTCTACCCTTTGCCGCCGCCGCTCACCGTCGTAAGTTTGGGTGCTATTGGAATCCTCTTGATTCAATTCTGGATTTTCCATCAAGTTTG
TGATTTAGTTCAACCCCCAATTTGGGAAAATGACACCACGTAGAAGTGCTCGTAGGGGTGATCAAAGAGGCAGAGGAGTAGGGTGTAATCAACCTGCTGAACAAGTTGCT
AACCCTGTTGCACCCATTACCCATGCTGATCTTACTCAGCTAGAGCAGAGGTTTAATGACACAGTGACTGAAGTGTTGGCACGACATCAGCTAGCCCAAGCAGCCCTTGC
TCAAGGTCAAACAGCACAACAAGACCTACCAGATCAGTTGTCAGTTGAGGCTAAGCATTTGAGGGATTTCAGAATATATGATCCACAAACTTTTAATGGATCGCTGGAGG
ATCCCATTAGCACGAAGTTATGGTTATCTTCTGTGGAGACCATCTTCCGTTTCATGAGATGTCCTGACGACCAGAAGGTCCAGTGTGCCGTGTTCCTACTGAGGGAGAGA
GCTGCCATTTGGTGGCAGTCAGTAGAAAGGATGCTTGGTGGTAATGTTAACCAGATTACGTGGGACCAGTTCAAGGAGAGTTTTTATGCTAAGTTTTTACCTTCTAGCCT
CAGGGATGCCAAACGTCAAGAGTTCATAAACCTGAAGCAAGGTCAAATGACGGTTGAGGAGTACGACTATGAGTTTGACATGCTATCCCTCTTTGCCCCTGAGTTGGTTG
AGACTGAGGCTGCTAGAGCTAAGATGTTTGTTTGGGGTTTGAGAAAGGATCTTCAAGGTTTTGTTCGAGCCTCCAAACCAGCCACCCAAACTGAAGCACTATACTTGGCG
CTGGACTTGAGTGTCCAGAAGGACGATGACCTGTTGAAGGTATCAAGGAAGGGACCATTCTTAGGTCAGAAGAGAAGGCTGAGCAGAAGCTTGTTAATGTTCCACAGAGA
AACTTGAGGTAAGGTGGGGCACTTCGCCGTTACCAATAGGAGTTTGCTAAGGCAAGTAGAACTTTGAGAGATACCTTATTGCAATACTCTAGCAATTTTCTTTCACTCAC
TCGTATATACACTTTCAACCTGCATTTTTTTTTCTCTTTTTCCCCAAAAATATTTTTCTTAATTCACACAAAAAACCTAGCCTTTATCTATTTGGACATTTCTTATTTTC
TAATTATTTCATAAATCTGCCAAAACCATAACTAATTTCTTGCCTTCTACCTGAACTCTCTTTCACCATCCACCCGTTTTTCTCCCCTTCACCCCCCAACAAACCTCCAC
GTTTTTTCTCTTCTCCGTATTTTCTGGTTTAAGTAAGTTGTTATTTATTTCCCTTTTATTAGGTTTTTGCTTC
Protein sequenceShow/hide protein sequence
MTPRRSARRGDQRGRGVGCNQPAEQVANPVAPITHADLTQLEQRFNDTVTEVLARHQLAQAALAQGQTAQQDLPDQLSVEAKHLRDFRIYDPQTFNGSLEDPISTKLWLS
SVETIFRFMRCPDDQKVQCAVFLLRERAAIWWQSVERMLGGNVNQITWDQFKESFYAKFLPSSLRDAKRQEFINLKQGQMTVEEYDYEFDMLSLFAPELVETEAARAKMF
VWGLRKDLQGFVRASKPATQTEALYLALDLSVQKDDDLLKVSRKGPFLGQKRRLSRSLLMFHRET