; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g15740 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g15740
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr6:12406555..12407799
RNA-Seq ExpressionMoc06g15740
SyntenyMoc06g15740
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0004519 - endonuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0036604.1 polyprotein [Cucumis melo var. makuwa]1.3e-11758.4Show/hide
Query:  MSATKQLGKSHVDRLVEIEEQLLFLREILDNLRYVESRLDEISTKADGIDVVNVRIDGLAIRELMLRVETLEDKVK--RTDNLERGESSSSSIAHMEERV
        MS++   GK+  DRLVE+EEQ+L+L E+ D++RY+ESRL+EIS K + ID V  R++G  I+ELM RV+ LE  V   RT N ERG+SS+ S+AH+EERV
Subjt:  MSATKQLGKSHVDRLVEIEEQLLFLREILDNLRYVESRLDEISTKADGIDVVNVRIDGLAIRELMLRVETLEDKVK--RTDNLERGESSSSSIAHMEERV

Query:  EEIDISQKTIVQMVSELTDDFKATIDEMRAEIAKLGTKVNLTMRAVGNQAPAGGPIQFNKVKVPKPKPFCGARDAKALENFIFDLEQYFKATSTVTEESK
        +E+D SQKT+++M++ +++DF+AT+D +R EIA +  +++LTMRA+ NQAPAGG I  ++VK+P+PKPFCGARDAKALEN+IFDLEQYF+AT+TVTEE+K
Subjt:  EEIDISQKTIVQMVSELTDDFKATIDEMRAEIAKLGTKVNLTMRAVGNQAPAGGPIQFNKVKVPKPKPFCGARDAKALENFIFDLEQYFKATSTVTEESK

Query:  VTLATMHLADDAKLWWRARYVDVQEGKCTIDTWEKLKQELRSQFFPENVEILARRKLRELRHTGNIRDYVKQFSGLMLDIHDMSEKDKVFAFVEGLKPWA
        VTLATMHL++DAKLWWR+R+VD+QEG+CTIDTW+ LK+ELRSQFFPENVEIL RRKLREL+HTG+IR+YVKQF+GLMLDI DMS+KDKVF FVEGLKPWA
Subjt:  VTLATMHLADDAKLWWRARYVDVQEGKCTIDTWEKLKQELRSQFFPENVEILARRKLRELRHTGNIRDYVKQFSGLMLDIHDMSEKDKVFAFVEGLKPWA

Query:  RPNCMNRRYKISLLLTQQPNGC-SILATIHHKSRGRPKLLPVVGTDPLDRGPLKLGELTEMEEETANPSSREREIHGEGRTHRTITI
        +     +R +       QPNGC + L T+  +   +   L  VGT    +  LKL E T+   E  +P +R  E  GE +  R   I
Subjt:  RPNCMNRRYKISLLLTQQPNGC-SILATIHHKSRGRPKLLPVVGTDPLDRGPLKLGELTEMEEETANPSSREREIHGEGRTHRTITI

TYK03044.1 uncharacterized protein E5676_scaffold46G001390 [Cucumis melo var. makuwa]1.8e-11467.21Show/hide
Query:  MSATKQLGKSHVDRLVEIEEQLLFLREILDNLRYVESRLDEISTKADGIDVVNVRIDGLAIRELMLRVETLED--KVKRTDNLERGESSSSSIAHMEERV
        MS++   GK+  DRLVEIEEQ+L+L E+ D++RY+ESR+DEIS KA+ ID V  R++GL I+EL+ RV+ LE+    +RT N ERGESSS   AHMEERV
Subjt:  MSATKQLGKSHVDRLVEIEEQLLFLREILDNLRYVESRLDEISTKADGIDVVNVRIDGLAIRELMLRVETLED--KVKRTDNLERGESSSSSIAHMEERV

Query:  EEIDISQKTIVQMVSELTDDFKATIDEMRAEIAKLGTKVNLTMRAVGNQAPAGGPIQFNKVKVPKPKPFCGARDAKALENFIFDLEQYFKATSTVTEESK
         E+D +QKT+++M++ +++DFK T+D +R EIA +  +++LTMRA+ NQAPAGG I  +KVKVP+PKPFCGARDAKALEN+IFDLEQYFKAT+T+ EE+K
Subjt:  EEIDISQKTIVQMVSELTDDFKATIDEMRAEIAKLGTKVNLTMRAVGNQAPAGGPIQFNKVKVPKPKPFCGARDAKALENFIFDLEQYFKATSTVTEESK

Query:  VTLATMHLADDAKLWWRARYVDVQEGKCTIDTWEKLKQELRSQFFPENVEILARRKLRELRHTGNIRDYVKQFSGLMLDIHDMSEKDKVFAFVEGLKPWA
        VTLATMHL++DAKLWWR+RYVD+QEG+CT+DTW+ LK+ELRSQFFPENVEILARRKLR+LRHTG IR+YVKQF+GLMLDI DMSEKDKVF FVEGLKPWA
Subjt:  VTLATMHLADDAKLWWRARYVDVQEGKCTIDTWEKLKQELRSQFFPENVEILARRKLRELRHTGNIRDYVKQFSGLMLDIHDMSEKDKVFAFVEGLKPWA

Query:  RPNCMNRR
        +     +R
Subjt:  RPNCMNRR

TYK03099.1 reverse transcriptase [Cucumis melo var. makuwa]1.4e-11466.56Show/hide
Query:  MSATKQLGKSHVDRLVEIEEQLLFLREILDNLRYVESRLDEISTKADGIDVVNVRIDGLAIRELMLRVETLEDKVK--RTDNLERGESSSSSIAHMEERV
        MS++  LGK+  DRLVE+EEQ+L+L E+ D++RY+ESRL+EIS K + ID V  R++G  I+ELM RV+ LE  V   RT N ERG+SS+ S+AH+EERV
Subjt:  MSATKQLGKSHVDRLVEIEEQLLFLREILDNLRYVESRLDEISTKADGIDVVNVRIDGLAIRELMLRVETLEDKVK--RTDNLERGESSSSSIAHMEERV

Query:  EEIDISQKTIVQMVSELTDDFKATIDEMRAEIAKLGTKVNLTMRAVGNQAPAGGPIQFNKVKVPKPKPFCGARDAKALENFIFDLEQYFKATSTVTEESK
        +E+D SQKT+++M++ +++DF+AT+D +R EIA +  +++LTMRA+ NQAPAGG I  ++VK+P+PKPFCGARDAKALEN+IFDLEQYF+AT+TVTEE+K
Subjt:  EEIDISQKTIVQMVSELTDDFKATIDEMRAEIAKLGTKVNLTMRAVGNQAPAGGPIQFNKVKVPKPKPFCGARDAKALENFIFDLEQYFKATSTVTEESK

Query:  VTLATMHLADDAKLWWRARYVDVQEGKCTIDTWEKLKQELRSQFFPENVEILARRKLRELRHTGNIRDYVKQFSGLMLDIHDMSEKDKVFAFVEGLKPWA
        VTLATMHL++DAKLWWR+R+VD+QEG+CTIDTW+ LK+ELRSQFFPENVEILARRKLREL+HTG+IR+YVKQF+GLMLDI DMSEKDKVF FVEGLKPWA
Subjt:  VTLATMHLADDAKLWWRARYVDVQEGKCTIDTWEKLKQELRSQFFPENVEILARRKLRELRHTGNIRDYVKQFSGLMLDIHDMSEKDKVFAFVEGLKPWA

Query:  RPNCMNRR
        +     +R
Subjt:  RPNCMNRR

TYK18566.1 uncharacterized protein E5676_scaffold119G00720 [Cucumis melo var. makuwa]3.1e-11466.23Show/hide
Query:  MSATKQLGKSHVDRLVEIEEQLLFLREILDNLRYVESRLDEISTKADGIDVVNVRIDGLAIRELMLRVETLEDKVK--RTDNLERGESSSSSIAHMEERV
        MS++   GK+  DRLVE+EEQ+L+L E+ D++RY+ESRLDEIS K + ID V  R++G  I+ELM RV+ LE  +   RT N ERG+SS+ S+AH+EERV
Subjt:  MSATKQLGKSHVDRLVEIEEQLLFLREILDNLRYVESRLDEISTKADGIDVVNVRIDGLAIRELMLRVETLEDKVK--RTDNLERGESSSSSIAHMEERV

Query:  EEIDISQKTIVQMVSELTDDFKATIDEMRAEIAKLGTKVNLTMRAVGNQAPAGGPIQFNKVKVPKPKPFCGARDAKALENFIFDLEQYFKATSTVTEESK
        +E+D SQKT+++M++ +++DF+AT+D +R EIA +  +++LTMRA+ NQAPAGG I  ++VK+P+PKPFCGARDAKALEN+IFDLEQYF+AT+TVTEE+K
Subjt:  EEIDISQKTIVQMVSELTDDFKATIDEMRAEIAKLGTKVNLTMRAVGNQAPAGGPIQFNKVKVPKPKPFCGARDAKALENFIFDLEQYFKATSTVTEESK

Query:  VTLATMHLADDAKLWWRARYVDVQEGKCTIDTWEKLKQELRSQFFPENVEILARRKLRELRHTGNIRDYVKQFSGLMLDIHDMSEKDKVFAFVEGLKPWA
        VTLATMHL++DAKLWWR+R+VD+QEG+CTIDTW+ LK+ELRSQFFPENVEILARRKLREL+HTG+IR+YVKQF+GLMLDI DMSEKDKVF FVEGLKPWA
Subjt:  VTLATMHLADDAKLWWRARYVDVQEGKCTIDTWEKLKQELRSQFFPENVEILARRKLRELRHTGNIRDYVKQFSGLMLDIHDMSEKDKVFAFVEGLKPWA

Query:  RPNCMNRR
        +     +R
Subjt:  RPNCMNRR

XP_022155185.1 uncharacterized protein LOC111022320 [Momordica charantia]5.8e-12180.94Show/hide
Query:  MSATKQLGKSHVDRLVEIEEQLLFLREILDNLRYVESRLDEISTKADGIDVVNVRIDGLAIRELMLRVETLEDKVKRTDNLERGESSSSSIAHMEERVEE
        MS TKQLGKSH+DRLVEIEE+LLFLREI DNLRYVESRLDEISTKADGIDVVN RIDGLAIRELMLRVETLEDKVKRTDNLERGESSSSSIAHMEERVEE
Subjt:  MSATKQLGKSHVDRLVEIEEQLLFLREILDNLRYVESRLDEISTKADGIDVVNVRIDGLAIRELMLRVETLEDKVKRTDNLERGESSSSSIAHMEERVEE

Query:  IDISQKTIVQMVSELTDDFKATIDEMRAEIAKLGTKVNLTMRAVGNQAPAGGPIQFNKVKVPKPKPFCGARDAKALENFIFDLEQYFKATSTVTEESKVT
        IDIS+KTIVQMVSELTDDFKAT+DEMRAEIA+LGT                             KPFCGARDAKALENFIFDLEQYFKATSTVTEESKVT
Subjt:  IDISQKTIVQMVSELTDDFKATIDEMRAEIAKLGTKVNLTMRAVGNQAPAGGPIQFNKVKVPKPKPFCGARDAKALENFIFDLEQYFKATSTVTEESKVT

Query:  LATMHLADDAKLWWRARYVDVQEGKCTIDTWEKLKQELRSQFFPENVEILARRKLRELRHTGNIRDYVKQFSGLMLDIHDMSEKDKVFAFVEGLKPWAR
        LATMHLADDAKLWWRARYVDVQEGKCTIDTWEKLKQELRSQ    ++  +          TGNIRDYVKQFSGLMLDI DMSEKDKVFAFVEGLKPWAR
Subjt:  LATMHLADDAKLWWRARYVDVQEGKCTIDTWEKLKQELRSQFFPENVEILARRKLRELRHTGNIRDYVKQFSGLMLDIHDMSEKDKVFAFVEGLKPWAR

TrEMBL top hitse value%identityAlignment
A0A5A7SZ91 Polyprotein6.5e-11858.4Show/hide
Query:  MSATKQLGKSHVDRLVEIEEQLLFLREILDNLRYVESRLDEISTKADGIDVVNVRIDGLAIRELMLRVETLEDKVK--RTDNLERGESSSSSIAHMEERV
        MS++   GK+  DRLVE+EEQ+L+L E+ D++RY+ESRL+EIS K + ID V  R++G  I+ELM RV+ LE  V   RT N ERG+SS+ S+AH+EERV
Subjt:  MSATKQLGKSHVDRLVEIEEQLLFLREILDNLRYVESRLDEISTKADGIDVVNVRIDGLAIRELMLRVETLEDKVK--RTDNLERGESSSSSIAHMEERV

Query:  EEIDISQKTIVQMVSELTDDFKATIDEMRAEIAKLGTKVNLTMRAVGNQAPAGGPIQFNKVKVPKPKPFCGARDAKALENFIFDLEQYFKATSTVTEESK
        +E+D SQKT+++M++ +++DF+AT+D +R EIA +  +++LTMRA+ NQAPAGG I  ++VK+P+PKPFCGARDAKALEN+IFDLEQYF+AT+TVTEE+K
Subjt:  EEIDISQKTIVQMVSELTDDFKATIDEMRAEIAKLGTKVNLTMRAVGNQAPAGGPIQFNKVKVPKPKPFCGARDAKALENFIFDLEQYFKATSTVTEESK

Query:  VTLATMHLADDAKLWWRARYVDVQEGKCTIDTWEKLKQELRSQFFPENVEILARRKLRELRHTGNIRDYVKQFSGLMLDIHDMSEKDKVFAFVEGLKPWA
        VTLATMHL++DAKLWWR+R+VD+QEG+CTIDTW+ LK+ELRSQFFPENVEIL RRKLREL+HTG+IR+YVKQF+GLMLDI DMS+KDKVF FVEGLKPWA
Subjt:  VTLATMHLADDAKLWWRARYVDVQEGKCTIDTWEKLKQELRSQFFPENVEILARRKLRELRHTGNIRDYVKQFSGLMLDIHDMSEKDKVFAFVEGLKPWA

Query:  RPNCMNRRYKISLLLTQQPNGC-SILATIHHKSRGRPKLLPVVGTDPLDRGPLKLGELTEMEEETANPSSREREIHGEGRTHRTITI
        +     +R +       QPNGC + L T+  +   +   L  VGT    +  LKL E T+   E  +P +R  E  GE +  R   I
Subjt:  RPNCMNRRYKISLLLTQQPNGC-SILATIHHKSRGRPKLLPVVGTDPLDRGPLKLGELTEMEEETANPSSREREIHGEGRTHRTITI

A0A5A7UT87 Retrotrans_gag domain-containing protein1.5e-11467.53Show/hide
Query:  MSATKQLGKSHVDRLVEIEEQLLFLREILDNLRYVESRLDEISTKADGIDVVNVRIDGLAIRELMLRVETLED--KVKRTDNLERGESSSSSIAHMEERV
        MS++   GK+  DRLVEIEEQ+L+L E+ D++RY+ESR+DEIS KA+ ID V  R++GL I+EL+ RV+ LE+    +RT N ERGESSS   AHMEERV
Subjt:  MSATKQLGKSHVDRLVEIEEQLLFLREILDNLRYVESRLDEISTKADGIDVVNVRIDGLAIRELMLRVETLED--KVKRTDNLERGESSSSSIAHMEERV

Query:  EEIDISQKTIVQMVSELTDDFKATIDEMRAEIAKLGTKVNLTMRAVGNQAPAGGPIQFNKVKVPKPKPFCGARDAKALENFIFDLEQYFKATSTVTEESK
         E+D +QKT+++M++ +++DFK T+D +R EIA +  +++LTMRA+ NQAPAGG I  +KVKVP+PKPFCGARDAKALEN+IFDLEQYFKAT+TV EE+K
Subjt:  EEIDISQKTIVQMVSELTDDFKATIDEMRAEIAKLGTKVNLTMRAVGNQAPAGGPIQFNKVKVPKPKPFCGARDAKALENFIFDLEQYFKATSTVTEESK

Query:  VTLATMHLADDAKLWWRARYVDVQEGKCTIDTWEKLKQELRSQFFPENVEILARRKLRELRHTGNIRDYVKQFSGLMLDIHDMSEKDKVFAFVEGLKPWA
        VTLATMHL++DAKLWWR+RYVD+QEG+CT+DTW+ LK+ELRSQFFPENVEILAR+KLR+LRHTG IR+YVKQF+GLMLDI DMSEKDKVF FVEGLKPWA
Subjt:  VTLATMHLADDAKLWWRARYVDVQEGKCTIDTWEKLKQELRSQFFPENVEILARRKLRELRHTGNIRDYVKQFSGLMLDIHDMSEKDKVFAFVEGLKPWA

Query:  RPNCMNRR
        R     +R
Subjt:  RPNCMNRR

A0A5D3BV48 Retrotrans_gag domain-containing protein8.8e-11567.21Show/hide
Query:  MSATKQLGKSHVDRLVEIEEQLLFLREILDNLRYVESRLDEISTKADGIDVVNVRIDGLAIRELMLRVETLED--KVKRTDNLERGESSSSSIAHMEERV
        MS++   GK+  DRLVEIEEQ+L+L E+ D++RY+ESR+DEIS KA+ ID V  R++GL I+EL+ RV+ LE+    +RT N ERGESSS   AHMEERV
Subjt:  MSATKQLGKSHVDRLVEIEEQLLFLREILDNLRYVESRLDEISTKADGIDVVNVRIDGLAIRELMLRVETLED--KVKRTDNLERGESSSSSIAHMEERV

Query:  EEIDISQKTIVQMVSELTDDFKATIDEMRAEIAKLGTKVNLTMRAVGNQAPAGGPIQFNKVKVPKPKPFCGARDAKALENFIFDLEQYFKATSTVTEESK
         E+D +QKT+++M++ +++DFK T+D +R EIA +  +++LTMRA+ NQAPAGG I  +KVKVP+PKPFCGARDAKALEN+IFDLEQYFKAT+T+ EE+K
Subjt:  EEIDISQKTIVQMVSELTDDFKATIDEMRAEIAKLGTKVNLTMRAVGNQAPAGGPIQFNKVKVPKPKPFCGARDAKALENFIFDLEQYFKATSTVTEESK

Query:  VTLATMHLADDAKLWWRARYVDVQEGKCTIDTWEKLKQELRSQFFPENVEILARRKLRELRHTGNIRDYVKQFSGLMLDIHDMSEKDKVFAFVEGLKPWA
        VTLATMHL++DAKLWWR+RYVD+QEG+CT+DTW+ LK+ELRSQFFPENVEILARRKLR+LRHTG IR+YVKQF+GLMLDI DMSEKDKVF FVEGLKPWA
Subjt:  VTLATMHLADDAKLWWRARYVDVQEGKCTIDTWEKLKQELRSQFFPENVEILARRKLRELRHTGNIRDYVKQFSGLMLDIHDMSEKDKVFAFVEGLKPWA

Query:  RPNCMNRR
        +     +R
Subjt:  RPNCMNRR

A0A5D3BYE6 Reverse transcriptase6.8e-11566.56Show/hide
Query:  MSATKQLGKSHVDRLVEIEEQLLFLREILDNLRYVESRLDEISTKADGIDVVNVRIDGLAIRELMLRVETLEDKVK--RTDNLERGESSSSSIAHMEERV
        MS++  LGK+  DRLVE+EEQ+L+L E+ D++RY+ESRL+EIS K + ID V  R++G  I+ELM RV+ LE  V   RT N ERG+SS+ S+AH+EERV
Subjt:  MSATKQLGKSHVDRLVEIEEQLLFLREILDNLRYVESRLDEISTKADGIDVVNVRIDGLAIRELMLRVETLEDKVK--RTDNLERGESSSSSIAHMEERV

Query:  EEIDISQKTIVQMVSELTDDFKATIDEMRAEIAKLGTKVNLTMRAVGNQAPAGGPIQFNKVKVPKPKPFCGARDAKALENFIFDLEQYFKATSTVTEESK
        +E+D SQKT+++M++ +++DF+AT+D +R EIA +  +++LTMRA+ NQAPAGG I  ++VK+P+PKPFCGARDAKALEN+IFDLEQYF+AT+TVTEE+K
Subjt:  EEIDISQKTIVQMVSELTDDFKATIDEMRAEIAKLGTKVNLTMRAVGNQAPAGGPIQFNKVKVPKPKPFCGARDAKALENFIFDLEQYFKATSTVTEESK

Query:  VTLATMHLADDAKLWWRARYVDVQEGKCTIDTWEKLKQELRSQFFPENVEILARRKLRELRHTGNIRDYVKQFSGLMLDIHDMSEKDKVFAFVEGLKPWA
        VTLATMHL++DAKLWWR+R+VD+QEG+CTIDTW+ LK+ELRSQFFPENVEILARRKLREL+HTG+IR+YVKQF+GLMLDI DMSEKDKVF FVEGLKPWA
Subjt:  VTLATMHLADDAKLWWRARYVDVQEGKCTIDTWEKLKQELRSQFFPENVEILARRKLRELRHTGNIRDYVKQFSGLMLDIHDMSEKDKVFAFVEGLKPWA

Query:  RPNCMNRR
        +     +R
Subjt:  RPNCMNRR

A0A6J1DLQ6 uncharacterized protein LOC1110223202.8e-12180.94Show/hide
Query:  MSATKQLGKSHVDRLVEIEEQLLFLREILDNLRYVESRLDEISTKADGIDVVNVRIDGLAIRELMLRVETLEDKVKRTDNLERGESSSSSIAHMEERVEE
        MS TKQLGKSH+DRLVEIEE+LLFLREI DNLRYVESRLDEISTKADGIDVVN RIDGLAIRELMLRVETLEDKVKRTDNLERGESSSSSIAHMEERVEE
Subjt:  MSATKQLGKSHVDRLVEIEEQLLFLREILDNLRYVESRLDEISTKADGIDVVNVRIDGLAIRELMLRVETLEDKVKRTDNLERGESSSSSIAHMEERVEE

Query:  IDISQKTIVQMVSELTDDFKATIDEMRAEIAKLGTKVNLTMRAVGNQAPAGGPIQFNKVKVPKPKPFCGARDAKALENFIFDLEQYFKATSTVTEESKVT
        IDIS+KTIVQMVSELTDDFKAT+DEMRAEIA+LGT                             KPFCGARDAKALENFIFDLEQYFKATSTVTEESKVT
Subjt:  IDISQKTIVQMVSELTDDFKATIDEMRAEIAKLGTKVNLTMRAVGNQAPAGGPIQFNKVKVPKPKPFCGARDAKALENFIFDLEQYFKATSTVTEESKVT

Query:  LATMHLADDAKLWWRARYVDVQEGKCTIDTWEKLKQELRSQFFPENVEILARRKLRELRHTGNIRDYVKQFSGLMLDIHDMSEKDKVFAFVEGLKPWAR
        LATMHLADDAKLWWRARYVDVQEGKCTIDTWEKLKQELRSQ    ++  +          TGNIRDYVKQFSGLMLDI DMSEKDKVFAFVEGLKPWAR
Subjt:  LATMHLADDAKLWWRARYVDVQEGKCTIDTWEKLKQELRSQFFPENVEILARRKLRELRHTGNIRDYVKQFSGLMLDIHDMSEKDKVFAFVEGLKPWAR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCGCGACAAAGCAGTTGGGCAAGTCCCACGTCGACAGACTCGTCGAGATCGAAGAACAGCTGTTGTTCTTGAGGGAAATCCTTGACAACCTTAGATATGTG
GAATCTCGGCTGGATGAGATCTCCACCAAAGCTGACGGAATTGACGTCGTGAATGTCCGCATAGACGGGCTTGCTATACGTGAGTTAATGCTTCGGGTTGAGACC
CTTGAAGACAAGGTTAAGCGTACTGATAACCTTGAGCGTGGCGAAAGCTCATCGAGCTCAATCGCCCACATGGAGGAGCGTGTCGAAGAAATAGACATCTCCCAG
AAGACTATTGTGCAGATGGTCAGTGAGCTGACCGACGACTTCAAAGCCACCATCGACGAAATGAGGGCGGAGATTGCCAAATTAGGCACCAAAGTAAATCTCACC
ATGAGAGCGGTGGGAAATCAGGCCCCAGCTGGGGGACCGATTCAGTTCAACAAGGTGAAAGTTCCCAAACCCAAGCCCTTCTGTGGGGCGCGAGATGCTAAAGCG
CTTGAGAACTTCATCTTCGACCTTGAGCAGTACTTCAAGGCGACAAGCACGGTGACAGAGGAATCGAAAGTCACACTAGCCACAATGCATCTTGCCGACGATGCG
AAGTTGTGGTGGAGAGCGAGATATGTAGATGTACAAGAGGGCAAATGTACCATCGACACGTGGGAAAAACTAAAGCAGGAACTCAGATCCCAATTTTTCCCGGAG
AATGTAGAGATTCTTGCCCGTCGTAAACTGCGTGAATTACGACACACAGGAAATATTCGAGACTATGTAAAGCAGTTTTCAGGGTTGATGTTGGACATCCACGAT
ATGTCCGAGAAAGACAAGGTCTTCGCCTTCGTGGAGGGCTTGAAACCATGGGCCCGGCCAAACTGTATGAACAGAAGGTACAAGATATCCCTACTGCTTACGCAA
CAGCCAAACGGTTGTTCGATCTTAGCAACGATACATCACAAGAGCAGAGGAAGACCCAAGCTTCTTCCAGTAGTGGGAACAGACCCGCTAGATCGGGGTCCCCTA
AAACTGGGGGAGCTGACAGAAATGGAGGAGGAGACCGCAAACCCTTCCAGCAGAGAGAGGGAAATACATGGAGAGGGCCGAACCCACCGAACAATAACAATCAGA
CAGGACCATCCTGCTTCATTTGTAAGGGACCGCACAGAGTATACGAGTGTCCGAATCGAGCTGCTCTCCGAGCCTTCCAAGCCACACTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCCGCGACAAAGCAGTTGGGCAAGTCCCACGTCGACAGACTCGTCGAGATCGAAGAACAGCTGTTGTTCTTGAGGGAAATCCTTGACAACCTTAGATATGTG
GAATCTCGGCTGGATGAGATCTCCACCAAAGCTGACGGAATTGACGTCGTGAATGTCCGCATAGACGGGCTTGCTATACGTGAGTTAATGCTTCGGGTTGAGACC
CTTGAAGACAAGGTTAAGCGTACTGATAACCTTGAGCGTGGCGAAAGCTCATCGAGCTCAATCGCCCACATGGAGGAGCGTGTCGAAGAAATAGACATCTCCCAG
AAGACTATTGTGCAGATGGTCAGTGAGCTGACCGACGACTTCAAAGCCACCATCGACGAAATGAGGGCGGAGATTGCCAAATTAGGCACCAAAGTAAATCTCACC
ATGAGAGCGGTGGGAAATCAGGCCCCAGCTGGGGGACCGATTCAGTTCAACAAGGTGAAAGTTCCCAAACCCAAGCCCTTCTGTGGGGCGCGAGATGCTAAAGCG
CTTGAGAACTTCATCTTCGACCTTGAGCAGTACTTCAAGGCGACAAGCACGGTGACAGAGGAATCGAAAGTCACACTAGCCACAATGCATCTTGCCGACGATGCG
AAGTTGTGGTGGAGAGCGAGATATGTAGATGTACAAGAGGGCAAATGTACCATCGACACGTGGGAAAAACTAAAGCAGGAACTCAGATCCCAATTTTTCCCGGAG
AATGTAGAGATTCTTGCCCGTCGTAAACTGCGTGAATTACGACACACAGGAAATATTCGAGACTATGTAAAGCAGTTTTCAGGGTTGATGTTGGACATCCACGAT
ATGTCCGAGAAAGACAAGGTCTTCGCCTTCGTGGAGGGCTTGAAACCATGGGCCCGGCCAAACTGTATGAACAGAAGGTACAAGATATCCCTACTGCTTACGCAA
CAGCCAAACGGTTGTTCGATCTTAGCAACGATACATCACAAGAGCAGAGGAAGACCCAAGCTTCTTCCAGTAGTGGGAACAGACCCGCTAGATCGGGGTCCCCTA
AAACTGGGGGAGCTGACAGAAATGGAGGAGGAGACCGCAAACCCTTCCAGCAGAGAGAGGGAAATACATGGAGAGGGCCGAACCCACCGAACAATAACAATCAGA
CAGGACCATCCTGCTTCATTTGTAAGGGACCGCACAGAGTATACGAGTGTCCGAATCGAGCTGCTCTCCGAGCCTTCCAAGCCACACTGA
Protein sequenceShow/hide protein sequence
MSATKQLGKSHVDRLVEIEEQLLFLREILDNLRYVESRLDEISTKADGIDVVNVRIDGLAIRELMLRVETLEDKVKRTDNLERGESSSSSIAHMEERVEEIDISQ
KTIVQMVSELTDDFKATIDEMRAEIAKLGTKVNLTMRAVGNQAPAGGPIQFNKVKVPKPKPFCGARDAKALENFIFDLEQYFKATSTVTEESKVTLATMHLADDA
KLWWRARYVDVQEGKCTIDTWEKLKQELRSQFFPENVEILARRKLRELRHTGNIRDYVKQFSGLMLDIHDMSEKDKVFAFVEGLKPWARPNCMNRRYKISLLLTQ
QPNGCSILATIHHKSRGRPKLLPVVGTDPLDRGPLKLGELTEMEEETANPSSREREIHGEGRTHRTITIRQDHPASFVRDRTEYTSVRIELLSEPSKPH