; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g32270 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g32270
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr4:24291804..24294005
RNA-Seq ExpressionMoc04g32270
SyntenyMoc04g32270
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142767.1 uncharacterized protein LOC111012805 [Momordica charantia]6.8e-12973.28Show/hide
Query:  KQHKSPAPNGGNSDHDDRNSEPISLDKGKPADQPESSEKRHSHKEKGFDLEELLDQADSPFTEEIIREKVLPKFKLPTVKRFDGTTDPVDHLDAYREWMD
        +QHKSPAPNGG SDH+DR+SEPISLDKGKPAD+PESSEKRH+ K+KGFDLEELLDQADSPFTEEI+REK                               
Subjt:  KQHKSPAPNGGNSDHDDRNSEPISLDKGKPADQPESSEKRHSHKEKGFDLEELLDQADSPFTEEIIREKVLPKFKLPTVKRFDGTTDPVDHLDAYREWMD

Query:  IYGVSEAVRGRCRSRPVAYLLTIKQRTTESLRDYVARFNEEKLQVEDLTDAVSLLAFMSGVRDEHLSFSFGKRTPNTFLEALSRAQRYMSAGEFFYSKRE
                                 RTTESL DYV RFN+EKLQVE LTDAVSLLAF+ GVRDEHLSFSFGKRTP+TF EALSRAQRYMSAGEFFYSKRE
Subjt:  IYGVSEAVRGRCRSRPVAYLLTIKQRTTESLRDYVARFNEEKLQVEDLTDAVSLLAFMSGVRDEHLSFSFGKRTPNTFLEALSRAQRYMSAGEFFYSKRE

Query:  PDGKRTDPKRERSGDKPQGSRWEKRDRSSQKDPPRKFEKYTATTVPIEQVLMEIKDQKLLKWPERMKAPSAKRSKGRYCLFHRDHGHATQDCFDLKEEVE
         +GKRTD KRERSGDKPQGSRWEKRDRS QKDPP+KFEKYT+TTVP+EQVLMEIK+Q+LLKWPERM A S KRSKGRYCLFH DHGHATQDCFDLKEEVE
Subjt:  PDGKRTDPKRERSGDKPQGSRWEKRDRSSQKDPPRKFEKYTATTVPIEQVLMEIKDQKLLKWPERMKAPSAKRSKGRYCLFHRDHGHATQDCFDLKEEVE

Query:  GLIRRGYLKEYVEEPKATQNGESDKSPAREIRTIMGGP-HRKRIRERK
        GLI RGYLKEYVEEPKATQNGESDKSPAREIRTIMGGP  R+  R+RK
Subjt:  GLIRRGYLKEYVEEPKATQNGESDKSPAREIRTIMGGP-HRKRIRERK

XP_022145129.1 uncharacterized protein LOC111014646 [Momordica charantia]6.7e-10088.53Show/hide
Query:  MTPERSPRRSDDNCSAKRRLNLGDPQVGGPEDGTNQPNQERQEGLPEVQALTTPEPFQKQFAVLEDKVEGMLQRMTQVLRQFEQQESDEVPLVRDPKKGK
        MTPERSPRRSDDNCSAKRRLNL DPQVGGPEDGT+QPN ERQEGLPE  ALTTPEP QKQFAVLEDKVEGMLQRMTQVLRQ+E+QESDEVPLVRDP+KGK
Subjt:  MTPERSPRRSDDNCSAKRRLNLGDPQVGGPEDGTNQPNQERQEGLPEVQALTTPEPFQKQFAVLEDKVEGMLQRMTQVLRQFEQQESDEVPLVRDPKKGK

Query:  GPAESDTEESTNSAGSKLQIGGNTRRRTRIFDPRKTKKQHKSPAPNGGNSDHDDRNSEPISLDKGKPADQPESSEKRHSHKEKGFDLEELLDQADSPFTE
        GPA+S+TEESTNSAGSKL+IGGNTRRRT+IFD +K KKQHK  APNGG SDHDDRNSEP+SLDKGKPADQPESSEKRH+ KEKGFDLEELLDQADSPFTE
Subjt:  GPAESDTEESTNSAGSKLQIGGNTRRRTRIFDPRKTKKQHKSPAPNGGNSDHDDRNSEPISLDKGKPADQPESSEKRHSHKEKGFDLEELLDQADSPFTE

Query:  EIIREKVLPKFKLPTVKR
        EI+REKV PKFKLPTVK+
Subjt:  EIIREKVLPKFKLPTVKR

XP_022158830.1 uncharacterized protein LOC111025293 [Momordica charantia]8.9e-23884Show/hide
Query:  MTPERSPRRSDDNCSAKRRLNLGDPQVGGPEDGTNQPNQERQEGLPEVQALTTPEPFQKQFAVLEDKVEGMLQRMTQVLRQFEQQESDEVPLVRDPKKGK
        MTPERSPRRSDD+CSAKRRLNLGDPQVGGPEDGT+QPNQERQEGL E  ALTTPEPFQKQFAVLEDK                  ESDEVPLVRDPKKGK
Subjt:  MTPERSPRRSDDNCSAKRRLNLGDPQVGGPEDGTNQPNQERQEGLPEVQALTTPEPFQKQFAVLEDKVEGMLQRMTQVLRQFEQQESDEVPLVRDPKKGK

Query:  GPAESDTEESTNSAGSKLQIGGNTRRRTRIFDPRKTKKQHKSPAPNGGNSDHDDRNSEPISLDKGKPADQPESSEKRHSHKEKGFDLEELLDQADSPFTE
        GP ESDTEESTNS GSKL+IGGNTR+RTRIFDPRKTKKQHKSPAPNGG SDHDDRNSEPISLDKGKPAD+PESSEKRHSHKEKGFDLEELLDQADSPFTE
Subjt:  GPAESDTEESTNSAGSKLQIGGNTRRRTRIFDPRKTKKQHKSPAPNGGNSDHDDRNSEPISLDKGKPADQPESSEKRHSHKEKGFDLEELLDQADSPFTE

Query:  EIIREKVLPKFKLPTVKRFDGTTDPVDHLDAYREWMDIYGVSEAVR----------------------------------------GRCRSRPVAYLLTI
        EI+REKV PKFKLPTVK+FD TTDPVDHLDAYREWMDIYGVSEAVR                                        GRCRSRPVAYLLTI
Subjt:  EIIREKVLPKFKLPTVKRFDGTTDPVDHLDAYREWMDIYGVSEAVR----------------------------------------GRCRSRPVAYLLTI

Query:  KQRTTESLRDYVARFNEEKLQVEDLTDAVSLLAFMSGVRDEHLSFSFGKRTPNTFLEALSRAQRYMSAGEFFYSKREPDGKRTDPKRERSGDKPQGSRWE
        KQRTTESLRDYVARFNEEKLQVE LTDAVSLLAFMSGVRDEHLSFSFGKRTPNTF EALSRAQRYMSAGEFFYSKREPDGKRTDPKRERSGDKPQGSRWE
Subjt:  KQRTTESLRDYVARFNEEKLQVEDLTDAVSLLAFMSGVRDEHLSFSFGKRTPNTFLEALSRAQRYMSAGEFFYSKREPDGKRTDPKRERSGDKPQGSRWE

Query:  KRDRSSQKDPPRKFEKYTATTVPIEQVLMEIKDQKLLKWPERMKAPSAKRSKGRYCLFHRDHGHATQDCFDLKEEVEGLIRRGYLKEYVEEPKATQNGES
        KRDRSSQKDPPRKFEKYT TTVPIEQVLMEIKDQ+LLKWPERMKA SAKRSKGRYCLFHRDHGHATQDCFDLKEEVEGLIRRGYLKEYVEEPKATQNGES
Subjt:  KRDRSSQKDPPRKFEKYTATTVPIEQVLMEIKDQKLLKWPERMKAPSAKRSKGRYCLFHRDHGHATQDCFDLKEEVEGLIRRGYLKEYVEEPKATQNGES

Query:  DKSPAREIRTIMGGP-HRKRIRERK
        DKSPAREIRTIMGGP  R+  R+RK
Subjt:  DKSPAREIRTIMGGP-HRKRIRERK

XP_022159109.1 uncharacterized protein LOC111025548 [Momordica charantia]1.2e-10975.17Show/hide
Query:  RRRTRIFDPRKTKKQHKSPAPNGGNSDHDDRNSEPISLDKGKPADQPESSEKRHSHKEKGFDLEELLDQADSPFTEEIIREKVLPKFKLPTVKRFDGTTD
        RRRTRIFD +K KKQHKS APNG  +DHD+RNSEPISL+KGKP D+PESSEKRH+ KEKGFDLEELL QADSPFTEEI+REKV PKFKLPTVK FDG T+
Subjt:  RRRTRIFDPRKTKKQHKSPAPNGGNSDHDDRNSEPISLDKGKPADQPESSEKRHSHKEKGFDLEELLDQADSPFTEEIIREKVLPKFKLPTVKRFDGTTD

Query:  PVDHLDAYREWMDIYGVSEAVR----------------------------------------GRCRSRPVAYLLTIKQRTTESLRDYVARFNEEKLQVED
        PVDHLDAYREWMDIYGVS+A+R                                        GRCRSRPVAYLLTIKQRT ESL DYVARFNEEKLQ+E 
Subjt:  PVDHLDAYREWMDIYGVSEAVR----------------------------------------GRCRSRPVAYLLTIKQRTTESLRDYVARFNEEKLQVED

Query:  LTDAVSLLAFMSGVRDEHLSFSFGKRTPNTFLEALSRAQRYMSAGEFFYSKREPDGKRTDPKRERSGDKPQGSRWEKRDRSSQKDPPRKF
        LTDAVSLLAFMSGVRDEHLSFSF KRTP+TF EALSRAQRYMSA EF YSKREPDGKRTD KRERSGDKPQGSRWEKRDR SQKDPPRKF
Subjt:  LTDAVSLLAFMSGVRDEHLSFSFGKRTPNTFLEALSRAQRYMSAGEFFYSKREPDGKRTDPKRERSGDKPQGSRWEKRDRSSQKDPPRKF

XP_022159368.1 uncharacterized protein LOC111025785 [Momordica charantia]1.9e-11588.16Show/hide
Query:  VSEAVRGRCRSRPVAYLLTIKQRTTESLRDYVARFNEEKLQVEDLTDAVSLLAFMSGVRDEHLSFSFGKRTPNTFLEALSRAQRYMSAGEFFYSKREPDG
        V++ V GRCRSRPVAYLLTIKQRTTESL DYVARFN+EKLQ+E LTD VSLLAFMSGVRDEHLSFSFGK+TP+TF E LSRAQRYMSAGEFFYSKREPDG
Subjt:  VSEAVRGRCRSRPVAYLLTIKQRTTESLRDYVARFNEEKLQVEDLTDAVSLLAFMSGVRDEHLSFSFGKRTPNTFLEALSRAQRYMSAGEFFYSKREPDG

Query:  KRTDPKRERSGDKPQGSRWEKRDRSSQKDPPRKFEKYTATTVPIEQVLMEIKDQKLLKWPERMKAPSAKRSKGRYCLFHRDHGHATQDCFDLKEEVEGLI
        KRTD KRERSGDKPQGSRWEKRDRSSQKDPPRKFEKYT TTVP+EQVLMEIKDQ+LLKWPERMK PS KRSKGRYCLFHRDH HATQD FDLKEEVEGLI
Subjt:  KRTDPKRERSGDKPQGSRWEKRDRSSQKDPPRKFEKYTATTVPIEQVLMEIKDQKLLKWPERMKAPSAKRSKGRYCLFHRDHGHATQDCFDLKEEVEGLI

Query:  RRGYLKEYVEEPKATQNGESDKSPAREIRTIMGGP-HRKRIRERK
        RRGYL+EYVEEPKATQNGES+KSPAREIRTIMGGP  R+  R+RK
Subjt:  RRGYLKEYVEEPKATQNGESDKSPAREIRTIMGGP-HRKRIRERK

TrEMBL top hitse value%identityAlignment
A0A6J1CNT2 uncharacterized protein LOC1110128053.3e-12973.28Show/hide
Query:  KQHKSPAPNGGNSDHDDRNSEPISLDKGKPADQPESSEKRHSHKEKGFDLEELLDQADSPFTEEIIREKVLPKFKLPTVKRFDGTTDPVDHLDAYREWMD
        +QHKSPAPNGG SDH+DR+SEPISLDKGKPAD+PESSEKRH+ K+KGFDLEELLDQADSPFTEEI+REK                               
Subjt:  KQHKSPAPNGGNSDHDDRNSEPISLDKGKPADQPESSEKRHSHKEKGFDLEELLDQADSPFTEEIIREKVLPKFKLPTVKRFDGTTDPVDHLDAYREWMD

Query:  IYGVSEAVRGRCRSRPVAYLLTIKQRTTESLRDYVARFNEEKLQVEDLTDAVSLLAFMSGVRDEHLSFSFGKRTPNTFLEALSRAQRYMSAGEFFYSKRE
                                 RTTESL DYV RFN+EKLQVE LTDAVSLLAF+ GVRDEHLSFSFGKRTP+TF EALSRAQRYMSAGEFFYSKRE
Subjt:  IYGVSEAVRGRCRSRPVAYLLTIKQRTTESLRDYVARFNEEKLQVEDLTDAVSLLAFMSGVRDEHLSFSFGKRTPNTFLEALSRAQRYMSAGEFFYSKRE

Query:  PDGKRTDPKRERSGDKPQGSRWEKRDRSSQKDPPRKFEKYTATTVPIEQVLMEIKDQKLLKWPERMKAPSAKRSKGRYCLFHRDHGHATQDCFDLKEEVE
         +GKRTD KRERSGDKPQGSRWEKRDRS QKDPP+KFEKYT+TTVP+EQVLMEIK+Q+LLKWPERM A S KRSKGRYCLFH DHGHATQDCFDLKEEVE
Subjt:  PDGKRTDPKRERSGDKPQGSRWEKRDRSSQKDPPRKFEKYTATTVPIEQVLMEIKDQKLLKWPERMKAPSAKRSKGRYCLFHRDHGHATQDCFDLKEEVE

Query:  GLIRRGYLKEYVEEPKATQNGESDKSPAREIRTIMGGP-HRKRIRERK
        GLI RGYLKEYVEEPKATQNGESDKSPAREIRTIMGGP  R+  R+RK
Subjt:  GLIRRGYLKEYVEEPKATQNGESDKSPAREIRTIMGGP-HRKRIRERK

A0A6J1CUA7 uncharacterized protein LOC1110146463.2e-10088.53Show/hide
Query:  MTPERSPRRSDDNCSAKRRLNLGDPQVGGPEDGTNQPNQERQEGLPEVQALTTPEPFQKQFAVLEDKVEGMLQRMTQVLRQFEQQESDEVPLVRDPKKGK
        MTPERSPRRSDDNCSAKRRLNL DPQVGGPEDGT+QPN ERQEGLPE  ALTTPEP QKQFAVLEDKVEGMLQRMTQVLRQ+E+QESDEVPLVRDP+KGK
Subjt:  MTPERSPRRSDDNCSAKRRLNLGDPQVGGPEDGTNQPNQERQEGLPEVQALTTPEPFQKQFAVLEDKVEGMLQRMTQVLRQFEQQESDEVPLVRDPKKGK

Query:  GPAESDTEESTNSAGSKLQIGGNTRRRTRIFDPRKTKKQHKSPAPNGGNSDHDDRNSEPISLDKGKPADQPESSEKRHSHKEKGFDLEELLDQADSPFTE
        GPA+S+TEESTNSAGSKL+IGGNTRRRT+IFD +K KKQHK  APNGG SDHDDRNSEP+SLDKGKPADQPESSEKRH+ KEKGFDLEELLDQADSPFTE
Subjt:  GPAESDTEESTNSAGSKLQIGGNTRRRTRIFDPRKTKKQHKSPAPNGGNSDHDDRNSEPISLDKGKPADQPESSEKRHSHKEKGFDLEELLDQADSPFTE

Query:  EIIREKVLPKFKLPTVKR
        EI+REKV PKFKLPTVK+
Subjt:  EIIREKVLPKFKLPTVKR

A0A6J1DWY0 uncharacterized protein LOC1110252934.3e-23884Show/hide
Query:  MTPERSPRRSDDNCSAKRRLNLGDPQVGGPEDGTNQPNQERQEGLPEVQALTTPEPFQKQFAVLEDKVEGMLQRMTQVLRQFEQQESDEVPLVRDPKKGK
        MTPERSPRRSDD+CSAKRRLNLGDPQVGGPEDGT+QPNQERQEGL E  ALTTPEPFQKQFAVLEDK                  ESDEVPLVRDPKKGK
Subjt:  MTPERSPRRSDDNCSAKRRLNLGDPQVGGPEDGTNQPNQERQEGLPEVQALTTPEPFQKQFAVLEDKVEGMLQRMTQVLRQFEQQESDEVPLVRDPKKGK

Query:  GPAESDTEESTNSAGSKLQIGGNTRRRTRIFDPRKTKKQHKSPAPNGGNSDHDDRNSEPISLDKGKPADQPESSEKRHSHKEKGFDLEELLDQADSPFTE
        GP ESDTEESTNS GSKL+IGGNTR+RTRIFDPRKTKKQHKSPAPNGG SDHDDRNSEPISLDKGKPAD+PESSEKRHSHKEKGFDLEELLDQADSPFTE
Subjt:  GPAESDTEESTNSAGSKLQIGGNTRRRTRIFDPRKTKKQHKSPAPNGGNSDHDDRNSEPISLDKGKPADQPESSEKRHSHKEKGFDLEELLDQADSPFTE

Query:  EIIREKVLPKFKLPTVKRFDGTTDPVDHLDAYREWMDIYGVSEAVR----------------------------------------GRCRSRPVAYLLTI
        EI+REKV PKFKLPTVK+FD TTDPVDHLDAYREWMDIYGVSEAVR                                        GRCRSRPVAYLLTI
Subjt:  EIIREKVLPKFKLPTVKRFDGTTDPVDHLDAYREWMDIYGVSEAVR----------------------------------------GRCRSRPVAYLLTI

Query:  KQRTTESLRDYVARFNEEKLQVEDLTDAVSLLAFMSGVRDEHLSFSFGKRTPNTFLEALSRAQRYMSAGEFFYSKREPDGKRTDPKRERSGDKPQGSRWE
        KQRTTESLRDYVARFNEEKLQVE LTDAVSLLAFMSGVRDEHLSFSFGKRTPNTF EALSRAQRYMSAGEFFYSKREPDGKRTDPKRERSGDKPQGSRWE
Subjt:  KQRTTESLRDYVARFNEEKLQVEDLTDAVSLLAFMSGVRDEHLSFSFGKRTPNTFLEALSRAQRYMSAGEFFYSKREPDGKRTDPKRERSGDKPQGSRWE

Query:  KRDRSSQKDPPRKFEKYTATTVPIEQVLMEIKDQKLLKWPERMKAPSAKRSKGRYCLFHRDHGHATQDCFDLKEEVEGLIRRGYLKEYVEEPKATQNGES
        KRDRSSQKDPPRKFEKYT TTVPIEQVLMEIKDQ+LLKWPERMKA SAKRSKGRYCLFHRDHGHATQDCFDLKEEVEGLIRRGYLKEYVEEPKATQNGES
Subjt:  KRDRSSQKDPPRKFEKYTATTVPIEQVLMEIKDQKLLKWPERMKAPSAKRSKGRYCLFHRDHGHATQDCFDLKEEVEGLIRRGYLKEYVEEPKATQNGES

Query:  DKSPAREIRTIMGGP-HRKRIRERK
        DKSPAREIRTIMGGP  R+  R+RK
Subjt:  DKSPAREIRTIMGGP-HRKRIRERK

A0A6J1DYL6 uncharacterized protein LOC1110257859.3e-11688.16Show/hide
Query:  VSEAVRGRCRSRPVAYLLTIKQRTTESLRDYVARFNEEKLQVEDLTDAVSLLAFMSGVRDEHLSFSFGKRTPNTFLEALSRAQRYMSAGEFFYSKREPDG
        V++ V GRCRSRPVAYLLTIKQRTTESL DYVARFN+EKLQ+E LTD VSLLAFMSGVRDEHLSFSFGK+TP+TF E LSRAQRYMSAGEFFYSKREPDG
Subjt:  VSEAVRGRCRSRPVAYLLTIKQRTTESLRDYVARFNEEKLQVEDLTDAVSLLAFMSGVRDEHLSFSFGKRTPNTFLEALSRAQRYMSAGEFFYSKREPDG

Query:  KRTDPKRERSGDKPQGSRWEKRDRSSQKDPPRKFEKYTATTVPIEQVLMEIKDQKLLKWPERMKAPSAKRSKGRYCLFHRDHGHATQDCFDLKEEVEGLI
        KRTD KRERSGDKPQGSRWEKRDRSSQKDPPRKFEKYT TTVP+EQVLMEIKDQ+LLKWPERMK PS KRSKGRYCLFHRDH HATQD FDLKEEVEGLI
Subjt:  KRTDPKRERSGDKPQGSRWEKRDRSSQKDPPRKFEKYTATTVPIEQVLMEIKDQKLLKWPERMKAPSAKRSKGRYCLFHRDHGHATQDCFDLKEEVEGLI

Query:  RRGYLKEYVEEPKATQNGESDKSPAREIRTIMGGP-HRKRIRERK
        RRGYL+EYVEEPKATQNGES+KSPAREIRTIMGGP  R+  R+RK
Subjt:  RRGYLKEYVEEPKATQNGESDKSPAREIRTIMGGP-HRKRIRERK

A0A6J1E1E7 uncharacterized protein LOC1110255485.9e-11075.17Show/hide
Query:  RRRTRIFDPRKTKKQHKSPAPNGGNSDHDDRNSEPISLDKGKPADQPESSEKRHSHKEKGFDLEELLDQADSPFTEEIIREKVLPKFKLPTVKRFDGTTD
        RRRTRIFD +K KKQHKS APNG  +DHD+RNSEPISL+KGKP D+PESSEKRH+ KEKGFDLEELL QADSPFTEEI+REKV PKFKLPTVK FDG T+
Subjt:  RRRTRIFDPRKTKKQHKSPAPNGGNSDHDDRNSEPISLDKGKPADQPESSEKRHSHKEKGFDLEELLDQADSPFTEEIIREKVLPKFKLPTVKRFDGTTD

Query:  PVDHLDAYREWMDIYGVSEAVR----------------------------------------GRCRSRPVAYLLTIKQRTTESLRDYVARFNEEKLQVED
        PVDHLDAYREWMDIYGVS+A+R                                        GRCRSRPVAYLLTIKQRT ESL DYVARFNEEKLQ+E 
Subjt:  PVDHLDAYREWMDIYGVSEAVR----------------------------------------GRCRSRPVAYLLTIKQRTTESLRDYVARFNEEKLQVED

Query:  LTDAVSLLAFMSGVRDEHLSFSFGKRTPNTFLEALSRAQRYMSAGEFFYSKREPDGKRTDPKRERSGDKPQGSRWEKRDRSSQKDPPRKF
        LTDAVSLLAFMSGVRDEHLSFSF KRTP+TF EALSRAQRYMSA EF YSKREPDGKRTD KRERSGDKPQGSRWEKRDR SQKDPPRKF
Subjt:  LTDAVSLLAFMSGVRDEHLSFSFGKRTPNTFLEALSRAQRYMSAGEFFYSKREPDGKRTDPKRERSGDKPQGSRWEKRDRSSQKDPPRKF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGAGCGGATTCTACGTTCTTTCCTATGGAAAGGTCGCGAGAGTGGGTGATCAGGGGCTAAGGTGGCTTGGTCTGAGGTGTGTATTCTGTTATGCAATGGAA
ACCGAAAGTCGAGGTACAGGTGAAGATCGGGAGGAAGCCACTGGGTCGCGATGGGCTCGGTCCCGACCCGGGGCTCGGTCCCGACCCGGGGCTCGGTCCCGACCC
GGGGCTCGGTCCCGACCCGGGGTTCGGTCCCGACCCGGGGCTCGGCCCACGGCCCTCCAACGGTCCTTCTTCTTCGATCCACGGTTTCGCCCAGGCCCACCAACG
AAATTAGGGTTAGCTACCTATGGAATGTTCTGGAACAACCTGACGACTATAAGTAGCGATCCAGCCTTAGAGAGTCGCACCGGTTTAGCTCATCTGGTTTTGCAG
TTGAGCAACATGACACCAGAAAGGAGTCCACGACGCTCTGATGATAACTGCTCTGCCAAGAGGAGGCTGAACTTGGGCGACCCCCAGGTTGGAGGACCCGAGGAT
GGGACCAACCAGCCAAACCAAGAGCGTCAGGAGGGGTTGCCCGAGGTGCAGGCATTAACGACCCCTGAGCCATTCCAGAAGCAGTTCGCGGTCTTGGAAGATAAG
GTAGAGGGCATGCTTCAACGCATGACCCAAGTCCTTCGACAATTCGAGCAACAGGAGTCCGACGAAGTGCCCCTTGTCAGAGACCCGAAAAAGGGGAAGGGCCCA
GCGGAAAGCGATACCGAGGAGTCAACGAACAGTGCAGGGAGCAAGTTGCAGATAGGTGGAAATACCAGGCGACGAACTCGAATTTTCGACCCTCGAAAGACGAAA
AAGCAACATAAATCACCGGCACCAAATGGGGGCAATAGCGACCATGATGACAGAAACTCTGAACCGATAAGTCTCGACAAAGGCAAACCGGCAGATCAGCCAGAG
TCCTCGGAGAAGCGACATAGCCACAAGGAGAAGGGATTCGACCTCGAAGAACTACTGGATCAAGCCGACTCACCATTCACGGAGGAAATCATAAGAGAAAAGGTC
CTTCCAAAATTCAAGCTACCTACGGTGAAGCGATTCGACGGGACGACTGACCCAGTGGACCATCTAGATGCTTATCGGGAATGGATGGATATCTACGGGGTGTCG
GAAGCGGTCAGGGGGCGGTGTCGGAGCCGACCCGTGGCTTATCTCTTAACCATTAAGCAGAGGACGACAGAGAGTCTACGCGACTATGTAGCCCGGTTCAACGAG
GAGAAGCTGCAGGTAGAAGACCTTACAGACGCTGTATCTCTGCTGGCCTTCATGTCCGGCGTCAGGGATGAACATTTGTCATTCTCGTTCGGAAAGAGAACACCG
AACACCTTCTTGGAAGCGCTGAGCCGAGCTCAGAGGTACATGAGCGCTGGTGAGTTCTTTTACTCAAAAAGGGAACCTGACGGAAAGCGAACCGACCCAAAGAGG
GAGAGGTCGGGAGATAAACCGCAGGGGTCGAGATGGGAGAAGAGGGATCGGAGTAGCCAGAAAGATCCACCCCGAAAATTTGAAAAGTATACCGCGACCACCGTT
CCAATCGAGCAAGTGCTGATGGAGATTAAAGACCAAAAGTTGCTTAAATGGCCGGAAAGGATGAAGGCCCCGTCAGCTAAGCGCAGCAAAGGCAGGTATTGTCTT
TTCCACAGGGATCACGGCCATGCAACTCAGGATTGTTTTGATCTCAAGGAAGAGGTGGAAGGACTAATCCGAAGGGGCTACCTCAAGGAGTATGTAGAGGAACCT
AAAGCGACCCAGAACGGCGAAAGCGACAAGTCTCCCGCAAGAGAGATTCGAACTATAATGGGGGGCCCCCATAGAAAGAGAATCCGGGAGAGAAAGGAAAGCAGA
TGTGCGTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTGAGCGGATTCTACGTTCTTTCCTATGGAAAGGTCGCGAGAGTGGGTGATCAGGGGCTAAGGTGGCTTGGTCTGAGGTGTGTATTCTGTTATGCAATGGAA
ACCGAAAGTCGAGGTACAGGTGAAGATCGGGAGGAAGCCACTGGGTCGCGATGGGCTCGGTCCCGACCCGGGGCTCGGTCCCGACCCGGGGCTCGGTCCCGACCC
GGGGCTCGGTCCCGACCCGGGGTTCGGTCCCGACCCGGGGCTCGGCCCACGGCCCTCCAACGGTCCTTCTTCTTCGATCCACGGTTTCGCCCAGGCCCACCAACG
AAATTAGGGTTAGCTACCTATGGAATGTTCTGGAACAACCTGACGACTATAAGTAGCGATCCAGCCTTAGAGAGTCGCACCGGTTTAGCTCATCTGGTTTTGCAG
TTGAGCAACATGACACCAGAAAGGAGTCCACGACGCTCTGATGATAACTGCTCTGCCAAGAGGAGGCTGAACTTGGGCGACCCCCAGGTTGGAGGACCCGAGGAT
GGGACCAACCAGCCAAACCAAGAGCGTCAGGAGGGGTTGCCCGAGGTGCAGGCATTAACGACCCCTGAGCCATTCCAGAAGCAGTTCGCGGTCTTGGAAGATAAG
GTAGAGGGCATGCTTCAACGCATGACCCAAGTCCTTCGACAATTCGAGCAACAGGAGTCCGACGAAGTGCCCCTTGTCAGAGACCCGAAAAAGGGGAAGGGCCCA
GCGGAAAGCGATACCGAGGAGTCAACGAACAGTGCAGGGAGCAAGTTGCAGATAGGTGGAAATACCAGGCGACGAACTCGAATTTTCGACCCTCGAAAGACGAAA
AAGCAACATAAATCACCGGCACCAAATGGGGGCAATAGCGACCATGATGACAGAAACTCTGAACCGATAAGTCTCGACAAAGGCAAACCGGCAGATCAGCCAGAG
TCCTCGGAGAAGCGACATAGCCACAAGGAGAAGGGATTCGACCTCGAAGAACTACTGGATCAAGCCGACTCACCATTCACGGAGGAAATCATAAGAGAAAAGGTC
CTTCCAAAATTCAAGCTACCTACGGTGAAGCGATTCGACGGGACGACTGACCCAGTGGACCATCTAGATGCTTATCGGGAATGGATGGATATCTACGGGGTGTCG
GAAGCGGTCAGGGGGCGGTGTCGGAGCCGACCCGTGGCTTATCTCTTAACCATTAAGCAGAGGACGACAGAGAGTCTACGCGACTATGTAGCCCGGTTCAACGAG
GAGAAGCTGCAGGTAGAAGACCTTACAGACGCTGTATCTCTGCTGGCCTTCATGTCCGGCGTCAGGGATGAACATTTGTCATTCTCGTTCGGAAAGAGAACACCG
AACACCTTCTTGGAAGCGCTGAGCCGAGCTCAGAGGTACATGAGCGCTGGTGAGTTCTTTTACTCAAAAAGGGAACCTGACGGAAAGCGAACCGACCCAAAGAGG
GAGAGGTCGGGAGATAAACCGCAGGGGTCGAGATGGGAGAAGAGGGATCGGAGTAGCCAGAAAGATCCACCCCGAAAATTTGAAAAGTATACCGCGACCACCGTT
CCAATCGAGCAAGTGCTGATGGAGATTAAAGACCAAAAGTTGCTTAAATGGCCGGAAAGGATGAAGGCCCCGTCAGCTAAGCGCAGCAAAGGCAGGTATTGTCTT
TTCCACAGGGATCACGGCCATGCAACTCAGGATTGTTTTGATCTCAAGGAAGAGGTGGAAGGACTAATCCGAAGGGGCTACCTCAAGGAGTATGTAGAGGAACCT
AAAGCGACCCAGAACGGCGAAAGCGACAAGTCTCCCGCAAGAGAGATTCGAACTATAATGGGGGGCCCCCATAGAAAGAGAATCCGGGAGAGAAAGGAAAGCAGA
TGTGCGTGA
Protein sequenceShow/hide protein sequence
MLSGFYVLSYGKVARVGDQGLRWLGLRCVFCYAMETESRGTGEDREEATGSRWARSRPGARSRPGARSRPGARSRPGVRSRPGARPTALQRSFFFDPRFRPGPPT
KLGLATYGMFWNNLTTISSDPALESRTGLAHLVLQLSNMTPERSPRRSDDNCSAKRRLNLGDPQVGGPEDGTNQPNQERQEGLPEVQALTTPEPFQKQFAVLEDK
VEGMLQRMTQVLRQFEQQESDEVPLVRDPKKGKGPAESDTEESTNSAGSKLQIGGNTRRRTRIFDPRKTKKQHKSPAPNGGNSDHDDRNSEPISLDKGKPADQPE
SSEKRHSHKEKGFDLEELLDQADSPFTEEIIREKVLPKFKLPTVKRFDGTTDPVDHLDAYREWMDIYGVSEAVRGRCRSRPVAYLLTIKQRTTESLRDYVARFNE
EKLQVEDLTDAVSLLAFMSGVRDEHLSFSFGKRTPNTFLEALSRAQRYMSAGEFFYSKREPDGKRTDPKRERSGDKPQGSRWEKRDRSSQKDPPRKFEKYTATTV
PIEQVLMEIKDQKLLKWPERMKAPSAKRSKGRYCLFHRDHGHATQDCFDLKEEVEGLIRRGYLKEYVEEPKATQNGESDKSPAREIRTIMGGPHRKRIRERKESR
CA