; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g03260 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g03260
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr3:2490472..2492600
RNA-Seq ExpressionMoc03g03260
SyntenyMoc03g03260
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022151719.1 uncharacterized protein LOC111019634 [Momordica charantia]8.2e-11185.02Show/hide
Query:  STQSMKSTPKAKSSYNPIVPEGMITREEFDQLKSKFDAQVEALKARCKVKGRAFDDGDLGESPFTSDTLEAPIPLKFKTPSMKPYDGSKDPKDYVEVFEG
        S     S  +A+SSYNPI PEG+ITREEFDQLKSKFDAQVEALKA+C+ K  +FDDGDLGESPFTSD LEA IPLKFKTP+MKPYDGSKDPKDYVEVFEG
Subjt:  STQSMKSTPKAKSSYNPIVPEGMITREEFDQLKSKFDAQVEALKARCKVKGRAFDDGDLGESPFTSDTLEAPIPLKFKTPSMKPYDGSKDPKDYVEVFEG

Query:  LMDLQAAIDAIKCRVFQIALTDSARLWYRKLPARLISTYSQLRKEFISQFYSRHYDRKTATHLTTIRQNEGETLREYVTRFQEEQLKVVHCSDDSAMCYF
        LMD QAA DAIKCR FQIALT SARLWYR+LPAR ISTYSQLRKEFI QF SRHYDRKTATHLTTIRQ EGETLREYVTRFQEEQLKV HCSD SAMCYF
Subjt:  LMDLQAAIDAIKCRVFQIALTDSARLWYRKLPARLISTYSQLRKEFISQFYSRHYDRKTATHLTTIRQNEGETLREYVTRFQEEQLKVVHCSDDSAMCYF

Query:  LTGLADETLTVKLGEEAPATFAEVLQKAKKVIDGQELLQTKTGRPEK
        LT LADETLTVKL EEAPATF EVLQKAKK+IDGQELL+TKT RPEK
Subjt:  LTGLADETLTVKLGEEAPATFAEVLQKAKKVIDGQELLQTKTGRPEK

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]2.6e-12066.22Show/hide
Query:  ADRRALAANDGHQREVGAEVAESQIQKGLGTEPLCRSARITTPVLLLTHPKPSKANRGRGGASRRTTRGEAPAPTRDNLMHSRKKWRQCAPRCVPWKRCT
        ADRRALAAN GHQREVGAEV E Q  + LGTEPLCRSARITTPVL   HPKPS                                               
Subjt:  ADRRALAANDGHQREVGAEVAESQIQKGLGTEPLCRSARITTPVLLLTHPKPSKANRGRGGASRRTTRGEAPAPTRDNLMHSRKKWRQCAPRCVPWKRCT

Query:  TKWCKLQVPSLDLKTERRVTRCVSKEVFISTQSMKSTPKAKSSYNPIVPEGMITREEFDQLKSKFDAQVEALKARCKVKGRAFDDGDLGESPFTSDTLEA
                                              KA+SSYNPI P G+ITREEFDQLKSKFDAQVEALKARC+ K  +FDDGDLGE  F+SD LEA
Subjt:  TKWCKLQVPSLDLKTERRVTRCVSKEVFISTQSMKSTPKAKSSYNPIVPEGMITREEFDQLKSKFDAQVEALKARCKVKGRAFDDGDLGESPFTSDTLEA

Query:  PIPLKFKTPSMKPYDGSKDPKDYVEVFEGLMDLQAAIDAIKCRVFQIALTDSARLWYRKLPARLISTYSQLRKEFISQFYSRHYDRKTATHLTTIRQNEG
         IP KFKTP+MKPYDGSKDPKDYVEVFE LMD QAA DAIKC  FQIALT SARLWYR+LPARLISTYSQLRKEFISQF SRHYDRKT THL TIRQ EG
Subjt:  PIPLKFKTPSMKPYDGSKDPKDYVEVFEGLMDLQAAIDAIKCRVFQIALTDSARLWYRKLPARLISTYSQLRKEFISQFYSRHYDRKTATHLTTIRQNEG

Query:  ETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADETLTVKLGEEAPATFAEVLQKAKKVIDGQELLQTKTGRPEK
        ETLREYVTRF EEQLKV HCSDDSAMCYFLTGLADETLTVKL EEAPATFAEVLQK KKVIDGQELL+TKTGRPEK
Subjt:  ETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADETLTVKLGEEAPATFAEVLQKAKKVIDGQELLQTKTGRPEK

XP_022155128.1 uncharacterized protein LOC111022267 [Momordica charantia]2.0e-10970.13Show/hide
Query:  KWCKLQVPSLDLKTERRVTRCVSKEVFI---------------------------------STQSMKSTPKAKSSYNPIVPEGMITREEFDQLKSKFDAQ
        +WCKL+   L LKT      C SK V                                   S+    S  +A+SSYNPI P+ +ITREEFDQLKSKFDAQ
Subjt:  KWCKLQVPSLDLKTERRVTRCVSKEVFI---------------------------------STQSMKSTPKAKSSYNPIVPEGMITREEFDQLKSKFDAQ

Query:  VEALKARCKVKGRAFDDGDLGESPFTSDTLEAPIPLKFKTPSMKPYDGSKDPKDYVEVFEGLMDLQAAIDAIKCRVFQIALTDSARLWYRKLPARLISTY
        VEALKA C+ K  +FDDGDLGE PFT D LEAPI  KFKTP+MKPYDGSK+PKDYV+VFEGLM+ QAA DAIKCR FQIA T SARLWYR+LPAR ISTY
Subjt:  VEALKARCKVKGRAFDDGDLGESPFTSDTLEAPIPLKFKTPSMKPYDGSKDPKDYVEVFEGLMDLQAAIDAIKCRVFQIALTDSARLWYRKLPARLISTY

Query:  SQLRKEFISQFYSRHYDRKTATHLTTIRQNEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADETLTVKLGEEAPATFAEVLQKAKKVIDGQELLQ
        SQLRKEFISQF SR+YDRKTATHL TIRQ +GETLREYVTRFQEEQLKV HCSDDSAMCYFLTGLAD+TLTVKLGEEAPATFAEVLQKAKKVIDGQELL+
Subjt:  SQLRKEFISQFYSRHYDRKTATHLTTIRQNEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADETLTVKLGEEAPATFAEVLQKAKKVIDGQELLQ

Query:  TKTGRPEK
        TKTGRPEK
Subjt:  TKTGRPEK

XP_022156088.1 uncharacterized protein LOC111023060 [Momordica charantia]3.8e-10884.23Show/hide
Query:  STPKAKSSYNPIVPEGMITREEFDQLKSKFDAQVEALKARCKVKGRAFDDGDLGESPFTSDTLEAPIPLKFKTPSMKPYDGSKDPKDYVEVFEGLMDLQA
        S  +A+SSYNPI PEG+ITREEF+QLKSKFDAQVEALK RC+ K  AFDDGDLGESPFTSD LEA IP KFKTP+MK YDGSKDPKDYVEVFEGLMD QA
Subjt:  STPKAKSSYNPIVPEGMITREEFDQLKSKFDAQVEALKARCKVKGRAFDDGDLGESPFTSDTLEAPIPLKFKTPSMKPYDGSKDPKDYVEVFEGLMDLQA

Query:  AIDAIKCRVFQIALTDSARLWYRKLPARLISTYSQLRKEFISQFYSRHYDRKTATHLTTIRQNEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLAD
        A DAIKCR FQIALT SARLWYR+LPAR ISTYSQLRKEFISQF+SRHYDRKT THL TIRQ EG+TL+EY+TRFQEEQLKVVHCSDDS+MCYFLTGLAD
Subjt:  AIDAIKCRVFQIALTDSARLWYRKLPARLISTYSQLRKEFISQFYSRHYDRKTATHLTTIRQNEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLAD

Query:  ETLTVKLGEEAPATFAEVLQKAKKVIDGQELLQTKTGRPEK
        ET TVKLGEEA ATFAEVLQ  KK IDGQELL+TKT RPEK
Subjt:  ETLTVKLGEEAPATFAEVLQKAKKVIDGQELLQTKTGRPEK

XP_022159327.1 uncharacterized protein LOC111025738 [Momordica charantia]2.3e-13769.82Show/hide
Query:  DRRALAANDGHQREVGAEVAESQIQKGLGTEPLCRSARITTPVLLLTHPKPSKANRGRGGASRRTTRGEAPAPTRDNLMHSRKKWRQCAPRCVPWKRCTT
        DRRAL ANDGHQREVGAEV E QI +GLGTEP CRSARITTP L   HPKP KANRGRGGASRRTT G APAP+R+N    +K+      + +  +    
Subjt:  DRRALAANDGHQREVGAEVAESQIQKGLGTEPLCRSARITTPVLLLTHPKPSKANRGRGGASRRTTRGEAPAPTRDNLMHSRKKWRQCAPRCVPWKRCTT

Query:  KWCKLQVPSLDLKTERRVTR----------------CVSKEVFISTQSMKSTPKAKSSYNPIVPEGMITREEFDQLKSKFDAQVEALKARCKVKGRAFDD
        +   +Q      ++E R  R                 + K    S     S  +A+SSYNP+VPEG+ITREEFDQLKSKFDAQVE LKARC+VKG  FDD
Subjt:  KWCKLQVPSLDLKTERRVTR----------------CVSKEVFISTQSMKSTPKAKSSYNPIVPEGMITREEFDQLKSKFDAQVEALKARCKVKGRAFDD

Query:  GDLGESPFTSDTLEAPIPLKFKTPSMKPYDGSKDPKDYVEVFEGLMDLQAAIDAIKCRVFQIALTDSARLWYRKLPARLISTYSQLRKEFISQFYSRHYD
        GDLGESPFTSD LEA IP KFKTP+MKPYDGSKDPKDYVEVFEGLM  QAA DAIK R FQIALT SARLWYR+LPAR ISTYSQLRKEF SQF SRHY+
Subjt:  GDLGESPFTSDTLEAPIPLKFKTPSMKPYDGSKDPKDYVEVFEGLMDLQAAIDAIKCRVFQIALTDSARLWYRKLPARLISTYSQLRKEFISQFYSRHYD

Query:  RKTATHLTTIRQNEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADETLTVKLGEEAPATFAEVLQKAKKVIDGQELLQTKTGRPEK
        RKTATHL TIRQ E ETLREYVT FQEEQLKV H SDDSA+CYFLT L DETLTVKLGEEAPATFAEVLQKAKKVIDGQEL +TKTGR EK
Subjt:  RKTATHLTTIRQNEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADETLTVKLGEEAPATFAEVLQKAKKVIDGQELLQTKTGRPEK

TrEMBL top hitse value%identityAlignment
A0A6J1DDW5 uncharacterized protein LOC1110196344.0e-11185.02Show/hide
Query:  STQSMKSTPKAKSSYNPIVPEGMITREEFDQLKSKFDAQVEALKARCKVKGRAFDDGDLGESPFTSDTLEAPIPLKFKTPSMKPYDGSKDPKDYVEVFEG
        S     S  +A+SSYNPI PEG+ITREEFDQLKSKFDAQVEALKA+C+ K  +FDDGDLGESPFTSD LEA IPLKFKTP+MKPYDGSKDPKDYVEVFEG
Subjt:  STQSMKSTPKAKSSYNPIVPEGMITREEFDQLKSKFDAQVEALKARCKVKGRAFDDGDLGESPFTSDTLEAPIPLKFKTPSMKPYDGSKDPKDYVEVFEG

Query:  LMDLQAAIDAIKCRVFQIALTDSARLWYRKLPARLISTYSQLRKEFISQFYSRHYDRKTATHLTTIRQNEGETLREYVTRFQEEQLKVVHCSDDSAMCYF
        LMD QAA DAIKCR FQIALT SARLWYR+LPAR ISTYSQLRKEFI QF SRHYDRKTATHLTTIRQ EGETLREYVTRFQEEQLKV HCSD SAMCYF
Subjt:  LMDLQAAIDAIKCRVFQIALTDSARLWYRKLPARLISTYSQLRKEFISQFYSRHYDRKTATHLTTIRQNEGETLREYVTRFQEEQLKVVHCSDDSAMCYF

Query:  LTGLADETLTVKLGEEAPATFAEVLQKAKKVIDGQELLQTKTGRPEK
        LT LADETLTVKL EEAPATF EVLQKAKK+IDGQELL+TKT RPEK
Subjt:  LTGLADETLTVKLGEEAPATFAEVLQKAKKVIDGQELLQTKTGRPEK

A0A6J1DHB3 uncharacterized protein LOC1110204791.2e-12066.22Show/hide
Query:  ADRRALAANDGHQREVGAEVAESQIQKGLGTEPLCRSARITTPVLLLTHPKPSKANRGRGGASRRTTRGEAPAPTRDNLMHSRKKWRQCAPRCVPWKRCT
        ADRRALAAN GHQREVGAEV E Q  + LGTEPLCRSARITTPVL   HPKPS                                               
Subjt:  ADRRALAANDGHQREVGAEVAESQIQKGLGTEPLCRSARITTPVLLLTHPKPSKANRGRGGASRRTTRGEAPAPTRDNLMHSRKKWRQCAPRCVPWKRCT

Query:  TKWCKLQVPSLDLKTERRVTRCVSKEVFISTQSMKSTPKAKSSYNPIVPEGMITREEFDQLKSKFDAQVEALKARCKVKGRAFDDGDLGESPFTSDTLEA
                                              KA+SSYNPI P G+ITREEFDQLKSKFDAQVEALKARC+ K  +FDDGDLGE  F+SD LEA
Subjt:  TKWCKLQVPSLDLKTERRVTRCVSKEVFISTQSMKSTPKAKSSYNPIVPEGMITREEFDQLKSKFDAQVEALKARCKVKGRAFDDGDLGESPFTSDTLEA

Query:  PIPLKFKTPSMKPYDGSKDPKDYVEVFEGLMDLQAAIDAIKCRVFQIALTDSARLWYRKLPARLISTYSQLRKEFISQFYSRHYDRKTATHLTTIRQNEG
         IP KFKTP+MKPYDGSKDPKDYVEVFE LMD QAA DAIKC  FQIALT SARLWYR+LPARLISTYSQLRKEFISQF SRHYDRKT THL TIRQ EG
Subjt:  PIPLKFKTPSMKPYDGSKDPKDYVEVFEGLMDLQAAIDAIKCRVFQIALTDSARLWYRKLPARLISTYSQLRKEFISQFYSRHYDRKTATHLTTIRQNEG

Query:  ETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADETLTVKLGEEAPATFAEVLQKAKKVIDGQELLQTKTGRPEK
        ETLREYVTRF EEQLKV HCSDDSAMCYFLTGLADETLTVKL EEAPATFAEVLQK KKVIDGQELL+TKTGRPEK
Subjt:  ETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADETLTVKLGEEAPATFAEVLQKAKKVIDGQELLQTKTGRPEK

A0A6J1DM55 uncharacterized protein LOC1110222679.8e-11070.13Show/hide
Query:  KWCKLQVPSLDLKTERRVTRCVSKEVFI---------------------------------STQSMKSTPKAKSSYNPIVPEGMITREEFDQLKSKFDAQ
        +WCKL+   L LKT      C SK V                                   S+    S  +A+SSYNPI P+ +ITREEFDQLKSKFDAQ
Subjt:  KWCKLQVPSLDLKTERRVTRCVSKEVFI---------------------------------STQSMKSTPKAKSSYNPIVPEGMITREEFDQLKSKFDAQ

Query:  VEALKARCKVKGRAFDDGDLGESPFTSDTLEAPIPLKFKTPSMKPYDGSKDPKDYVEVFEGLMDLQAAIDAIKCRVFQIALTDSARLWYRKLPARLISTY
        VEALKA C+ K  +FDDGDLGE PFT D LEAPI  KFKTP+MKPYDGSK+PKDYV+VFEGLM+ QAA DAIKCR FQIA T SARLWYR+LPAR ISTY
Subjt:  VEALKARCKVKGRAFDDGDLGESPFTSDTLEAPIPLKFKTPSMKPYDGSKDPKDYVEVFEGLMDLQAAIDAIKCRVFQIALTDSARLWYRKLPARLISTY

Query:  SQLRKEFISQFYSRHYDRKTATHLTTIRQNEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADETLTVKLGEEAPATFAEVLQKAKKVIDGQELLQ
        SQLRKEFISQF SR+YDRKTATHL TIRQ +GETLREYVTRFQEEQLKV HCSDDSAMCYFLTGLAD+TLTVKLGEEAPATFAEVLQKAKKVIDGQELL+
Subjt:  SQLRKEFISQFYSRHYDRKTATHLTTIRQNEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADETLTVKLGEEAPATFAEVLQKAKKVIDGQELLQ

Query:  TKTGRPEK
        TKTGRPEK
Subjt:  TKTGRPEK

A0A6J1DPN4 uncharacterized protein LOC1110230601.9e-10884.23Show/hide
Query:  STPKAKSSYNPIVPEGMITREEFDQLKSKFDAQVEALKARCKVKGRAFDDGDLGESPFTSDTLEAPIPLKFKTPSMKPYDGSKDPKDYVEVFEGLMDLQA
        S  +A+SSYNPI PEG+ITREEF+QLKSKFDAQVEALK RC+ K  AFDDGDLGESPFTSD LEA IP KFKTP+MK YDGSKDPKDYVEVFEGLMD QA
Subjt:  STPKAKSSYNPIVPEGMITREEFDQLKSKFDAQVEALKARCKVKGRAFDDGDLGESPFTSDTLEAPIPLKFKTPSMKPYDGSKDPKDYVEVFEGLMDLQA

Query:  AIDAIKCRVFQIALTDSARLWYRKLPARLISTYSQLRKEFISQFYSRHYDRKTATHLTTIRQNEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLAD
        A DAIKCR FQIALT SARLWYR+LPAR ISTYSQLRKEFISQF+SRHYDRKT THL TIRQ EG+TL+EY+TRFQEEQLKVVHCSDDS+MCYFLTGLAD
Subjt:  AIDAIKCRVFQIALTDSARLWYRKLPARLISTYSQLRKEFISQFYSRHYDRKTATHLTTIRQNEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLAD

Query:  ETLTVKLGEEAPATFAEVLQKAKKVIDGQELLQTKTGRPEK
        ET TVKLGEEA ATFAEVLQ  KK IDGQELL+TKT RPEK
Subjt:  ETLTVKLGEEAPATFAEVLQKAKKVIDGQELLQTKTGRPEK

A0A6J1DZJ1 uncharacterized protein LOC1110257381.1e-13769.82Show/hide
Query:  DRRALAANDGHQREVGAEVAESQIQKGLGTEPLCRSARITTPVLLLTHPKPSKANRGRGGASRRTTRGEAPAPTRDNLMHSRKKWRQCAPRCVPWKRCTT
        DRRAL ANDGHQREVGAEV E QI +GLGTEP CRSARITTP L   HPKP KANRGRGGASRRTT G APAP+R+N    +K+      + +  +    
Subjt:  DRRALAANDGHQREVGAEVAESQIQKGLGTEPLCRSARITTPVLLLTHPKPSKANRGRGGASRRTTRGEAPAPTRDNLMHSRKKWRQCAPRCVPWKRCTT

Query:  KWCKLQVPSLDLKTERRVTR----------------CVSKEVFISTQSMKSTPKAKSSYNPIVPEGMITREEFDQLKSKFDAQVEALKARCKVKGRAFDD
        +   +Q      ++E R  R                 + K    S     S  +A+SSYNP+VPEG+ITREEFDQLKSKFDAQVE LKARC+VKG  FDD
Subjt:  KWCKLQVPSLDLKTERRVTR----------------CVSKEVFISTQSMKSTPKAKSSYNPIVPEGMITREEFDQLKSKFDAQVEALKARCKVKGRAFDD

Query:  GDLGESPFTSDTLEAPIPLKFKTPSMKPYDGSKDPKDYVEVFEGLMDLQAAIDAIKCRVFQIALTDSARLWYRKLPARLISTYSQLRKEFISQFYSRHYD
        GDLGESPFTSD LEA IP KFKTP+MKPYDGSKDPKDYVEVFEGLM  QAA DAIK R FQIALT SARLWYR+LPAR ISTYSQLRKEF SQF SRHY+
Subjt:  GDLGESPFTSDTLEAPIPLKFKTPSMKPYDGSKDPKDYVEVFEGLMDLQAAIDAIKCRVFQIALTDSARLWYRKLPARLISTYSQLRKEFISQFYSRHYD

Query:  RKTATHLTTIRQNEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADETLTVKLGEEAPATFAEVLQKAKKVIDGQELLQTKTGRPEK
        RKTATHL TIRQ E ETLREYVT FQEEQLKV H SDDSA+CYFLT L DETLTVKLGEEAPATFAEVLQKAKKVIDGQEL +TKTGR EK
Subjt:  RKTATHLTTIRQNEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADETLTVKLGEEAPATFAEVLQKAKKVIDGQELLQTKTGRPEK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGGGTTCGACTTGGAGGGGTTCGATCTGCTCAACCCCGACAGGTTCGACTTGATAAAAGACAGGGATAACCGCCGGTCGTTTATTCCACGGTTCAAGTGGAGTCA
AAGACCAGGTTCAAGTAGAGATCTAAGCCATGGAATTTGGGCTGAGGCCCATGAAGTGGAGGGCATAGCATATCGCGGTCACGTGGAGGACGCTCAGAATGGAGGGGTTC
GACTTGGAGGGGTTCGATCTGCTCAACCCCGACAGGTTTATTCCACGGTTCAAGTGGAGTCAAAGACCAGGTTCGAGCTAGCAGACCGAAGAGCTCTGGCTGCTAACGAT
GGCCACCAGAGAGAGGTCGGGGCAGAAGTGGCAGAGAGCCAGATTCAAAAAGGTCTAGGGACCGAGCCGCTCTGTAGGTCGGCACGTATCACCACGCCTGTTCTGCTGCT
AACACATCCAAAACCATCTAAGGCCAATCGCGGCCGAGGTGGTGCCTCGAGAAGAACCACTCGAGGAGAAGCTCCAGCCCCTACTAGGGATAACTTGATGCACTCCAGAA
AGAAATGGAGGCAATGCGCACCCAGATGCGTACCATGGAAGAGATGTACAACAAAATGGTGTAAGCTGCAGGTGCCGAGTCTCGATCTGAAGACCGAGCGGCGCGTGACG
AGGTGCGTGAGCAAGGAGGTCTTCATCTCGACCCAATCGATGAAGAGCACCCCGAAGGCCAAATCCTCTTACAACCCAATAGTTCCTGAGGGAATGATTACGAGGGAAGA
GTTCGACCAGCTCAAGAGCAAATTTGATGCTCAAGTAGAAGCCTTAAAGGCAAGGTGCAAGGTGAAAGGGAGAGCATTTGATGATGGTGACCTGGGGGAATCGCCATTCA
CCTCGGATACCTTGGAGGCTCCAATCCCTCTAAAGTTCAAAACTCCCAGTATGAAGCCATATGATGGGTCTAAGGACCCAAAGGATTATGTTGAGGTCTTTGAAGGCCTC
ATGGATTTACAAGCAGCAATAGACGCCATTAAATGCCGCGTCTTCCAGATCGCGCTTACCGACAGCGCACGCTTGTGGTACAGAAAATTGCCGGCTAGGTTGATCTCGAC
CTACTCACAGTTGAGGAAAGAATTTATTAGTCAATTCTATTCTCGGCACTATGACAGAAAAACAGCGACCCATCTCACCACCATCAGACAGAATGAGGGCGAGACGCTTA
GAGAATACGTCACAAGGTTCCAGGAGGAGCAGCTGAAGGTTGTGCACTGCTCCGATGATTCGGCCATGTGCTACTTTCTCACCGGCCTGGCCGACGAGACTCTTACCGTG
AAGCTTGGAGAGGAAGCTCCAGCCACGTTTGCTGAGGTTTTGCAAAAGGCGAAGAAAGTTATCGATGGACAGGAGCTCCTCCAGACCAAGACTGGCCGGCCAGAAAAGTA
G
mRNA sequenceShow/hide mRNA sequence
ATGGAGGGGTTCGACTTGGAGGGGTTCGATCTGCTCAACCCCGACAGGTTCGACTTGATAAAAGACAGGGATAACCGCCGGTCGTTTATTCCACGGTTCAAGTGGAGTCA
AAGACCAGGTTCAAGTAGAGATCTAAGCCATGGAATTTGGGCTGAGGCCCATGAAGTGGAGGGCATAGCATATCGCGGTCACGTGGAGGACGCTCAGAATGGAGGGGTTC
GACTTGGAGGGGTTCGATCTGCTCAACCCCGACAGGTTTATTCCACGGTTCAAGTGGAGTCAAAGACCAGGTTCGAGCTAGCAGACCGAAGAGCTCTGGCTGCTAACGAT
GGCCACCAGAGAGAGGTCGGGGCAGAAGTGGCAGAGAGCCAGATTCAAAAAGGTCTAGGGACCGAGCCGCTCTGTAGGTCGGCACGTATCACCACGCCTGTTCTGCTGCT
AACACATCCAAAACCATCTAAGGCCAATCGCGGCCGAGGTGGTGCCTCGAGAAGAACCACTCGAGGAGAAGCTCCAGCCCCTACTAGGGATAACTTGATGCACTCCAGAA
AGAAATGGAGGCAATGCGCACCCAGATGCGTACCATGGAAGAGATGTACAACAAAATGGTGTAAGCTGCAGGTGCCGAGTCTCGATCTGAAGACCGAGCGGCGCGTGACG
AGGTGCGTGAGCAAGGAGGTCTTCATCTCGACCCAATCGATGAAGAGCACCCCGAAGGCCAAATCCTCTTACAACCCAATAGTTCCTGAGGGAATGATTACGAGGGAAGA
GTTCGACCAGCTCAAGAGCAAATTTGATGCTCAAGTAGAAGCCTTAAAGGCAAGGTGCAAGGTGAAAGGGAGAGCATTTGATGATGGTGACCTGGGGGAATCGCCATTCA
CCTCGGATACCTTGGAGGCTCCAATCCCTCTAAAGTTCAAAACTCCCAGTATGAAGCCATATGATGGGTCTAAGGACCCAAAGGATTATGTTGAGGTCTTTGAAGGCCTC
ATGGATTTACAAGCAGCAATAGACGCCATTAAATGCCGCGTCTTCCAGATCGCGCTTACCGACAGCGCACGCTTGTGGTACAGAAAATTGCCGGCTAGGTTGATCTCGAC
CTACTCACAGTTGAGGAAAGAATTTATTAGTCAATTCTATTCTCGGCACTATGACAGAAAAACAGCGACCCATCTCACCACCATCAGACAGAATGAGGGCGAGACGCTTA
GAGAATACGTCACAAGGTTCCAGGAGGAGCAGCTGAAGGTTGTGCACTGCTCCGATGATTCGGCCATGTGCTACTTTCTCACCGGCCTGGCCGACGAGACTCTTACCGTG
AAGCTTGGAGAGGAAGCTCCAGCCACGTTTGCTGAGGTTTTGCAAAAGGCGAAGAAAGTTATCGATGGACAGGAGCTCCTCCAGACCAAGACTGGCCGGCCAGAAAAGTA
G
Protein sequenceShow/hide protein sequence
MEGFDLEGFDLLNPDRFDLIKDRDNRRSFIPRFKWSQRPGSSRDLSHGIWAEAHEVEGIAYRGHVEDAQNGGVRLGGVRSAQPRQVYSTVQVESKTRFELADRRALAAND
GHQREVGAEVAESQIQKGLGTEPLCRSARITTPVLLLTHPKPSKANRGRGGASRRTTRGEAPAPTRDNLMHSRKKWRQCAPRCVPWKRCTTKWCKLQVPSLDLKTERRVT
RCVSKEVFISTQSMKSTPKAKSSYNPIVPEGMITREEFDQLKSKFDAQVEALKARCKVKGRAFDDGDLGESPFTSDTLEAPIPLKFKTPSMKPYDGSKDPKDYVEVFEGL
MDLQAAIDAIKCRVFQIALTDSARLWYRKLPARLISTYSQLRKEFISQFYSRHYDRKTATHLTTIRQNEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADETLTV
KLGEEAPATFAEVLQKAKKVIDGQELLQTKTGRPEK