; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g31280 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g31280
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr6:23539212..23540623
RNA-Seq ExpressionMoc06g31280
SyntenyMoc06g31280
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]4.8e-10578.14Show/hide
Query:  QAECSRNPVTPAGVITREEFDQLRGQLDAQVEALKAKCKQKEGPLNDGDLGESPFSSDVLEASIPPKFKAPTVKPYDGSKDPKDYVE-------------
        +AE SRNP TPAGVITREEFDQLRGQLDAQVEALKAKC+QKEGPLNDGDLGESPF+SDVLEA IPPKFKAPTVKPYDGSKDPKDYVE             
Subjt:  QAECSRNPVTPAGVITREEFDQLRGQLDAQVEALKAKCKQKEGPLNDGDLGESPFSSDVLEASIPPKFKAPTVKPYDGSKDPKDYVE-------------

Query:  ----RAIVVSETAS------QVDLDLLSAEKGVPRPVL--FSA--LCKKTATHLATIRQKEGETLREYVTRFQEEQLRVAHCSDDSAMCYFLTGLADEAL
            RA  ++ T S      ++    +S    + R  L  FS+    KKTATHLATIRQKEGETLREYVTRFQEEQL+VAHCSDDSAMCYFLTGLADEAL
Subjt:  ----RAIVVSETAS------QVDLDLLSAEKGVPRPVL--FSA--LCKKTATHLATIRQKEGETLREYVTRFQEEQLRVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGHELLRTKTGRPERKIGRGRSGKDIEKADAKSKDKGSFSSGRAEYRRAENGPTRS
        TVKLGEEAPATFAEVLQKAKKVIDG ELLRTKTGRPERKIGRGRSGKDIE AD KSKDKGSFSSGRAEYRRAENGPTRS
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGHELLRTKTGRPERKIGRGRSGKDIEKADAKSKDKGSFSSGRAEYRRAENGPTRS

XP_022151719.1 uncharacterized protein LOC111019634 [Momordica charantia]1.2e-11164.31Show/hide
Query:  MRTKMRSIEEMYNEMILAAGAGSRSENRVTCVGIREQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAECSRNP
        MRT+M ++E+MY+EM+ AAGA SRSENRV    + EQRG HLGPV++ HPE  E E +T QRGDLREHLNRKR SSLRKGQSPS SHR+SNQQAE S NP
Subjt:  MRTKMRSIEEMYNEMILAAGAGSRSENRVTCVGIREQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAECSRNP

Query:  VTPAGVITREEFDQLRGQLDAQVEALKAKCKQKEGPLNDGDLGESPFSSDVLEASIPPKFKAPTVKPYDGSKDPKDYVERAIVVSETASQVDL----DLL
        +TP GVITREEFDQL+ + DAQVEALKAKC++KE   +DGDLGESPF+SD+LEA IP KFK PT+KPYDGSKDPKDYVE    + +  +  D     D  
Subjt:  VTPAGVITREEFDQLRGQLDAQVEALKAKCKQKEGPLNDGDLGESPFSSDVLEASIPPKFKAPTVKPYDGSKDPKDYVERAIVVSETASQVDL----DLL

Query:  SAEKGVPR---------------------PVLFSA--LCKKTATHLATIRQKEGETLREYVTRFQEEQLRVAHCSDDSAMCYFLTGLADEALTVKLGEEA
         A  G  R                      + FS+    +KTATHL TIRQKEGETLREYVTRFQEEQL+VAHCSD SAMCYFLT LADE LTVKL EEA
Subjt:  SAEKGVPR---------------------PVLFSA--LCKKTATHLATIRQKEGETLREYVTRFQEEQLRVAHCSDDSAMCYFLTGLADEALTVKLGEEA

Query:  PATFAEVLQKAKKVIDGHELLRTKTGRPERKIGRGRSGKDIEKADAKSKDKGSFS-SGRAEYRRAEN
        PATF EVLQKAKK+IDG ELLRTKT RPE+KI +GR+ KD  K D+K++DKG  S S R  YRR++N
Subjt:  PATFAEVLQKAKKVIDGHELLRTKTGRPERKIGRGRSGKDIEKADAKSKDKGSFS-SGRAEYRRAEN

XP_022152033.1 uncharacterized protein LOC111019842 [Momordica charantia]2.5e-10976.33Show/hide
Query:  KRGSSLRKGQSPSRSHRSSNQQAECSRNPVTPAGVITREEFDQLRGQLDAQVEALKAKCKQKEGPLNDGDLGESPFSSDVLEASIPPKFKAPTVKPYDGS
        +RGSSLRKGQSPSRSHRSSNQQAE S NP TPAGVITREEFDQLRG+LDAQVEALKAKC+QKEG LNDGDLGESPF+SDVLEA IP KFKAPTVKPYDGS
Subjt:  KRGSSLRKGQSPSRSHRSSNQQAECSRNPVTPAGVITREEFDQLRGQLDAQVEALKAKCKQKEGPLNDGDLGESPFSSDVLEASIPPKFKAPTVKPYDGS

Query:  KDPKDYVE-----------------RAIVVSETAS------QVDLDLLSAEKGVPRPVL--FSA--LCKKTATHLATIRQKEGETLREYVTRFQEEQLRV
        +DPKDYVE                 RA  ++ T S      ++    +S    + R  L  FS+    K+TATHLATIRQKEGETLREYVTRFQEEQL+V
Subjt:  KDPKDYVE-----------------RAIVVSETAS------QVDLDLLSAEKGVPRPVL--FSA--LCKKTATHLATIRQKEGETLREYVTRFQEEQLRV

Query:  AHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGHELLRTKTGRPERKIGRGRSGKDIEKADAKSKDKGSFSSGRAEYRRAENGPTRS
         HCSDDSAMCYFLTGLADEA TVKLGEEAPATFAEVLQKAKKVIDG ELLRTKTGRPERKIGRGRSGKDIE+AD KSKDKGSFSS RA YRRAENGPTRS
Subjt:  AHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGHELLRTKTGRPERKIGRGRSGKDIEKADAKSKDKGSFSSGRAEYRRAENGPTRS

XP_022158652.1 uncharacterized protein LOC111025109 [Momordica charantia]1.9e-10176Show/hide
Query:  NRKRGSSLRKGQSPSRSHRSSNQQAECSRNPVTPAGVITREEFDQLRGQLDAQVEALKAKCKQKEGPLNDGDLGESPFSSDVLEASIPPKFKAPTVKPYD
        + KRGSSLRKGQSPSRSHRSSNQQAE S N   PAG+ITREEFDQLRG+LDAQVEALKAKC+QK+  LNDGDLGE PF+SDVLEA IPPKFKAPTVKPYD
Subjt:  NRKRGSSLRKGQSPSRSHRSSNQQAECSRNPVTPAGVITREEFDQLRGQLDAQVEALKAKCKQKEGPLNDGDLGESPFSSDVLEASIPPKFKAPTVKPYD

Query:  GSKDPKDYVERAIVVSETASQVDLDLLSAEKGVPRPVLFSALCKKTATHLATI--RQKEGETLREYVTRFQEEQLRVAHCSDDSAMCYFLTGLADEALTV
        G+KDPKDYVE          +  +D  +A   +       AL          +  RQKE ETLREYVTRFQEEQL+VAHCSDDSAMCYF TGLADEALTV
Subjt:  GSKDPKDYVERAIVVSETASQVDLDLLSAEKGVPRPVLFSALCKKTATHLATI--RQKEGETLREYVTRFQEEQLRVAHCSDDSAMCYFLTGLADEALTV

Query:  KLGEEAPATFAEVLQKAKKVIDGHELLRTKTGRPERKIGRGRSGKDIEKADAKSKDKGSFSSGRAEYRRAENGPT
        KLGEEAP TFAEVLQKAKKVIDG ELLRTKTGRPERKIGRGRSGKD+E+AD KSKDKGSFSSGRAEYRRAENGPT
Subjt:  KLGEEAPATFAEVLQKAKKVIDGHELLRTKTGRPERKIGRGRSGKDIEKADAKSKDKGSFSSGRAEYRRAENGPT

XP_022159327.1 uncharacterized protein LOC111025738 [Momordica charantia]3.1e-12858.94Show/hide
Query:  MVQPANSTNTADRRTLAASDAHQREVGAVVVEGQGHDGLATEPLRRSARITAPVLPPAHPPRTSKATRGRGGTSKKGARGPAPAPTSENLDALQREMEAM
        MVQP +STNT DRR L A+D HQREVGA VVEGQ H+GL TEP  RSARIT P L PAH P+  KA RGRGG S++   G APAP+ EN DALQ+EMEAM
Subjt:  MVQPANSTNTADRRTLAASDAHQREVGAVVVEGQGHDGLATEPLRRSARITAPVLPPAHPPRTSKATRGRGGTSKKGARGPAPAPTSENLDALQREMEAM

Query:  RTKMRSIEEMYNEMILAAGAGSRSENRVTCVGIREQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAECSRNPV
        RT+M ++EEMYNEM+ A GAGSRSE+R                                +RGDLR+HL+RKR SSLRKG+SPS SH++SNQQAE S NPV
Subjt:  RTKMRSIEEMYNEMILAAGAGSRSENRVTCVGIREQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAECSRNPV

Query:  TPAGVITREEFDQLRGQLDAQVEALKAKCKQKEGPLNDGDLGESPFSSDVLEASIPPKFKAPTVKPYDGSKDPKDYVE-----------------RAIVV
         P GVITREEFDQL+ + DAQVE LKA+C+ K    +DGDLGESPF+SD+LEA IP KFK PT+KPYDGSKDPKDYVE                 RA  +
Subjt:  TPAGVITREEFDQLRGQLDAQVEALKAKCKQKEGPLNDGDLGESPFSSDVLEASIPPKFKAPTVKPYDGSKDPKDYVE-----------------RAIVV

Query:  SETAS------QVDLDLLSAEKGVPRPV--LFSA--LCKKTATHLATIRQKEGETLREYVTRFQEEQLRVAHCSDDSAMCYFLTGLADEALTVKLGEEAP
        + T+S      ++    +S    + +     FS+    +KTATHLATIRQKE ETLREYVT FQEEQL+VAH SDDSA+CYFLT L DE LTVKLGEEAP
Subjt:  SETAS------QVDLDLLSAEKGVPRPV--LFSA--LCKKTATHLATIRQKEGETLREYVTRFQEEQLRVAHCSDDSAMCYFLTGLADEALTVKLGEEAP

Query:  ATFAEVLQKAKKVIDGHELLRTKTGRPERKIGRGRSGKDIEKADAKSKDKGSFSSGRAEYRRAENGPTRS
        ATFAEVLQKAKKVIDG EL RTKTGR E++I + +  ++  KA++KSKDK       AEYRR+++GP+RS
Subjt:  ATFAEVLQKAKKVIDGHELLRTKTGRPERKIGRGRSGKDIEKADAKSKDKGSFSSGRAEYRRAENGPTRS

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088132.3e-10578.14Show/hide
Query:  QAECSRNPVTPAGVITREEFDQLRGQLDAQVEALKAKCKQKEGPLNDGDLGESPFSSDVLEASIPPKFKAPTVKPYDGSKDPKDYVE-------------
        +AE SRNP TPAGVITREEFDQLRGQLDAQVEALKAKC+QKEGPLNDGDLGESPF+SDVLEA IPPKFKAPTVKPYDGSKDPKDYVE             
Subjt:  QAECSRNPVTPAGVITREEFDQLRGQLDAQVEALKAKCKQKEGPLNDGDLGESPFSSDVLEASIPPKFKAPTVKPYDGSKDPKDYVE-------------

Query:  ----RAIVVSETAS------QVDLDLLSAEKGVPRPVL--FSA--LCKKTATHLATIRQKEGETLREYVTRFQEEQLRVAHCSDDSAMCYFLTGLADEAL
            RA  ++ T S      ++    +S    + R  L  FS+    KKTATHLATIRQKEGETLREYVTRFQEEQL+VAHCSDDSAMCYFLTGLADEAL
Subjt:  ----RAIVVSETAS------QVDLDLLSAEKGVPRPVL--FSA--LCKKTATHLATIRQKEGETLREYVTRFQEEQLRVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGHELLRTKTGRPERKIGRGRSGKDIEKADAKSKDKGSFSSGRAEYRRAENGPTRS
        TVKLGEEAPATFAEVLQKAKKVIDG ELLRTKTGRPERKIGRGRSGKDIE AD KSKDKGSFSSGRAEYRRAENGPTRS
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGHELLRTKTGRPERKIGRGRSGKDIEKADAKSKDKGSFSSGRAEYRRAENGPTRS

A0A6J1DDS5 uncharacterized protein LOC1110198421.2e-10976.33Show/hide
Query:  KRGSSLRKGQSPSRSHRSSNQQAECSRNPVTPAGVITREEFDQLRGQLDAQVEALKAKCKQKEGPLNDGDLGESPFSSDVLEASIPPKFKAPTVKPYDGS
        +RGSSLRKGQSPSRSHRSSNQQAE S NP TPAGVITREEFDQLRG+LDAQVEALKAKC+QKEG LNDGDLGESPF+SDVLEA IP KFKAPTVKPYDGS
Subjt:  KRGSSLRKGQSPSRSHRSSNQQAECSRNPVTPAGVITREEFDQLRGQLDAQVEALKAKCKQKEGPLNDGDLGESPFSSDVLEASIPPKFKAPTVKPYDGS

Query:  KDPKDYVE-----------------RAIVVSETAS------QVDLDLLSAEKGVPRPVL--FSA--LCKKTATHLATIRQKEGETLREYVTRFQEEQLRV
        +DPKDYVE                 RA  ++ T S      ++    +S    + R  L  FS+    K+TATHLATIRQKEGETLREYVTRFQEEQL+V
Subjt:  KDPKDYVE-----------------RAIVVSETAS------QVDLDLLSAEKGVPRPVL--FSA--LCKKTATHLATIRQKEGETLREYVTRFQEEQLRV

Query:  AHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGHELLRTKTGRPERKIGRGRSGKDIEKADAKSKDKGSFSSGRAEYRRAENGPTRS
         HCSDDSAMCYFLTGLADEA TVKLGEEAPATFAEVLQKAKKVIDG ELLRTKTGRPERKIGRGRSGKDIE+AD KSKDKGSFSS RA YRRAENGPTRS
Subjt:  AHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGHELLRTKTGRPERKIGRGRSGKDIEKADAKSKDKGSFSSGRAEYRRAENGPTRS

A0A6J1DDW5 uncharacterized protein LOC1110196345.7e-11264.31Show/hide
Query:  MRTKMRSIEEMYNEMILAAGAGSRSENRVTCVGIREQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAECSRNP
        MRT+M ++E+MY+EM+ AAGA SRSENRV    + EQRG HLGPV++ HPE  E E +T QRGDLREHLNRKR SSLRKGQSPS SHR+SNQQAE S NP
Subjt:  MRTKMRSIEEMYNEMILAAGAGSRSENRVTCVGIREQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAECSRNP

Query:  VTPAGVITREEFDQLRGQLDAQVEALKAKCKQKEGPLNDGDLGESPFSSDVLEASIPPKFKAPTVKPYDGSKDPKDYVERAIVVSETASQVDL----DLL
        +TP GVITREEFDQL+ + DAQVEALKAKC++KE   +DGDLGESPF+SD+LEA IP KFK PT+KPYDGSKDPKDYVE    + +  +  D     D  
Subjt:  VTPAGVITREEFDQLRGQLDAQVEALKAKCKQKEGPLNDGDLGESPFSSDVLEASIPPKFKAPTVKPYDGSKDPKDYVERAIVVSETASQVDL----DLL

Query:  SAEKGVPR---------------------PVLFSA--LCKKTATHLATIRQKEGETLREYVTRFQEEQLRVAHCSDDSAMCYFLTGLADEALTVKLGEEA
         A  G  R                      + FS+    +KTATHL TIRQKEGETLREYVTRFQEEQL+VAHCSD SAMCYFLT LADE LTVKL EEA
Subjt:  SAEKGVPR---------------------PVLFSA--LCKKTATHLATIRQKEGETLREYVTRFQEEQLRVAHCSDDSAMCYFLTGLADEALTVKLGEEA

Query:  PATFAEVLQKAKKVIDGHELLRTKTGRPERKIGRGRSGKDIEKADAKSKDKGSFS-SGRAEYRRAEN
        PATF EVLQKAKK+IDG ELLRTKT RPE+KI +GR+ KD  K D+K++DKG  S S R  YRR++N
Subjt:  PATFAEVLQKAKKVIDGHELLRTKTGRPERKIGRGRSGKDIEKADAKSKDKGSFS-SGRAEYRRAEN

A0A6J1DXR9 uncharacterized protein LOC1110251099.2e-10276Show/hide
Query:  NRKRGSSLRKGQSPSRSHRSSNQQAECSRNPVTPAGVITREEFDQLRGQLDAQVEALKAKCKQKEGPLNDGDLGESPFSSDVLEASIPPKFKAPTVKPYD
        + KRGSSLRKGQSPSRSHRSSNQQAE S N   PAG+ITREEFDQLRG+LDAQVEALKAKC+QK+  LNDGDLGE PF+SDVLEA IPPKFKAPTVKPYD
Subjt:  NRKRGSSLRKGQSPSRSHRSSNQQAECSRNPVTPAGVITREEFDQLRGQLDAQVEALKAKCKQKEGPLNDGDLGESPFSSDVLEASIPPKFKAPTVKPYD

Query:  GSKDPKDYVERAIVVSETASQVDLDLLSAEKGVPRPVLFSALCKKTATHLATI--RQKEGETLREYVTRFQEEQLRVAHCSDDSAMCYFLTGLADEALTV
        G+KDPKDYVE          +  +D  +A   +       AL          +  RQKE ETLREYVTRFQEEQL+VAHCSDDSAMCYF TGLADEALTV
Subjt:  GSKDPKDYVERAIVVSETASQVDLDLLSAEKGVPRPVLFSALCKKTATHLATI--RQKEGETLREYVTRFQEEQLRVAHCSDDSAMCYFLTGLADEALTV

Query:  KLGEEAPATFAEVLQKAKKVIDGHELLRTKTGRPERKIGRGRSGKDIEKADAKSKDKGSFSSGRAEYRRAENGPT
        KLGEEAP TFAEVLQKAKKVIDG ELLRTKTGRPERKIGRGRSGKD+E+AD KSKDKGSFSSGRAEYRRAENGPT
Subjt:  KLGEEAPATFAEVLQKAKKVIDGHELLRTKTGRPERKIGRGRSGKDIEKADAKSKDKGSFSSGRAEYRRAENGPT

A0A6J1DZJ1 uncharacterized protein LOC1110257381.5e-12858.94Show/hide
Query:  MVQPANSTNTADRRTLAASDAHQREVGAVVVEGQGHDGLATEPLRRSARITAPVLPPAHPPRTSKATRGRGGTSKKGARGPAPAPTSENLDALQREMEAM
        MVQP +STNT DRR L A+D HQREVGA VVEGQ H+GL TEP  RSARIT P L PAH P+  KA RGRGG S++   G APAP+ EN DALQ+EMEAM
Subjt:  MVQPANSTNTADRRTLAASDAHQREVGAVVVEGQGHDGLATEPLRRSARITAPVLPPAHPPRTSKATRGRGGTSKKGARGPAPAPTSENLDALQREMEAM

Query:  RTKMRSIEEMYNEMILAAGAGSRSENRVTCVGIREQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAECSRNPV
        RT+M ++EEMYNEM+ A GAGSRSE+R                                +RGDLR+HL+RKR SSLRKG+SPS SH++SNQQAE S NPV
Subjt:  RTKMRSIEEMYNEMILAAGAGSRSENRVTCVGIREQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAECSRNPV

Query:  TPAGVITREEFDQLRGQLDAQVEALKAKCKQKEGPLNDGDLGESPFSSDVLEASIPPKFKAPTVKPYDGSKDPKDYVE-----------------RAIVV
         P GVITREEFDQL+ + DAQVE LKA+C+ K    +DGDLGESPF+SD+LEA IP KFK PT+KPYDGSKDPKDYVE                 RA  +
Subjt:  TPAGVITREEFDQLRGQLDAQVEALKAKCKQKEGPLNDGDLGESPFSSDVLEASIPPKFKAPTVKPYDGSKDPKDYVE-----------------RAIVV

Query:  SETAS------QVDLDLLSAEKGVPRPV--LFSA--LCKKTATHLATIRQKEGETLREYVTRFQEEQLRVAHCSDDSAMCYFLTGLADEALTVKLGEEAP
        + T+S      ++    +S    + +     FS+    +KTATHLATIRQKE ETLREYVT FQEEQL+VAH SDDSA+CYFLT L DE LTVKLGEEAP
Subjt:  SETAS------QVDLDLLSAEKGVPRPV--LFSA--LCKKTATHLATIRQKEGETLREYVTRFQEEQLRVAHCSDDSAMCYFLTGLADEALTVKLGEEAP

Query:  ATFAEVLQKAKKVIDGHELLRTKTGRPERKIGRGRSGKDIEKADAKSKDKGSFSSGRAEYRRAENGPTRS
        ATFAEVLQKAKKVIDG EL RTKTGR E++I + +  ++  KA++KSKDK       AEYRR+++GP+RS
Subjt:  ATFAEVLQKAKKVIDGHELLRTKTGRPERKIGRGRSGKDIEKADAKSKDKGSFSSGRAEYRRAENGPTRS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAACCAGCAAACTCAACCAATACGGCAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGTAGTGGTAGAGGGGCAAGGTCACGA
TGGCCTAGCAACAGAACCCCTCCGCAGGTCGGCACGAATCACCGCGCCTGTCCTACCACCTGCGCACCCCCCAAGGACATCCAAGGCCACCCGTGGCCGAGGTGGAACCT
CTAAGAAGGGCGCCCGGGGTCCAGCCCCGGCCCCGACAAGTGAGAACTTGGACGCACTCCAGAGAGAAATGGAGGCAATGCGCACGAAAATGCGGTCCATAGAGGAAATG
TATAACGAAATGATATTAGCTGCAGGCGCAGGGTCCCGATCTGAGAACCGAGTGACGTGCGTTGGCATACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGTCGAGGAGGA
ACATCCCGAAGACAACGAGAGCGAGGGACACACTCGCCAGAGAGGAGACCTCCGTGAACACCTCAACAGAAAGAGAGGCTCATCTCTCCGAAAAGGACAGTCACCATCCC
GCTCACACAGGAGCTCCAACCAGCAGGCTGAATGCTCTCGCAACCCAGTAACTCCTGCAGGAGTGATTACAAGGGAGGAGTTTGACCAGCTGAGGGGCCAGCTCGACGCT
CAGGTGGAGGCCTTAAAGGCCAAATGTAAGCAGAAAGAAGGTCCACTGAACGATGGCGACTTGGGAGAATCGCCCTTCTCCTCGGACGTTTTGGAAGCATCGATCCCTCC
GAAGTTCAAAGCTCCTACCGTGAAACCTTATGATGGGTCGAAAGACCCCAAGGATTACGTTGAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACC
TACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCATTATGCAAAAAGACAGCGACCCATCTCGCCACCATCAGACAGAAGGAGGGTGAGACGCTGCGA
GAATATGTCACCAGATTCCAGGAGGAGCAGTTGAGGGTCGCACACTGCTCCGATGACTCGGCCATGTGCTATTTTCTCACCGGTCTAGCCGACGAGGCCCTCACGGTGAA
ACTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACATGAGCTCCTCCGAACCAAAACCGGCCGACCAGAACGAAAGA
TCGGCCGGGGCAGAAGTGGAAAAGATATAGAAAAGGCAGATGCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGAGCTGAGTATCGGAGGGCGGAGAACGGACCT
ACCAGGAGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAACCAGCAAACTCAACCAATACGGCAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGTAGTGGTAGAGGGGCAAGGTCACGA
TGGCCTAGCAACAGAACCCCTCCGCAGGTCGGCACGAATCACCGCGCCTGTCCTACCACCTGCGCACCCCCCAAGGACATCCAAGGCCACCCGTGGCCGAGGTGGAACCT
CTAAGAAGGGCGCCCGGGGTCCAGCCCCGGCCCCGACAAGTGAGAACTTGGACGCACTCCAGAGAGAAATGGAGGCAATGCGCACGAAAATGCGGTCCATAGAGGAAATG
TATAACGAAATGATATTAGCTGCAGGCGCAGGGTCCCGATCTGAGAACCGAGTGACGTGCGTTGGCATACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGTCGAGGAGGA
ACATCCCGAAGACAACGAGAGCGAGGGACACACTCGCCAGAGAGGAGACCTCCGTGAACACCTCAACAGAAAGAGAGGCTCATCTCTCCGAAAAGGACAGTCACCATCCC
GCTCACACAGGAGCTCCAACCAGCAGGCTGAATGCTCTCGCAACCCAGTAACTCCTGCAGGAGTGATTACAAGGGAGGAGTTTGACCAGCTGAGGGGCCAGCTCGACGCT
CAGGTGGAGGCCTTAAAGGCCAAATGTAAGCAGAAAGAAGGTCCACTGAACGATGGCGACTTGGGAGAATCGCCCTTCTCCTCGGACGTTTTGGAAGCATCGATCCCTCC
GAAGTTCAAAGCTCCTACCGTGAAACCTTATGATGGGTCGAAAGACCCCAAGGATTACGTTGAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACC
TACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCATTATGCAAAAAGACAGCGACCCATCTCGCCACCATCAGACAGAAGGAGGGTGAGACGCTGCGA
GAATATGTCACCAGATTCCAGGAGGAGCAGTTGAGGGTCGCACACTGCTCCGATGACTCGGCCATGTGCTATTTTCTCACCGGTCTAGCCGACGAGGCCCTCACGGTGAA
ACTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACATGAGCTCCTCCGAACCAAAACCGGCCGACCAGAACGAAAGA
TCGGCCGGGGCAGAAGTGGAAAAGATATAGAAAAGGCAGATGCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGAGCTGAGTATCGGAGGGCGGAGAACGGACCT
ACCAGGAGCTGA
Protein sequenceShow/hide protein sequence
MVQPANSTNTADRRTLAASDAHQREVGAVVVEGQGHDGLATEPLRRSARITAPVLPPAHPPRTSKATRGRGGTSKKGARGPAPAPTSENLDALQREMEAMRTKMRSIEEM
YNEMILAAGAGSRSENRVTCVGIREQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAECSRNPVTPAGVITREEFDQLRGQLDA
QVEALKAKCKQKEGPLNDGDLGESPFSSDVLEASIPPKFKAPTVKPYDGSKDPKDYVERAIVVSETASQVDLDLLSAEKGVPRPVLFSALCKKTATHLATIRQKEGETLR
EYVTRFQEEQLRVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGHELLRTKTGRPERKIGRGRSGKDIEKADAKSKDKGSFSSGRAEYRRAENGP
TRS