; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g30370 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g30370
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr9:22877322..22882821
RNA-Seq ExpressionMoc09g30370
SyntenyMoc09g30370
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001995 - Peptidase A2A, retrovirus, catalytic
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]3.2e-9555.98Show/hide
Query:  GRTDYRRSDSDSSQRGPYERYTPTTISISKILTNIEENGLEKLLKRLDKLRGDPKKHNKDRYCRFHRDHGHDTSNCWELKRQIEDLIQDGYFKKYIVKSG
        GR ++RR+ +  ++  PYER+TPTTI IS+ILTNIEE+G+EKLLKR +KLRG P++ NKD+YCRFHR+H H+TS+ WELKRQIEDLIQD YFKK++ K  
Subjt:  GRTDYRRSDSDSSQRGPYERYTPTTISISKILTNIEENGLEKLLKRLDKLRGDPKKHNKDRYCRFHRDHGHDTSNCWELKRQIEDLIQDGYFKKYIVKSG

Query:  LSSAGKKEERKRSRTPPQRDDRPASSTPFL---------------------------------------------------AVLVGP------VRRVLVD
         SSA KKEERK SRTP +R DRPA                                                         A+++ P      VRRVLVD
Subjt:  LSSAGKKEERKRSRTPPQRDDRPASSTPFL---------------------------------------------------AVLVGP------VRRVLVD

Query:  EGASANILSLTTYLALGWTMAQLKKSPTSLVGFAGESVTPEGCTDLLITISQGDTQITQMAEFVVIDGRLAYNAIFGRPIIHSLRAIPSTLHQVMKYSTI
        EG SANI+SL TYLALGWT +QLKKS T LVGF+ ESV PEGC DL +T+    TQ+TQMAEFVVIDGR AYNAIFGRPIIHS RAIPSTLHQV+KYST 
Subjt:  EGASANILSLTTYLALGWTMAQLKKSPTSLVGFAGESVTPEGCTDLLITISQGDTQITQMAEFVVIDGRLAYNAIFGRPIIHSLRAIPSTLHQVMKYSTI

Query:  NGVGIIRGEQKASWECYTSALIGSAVCAQEEVEKLPGPVTQASEDEQPKTNQCEVAAPTEELELVPLL
        NGVG++RGEQ AS ECY SAL GS+VCA E +    G +     + +    + E AAPTEELELVPLL
Subjt:  NGVGIIRGEQKASWECYTSALIGSAVCAQEEVEKLPGPVTQASEDEQPKTNQCEVAAPTEELELVPLL

XP_022152110.1 uncharacterized protein LOC111019899 [Momordica charantia]7.2e-10357.56Show/hide
Query:  GRTDYRRSDSDSSQRGPYERYTPTTISISKILTNIEENGLEKLLKRLDKLRGDPKKHNKDRYCRFHRDHGHDTSNCWELKRQIEDLIQDGYFKKYIVKSG
        GR +YRR+++  ++  PYER+TPTTI IS+ILTNIEE+G+EKLLKR +KLRG P++ +KD+YCRFHR+HGH+TS+ WELK QIEDLIQDGYFKK++ K  
Subjt:  GRTDYRRSDSDSSQRGPYERYTPTTISISKILTNIEENGLEKLLKRLDKLRGDPKKHNKDRYCRFHRDHGHDTSNCWELKRQIEDLIQDGYFKKYIVKSG

Query:  LSSAGKKEERKRSRTPPQRDDRPASSTPFL---------------------------------------------------AVLVGP------VRRVLVD
         SSA KKEERKRSRTPP+R DRPA                                                         A+++ P      VRRVLVD
Subjt:  LSSAGKKEERKRSRTPPQRDDRPASSTPFL---------------------------------------------------AVLVGP------VRRVLVD

Query:  EGASANILSLTTYLALGWTMAQLKKSPTSLVGFAGESVTPEGCTDLLITISQGDTQITQMAEFVVIDGRLAYNAIFGRPIIHSLRAIPSTLHQVMKYSTI
         GASANILSL TYLALGWT +QLKKSPT LVGF+GESV PEGC DL +T+ Q  T++TQMAEFVV+DGR AYNAIFGRPIIHS RAIPSTLHQV+KYST 
Subjt:  EGASANILSLTTYLALGWTMAQLKKSPTSLVGFAGESVTPEGCTDLLITISQGDTQITQMAEFVVIDGRLAYNAIFGRPIIHSLRAIPSTLHQVMKYSTI

Query:  NGVGIIRGEQKASWECYTSALIGSAVCAQEEVEKLPGPVTQASEDEQPKTNQCEVAAPTEELELVPLLSPEKQLSLG
        NGVG +RGEQ AS ECY S L G++VCA E +    G  T   E + P     E AAP EELELVPLLS EKQ+ LG
Subjt:  NGVGIIRGEQKASWECYTSALIGSAVCAQEEVEKLPGPVTQASEDEQPKTNQCEVAAPTEELELVPLLSPEKQLSLG

XP_022152367.1 uncharacterized protein LOC111020111 [Momordica charantia]5.7e-10862.5Show/hide
Query:  RTDYRRSDSDSSQRGPYERYTPTTISISKILTNIEENGLEKLLKRLDKLRGDPKKHNKDRYCRFHRDHGHDTSNCWELKRQIEDLIQDGYFKKYIVKSGL
        RT+YRR + D ++  PYERYTPTTI IS+IL NIEE+G+EKLLKR +KL+GD +K NKD+YCRF RDHGH+TSNCWELKRQIEDLI+DGYFKK++ K   
Subjt:  RTDYRRSDSDSSQRGPYERYTPTTISISKILTNIEENGLEKLLKRLDKLRGDPKKHNKDRYCRFHRDHGHDTSNCWELKRQIEDLIQDGYFKKYIVKSGL

Query:  SSAGKK---EERKRSRTPPQRDDRPASSTPFL---------------------------AVLVGP------VRRVLVDEGASANILSLTTYLALGWTMAQ
        SSA KK   EERKRSRTPP+RDDRPA   PF                            A+++ P      VRRVLVD GASANILSL TYLALGWT +Q
Subjt:  SSAGKK---EERKRSRTPPQRDDRPASSTPFL---------------------------AVLVGP------VRRVLVDEGASANILSLTTYLALGWTMAQ

Query:  LKKSPTSLVGFAGESVTPEGCTDLLITISQGDTQITQMAEFVVIDGRLAYNAIFGRPIIHSLRAIPSTLHQVMKYSTINGVGIIRGEQKASWECYTSALI
        LKKSPT LVGF+GESV+PEGC DL +T+ Q  TQ+TQMAEFVVIDGRLAYNAIFGRPIIHS RA+PSTLHQV+KYST NGVG +RGEQK S ECY S L 
Subjt:  LKKSPTSLVGFAGESVTPEGCTDLLITISQGDTQITQMAEFVVIDGRLAYNAIFGRPIIHSLRAIPSTLHQVMKYSTINGVGIIRGEQKASWECYTSALI

Query:  GSAVCAQEEVEKLPGPVTQASEDEQPKTNQCEVAAPTEELELVPLLSPEKQL
        GS+VC  EE         QA++D+ P+  + + +   EELELVPLLSP + L
Subjt:  GSAVCAQEEVEKLPGPVTQASEDEQPKTNQCEVAAPTEELELVPLLSPEKQL

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]1.4e-9863.09Show/hide
Query:  RTDYRRSDSDSSQRGPYERYTPTTISISKILTNIEENGLEKLLKRLDKLRGDPKKHNKDRYCRFHRDHGHDTSNCWELKRQIEDLIQDGYFKKYIVKSGL
        R DYRRS+S  +Q  PYE YTPTTI I +ILTNIEE G+EKLLKR +KLRGDP+K N D+YCRFHRDHGH+TSN WELKRQIEDLIQDGYFKK++ K   
Subjt:  RTDYRRSDSDSSQRGPYERYTPTTISISKILTNIEENGLEKLLKRLDKLRGDPKKHNKDRYCRFHRDHGHDTSNCWELKRQIEDLIQDGYFKKYIVKSGL

Query:  SSAGKKEERKRSRTPPQRDDRPA-------------------------SSTPFL-------------AVLVGP------VRRVLVDEGASANILSLTTYL
        +S  KKEERKR RTPP+RDDRPA                         SS  F              A+++ P      VRR+LVD GASANILSL+TYL
Subjt:  SSAGKKEERKRSRTPPQRDDRPA-------------------------SSTPFL-------------AVLVGP------VRRVLVDEGASANILSLTTYL

Query:  ALGWTMAQLKKSPTSLVGFAGESVTPEGCTDLLITISQGDTQITQMAEFVVIDGRLAYNAIFGRPIIHSLRAIPSTLHQVMKYSTINGVGIIRGEQKASW
        ALGWT +QLKKSPT LVGF+GES++ EGC DL ++I Q DTQ+TQMAEFVVIDGR AYNAIFGRPIIHS RA+PSTLHQV+KYST+NGVG +RGE K S 
Subjt:  ALGWTMAQLKKSPTSLVGFAGESVTPEGCTDLLITISQGDTQITQMAEFVVIDGRLAYNAIFGRPIIHSLRAIPSTLHQVMKYSTINGVGIIRGEQKASW

Query:  ECYTSALIGSAVCAQEE
        ECY S    S+VCA EE
Subjt:  ECYTSALIGSAVCAQEE

XP_022157448.1 uncharacterized protein LOC111024144 [Momordica charantia]3.4e-10076.06Show/hide
Query:  QGRTDYRRSDSDSSQRGPYERYTPTTISISKILTNIEENGLEKLLKRLDKLRGDPKKHNKDRYCRFHRDHGHDTSNCWELKRQIEDLIQDGYFKKYIVKS
        QGR+DY+ SDSDS+ RGPYERYTPTTI I KILTNIEENGLEKLLKR DKLRGD +K NKD YCRFHR+H HDTS+CWELKRQIEDLIQDGYFKKY+ K 
Subjt:  QGRTDYRRSDSDSSQRGPYERYTPTTISISKILTNIEENGLEKLLKRLDKLRGDPKKHNKDRYCRFHRDHGHDTSNCWELKRQIEDLIQDGYFKKYIVKS

Query:  GLSSAGKKEERKRSRTPPQRDDRPASSTPFLAVLVGP---------VRRVLVDEGASANILSLTTYLALGWTMAQLKKSPTSLVGFAGESVTPEGCTDLL
        G SS  KK+ERK SRTPP+RDDRPA          G          VR+V VDEGASANILSLTTYLALGWT A+LKKS T LVGFAGESVT E C DLL
Subjt:  GLSSAGKKEERKRSRTPPQRDDRPASSTPFLAVLVGP---------VRRVLVDEGASANILSLTTYLALGWTMAQLKKSPTSLVGFAGESVTPEGCTDLL

Query:  ITISQGDTQITQMAEFVVIDGRLAYNAIFGRPIIHSLRAIPSTLHQVMKYSTINGVGII
        ITI QGDTQ+ +M EFVVIDGR AYNAIFGRPIIHSLRAIPST+HQVMKYS INGVG+I
Subjt:  ITISQGDTQITQMAEFVVIDGRLAYNAIFGRPIIHSLRAIPSTLHQVMKYSTINGVGII

TrEMBL top hitse value%identityAlignment
A0A6J1D9E1 uncharacterized protein LOC1110188231.6e-9555.98Show/hide
Query:  GRTDYRRSDSDSSQRGPYERYTPTTISISKILTNIEENGLEKLLKRLDKLRGDPKKHNKDRYCRFHRDHGHDTSNCWELKRQIEDLIQDGYFKKYIVKSG
        GR ++RR+ +  ++  PYER+TPTTI IS+ILTNIEE+G+EKLLKR +KLRG P++ NKD+YCRFHR+H H+TS+ WELKRQIEDLIQD YFKK++ K  
Subjt:  GRTDYRRSDSDSSQRGPYERYTPTTISISKILTNIEENGLEKLLKRLDKLRGDPKKHNKDRYCRFHRDHGHDTSNCWELKRQIEDLIQDGYFKKYIVKSG

Query:  LSSAGKKEERKRSRTPPQRDDRPASSTPFL---------------------------------------------------AVLVGP------VRRVLVD
         SSA KKEERK SRTP +R DRPA                                                         A+++ P      VRRVLVD
Subjt:  LSSAGKKEERKRSRTPPQRDDRPASSTPFL---------------------------------------------------AVLVGP------VRRVLVD

Query:  EGASANILSLTTYLALGWTMAQLKKSPTSLVGFAGESVTPEGCTDLLITISQGDTQITQMAEFVVIDGRLAYNAIFGRPIIHSLRAIPSTLHQVMKYSTI
        EG SANI+SL TYLALGWT +QLKKS T LVGF+ ESV PEGC DL +T+    TQ+TQMAEFVVIDGR AYNAIFGRPIIHS RAIPSTLHQV+KYST 
Subjt:  EGASANILSLTTYLALGWTMAQLKKSPTSLVGFAGESVTPEGCTDLLITISQGDTQITQMAEFVVIDGRLAYNAIFGRPIIHSLRAIPSTLHQVMKYSTI

Query:  NGVGIIRGEQKASWECYTSALIGSAVCAQEEVEKLPGPVTQASEDEQPKTNQCEVAAPTEELELVPLL
        NGVG++RGEQ AS ECY SAL GS+VCA E +    G +     + +    + E AAPTEELELVPLL
Subjt:  NGVGIIRGEQKASWECYTSALIGSAVCAQEEVEKLPGPVTQASEDEQPKTNQCEVAAPTEELELVPLL

A0A6J1DD03 uncharacterized protein LOC1110198993.5e-10357.56Show/hide
Query:  GRTDYRRSDSDSSQRGPYERYTPTTISISKILTNIEENGLEKLLKRLDKLRGDPKKHNKDRYCRFHRDHGHDTSNCWELKRQIEDLIQDGYFKKYIVKSG
        GR +YRR+++  ++  PYER+TPTTI IS+ILTNIEE+G+EKLLKR +KLRG P++ +KD+YCRFHR+HGH+TS+ WELK QIEDLIQDGYFKK++ K  
Subjt:  GRTDYRRSDSDSSQRGPYERYTPTTISISKILTNIEENGLEKLLKRLDKLRGDPKKHNKDRYCRFHRDHGHDTSNCWELKRQIEDLIQDGYFKKYIVKSG

Query:  LSSAGKKEERKRSRTPPQRDDRPASSTPFL---------------------------------------------------AVLVGP------VRRVLVD
         SSA KKEERKRSRTPP+R DRPA                                                         A+++ P      VRRVLVD
Subjt:  LSSAGKKEERKRSRTPPQRDDRPASSTPFL---------------------------------------------------AVLVGP------VRRVLVD

Query:  EGASANILSLTTYLALGWTMAQLKKSPTSLVGFAGESVTPEGCTDLLITISQGDTQITQMAEFVVIDGRLAYNAIFGRPIIHSLRAIPSTLHQVMKYSTI
         GASANILSL TYLALGWT +QLKKSPT LVGF+GESV PEGC DL +T+ Q  T++TQMAEFVV+DGR AYNAIFGRPIIHS RAIPSTLHQV+KYST 
Subjt:  EGASANILSLTTYLALGWTMAQLKKSPTSLVGFAGESVTPEGCTDLLITISQGDTQITQMAEFVVIDGRLAYNAIFGRPIIHSLRAIPSTLHQVMKYSTI

Query:  NGVGIIRGEQKASWECYTSALIGSAVCAQEEVEKLPGPVTQASEDEQPKTNQCEVAAPTEELELVPLLSPEKQLSLG
        NGVG +RGEQ AS ECY S L G++VCA E +    G  T   E + P     E AAP EELELVPLLS EKQ+ LG
Subjt:  NGVGIIRGEQKASWECYTSALIGSAVCAQEEVEKLPGPVTQASEDEQPKTNQCEVAAPTEELELVPLLSPEKQLSLG

A0A6J1DG07 uncharacterized protein LOC1110201112.8e-10862.5Show/hide
Query:  RTDYRRSDSDSSQRGPYERYTPTTISISKILTNIEENGLEKLLKRLDKLRGDPKKHNKDRYCRFHRDHGHDTSNCWELKRQIEDLIQDGYFKKYIVKSGL
        RT+YRR + D ++  PYERYTPTTI IS+IL NIEE+G+EKLLKR +KL+GD +K NKD+YCRF RDHGH+TSNCWELKRQIEDLI+DGYFKK++ K   
Subjt:  RTDYRRSDSDSSQRGPYERYTPTTISISKILTNIEENGLEKLLKRLDKLRGDPKKHNKDRYCRFHRDHGHDTSNCWELKRQIEDLIQDGYFKKYIVKSGL

Query:  SSAGKK---EERKRSRTPPQRDDRPASSTPFL---------------------------AVLVGP------VRRVLVDEGASANILSLTTYLALGWTMAQ
        SSA KK   EERKRSRTPP+RDDRPA   PF                            A+++ P      VRRVLVD GASANILSL TYLALGWT +Q
Subjt:  SSAGKK---EERKRSRTPPQRDDRPASSTPFL---------------------------AVLVGP------VRRVLVDEGASANILSLTTYLALGWTMAQ

Query:  LKKSPTSLVGFAGESVTPEGCTDLLITISQGDTQITQMAEFVVIDGRLAYNAIFGRPIIHSLRAIPSTLHQVMKYSTINGVGIIRGEQKASWECYTSALI
        LKKSPT LVGF+GESV+PEGC DL +T+ Q  TQ+TQMAEFVVIDGRLAYNAIFGRPIIHS RA+PSTLHQV+KYST NGVG +RGEQK S ECY S L 
Subjt:  LKKSPTSLVGFAGESVTPEGCTDLLITISQGDTQITQMAEFVVIDGRLAYNAIFGRPIIHSLRAIPSTLHQVMKYSTINGVGIIRGEQKASWECYTSALI

Query:  GSAVCAQEEVEKLPGPVTQASEDEQPKTNQCEVAAPTEELELVPLLSPEKQL
        GS+VC  EE         QA++D+ P+  + + +   EELELVPLLSP + L
Subjt:  GSAVCAQEEVEKLPGPVTQASEDEQPKTNQCEVAAPTEELELVPLLSPEKQL

A0A6J1DHB3 uncharacterized protein LOC1110204796.8e-9963.09Show/hide
Query:  RTDYRRSDSDSSQRGPYERYTPTTISISKILTNIEENGLEKLLKRLDKLRGDPKKHNKDRYCRFHRDHGHDTSNCWELKRQIEDLIQDGYFKKYIVKSGL
        R DYRRS+S  +Q  PYE YTPTTI I +ILTNIEE G+EKLLKR +KLRGDP+K N D+YCRFHRDHGH+TSN WELKRQIEDLIQDGYFKK++ K   
Subjt:  RTDYRRSDSDSSQRGPYERYTPTTISISKILTNIEENGLEKLLKRLDKLRGDPKKHNKDRYCRFHRDHGHDTSNCWELKRQIEDLIQDGYFKKYIVKSGL

Query:  SSAGKKEERKRSRTPPQRDDRPA-------------------------SSTPFL-------------AVLVGP------VRRVLVDEGASANILSLTTYL
        +S  KKEERKR RTPP+RDDRPA                         SS  F              A+++ P      VRR+LVD GASANILSL+TYL
Subjt:  SSAGKKEERKRSRTPPQRDDRPA-------------------------SSTPFL-------------AVLVGP------VRRVLVDEGASANILSLTTYL

Query:  ALGWTMAQLKKSPTSLVGFAGESVTPEGCTDLLITISQGDTQITQMAEFVVIDGRLAYNAIFGRPIIHSLRAIPSTLHQVMKYSTINGVGIIRGEQKASW
        ALGWT +QLKKSPT LVGF+GES++ EGC DL ++I Q DTQ+TQMAEFVVIDGR AYNAIFGRPIIHS RA+PSTLHQV+KYST+NGVG +RGE K S 
Subjt:  ALGWTMAQLKKSPTSLVGFAGESVTPEGCTDLLITISQGDTQITQMAEFVVIDGRLAYNAIFGRPIIHSLRAIPSTLHQVMKYSTINGVGIIRGEQKASW

Query:  ECYTSALIGSAVCAQEE
        ECY S    S+VCA EE
Subjt:  ECYTSALIGSAVCAQEE

A0A6J1DTD9 uncharacterized protein LOC1110241441.6e-10076.06Show/hide
Query:  QGRTDYRRSDSDSSQRGPYERYTPTTISISKILTNIEENGLEKLLKRLDKLRGDPKKHNKDRYCRFHRDHGHDTSNCWELKRQIEDLIQDGYFKKYIVKS
        QGR+DY+ SDSDS+ RGPYERYTPTTI I KILTNIEENGLEKLLKR DKLRGD +K NKD YCRFHR+H HDTS+CWELKRQIEDLIQDGYFKKY+ K 
Subjt:  QGRTDYRRSDSDSSQRGPYERYTPTTISISKILTNIEENGLEKLLKRLDKLRGDPKKHNKDRYCRFHRDHGHDTSNCWELKRQIEDLIQDGYFKKYIVKS

Query:  GLSSAGKKEERKRSRTPPQRDDRPASSTPFLAVLVGP---------VRRVLVDEGASANILSLTTYLALGWTMAQLKKSPTSLVGFAGESVTPEGCTDLL
        G SS  KK+ERK SRTPP+RDDRPA          G          VR+V VDEGASANILSLTTYLALGWT A+LKKS T LVGFAGESVT E C DLL
Subjt:  GLSSAGKKEERKRSRTPPQRDDRPASSTPFLAVLVGP---------VRRVLVDEGASANILSLTTYLALGWTMAQLKKSPTSLVGFAGESVTPEGCTDLL

Query:  ITISQGDTQITQMAEFVVIDGRLAYNAIFGRPIIHSLRAIPSTLHQVMKYSTINGVGII
        ITI QGDTQ+ +M EFVVIDGR AYNAIFGRPIIHSLRAIPST+HQVMKYS INGVG+I
Subjt:  ITISQGDTQITQMAEFVVIDGRLAYNAIFGRPIIHSLRAIPSTLHQVMKYSTINGVGII

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGCAAGGACGAACTGACTACAGGAGGTCTGATTCGGATTCGAGCCAGAGGGGGCCGTATGAGCGGTACACCCCAACCACCATTTCGATCTCCAAGATTTTAACCAA
TATCGAGGAGAATGGTTTGGAGAAACTTCTCAAGCGGCTTGATAAACTCAGAGGAGACCCGAAAAAGCACAATAAGGACAGGTACTGCCGTTTTCATCGCGATCACGGCC
ATGATACCTCGAATTGCTGGGAGTTAAAGCGTCAAATTGAAGACCTGATTCAAGATGGCTACTTCAAAAAGTACATCGTCAAGTCAGGCTTGAGCTCGGCAGGCAAAAAA
GAAGAGAGGAAGCGTTCAAGAACGCCACCTCAGCGAGATGACCGACCTGCGTCATCAACACCATTTTTGGCTGTTCTAGTGGGGCCAGTCAGAAGAGTGCTAGTGGATGA
GGGTGCTTCTGCCAACATTTTGTCTCTCACTACTTACCTGGCGTTGGGATGGACCATGGCACAGTTGAAAAAGAGCCCAACCTCCTTAGTGGGATTTGCGGGTGAATCTG
TCACCCCAGAAGGTTGCACAGATCTTCTAATCACCATTAGTCAAGGCGACACTCAAATCACTCAGATGGCCGAGTTCGTGGTAATAGATGGTAGGTTGGCCTACAACGCC
ATATTCGGACGACCTATCATCCATTCACTGCGTGCCATCCCGTCGACCTTGCACCAAGTCATGAAATACTCTACGATCAATGGGGTTGGCATAATCCGAGGTGAACAGAA
AGCATCCTGGGAGTGCTATACCTCGGCTTTAATAGGATCAGCAGTCTGCGCCCAGGAAGAAGTGGAAAAACTTCCAGGTCCGGTAACTCAAGCTTCCGAAGACGAGCAGC
CCAAGACCAACCAGTGCGAAGTAGCTGCGCCCACTGAAGAGTTGGAGCTTGTCCCCTTGCTCAGCCCAGAGAAACAACTTTCCCTGGGACCCAAGGATGTCAACCTCCTG
GGAGTCTATCACCCCAGCTTCTTAGGAGGAGTTTACACCGGCAGCAGAAGGAATAGTCTCGTCAGCGTCTTGGGAAGAAGAGTCATCGGCCTCTTCCTCGTCGAGCTCGG
CATCGGAATCAAGCTTCTTCAAGCATTGATCGACGCGGTGCTGGGAGCCAGGGGTCCATTAGGACTCGAAGCCCACTTCTCTGCATATCGCAGTTTGATGGGTGCGAAAA
GCTTCCTCCAGCAGAACTCTGTTGCTAAGCTTGGACTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGCAAGGACGAACTGACTACAGGAGGTCTGATTCGGATTCGAGCCAGAGGGGGCCGTATGAGCGGTACACCCCAACCACCATTTCGATCTCCAAGATTTTAACCAA
TATCGAGGAGAATGGTTTGGAGAAACTTCTCAAGCGGCTTGATAAACTCAGAGGAGACCCGAAAAAGCACAATAAGGACAGGTACTGCCGTTTTCATCGCGATCACGGCC
ATGATACCTCGAATTGCTGGGAGTTAAAGCGTCAAATTGAAGACCTGATTCAAGATGGCTACTTCAAAAAGTACATCGTCAAGTCAGGCTTGAGCTCGGCAGGCAAAAAA
GAAGAGAGGAAGCGTTCAAGAACGCCACCTCAGCGAGATGACCGACCTGCGTCATCAACACCATTTTTGGCTGTTCTAGTGGGGCCAGTCAGAAGAGTGCTAGTGGATGA
GGGTGCTTCTGCCAACATTTTGTCTCTCACTACTTACCTGGCGTTGGGATGGACCATGGCACAGTTGAAAAAGAGCCCAACCTCCTTAGTGGGATTTGCGGGTGAATCTG
TCACCCCAGAAGGTTGCACAGATCTTCTAATCACCATTAGTCAAGGCGACACTCAAATCACTCAGATGGCCGAGTTCGTGGTAATAGATGGTAGGTTGGCCTACAACGCC
ATATTCGGACGACCTATCATCCATTCACTGCGTGCCATCCCGTCGACCTTGCACCAAGTCATGAAATACTCTACGATCAATGGGGTTGGCATAATCCGAGGTGAACAGAA
AGCATCCTGGGAGTGCTATACCTCGGCTTTAATAGGATCAGCAGTCTGCGCCCAGGAAGAAGTGGAAAAACTTCCAGGTCCGGTAACTCAAGCTTCCGAAGACGAGCAGC
CCAAGACCAACCAGTGCGAAGTAGCTGCGCCCACTGAAGAGTTGGAGCTTGTCCCCTTGCTCAGCCCAGAGAAACAACTTTCCCTGGGACCCAAGGATGTCAACCTCCTG
GGAGTCTATCACCCCAGCTTCTTAGGAGGAGTTTACACCGGCAGCAGAAGGAATAGTCTCGTCAGCGTCTTGGGAAGAAGAGTCATCGGCCTCTTCCTCGTCGAGCTCGG
CATCGGAATCAAGCTTCTTCAAGCATTGATCGACGCGGTGCTGGGAGCCAGGGGTCCATTAGGACTCGAAGCCCACTTCTCTGCATATCGCAGTTTGATGGGTGCGAAAA
GCTTCCTCCAGCAGAACTCTGTTGCTAAGCTTGGACTCTGA
Protein sequenceShow/hide protein sequence
MEQGRTDYRRSDSDSSQRGPYERYTPTTISISKILTNIEENGLEKLLKRLDKLRGDPKKHNKDRYCRFHRDHGHDTSNCWELKRQIEDLIQDGYFKKYIVKSGLSSAGKK
EERKRSRTPPQRDDRPASSTPFLAVLVGPVRRVLVDEGASANILSLTTYLALGWTMAQLKKSPTSLVGFAGESVTPEGCTDLLITISQGDTQITQMAEFVVIDGRLAYNA
IFGRPIIHSLRAIPSTLHQVMKYSTINGVGIIRGEQKASWECYTSALIGSAVCAQEEVEKLPGPVTQASEDEQPKTNQCEVAAPTEELELVPLLSPEKQLSLGPKDVNLL
GVYHPSFLGGVYTGSRRNSLVSVLGRRVIGLFLVELGIGIKLLQALIDAVLGARGPLGLEAHFSAYRSLMGAKSFLQQNSVAKLGL