; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS014698 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS014698
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptionpentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X1
Genome locationscaffold659:202882..205535
RNA-Seq ExpressionMS014698
SyntenyMS014698
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022154414.1 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X1 [Momordica charantia]8.7e-10798.99Show/hide
Query:  GQIMDLGVSRLQVGNSCYCTMVQAQMCQQLADRDMKNKDVNNSKALCQRSEQNDRDMRKHQIGENVSRKDKINFLVATLVDLRGSKEAVYGALDAWVAWE
        GQIMDLGVSRLQVGNSCYCTMVQAQMCQQLADRDMKNKDVNNSKALCQRSEQND DMRKHQIGENVSRKDKINFLVATLVDLRGSKEAVYGALDAWVAWE
Subjt:  GQIMDLGVSRLQVGNSCYCTMVQAQMCQQLADRDMKNKDVNNSKALCQRSEQNDRDMRKHQIGENVSRKDKINFLVATLVDLRGSKEAVYGALDAWVAWE

Query:  QNFPIASLKQVLAVLEKEQQWHRVVQVIKWMLSKGQGTTMRVYGQLIRALDMDHRAEESHKFWVMKIGADLHSVPWQLCRSMISIYYRNKMLDNLVKV
        QNFPIASLKQVLAVLEKEQQWHRVVQVIKWMLSKGQGTTMRVYGQLIRALDMDHRAEESHKFWVMKIGADLHSVPWQLCRSMISIYYRNKMLDNLVK+
Subjt:  QNFPIASLKQVLAVLEKEQQWHRVVQVIKWMLSKGQGTTMRVYGQLIRALDMDHRAEESHKFWVMKIGADLHSVPWQLCRSMISIYYRNKMLDNLVKV

XP_022154415.1 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X2 [Momordica charantia]2.0e-9893.94Show/hide
Query:  GQIMDLGVSRLQVGNSCYCTMVQAQMCQQLADRDMKNKDVNNSKALCQRSEQNDRDMRKHQIGENVSRKDKINFLVATLVDLRGSKEAVYGALDAWVAWE
        GQIMDLGVSRLQVGNSCYCTMVQAQMCQQLADRDMKNK          RSEQND DMRKHQIGENVSRKDKINFLVATLVDLRGSKEAVYGALDAWVAWE
Subjt:  GQIMDLGVSRLQVGNSCYCTMVQAQMCQQLADRDMKNKDVNNSKALCQRSEQNDRDMRKHQIGENVSRKDKINFLVATLVDLRGSKEAVYGALDAWVAWE

Query:  QNFPIASLKQVLAVLEKEQQWHRVVQVIKWMLSKGQGTTMRVYGQLIRALDMDHRAEESHKFWVMKIGADLHSVPWQLCRSMISIYYRNKMLDNLVKV
        QNFPIASLKQVLAVLEKEQQWHRVVQVIKWMLSKGQGTTMRVYGQLIRALDMDHRAEESHKFWVMKIGADLHSVPWQLCRSMISIYYRNKMLDNLVK+
Subjt:  QNFPIASLKQVLAVLEKEQQWHRVVQVIKWMLSKGQGTTMRVYGQLIRALDMDHRAEESHKFWVMKIGADLHSVPWQLCRSMISIYYRNKMLDNLVKV

XP_022154416.1 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X3 [Momordica charantia]8.7e-10798.99Show/hide
Query:  GQIMDLGVSRLQVGNSCYCTMVQAQMCQQLADRDMKNKDVNNSKALCQRSEQNDRDMRKHQIGENVSRKDKINFLVATLVDLRGSKEAVYGALDAWVAWE
        GQIMDLGVSRLQVGNSCYCTMVQAQMCQQLADRDMKNKDVNNSKALCQRSEQND DMRKHQIGENVSRKDKINFLVATLVDLRGSKEAVYGALDAWVAWE
Subjt:  GQIMDLGVSRLQVGNSCYCTMVQAQMCQQLADRDMKNKDVNNSKALCQRSEQNDRDMRKHQIGENVSRKDKINFLVATLVDLRGSKEAVYGALDAWVAWE

Query:  QNFPIASLKQVLAVLEKEQQWHRVVQVIKWMLSKGQGTTMRVYGQLIRALDMDHRAEESHKFWVMKIGADLHSVPWQLCRSMISIYYRNKMLDNLVKV
        QNFPIASLKQVLAVLEKEQQWHRVVQVIKWMLSKGQGTTMRVYGQLIRALDMDHRAEESHKFWVMKIGADLHSVPWQLCRSMISIYYRNKMLDNLVK+
Subjt:  QNFPIASLKQVLAVLEKEQQWHRVVQVIKWMLSKGQGTTMRVYGQLIRALDMDHRAEESHKFWVMKIGADLHSVPWQLCRSMISIYYRNKMLDNLVKV

XP_022967610.1 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X1 [Cucurbita maxima]4.8e-8982.83Show/hide
Query:  GQIMDLGVSRLQVGNSCYCTMVQAQMCQQLADRDMKNKDVNNSKALCQRSEQNDRDMRKHQIGENVSRKDKINFLVATLVDLRGSKEAVYGALDAWVAWE
        GQ+M+LGV++LQ+GNSCYCTM+Q QM ++ AD+DM +KDVNNSK L Q SE+N  D+RKHQIGENVSRKDKINFLV TL+DLR SKEAVYGALDAWVAWE
Subjt:  GQIMDLGVSRLQVGNSCYCTMVQAQMCQQLADRDMKNKDVNNSKALCQRSEQNDRDMRKHQIGENVSRKDKINFLVATLVDLRGSKEAVYGALDAWVAWE

Query:  QNFPIASLKQVLAVLEKEQQWHRVVQVIKWMLSKGQGTTMRVYGQLIRALDMDHRAEESHKFWVMKIGADLHSVPWQLCRSMISIYYRNKMLDNLVKV
        Q+FPIASLK  LAVLEKE QWHRVVQVIKWMLSKGQGTTM VYGQLIRALDMDHRAEE+HKFWVMKIG+DLHSVPWQLCRSMISIYYRNKML++LVK+
Subjt:  QNFPIASLKQVLAVLEKEQQWHRVVQVIKWMLSKGQGTTMRVYGQLIRALDMDHRAEESHKFWVMKIGADLHSVPWQLCRSMISIYYRNKMLDNLVKV

XP_038887984.1 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X1 [Benincasa hispida]6.3e-8983.84Show/hide
Query:  GQIMDLGVSRLQVGNSCYCTMVQAQMCQQLADRDMKNKDVNNSKALCQRSEQNDRDMRKHQIGENVSRKDKINFLVATLVDLRGSKEAVYGALDAWVAWE
        GQIM+LGVSRLQVG+ CYCTM+Q QM +QLA +D+KNKD NNSKAL Q SEQN  D+RKHQIG+NV RKDKINFLV TL+DLR SKEAVYGALDAWVAWE
Subjt:  GQIMDLGVSRLQVGNSCYCTMVQAQMCQQLADRDMKNKDVNNSKALCQRSEQNDRDMRKHQIGENVSRKDKINFLVATLVDLRGSKEAVYGALDAWVAWE

Query:  QNFPIASLKQVLAVLEKEQQWHRVVQVIKWMLSKGQGTTMRVYGQLIRALDMDHRAEESHKFWVMKIGADLHSVPWQLCRSMISIYYRNKMLDNLVKV
        Q+FPI SLK VL VLEKEQQWHRVVQVIKWMLSKGQGTTM VYGQLIRALDMDHRAEE+HKFWVMKIG+DLHSVPWQLCRSMI+IYYRNKML++LVK+
Subjt:  QNFPIASLKQVLAVLEKEQQWHRVVQVIKWMLSKGQGTTMRVYGQLIRALDMDHRAEESHKFWVMKIGADLHSVPWQLCRSMISIYYRNKMLDNLVKV

TrEMBL top hitse value%identityAlignment
A0A6J1DJJ2 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X29.4e-9993.94Show/hide
Query:  GQIMDLGVSRLQVGNSCYCTMVQAQMCQQLADRDMKNKDVNNSKALCQRSEQNDRDMRKHQIGENVSRKDKINFLVATLVDLRGSKEAVYGALDAWVAWE
        GQIMDLGVSRLQVGNSCYCTMVQAQMCQQLADRDMKNK          RSEQND DMRKHQIGENVSRKDKINFLVATLVDLRGSKEAVYGALDAWVAWE
Subjt:  GQIMDLGVSRLQVGNSCYCTMVQAQMCQQLADRDMKNKDVNNSKALCQRSEQNDRDMRKHQIGENVSRKDKINFLVATLVDLRGSKEAVYGALDAWVAWE

Query:  QNFPIASLKQVLAVLEKEQQWHRVVQVIKWMLSKGQGTTMRVYGQLIRALDMDHRAEESHKFWVMKIGADLHSVPWQLCRSMISIYYRNKMLDNLVKV
        QNFPIASLKQVLAVLEKEQQWHRVVQVIKWMLSKGQGTTMRVYGQLIRALDMDHRAEESHKFWVMKIGADLHSVPWQLCRSMISIYYRNKMLDNLVK+
Subjt:  QNFPIASLKQVLAVLEKEQQWHRVVQVIKWMLSKGQGTTMRVYGQLIRALDMDHRAEESHKFWVMKIGADLHSVPWQLCRSMISIYYRNKMLDNLVKV

A0A6J1DM10 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X14.2e-10798.99Show/hide
Query:  GQIMDLGVSRLQVGNSCYCTMVQAQMCQQLADRDMKNKDVNNSKALCQRSEQNDRDMRKHQIGENVSRKDKINFLVATLVDLRGSKEAVYGALDAWVAWE
        GQIMDLGVSRLQVGNSCYCTMVQAQMCQQLADRDMKNKDVNNSKALCQRSEQND DMRKHQIGENVSRKDKINFLVATLVDLRGSKEAVYGALDAWVAWE
Subjt:  GQIMDLGVSRLQVGNSCYCTMVQAQMCQQLADRDMKNKDVNNSKALCQRSEQNDRDMRKHQIGENVSRKDKINFLVATLVDLRGSKEAVYGALDAWVAWE

Query:  QNFPIASLKQVLAVLEKEQQWHRVVQVIKWMLSKGQGTTMRVYGQLIRALDMDHRAEESHKFWVMKIGADLHSVPWQLCRSMISIYYRNKMLDNLVKV
        QNFPIASLKQVLAVLEKEQQWHRVVQVIKWMLSKGQGTTMRVYGQLIRALDMDHRAEESHKFWVMKIGADLHSVPWQLCRSMISIYYRNKMLDNLVK+
Subjt:  QNFPIASLKQVLAVLEKEQQWHRVVQVIKWMLSKGQGTTMRVYGQLIRALDMDHRAEESHKFWVMKIGADLHSVPWQLCRSMISIYYRNKMLDNLVKV

A0A6J1DNN5 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X34.2e-10798.99Show/hide
Query:  GQIMDLGVSRLQVGNSCYCTMVQAQMCQQLADRDMKNKDVNNSKALCQRSEQNDRDMRKHQIGENVSRKDKINFLVATLVDLRGSKEAVYGALDAWVAWE
        GQIMDLGVSRLQVGNSCYCTMVQAQMCQQLADRDMKNKDVNNSKALCQRSEQND DMRKHQIGENVSRKDKINFLVATLVDLRGSKEAVYGALDAWVAWE
Subjt:  GQIMDLGVSRLQVGNSCYCTMVQAQMCQQLADRDMKNKDVNNSKALCQRSEQNDRDMRKHQIGENVSRKDKINFLVATLVDLRGSKEAVYGALDAWVAWE

Query:  QNFPIASLKQVLAVLEKEQQWHRVVQVIKWMLSKGQGTTMRVYGQLIRALDMDHRAEESHKFWVMKIGADLHSVPWQLCRSMISIYYRNKMLDNLVKV
        QNFPIASLKQVLAVLEKEQQWHRVVQVIKWMLSKGQGTTMRVYGQLIRALDMDHRAEESHKFWVMKIGADLHSVPWQLCRSMISIYYRNKMLDNLVK+
Subjt:  QNFPIASLKQVLAVLEKEQQWHRVVQVIKWMLSKGQGTTMRVYGQLIRALDMDHRAEESHKFWVMKIGADLHSVPWQLCRSMISIYYRNKMLDNLVKV

A0A6J1HGC4 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X11.2e-8882.32Show/hide
Query:  GQIMDLGVSRLQVGNSCYCTMVQAQMCQQLADRDMKNKDVNNSKALCQRSEQNDRDMRKHQIGENVSRKDKINFLVATLVDLRGSKEAVYGALDAWVAWE
        GQIM+LGV++LQ+GNSCYCTM+Q QM ++  D+DM +KDVNNSK L Q SE+N  D+RKHQIGENVSRKDKI+FLV TL+DLR SKEAVYGALDAWVAWE
Subjt:  GQIMDLGVSRLQVGNSCYCTMVQAQMCQQLADRDMKNKDVNNSKALCQRSEQNDRDMRKHQIGENVSRKDKINFLVATLVDLRGSKEAVYGALDAWVAWE

Query:  QNFPIASLKQVLAVLEKEQQWHRVVQVIKWMLSKGQGTTMRVYGQLIRALDMDHRAEESHKFWVMKIGADLHSVPWQLCRSMISIYYRNKMLDNLVKV
        Q+FPIASLK  LAVLEKE QWHRVVQVIKWMLSKGQGTTM VYGQLIRALDMDHRAEE+HKFWVMKIG+DLHSVPWQLCRSMISIYYRNKML++LVK+
Subjt:  QNFPIASLKQVLAVLEKEQQWHRVVQVIKWMLSKGQGTTMRVYGQLIRALDMDHRAEESHKFWVMKIGADLHSVPWQLCRSMISIYYRNKMLDNLVKV

A0A6J1HUZ4 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X12.3e-8982.83Show/hide
Query:  GQIMDLGVSRLQVGNSCYCTMVQAQMCQQLADRDMKNKDVNNSKALCQRSEQNDRDMRKHQIGENVSRKDKINFLVATLVDLRGSKEAVYGALDAWVAWE
        GQ+M+LGV++LQ+GNSCYCTM+Q QM ++ AD+DM +KDVNNSK L Q SE+N  D+RKHQIGENVSRKDKINFLV TL+DLR SKEAVYGALDAWVAWE
Subjt:  GQIMDLGVSRLQVGNSCYCTMVQAQMCQQLADRDMKNKDVNNSKALCQRSEQNDRDMRKHQIGENVSRKDKINFLVATLVDLRGSKEAVYGALDAWVAWE

Query:  QNFPIASLKQVLAVLEKEQQWHRVVQVIKWMLSKGQGTTMRVYGQLIRALDMDHRAEESHKFWVMKIGADLHSVPWQLCRSMISIYYRNKMLDNLVKV
        Q+FPIASLK  LAVLEKE QWHRVVQVIKWMLSKGQGTTM VYGQLIRALDMDHRAEE+HKFWVMKIG+DLHSVPWQLCRSMISIYYRNKML++LVK+
Subjt:  QNFPIASLKQVLAVLEKEQQWHRVVQVIKWMLSKGQGTTMRVYGQLIRALDMDHRAEESHKFWVMKIGADLHSVPWQLCRSMISIYYRNKMLDNLVKV

SwissProt top hitse value%identityAlignment
Q2V3H0 Pentatricopeptide repeat-containing protein At4g18975, chloroplastic1.9e-2747.58Show/hide
Query:  LVATLVDLRGSKEAVYGALDAWVAWEQNFPIASLKQVLAVLEKEQQWHRVVQVIKWMLSKGQGTTMRVYGQLIRALDMDHRAEESHKFWVMKIGADLHSV
        LV  L  L   KEAVYGAL+ WVAWE  FPI +  + L +L K  QWHRV+Q+ KWMLSKGQG TM  Y  L+ A DMD RA+E+   W M +     S+
Subjt:  LVATLVDLRGSKEAVYGALDAWVAWEQNFPIASLKQVLAVLEKEQQWHRVVQVIKWMLSKGQGTTMRVYGQLIRALDMDHRAEESHKFWVMKIGADLHSV

Query:  PWQLCRSMISIYYRNKMLDNLVKV
        P +L   MI++Y  + + D +++V
Subjt:  PWQLCRSMISIYYRNKMLDNLVKV

Q8LG95 Pentatricopeptide repeat-containing protein At4g211902.5e-2436.94Show/hide
Query:  NSKALCQRSEQNDRDMRKHQIGENVSRKDKINFLVATLVDLRGSKEAVYGALDAWVAWEQNFPIASLKQVLAVLEKEQQWHRVVQVIKWMLSKGQGTTMR
        N+  +C       R  R  +  + +    K   ++A +  L   KE VYGALD+++AWE  FP+  +K+ L +LE E++W +++QV KWMLSKGQG TM 
Subjt:  NSKALCQRSEQNDRDMRKHQIGENVSRKDKINFLVATLVDLRGSKEAVYGALDAWVAWEQNFPIASLKQVLAVLEKEQQWHRVVQVIKWMLSKGQGTTMR

Query:  VYGQLIRALDMDHRAEESHKFWVMKIGADLHSVPWQLCRSMISIYYRNKMLDNLVKV
         Y  L+ AL  D+R +E+ + W       L   P +    MISIYY+  M   L +V
Subjt:  VYGQLIRALDMDHRAEESHKFWVMKIGADLHSVPWQLCRSMISIYYRNKMLDNLVKV

Arabidopsis top hitse value%identityAlignment
AT1G04590.1 BEST Arabidopsis thaliana protein match is: Pentatricopeptide repeat (PPR) superfamily protein (TAIR:AT4G21190.1)8.0e-5862.3Show/hide
Query:  VQAQMCQQLAD------RDMKNKDVNNSKALCQRSEQNDRDMRKHQIGENVSRKDKINFLVATLVDLRGSKEAVYGALDAWVAWEQNFPIASLKQVLAVL
        VQ+   Q +AD      R +KN+D  +     ++   N  + RKHQIGEN+ +KDKI FLV TL+D+  +KEAVYGALDAWVAWE+NFPIASLK V+A L
Subjt:  VQAQMCQQLAD------RDMKNKDVNNSKALCQRSEQNDRDMRKHQIGENVSRKDKINFLVATLVDLRGSKEAVYGALDAWVAWEQNFPIASLKQVLAVL

Query:  EKEQQWHRVVQVIKWMLSKGQGTTMRVYGQLIRALDMDHRAEESHKFWVMKIGADLHSVPWQLCRSMISIYYRNKMLDNLVKV
        EKE QWHR+VQVIKW+LSKGQG TM  YGQLIRALDMD RAEE+H  W  K+G DLHSVPWQLC  M+ IY+RN ML  LVK+
Subjt:  EKEQQWHRVVQVIKWMLSKGQGTTMRVYGQLIRALDMDHRAEESHKFWVMKIGADLHSVPWQLCRSMISIYYRNKMLDNLVKV

AT1G04590.2 BEST Arabidopsis thaliana protein match is: Pentatricopeptide repeat (PPR) superfamily protein (TAIR:AT4G18975.4)3.6e-5862.84Show/hide
Query:  VQAQMCQQLAD------RDMKNKDVNNSKALCQRSEQNDRDMRKHQIGENVSRKDKINFLVATLVDLRGSKEAVYGALDAWVAWEQNFPIASLKQVLAVL
        VQ+   Q +AD      R +KN+D  +     ++   N  + RKHQIGEN+ +KDKI FLV TL+D+  +KEAVYGALDAWVAWE+NFPIASLK V+A L
Subjt:  VQAQMCQQLAD------RDMKNKDVNNSKALCQRSEQNDRDMRKHQIGENVSRKDKINFLVATLVDLRGSKEAVYGALDAWVAWEQNFPIASLKQVLAVL

Query:  EKEQQWHRVVQVIKWMLSKGQGTTMRVYGQLIRALDMDHRAEESHKFWVMKIGADLHSVPWQLCRSMISIYYRNKMLDNLVKV
        EKE QWHR+VQVIKW+LSKGQG TM  YGQLIRALDMD RAEE+H  W  K+G DLHSVPWQLC  M+ IY+RN ML  LVKV
Subjt:  EKEQQWHRVVQVIKWMLSKGQGTTMRVYGQLIRALDMDHRAEESHKFWVMKIGADLHSVPWQLCRSMISIYYRNKMLDNLVKV

AT4G18975.1 Pentatricopeptide repeat (PPR) superfamily protein1.3e-2847.58Show/hide
Query:  LVATLVDLRGSKEAVYGALDAWVAWEQNFPIASLKQVLAVLEKEQQWHRVVQVIKWMLSKGQGTTMRVYGQLIRALDMDHRAEESHKFWVMKIGADLHSV
        LV  L  L   KEAVYGAL+ WVAWE  FPI +  + L +L K  QWHRV+Q+ KWMLSKGQG TM  Y  L+ A DMD RA+E+   W M +     S+
Subjt:  LVATLVDLRGSKEAVYGALDAWVAWEQNFPIASLKQVLAVLEKEQQWHRVVQVIKWMLSKGQGTTMRVYGQLIRALDMDHRAEESHKFWVMKIGADLHSV

Query:  PWQLCRSMISIYYRNKMLDNLVKV
        P +L   MI++Y  + + D +++V
Subjt:  PWQLCRSMISIYYRNKMLDNLVKV

AT4G18975.2 Pentatricopeptide repeat (PPR) superfamily protein1.3e-2847.58Show/hide
Query:  LVATLVDLRGSKEAVYGALDAWVAWEQNFPIASLKQVLAVLEKEQQWHRVVQVIKWMLSKGQGTTMRVYGQLIRALDMDHRAEESHKFWVMKIGADLHSV
        LV  L  L   KEAVYGAL+ WVAWE  FPI +  + L +L K  QWHRV+Q+ KWMLSKGQG TM  Y  L+ A DMD RA+E+   W M +     S+
Subjt:  LVATLVDLRGSKEAVYGALDAWVAWEQNFPIASLKQVLAVLEKEQQWHRVVQVIKWMLSKGQGTTMRVYGQLIRALDMDHRAEESHKFWVMKIGADLHSV

Query:  PWQLCRSMISIYYRNKMLDNLVKV
        P +L   MI++Y  + + D +++V
Subjt:  PWQLCRSMISIYYRNKMLDNLVKV

AT4G18975.3 Pentatricopeptide repeat (PPR) superfamily protein1.3e-2847.58Show/hide
Query:  LVATLVDLRGSKEAVYGALDAWVAWEQNFPIASLKQVLAVLEKEQQWHRVVQVIKWMLSKGQGTTMRVYGQLIRALDMDHRAEESHKFWVMKIGADLHSV
        LV  L  L   KEAVYGAL+ WVAWE  FPI +  + L +L K  QWHRV+Q+ KWMLSKGQG TM  Y  L+ A DMD RA+E+   W M +     S+
Subjt:  LVATLVDLRGSKEAVYGALDAWVAWEQNFPIASLKQVLAVLEKEQQWHRVVQVIKWMLSKGQGTTMRVYGQLIRALDMDHRAEESHKFWVMKIGADLHSV

Query:  PWQLCRSMISIYYRNKMLDNLVKV
        P +L   MI++Y  + + D +++V
Subjt:  PWQLCRSMISIYYRNKMLDNLVKV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GGACAAATAATGGACCTTGGAGTCAGCAGACTGCAAGTTGGGAACTCTTGTTACTGTACAATGGTTCAAGCTCAAATGTGTCAACAGCTTGCTGATAGAGATATGAAAAA
TAAGGATGTTAACAATAGTAAAGCTTTGTGCCAGAGATCAGAGCAAAATGATAGAGACATGAGAAAGCACCAAATTGGGGAAAATGTCTCACGGAAGGACAAAATTAACT
TCCTTGTAGCCACACTTGTTGATCTGAGAGGTAGTAAGGAGGCTGTTTATGGTGCTCTTGATGCCTGGGTTGCATGGGAGCAAAACTTTCCAATAGCATCCCTTAAGCAG
GTATTAGCTGTTCTTGAGAAGGAACAACAGTGGCATAGAGTTGTTCAGGTAATCAAATGGATGTTAAGCAAGGGGCAGGGAACCACTATGAGAGTCTACGGACAGTTAAT
ACGGGCTTTAGACATGGACCATCGAGCAGAAGAATCACACAAATTCTGGGTCATGAAAATTGGTGCAGATCTACATTCGGTCCCTTGGCAATTATGCAGAAGCATGATAT
CAATATACTATCGAAATAAAATGCTAGACAATCTTGTGAAGGTA
mRNA sequenceShow/hide mRNA sequence
GGACAAATAATGGACCTTGGAGTCAGCAGACTGCAAGTTGGGAACTCTTGTTACTGTACAATGGTTCAAGCTCAAATGTGTCAACAGCTTGCTGATAGAGATATGAAAAA
TAAGGATGTTAACAATAGTAAAGCTTTGTGCCAGAGATCAGAGCAAAATGATAGAGACATGAGAAAGCACCAAATTGGGGAAAATGTCTCACGGAAGGACAAAATTAACT
TCCTTGTAGCCACACTTGTTGATCTGAGAGGTAGTAAGGAGGCTGTTTATGGTGCTCTTGATGCCTGGGTTGCATGGGAGCAAAACTTTCCAATAGCATCCCTTAAGCAG
GTATTAGCTGTTCTTGAGAAGGAACAACAGTGGCATAGAGTTGTTCAGGTAATCAAATGGATGTTAAGCAAGGGGCAGGGAACCACTATGAGAGTCTACGGACAGTTAAT
ACGGGCTTTAGACATGGACCATCGAGCAGAAGAATCACACAAATTCTGGGTCATGAAAATTGGTGCAGATCTACATTCGGTCCCTTGGCAATTATGCAGAAGCATGATAT
CAATATACTATCGAAATAAAATGCTAGACAATCTTGTGAAGGTA
Protein sequenceShow/hide protein sequence
GQIMDLGVSRLQVGNSCYCTMVQAQMCQQLADRDMKNKDVNNSKALCQRSEQNDRDMRKHQIGENVSRKDKINFLVATLVDLRGSKEAVYGALDAWVAWEQNFPIASLKQ
VLAVLEKEQQWHRVVQVIKWMLSKGQGTTMRVYGQLIRALDMDHRAEESHKFWVMKIGADLHSVPWQLCRSMISIYYRNKMLDNLVKV