; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr025541 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr025541
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionprotein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic
Genome locationtig00007724:1356303..1359438
RNA-Seq ExpressionSgr025541
SyntenySgr025541
Gene Ontology termsGO:0001522 - pseudouridine synthesis (biological process)
GO:0008033 - tRNA processing (biological process)
GO:0034196 - acylglycerol transport (biological process)
GO:1990052 - ER to chloroplast lipid transport (biological process)
GO:0009941 - chloroplast envelope (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0009982 - pseudouridine synthase activity (molecular function)
GO:0070300 - phosphatidic acid binding (molecular function)
InterPro domainsIPR044160 - Protein TRIGALACTOSYLDIACYLGLYCEROL 4-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022158716.1 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic [Momordica charantia]8.7e-22585.29Show/hide
Query:  MKKLRWTMDGQGFWDLDVSTPRTLDGSASPVPD--DLLPLGLSRGTRLSRAKQIDFMQRFMAAPFVPSYAPSHGFALHRVFTIPFWDSGSATFLGQFNLQ
        MKKLRWTMDGQGFW+LDVSTP TLDG+ASPVP    LLPLGLSRG RLSRAKQIDFMQRFMAAPFVPSYAPSHGF+L RVF  PF   GS T LGQFNLQ
Subjt:  MKKLRWTMDGQGFWDLDVSTPRTLDGSASPVPD--DLLPLGLSRGTRLSRAKQIDFMQRFMAAPFVPSYAPSHGFALHRVFTIPFWDSGSATFLGQFNLQ

Query:  KFLASFKRSGEMNQSASSLLQGIGRHLCHRSLYALGFSSDFLLTPD--DTLLISFDGYGDNEILRTKAVLHHKFLHHDLTMEALSPGLFVDKCGNYWDVP
        KF++SF RSG M+ S SS LQ IGRHLCH S YALG SSD  L PD  D+LLISFDGYG+N +LRTKA+LHHKFLHHDLTMEALSPGLFVD+ GNYWDVP
Subjt:  KFLASFKRSGEMNQSASSLLQGIGRHLCHRSLYALGFSSDFLLTPD--DTLLISFDGYGDNEILRTKAVLHHKFLHHDLTMEALSPGLFVDKCGNYWDVP

Query:  SSLVIDLGSAASDSGPSYHLSMHHNTGSPSQAGSEQTGAVPFCLLPGLSVKAAFSYKKDFEIWRSNAKKLKMVQPYDIFLSNPHVSLSGIIGAVATTYFG
        SSL+IDLGSAA DSGPSYHLSMHHN GSPSQ+G+E+TGAVPFCLLPGLS+KAAFS+K +F+IWRSNAKKLKMVQPYDIFLSNPHVSLSGIIGAVATTYFG
Subjt:  SSLVIDLGSAASDSGPSYHLSMHHNTGSPSQAGSEQTGAVPFCLLPGLSVKAAFSYKKDFEIWRSNAKKLKMVQPYDIFLSNPHVSLSGIIGAVATTYFG

Query:  DNSVRSAAQDSLQEFKGLHMQTSSIRSTVLADVFASISFSAQYGMFQRLFLDLTRFSARMDFHSGSKFISGAMLLIEDLSNSRQPRTESVKATLPNARFS
        DNSVRSAAQDSLQEFKGL+MQTS IRSTVLADVFASISFSAQYGMFQR FLDLTRFSA MDFHSGSKFISGA LLIEDLSNS +PRTE+VKA LP+ARFS
Subjt:  DNSVRSAAQDSLQEFKGLHMQTSSIRSTVLADVFASISFSAQYGMFQRLFLDLTRFSARMDFHSGSKFISGAMLLIEDLSNSRQPRTESVKATLPNARFS

Query:  LQQQIAGPVSFRADSGVTIDLNKPAWGIQVEEPTFALEYALQVLGSAKAVAWYSPKHREFMVELRFYEN
        LQQQIAGP+SFRADS VTIDLNK  WG+QVEEPTFALEYALQVLGSAKA+AWYSPK REFMVELRFYEN
Subjt:  LQQQIAGPVSFRADSGVTIDLNKPAWGIQVEEPTFALEYALQVLGSAKAVAWYSPKHREFMVELRFYEN

XP_022959177.1 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic [Cucurbita moschata]5.8e-22184.04Show/hide
Query:  MKKLRWTMDGQGFWDLDVSTPRTLDGSASPVPDD--LLPLGLSRGTRLSRAKQIDFMQRFMAAPFVPSYAPSHGFALHRVFTIPFWDSGSATFLGQFNLQ
        MKKLRWTM+GQ FWDLDVSTPRTLDGSASPVP D  LLPLGLSRG RLSRAKQIDFMQ+FMAAPFVPS+ PSHGF+L RVF+IPF DSGSAT LGQFN+Q
Subjt:  MKKLRWTMDGQGFWDLDVSTPRTLDGSASPVPDD--LLPLGLSRGTRLSRAKQIDFMQRFMAAPFVPSYAPSHGFALHRVFTIPFWDSGSATFLGQFNLQ

Query:  KFLASFKRS--GEMNQSASSLLQGIGRHLCHRSLYALGFSSDFLLTPDDTLLISFDGYGDNEILRTKAVLHHKFLHHDLTMEALSPGLFVDKCGNYWDVP
        KF++S K+S  GEM QS SSLLQGIGRHL  RSLYA G SSD LLTPDD LLISFDGYGD++ILRTKAVLHHKFLHHDLTMEALSPGLFVDK G YWDVP
Subjt:  KFLASFKRS--GEMNQSASSLLQGIGRHLCHRSLYALGFSSDFLLTPDDTLLISFDGYGDNEILRTKAVLHHKFLHHDLTMEALSPGLFVDKCGNYWDVP

Query:  SSLVIDLGSAASDSGPSYHLSMHHNTGSPSQAGSEQTGAVPFCLLPGLSVKAAFSYKKDFEIWRSNAKKLKMVQPYDIFLSNPHVSLSGIIGAVATTYFG
        SSLVIDLGSA +DSG SYHLSMHHN GSPSQ+GSEQT   PFCLLPGLS KAAF+ KK+ EIWRSNAKKLK VQPYDIFL+NPHVSLSGIIGAVAT+YFG
Subjt:  SSLVIDLGSAASDSGPSYHLSMHHNTGSPSQAGSEQTGAVPFCLLPGLSVKAAFSYKKDFEIWRSNAKKLKMVQPYDIFLSNPHVSLSGIIGAVATTYFG

Query:  DNSVRSAAQDSLQEFKGLHMQTSSIRSTVLADVFASISFSAQYGMFQRLFLDLTRFSARMDFHSGSKFISGAMLLIEDLSNSRQPRTESVKATLPNARFS
        D S  SAA+ SLQEF+GL+MQTS IRSTV ADVFASISFSAQYGMFQR FLDLTRFS R DFHSGSKF+SGAMLLIEDLSNS+ PRTESVKATLPNARFS
Subjt:  DNSVRSAAQDSLQEFKGLHMQTSSIRSTVLADVFASISFSAQYGMFQRLFLDLTRFSARMDFHSGSKFISGAMLLIEDLSNSRQPRTESVKATLPNARFS

Query:  LQQQIAGPVSFRADSGVTIDLNKPAWG-IQVEEPTFALEYALQVLGSAKAVAWYSPKHREFMVELRFYEN
         QQQIAGPVSFRADSGV IDL+K  WG +QVEEPTFALEYAL  LGSAKA+AWYSPK REFMVELRFYEN
Subjt:  LQQQIAGPVSFRADSGVTIDLNKPAWG-IQVEEPTFALEYALQVLGSAKAVAWYSPKHREFMVELRFYEN

XP_023006570.1 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic [Cucurbita maxima]3.4e-22184.26Show/hide
Query:  MKKLRWTMDGQGFWDLDVSTPRTLDGSASPVPDD--LLPLGLSRGTRLSRAKQIDFMQRFMAAPFVPSYAPSHGFALHRVFTIPFWDSGSATFLGQFNLQ
        MKKLRWTM+GQ FWDLDVSTPRTLDGSASPVP D  LLPLGLSRG RLSRAKQIDFMQ+FMAAPFVPSY PSHGF+L RVF+IPF DSGSAT LGQFN+Q
Subjt:  MKKLRWTMDGQGFWDLDVSTPRTLDGSASPVPDD--LLPLGLSRGTRLSRAKQIDFMQRFMAAPFVPSYAPSHGFALHRVFTIPFWDSGSATFLGQFNLQ

Query:  KFLASFKRS--GEMNQSASSLLQGIGRHLCHRSLYALGFSSDFLLTPDDTLLISFDGYGDNEILRTKAVLHHKFLHHDLTMEALSPGLFVDKCGNYWDVP
        KF++S K+S  GEM QS SSLLQGIGRHL  RSLYA G SSD LLTPDD LLISFDGYGD+++LRTKAVLHHKFLHHDLTMEALSPGLFVDK G YWDVP
Subjt:  KFLASFKRS--GEMNQSASSLLQGIGRHLCHRSLYALGFSSDFLLTPDDTLLISFDGYGDNEILRTKAVLHHKFLHHDLTMEALSPGLFVDKCGNYWDVP

Query:  SSLVIDLGSAASDSGPSYHLSMHHNTGSPSQAGSEQTGAVPFCLLPGLSVKAAFSYKKDFEIWRSNAKKLKMVQPYDIFLSNPHVSLSGIIGAVATTYFG
        SSLVIDLGSAA+DSG SYHLSMHHN GSPSQ+GSEQT   PFCLLPGLS KAAF+ KK+ EIWRSNAKKLK VQPYDIFLSNPHVSLSGIIGAVAT+YF 
Subjt:  SSLVIDLGSAASDSGPSYHLSMHHNTGSPSQAGSEQTGAVPFCLLPGLSVKAAFSYKKDFEIWRSNAKKLKMVQPYDIFLSNPHVSLSGIIGAVATTYFG

Query:  DNSVRSAAQDSLQEFKGLHMQTSSIRSTVLADVFASISFSAQYGMFQRLFLDLTRFSARMDFHSGSKFISGAMLLIEDLSNSRQPRTESVKATLPNARFS
        D SV SAA+ SLQEFKGLHMQTS IRSTV ADVFASISFSAQYGMFQ  FLDLTRFS R DFHSGSKF+SGAMLLIEDLSNS+ PRTESVKATLPNARFS
Subjt:  DNSVRSAAQDSLQEFKGLHMQTSSIRSTVLADVFASISFSAQYGMFQRLFLDLTRFSARMDFHSGSKFISGAMLLIEDLSNSRQPRTESVKATLPNARFS

Query:  LQQQIAGPVSFRADSGVTIDLNKPAWG-IQVEEPTFALEYALQVLGSAKAVAWYSPKHREFMVELRFYEN
         QQQIAGPVSFRAD+GV IDL+K  WG +QVEEPTFALEYAL  LGSAKA+AWYSPK  EFMVELRFYEN
Subjt:  LQQQIAGPVSFRADSGVTIDLNKPAWG-IQVEEPTFALEYALQVLGSAKAVAWYSPKHREFMVELRFYEN

XP_023548884.1 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic [Cucurbita pepo subsp. pepo]4.8e-22384.89Show/hide
Query:  MKKLRWTMDGQGFWDLDVSTPRTLDGSASPVPDD--LLPLGLSRGTRLSRAKQIDFMQRFMAAPFVPSYAPSHGFALHRVFTIPFWDSGSATFLGQFNLQ
        MKKLRWTM+GQ FWDLDVSTPRTLDGSASPVP D  LLPLGLSRG RLSRAKQIDFMQ+FMAAPFVPSY PSHGF+L RVF+IPF DSGSAT LGQFN+Q
Subjt:  MKKLRWTMDGQGFWDLDVSTPRTLDGSASPVPDD--LLPLGLSRGTRLSRAKQIDFMQRFMAAPFVPSYAPSHGFALHRVFTIPFWDSGSATFLGQFNLQ

Query:  KFLASFKRS--GEMNQSASSLLQGIGRHLCHRSLYALGFSSDFLLTPDDTLLISFDGYGDNEILRTKAVLHHKFLHHDLTMEALSPGLFVDKCGNYWDVP
        KF++S K+S  GEM QS SSLLQGIGRHL  RSLYA G SSD LLTPDD LLISFDGYGD++ILRTKAVLHHKFLHHDLTMEALSPGLFVDK G YWDVP
Subjt:  KFLASFKRS--GEMNQSASSLLQGIGRHLCHRSLYALGFSSDFLLTPDDTLLISFDGYGDNEILRTKAVLHHKFLHHDLTMEALSPGLFVDKCGNYWDVP

Query:  SSLVIDLGSAASDSGPSYHLSMHHNTGSPSQAGSEQTGAVPFCLLPGLSVKAAFSYKKDFEIWRSNAKKLKMVQPYDIFLSNPHVSLSGIIGAVATTYFG
        SSLVIDLGSAA+DSG SYHLSMHHN GSPSQ+GSEQT   PFCLLPGLS KAAF+ KK+ EIWRSNAKKLK VQPYDIFLSNPHVSLSGIIGAVAT+YFG
Subjt:  SSLVIDLGSAASDSGPSYHLSMHHNTGSPSQAGSEQTGAVPFCLLPGLSVKAAFSYKKDFEIWRSNAKKLKMVQPYDIFLSNPHVSLSGIIGAVATTYFG

Query:  DNSVRSAAQDSLQEFKGLHMQTSSIRSTVLADVFASISFSAQYGMFQRLFLDLTRFSARMDFHSGSKFISGAMLLIEDLSNSRQPRTESVKATLPNARFS
        D S  SAA+ SLQEFKGL+MQTS IRSTV ADVFASISFSAQYGMFQR FLDLTRFS R DFHSGSKF+SGAMLLIEDLSNS+ PRTESVKATLPNARFS
Subjt:  DNSVRSAAQDSLQEFKGLHMQTSSIRSTVLADVFASISFSAQYGMFQRLFLDLTRFSARMDFHSGSKFISGAMLLIEDLSNSRQPRTESVKATLPNARFS

Query:  LQQQIAGPVSFRADSGVTIDLNKPAWG-IQVEEPTFALEYALQVLGSAKAVAWYSPKHREFMVELRFYEN
         QQQIAGPVSFRAD+GV IDL+K  WG +QVEEPTFALEYAL VLGSAKA+AWYSPK REFMVELRFYEN
Subjt:  LQQQIAGPVSFRADSGVTIDLNKPAWG-IQVEEPTFALEYALQVLGSAKAVAWYSPKHREFMVELRFYEN

XP_038875869.1 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic [Benincasa hispida]5.6e-22484.22Show/hide
Query:  MKKLRWTMDGQGFWDLDVSTPRTLDGSASPVPD--DLLPLGLSRGTRLSRAKQIDFMQRFMAAPFVPSYAPSHGFALHRVFTIPFWDSGSATFLGQFNLQ
        MKKLRW MDGQGFWDLDVSTPRTLDGSASPVP    LLPLGLSRG RLSRAKQIDFMQ FMAAPFVPSY+PSHGF+L RVF+IPF DSGS T LGQFNLQ
Subjt:  MKKLRWTMDGQGFWDLDVSTPRTLDGSASPVPD--DLLPLGLSRGTRLSRAKQIDFMQRFMAAPFVPSYAPSHGFALHRVFTIPFWDSGSATFLGQFNLQ

Query:  KFLASFKRS--GEMNQSASSLLQGIGRHLCHRSLYALGFSSDFLLTPDDTLLISFDGYGDNEILRTKAVLHHKFLHHDLTMEALSPGLFVDKCGNYWDVP
        KF++S K+S  G+M QS SS LQ IGRHLCHRSLYALG SSD LL PDD+L+ISFDGYGDNEI+RTKAV HHKFLHHDLTMEA SPGLFVDK G YWDVP
Subjt:  KFLASFKRS--GEMNQSASSLLQGIGRHLCHRSLYALGFSSDFLLTPDDTLLISFDGYGDNEILRTKAVLHHKFLHHDLTMEALSPGLFVDKCGNYWDVP

Query:  SSLVIDLGSAASDSGPSYHLSMHHNTGSPSQAGSEQTGAVPFCLLPGLSVKAAFSYKKDFEIWRSNAKKLKMVQPYDIFLSNPHVSLSGIIGAVATTYFG
        S+LV+DLGSAAS+SG SYHLSMH NTGSPSQ+GSEQ  + P CLLPGLS KAAF++KK+ EIWRSNAKKLKMVQPYDIFLS PHVSLSGIIGAVATTYFG
Subjt:  SSLVIDLGSAASDSGPSYHLSMHHNTGSPSQAGSEQTGAVPFCLLPGLSVKAAFSYKKDFEIWRSNAKKLKMVQPYDIFLSNPHVSLSGIIGAVATTYFG

Query:  DNSVRSAAQDSLQEFKGLHMQTSSIRSTVLADVFASISFSAQYGMFQRLFLDLTRFSARMDFHSGSKFISGAMLLIEDLSNSRQPRTESVKATLPNARFS
        DNS+RSAAQDSL EFKGL++QTS IRSTV ADVFASISFSAQYGMFQR +LDLT FSARMDFHSGSKF+SGAMLLI+DLSNSR PRTESV+ATLP+ARFS
Subjt:  DNSVRSAAQDSLQEFKGLHMQTSSIRSTVLADVFASISFSAQYGMFQRLFLDLTRFSARMDFHSGSKFISGAMLLIEDLSNSRQPRTESVKATLPNARFS

Query:  LQQQIAGPVSFRADSGVTIDLNKPAWG-IQVEEPTFALEYALQVLGSAKAVAWYSPKHREFMVELRFYE
        LQQQIAGPVSFRADSGV IDLNK  WG + V+EPTFALEYALQVLGSAKA+AWYSPKHREFMVELRFYE
Subjt:  LQQQIAGPVSFRADSGVTIDLNKPAWG-IQVEEPTFALEYALQVLGSAKAVAWYSPKHREFMVELRFYE

TrEMBL top hitse value%identityAlignment
A0A0A0K824 Uncharacterized protein2.8e-21379.96Show/hide
Query:  MKKLRWTMDGQGFWDLDVSTPRTLDGSASPVPD--DLLPLGLSRGTRLSRAKQIDFMQRFMAAPFVPSYAPSHGFALHRVFTIPFWDSGSATFLGQFNLQ
        MKKLRW MDGQGFWDLDVST RTLDGSASPVP    LLPLGLSRG RLSRAKQIDFMQRFMAAPFVPSY+PSHGF+L RVF++PF DSGS T LGQFNLQ
Subjt:  MKKLRWTMDGQGFWDLDVSTPRTLDGSASPVPD--DLLPLGLSRGTRLSRAKQIDFMQRFMAAPFVPSYAPSHGFALHRVFTIPFWDSGSATFLGQFNLQ

Query:  KFLASFKR--SGEMNQSASSLLQGIGRHLCHRSLYALGFSSDFLLTPDDTLLISFDGYGDNEILRTKAVLHHKFLHHDLTMEALSPGLFVDKCGNYWDVP
        KF++S  +  SGEM QS SSLLQ IGRHL  RSLYA+G S+D LL PDD+L+ISFDGYGD++I+RTKAV H KFLHHDLT+EALSPGLF++KCG YWDVP
Subjt:  KFLASFKR--SGEMNQSASSLLQGIGRHLCHRSLYALGFSSDFLLTPDDTLLISFDGYGDNEILRTKAVLHHKFLHHDLTMEALSPGLFVDKCGNYWDVP

Query:  SSLVIDLGSAASDSGPSYHLSMHHNTGSPSQAGSEQTGAVPFCLLPGLSVKAAFSYKKDFEIWRSNAKKLKMVQPYDIFLSNPHVSLSGIIGAVATTYFG
        SSLV+DLGS ASDSG SYHLSMH N G PSQ GSE T + PFCLLPGLS KAAF++KK+FEIWRSNAKKLKMVQPYDIFLS PHVSLS IIGAVAT+YFG
Subjt:  SSLVIDLGSAASDSGPSYHLSMHHNTGSPSQAGSEQTGAVPFCLLPGLSVKAAFSYKKDFEIWRSNAKKLKMVQPYDIFLSNPHVSLSGIIGAVATTYFG

Query:  DNSVRSAAQDSLQEFKGLHMQTSSIRSTVLADVFASISFSAQYGMFQRLFLDLTRFSARMDFHSGSKFISGAMLLIEDLSNSRQPRTESVKATLPNARFS
        D+  RSAAQDSL++FKG +M++S IRSTV AD+F SISFSAQYGMFQ+ +LDLTRFSA MDFHSGSKF+SG+MLLI+DLSNSR P+TESVKATLPNARFS
Subjt:  DNSVRSAAQDSLQEFKGLHMQTSSIRSTVLADVFASISFSAQYGMFQRLFLDLTRFSARMDFHSGSKFISGAMLLIEDLSNSRQPRTESVKATLPNARFS

Query:  LQQQIAGPVSFRADSGVTIDLNKPAWG-IQVEEPTFALEYALQVLGSAKAVAWYSPKHREFMVELRFYE
        +QQQIAGPVSFRAD+GV IDLNK  W  ++VEEPTFALEYAL VLGSAKA+AWYSPKHREFMVELRFYE
Subjt:  LQQQIAGPVSFRADSGVTIDLNKPAWG-IQVEEPTFALEYALQVLGSAKAVAWYSPKHREFMVELRFYE

A0A5D3BY40 Protein TRIGALACTOSYLDIACYLGLYCEROL 41.5e-20979.53Show/hide
Query:  MKKLRWTMDGQGFWDLDVSTPRTLDGSASPVPD--DLLPLGLSRGTRLSRAKQIDFMQRFMAAPFVPSYAPSHGFALHRVFTIPFWDSGSATFLGQFNLQ
        MKKLRW MD  GFWDLDVST RTLDGSASPVP    LLPLGLSRG RLSRAKQIDFMQ FM APFVPSY+PSHGF+L RVF+IPF DSGS T LGQFNLQ
Subjt:  MKKLRWTMDGQGFWDLDVSTPRTLDGSASPVPD--DLLPLGLSRGTRLSRAKQIDFMQRFMAAPFVPSYAPSHGFALHRVFTIPFWDSGSATFLGQFNLQ

Query:  KFLASFKR--SGEMNQSASSLLQGIGRHLCHRSLYALGFSSDFLLTPDDTLLISFDGYGDNEILRTKAVLHHKFLHHDLTMEALSPGLFVDKCGNYWDVP
        KF++S  +  SGEM QS SS +Q IGRHL  RSLYA+G S+D LL PDD+L+ISFDGYGD++I+RTKAV H KFLHHDLTMEALSPGLF+DK G YWDVP
Subjt:  KFLASFKR--SGEMNQSASSLLQGIGRHLCHRSLYALGFSSDFLLTPDDTLLISFDGYGDNEILRTKAVLHHKFLHHDLTMEALSPGLFVDKCGNYWDVP

Query:  SSLVIDLGSAASDSGPSYHLSMHHNTGSPSQAGSEQTGAVPFCLLPGLSVKAAFSYKKDFEIWRSNAKKLKMVQPYDIFLSNPHVSLSGIIGAVATTYFG
        SSLV+DLGSAASDSG SYHLSMH NTG PS  GSE T + PFCL PGLS KAAF++KK+FEIWRSNAKKLKMVQPYDIFLS PHVSLS IIGAVAT+YFG
Subjt:  SSLVIDLGSAASDSGPSYHLSMHHNTGSPSQAGSEQTGAVPFCLLPGLSVKAAFSYKKDFEIWRSNAKKLKMVQPYDIFLSNPHVSLSGIIGAVATTYFG

Query:  DNSVRSAAQDSLQEFKGLHMQTSSIRSTVLADVFASISFSAQYGMFQRLFLDLTRFSARMDFHSGSKFISGAMLLIEDLSNSRQPRTESVKATLPNARFS
        D+ VRSAAQ SL EFKG +MQTS IRST+ AD+F SISFSAQYGMFQ+ +LDLTRFSA MDFHSGSKF+SG+MLLI+DLSNSR P+TE+VKATLPNARFS
Subjt:  DNSVRSAAQDSLQEFKGLHMQTSSIRSTVLADVFASISFSAQYGMFQRLFLDLTRFSARMDFHSGSKFISGAMLLIEDLSNSRQPRTESVKATLPNARFS

Query:  LQQQIAGPVSFRADSGVTIDLNKPAWG-IQVEEPTFALEYALQVLGSAKAVAWYSPKHREFMVELRFYE
        +QQQIAGPVSFRADSGV IDLNK  W  ++V+EPTFALEYALQVLGSAKA+AWYSPKHREFMVELRFYE
Subjt:  LQQQIAGPVSFRADSGVTIDLNKPAWG-IQVEEPTFALEYALQVLGSAKAVAWYSPKHREFMVELRFYE

A0A6J1E1S3 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic4.2e-22585.29Show/hide
Query:  MKKLRWTMDGQGFWDLDVSTPRTLDGSASPVPD--DLLPLGLSRGTRLSRAKQIDFMQRFMAAPFVPSYAPSHGFALHRVFTIPFWDSGSATFLGQFNLQ
        MKKLRWTMDGQGFW+LDVSTP TLDG+ASPVP    LLPLGLSRG RLSRAKQIDFMQRFMAAPFVPSYAPSHGF+L RVF  PF   GS T LGQFNLQ
Subjt:  MKKLRWTMDGQGFWDLDVSTPRTLDGSASPVPD--DLLPLGLSRGTRLSRAKQIDFMQRFMAAPFVPSYAPSHGFALHRVFTIPFWDSGSATFLGQFNLQ

Query:  KFLASFKRSGEMNQSASSLLQGIGRHLCHRSLYALGFSSDFLLTPD--DTLLISFDGYGDNEILRTKAVLHHKFLHHDLTMEALSPGLFVDKCGNYWDVP
        KF++SF RSG M+ S SS LQ IGRHLCH S YALG SSD  L PD  D+LLISFDGYG+N +LRTKA+LHHKFLHHDLTMEALSPGLFVD+ GNYWDVP
Subjt:  KFLASFKRSGEMNQSASSLLQGIGRHLCHRSLYALGFSSDFLLTPD--DTLLISFDGYGDNEILRTKAVLHHKFLHHDLTMEALSPGLFVDKCGNYWDVP

Query:  SSLVIDLGSAASDSGPSYHLSMHHNTGSPSQAGSEQTGAVPFCLLPGLSVKAAFSYKKDFEIWRSNAKKLKMVQPYDIFLSNPHVSLSGIIGAVATTYFG
        SSL+IDLGSAA DSGPSYHLSMHHN GSPSQ+G+E+TGAVPFCLLPGLS+KAAFS+K +F+IWRSNAKKLKMVQPYDIFLSNPHVSLSGIIGAVATTYFG
Subjt:  SSLVIDLGSAASDSGPSYHLSMHHNTGSPSQAGSEQTGAVPFCLLPGLSVKAAFSYKKDFEIWRSNAKKLKMVQPYDIFLSNPHVSLSGIIGAVATTYFG

Query:  DNSVRSAAQDSLQEFKGLHMQTSSIRSTVLADVFASISFSAQYGMFQRLFLDLTRFSARMDFHSGSKFISGAMLLIEDLSNSRQPRTESVKATLPNARFS
        DNSVRSAAQDSLQEFKGL+MQTS IRSTVLADVFASISFSAQYGMFQR FLDLTRFSA MDFHSGSKFISGA LLIEDLSNS +PRTE+VKA LP+ARFS
Subjt:  DNSVRSAAQDSLQEFKGLHMQTSSIRSTVLADVFASISFSAQYGMFQRLFLDLTRFSARMDFHSGSKFISGAMLLIEDLSNSRQPRTESVKATLPNARFS

Query:  LQQQIAGPVSFRADSGVTIDLNKPAWGIQVEEPTFALEYALQVLGSAKAVAWYSPKHREFMVELRFYEN
        LQQQIAGP+SFRADS VTIDLNK  WG+QVEEPTFALEYALQVLGSAKA+AWYSPK REFMVELRFYEN
Subjt:  LQQQIAGPVSFRADSGVTIDLNKPAWGIQVEEPTFALEYALQVLGSAKAVAWYSPKHREFMVELRFYEN

A0A6J1H3U0 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic2.8e-22184.04Show/hide
Query:  MKKLRWTMDGQGFWDLDVSTPRTLDGSASPVPDD--LLPLGLSRGTRLSRAKQIDFMQRFMAAPFVPSYAPSHGFALHRVFTIPFWDSGSATFLGQFNLQ
        MKKLRWTM+GQ FWDLDVSTPRTLDGSASPVP D  LLPLGLSRG RLSRAKQIDFMQ+FMAAPFVPS+ PSHGF+L RVF+IPF DSGSAT LGQFN+Q
Subjt:  MKKLRWTMDGQGFWDLDVSTPRTLDGSASPVPDD--LLPLGLSRGTRLSRAKQIDFMQRFMAAPFVPSYAPSHGFALHRVFTIPFWDSGSATFLGQFNLQ

Query:  KFLASFKRS--GEMNQSASSLLQGIGRHLCHRSLYALGFSSDFLLTPDDTLLISFDGYGDNEILRTKAVLHHKFLHHDLTMEALSPGLFVDKCGNYWDVP
        KF++S K+S  GEM QS SSLLQGIGRHL  RSLYA G SSD LLTPDD LLISFDGYGD++ILRTKAVLHHKFLHHDLTMEALSPGLFVDK G YWDVP
Subjt:  KFLASFKRS--GEMNQSASSLLQGIGRHLCHRSLYALGFSSDFLLTPDDTLLISFDGYGDNEILRTKAVLHHKFLHHDLTMEALSPGLFVDKCGNYWDVP

Query:  SSLVIDLGSAASDSGPSYHLSMHHNTGSPSQAGSEQTGAVPFCLLPGLSVKAAFSYKKDFEIWRSNAKKLKMVQPYDIFLSNPHVSLSGIIGAVATTYFG
        SSLVIDLGSA +DSG SYHLSMHHN GSPSQ+GSEQT   PFCLLPGLS KAAF+ KK+ EIWRSNAKKLK VQPYDIFL+NPHVSLSGIIGAVAT+YFG
Subjt:  SSLVIDLGSAASDSGPSYHLSMHHNTGSPSQAGSEQTGAVPFCLLPGLSVKAAFSYKKDFEIWRSNAKKLKMVQPYDIFLSNPHVSLSGIIGAVATTYFG

Query:  DNSVRSAAQDSLQEFKGLHMQTSSIRSTVLADVFASISFSAQYGMFQRLFLDLTRFSARMDFHSGSKFISGAMLLIEDLSNSRQPRTESVKATLPNARFS
        D S  SAA+ SLQEF+GL+MQTS IRSTV ADVFASISFSAQYGMFQR FLDLTRFS R DFHSGSKF+SGAMLLIEDLSNS+ PRTESVKATLPNARFS
Subjt:  DNSVRSAAQDSLQEFKGLHMQTSSIRSTVLADVFASISFSAQYGMFQRLFLDLTRFSARMDFHSGSKFISGAMLLIEDLSNSRQPRTESVKATLPNARFS

Query:  LQQQIAGPVSFRADSGVTIDLNKPAWG-IQVEEPTFALEYALQVLGSAKAVAWYSPKHREFMVELRFYEN
         QQQIAGPVSFRADSGV IDL+K  WG +QVEEPTFALEYAL  LGSAKA+AWYSPK REFMVELRFYEN
Subjt:  LQQQIAGPVSFRADSGVTIDLNKPAWG-IQVEEPTFALEYALQVLGSAKAVAWYSPKHREFMVELRFYEN

A0A6J1KW75 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic1.7e-22184.26Show/hide
Query:  MKKLRWTMDGQGFWDLDVSTPRTLDGSASPVPDD--LLPLGLSRGTRLSRAKQIDFMQRFMAAPFVPSYAPSHGFALHRVFTIPFWDSGSATFLGQFNLQ
        MKKLRWTM+GQ FWDLDVSTPRTLDGSASPVP D  LLPLGLSRG RLSRAKQIDFMQ+FMAAPFVPSY PSHGF+L RVF+IPF DSGSAT LGQFN+Q
Subjt:  MKKLRWTMDGQGFWDLDVSTPRTLDGSASPVPDD--LLPLGLSRGTRLSRAKQIDFMQRFMAAPFVPSYAPSHGFALHRVFTIPFWDSGSATFLGQFNLQ

Query:  KFLASFKRS--GEMNQSASSLLQGIGRHLCHRSLYALGFSSDFLLTPDDTLLISFDGYGDNEILRTKAVLHHKFLHHDLTMEALSPGLFVDKCGNYWDVP
        KF++S K+S  GEM QS SSLLQGIGRHL  RSLYA G SSD LLTPDD LLISFDGYGD+++LRTKAVLHHKFLHHDLTMEALSPGLFVDK G YWDVP
Subjt:  KFLASFKRS--GEMNQSASSLLQGIGRHLCHRSLYALGFSSDFLLTPDDTLLISFDGYGDNEILRTKAVLHHKFLHHDLTMEALSPGLFVDKCGNYWDVP

Query:  SSLVIDLGSAASDSGPSYHLSMHHNTGSPSQAGSEQTGAVPFCLLPGLSVKAAFSYKKDFEIWRSNAKKLKMVQPYDIFLSNPHVSLSGIIGAVATTYFG
        SSLVIDLGSAA+DSG SYHLSMHHN GSPSQ+GSEQT   PFCLLPGLS KAAF+ KK+ EIWRSNAKKLK VQPYDIFLSNPHVSLSGIIGAVAT+YF 
Subjt:  SSLVIDLGSAASDSGPSYHLSMHHNTGSPSQAGSEQTGAVPFCLLPGLSVKAAFSYKKDFEIWRSNAKKLKMVQPYDIFLSNPHVSLSGIIGAVATTYFG

Query:  DNSVRSAAQDSLQEFKGLHMQTSSIRSTVLADVFASISFSAQYGMFQRLFLDLTRFSARMDFHSGSKFISGAMLLIEDLSNSRQPRTESVKATLPNARFS
        D SV SAA+ SLQEFKGLHMQTS IRSTV ADVFASISFSAQYGMFQ  FLDLTRFS R DFHSGSKF+SGAMLLIEDLSNS+ PRTESVKATLPNARFS
Subjt:  DNSVRSAAQDSLQEFKGLHMQTSSIRSTVLADVFASISFSAQYGMFQRLFLDLTRFSARMDFHSGSKFISGAMLLIEDLSNSRQPRTESVKATLPNARFS

Query:  LQQQIAGPVSFRADSGVTIDLNKPAWG-IQVEEPTFALEYALQVLGSAKAVAWYSPKHREFMVELRFYEN
         QQQIAGPVSFRAD+GV IDL+K  WG +QVEEPTFALEYAL  LGSAKA+AWYSPK  EFMVELRFYEN
Subjt:  LQQQIAGPVSFRADSGVTIDLNKPAWG-IQVEEPTFALEYALQVLGSAKAVAWYSPKHREFMVELRFYEN

SwissProt top hitse value%identityAlignment
Q9M903 Protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic8.1e-13350.94Show/hide
Query:  MKKLRWTMDGQGFWDLDVSTPRTLDGSASPVPDDLLPLGLSRGTRLSRAKQIDFMQRFMAAPFVPSYAP---------SHGFALHRVFTIPFWDSGSATF
        M ++RW  +G   WDLD+STP TL+G+A  VPDD LPLGLSRGTRLSR KQ++F  RFMA+P +PS++P           GF+L RV T+PF ++   + 
Subjt:  MKKLRWTMDGQGFWDLDVSTPRTLDGSASPVPDDLLPLGLSRGTRLSRAKQIDFMQRFMAAPFVPSYAP---------SHGFALHRVFTIPFWDSGSATF

Query:  LGQFNLQKFLASFKRSGEMNQSASSL----LQGIGRHLCHRSLYALGFSSDFLLTPDDTLLISFDGY-GD-NEILRTKAVLHHKFLHHDLTMEALSPGLF
        LGQF++Q+F+    ++    + +SS     L  IG+HL  +SLYALGF S+FLL+PDDTLL+S+D Y GD ++  R KA+ +H+F  H+LT EA+ PGLF
Subjt:  LGQFNLQKFLASFKRSGEMNQSASSL----LQGIGRHLCHRSLYALGFSSDFLLTPDDTLLISFDGY-GD-NEILRTKAVLHHKFLHHDLTMEALSPGLF

Query:  VDKCGNYWDVPSSLVIDLGSAASDSGPSYHLSMHHNTGSPSQAGSEQTGAVPFCLLPGLSVKAAFSYKKDFEIWRSNAKKLKMVQPYDIFLSNPHVSLSG
        VDK G YWDVP S+ IDL S  ++SGPSYHL +HHN+GSP +  S+     P  LLPGLS+K+A SY+ + ++WR    KL+  +PYD+FLS+PHV++SG
Subjt:  VDKCGNYWDVPSSLVIDLGSAASDSGPSYHLSMHHNTGSPSQAGSEQTGAVPFCLLPGLSVKAAFSYKKDFEIWRSNAKKLKMVQPYDIFLSNPHVSLSG

Query:  IIGAVATTYFGDNSVRSAAQDSLQEFKGLHMQTSSIRSTVLADVFASISFSAQYGMFQRLFLDLTRFSARMDFHSGSKFISGAMLLIEDLSNSRQPRTES
        IIG+V T  FG+NS+RS  ++  +   G  +   S+ S  +AD     S +AQYG FQ+ F DLTRF AR+DF  G +F++GA  + +DL NSRQP  E+
Subjt:  IIGAVATTYFGDNSVRSAAQDSLQEFKGLHMQTSSIRSTVLADVFASISFSAQYGMFQRLFLDLTRFSARMDFHSGSKFISGAMLLIEDLSNSRQPRTES

Query:  VKATLPNARFSLQQQIAGPVSFRADSGVTIDLNKPAWGIQVEEPTFALEYALQVLGSAKAVAWYSPKHREFMVELRFYE
         +   P    SLQQQI GP SF+ +SG+ IDL   A  + V++  FA+EYALQVL SAKAV  YSPK  EFMVELRF+E
Subjt:  VKATLPNARFSLQQQIAGPVSFRADSGVTIDLNKPAWGIQVEEPTFALEYALQVLGSAKAVAWYSPKHREFMVELRFYE

Arabidopsis top hitse value%identityAlignment
AT2G44640.1 FUNCTIONS IN: molecular_function unknown1.7e-6432.26Show/hide
Query:  FWDLDVSTPRTLDGSASPVPDDLLPLGLSRGTRLSRAKQIDFMQRFMAAPFVPSYAPSH-----GFALHRVFTIPFWDSGSATFLGQFNLQKFLASFKRS
        FWD +VS+P+TL+G+A  VP +  PL  +R +R  R +Q+  ++       +PS AP+       F+L+ +   P  ++     +GQF  +K  A  K  
Subjt:  FWDLDVSTPRTLDGSASPVPDDLLPLGLSRGTRLSRAKQIDFMQRFMAAPFVPSYAPSH-----GFALHRVFTIPFWDSGSATFLGQFNLQKFLASFKRS

Query:  -GEMNQSASSLLQGIGRHLCHRSLYALGFSSDFLLTPDDTLLISFDGYGDNEILRTKAVLHHKFLHHDLTMEALSPGLFVDKCGNYWDVPSSLVIDLGSA
             +    +++   +H+  +SLY++G  +   L    +LL+S +  GD   LR K +L H    HDLT+EA  P LF+D  G +WDVP SL +D+ S 
Subjt:  -GEMNQSASSLLQGIGRHLCHRSLYALGFSSDFLLTPDDTLLISFDGYGDNEILRTKAVLHHKFLHHDLTMEALSPGLFVDKCGNYWDVPSSLVIDLGSA

Query:  ASDSGPSYHLSMHHNTGSP---SQAGSEQTGAVPFCLLPGLSVKAAFSYKKDFEIWRSNAKK-------LKMVQPYDIFLSNPHVSLSGIIGAVATTYFG
          +SG  Y   +H + G+P   + AG E     P  L+PGL  KAA SYK + ++WR   K+         +  PYD+ L  PH ++SGI+G+    +  
Subjt:  ASDSGPSYHLSMHHNTGSP---SQAGSEQTGAVPFCLLPGLSVKAAFSYKKDFEIWRSNAKK-------LKMVQPYDIFLSNPHVSLSGIIGAVATTYFG

Query:  DNSVRSAAQDSLQEFKGLHMQTSSIRSTVLADVFASISFSAQYGMFQRLFLDLTRFSARMDFHSGSKFISGAMLLIEDLSNSRQPRTESVKATLPNARFS
           +               +     RS + ADVF S  ++ Q G F +L+ DLTR  AR+D  S       A  L + L ++    ++    + P     
Subjt:  DNSVRSAAQDSLQEFKGLHMQTSSIRSTVLADVFASISFSAQYGMFQRLFLDLTRFSARMDFHSGSKFISGAMLLIEDLSNSRQPRTESVKATLPNARFS

Query:  LQQQIAGPVSFRADSGVTIDLNKPAWGIQVEEPTFALEYALQVLGSAKAVAWYSPKHREFMVELRFYE
         QQQ+AGP+ F+ DS   +         ++E+  ++L Y+L++L S K VAWYSPK +E M+ELR +E
Subjt:  LQQQIAGPVSFRADSGVTIDLNKPAWGIQVEEPTFALEYALQVLGSAKAVAWYSPKHREFMVELRFYE

AT3G06960.1 pigment defective 3205.8e-13450.94Show/hide
Query:  MKKLRWTMDGQGFWDLDVSTPRTLDGSASPVPDDLLPLGLSRGTRLSRAKQIDFMQRFMAAPFVPSYAP---------SHGFALHRVFTIPFWDSGSATF
        M ++RW  +G   WDLD+STP TL+G+A  VPDD LPLGLSRGTRLSR KQ++F  RFMA+P +PS++P           GF+L RV T+PF ++   + 
Subjt:  MKKLRWTMDGQGFWDLDVSTPRTLDGSASPVPDDLLPLGLSRGTRLSRAKQIDFMQRFMAAPFVPSYAP---------SHGFALHRVFTIPFWDSGSATF

Query:  LGQFNLQKFLASFKRSGEMNQSASSL----LQGIGRHLCHRSLYALGFSSDFLLTPDDTLLISFDGY-GD-NEILRTKAVLHHKFLHHDLTMEALSPGLF
        LGQF++Q+F+    ++    + +SS     L  IG+HL  +SLYALGF S+FLL+PDDTLL+S+D Y GD ++  R KA+ +H+F  H+LT EA+ PGLF
Subjt:  LGQFNLQKFLASFKRSGEMNQSASSL----LQGIGRHLCHRSLYALGFSSDFLLTPDDTLLISFDGY-GD-NEILRTKAVLHHKFLHHDLTMEALSPGLF

Query:  VDKCGNYWDVPSSLVIDLGSAASDSGPSYHLSMHHNTGSPSQAGSEQTGAVPFCLLPGLSVKAAFSYKKDFEIWRSNAKKLKMVQPYDIFLSNPHVSLSG
        VDK G YWDVP S+ IDL S  ++SGPSYHL +HHN+GSP +  S+     P  LLPGLS+K+A SY+ + ++WR    KL+  +PYD+FLS+PHV++SG
Subjt:  VDKCGNYWDVPSSLVIDLGSAASDSGPSYHLSMHHNTGSPSQAGSEQTGAVPFCLLPGLSVKAAFSYKKDFEIWRSNAKKLKMVQPYDIFLSNPHVSLSG

Query:  IIGAVATTYFGDNSVRSAAQDSLQEFKGLHMQTSSIRSTVLADVFASISFSAQYGMFQRLFLDLTRFSARMDFHSGSKFISGAMLLIEDLSNSRQPRTES
        IIG+V T  FG+NS+RS  ++  +   G  +   S+ S  +AD     S +AQYG FQ+ F DLTRF AR+DF  G +F++GA  + +DL NSRQP  E+
Subjt:  IIGAVATTYFGDNSVRSAAQDSLQEFKGLHMQTSSIRSTVLADVFASISFSAQYGMFQRLFLDLTRFSARMDFHSGSKFISGAMLLIEDLSNSRQPRTES

Query:  VKATLPNARFSLQQQIAGPVSFRADSGVTIDLNKPAWGIQVEEPTFALEYALQVLGSAKAVAWYSPKHREFMVELRFYE
         +   P    SLQQQI GP SF+ +SG+ IDL   A  + V++  FA+EYALQVL SAKAV  YSPK  EFMVELRF+E
Subjt:  VKATLPNARFSLQQQIAGPVSFRADSGVTIDLNKPAWGIQVEEPTFALEYALQVLGSAKAVAWYSPKHREFMVELRFYE

AT3G06960.2 pigment defective 3205.0e-8551.31Show/hide
Query:  MKKLRWTMDGQGFWDLDVSTPRTLDGSASPVPDDLLPLGLSRGTRLSRAKQIDFMQRFMAAPFVPSYAP---------SHGFALHRVFTIPFWDSGSATF
        M ++RW  +G   WDLD+STP TL+G+A  VPDD LPLGLSRGTRLSR KQ++F  RFMA+P +PS++P           GF+L RV T+PF ++   + 
Subjt:  MKKLRWTMDGQGFWDLDVSTPRTLDGSASPVPDDLLPLGLSRGTRLSRAKQIDFMQRFMAAPFVPSYAP---------SHGFALHRVFTIPFWDSGSATF

Query:  LGQFNLQKFLASFKRSGEMNQSASSL----LQGIGRHLCHRSLYALGFSSDFLLTPDDTLLISFDGY-GD-NEILRTKAVLHHKFLHHDLTMEALSPGLF
        LGQF++Q+F+    ++    + +SS     L  IG+HL  +SLYALGF S+FLL+PDDTLL+S+D Y GD ++  R KA+ +H+F  H+LT EA+ PGLF
Subjt:  LGQFNLQKFLASFKRSGEMNQSASSL----LQGIGRHLCHRSLYALGFSSDFLLTPDDTLLISFDGY-GD-NEILRTKAVLHHKFLHHDLTMEALSPGLF

Query:  VDKCGNYWDVPSSLVIDLGSAASDSGPSYHLSMHHNTGSPSQAGSEQTGAVPFCLLPGLSVKAAFSYKKDFEIWRSNAKKLKMVQPYDIFLSNPHVSLSG
        VDK G YWDVP S+ IDL S  ++SGPSYHL +HHN+GSP +  S+     P  LLPGLS+K+A SY+ + ++WR    KL+  +PYD+FLS+PHV++SG
Subjt:  VDKCGNYWDVPSSLVIDLGSAASDSGPSYHLSMHHNTGSPSQAGSEQTGAVPFCLLPGLSVKAAFSYKKDFEIWRSNAKKLKMVQPYDIFLSNPHVSLSG

Query:  IIGAVA
        IIG ++
Subjt:  IIGAVA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGGGAACGAGAGGAAGAAGGGAAGGACTCTAGGGTTGTTTCTTTCGAGTTCCCAGGGAGGATGAAGAAGCTAAGATGGACAATGGACGGGCAAGGTTTTTGGGACCT
GGATGTTTCAACACCTAGAACACTGGATGGGTCGGCCTCCCCGGTTCCTGATGACTTGCTTCCCCTGGGATTGTCCAGAGGCACCAGACTTTCCAGGGCCAAACAGATCG
ATTTCATGCAGCGCTTCATGGCTGCACCTTTTGTCCCCTCTTATGCTCCCTCCCATGGCTTTGCTCTACATCGCGTGTTTACGATCCCCTTTTGGGACTCCGGGTCTGCT
ACTTTTTTAGGTCAGTTCAATTTGCAGAAGTTCTTGGCCTCTTTTAAGAGATCTGGAGAGATGAATCAATCGGCGTCCTCATTGCTGCAAGGCATTGGAAGGCACCTCTG
CCACCGATCTTTGTATGCCCTTGGTTTCTCTTCTGATTTCTTGTTAACTCCGGATGATACGCTGCTGATCAGCTTCGACGGATACGGCGACAATGAAATACTTCGTACAA
AAGCAGTACTCCACCACAAGTTTCTACATCATGATCTAACAATGGAGGCACTTTCTCCAGGACTTTTTGTGGACAAATGTGGTAACTACTGGGATGTGCCTTCTTCATTA
GTCATTGATCTAGGTTCTGCTGCTTCTGACTCAGGTCCAAGTTATCACTTGTCTATGCACCACAATACCGGGTCTCCCTCACAAGCTGGAAGTGAACAGACCGGTGCGGT
TCCTTTCTGTCTACTTCCTGGTCTATCAGTCAAGGCTGCTTTTTCCTATAAGAAGGACTTCGAAATCTGGAGAAGCAACGCCAAGAAGTTAAAGATGGTGCAACCATATG
ACATTTTCCTATCAAATCCTCATGTTTCATTGTCAGGAATCATTGGTGCTGTGGCTACTACCTACTTTGGAGACAATTCCGTTAGATCAGCGGCACAAGACAGTCTTCAG
GAATTTAAAGGACTTCATATGCAGACTTCCAGTATAAGATCTACTGTTTTAGCGGATGTATTTGCTTCCATTTCTTTTTCAGCTCAGTATGGGATGTTTCAAAGGCTATT
TCTGGATCTCACCCGATTTTCTGCACGTATGGATTTCCATTCTGGCTCCAAGTTCATTTCTGGAGCCATGCTTTTGATAGAAGATCTTTCCAATTCCCGACAGCCAAGAA
CTGAATCAGTGAAAGCCACTTTGCCTAATGCTAGATTTTCTCTTCAGCAGCAGATCGCTGGACCCGTCAGCTTTAGAGCAGATTCAGGAGTTACAATAGATTTGAATAAA
CCAGCGTGGGGTATACAAGTGGAGGAGCCTACATTTGCCTTGGAATATGCGTTGCAGGTCCTTGGTTCGGCTAAAGCCGTCGCTTGGTACTCACCCAAGCACAGAGAATT
TATGGTAGAGCTCCGTTTCTATGAGAACTGA
mRNA sequenceShow/hide mRNA sequence
ATGCGGGAACGAGAGGAAGAAGGGAAGGACTCTAGGGTTGTTTCTTTCGAGTTCCCAGGGAGGATGAAGAAGCTAAGATGGACAATGGACGGGCAAGGTTTTTGGGACCT
GGATGTTTCAACACCTAGAACACTGGATGGGTCGGCCTCCCCGGTTCCTGATGACTTGCTTCCCCTGGGATTGTCCAGAGGCACCAGACTTTCCAGGGCCAAACAGATCG
ATTTCATGCAGCGCTTCATGGCTGCACCTTTTGTCCCCTCTTATGCTCCCTCCCATGGCTTTGCTCTACATCGCGTGTTTACGATCCCCTTTTGGGACTCCGGGTCTGCT
ACTTTTTTAGGTCAGTTCAATTTGCAGAAGTTCTTGGCCTCTTTTAAGAGATCTGGAGAGATGAATCAATCGGCGTCCTCATTGCTGCAAGGCATTGGAAGGCACCTCTG
CCACCGATCTTTGTATGCCCTTGGTTTCTCTTCTGATTTCTTGTTAACTCCGGATGATACGCTGCTGATCAGCTTCGACGGATACGGCGACAATGAAATACTTCGTACAA
AAGCAGTACTCCACCACAAGTTTCTACATCATGATCTAACAATGGAGGCACTTTCTCCAGGACTTTTTGTGGACAAATGTGGTAACTACTGGGATGTGCCTTCTTCATTA
GTCATTGATCTAGGTTCTGCTGCTTCTGACTCAGGTCCAAGTTATCACTTGTCTATGCACCACAATACCGGGTCTCCCTCACAAGCTGGAAGTGAACAGACCGGTGCGGT
TCCTTTCTGTCTACTTCCTGGTCTATCAGTCAAGGCTGCTTTTTCCTATAAGAAGGACTTCGAAATCTGGAGAAGCAACGCCAAGAAGTTAAAGATGGTGCAACCATATG
ACATTTTCCTATCAAATCCTCATGTTTCATTGTCAGGAATCATTGGTGCTGTGGCTACTACCTACTTTGGAGACAATTCCGTTAGATCAGCGGCACAAGACAGTCTTCAG
GAATTTAAAGGACTTCATATGCAGACTTCCAGTATAAGATCTACTGTTTTAGCGGATGTATTTGCTTCCATTTCTTTTTCAGCTCAGTATGGGATGTTTCAAAGGCTATT
TCTGGATCTCACCCGATTTTCTGCACGTATGGATTTCCATTCTGGCTCCAAGTTCATTTCTGGAGCCATGCTTTTGATAGAAGATCTTTCCAATTCCCGACAGCCAAGAA
CTGAATCAGTGAAAGCCACTTTGCCTAATGCTAGATTTTCTCTTCAGCAGCAGATCGCTGGACCCGTCAGCTTTAGAGCAGATTCAGGAGTTACAATAGATTTGAATAAA
CCAGCGTGGGGTATACAAGTGGAGGAGCCTACATTTGCCTTGGAATATGCGTTGCAGGTCCTTGGTTCGGCTAAAGCCGTCGCTTGGTACTCACCCAAGCACAGAGAATT
TATGGTAGAGCTCCGTTTCTATGAGAACTGA
Protein sequenceShow/hide protein sequence
MREREEEGKDSRVVSFEFPGRMKKLRWTMDGQGFWDLDVSTPRTLDGSASPVPDDLLPLGLSRGTRLSRAKQIDFMQRFMAAPFVPSYAPSHGFALHRVFTIPFWDSGSA
TFLGQFNLQKFLASFKRSGEMNQSASSLLQGIGRHLCHRSLYALGFSSDFLLTPDDTLLISFDGYGDNEILRTKAVLHHKFLHHDLTMEALSPGLFVDKCGNYWDVPSSL
VIDLGSAASDSGPSYHLSMHHNTGSPSQAGSEQTGAVPFCLLPGLSVKAAFSYKKDFEIWRSNAKKLKMVQPYDIFLSNPHVSLSGIIGAVATTYFGDNSVRSAAQDSLQ
EFKGLHMQTSSIRSTVLADVFASISFSAQYGMFQRLFLDLTRFSARMDFHSGSKFISGAMLLIEDLSNSRQPRTESVKATLPNARFSLQQQIAGPVSFRADSGVTIDLNK
PAWGIQVEEPTFALEYALQVLGSAKAVAWYSPKHREFMVELRFYEN