; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC06G110680 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC06G110680
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
Descriptionprotein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic
Genome locationCiama_Chr06:4991964..4994769
RNA-Seq ExpressionCaUC06G110680
SyntenyCaUC06G110680
Gene Ontology termsGO:0001522 - pseudouridine synthesis (biological process)
GO:0008033 - tRNA processing (biological process)
GO:0034196 - acylglycerol transport (biological process)
GO:1990052 - ER to chloroplast lipid transport (biological process)
GO:0009941 - chloroplast envelope (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0009982 - pseudouridine synthase activity (molecular function)
GO:0070300 - phosphatidic acid binding (molecular function)
InterPro domainsIPR044160 - Protein TRIGALACTOSYLDIACYLGLYCEROL 4-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK02989.1 protein TRIGALACTOSYLDIACYLGLYCEROL 4 [Cucumis melo var. makuwa]1.6e-23187.66Show/hide
Query:  MKKLRWAMDGQGFWDLDVSTPRTLDGSASPVPSHLHLLPLGLSRGVRLSRAKQIDFMQSFMAAPFLPSYSPSHGFSLQRVFSIPFSDSGSATLLGQFNLQ
        MKKLRWAMD  GFWDLDVST RTLDGSASPVPS  HLLPLGLSRGVRLSRAKQIDFMQSFM APF+PSYSPSHGFSLQRVFSIPFSDSGS TLLGQFNLQ
Subjt:  MKKLRWAMDGQGFWDLDVSTPRTLDGSASPVPSHLHLLPLGLSRGVRLSRAKQIDFMQSFMAAPFLPSYSPSHGFSLQRVFSIPFSDSGSATLLGQFNLQ

Query:  KFISSLKKSGVGDMGQSLSSFLQCIGRHLRHRSLYALGISSDILLPPDDSLMISFDGYGDSEIVRTKA-----FLHHDLTMEALSPGLFVDKSGKYWDVP
        KF+SSL K+G G+MGQS SSF+QCIGRHL  RSLYA+GIS+DILLPPDDSLMISFDGYGDS+IVRTKA     FLHHDLTMEALSPGLF+DKSG+YWDVP
Subjt:  KFISSLKKSGVGDMGQSLSSFLQCIGRHLRHRSLYALGISSDILLPPDDSLMISFDGYGDSEIVRTKA-----FLHHDLTMEALSPGLFVDKSGKYWDVP

Query:  SSIVVDLGSAASESGLSYHLSMHQNAGSPSQSGSEQTCSAPFCLFPGLSAKAAFAFKKNLEIWRSNAKKLKRVQPYDIFLSTPHVSLSAIIGAVVTTYFG
        SS+VVDLGSAAS+SGLSYHLSMHQN G PS  GSE T SAPFCLFPGLSAKAAFAFKKN EIWRSNAKKLK VQPYDIFLSTPHVSLSAIIGAV T+YFG
Subjt:  SSIVVDLGSAASESGLSYHLSMHQNAGSPSQSGSEQTCSAPFCLFPGLSAKAAFAFKKNLEIWRSNAKKLKRVQPYDIFLSTPHVSLSAIIGAVVTTYFG

Query:  DNSVISAAQDSLQEFKGLYMQTSGTRSTIFADLFASISFSAQYGMFQRKYLDLTRFSACMDFHSGSKFISGAMLLIDDLSNSRHPRTESVIATLPNARFS
        D+ V SAAQ SL EFKG YMQTS  RSTIFADLF SISFSAQYGMFQ+KYLDLTRFSACMDFHSGSKF+SG+MLLIDDLSNSRHP+TE+V ATLPNARFS
Subjt:  DNSVISAAQDSLQEFKGLYMQTSGTRSTIFADLFASISFSAQYGMFQRKYLDLTRFSACMDFHSGSKFISGAMLLIDDLSNSRHPRTESVIATLPNARFS

Query:  LQQQIAGPVSFRADSGVTIDLNKAGWGLLGVDEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFYET
        +QQQIAGPVSFRADSGV IDLNKAGW LL VDEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFYET
Subjt:  LQQQIAGPVSFRADSGVTIDLNKAGWGLLGVDEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFYET

XP_004138517.2 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic isoform X1 [Cucumis sativus]2.8e-22885.74Show/hide
Query:  MKKLRWAMDGQGFWDLDVSTPRTLDGSASPVPSHLHLLPLGLSRGVRLSRAKQIDFMQSFMAAPFLPSYSPSHGFSLQRVFSIPFSDSGSATLLGQFNLQ
        MKKLRWAMDGQGFWDLDVST RTLDGSASPVPS LHLLPLGLSRGVRLSRAKQIDFMQ FMAAPF+PSYSPSHGFSLQRVFS+PFSDSGS TLLGQFNLQ
Subjt:  MKKLRWAMDGQGFWDLDVSTPRTLDGSASPVPSHLHLLPLGLSRGVRLSRAKQIDFMQSFMAAPFLPSYSPSHGFSLQRVFSIPFSDSGSATLLGQFNLQ

Query:  KFISSLKKSGVGDMGQSLSSFLQCIGRHLRHRSLYALGISSDILLPPDDSLMISFDGYGDSEIVRTKA-----FLHHDLTMEALSPGLFVDKSGKYWDVP
        KF+SSL K+G G+M QS SS LQ IGRHL  RSLYA+GIS+DILLPPDDSLMISFDGYGDS+IVRTKA     FLHHDLT+EALSPGLF++K G+YWDVP
Subjt:  KFISSLKKSGVGDMGQSLSSFLQCIGRHLRHRSLYALGISSDILLPPDDSLMISFDGYGDSEIVRTKA-----FLHHDLTMEALSPGLFVDKSGKYWDVP

Query:  SSIVVDLGSAASESGLSYHLSMHQNAGSPSQSGSEQTCSAPFCLFPGLSAKAAFAFKKNLEIWRSNAKKLKRVQPYDIFLSTPHVSLSAIIGAVVTTYFG
        SS+VVDLGS AS+SGLSYHLSMHQNAG PSQ GSE T SAPFCL PGLSAKAAFAFKKN EIWRSNAKKLK VQPYDIFLSTPHVSLSAIIGAV T+YFG
Subjt:  SSIVVDLGSAASESGLSYHLSMHQNAGSPSQSGSEQTCSAPFCLFPGLSAKAAFAFKKNLEIWRSNAKKLKRVQPYDIFLSTPHVSLSAIIGAVVTTYFG

Query:  DNSVISAAQDSLQEFKGLYMQTSGTRSTIFADLFASISFSAQYGMFQRKYLDLTRFSACMDFHSGSKFISGAMLLIDDLSNSRHPRTESVIATLPNARFS
        D+   SAAQDSL++FKG YM++S  RST+FADLF SISFSAQYGMFQ+KYLDLTRFSACMDFHSGSKF+SG+MLLIDDLSNSRHP+TESV ATLPNARFS
Subjt:  DNSVISAAQDSLQEFKGLYMQTSGTRSTIFADLFASISFSAQYGMFQRKYLDLTRFSACMDFHSGSKFISGAMLLIDDLSNSRHPRTESVIATLPNARFS

Query:  LQQQIAGPVSFRADSGVTIDLNKAGWGLLGVDEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFYET
        +QQQIAGPVSFRAD+GV IDLNKAGW LL V+EPTFALEYAL VLGSAKAIAWYSPKHREFMVELRFYET
Subjt:  LQQQIAGPVSFRADSGVTIDLNKAGWGLLGVDEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFYET

XP_008458282.1 PREDICTED: protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic [Cucumis melo]1.6e-23187.66Show/hide
Query:  MKKLRWAMDGQGFWDLDVSTPRTLDGSASPVPSHLHLLPLGLSRGVRLSRAKQIDFMQSFMAAPFLPSYSPSHGFSLQRVFSIPFSDSGSATLLGQFNLQ
        MKKLRWAMD  GFWDLDVST RTLDGSASPVPS  HLLPLGLSRGVRLSRAKQIDFMQSFM APF+PSYSPSHGFSLQRVFSIPFSDSGS TLLGQFNLQ
Subjt:  MKKLRWAMDGQGFWDLDVSTPRTLDGSASPVPSHLHLLPLGLSRGVRLSRAKQIDFMQSFMAAPFLPSYSPSHGFSLQRVFSIPFSDSGSATLLGQFNLQ

Query:  KFISSLKKSGVGDMGQSLSSFLQCIGRHLRHRSLYALGISSDILLPPDDSLMISFDGYGDSEIVRTKA-----FLHHDLTMEALSPGLFVDKSGKYWDVP
        KF+SSL K+G G+MGQS SSF+QCIGRHL  RSLYA+GIS+DILLPPDDSLMISFDGYGDS+IVRTKA     FLHHDLTMEALSPGLF+DKSG+YWDVP
Subjt:  KFISSLKKSGVGDMGQSLSSFLQCIGRHLRHRSLYALGISSDILLPPDDSLMISFDGYGDSEIVRTKA-----FLHHDLTMEALSPGLFVDKSGKYWDVP

Query:  SSIVVDLGSAASESGLSYHLSMHQNAGSPSQSGSEQTCSAPFCLFPGLSAKAAFAFKKNLEIWRSNAKKLKRVQPYDIFLSTPHVSLSAIIGAVVTTYFG
        SS+VVDLGSAAS+SGLSYHLSMHQN G PS  GSE T SAPFCLFPGLSAKAAFAFKKN EIWRSNAKKLK VQPYDIFLSTPHVSLSAIIGAV T+YFG
Subjt:  SSIVVDLGSAASESGLSYHLSMHQNAGSPSQSGSEQTCSAPFCLFPGLSAKAAFAFKKNLEIWRSNAKKLKRVQPYDIFLSTPHVSLSAIIGAVVTTYFG

Query:  DNSVISAAQDSLQEFKGLYMQTSGTRSTIFADLFASISFSAQYGMFQRKYLDLTRFSACMDFHSGSKFISGAMLLIDDLSNSRHPRTESVIATLPNARFS
        D+ V SAAQ SL EFKG YMQTS  RSTIFADLF SISFSAQYGMFQ+KYLDLTRFSACMDFHSGSKF+SG+MLLIDDLSNSRHP+TE+V ATLPNARFS
Subjt:  DNSVISAAQDSLQEFKGLYMQTSGTRSTIFADLFASISFSAQYGMFQRKYLDLTRFSACMDFHSGSKFISGAMLLIDDLSNSRHPRTESVIATLPNARFS

Query:  LQQQIAGPVSFRADSGVTIDLNKAGWGLLGVDEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFYET
        +QQQIAGPVSFRADSGV IDLNKAGW LL VDEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFYET
Subjt:  LQQQIAGPVSFRADSGVTIDLNKAGWGLLGVDEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFYET

XP_023548884.1 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic [Cucurbita pepo subsp. pepo]2.2e-22885.74Show/hide
Query:  MMKKLRWAMDGQGFWDLDVSTPRTLDGSASPVPSHLHLLPLGLSRGVRLSRAKQIDFMQSFMAAPFLPSYSPSHGFSLQRVFSIPFSDSGSATLLGQFNL
        MMKKLRW M+GQ FWDLDVSTPRTLDGSASPVP+ L LLPLGLSRGVRLSRAKQIDFMQ FMAAPF+PSY+PSHGFSLQRVFSIPFSDSGSATLLGQFN+
Subjt:  MMKKLRWAMDGQGFWDLDVSTPRTLDGSASPVPSHLHLLPLGLSRGVRLSRAKQIDFMQSFMAAPFLPSYSPSHGFSLQRVFSIPFSDSGSATLLGQFNL

Query:  QKFISSLKKSGVGDMGQSLSSFLQCIGRHLRHRSLYALGISSDILLPPDDSLMISFDGYGDSEIVRTKA-----FLHHDLTMEALSPGLFVDKSGKYWDV
        QKF+SSLKKSG G+MGQS+SS LQ IGRHLR RSLYA GISSDILL PDD+L+ISFDGYGDS+I+RTKA     FLHHDLTMEALSPGLFVDKSGKYWDV
Subjt:  QKFISSLKKSGVGDMGQSLSSFLQCIGRHLRHRSLYALGISSDILLPPDDSLMISFDGYGDSEIVRTKA-----FLHHDLTMEALSPGLFVDKSGKYWDV

Query:  PSSIVVDLGSAASESGLSYHLSMHQNAGSPSQSGSEQTCSAPFCLFPGLSAKAAFAFKKNLEIWRSNAKKLKRVQPYDIFLSTPHVSLSAIIGAVVTTYF
        PSS+V+DLGSAA++SGLSYHLSMH NAGSPSQSGSEQTC APFCL PGLSAKAAFA KKNLEIWRSNAKKLKRVQPYDIFLS PHVSLS IIGAV T+YF
Subjt:  PSSIVVDLGSAASESGLSYHLSMHQNAGSPSQSGSEQTCSAPFCLFPGLSAKAAFAFKKNLEIWRSNAKKLKRVQPYDIFLSTPHVSLSAIIGAVVTTYF

Query:  GDNSVISAAQDSLQEFKGLYMQTSGTRSTIFADLFASISFSAQYGMFQRKYLDLTRFSACMDFHSGSKFISGAMLLIDDLSNSRHPRTESVIATLPNARF
        GD S  SAA+ SLQEFKGLYMQTS  RST+FAD+FASISFSAQYGMFQR +LDLTRFS   DFHSGSKF+SGAMLLI+DLSNS+HPRTESV ATLPNARF
Subjt:  GDNSVISAAQDSLQEFKGLYMQTSGTRSTIFADLFASISFSAQYGMFQRKYLDLTRFSACMDFHSGSKFISGAMLLIDDLSNSRHPRTESVIATLPNARF

Query:  SLQQQIAGPVSFRADSGVTIDLNKAGWGLLGVDEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFYE
        S QQQIAGPVSFRAD+GV IDL+KAGWG L V+EPTFALEYAL VLGSAKAIAWYSPK REFMVELRFYE
Subjt:  SLQQQIAGPVSFRADSGVTIDLNKAGWGLLGVDEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFYE

XP_038875869.1 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic [Benincasa hispida]1.8e-24692.57Show/hide
Query:  MMKKLRWAMDGQGFWDLDVSTPRTLDGSASPVPSHLHLLPLGLSRGVRLSRAKQIDFMQSFMAAPFLPSYSPSHGFSLQRVFSIPFSDSGSATLLGQFNL
        MMKKLRWAMDGQGFWDLDVSTPRTLDGSASPVPSHLHLLPLGLSRGVRLSRAKQIDFMQSFMAAPF+PSYSPSHGFSLQRVFSIPFSDSGS TLLGQFNL
Subjt:  MMKKLRWAMDGQGFWDLDVSTPRTLDGSASPVPSHLHLLPLGLSRGVRLSRAKQIDFMQSFMAAPFLPSYSPSHGFSLQRVFSIPFSDSGSATLLGQFNL

Query:  QKFISSLKKSGVGDMGQSLSSFLQCIGRHLRHRSLYALGISSDILLPPDDSLMISFDGYGDSEIVRTKA-----FLHHDLTMEALSPGLFVDKSGKYWDV
        QKFISSLKKSGVGDMGQSLSSFLQCIGRHL HRSLYALGISSDILLPPDDSLMISFDGYGD+EIVRTKA     FLHHDLTMEA SPGLFVDKSGKYWDV
Subjt:  QKFISSLKKSGVGDMGQSLSSFLQCIGRHLRHRSLYALGISSDILLPPDDSLMISFDGYGDSEIVRTKA-----FLHHDLTMEALSPGLFVDKSGKYWDV

Query:  PSSIVVDLGSAASESGLSYHLSMHQNAGSPSQSGSEQTCSAPFCLFPGLSAKAAFAFKKNLEIWRSNAKKLKRVQPYDIFLSTPHVSLSAIIGAVVTTYF
        PS++VVDLGSAASESGLSYHLSMHQN GSPSQSGSEQ  S+P CL PGLSAKAAFAFKKNLEIWRSNAKKLK VQPYDIFLSTPHVSLS IIGAV TTYF
Subjt:  PSSIVVDLGSAASESGLSYHLSMHQNAGSPSQSGSEQTCSAPFCLFPGLSAKAAFAFKKNLEIWRSNAKKLKRVQPYDIFLSTPHVSLSAIIGAVVTTYF

Query:  GDNSVISAAQDSLQEFKGLYMQTSGTRSTIFADLFASISFSAQYGMFQRKYLDLTRFSACMDFHSGSKFISGAMLLIDDLSNSRHPRTESVIATLPNARF
        GDNS+ SAAQDSL EFKGLY+QTS  RST+FAD+FASISFSAQYGMFQRKYLDLT FSA MDFHSGSKF+SGAMLLIDDLSNSRHPRTESV ATLP+ARF
Subjt:  GDNSVISAAQDSLQEFKGLYMQTSGTRSTIFADLFASISFSAQYGMFQRKYLDLTRFSACMDFHSGSKFISGAMLLIDDLSNSRHPRTESVIATLPNARF

Query:  SLQQQIAGPVSFRADSGVTIDLNKAGWGLLGVDEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFYET
        SLQQQIAGPVSFRADSGV IDLNKAGWGLLGVDEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFYET
Subjt:  SLQQQIAGPVSFRADSGVTIDLNKAGWGLLGVDEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFYET

TrEMBL top hitse value%identityAlignment
A0A0A0K824 Uncharacterized protein1.4e-22885.74Show/hide
Query:  MKKLRWAMDGQGFWDLDVSTPRTLDGSASPVPSHLHLLPLGLSRGVRLSRAKQIDFMQSFMAAPFLPSYSPSHGFSLQRVFSIPFSDSGSATLLGQFNLQ
        MKKLRWAMDGQGFWDLDVST RTLDGSASPVPS LHLLPLGLSRGVRLSRAKQIDFMQ FMAAPF+PSYSPSHGFSLQRVFS+PFSDSGS TLLGQFNLQ
Subjt:  MKKLRWAMDGQGFWDLDVSTPRTLDGSASPVPSHLHLLPLGLSRGVRLSRAKQIDFMQSFMAAPFLPSYSPSHGFSLQRVFSIPFSDSGSATLLGQFNLQ

Query:  KFISSLKKSGVGDMGQSLSSFLQCIGRHLRHRSLYALGISSDILLPPDDSLMISFDGYGDSEIVRTKA-----FLHHDLTMEALSPGLFVDKSGKYWDVP
        KF+SSL K+G G+M QS SS LQ IGRHL  RSLYA+GIS+DILLPPDDSLMISFDGYGDS+IVRTKA     FLHHDLT+EALSPGLF++K G+YWDVP
Subjt:  KFISSLKKSGVGDMGQSLSSFLQCIGRHLRHRSLYALGISSDILLPPDDSLMISFDGYGDSEIVRTKA-----FLHHDLTMEALSPGLFVDKSGKYWDVP

Query:  SSIVVDLGSAASESGLSYHLSMHQNAGSPSQSGSEQTCSAPFCLFPGLSAKAAFAFKKNLEIWRSNAKKLKRVQPYDIFLSTPHVSLSAIIGAVVTTYFG
        SS+VVDLGS AS+SGLSYHLSMHQNAG PSQ GSE T SAPFCL PGLSAKAAFAFKKN EIWRSNAKKLK VQPYDIFLSTPHVSLSAIIGAV T+YFG
Subjt:  SSIVVDLGSAASESGLSYHLSMHQNAGSPSQSGSEQTCSAPFCLFPGLSAKAAFAFKKNLEIWRSNAKKLKRVQPYDIFLSTPHVSLSAIIGAVVTTYFG

Query:  DNSVISAAQDSLQEFKGLYMQTSGTRSTIFADLFASISFSAQYGMFQRKYLDLTRFSACMDFHSGSKFISGAMLLIDDLSNSRHPRTESVIATLPNARFS
        D+   SAAQDSL++FKG YM++S  RST+FADLF SISFSAQYGMFQ+KYLDLTRFSACMDFHSGSKF+SG+MLLIDDLSNSRHP+TESV ATLPNARFS
Subjt:  DNSVISAAQDSLQEFKGLYMQTSGTRSTIFADLFASISFSAQYGMFQRKYLDLTRFSACMDFHSGSKFISGAMLLIDDLSNSRHPRTESVIATLPNARFS

Query:  LQQQIAGPVSFRADSGVTIDLNKAGWGLLGVDEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFYET
        +QQQIAGPVSFRAD+GV IDLNKAGW LL V+EPTFALEYAL VLGSAKAIAWYSPKHREFMVELRFYET
Subjt:  LQQQIAGPVSFRADSGVTIDLNKAGWGLLGVDEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFYET

A0A1S3C837 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic7.8e-23287.66Show/hide
Query:  MKKLRWAMDGQGFWDLDVSTPRTLDGSASPVPSHLHLLPLGLSRGVRLSRAKQIDFMQSFMAAPFLPSYSPSHGFSLQRVFSIPFSDSGSATLLGQFNLQ
        MKKLRWAMD  GFWDLDVST RTLDGSASPVPS  HLLPLGLSRGVRLSRAKQIDFMQSFM APF+PSYSPSHGFSLQRVFSIPFSDSGS TLLGQFNLQ
Subjt:  MKKLRWAMDGQGFWDLDVSTPRTLDGSASPVPSHLHLLPLGLSRGVRLSRAKQIDFMQSFMAAPFLPSYSPSHGFSLQRVFSIPFSDSGSATLLGQFNLQ

Query:  KFISSLKKSGVGDMGQSLSSFLQCIGRHLRHRSLYALGISSDILLPPDDSLMISFDGYGDSEIVRTKA-----FLHHDLTMEALSPGLFVDKSGKYWDVP
        KF+SSL K+G G+MGQS SSF+QCIGRHL  RSLYA+GIS+DILLPPDDSLMISFDGYGDS+IVRTKA     FLHHDLTMEALSPGLF+DKSG+YWDVP
Subjt:  KFISSLKKSGVGDMGQSLSSFLQCIGRHLRHRSLYALGISSDILLPPDDSLMISFDGYGDSEIVRTKA-----FLHHDLTMEALSPGLFVDKSGKYWDVP

Query:  SSIVVDLGSAASESGLSYHLSMHQNAGSPSQSGSEQTCSAPFCLFPGLSAKAAFAFKKNLEIWRSNAKKLKRVQPYDIFLSTPHVSLSAIIGAVVTTYFG
        SS+VVDLGSAAS+SGLSYHLSMHQN G PS  GSE T SAPFCLFPGLSAKAAFAFKKN EIWRSNAKKLK VQPYDIFLSTPHVSLSAIIGAV T+YFG
Subjt:  SSIVVDLGSAASESGLSYHLSMHQNAGSPSQSGSEQTCSAPFCLFPGLSAKAAFAFKKNLEIWRSNAKKLKRVQPYDIFLSTPHVSLSAIIGAVVTTYFG

Query:  DNSVISAAQDSLQEFKGLYMQTSGTRSTIFADLFASISFSAQYGMFQRKYLDLTRFSACMDFHSGSKFISGAMLLIDDLSNSRHPRTESVIATLPNARFS
        D+ V SAAQ SL EFKG YMQTS  RSTIFADLF SISFSAQYGMFQ+KYLDLTRFSACMDFHSGSKF+SG+MLLIDDLSNSRHP+TE+V ATLPNARFS
Subjt:  DNSVISAAQDSLQEFKGLYMQTSGTRSTIFADLFASISFSAQYGMFQRKYLDLTRFSACMDFHSGSKFISGAMLLIDDLSNSRHPRTESVIATLPNARFS

Query:  LQQQIAGPVSFRADSGVTIDLNKAGWGLLGVDEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFYET
        +QQQIAGPVSFRADSGV IDLNKAGW LL VDEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFYET
Subjt:  LQQQIAGPVSFRADSGVTIDLNKAGWGLLGVDEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFYET

A0A5D3BY40 Protein TRIGALACTOSYLDIACYLGLYCEROL 47.8e-23287.66Show/hide
Query:  MKKLRWAMDGQGFWDLDVSTPRTLDGSASPVPSHLHLLPLGLSRGVRLSRAKQIDFMQSFMAAPFLPSYSPSHGFSLQRVFSIPFSDSGSATLLGQFNLQ
        MKKLRWAMD  GFWDLDVST RTLDGSASPVPS  HLLPLGLSRGVRLSRAKQIDFMQSFM APF+PSYSPSHGFSLQRVFSIPFSDSGS TLLGQFNLQ
Subjt:  MKKLRWAMDGQGFWDLDVSTPRTLDGSASPVPSHLHLLPLGLSRGVRLSRAKQIDFMQSFMAAPFLPSYSPSHGFSLQRVFSIPFSDSGSATLLGQFNLQ

Query:  KFISSLKKSGVGDMGQSLSSFLQCIGRHLRHRSLYALGISSDILLPPDDSLMISFDGYGDSEIVRTKA-----FLHHDLTMEALSPGLFVDKSGKYWDVP
        KF+SSL K+G G+MGQS SSF+QCIGRHL  RSLYA+GIS+DILLPPDDSLMISFDGYGDS+IVRTKA     FLHHDLTMEALSPGLF+DKSG+YWDVP
Subjt:  KFISSLKKSGVGDMGQSLSSFLQCIGRHLRHRSLYALGISSDILLPPDDSLMISFDGYGDSEIVRTKA-----FLHHDLTMEALSPGLFVDKSGKYWDVP

Query:  SSIVVDLGSAASESGLSYHLSMHQNAGSPSQSGSEQTCSAPFCLFPGLSAKAAFAFKKNLEIWRSNAKKLKRVQPYDIFLSTPHVSLSAIIGAVVTTYFG
        SS+VVDLGSAAS+SGLSYHLSMHQN G PS  GSE T SAPFCLFPGLSAKAAFAFKKN EIWRSNAKKLK VQPYDIFLSTPHVSLSAIIGAV T+YFG
Subjt:  SSIVVDLGSAASESGLSYHLSMHQNAGSPSQSGSEQTCSAPFCLFPGLSAKAAFAFKKNLEIWRSNAKKLKRVQPYDIFLSTPHVSLSAIIGAVVTTYFG

Query:  DNSVISAAQDSLQEFKGLYMQTSGTRSTIFADLFASISFSAQYGMFQRKYLDLTRFSACMDFHSGSKFISGAMLLIDDLSNSRHPRTESVIATLPNARFS
        D+ V SAAQ SL EFKG YMQTS  RSTIFADLF SISFSAQYGMFQ+KYLDLTRFSACMDFHSGSKF+SG+MLLIDDLSNSRHP+TE+V ATLPNARFS
Subjt:  DNSVISAAQDSLQEFKGLYMQTSGTRSTIFADLFASISFSAQYGMFQRKYLDLTRFSACMDFHSGSKFISGAMLLIDDLSNSRHPRTESVIATLPNARFS

Query:  LQQQIAGPVSFRADSGVTIDLNKAGWGLLGVDEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFYET
        +QQQIAGPVSFRADSGV IDLNKAGW LL VDEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFYET
Subjt:  LQQQIAGPVSFRADSGVTIDLNKAGWGLLGVDEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFYET

A0A6J1H3U0 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic1.3e-22684.89Show/hide
Query:  MMKKLRWAMDGQGFWDLDVSTPRTLDGSASPVPSHLHLLPLGLSRGVRLSRAKQIDFMQSFMAAPFLPSYSPSHGFSLQRVFSIPFSDSGSATLLGQFNL
        MMKKLRW M+GQ FWDLDVSTPRTLDGSASPVP+ L LLPLGLSRGVRLSRAKQIDFMQ FMAAPF+PS++PSHGFSLQRVFSIPFSDSGSATLLGQFN+
Subjt:  MMKKLRWAMDGQGFWDLDVSTPRTLDGSASPVPSHLHLLPLGLSRGVRLSRAKQIDFMQSFMAAPFLPSYSPSHGFSLQRVFSIPFSDSGSATLLGQFNL

Query:  QKFISSLKKSGVGDMGQSLSSFLQCIGRHLRHRSLYALGISSDILLPPDDSLMISFDGYGDSEIVRTKA-----FLHHDLTMEALSPGLFVDKSGKYWDV
        QKF+SSLKKSG G+MGQS+SS LQ IGRHLR RSLYA GISSDILL PDD+L+ISFDGYGDS+I+RTKA     FLHHDLTMEALSPGLFVDKSGKYWDV
Subjt:  QKFISSLKKSGVGDMGQSLSSFLQCIGRHLRHRSLYALGISSDILLPPDDSLMISFDGYGDSEIVRTKA-----FLHHDLTMEALSPGLFVDKSGKYWDV

Query:  PSSIVVDLGSAASESGLSYHLSMHQNAGSPSQSGSEQTCSAPFCLFPGLSAKAAFAFKKNLEIWRSNAKKLKRVQPYDIFLSTPHVSLSAIIGAVVTTYF
        PSS+V+DLGSA ++SGLSYHLSMH NAGSPSQSGSEQTC APFCL PGLSAKAAFA KKNLEIWRSNAKKLKRVQPYDIFL+ PHVSLS IIGAV T+YF
Subjt:  PSSIVVDLGSAASESGLSYHLSMHQNAGSPSQSGSEQTCSAPFCLFPGLSAKAAFAFKKNLEIWRSNAKKLKRVQPYDIFLSTPHVSLSAIIGAVVTTYF

Query:  GDNSVISAAQDSLQEFKGLYMQTSGTRSTIFADLFASISFSAQYGMFQRKYLDLTRFSACMDFHSGSKFISGAMLLIDDLSNSRHPRTESVIATLPNARF
        GD S  SAA+ SLQEF+GLYMQTS  RST+FAD+FASISFSAQYGMFQR +LDLTRFS   DFHSGSKF+SGAMLLI+DLSNS+HPRTESV ATLPNARF
Subjt:  GDNSVISAAQDSLQEFKGLYMQTSGTRSTIFADLFASISFSAQYGMFQRKYLDLTRFSACMDFHSGSKFISGAMLLIDDLSNSRHPRTESVIATLPNARF

Query:  SLQQQIAGPVSFRADSGVTIDLNKAGWGLLGVDEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFYE
        S QQQIAGPVSFRADSGV IDL+KAGWG L V+EPTFALEYAL  LGSAKAIAWYSPK REFMVELRFYE
Subjt:  SLQQQIAGPVSFRADSGVTIDLNKAGWGLLGVDEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFYE

A0A6J1KW75 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic8.3e-22684.89Show/hide
Query:  MMKKLRWAMDGQGFWDLDVSTPRTLDGSASPVPSHLHLLPLGLSRGVRLSRAKQIDFMQSFMAAPFLPSYSPSHGFSLQRVFSIPFSDSGSATLLGQFNL
        MMKKLRW M+GQ FWDLDVSTPRTLDGSASPVP+ L LLPLGLSRGVRLSRAKQIDFMQ FMAAPF+PSY+PSHGFSLQRVFSIPFSDSGSATLLGQFN+
Subjt:  MMKKLRWAMDGQGFWDLDVSTPRTLDGSASPVPSHLHLLPLGLSRGVRLSRAKQIDFMQSFMAAPFLPSYSPSHGFSLQRVFSIPFSDSGSATLLGQFNL

Query:  QKFISSLKKSGVGDMGQSLSSFLQCIGRHLRHRSLYALGISSDILLPPDDSLMISFDGYGDSEIVRTKA-----FLHHDLTMEALSPGLFVDKSGKYWDV
        QKF+SSLKKSG G+MGQSLSS LQ IGRHLR RSLYA GISSDILL PDD+L+ISFDGYGDS+++RTKA     FLHHDLTMEALSPGLFVDKSGKYWDV
Subjt:  QKFISSLKKSGVGDMGQSLSSFLQCIGRHLRHRSLYALGISSDILLPPDDSLMISFDGYGDSEIVRTKA-----FLHHDLTMEALSPGLFVDKSGKYWDV

Query:  PSSIVVDLGSAASESGLSYHLSMHQNAGSPSQSGSEQTCSAPFCLFPGLSAKAAFAFKKNLEIWRSNAKKLKRVQPYDIFLSTPHVSLSAIIGAVVTTYF
        PSS+V+DLGSAA++SGLSYHLSMH NAGSPSQSGSEQTC APFCL PGLSAKAAFA KKNLEIWRSNAKKLKRVQPYDIFLS PHVSLS IIGAV T+YF
Subjt:  PSSIVVDLGSAASESGLSYHLSMHQNAGSPSQSGSEQTCSAPFCLFPGLSAKAAFAFKKNLEIWRSNAKKLKRVQPYDIFLSTPHVSLSAIIGAVVTTYF

Query:  GDNSVISAAQDSLQEFKGLYMQTSGTRSTIFADLFASISFSAQYGMFQRKYLDLTRFSACMDFHSGSKFISGAMLLIDDLSNSRHPRTESVIATLPNARF
         D SV SAA+ SLQEFKGL+MQTS  RST+FAD+FASISFSAQYGMFQ  +LDLTRFS   DFHSGSKF+SGAMLLI+DLSNS+HPRTESV ATLPNARF
Subjt:  GDNSVISAAQDSLQEFKGLYMQTSGTRSTIFADLFASISFSAQYGMFQRKYLDLTRFSACMDFHSGSKFISGAMLLIDDLSNSRHPRTESVIATLPNARF

Query:  SLQQQIAGPVSFRADSGVTIDLNKAGWGLLGVDEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFYE
        S QQQIAGPVSFRAD+GV IDL+KAGWG L V+EPTFALEYAL  LGSAKAIAWYSPK  EFMVELRFYE
Subjt:  SLQQQIAGPVSFRADSGVTIDLNKAGWGLLGVDEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFYE

SwissProt top hitse value%identityAlignment
Q9M903 Protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic2.5e-11847.83Show/hide
Query:  MKKLRWAMDGQGFWDLDVSTPRTLDGSASPVPSHLHLLPLGLSRGVRLSRAKQIDFMQSFMAAPFLPSYSP---------SHGFSLQRVFSIPFSDSGSA
        M ++RW  +G   WDLD+STP TL+G+A  VP     LPLGLSRG RLSR KQ++F   FMA+P +PS+SP           GFSLQRV ++PFS++   
Subjt:  MKKLRWAMDGQGFWDLDVSTPRTLDGSASPVPSHLHLLPLGLSRGVRLSRAKQIDFMQSFMAAPFLPSYSP---------SHGFSLQRVFSIPFSDSGSA

Query:  TLLGQFNLQKFISSLKKSGVGDMGQS--LSSFLQCIGRHLRHRSLYALGISSDILLPPDDSLMISFDGY-GD-SEIVRTKAFLHHD-----LTMEALSPG
        +LLGQF++Q+F++ + K+     G S  ++S L  IG+HL+ +SLYALG  S+ LL PDD+L++S+D Y GD  +  R KA  +H+     LT EA+ PG
Subjt:  TLLGQFNLQKFISSLKKSGVGDMGQS--LSSFLQCIGRHLRHRSLYALGISSDILLPPDDSLMISFDGY-GD-SEIVRTKAFLHHD-----LTMEALSPG

Query:  LFVDKSGKYWDVPSSIVVDLGSAASESGLSYHLSMHQNAGSPSQSGSEQTCSAPFCLFPGLSAKAAFAFKKNLEIWRSNAKKLKRVQPYDIFLSTPHVSL
        LFVDK G+YWDVP S+ +DL S  +ESG SYHL +H N+GSP +  S+     P  L PGLS K+A +++ N+++WR    KL+  +PYD+FLS+PHV++
Subjt:  LFVDKSGKYWDVPSSIVVDLGSAASESGLSYHLSMHQNAGSPSQSGSEQTCSAPFCLFPGLSAKAAFAFKKNLEIWRSNAKKLKRVQPYDIFLSTPHVSL

Query:  SAIIGAVVTTYFGDNSVISAAQDSLQEFKGLYMQTSGTRSTIFADLFASISFSAQYGMFQRKYLDLTRFSACMDFHSGSKFISGAMLLIDDLSNSRHPRT
        S IIG+V+T  FG+NS+ S  ++  +   G  +      S   AD     S +AQYG FQ+ + DLTRF A +DF  G +F++GA  +  DL NSR P  
Subjt:  SAIIGAVVTTYFGDNSVISAAQDSLQEFKGLYMQTSGTRSTIFADLFASISFSAQYGMFQRKYLDLTRFSACMDFHSGSKFISGAMLLIDDLSNSRHPRT

Query:  ESVIATLPNARFSLQQQIAGPVSFRADSGVTIDLNKAGWGLLGVDEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFYET
        E+     P    SLQQQI GP SF+ +SG+ IDL + G   + VD+  FA+EYALQVL SAKA+  YSPK  EFMVELRF+ET
Subjt:  ESVIATLPNARFSLQQQIAGPVSFRADSGVTIDLNKAGWGLLGVDEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFYET

Arabidopsis top hitse value%identityAlignment
AT2G44640.1 FUNCTIONS IN: molecular_function unknown2.2e-6132.58Show/hide
Query:  MKKLRWAMDGQGFWDLDVSTPRTLDGSASPVPSHLHLLPLGLSRGVRLSRAKQIDFMQSFMAAPFLPSYSPSH-----GFSLQRVFSIPFSDSGSATLLG
        M  L  A+D   FWD +VS+P+TL+G+A  VP      PL  +R  R  R +Q+  ++       +PS +P+       FSL  +   P S++    L+G
Subjt:  MKKLRWAMDGQGFWDLDVSTPRTLDGSASPVPSHLHLLPLGLSRGVRLSRAKQIDFMQSFMAAPFLPSYSPSH-----GFSLQRVFSIPFSDSGSATLLG

Query:  QFNLQKFISSLKKSGVGDMGQSLSSFLQCI---GRHLRHRSLYALGISSDILLPPDDSLMISFDGYGDSEIVRTKAFL-----HHDLTMEALSPGLFVDK
        QF  +K  + +K     D+  +    LQ +    +H+  +SLY++G+ + I L    SL++S +  GD   +R K  L      HDLT+EA  P LF+D 
Subjt:  QFNLQKFISSLKKSGVGDMGQSLSSFLQCI---GRHLRHRSLYALGISSDILLPPDDSLMISFDGYGDSEIVRTKAFL-----HHDLTMEALSPGLFVDK

Query:  SGKYWDVPSSIVVDLGSAASESGLSYHLSMHQNAGSP---SQSGSEQTCSAPFCLFPGLSAKAAFAFKKNLEIWRSNAKKLKRVQ-------PYDIFLST
         G++WDVP S+ VD+ S   ESG+ Y   +H++ G+P   + +G E    AP  L PGL AKAA ++K N ++WR   K+    +       PYD+ L  
Subjt:  SGKYWDVPSSIVVDLGSAASESGLSYHLSMHQNAGSP---SQSGSEQTCSAPFCLFPGLSAKAAFAFKKNLEIWRSNAKKLKRVQ-------PYDIFLST

Query:  PHVSLSAIIGAVVTTYFGDNSVISAAQDSLQEFKGLYMQTSG-TRSTIFADLFASISFSAQYGMFQRKYLDLTRFSACMDFHSGSKFISGAMLLIDDLSN
        PH ++S I+G+ +  +                  G  M  +G  RS I AD+F S  ++ Q G F + Y DLTR  A +D  S       A  L   L +
Subjt:  PHVSLSAIIGAVVTTYFGDNSVISAAQDSLQEFKGLYMQTSG-TRSTIFADLFASISFSAQYGMFQRKYLDLTRFSACMDFHSGSKFISGAMLLIDDLSN

Query:  SRHPRTESVIATLPNARFSLQQQIAGPVSFRADSGVTIDLNKAGWGLLGVDEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFYE
        +    ++  + + P      QQQ+AGP+ F+ DS   +       G   +++  ++L Y+L++L S K +AWYSPK +E M+ELR +E
Subjt:  SRHPRTESVIATLPNARFSLQQQIAGPVSFRADSGVTIDLNKAGWGLLGVDEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFYE

AT3G06960.1 pigment defective 3201.8e-11947.83Show/hide
Query:  MKKLRWAMDGQGFWDLDVSTPRTLDGSASPVPSHLHLLPLGLSRGVRLSRAKQIDFMQSFMAAPFLPSYSP---------SHGFSLQRVFSIPFSDSGSA
        M ++RW  +G   WDLD+STP TL+G+A  VP     LPLGLSRG RLSR KQ++F   FMA+P +PS+SP           GFSLQRV ++PFS++   
Subjt:  MKKLRWAMDGQGFWDLDVSTPRTLDGSASPVPSHLHLLPLGLSRGVRLSRAKQIDFMQSFMAAPFLPSYSP---------SHGFSLQRVFSIPFSDSGSA

Query:  TLLGQFNLQKFISSLKKSGVGDMGQS--LSSFLQCIGRHLRHRSLYALGISSDILLPPDDSLMISFDGY-GD-SEIVRTKAFLHHD-----LTMEALSPG
        +LLGQF++Q+F++ + K+     G S  ++S L  IG+HL+ +SLYALG  S+ LL PDD+L++S+D Y GD  +  R KA  +H+     LT EA+ PG
Subjt:  TLLGQFNLQKFISSLKKSGVGDMGQS--LSSFLQCIGRHLRHRSLYALGISSDILLPPDDSLMISFDGY-GD-SEIVRTKAFLHHD-----LTMEALSPG

Query:  LFVDKSGKYWDVPSSIVVDLGSAASESGLSYHLSMHQNAGSPSQSGSEQTCSAPFCLFPGLSAKAAFAFKKNLEIWRSNAKKLKRVQPYDIFLSTPHVSL
        LFVDK G+YWDVP S+ +DL S  +ESG SYHL +H N+GSP +  S+     P  L PGLS K+A +++ N+++WR    KL+  +PYD+FLS+PHV++
Subjt:  LFVDKSGKYWDVPSSIVVDLGSAASESGLSYHLSMHQNAGSPSQSGSEQTCSAPFCLFPGLSAKAAFAFKKNLEIWRSNAKKLKRVQPYDIFLSTPHVSL

Query:  SAIIGAVVTTYFGDNSVISAAQDSLQEFKGLYMQTSGTRSTIFADLFASISFSAQYGMFQRKYLDLTRFSACMDFHSGSKFISGAMLLIDDLSNSRHPRT
        S IIG+V+T  FG+NS+ S  ++  +   G  +      S   AD     S +AQYG FQ+ + DLTRF A +DF  G +F++GA  +  DL NSR P  
Subjt:  SAIIGAVVTTYFGDNSVISAAQDSLQEFKGLYMQTSGTRSTIFADLFASISFSAQYGMFQRKYLDLTRFSACMDFHSGSKFISGAMLLIDDLSNSRHPRT

Query:  ESVIATLPNARFSLQQQIAGPVSFRADSGVTIDLNKAGWGLLGVDEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFYET
        E+     P    SLQQQI GP SF+ +SG+ IDL + G   + VD+  FA+EYALQVL SAKA+  YSPK  EFMVELRF+ET
Subjt:  ESVIATLPNARFSLQQQIAGPVSFRADSGVTIDLNKAGWGLLGVDEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFYET

AT3G06960.2 pigment defective 3202.7e-7548.21Show/hide
Query:  MKKLRWAMDGQGFWDLDVSTPRTLDGSASPVPSHLHLLPLGLSRGVRLSRAKQIDFMQSFMAAPFLPSYSP---------SHGFSLQRVFSIPFSDSGSA
        M ++RW  +G   WDLD+STP TL+G+A  VP     LPLGLSRG RLSR KQ++F   FMA+P +PS+SP           GFSLQRV ++PFS++   
Subjt:  MKKLRWAMDGQGFWDLDVSTPRTLDGSASPVPSHLHLLPLGLSRGVRLSRAKQIDFMQSFMAAPFLPSYSP---------SHGFSLQRVFSIPFSDSGSA

Query:  TLLGQFNLQKFISSLKKSGVGDMGQS--LSSFLQCIGRHLRHRSLYALGISSDILLPPDDSLMISFDGY-GD-SEIVRTKAFLHHD-----LTMEALSPG
        +LLGQF++Q+F++ + K+     G S  ++S L  IG+HL+ +SLYALG  S+ LL PDD+L++S+D Y GD  +  R KA  +H+     LT EA+ PG
Subjt:  TLLGQFNLQKFISSLKKSGVGDMGQS--LSSFLQCIGRHLRHRSLYALGISSDILLPPDDSLMISFDGY-GD-SEIVRTKAFLHHD-----LTMEALSPG

Query:  LFVDKSGKYWDVPSSIVVDLGSAASESGLSYHLSMHQNAGSPSQSGSEQTCSAPFCLFPGLSAKAAFAFKKNLEIWRSNAKKLKRVQPYDIFLSTPHVSL
        LFVDK G+YWDVP S+ +DL S  +ESG SYHL +H N+GSP +  S+     P  L PGLS K+A +++ N+++WR    KL+  +PYD+FLS+PHV++
Subjt:  LFVDKSGKYWDVPSSIVVDLGSAASESGLSYHLSMHQNAGSPSQSGSEQTCSAPFCLFPGLSAKAAFAFKKNLEIWRSNAKKLKRVQPYDIFLSTPHVSL

Query:  SAIIGAV
        S IIG +
Subjt:  SAIIGAV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
TTCTCTTTTGTTTGTGAAGGTGGTGACTGGTGTGGGATGATGAAGAAGCTAAGATGGGCAATGGACGGCCAAGGCTTCTGGGATCTCGATGTTTCAACGCCTAGAACACT
CGATGGCTCTGCTTCCCCTGTTCCTTCTCATTTACACTTACTTCCTTTGGGATTATCCAGAGGCGTTCGTCTTTCCCGGGCCAAGCAGATCGATTTCATGCAGAGTTTCA
TGGCTGCTCCTTTTCTTCCTTCTTATTCCCCCTCCCATGGCTTCTCCCTCCAACGCGTCTTCTCCATCCCCTTTTCAGATTCTGGCTCTGCTACTCTCTTAGGCCAGTTC
AATTTGCAGAAATTCATCTCATCTCTTAAGAAATCTGGTGTTGGAGACATGGGTCAGTCGCTTTCTTCATTTCTGCAATGCATTGGAAGGCATCTTCGCCATCGGTCTTT
GTATGCCCTTGGTATCTCTTCTGATATCTTGTTACCACCTGATGATTCCTTGATGATTAGCTTCGATGGATATGGCGACAGTGAAATAGTTCGAACAAAAGCATTCCTAC
ATCATGATCTAACTATGGAGGCACTTTCTCCAGGACTTTTCGTGGACAAATCTGGAAAATATTGGGATGTGCCTTCTTCAATAGTTGTTGATCTAGGTTCTGCTGCTTCT
GAATCGGGTTTAAGCTATCATTTGTCTATGCACCAGAATGCTGGGTCTCCCTCGCAATCTGGAAGTGAACAGACTTGTTCGGCTCCTTTCTGTTTATTTCCTGGCCTTTC
AGCCAAGGCTGCTTTTGCCTTTAAGAAGAACTTGGAAATCTGGAGAAGCAACGCCAAGAAGTTAAAGAGGGTGCAACCGTATGACATTTTCCTGTCAACTCCCCATGTTT
CATTGTCAGCAATCATTGGTGCTGTGGTTACTACCTACTTTGGAGACAATTCGGTTATATCAGCAGCACAAGACAGTCTTCAGGAATTTAAAGGACTTTACATGCAGACT
TCTGGAACAAGATCTACTATTTTCGCAGATTTATTTGCCTCCATTTCTTTTTCAGCTCAGTATGGGATGTTTCAAAGGAAATATTTGGATCTTACCCGTTTTTCTGCATG
TATGGATTTCCATTCTGGCTCAAAGTTTATTTCAGGAGCCATGCTTTTGATAGATGATCTTTCCAACTCCCGGCACCCAAGAACTGAATCTGTGATAGCGACCTTGCCTA
ACGCTAGATTTTCCCTTCAGCAGCAGATCGCTGGACCTGTCAGCTTTAGAGCAGATTCAGGAGTTACAATAGATTTGAATAAAGCAGGGTGGGGTTTATTAGGAGTGGAT
GAGCCTACATTTGCCTTGGAATATGCGTTACAGGTCCTTGGTTCAGCTAAAGCCATTGCTTGGTATTCACCAAAGCACAGAGAATTTATGGTAGAGCTTCGTTTCTATGA
GACCTGA
mRNA sequenceShow/hide mRNA sequence
TTCTCTTTTGTTTGTGAAGGTGGTGACTGGTGTGGGATGATGAAGAAGCTAAGATGGGCAATGGACGGCCAAGGCTTCTGGGATCTCGATGTTTCAACGCCTAGAACACT
CGATGGCTCTGCTTCCCCTGTTCCTTCTCATTTACACTTACTTCCTTTGGGATTATCCAGAGGCGTTCGTCTTTCCCGGGCCAAGCAGATCGATTTCATGCAGAGTTTCA
TGGCTGCTCCTTTTCTTCCTTCTTATTCCCCCTCCCATGGCTTCTCCCTCCAACGCGTCTTCTCCATCCCCTTTTCAGATTCTGGCTCTGCTACTCTCTTAGGCCAGTTC
AATTTGCAGAAATTCATCTCATCTCTTAAGAAATCTGGTGTTGGAGACATGGGTCAGTCGCTTTCTTCATTTCTGCAATGCATTGGAAGGCATCTTCGCCATCGGTCTTT
GTATGCCCTTGGTATCTCTTCTGATATCTTGTTACCACCTGATGATTCCTTGATGATTAGCTTCGATGGATATGGCGACAGTGAAATAGTTCGAACAAAAGCATTCCTAC
ATCATGATCTAACTATGGAGGCACTTTCTCCAGGACTTTTCGTGGACAAATCTGGAAAATATTGGGATGTGCCTTCTTCAATAGTTGTTGATCTAGGTTCTGCTGCTTCT
GAATCGGGTTTAAGCTATCATTTGTCTATGCACCAGAATGCTGGGTCTCCCTCGCAATCTGGAAGTGAACAGACTTGTTCGGCTCCTTTCTGTTTATTTCCTGGCCTTTC
AGCCAAGGCTGCTTTTGCCTTTAAGAAGAACTTGGAAATCTGGAGAAGCAACGCCAAGAAGTTAAAGAGGGTGCAACCGTATGACATTTTCCTGTCAACTCCCCATGTTT
CATTGTCAGCAATCATTGGTGCTGTGGTTACTACCTACTTTGGAGACAATTCGGTTATATCAGCAGCACAAGACAGTCTTCAGGAATTTAAAGGACTTTACATGCAGACT
TCTGGAACAAGATCTACTATTTTCGCAGATTTATTTGCCTCCATTTCTTTTTCAGCTCAGTATGGGATGTTTCAAAGGAAATATTTGGATCTTACCCGTTTTTCTGCATG
TATGGATTTCCATTCTGGCTCAAAGTTTATTTCAGGAGCCATGCTTTTGATAGATGATCTTTCCAACTCCCGGCACCCAAGAACTGAATCTGTGATAGCGACCTTGCCTA
ACGCTAGATTTTCCCTTCAGCAGCAGATCGCTGGACCTGTCAGCTTTAGAGCAGATTCAGGAGTTACAATAGATTTGAATAAAGCAGGGTGGGGTTTATTAGGAGTGGAT
GAGCCTACATTTGCCTTGGAATATGCGTTACAGGTCCTTGGTTCAGCTAAAGCCATTGCTTGGTATTCACCAAAGCACAGAGAATTTATGGTAGAGCTTCGTTTCTATGA
GACCTGA
Protein sequenceShow/hide protein sequence
FSFVCEGGDWCGMMKKLRWAMDGQGFWDLDVSTPRTLDGSASPVPSHLHLLPLGLSRGVRLSRAKQIDFMQSFMAAPFLPSYSPSHGFSLQRVFSIPFSDSGSATLLGQF
NLQKFISSLKKSGVGDMGQSLSSFLQCIGRHLRHRSLYALGISSDILLPPDDSLMISFDGYGDSEIVRTKAFLHHDLTMEALSPGLFVDKSGKYWDVPSSIVVDLGSAAS
ESGLSYHLSMHQNAGSPSQSGSEQTCSAPFCLFPGLSAKAAFAFKKNLEIWRSNAKKLKRVQPYDIFLSTPHVSLSAIIGAVVTTYFGDNSVISAAQDSLQEFKGLYMQT
SGTRSTIFADLFASISFSAQYGMFQRKYLDLTRFSACMDFHSGSKFISGAMLLIDDLSNSRHPRTESVIATLPNARFSLQQQIAGPVSFRADSGVTIDLNKAGWGLLGVD
EPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFYET