; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc09G20700 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc09G20700
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
Descriptionprotein TRIGALACTOSYLDIACYLGLYCEROL 2, chloroplastic-like
Genome locationClcChr09:34220270..34226461
RNA-Seq ExpressionClc09G20700
SyntenyClc09G20700
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0009610 - response to symbiotic fungus (biological process)
GO:0009706 - chloroplast inner membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0004252 - serine-type endopeptidase activity (molecular function)
GO:0005319 - lipid transporter activity (molecular function)
GO:0005543 - phospholipid binding (molecular function)
InterPro domainsIPR003399 - Mce/MlaD
IPR039342 - Protein TRIGALACTOSYLDIACYLGLYCEROL 2-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008451206.1 PREDICTED: protein TRIGALACTOSYLDIACYLGLYCEROL 2, chloroplastic isoform X1 [Cucumis melo]6.5e-16974.51Show/hide
Query:  MAGDIRVQVVTCSVVLPSSLVTLPHRSSNRLSYHLPLGLKSKVKKIKATSADAGHSQPPSSLERRNPLSLFLDVPRTVWRQTLRPLSNFGFGRRSIWEGG
        M GDIRVQVVTCSV LPSSLVTLPHRSSNRLSYHLPLG KSKVK+IKATSADAGHSQPPSS ERRNPLSLFLDVPRTVWRQTLRPLSNFGFGRRSIWEGG
Subjt:  MAGDIRVQVVTCSVVLPSSLVTLPHRSSNRLSYHLPLGLKSKVKKIKATSADAGHSQPPSSLERRNPLSLFLDVPRTVWRQTLRPLSNFGFGRRSIWEGG

Query:  VGLFLVSGAILLTLSLAWLRGFQLRSKFRKYLAVFEFAQACGISTGTPVRIRGVTVGNVIRVNPSLRCIETVVEVEDDKIIIPRNSLVEVNQSGLLMETM
        VGLFLVSGAILLTLSLAWLRGFQLRSKFRKYLAVFEFAQA GIS GTPVRIRGVTVGNVIRVNPSLRCIETVVEVEDDKIIIPRNSLVEVNQSGLLMETM
Subjt:  VGLFLVSGAILLTLSLAWLRGFQLRSKFRKYLAVFEFAQACGISTGTPVRIRGVTVGNVIRVNPSLRCIETVVEVEDDKIIIPRNSLVEVNQSGLLMETM

Query:  IDITPRDPIPVPSAGPLDSECIKEGLILCDKQKMKGHQGVSLDALVGIFTRLGREAEEIGLTNTFSLAQRVALVIEEAKPLLLKIQAMAEDVQPLLAEVR
        IDITPRDPIPVPSAGPLD ECI+EGLILCDKQKMKGHQGVSLDALVGIFTRLGREAEEIGLTNTF LAQRVALVIEEAKPLLLKIQAMAEDVQPLLAEVR
Subjt:  IDITPRDPIPVPSAGPLDSECIKEGLILCDKQKMKGHQGVSLDALVGIFTRLGREAEEIGLTNTFSLAQRVALVIEEAKPLLLKIQAMAEDVQPLLAEVR

Query:  DGDLLKEFENLIRSLSHATEELRYNSEETANRALPKLSCDIQLSNWCLLIVIMCNLILVHGLFLVRNEKGKKKKRMPRSTSSIPSVPNKRARIHYDPGKH
        D  LLKE E+L RSLSHATE+LR                 +Q S            IL                  P +T  +                 
Subjt:  DGDLLKEFENLIRSLSHATEELRYNSEETANRALPKLSCDIQLSNWCLLIVIMCNLILVHGLFLVRNEKGKKKKRMPRSTSSIPSVPNKRARIHYDPGKH

Query:  RTASQVHLYAHSYFAECRGEILIVQTLLDFPIISLSSEVLGLTGDEGTKRNLKLLIKSLSRLL
            Q  +Y             ++ TL +  I SLSSEVLG TGDE TKRNLKLLI+SLSRLL
Subjt:  RTASQVHLYAHSYFAECRGEILIVQTLLDFPIISLSSEVLGLTGDEGTKRNLKLLIKSLSRLL

XP_011658640.1 protein TRIGALACTOSYLDIACYLGLYCEROL 2, chloroplastic [Cucumis sativus]5.1e-16673.33Show/hide
Query:  MAGDIRVQVVTCSVVLPSSLVTLPHRSSNRL--SYHLPLGLKSKVKKIKATSADAGHSQPPSSLERRNPLSLFLDVPRTVWRQTLRPLSNFGFGRRSIWE
        M GDIRVQVVTCSV LPSSLVTLPHRSS+RL  SYHLPLGLKSKVK+IKATSADAGHSQPPSS ERRNPLSLFLDVPRTVWRQTLRPLSNFGFG+RSIWE
Subjt:  MAGDIRVQVVTCSVVLPSSLVTLPHRSSNRL--SYHLPLGLKSKVKKIKATSADAGHSQPPSSLERRNPLSLFLDVPRTVWRQTLRPLSNFGFGRRSIWE

Query:  GGVGLFLVSGAILLTLSLAWLRGFQLRSKFRKYLAVFEFAQACGISTGTPVRIRGVTVGNVIRVNPSLRCIETVVEVEDDKIIIPRNSLVEVNQSGLLME
        GGVGLFLVSGAILLTLSLAWLRGFQLRSKFRKYLAVFEFAQA GIS GTPVRIRGVTVGNVIRVNPSLRCIETVVEVEDDKIIIPRNSLVEVNQSGLLME
Subjt:  GGVGLFLVSGAILLTLSLAWLRGFQLRSKFRKYLAVFEFAQACGISTGTPVRIRGVTVGNVIRVNPSLRCIETVVEVEDDKIIIPRNSLVEVNQSGLLME

Query:  TMIDITPRDPIPVPSAGPLDSECIKEGLILCDKQKMKGHQGVSLDALVGIFTRLGREAEEIGLTNTFSLAQRVALVIEEAKPLLLKIQAMAEDVQPLLAE
        TMIDITPRDPIPVPSAGPLD ECI+EGLILCDKQK+KG+QGVSLDALVGIFTRLGREAEEIGLTNTF LAQRVALVIEEAKPLLLKIQAMAEDVQPLLAE
Subjt:  TMIDITPRDPIPVPSAGPLDSECIKEGLILCDKQKMKGHQGVSLDALVGIFTRLGREAEEIGLTNTFSLAQRVALVIEEAKPLLLKIQAMAEDVQPLLAE

Query:  VRDGDLLKEFENLIRSLSHATEELRYNSEETANRALPKLSCDIQLSNWCLLIVIMCNLILVHGLFLVRNEKGKKKKRMPRSTSSIPSVPNKRARIHYDPG
        VRD  LLKE E+L RSLSHATE+LR                 +Q S            IL                  P +T  +               
Subjt:  VRDGDLLKEFENLIRSLSHATEELRYNSEETANRALPKLSCDIQLSNWCLLIVIMCNLILVHGLFLVRNEKGKKKKRMPRSTSSIPSVPNKRARIHYDPG

Query:  KHRTASQVHLYAHSYFAECRGEILIVQTLLDFPIISLSSEVLGLTGDEGTKRNLKLLIKSLSRLL
              Q  +Y             ++ TL +  + SLSSEVLG TGDE TKRNLKLLI+SLSRLL
Subjt:  KHRTASQVHLYAHSYFAECRGEILIVQTLLDFPIISLSSEVLGLTGDEGTKRNLKLLIKSLSRLL

XP_022952170.1 protein TRIGALACTOSYLDIACYLGLYCEROL 2, chloroplastic-like isoform X1 [Cucurbita moschata]1.1e-16572.14Show/hide
Query:  MAGDIRVQVVTCSVVLPSSLVTLPHRSSNRLSYHLPLGLKSKVKKIKATSADAGHSQPPSSLERRNPLSLFLDVPRTVWRQTLRPLSNFGFGRRSIWEGG
        M GD RVQV   SV LPSSLVTLPH+SS+RLSYHLP GLKSKVKKIKATSADAGHSQPPSS ER NPL+LFLDVPRTVWR+TLRPLSNFGFGRRSIWEGG
Subjt:  MAGDIRVQVVTCSVVLPSSLVTLPHRSSNRLSYHLPLGLKSKVKKIKATSADAGHSQPPSSLERRNPLSLFLDVPRTVWRQTLRPLSNFGFGRRSIWEGG

Query:  VGLFLVSGAILLTLSLAWLRGFQLRSKFRKYLAVFEFAQACGISTGTPVRIRGVTVGNVIRVNPSLRCIETVVEVEDDKIIIPRNSLVEVNQSGLLMETM
        VGLFLVSGAILLTLSLAWLRGFQLRSKFRKYLAVFEFAQACGISTGTPVRIRGVTVGNVIRVNPSLRCIET+VEVEDDKIIIP NSLVEVNQSGLLMETM
Subjt:  VGLFLVSGAILLTLSLAWLRGFQLRSKFRKYLAVFEFAQACGISTGTPVRIRGVTVGNVIRVNPSLRCIETVVEVEDDKIIIPRNSLVEVNQSGLLMETM

Query:  IDITPRDPIPVPSAGPLDSECIKEGLILCDKQKMKGHQGVSLDALVGIFTRLGREAEEIGLTNTFSLAQRVALVIEEAKPLLLKIQAMAEDVQPLLAEVR
        IDITPRDPIPVPS GPLDSECIKEGLILCDK KMKGHQGVSLDALVGIFTRLGREAEEIGLTNTFSLAQRVALV+EEA+PLLLKIQAMAEDVQPLLAEVR
Subjt:  IDITPRDPIPVPSAGPLDSECIKEGLILCDKQKMKGHQGVSLDALVGIFTRLGREAEEIGLTNTFSLAQRVALVIEEAKPLLLKIQAMAEDVQPLLAEVR

Query:  DGDLLKEFENLIRSLSHATEELRYNSEETANRALPKLSCDIQLSNWCLLIVIMCNLILVHGLFLVRNEKGKKKKRMPRSTSSIPSVPNKRARIHYDPGKH
        D  LLKE ENL  SLSHATE+LR                                   VH   +            P +T  +                 
Subjt:  DGDLLKEFENLIRSLSHATEELRYNSEETANRALPKLSCDIQLSNWCLLIVIMCNLILVHGLFLVRNEKGKKKKRMPRSTSSIPSVPNKRARIHYDPGKH

Query:  RTASQVHLYAHSYFAECRGEILIVQTLLDFPIISLSSEVLGLTGDEGTKRNLKLLIKSLSRLL
            Q  +Y             ++ TL +  I SLSSE+LG TGDE TKRNLKLLIKSLSRLL
Subjt:  RTASQVHLYAHSYFAECRGEILIVQTLLDFPIISLSSEVLGLTGDEGTKRNLKLLIKSLSRLL

XP_023554510.1 protein TRIGALACTOSYLDIACYLGLYCEROL 2, chloroplastic-like isoform X1 [Cucurbita pepo subsp. pepo]2.7e-16773Show/hide
Query:  MAGDIRVQVVTCSVVLPSSLVTLPHRSSNRLSYHLPLGLKSKVKKIKATSADAGHSQPPSSLERRNPLSLFLDVPRTVWRQTLRPLSNFGFGRRSIWEGG
        M GD RVQV   SV LPSSLVTLPHRSS+RLSYHLP GLKSKVKKIKATSADAGHSQPPSS ERRNPL+LFLDVPRTVWR+TLRPLSNFGFGRRSIWEGG
Subjt:  MAGDIRVQVVTCSVVLPSSLVTLPHRSSNRLSYHLPLGLKSKVKKIKATSADAGHSQPPSSLERRNPLSLFLDVPRTVWRQTLRPLSNFGFGRRSIWEGG

Query:  VGLFLVSGAILLTLSLAWLRGFQLRSKFRKYLAVFEFAQACGISTGTPVRIRGVTVGNVIRVNPSLRCIETVVEVEDDKIIIPRNSLVEVNQSGLLMETM
        VGLFLVSGAILLTLSLAWLRGFQLRSKFRKYLAVFEFAQACGISTGTPVRIRGVTVGNVIRVNPSLRCIETVVEVEDDKIIIP NSLVEVNQSGLLMETM
Subjt:  VGLFLVSGAILLTLSLAWLRGFQLRSKFRKYLAVFEFAQACGISTGTPVRIRGVTVGNVIRVNPSLRCIETVVEVEDDKIIIPRNSLVEVNQSGLLMETM

Query:  IDITPRDPIPVPSAGPLDSECIKEGLILCDKQKMKGHQGVSLDALVGIFTRLGREAEEIGLTNTFSLAQRVALVIEEAKPLLLKIQAMAEDVQPLLAEVR
        IDITPRDPIPVPSAGPLDSECIKEGLILCDK KMKGHQGVSLDALVGIFTRLGREAEEIGLTNTFSLAQRVALV+EEA+PLLLKIQAMAEDVQPLLAEVR
Subjt:  IDITPRDPIPVPSAGPLDSECIKEGLILCDKQKMKGHQGVSLDALVGIFTRLGREAEEIGLTNTFSLAQRVALVIEEAKPLLLKIQAMAEDVQPLLAEVR

Query:  DGDLLKEFENLIRSLSHATEELRYNSEETANRALPKLSCDIQLSNWCLLIVIMCNLILVHGLFLVRNEKGKKKKRMPRSTSSIPSVPNKRARIHYDPGKH
        D  LLKE ENL  SLSHATE+LR                                   VH   +            P +T  +                 
Subjt:  DGDLLKEFENLIRSLSHATEELRYNSEETANRALPKLSCDIQLSNWCLLIVIMCNLILVHGLFLVRNEKGKKKKRMPRSTSSIPSVPNKRARIHYDPGKH

Query:  RTASQVHLYAHSYFAECRGEILIVQTLLDFPIISLSSEVLGLTGDEGTKRNLKLLIKSLSRLL
            Q  +Y             ++ TL +  I SLSSE+LG TGDE TKRNLKLLIKSLSRLL
Subjt:  RTASQVHLYAHSYFAECRGEILIVQTLLDFPIISLSSEVLGLTGDEGTKRNLKLLIKSLSRLL

XP_038890927.1 protein TRIGALACTOSYLDIACYLGLYCEROL 2, chloroplastic isoform X1 [Benincasa hispida]1.5e-17074.73Show/hide
Query:  MAGDIRVQVVTCSVVLPSSLVTLPHRSSNRLSYHLPLGLKSKVKKIKATSADAGHSQPPSSLERRNPLSLFLDVPRTVWRQTLRPLSNFGFGRRSIWEGG
        M GD+RVQVVTCSV LPS+LVTLP+RSSNRLSYHLPLGLKSKVKKIKATSADAGHSQPPSS ER+NPLSLFLDVPRTVWRQTLRPLSNFGFGRRSIWEGG
Subjt:  MAGDIRVQVVTCSVVLPSSLVTLPHRSSNRLSYHLPLGLKSKVKKIKATSADAGHSQPPSSLERRNPLSLFLDVPRTVWRQTLRPLSNFGFGRRSIWEGG

Query:  VGLFLVSGAILLTLSLAWLRGFQLRSKFRKYLAVFEFAQACGISTGTPVRIRGVTVGNVIRVNPSLRCIETVVEVEDDKIIIPRNSLVEVNQSGLLMETM
        VGLFLVSGAILLTLSLAWLRGFQLRSKFRKYLAVFEFAQA GISTGTPVRIRGVTVGNVIR+NPSLRCIETVVEVEDDKIIIPRNSLVEVNQSGLLMET+
Subjt:  VGLFLVSGAILLTLSLAWLRGFQLRSKFRKYLAVFEFAQACGISTGTPVRIRGVTVGNVIRVNPSLRCIETVVEVEDDKIIIPRNSLVEVNQSGLLMETM

Query:  IDITPRDPIPVPSAGPLDSECIKEGLILCDKQKMKGHQGVSLDALVGIFTRLGREAEEIGLTNTFSLAQRVALVIEEAKPLLLKIQAMAEDVQPLLAEVR
        IDITPRDPIPVPSAGPLDSECI+EGLILCDKQKMKGHQGVSLDALVGIFTRLGREAEEIGLTNTFSLAQRVALVIEEAKPLLLKIQAMAEDVQPLLAEVR
Subjt:  IDITPRDPIPVPSAGPLDSECIKEGLILCDKQKMKGHQGVSLDALVGIFTRLGREAEEIGLTNTFSLAQRVALVIEEAKPLLLKIQAMAEDVQPLLAEVR

Query:  DGDLLKEFENLIRSLSHATEELRYNSEETANRALPKLSCDIQLSNWCLLIVIMCNLILVHGLFLVRNEKGKKKKRMPRSTSSIPSVPNKRARIHYDPGKH
        D +LLKEFENL RSLS ATEELR                                   VH                    +SI S  N            
Subjt:  DGDLLKEFENLIRSLSHATEELRYNSEETANRALPKLSCDIQLSNWCLLIVIMCNLILVHGLFLVRNEKGKKKKRMPRSTSSIPSVPNKRARIHYDPGKH

Query:  RTASQVHLYAHSYFAECRGEILIVQTLLDFPIISLSSEVLGLTGDEGTKRNLKLLIKSLSRLL
               L   S +        ++ TL +  I SLSSEVLG TGDE TKRNLKLLI+SLSRLL
Subjt:  RTASQVHLYAHSYFAECRGEILIVQTLLDFPIISLSSEVLGLTGDEGTKRNLKLLIKSLSRLL

TrEMBL top hitse value%identityAlignment
A0A0A0K2X5 MlaD domain-containing protein2.5e-16673.33Show/hide
Query:  MAGDIRVQVVTCSVVLPSSLVTLPHRSSNRL--SYHLPLGLKSKVKKIKATSADAGHSQPPSSLERRNPLSLFLDVPRTVWRQTLRPLSNFGFGRRSIWE
        M GDIRVQVVTCSV LPSSLVTLPHRSS+RL  SYHLPLGLKSKVK+IKATSADAGHSQPPSS ERRNPLSLFLDVPRTVWRQTLRPLSNFGFG+RSIWE
Subjt:  MAGDIRVQVVTCSVVLPSSLVTLPHRSSNRL--SYHLPLGLKSKVKKIKATSADAGHSQPPSSLERRNPLSLFLDVPRTVWRQTLRPLSNFGFGRRSIWE

Query:  GGVGLFLVSGAILLTLSLAWLRGFQLRSKFRKYLAVFEFAQACGISTGTPVRIRGVTVGNVIRVNPSLRCIETVVEVEDDKIIIPRNSLVEVNQSGLLME
        GGVGLFLVSGAILLTLSLAWLRGFQLRSKFRKYLAVFEFAQA GIS GTPVRIRGVTVGNVIRVNPSLRCIETVVEVEDDKIIIPRNSLVEVNQSGLLME
Subjt:  GGVGLFLVSGAILLTLSLAWLRGFQLRSKFRKYLAVFEFAQACGISTGTPVRIRGVTVGNVIRVNPSLRCIETVVEVEDDKIIIPRNSLVEVNQSGLLME

Query:  TMIDITPRDPIPVPSAGPLDSECIKEGLILCDKQKMKGHQGVSLDALVGIFTRLGREAEEIGLTNTFSLAQRVALVIEEAKPLLLKIQAMAEDVQPLLAE
        TMIDITPRDPIPVPSAGPLD ECI+EGLILCDKQK+KG+QGVSLDALVGIFTRLGREAEEIGLTNTF LAQRVALVIEEAKPLLLKIQAMAEDVQPLLAE
Subjt:  TMIDITPRDPIPVPSAGPLDSECIKEGLILCDKQKMKGHQGVSLDALVGIFTRLGREAEEIGLTNTFSLAQRVALVIEEAKPLLLKIQAMAEDVQPLLAE

Query:  VRDGDLLKEFENLIRSLSHATEELRYNSEETANRALPKLSCDIQLSNWCLLIVIMCNLILVHGLFLVRNEKGKKKKRMPRSTSSIPSVPNKRARIHYDPG
        VRD  LLKE E+L RSLSHATE+LR                 +Q S            IL                  P +T  +               
Subjt:  VRDGDLLKEFENLIRSLSHATEELRYNSEETANRALPKLSCDIQLSNWCLLIVIMCNLILVHGLFLVRNEKGKKKKRMPRSTSSIPSVPNKRARIHYDPG

Query:  KHRTASQVHLYAHSYFAECRGEILIVQTLLDFPIISLSSEVLGLTGDEGTKRNLKLLIKSLSRLL
              Q  +Y             ++ TL +  + SLSSEVLG TGDE TKRNLKLLI+SLSRLL
Subjt:  KHRTASQVHLYAHSYFAECRGEILIVQTLLDFPIISLSSEVLGLTGDEGTKRNLKLLIKSLSRLL

A0A1S3BRQ6 protein TRIGALACTOSYLDIACYLGLYCEROL 2, chloroplastic isoform X13.1e-16974.51Show/hide
Query:  MAGDIRVQVVTCSVVLPSSLVTLPHRSSNRLSYHLPLGLKSKVKKIKATSADAGHSQPPSSLERRNPLSLFLDVPRTVWRQTLRPLSNFGFGRRSIWEGG
        M GDIRVQVVTCSV LPSSLVTLPHRSSNRLSYHLPLG KSKVK+IKATSADAGHSQPPSS ERRNPLSLFLDVPRTVWRQTLRPLSNFGFGRRSIWEGG
Subjt:  MAGDIRVQVVTCSVVLPSSLVTLPHRSSNRLSYHLPLGLKSKVKKIKATSADAGHSQPPSSLERRNPLSLFLDVPRTVWRQTLRPLSNFGFGRRSIWEGG

Query:  VGLFLVSGAILLTLSLAWLRGFQLRSKFRKYLAVFEFAQACGISTGTPVRIRGVTVGNVIRVNPSLRCIETVVEVEDDKIIIPRNSLVEVNQSGLLMETM
        VGLFLVSGAILLTLSLAWLRGFQLRSKFRKYLAVFEFAQA GIS GTPVRIRGVTVGNVIRVNPSLRCIETVVEVEDDKIIIPRNSLVEVNQSGLLMETM
Subjt:  VGLFLVSGAILLTLSLAWLRGFQLRSKFRKYLAVFEFAQACGISTGTPVRIRGVTVGNVIRVNPSLRCIETVVEVEDDKIIIPRNSLVEVNQSGLLMETM

Query:  IDITPRDPIPVPSAGPLDSECIKEGLILCDKQKMKGHQGVSLDALVGIFTRLGREAEEIGLTNTFSLAQRVALVIEEAKPLLLKIQAMAEDVQPLLAEVR
        IDITPRDPIPVPSAGPLD ECI+EGLILCDKQKMKGHQGVSLDALVGIFTRLGREAEEIGLTNTF LAQRVALVIEEAKPLLLKIQAMAEDVQPLLAEVR
Subjt:  IDITPRDPIPVPSAGPLDSECIKEGLILCDKQKMKGHQGVSLDALVGIFTRLGREAEEIGLTNTFSLAQRVALVIEEAKPLLLKIQAMAEDVQPLLAEVR

Query:  DGDLLKEFENLIRSLSHATEELRYNSEETANRALPKLSCDIQLSNWCLLIVIMCNLILVHGLFLVRNEKGKKKKRMPRSTSSIPSVPNKRARIHYDPGKH
        D  LLKE E+L RSLSHATE+LR                 +Q S            IL                  P +T  +                 
Subjt:  DGDLLKEFENLIRSLSHATEELRYNSEETANRALPKLSCDIQLSNWCLLIVIMCNLILVHGLFLVRNEKGKKKKRMPRSTSSIPSVPNKRARIHYDPGKH

Query:  RTASQVHLYAHSYFAECRGEILIVQTLLDFPIISLSSEVLGLTGDEGTKRNLKLLIKSLSRLL
            Q  +Y             ++ TL +  I SLSSEVLG TGDE TKRNLKLLI+SLSRLL
Subjt:  RTASQVHLYAHSYFAECRGEILIVQTLLDFPIISLSSEVLGLTGDEGTKRNLKLLIKSLSRLL

A0A5D3C845 Protein TRIGALACTOSYLDIACYLGLYCEROL 23.1e-16974.51Show/hide
Query:  MAGDIRVQVVTCSVVLPSSLVTLPHRSSNRLSYHLPLGLKSKVKKIKATSADAGHSQPPSSLERRNPLSLFLDVPRTVWRQTLRPLSNFGFGRRSIWEGG
        M GDIRVQVVTCSV LPSSLVTLPHRSSNRLSYHLPLG KSKVK+IKATSADAGHSQPPSS ERRNPLSLFLDVPRTVWRQTLRPLSNFGFGRRSIWEGG
Subjt:  MAGDIRVQVVTCSVVLPSSLVTLPHRSSNRLSYHLPLGLKSKVKKIKATSADAGHSQPPSSLERRNPLSLFLDVPRTVWRQTLRPLSNFGFGRRSIWEGG

Query:  VGLFLVSGAILLTLSLAWLRGFQLRSKFRKYLAVFEFAQACGISTGTPVRIRGVTVGNVIRVNPSLRCIETVVEVEDDKIIIPRNSLVEVNQSGLLMETM
        VGLFLVSGAILLTLSLAWLRGFQLRSKFRKYLAVFEFAQA GIS GTPVRIRGVTVGNVIRVNPSLRCIETVVEVEDDKIIIPRNSLVEVNQSGLLMETM
Subjt:  VGLFLVSGAILLTLSLAWLRGFQLRSKFRKYLAVFEFAQACGISTGTPVRIRGVTVGNVIRVNPSLRCIETVVEVEDDKIIIPRNSLVEVNQSGLLMETM

Query:  IDITPRDPIPVPSAGPLDSECIKEGLILCDKQKMKGHQGVSLDALVGIFTRLGREAEEIGLTNTFSLAQRVALVIEEAKPLLLKIQAMAEDVQPLLAEVR
        IDITPRDPIPVPSAGPLD ECI+EGLILCDKQKMKGHQGVSLDALVGIFTRLGREAEEIGLTNTF LAQRVALVIEEAKPLLLKIQAMAEDVQPLLAEVR
Subjt:  IDITPRDPIPVPSAGPLDSECIKEGLILCDKQKMKGHQGVSLDALVGIFTRLGREAEEIGLTNTFSLAQRVALVIEEAKPLLLKIQAMAEDVQPLLAEVR

Query:  DGDLLKEFENLIRSLSHATEELRYNSEETANRALPKLSCDIQLSNWCLLIVIMCNLILVHGLFLVRNEKGKKKKRMPRSTSSIPSVPNKRARIHYDPGKH
        D  LLKE E+L RSLSHATE+LR                 +Q S            IL                  P +T  +                 
Subjt:  DGDLLKEFENLIRSLSHATEELRYNSEETANRALPKLSCDIQLSNWCLLIVIMCNLILVHGLFLVRNEKGKKKKRMPRSTSSIPSVPNKRARIHYDPGKH

Query:  RTASQVHLYAHSYFAECRGEILIVQTLLDFPIISLSSEVLGLTGDEGTKRNLKLLIKSLSRLL
            Q  +Y             ++ TL +  I SLSSEVLG TGDE TKRNLKLLI+SLSRLL
Subjt:  RTASQVHLYAHSYFAECRGEILIVQTLLDFPIISLSSEVLGLTGDEGTKRNLKLLIKSLSRLL

A0A6J1GJI3 protein TRIGALACTOSYLDIACYLGLYCEROL 2, chloroplastic-like isoform X15.5e-16672.14Show/hide
Query:  MAGDIRVQVVTCSVVLPSSLVTLPHRSSNRLSYHLPLGLKSKVKKIKATSADAGHSQPPSSLERRNPLSLFLDVPRTVWRQTLRPLSNFGFGRRSIWEGG
        M GD RVQV   SV LPSSLVTLPH+SS+RLSYHLP GLKSKVKKIKATSADAGHSQPPSS ER NPL+LFLDVPRTVWR+TLRPLSNFGFGRRSIWEGG
Subjt:  MAGDIRVQVVTCSVVLPSSLVTLPHRSSNRLSYHLPLGLKSKVKKIKATSADAGHSQPPSSLERRNPLSLFLDVPRTVWRQTLRPLSNFGFGRRSIWEGG

Query:  VGLFLVSGAILLTLSLAWLRGFQLRSKFRKYLAVFEFAQACGISTGTPVRIRGVTVGNVIRVNPSLRCIETVVEVEDDKIIIPRNSLVEVNQSGLLMETM
        VGLFLVSGAILLTLSLAWLRGFQLRSKFRKYLAVFEFAQACGISTGTPVRIRGVTVGNVIRVNPSLRCIET+VEVEDDKIIIP NSLVEVNQSGLLMETM
Subjt:  VGLFLVSGAILLTLSLAWLRGFQLRSKFRKYLAVFEFAQACGISTGTPVRIRGVTVGNVIRVNPSLRCIETVVEVEDDKIIIPRNSLVEVNQSGLLMETM

Query:  IDITPRDPIPVPSAGPLDSECIKEGLILCDKQKMKGHQGVSLDALVGIFTRLGREAEEIGLTNTFSLAQRVALVIEEAKPLLLKIQAMAEDVQPLLAEVR
        IDITPRDPIPVPS GPLDSECIKEGLILCDK KMKGHQGVSLDALVGIFTRLGREAEEIGLTNTFSLAQRVALV+EEA+PLLLKIQAMAEDVQPLLAEVR
Subjt:  IDITPRDPIPVPSAGPLDSECIKEGLILCDKQKMKGHQGVSLDALVGIFTRLGREAEEIGLTNTFSLAQRVALVIEEAKPLLLKIQAMAEDVQPLLAEVR

Query:  DGDLLKEFENLIRSLSHATEELRYNSEETANRALPKLSCDIQLSNWCLLIVIMCNLILVHGLFLVRNEKGKKKKRMPRSTSSIPSVPNKRARIHYDPGKH
        D  LLKE ENL  SLSHATE+LR                                   VH   +            P +T  +                 
Subjt:  DGDLLKEFENLIRSLSHATEELRYNSEETANRALPKLSCDIQLSNWCLLIVIMCNLILVHGLFLVRNEKGKKKKRMPRSTSSIPSVPNKRARIHYDPGKH

Query:  RTASQVHLYAHSYFAECRGEILIVQTLLDFPIISLSSEVLGLTGDEGTKRNLKLLIKSLSRLL
            Q  +Y             ++ TL +  I SLSSE+LG TGDE TKRNLKLLIKSLSRLL
Subjt:  RTASQVHLYAHSYFAECRGEILIVQTLLDFPIISLSSEVLGLTGDEGTKRNLKLLIKSLSRLL

A0A6J1HZ36 protein TRIGALACTOSYLDIACYLGLYCEROL 2, chloroplastic-like isoform X13.0e-16471.71Show/hide
Query:  MAGDIRVQVVTCSVVLPSSLVTLPHRSSNRLSYHLPLGLKSKVKKIKATSADAGHSQPPSSLERRNPLSLFLDVPRTVWRQTLRPLSNFGFGRRSIWEGG
        M GD  VQV T SV LPSSLVTLPHRSSNRLSYHLP GLKSKVKKIKATSA AGHSQPPSS ERRNPL+LFLDVPRT+WR+TL PLSNFGFGRRSIWEGG
Subjt:  MAGDIRVQVVTCSVVLPSSLVTLPHRSSNRLSYHLPLGLKSKVKKIKATSADAGHSQPPSSLERRNPLSLFLDVPRTVWRQTLRPLSNFGFGRRSIWEGG

Query:  VGLFLVSGAILLTLSLAWLRGFQLRSKFRKYLAVFEFAQACGISTGTPVRIRGVTVGNVIRVNPSLRCIETVVEVEDDKIIIPRNSLVEVNQSGLLMETM
        VGLFLVSGAILLTLSLAWLRGFQLRSKFRKYLAVFEFAQACGISTGTPVRIRGVTVGNVIRVNPSLRCIETVVEVEDDKIIIP NSLVEVNQSGLLMETM
Subjt:  VGLFLVSGAILLTLSLAWLRGFQLRSKFRKYLAVFEFAQACGISTGTPVRIRGVTVGNVIRVNPSLRCIETVVEVEDDKIIIPRNSLVEVNQSGLLMETM

Query:  IDITPRDPIPVPSAGPLDSECIKEGLILCDKQKMKGHQGVSLDALVGIFTRLGREAEEIGLTNTFSLAQRVALVIEEAKPLLLKIQAMAEDVQPLLAEVR
        IDITPRDPIPVPS GPLDS+CIKEGLILCDK KMKG+QGVSLDALVGIFTRLGREAEEIGLTNTFSLAQRVALV+EEA+PLLLKIQAMAEDVQPLLAEVR
Subjt:  IDITPRDPIPVPSAGPLDSECIKEGLILCDKQKMKGHQGVSLDALVGIFTRLGREAEEIGLTNTFSLAQRVALVIEEAKPLLLKIQAMAEDVQPLLAEVR

Query:  DGDLLKEFENLIRSLSHATEELRYNSEETANRALPKLSCDIQLSNWCLLIVIMCNLILVHGLFLVRNEKGKKKKRMPRSTSSIPSVPNKRARIHYDPGKH
        D  L+KE ENL  SLSHATE+LR                                   VH   +            P +T  +                 
Subjt:  DGDLLKEFENLIRSLSHATEELRYNSEETANRALPKLSCDIQLSNWCLLIVIMCNLILVHGLFLVRNEKGKKKKRMPRSTSSIPSVPNKRARIHYDPGKH

Query:  RTASQVHLYAHSYFAECRGEILIVQTLLDFPIISLSSEVLGLTGDEGTKRNLKLLIKSLSRLL
            Q  +Y             ++ TL +  I SLSSE+LG TGDE TKRNLKLLIKSLSRLL
Subjt:  RTASQVHLYAHSYFAECRGEILIVQTLLDFPIISLSSEVLGLTGDEGTKRNLKLLIKSLSRLL

SwissProt top hitse value%identityAlignment
P46315 Uncharacterized protein ycf223.0e-1229.93Show/hide
Query:  KFRK---YLAVFEFAQACGISTGTPVRIRGVTVGNVIRVNPSLRCIETVVEVEDDKIIIPRNSLVEVNQSGLLMETMIDITPRDPI---PVPSAGPLDSE
        K++K   Y    EF  A GI  GT V +RGV +G +  +  +   +   + ++ +KI+IP+NS++E NQ+ L   T+IDI P + I    +      +  
Subjt:  KFRK---YLAVFEFAQACGISTGTPVRIRGVTVGNVIRVNPSLRCIETVVEVEDDKIIIPRNSLVEVNQSGLLMETMIDITPRDPI---PVPSAGPLDSE

Query:  CIKEGLILCDKQKMKGHQGVSLDALVGIFTRLGREAEEIGLTNTFSL
        C  +  I C+ Q + G +G++ D L+   TR+ +  ++    N F L
Subjt:  CIKEGLILCDKQKMKGHQGVSLDALVGIFTRLGREAEEIGLTNTFSL

P51372 Uncharacterized protein ycf224.5e-1635.04Show/hide
Query:  SKFRKYLAVFEFAQACGISTGTPVRIRGVTVGNVIRVNPSLRCIETVVEVEDDKIIIPRNSLVEVNQSGLLMETMIDITPRDPIPVP----SAGPLDSEC
        SK + Y    EF  A GI  GT VR+RG+ +G V+ ++ S   I T +E++    IIP+ SL+E NQ+GLL +T+IDI P   +         GPL   C
Subjt:  SKFRKYLAVFEFAQACGISTGTPVRIRGVTVGNVIRVNPSLRCIETVVEVEDDKIIIPRNSLVEVNQSGLLMETMIDITPRDPIPVP----SAGPLDSEC

Query:  IKEGLILCDKQKMKGHQGVSLDALVGIFTRLGREAEE
             I+C    ++G +G++ D L+   TR+ +  ++
Subjt:  IKEGLILCDKQKMKGHQGVSLDALVGIFTRLGREAEE

Q1XDB5 Uncharacterized protein ycf221.7e-1835.19Show/hide
Query:  VGLFLVSGAILLTLSLAWLRGFQLRSKFRKYLAVFEFAQACGISTGTPVRIRGVTVGNVIRVNPSLRCIETVVEVEDDKIIIPRNSLVEVNQSGLLMETM
        +G   +    LL++SL W        K   Y A  EF  A GI  GT VR+RG+ VG V+ ++ S   I T +E++    IIP+ SL+E NQ+GLL +T+
Subjt:  VGLFLVSGAILLTLSLAWLRGFQLRSKFRKYLAVFEFAQACGISTGTPVRIRGVTVGNVIRVNPSLRCIETVVEVEDDKIIIPRNSLVEVNQSGLLMETM

Query:  IDITPRDPIPVP----SAGPLDSECIKEGLILCDKQKMKGHQGVSLDALVGIFTRLGREAEE
        IDI P   + +      AGPL   C     I+C+   +KG +G++ D L+   TR+ +  ++
Subjt:  IDITPRDPIPVP----SAGPLDSECIKEGLILCDKQKMKGHQGVSLDALVGIFTRLGREAEE

Q9LTR2 Protein TRIGALACTOSYLDIACYLGLYCEROL 2, chloroplastic4.7e-11453.35Show/hide
Query:  MAGDIRVQVVTCSVVLPSSLVTLPHRSSNRLSYHLPLGLKSKVKKIKATSADAGHSQPPSSLERRNPLSLFLDVPRTVWRQTLRPLSNFGFGRRSIWEGG
        M G+  +QV + S++  SS++  P  S N + Y  P      +    A+++DA H Q PSS   +NPL++ LDVPR +WRQTL+PLS+FGFG+RSIWEGG
Subjt:  MAGDIRVQVVTCSVVLPSSLVTLPHRSSNRLSYHLPLGLKSKVKKIKATSADAGHSQPPSSLERRNPLSLFLDVPRTVWRQTLRPLSNFGFGRRSIWEGG

Query:  VGLFLVSGAILLTLSLAWLRGFQLRSKFRKYLAVFEFAQACGISTGTPVRIRGVTVGNVIRVNPSLRCIETVVEVEDDKIIIPRNSLVEVNQSGLLMETM
        VGLF+VSGA LL LS AWLRGFQ+RSKFRKY  VFE + A GI TGTPVRIRGVTVG +IRVNPSL+ IE V E+EDDKIIIPRNSLVEVNQSGLLMETM
Subjt:  VGLFLVSGAILLTLSLAWLRGFQLRSKFRKYLAVFEFAQACGISTGTPVRIRGVTVGNVIRVNPSLRCIETVVEVEDDKIIIPRNSLVEVNQSGLLMETM

Query:  IDITPRDPIPVPSAGPLDSECIKEGLILCDKQKMKGHQGVSLDALVGIFTRLGREAEEIGLTNTFSLAQRVALVIEEAKPLLLKIQAMAEDVQPLLAEVR
        IDI PR+PIP PS GPL  EC KEGLI+CD+Q +KG QGVSLD LVGIFTR+GRE E IG+ NT+SLA+R A VIEEA+PLL KIQAMAED QPLL+E R
Subjt:  IDITPRDPIPVPSAGPLDSECIKEGLILCDKQKMKGHQGVSLDALVGIFTRLGREAEEIGLTNTFSLAQRVALVIEEAKPLLLKIQAMAEDVQPLLAEVR

Query:  DGDLLKEFENLIRSLSHATEELRYNSEETANRALPKLSCDIQLSNWCLLIVIMCNLILVHGLFLVRNEKGKKKKRMPRSTSSIPSVPNKRARIHYDPGKH
        D  LLKE E L RSL+ A+++LR            K++  I                                   P +T  I                 
Subjt:  DGDLLKEFENLIRSLSHATEELRYNSEETANRALPKLSCDIQLSNWCLLIVIMCNLILVHGLFLVRNEKGKKKKRMPRSTSSIPSVPNKRARIHYDPGKH

Query:  RTASQVHLYAHSYFAECRGEILIVQTLLDFPIISLSSEVLGLTGDEGTKRNLKLLIKSLSRLL
            Q  +Y             +V TL +  + S+SS++LG TGDE T++NLKLLIKSLSRLL
Subjt:  RTASQVHLYAHSYFAECRGEILIVQTLLDFPIISLSSEVLGLTGDEGTKRNLKLLIKSLSRLL

Arabidopsis top hitse value%identityAlignment
AT3G20320.1 trigalactosyldiacylglycerol23.4e-11553.35Show/hide
Query:  MAGDIRVQVVTCSVVLPSSLVTLPHRSSNRLSYHLPLGLKSKVKKIKATSADAGHSQPPSSLERRNPLSLFLDVPRTVWRQTLRPLSNFGFGRRSIWEGG
        M G+  +QV + S++  SS++  P  S N + Y  P      +    A+++DA H Q PSS   +NPL++ LDVPR +WRQTL+PLS+FGFG+RSIWEGG
Subjt:  MAGDIRVQVVTCSVVLPSSLVTLPHRSSNRLSYHLPLGLKSKVKKIKATSADAGHSQPPSSLERRNPLSLFLDVPRTVWRQTLRPLSNFGFGRRSIWEGG

Query:  VGLFLVSGAILLTLSLAWLRGFQLRSKFRKYLAVFEFAQACGISTGTPVRIRGVTVGNVIRVNPSLRCIETVVEVEDDKIIIPRNSLVEVNQSGLLMETM
        VGLF+VSGA LL LS AWLRGFQ+RSKFRKY  VFE + A GI TGTPVRIRGVTVG +IRVNPSL+ IE V E+EDDKIIIPRNSLVEVNQSGLLMETM
Subjt:  VGLFLVSGAILLTLSLAWLRGFQLRSKFRKYLAVFEFAQACGISTGTPVRIRGVTVGNVIRVNPSLRCIETVVEVEDDKIIIPRNSLVEVNQSGLLMETM

Query:  IDITPRDPIPVPSAGPLDSECIKEGLILCDKQKMKGHQGVSLDALVGIFTRLGREAEEIGLTNTFSLAQRVALVIEEAKPLLLKIQAMAEDVQPLLAEVR
        IDI PR+PIP PS GPL  EC KEGLI+CD+Q +KG QGVSLD LVGIFTR+GRE E IG+ NT+SLA+R A VIEEA+PLL KIQAMAED QPLL+E R
Subjt:  IDITPRDPIPVPSAGPLDSECIKEGLILCDKQKMKGHQGVSLDALVGIFTRLGREAEEIGLTNTFSLAQRVALVIEEAKPLLLKIQAMAEDVQPLLAEVR

Query:  DGDLLKEFENLIRSLSHATEELRYNSEETANRALPKLSCDIQLSNWCLLIVIMCNLILVHGLFLVRNEKGKKKKRMPRSTSSIPSVPNKRARIHYDPGKH
        D  LLKE E L RSL+ A+++LR            K++  I                                   P +T  I                 
Subjt:  DGDLLKEFENLIRSLSHATEELRYNSEETANRALPKLSCDIQLSNWCLLIVIMCNLILVHGLFLVRNEKGKKKKRMPRSTSSIPSVPNKRARIHYDPGKH

Query:  RTASQVHLYAHSYFAECRGEILIVQTLLDFPIISLSSEVLGLTGDEGTKRNLKLLIKSLSRLL
            Q  +Y             +V TL +  + S+SS++LG TGDE T++NLKLLIKSLSRLL
Subjt:  RTASQVHLYAHSYFAECRGEILIVQTLLDFPIISLSSEVLGLTGDEGTKRNLKLLIKSLSRLL

AT3G20320.2 trigalactosyldiacylglycerol22.2e-9866.2Show/hide
Query:  MAGDIRVQVVTCSVVLPSSLVTLPHRSSNRLSYHLPLGLKSKVKKIKATSADAGHSQPPSSLERRNPLSLFLDVPRTVWRQTLRPLSNFGFGRRSIWEGG
        M G+  +QV + S++  SS++  P  S N + Y  P      +    A+++DA H Q PSS   +NPL++ LDVPR +WRQTL+PLS+FGFG+RSIWEGG
Subjt:  MAGDIRVQVVTCSVVLPSSLVTLPHRSSNRLSYHLPLGLKSKVKKIKATSADAGHSQPPSSLERRNPLSLFLDVPRTVWRQTLRPLSNFGFGRRSIWEGG

Query:  VGLFLVSGAILLTLSLAWLRGFQLRSKFRKYLAVFEFAQACGISTGTPVRIRGVTVGNVIRVNPSLRCIETVVEVEDDKIIIPRNSLVEVNQSGLLMETM
        VGLF+VSGA LL LS AWLRGFQ+RSKFRKY  VFE + A GI TGTPVRIRGVTVG +IRVNPSL+ IE V E+EDDKIIIPRNSLVEVNQSGLLMETM
Subjt:  VGLFLVSGAILLTLSLAWLRGFQLRSKFRKYLAVFEFAQACGISTGTPVRIRGVTVGNVIRVNPSLRCIETVVEVEDDKIIIPRNSLVEVNQSGLLMETM

Query:  IDITPRDPIPVPSAGPLDSECIKEGLILCDKQKMKGHQGVSLDALVGIFTRLGREAEEIGLTNTFSLAQRVALVIEEAKPLLLK
        IDI PR+PIP PS GPL  EC KEGLI+CD+Q +KG QGVSLD LVGIFTR+GRE E IG+ NT+SLA+R A VIEEA+PLL K
Subjt:  IDITPRDPIPVPSAGPLDSECIKEGLILCDKQKMKGHQGVSLDALVGIFTRLGREAEEIGLTNTFSLAQRVALVIEEAKPLLLK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGGAGATATTCGTGTGCAGGTTGTAACATGTTCTGTGGTGTTACCCTCGTCCTTGGTTACCCTTCCACATAGATCTTCAAATAGGTTGTCTTATCATCTCCCATT
GGGTCTGAAATCAAAAGTAAAGAAGATAAAGGCTACATCTGCTGATGCTGGACATAGTCAGCCCCCCTCATCTTTAGAAAGAAGGAATCCACTTTCCCTTTTCCTTGATG
TTCCTCGTACTGTATGGAGGCAAACATTGCGTCCATTGAGTAATTTTGGGTTCGGTCGAAGGAGCATATGGGAAGGTGGGGTTGGGTTGTTCCTAGTGTCGGGTGCTATA
CTACTCACACTCAGTTTGGCTTGGTTGAGAGGCTTTCAACTGCGGTCTAAATTTAGAAAATACTTGGCTGTCTTTGAGTTTGCTCAGGCTTGTGGTATTTCTACTGGAAC
TCCTGTAAGAATTAGAGGAGTTACTGTTGGCAATGTTATTCGTGTTAACCCTTCCCTGAGATGTATTGAAACTGTTGTTGAGGTTGAAGATGATAAAATAATTATACCCC
GTAATTCATTGGTTGAAGTAAATCAGTCTGGTCTTCTTATGGAGACAATGATTGACATTACTCCCCGGGATCCTATTCCAGTGCCTTCAGCAGGACCACTTGACTCAGAA
TGTATTAAAGAAGGTTTAATATTATGTGATAAGCAGAAAATGAAGGGACATCAAGGGGTAAGTTTAGATGCATTAGTTGGAATATTCACTCGGCTTGGACGTGAAGCAGA
GGAAATAGGGCTTACCAATACGTTTTCATTAGCCCAACGAGTTGCTTTGGTTATTGAAGAAGCAAAGCCTTTGCTTTTAAAGATTCAAGCCATGGCTGAAGATGTTCAAC
CTTTGCTTGCTGAGGTTCGTGATGGTGATCTTCTAAAGGAGTTTGAAAACTTAATTAGAAGTCTTTCACATGCCACGGAAGAGTTAAGGTATAACAGCGAGGAAACTGCG
AATAGAGCTTTACCAAAACTTTCTTGCGATATCCAACTTTCAAATTGGTGTCTTCTTATTGTTATAATGTGCAACTTAATACTGGTACATGGCTTGTTTCTAGTAAGAAA
TGAAAAAGGAAAGAAAAAGAAAAGGATGCCACGGTCTACTTCTTCCATTCCATCAGTCCCAAACAAGCGTGCACGCATCCATTATGACCCCGGAAAACACAGAACTGCTT
CACAAGTCCATTTATACGCTCATTCATACTTTGCAGAATGTAGAGGTGAGATCTTAATTGTTCAAACTCTTCTTGATTTTCCGATCATTAGTTTGAGCTCCGAAGTTCTT
GGACTCACCGGTGATGAAGGGACAAAACGGAATTTAAAACTGCTCATCAAGTCGCTGAGCAGGCTACTATGA
mRNA sequenceShow/hide mRNA sequence
CCGACTCCCACGCTCCCTCTCTCTGAACGGTTTCTGCAGCCTCTTTTATCTCTCTTTTTCCAACGATTTTTGCAGCCACCATATCTTGCTCTCTCTCTCGCACGGTACCA
TTTTCTTTTCCCTCTTGTTCCGTCCATCACAGCTCAGATCGCCAGCGTCGCTGCCGTCGTCCGTTTTCGCAGTCGCCGGCGCCGTCGCACACCAGTCGGCCATTCATCCT
TGAGCAGTAGCAGCGGATAAAGGATTTCGTAGGGGAGGAAGGAATTTGTTGGAAATTGGATTTCAGTCGAGGCCGTTGTTTAAATCTTGTTTTACTGTTGCTGTTGTGCA
AAAATGGCGGGAGATATTCGTGTGCAGGTTGTAACATGTTCTGTGGTGTTACCCTCGTCCTTGGTTACCCTTCCACATAGATCTTCAAATAGGTTGTCTTATCATCTCCC
ATTGGGTCTGAAATCAAAAGTAAAGAAGATAAAGGCTACATCTGCTGATGCTGGACATAGTCAGCCCCCCTCATCTTTAGAAAGAAGGAATCCACTTTCCCTTTTCCTTG
ATGTTCCTCGTACTGTATGGAGGCAAACATTGCGTCCATTGAGTAATTTTGGGTTCGGTCGAAGGAGCATATGGGAAGGTGGGGTTGGGTTGTTCCTAGTGTCGGGTGCT
ATACTACTCACACTCAGTTTGGCTTGGTTGAGAGGCTTTCAACTGCGGTCTAAATTTAGAAAATACTTGGCTGTCTTTGAGTTTGCTCAGGCTTGTGGTATTTCTACTGG
AACTCCTGTAAGAATTAGAGGAGTTACTGTTGGCAATGTTATTCGTGTTAACCCTTCCCTGAGATGTATTGAAACTGTTGTTGAGGTTGAAGATGATAAAATAATTATAC
CCCGTAATTCATTGGTTGAAGTAAATCAGTCTGGTCTTCTTATGGAGACAATGATTGACATTACTCCCCGGGATCCTATTCCAGTGCCTTCAGCAGGACCACTTGACTCA
GAATGTATTAAAGAAGGTTTAATATTATGTGATAAGCAGAAAATGAAGGGACATCAAGGGGTAAGTTTAGATGCATTAGTTGGAATATTCACTCGGCTTGGACGTGAAGC
AGAGGAAATAGGGCTTACCAATACGTTTTCATTAGCCCAACGAGTTGCTTTGGTTATTGAAGAAGCAAAGCCTTTGCTTTTAAAGATTCAAGCCATGGCTGAAGATGTTC
AACCTTTGCTTGCTGAGGTTCGTGATGGTGATCTTCTAAAGGAGTTTGAAAACTTAATTAGAAGTCTTTCACATGCCACGGAAGAGTTAAGGTATAACAGCGAGGAAACT
GCGAATAGAGCTTTACCAAAACTTTCTTGCGATATCCAACTTTCAAATTGGTGTCTTCTTATTGTTATAATGTGCAACTTAATACTGGTACATGGCTTGTTTCTAGTAAG
AAATGAAAAAGGAAAGAAAAAGAAAAGGATGCCACGGTCTACTTCTTCCATTCCATCAGTCCCAAACAAGCGTGCACGCATCCATTATGACCCCGGAAAACACAGAACTG
CTTCACAAGTCCATTTATACGCTCATTCATACTTTGCAGAATGTAGAGGTGAGATCTTAATTGTTCAAACTCTTCTTGATTTTCCGATCATTAGTTTGAGCTCCGAAGTT
CTTGGACTCACCGGTGATGAAGGGACAAAACGGAATTTAAAACTGCTCATCAAGTCGCTGAGCAGGCTACTATGAGACACAGATGTGATGGAGTGGGATTGTATGAACTT
GGTATTAGGCGACACTTTGTTTTCAACATTACTAGACACATTTTCACAACTGTTTTATGGTTTTGGCTTCTGTGGGAAAGGAGGAAGTCAGGCATTGGCAAAAAATGTCA
AAAATTATATCCATAGTTAATTTTAGCATAGAGATATTTGTGAGTGTTCATAGATAATAGCTCTGTTTAAACAATGGGAAATTATTGATTCTTATTAACACATGTTGGTC
CTTCATTTCTG
Protein sequenceShow/hide protein sequence
MAGDIRVQVVTCSVVLPSSLVTLPHRSSNRLSYHLPLGLKSKVKKIKATSADAGHSQPPSSLERRNPLSLFLDVPRTVWRQTLRPLSNFGFGRRSIWEGGVGLFLVSGAI
LLTLSLAWLRGFQLRSKFRKYLAVFEFAQACGISTGTPVRIRGVTVGNVIRVNPSLRCIETVVEVEDDKIIIPRNSLVEVNQSGLLMETMIDITPRDPIPVPSAGPLDSE
CIKEGLILCDKQKMKGHQGVSLDALVGIFTRLGREAEEIGLTNTFSLAQRVALVIEEAKPLLLKIQAMAEDVQPLLAEVRDGDLLKEFENLIRSLSHATEELRYNSEETA
NRALPKLSCDIQLSNWCLLIVIMCNLILVHGLFLVRNEKGKKKKRMPRSTSSIPSVPNKRARIHYDPGKHRTASQVHLYAHSYFAECRGEILIVQTLLDFPIISLSSEVL
GLTGDEGTKRNLKLLIKSLSRLL