; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0000119 (gene) of Chayote v1 genome

Gene IDSed0000119
OrganismSechium edule (Chayote v1)
Descriptionprotein TRIGALACTOSYLDIACYLGLYCEROL 3, chloroplastic-like
Genome locationLG08:25187880..25197869
RNA-Seq ExpressionSed0000119
SyntenySed0000119
Gene Ontology termsGO:0005524 - ATP binding (molecular function)
InterPro domainsIPR003439 - ABC transporter-like, ATP-binding domain
IPR003593 - AAA+ ATPase domain
IPR017871 - ABC transporter-like, conserved site
IPR027417 - P-loop containing nucleoside triphosphate hydrolase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6588286.1 Protein TRIGALACTOSYLDIACYLGLYCEROL 3, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]2.4e-17692.22Show/hide
Query:  MVSVSGSVFFPLTAPNCSFRSRKVAVLNAHDSFCFKKKDQRRIVCNCIAPPSYFKGDGSSA----DLFRSKHLSPDNEDKDGSDVLIECRDVYKSFGEKH
        MVS+SGSVFFPLT P+CS RSRKVAVL+ H+SFC KKKDQRRIVCNCIAPP +FK DGSS     D F S+HLS DNED+D SDVLIECR+VYKSFGEKH
Subjt:  MVSVSGSVFFPLTAPNCSFRSRKVAVLNAHDSFCFKKKDQRRIVCNCIAPPSYFKGDGSSA----DLFRSKHLSPDNEDKDGSDVLIECRDVYKSFGEKH

Query:  ILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLSPDKGEVYIRGRKRVGLIDDEELSGLRIGLVFQSAALFDSLTVRQNVGFLLYENSSLSEDQISA
        ILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLSPDKGEVYIRGRKRVGLIDDEELSGLRIGLVFQSAALFDSLTVRQNVGFLLYENSSLSEDQISA
Subjt:  ILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLSPDKGEVYIRGRKRVGLIDDEELSGLRIGLVFQSAALFDSLTVRQNVGFLLYENSSLSEDQISA

Query:  LVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEVEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGKIASYIVVTHQHST
        LVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKE+EPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGKIASYIVVTHQHST
Subjt:  LVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEVEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGKIASYIVVTHQHST

Query:  ISRAVDRLIFLHEGKVVWQGMTDEFTTSTNPIVQQFASGSLDGPIKY
        I RAVDRL+FL+EGKVVWQGMTDEFTTSTNPIVQQFASGSLDGPIKY
Subjt:  ISRAVDRLIFLHEGKVVWQGMTDEFTTSTNPIVQQFASGSLDGPIKY

XP_022932753.1 protein TRIGALACTOSYLDIACYLGLYCEROL 3, chloroplastic-like [Cucurbita moschata]1.5e-17591.93Show/hide
Query:  MVSVSGSVFFPLTAPNCSFRSRKVAVLNAHDSFCFKKKDQRRIVCNCIAPPSYFKGDGSSA----DLFRSKHLSPDNEDKDGSDVLIECRDVYKSFGEKH
        MVS+SGSVFFPLT P+CS RSRKVAVL+ H+SFC KKKDQRRIVCNCIAPP +FK DGSS     D F S+HLS DNE +D SDVLIECR+VYKSFGEKH
Subjt:  MVSVSGSVFFPLTAPNCSFRSRKVAVLNAHDSFCFKKKDQRRIVCNCIAPPSYFKGDGSSA----DLFRSKHLSPDNEDKDGSDVLIECRDVYKSFGEKH

Query:  ILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLSPDKGEVYIRGRKRVGLIDDEELSGLRIGLVFQSAALFDSLTVRQNVGFLLYENSSLSEDQISA
        ILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLSPDKGEVYIRGRKRVGLIDDEELSGLRIGLVFQSAALFDSLTVRQNVGFLLYENSSLSEDQISA
Subjt:  ILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLSPDKGEVYIRGRKRVGLIDDEELSGLRIGLVFQSAALFDSLTVRQNVGFLLYENSSLSEDQISA

Query:  LVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEVEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGKIASYIVVTHQHST
        LVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKE+EPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGKIASYIVVTHQHST
Subjt:  LVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEVEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGKIASYIVVTHQHST

Query:  ISRAVDRLIFLHEGKVVWQGMTDEFTTSTNPIVQQFASGSLDGPIKY
        I RAVDRL+FL+EGKVVWQGMTDEFTTSTNPIVQQFASGSLDGPIKY
Subjt:  ISRAVDRLIFLHEGKVVWQGMTDEFTTSTNPIVQQFASGSLDGPIKY

XP_023001882.1 protein TRIGALACTOSYLDIACYLGLYCEROL 3, chloroplastic-like [Cucurbita maxima]1.4e-17390.6Show/hide
Query:  MVSVSGSVFFPLTAPNCSFRSRKVAVLNAHDSFCFKKKDQRRIVCNCIAPPSYFKGDGSSA----DLFRSKHLSPDN----EDKDGSDVLIECRDVYKSF
        MVS+SGSVFFPLT P+CS RSRKVAVL+ H+SFC KKKDQRRIVCNCIAPP +FK DGSS     D F S+HLS DN    ED+D SDVLIECR+VYKSF
Subjt:  MVSVSGSVFFPLTAPNCSFRSRKVAVLNAHDSFCFKKKDQRRIVCNCIAPPSYFKGDGSSA----DLFRSKHLSPDN----EDKDGSDVLIECRDVYKSF

Query:  GEKHILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLSPDKGEVYIRGRKRVGLIDDEELSGLRIGLVFQSAALFDSLTVRQNVGFLLYENSSLSED
        GEKHILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLSPDKGEVYIRGRKRVGLIDDEELSGLRIGLVFQSAALFDSLTVRQNVGFLLYENSSLSED
Subjt:  GEKHILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLSPDKGEVYIRGRKRVGLIDDEELSGLRIGLVFQSAALFDSLTVRQNVGFLLYENSSLSED

Query:  QISALVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEVEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGKIASYIVVTH
        QISALVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKE+EPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDA GKPGKIASYI VTH
Subjt:  QISALVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEVEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGKIASYIVVTH

Query:  QHSTISRAVDRLIFLHEGKVVWQGMTDEFTTSTNPIVQQFASGSLDGPIKY
        QHSTI RAVDRL+FL+EGKVVWQGMTDEFTTSTNPIVQQFASGSLDGPIKY
Subjt:  QHSTISRAVDRLIFLHEGKVVWQGMTDEFTTSTNPIVQQFASGSLDGPIKY

XP_023520683.1 protein TRIGALACTOSYLDIACYLGLYCEROL 3, chloroplastic-like [Cucurbita pepo subsp. pepo]1.2e-17591.93Show/hide
Query:  MVSVSGSVFFPLTAPNCSFRSRKVAVLNAHDSFCFKKKDQRRIVCNCIAPPSYFKGDGSSA----DLFRSKHLSPDNEDKDGSDVLIECRDVYKSFGEKH
        MVS+SGSVFFPLT P+CS RSRKVAVL+ H+SFC KKKDQRRIVCNCIAPP +FK DGSS     D F S+HLS DNED+D SDVLIECR+VYKSFGEKH
Subjt:  MVSVSGSVFFPLTAPNCSFRSRKVAVLNAHDSFCFKKKDQRRIVCNCIAPPSYFKGDGSSA----DLFRSKHLSPDNEDKDGSDVLIECRDVYKSFGEKH

Query:  ILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLSPDKGEVYIRGRKRVGLIDDEELSGLRIGLVFQSAALFDSLTVRQNVGFLLYENSSLSEDQISA
        ILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLSPDKGEVYIRGRKRVGLIDDEELSGLRIGLVFQSAALFDSLTVRQNVGFLLYENSSLSEDQISA
Subjt:  ILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLSPDKGEVYIRGRKRVGLIDDEELSGLRIGLVFQSAALFDSLTVRQNVGFLLYENSSLSEDQISA

Query:  LVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEVEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGKIASYIVVTHQHST
        LVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDN RKE+EPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGKIASYIVVTHQHST
Subjt:  LVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEVEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGKIASYIVVTHQHST

Query:  ISRAVDRLIFLHEGKVVWQGMTDEFTTSTNPIVQQFASGSLDGPIKY
        I RAVDRL+FL+EGKVVWQGMTDEFTTSTNPIVQQFASGSLDGPIKY
Subjt:  ISRAVDRLIFLHEGKVVWQGMTDEFTTSTNPIVQQFASGSLDGPIKY

XP_038876767.1 protein TRIGALACTOSYLDIACYLGLYCEROL 3, chloroplastic isoform X1 [Benincasa hispida]3.7e-17792.82Show/hide
Query:  MVSVSGSVFFPLTAPNCSFRSRKVAVLNAHDSFCFKKKDQRRI-VCNCIAPPSYFKGDGSSA----DLFRSKHLSPDNEDKDGSDVLIECRDVYKSFGEK
        MVS+SGSVFFPLT PNCS RSRKVAVL+AH+SFC KKKDQRRI VCNCIAPP YFK DGSSA    D FRS+HLSPDNEDKD SDVLIECR+VYKSFGEK
Subjt:  MVSVSGSVFFPLTAPNCSFRSRKVAVLNAHDSFCFKKKDQRRI-VCNCIAPPSYFKGDGSSA----DLFRSKHLSPDNEDKDGSDVLIECRDVYKSFGEK

Query:  HILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLSPDKGEVYIRGRKRVGLIDDEELSGLRIGLVFQSAALFDSLTVRQNVGFLLYENSSLSEDQIS
        HILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLSPDKGEVYIRGRKR GLIDDEELSGLRIGLVFQSAALFDSLTVRQNVGFLLYENSSLSE+QIS
Subjt:  HILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLSPDKGEVYIRGRKRVGLIDDEELSGLRIGLVFQSAALFDSLTVRQNVGFLLYENSSLSEDQIS

Query:  ALVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEVEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGKIASYIVVTHQHS
         LVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKE+EPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGKIASYIVVTHQHS
Subjt:  ALVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEVEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGKIASYIVVTHQHS

Query:  TISRAVDRLIFLHEGKVVWQGMTDEFTTSTNPIVQQFASGSLDGPIKY
        TI RAVDRL+FL+EGKVVWQGMTDEFTTSTNPIVQQFASGSLDGPI+Y
Subjt:  TISRAVDRLIFLHEGKVVWQGMTDEFTTSTNPIVQQFASGSLDGPIKY

TrEMBL top hitse value%identityAlignment
A0A0A0LY39 ABC transporter domain-containing protein1.3e-17291.09Show/hide
Query:  MVSVSGSVFFPLTAPNCSFRSRKVAVLNAHDSFCFKKKDQRRIV-CNCIAPPSYFKGDGSSA----DLFRSKHLSPDNEDKDGSDVLIECRDVYKSFGEK
        MVSVSGSVFFPLT P+CS RSRKVAV++AH+SFC K KDQRRIV CNCIAPP YFK D SSA    D FRS+HLS +NEDK+ SDVLIECR+V+KSFGEK
Subjt:  MVSVSGSVFFPLTAPNCSFRSRKVAVLNAHDSFCFKKKDQRRIV-CNCIAPPSYFKGDGSSA----DLFRSKHLSPDNEDKDGSDVLIECRDVYKSFGEK

Query:  HILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLSPDKGEVYIRGRKRVGLIDDEELSGLRIGLVFQSAALFDSLTVRQNVGFLLYENSSLSEDQIS
        HILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLSPDKGEVYIRGRKRVGLIDDEELSGLRIGLVFQSAALFDSLTVRQNVGFLLYENSSLSEDQIS
Subjt:  HILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLSPDKGEVYIRGRKRVGLIDDEELSGLRIGLVFQSAALFDSLTVRQNVGFLLYENSSLSEDQIS

Query:  ALVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEVEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGKIASYIVVTHQHS
         LVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKE+EPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGKIASYIVVTHQHS
Subjt:  ALVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEVEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGKIASYIVVTHQHS

Query:  TISRAVDRLIFLHEGKVVWQGMTDEFTTSTNPIVQQFASGSLDGPIKY
        TI RAVDRL+FL+EGKVVWQGMT EFTTSTNPIVQQFASGSLDGPI+Y
Subjt:  TISRAVDRLIFLHEGKVVWQGMTDEFTTSTNPIVQQFASGSLDGPIKY

A0A1S3B8N5 protein TRIGALACTOSYLDIACYLGLYCEROL 3, chloroplastic isoform X12.0e-17391.09Show/hide
Query:  MVSVSGSVFFPLTAPNCSFRSRKVAVLNAHDSFCFKKKDQRRIV-CNCIAPPSYFKGDGSSA----DLFRSKHLSPDNEDKDGSDVLIECRDVYKSFGEK
        MVS+SGSVFFPLT PNCS RSRKVAVL+AH+SFC K KDQRRIV CNCIAPP YFK D SSA    D FRS+HLS +NEDKD SDVLIECR+V+KSFGEK
Subjt:  MVSVSGSVFFPLTAPNCSFRSRKVAVLNAHDSFCFKKKDQRRIV-CNCIAPPSYFKGDGSSA----DLFRSKHLSPDNEDKDGSDVLIECRDVYKSFGEK

Query:  HILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLSPDKGEVYIRGRKRVGLIDDEELSGLRIGLVFQSAALFDSLTVRQNVGFLLYENSSLSEDQIS
        HILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLSPDKGEVYIRGRKR+GLIDDEELSGLRIGLVFQSAALFDSLTVRQNVGFLLYENSSLSEDQIS
Subjt:  HILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLSPDKGEVYIRGRKRVGLIDDEELSGLRIGLVFQSAALFDSLTVRQNVGFLLYENSSLSEDQIS

Query:  ALVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEVEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGKIASYIVVTHQHS
         LVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKE+EPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGKIASYIVVTHQHS
Subjt:  ALVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEVEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGKIASYIVVTHQHS

Query:  TISRAVDRLIFLHEGKVVWQGMTDEFTTSTNPIVQQFASGSLDGPIKY
        TI RAVDRL+FL+EG+VVWQGMT EFTTSTNPIVQQFASGSLDGPI+Y
Subjt:  TISRAVDRLIFLHEGKVVWQGMTDEFTTSTNPIVQQFASGSLDGPIKY

A0A6J1CBM2 protein TRIGALACTOSYLDIACYLGLYCEROL 3, chloroplastic1.3e-17290.49Show/hide
Query:  MVSVSGSVFFPLTAPNCSFRSRKVAVLNAHDSFCFKKKDQRRIVCNCIAPPSYFKGDGSSA----DLFRSKHLSPDNEDKDGSDVLIECRDVYKSFGEKH
        MVS+SGSV FPLTAPNCS  SRKVAVL+A +SFCFKKK+QRRIVCNCIAPP YFK D SSA    D FRS++L PDNE +D SDVLIECRDVYKSFGEKH
Subjt:  MVSVSGSVFFPLTAPNCSFRSRKVAVLNAHDSFCFKKKDQRRIVCNCIAPPSYFKGDGSSA----DLFRSKHLSPDNEDKDGSDVLIECRDVYKSFGEKH

Query:  ILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLSPDKGEVYIRGRKRVGLIDDEELSGLRIGLVFQSAALFDSLTVRQNVGFLLYENSSLSEDQISA
        ILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLL+PDKGEVYIRGRKRVGLIDDEE+SGLRIGLVFQSAALFDSLTVR+NVGFLLYENSSLS DQISA
Subjt:  ILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLSPDKGEVYIRGRKRVGLIDDEELSGLRIGLVFQSAALFDSLTVRQNVGFLLYENSSLSEDQISA

Query:  LVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEVEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGKIASYIVVTHQHST
        LVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKE+EPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPG IASYIVVTHQHST
Subjt:  LVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEVEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGKIASYIVVTHQHST

Query:  ISRAVDRLIFLHEGKVVWQGMTDEFTTSTNPIVQQFASGSLDGPIKY
        I RAVDRL+FL+EGKVVWQGMT EFTTSTNPIVQQFASG+LDGPIKY
Subjt:  ISRAVDRLIFLHEGKVVWQGMTDEFTTSTNPIVQQFASGSLDGPIKY

A0A6J1F2M5 protein TRIGALACTOSYLDIACYLGLYCEROL 3, chloroplastic-like7.4e-17691.93Show/hide
Query:  MVSVSGSVFFPLTAPNCSFRSRKVAVLNAHDSFCFKKKDQRRIVCNCIAPPSYFKGDGSSA----DLFRSKHLSPDNEDKDGSDVLIECRDVYKSFGEKH
        MVS+SGSVFFPLT P+CS RSRKVAVL+ H+SFC KKKDQRRIVCNCIAPP +FK DGSS     D F S+HLS DNE +D SDVLIECR+VYKSFGEKH
Subjt:  MVSVSGSVFFPLTAPNCSFRSRKVAVLNAHDSFCFKKKDQRRIVCNCIAPPSYFKGDGSSA----DLFRSKHLSPDNEDKDGSDVLIECRDVYKSFGEKH

Query:  ILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLSPDKGEVYIRGRKRVGLIDDEELSGLRIGLVFQSAALFDSLTVRQNVGFLLYENSSLSEDQISA
        ILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLSPDKGEVYIRGRKRVGLIDDEELSGLRIGLVFQSAALFDSLTVRQNVGFLLYENSSLSEDQISA
Subjt:  ILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLSPDKGEVYIRGRKRVGLIDDEELSGLRIGLVFQSAALFDSLTVRQNVGFLLYENSSLSEDQISA

Query:  LVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEVEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGKIASYIVVTHQHST
        LVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKE+EPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGKIASYIVVTHQHST
Subjt:  LVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEVEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGKIASYIVVTHQHST

Query:  ISRAVDRLIFLHEGKVVWQGMTDEFTTSTNPIVQQFASGSLDGPIKY
        I RAVDRL+FL+EGKVVWQGMTDEFTTSTNPIVQQFASGSLDGPIKY
Subjt:  ISRAVDRLIFLHEGKVVWQGMTDEFTTSTNPIVQQFASGSLDGPIKY

A0A6J1KHV9 protein TRIGALACTOSYLDIACYLGLYCEROL 3, chloroplastic-like6.9e-17490.6Show/hide
Query:  MVSVSGSVFFPLTAPNCSFRSRKVAVLNAHDSFCFKKKDQRRIVCNCIAPPSYFKGDGSSA----DLFRSKHLSPDN----EDKDGSDVLIECRDVYKSF
        MVS+SGSVFFPLT P+CS RSRKVAVL+ H+SFC KKKDQRRIVCNCIAPP +FK DGSS     D F S+HLS DN    ED+D SDVLIECR+VYKSF
Subjt:  MVSVSGSVFFPLTAPNCSFRSRKVAVLNAHDSFCFKKKDQRRIVCNCIAPPSYFKGDGSSA----DLFRSKHLSPDN----EDKDGSDVLIECRDVYKSF

Query:  GEKHILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLSPDKGEVYIRGRKRVGLIDDEELSGLRIGLVFQSAALFDSLTVRQNVGFLLYENSSLSED
        GEKHILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLSPDKGEVYIRGRKRVGLIDDEELSGLRIGLVFQSAALFDSLTVRQNVGFLLYENSSLSED
Subjt:  GEKHILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLSPDKGEVYIRGRKRVGLIDDEELSGLRIGLVFQSAALFDSLTVRQNVGFLLYENSSLSED

Query:  QISALVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEVEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGKIASYIVVTH
        QISALVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKE+EPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDA GKPGKIASYI VTH
Subjt:  QISALVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEVEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGKIASYIVVTH

Query:  QHSTISRAVDRLIFLHEGKVVWQGMTDEFTTSTNPIVQQFASGSLDGPIKY
        QHSTI RAVDRL+FL+EGKVVWQGMTDEFTTSTNPIVQQFASGSLDGPIKY
Subjt:  QHSTISRAVDRLIFLHEGKVVWQGMTDEFTTSTNPIVQQFASGSLDGPIKY

SwissProt top hitse value%identityAlignment
P30769 Probable ribonucleotide transport ATP-binding protein mkl1.2e-3735.74Show/hide
Query:  VLIECRDVYKSFGEKHILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLSPDKGEVYIRGRKRVGLIDDEELSGLR--IGLVFQSAALFDSLTVRQN
        V IE + + KSFG   I   V+  I  GE   ++GPSGTGKS  LK + GLL P++G + I G   +     +EL  +R   G++FQ  ALF S+ +  N
Subjt:  VLIECRDVYKSFGEKHILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLSPDKGEVYIRGRKRVGLIDDEELSGLR--IGLVFQSAALFDSLTVRQN

Query:  VGFLLYENSSLSEDQISALVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEVEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDAS
          F L E++   E +I  +V E L  VGL G E + P E+SGGM+KR  LAR+++ D       P+++L DEP +GLDP+ +  +  LI  ++ + +   
Subjt:  VGFLLYENSSLSEDQISALVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEVEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDAS

Query:  GKPGKIASYIVVTHQHSTISRAVDRLIFLHEGKVVWQGMTDEFTTSTNPIVQQFASGSLDGPI
              A+ ++VTH  +      D +  L    +V  G  +   TS  P+V+QF +G   GPI
Subjt:  GKPGKIASYIVVTHQHSTISRAVDRLIFLHEGKVVWQGMTDEFTTSTNPIVQQFASGSLDGPI

P63358 Probable ribonucleotide transport ATP-binding protein mkl7.0e-3836.12Show/hide
Query:  VLIECRDVYKSFGEKHILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLSPDKGEVYIRGRKRVGLIDDEELSGLR--IGLVFQSAALFDSLTVRQN
        V IE   + KSFG   I   V+  I  GE   ++GPSGTGKS  LK + GLL P++G + I G   +     +EL  +R   G++FQ  ALF S+ +  N
Subjt:  VLIECRDVYKSFGEKHILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLSPDKGEVYIRGRKRVGLIDDEELSGLR--IGLVFQSAALFDSLTVRQN

Query:  VGFLLYENSSLSEDQISALVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEVEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDAS
          F L E++   E +I  +V E LA VGL G E + P E+SGGM+KR  LAR+++ D       P+++L DEP +GLDP+ +  +  LI  ++ + +   
Subjt:  VGFLLYENSSLSEDQISALVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEVEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDAS

Query:  GKPGKIASYIVVTHQHSTISRAVDRLIFLHEGKVVWQGMTDEFTTSTNPIVQQFASGSLDGPI
              A+ ++VTH  +      D +  L    +V  G  +   TS  P+V+QF +G   GPI
Subjt:  GKPGKIASYIVVTHQHSTISRAVDRLIFLHEGKVVWQGMTDEFTTSTNPIVQQFASGSLDGPI

P9WQL4 Probable ribonucleotide transport ATP-binding protein mkl7.0e-3836.12Show/hide
Query:  VLIECRDVYKSFGEKHILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLSPDKGEVYIRGRKRVGLIDDEELSGLR--IGLVFQSAALFDSLTVRQN
        V IE   + KSFG   I   V+  I  GE   ++GPSGTGKS  LK + GLL P++G + I G   +     +EL  +R   G++FQ  ALF S+ +  N
Subjt:  VLIECRDVYKSFGEKHILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLSPDKGEVYIRGRKRVGLIDDEELSGLR--IGLVFQSAALFDSLTVRQN

Query:  VGFLLYENSSLSEDQISALVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEVEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDAS
          F L E++   E +I  +V E LA VGL G E + P E+SGGM+KR  LAR+++ D       P+++L DEP +GLDP+ +  +  LI  ++ + +   
Subjt:  VGFLLYENSSLSEDQISALVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEVEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDAS

Query:  GKPGKIASYIVVTHQHSTISRAVDRLIFLHEGKVVWQGMTDEFTTSTNPIVQQFASGSLDGPI
              A+ ++VTH  +      D +  L    +V  G  +   TS  P+V+QF +G   GPI
Subjt:  GKPGKIASYIVVTHQHSTISRAVDRLIFLHEGKVVWQGMTDEFTTSTNPIVQQFASGSLDGPI

P9WQL5 Probable ribonucleotide transport ATP-binding protein mkl7.0e-3836.12Show/hide
Query:  VLIECRDVYKSFGEKHILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLSPDKGEVYIRGRKRVGLIDDEELSGLR--IGLVFQSAALFDSLTVRQN
        V IE   + KSFG   I   V+  I  GE   ++GPSGTGKS  LK + GLL P++G + I G   +     +EL  +R   G++FQ  ALF S+ +  N
Subjt:  VLIECRDVYKSFGEKHILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLSPDKGEVYIRGRKRVGLIDDEELSGLR--IGLVFQSAALFDSLTVRQN

Query:  VGFLLYENSSLSEDQISALVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEVEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDAS
          F L E++   E +I  +V E LA VGL G E + P E+SGGM+KR  LAR+++ D       P+++L DEP +GLDP+ +  +  LI  ++ + +   
Subjt:  VGFLLYENSSLSEDQISALVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEVEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDAS

Query:  GKPGKIASYIVVTHQHSTISRAVDRLIFLHEGKVVWQGMTDEFTTSTNPIVQQFASGSLDGPI
              A+ ++VTH  +      D +  L    +V  G  +   TS  P+V+QF +G   GPI
Subjt:  GKPGKIASYIVVTHQHSTISRAVDRLIFLHEGKVVWQGMTDEFTTSTNPIVQQFASGSLDGPI

Q9AT00 Protein TRIGALACTOSYLDIACYLGLYCEROL 3, chloroplastic2.0e-13378.1Show/hide
Query:  QRRIVCNCIAPPSYFKGDGSSADLFRSK--HLSPDNEDKDGSDVLIECRDVYKSFGEKHILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLSPDKG
        +R++ C CIAPP     D +  D        +  +   ++ SDVLIECRDVYKSFGEKHIL+GVSFKIRHGEAVGVIGPSGTGKSTILKI+AGLL+PDKG
Subjt:  QRRIVCNCIAPPSYFKGDGSSADLFRSK--HLSPDNEDKDGSDVLIECRDVYKSFGEKHILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLSPDKG

Query:  EVYIRGRKRVGLIDDEELSGLRIGLVFQSAALFDSLTVRQNVGFLLYENSSLSEDQISALVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNT
        EVYIRG+KR GLI DEE+SGLRIGLVFQSAALFDSL+VR+NVGFLLYE S +SE+QIS LVT+ LAAVGLKGVE+RLPSELSGGMKKRVALARS+IFD T
Subjt:  EVYIRGRKRVGLIDDEELSGLRIGLVFQSAALFDSLTVRQNVGFLLYENSSLSEDQISALVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNT

Query:  RKEVEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGKIASYIVVTHQHSTISRAVDRLIFLHEGKVVWQGMTDEFTTSTNPIVQQFASGSL
        ++ +EPEVLLYDEPTAGLDPIASTVVEDLIRSVH+  EDA GKPGKIASY+VVTHQHSTI RAVDRL+FL+EGK+VWQGMT EFTTSTNPIVQQFA+GSL
Subjt:  RKEVEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGKIASYIVVTHQHSTISRAVDRLIFLHEGKVVWQGMTDEFTTSTNPIVQQFASGSL

Query:  DGPIKY
        DGPI+Y
Subjt:  DGPIKY

Arabidopsis top hitse value%identityAlignment
AT1G65410.1 non-intrinsic ABC protein 111.4e-13478.1Show/hide
Query:  QRRIVCNCIAPPSYFKGDGSSADLFRSK--HLSPDNEDKDGSDVLIECRDVYKSFGEKHILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLSPDKG
        +R++ C CIAPP     D +  D        +  +   ++ SDVLIECRDVYKSFGEKHIL+GVSFKIRHGEAVGVIGPSGTGKSTILKI+AGLL+PDKG
Subjt:  QRRIVCNCIAPPSYFKGDGSSADLFRSK--HLSPDNEDKDGSDVLIECRDVYKSFGEKHILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLSPDKG

Query:  EVYIRGRKRVGLIDDEELSGLRIGLVFQSAALFDSLTVRQNVGFLLYENSSLSEDQISALVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNT
        EVYIRG+KR GLI DEE+SGLRIGLVFQSAALFDSL+VR+NVGFLLYE S +SE+QIS LVT+ LAAVGLKGVE+RLPSELSGGMKKRVALARS+IFD T
Subjt:  EVYIRGRKRVGLIDDEELSGLRIGLVFQSAALFDSLTVRQNVGFLLYENSSLSEDQISALVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNT

Query:  RKEVEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGKIASYIVVTHQHSTISRAVDRLIFLHEGKVVWQGMTDEFTTSTNPIVQQFASGSL
        ++ +EPEVLLYDEPTAGLDPIASTVVEDLIRSVH+  EDA GKPGKIASY+VVTHQHSTI RAVDRL+FL+EGK+VWQGMT EFTTSTNPIVQQFA+GSL
Subjt:  RKEVEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGKIASYIVVTHQHSTISRAVDRLIFLHEGKVVWQGMTDEFTTSTNPIVQQFASGSL

Query:  DGPIKY
        DGPI+Y
Subjt:  DGPIKY

AT1G67940.1 non-intrinsic ABC protein 35.0e-2331.91Show/hide
Query:  GSDVLIECRDVYKSFGE-KHILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLSPDKGEVYIRGRKRVGLIDDEELSGLRIGLVFQSAALFDSLTVR
        GS+  I   D+ +   +   IL+GV+  I  G  VGVIGPSG+GKST L+ +  L  P +  V++ G     +  D      R+G++FQ   LF   TV 
Subjt:  GSDVLIECRDVYKSFGE-KHILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLSPDKGEVYIRGRKRVGLIDDEELSGLRIGLVFQSAALFDSLTVR

Query:  QNVGF-LLYENSSLSEDQISALVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEVEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGE
         NV +        LS++++  L+  +LA +     + +  +ELS G  +RVALAR++         EPEVLL DEPT+ LDPI++  +ED+I  +     
Subjt:  QNVGF-LLYENSSLSEDQISALVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEVEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGE

Query:  DASGKPGKIASYIVVTHQHSTISRAVDRLIFLHEGKVVWQGMTDEFTTSTNPIVQQF
            K  +  + ++V+H    I +  D +  + +G++V      E + +T+P+ Q+F
Subjt:  DASGKPGKIASYIVVTHQHSTISRAVDRLIFLHEGKVVWQGMTDEFTTSTNPIVQQF

AT2G47000.1 ATP binding cassette subfamily B42.3e-2030.24Show/hide
Query:  IECRDVYKSF---GEKHILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLSPDKGEVYIRGRKRVGLIDDEELSGLR--IGLVFQSAALFDSLTVRQ
        IE +DVY ++    ++ I RG S  I  G  V ++G SG+GKST++ +I     P  G+V I G      + + +L  +R  IGLV Q   LF + +++ 
Subjt:  IECRDVYKSF---GEKHILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLSPDKGEVYIRGRKRVGLIDDEELSGLR--IGLVFQSAALFDSLTVRQ

Query:  NVGFLLYENSSLSEDQISALVTENLAAV-----GLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEVEPEVLLYDEPTAGLDPIASTVVEDLIRSVHI
        N+ +   E+++  E + +A +      V     GL  +     ++LSGG K+R+A+AR+I+ D       P +LL DE T+ LD  +  VV++ +  + +
Subjt:  NVGFLLYENSSLSEDQISALVTENLAAV-----GLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEVEPEVLLYDEPTAGLDPIASTVVEDLIRSVHI

Query:  KGEDASGKPGKIASYIVVTHQHSTISRAVDRLIFLHEGKVVWQGMTDE
                     + +VV H+ ST+ R  D +  +H+GK+V +G   E
Subjt:  KGEDASGKPGKIASYIVVTHQHSTISRAVDRLIFLHEGKVVWQGMTDE

AT3G62150.1 P-glycoprotein 216.1e-2130.54Show/hide
Query:  YKSFGEKHILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLSPDKGEVYIRGRKRVGLIDDEELSGLR--IGLVFQSAALFDSLTVRQNVGFLLYEN
        Y +  E+ I RG S  I  G  V ++G SG+GKST++ +I     P  GEV I G      + + +L  +R  IGLV Q   LF S ++++N+ +   EN
Subjt:  YKSFGEKHILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLSPDKGEVYIRGRKRVGLIDDEELSGLR--IGLVFQSAALFDSLTVRQNVGFLLYEN

Query:  SSLSE-----DQISALVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEVEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKP
        +++ E     +  +A    +    GL  +     ++LSGG K+R+A+AR+I+ D       P +LL DE T+ LD  +  +V++ +  + +         
Subjt:  SSLSE-----DQISALVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEVEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKP

Query:  GKIASYIVVTHQHSTISRAVDRLIFLHEGKVVWQGMTDE
            + +VV H+ ST+ R  D +  +H+GK+V +G   E
Subjt:  GKIASYIVVTHQHSTISRAVDRLIFLHEGKVVWQGMTDE

AT5G46540.1 P-glycoprotein 71.8e-2030.95Show/hide
Query:  IECRDVYKSFGEK---HILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLSPDKGEVYIRGRKRVGLIDDEELS----GLRIGLVFQSAALFDSLTV
        IE RDVY  +  +    I  G S  + +G  V ++G SG+GKST++ +I     P+ GEV I G      ID ++        +IGLV Q   LF + T+
Subjt:  IECRDVYKSFGEK---HILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLSPDKGEVYIRGRKRVGLIDDEELS----GLRIGLVFQSAALFDSLTV

Query:  RQNVGFLLYENSSLSEDQI-SALVTENLA------AVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEVEPEVLLYDEPTAGLDPIASTVVEDLIR
        R+N+   +Y     S+ +I +AL   N +        GL+ +     ++LSGG K+R+A+AR+I+         P++LL DE T+ LD  +  +V+D + 
Subjt:  RQNVGFLLYENSSLSEDQI-SALVTENLA------AVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEVEPEVLLYDEPTAGLDPIASTVVEDLIR

Query:  SVHIKGEDASGKPGKIASYIVVTHQHSTISRAVDRLIFLHEGKVVWQGMTDE
         + +             + +VV H+ +TI R  D +  + +GKV+ +G  DE
Subjt:  SVHIKGEDASGKPGKIASYIVVTHQHSTISRAVDRLIFLHEGKVVWQGMTDE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCTCTGTATCGGGTTCCGTGTTCTTTCCATTAACAGCACCCAATTGTTCCTTTCGATCGCGTAAAGTAGCTGTTCTTAATGCGCACGATTCGTTTTGTTTCAAGAA
AAAGGACCAGAGAAGGATTGTCTGTAATTGCATTGCGCCACCGTCATACTTCAAGGGTGATGGATCATCTGCCGACTTATTTAGATCAAAGCATTTGAGCCCAGATAATG
AGGACAAGGATGGGTCTGATGTTCTCATTGAGTGTAGAGATGTTTACAAATCCTTTGGAGAAAAGCATATATTGAGAGGTGTGAGCTTCAAGATTAGACATGGAGAAGCT
GTGGGTGTAATTGGGCCTTCTGGTACTGGGAAGTCTACAATACTGAAGATCATTGCAGGTCTTCTTTCTCCAGACAAGGGAGAGGTATATATTCGAGGTAGAAAGCGAGT
TGGTTTGATTGATGATGAGGAGCTATCTGGTCTTCGAATTGGACTGGTTTTCCAGAGTGCGGCACTTTTTGACTCATTAACTGTTCGGCAAAATGTTGGTTTCCTTTTGT
ATGAAAATTCAAGCTTGTCTGAAGATCAAATTTCAGCATTGGTAACTGAGAACTTAGCTGCTGTTGGGCTGAAGGGAGTTGAGGACCGGTTACCTTCTGAATTATCAGGT
GGAATGAAGAAACGAGTTGCATTAGCTAGGTCTATAATTTTTGATAACACGAGGAAAGAAGTTGAGCCAGAGGTGCTTTTGTATGATGAACCAACTGCTGGACTTGACCC
AATTGCATCAACCGTTGTTGAAGATCTTATACGTTCTGTACATATTAAGGGTGAGGATGCTAGTGGGAAGCCTGGGAAGATTGCATCTTATATCGTAGTCACCCACCAAC
ATAGTACTATTAGTAGAGCTGTTGACAGGTTAATATTTTTACACGAGGGGAAGGTTGTGTGGCAAGGAATGACTGATGAATTTACAACATCTACAAATCCAATCGTTCAA
CAGTTCGCATCGGGAAGCTTGGATGGTCCGATTAAGTACTAG
mRNA sequenceShow/hide mRNA sequence
CTATCTTTCTCTAAAATAGCACCATTATTTTGTAATCAGAACTTTTAAAACTAAATCAAAATCCATGCAAAAACAAAGAAAAAAGGAAGAAATTAAATATTGAGATTTCA
AAGAACAAAAATTGAAGGAAAGGAGGGGAGCCACGATCGAACTTCTCATCGCAAACCAAGCATACGACAGGGAGTTCTCCATAATTCTATGAGATTTTCTTTCAATCTTT
GAAGATTCTGTTCATTCCCATCTCTTTTCTTCATCTTTCCAAACCTAATTCTTGGCCATTTCTGCATTTTGAGTTCATTCTGAAGCACAATGCATTGACCCATCTAGACC
AAATCCAGTTTTGAAGGGGTATTTGCCCATAATCCTTTTTGGGTTTTCTTAAATTTGTTTCGATTGCATTCAAACAAGTTATTAGAGGCTGATTTTTTACTGGTTGCATC
GTGGGTCAGCTTCAAAATCTCAAAATTCATCAACCCTTCAAGGGTCAATCCCCACAACATGGTCTCTGTATCGGGTTCCGTGTTCTTTCCATTAACAGCACCCAATTGTT
CCTTTCGATCGCGTAAAGTAGCTGTTCTTAATGCGCACGATTCGTTTTGTTTCAAGAAAAAGGACCAGAGAAGGATTGTCTGTAATTGCATTGCGCCACCGTCATACTTC
AAGGGTGATGGATCATCTGCCGACTTATTTAGATCAAAGCATTTGAGCCCAGATAATGAGGACAAGGATGGGTCTGATGTTCTCATTGAGTGTAGAGATGTTTACAAATC
CTTTGGAGAAAAGCATATATTGAGAGGTGTGAGCTTCAAGATTAGACATGGAGAAGCTGTGGGTGTAATTGGGCCTTCTGGTACTGGGAAGTCTACAATACTGAAGATCA
TTGCAGGTCTTCTTTCTCCAGACAAGGGAGAGGTATATATTCGAGGTAGAAAGCGAGTTGGTTTGATTGATGATGAGGAGCTATCTGGTCTTCGAATTGGACTGGTTTTC
CAGAGTGCGGCACTTTTTGACTCATTAACTGTTCGGCAAAATGTTGGTTTCCTTTTGTATGAAAATTCAAGCTTGTCTGAAGATCAAATTTCAGCATTGGTAACTGAGAA
CTTAGCTGCTGTTGGGCTGAAGGGAGTTGAGGACCGGTTACCTTCTGAATTATCAGGTGGAATGAAGAAACGAGTTGCATTAGCTAGGTCTATAATTTTTGATAACACGA
GGAAAGAAGTTGAGCCAGAGGTGCTTTTGTATGATGAACCAACTGCTGGACTTGACCCAATTGCATCAACCGTTGTTGAAGATCTTATACGTTCTGTACATATTAAGGGT
GAGGATGCTAGTGGGAAGCCTGGGAAGATTGCATCTTATATCGTAGTCACCCACCAACATAGTACTATTAGTAGAGCTGTTGACAGGTTAATATTTTTACACGAGGGGAA
GGTTGTGTGGCAAGGAATGACTGATGAATTTACAACATCTACAAATCCAATCGTTCAACAGTTCGCATCGGGAAGCTTGGATGGTCCGATTAAGTACTAGCAGATATACG
AATGATGTAGTAGTTTAGTAGTTAAAGAGAACACCAAAGATGGACCAATTCTACACATTACTGATCAAGGAAACTAGTAAATTAAGGCTCATCATGTCTGCATATATACT
CCTCTATTGTAATGTCTATTTCTTTTGTTTTTTCTTATTACGAGGCAAATGTTACAAGATTTTACAATATGTGTGGATCATTTTTGTTTTTCTGCCAAA
Protein sequenceShow/hide protein sequence
MVSVSGSVFFPLTAPNCSFRSRKVAVLNAHDSFCFKKKDQRRIVCNCIAPPSYFKGDGSSADLFRSKHLSPDNEDKDGSDVLIECRDVYKSFGEKHILRGVSFKIRHGEA
VGVIGPSGTGKSTILKIIAGLLSPDKGEVYIRGRKRVGLIDDEELSGLRIGLVFQSAALFDSLTVRQNVGFLLYENSSLSEDQISALVTENLAAVGLKGVEDRLPSELSG
GMKKRVALARSIIFDNTRKEVEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGKIASYIVVTHQHSTISRAVDRLIFLHEGKVVWQGMTDEFTTSTNPIVQ
QFASGSLDGPIKY