; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC06g1149 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC06g1149
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptionprotein TRIGALACTOSYLDIACYLGLYCEROL 3, chloroplastic
Genome locationMC06:13034340..13056442
RNA-Seq ExpressionMC06g1149
SyntenyMC06g1149
Gene Ontology termsGO:0005524 - ATP binding (molecular function)
InterPro domainsIPR003439 - ABC transporter-like, ATP-binding domain
IPR003593 - AAA+ ATPase domain
IPR017871 - ABC transporter-like, conserved site
IPR027417 - P-loop containing nucleoside triphosphate hydrolase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6588286.1 Protein TRIGALACTOSYLDIACYLGLYCEROL 3, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]1.25e-22592.8Show/hide
Query:  MVSISGSVLFPLTAPNCSSGSRKVAVLDASNSFCFKKKEQRRIVCNCIAPPPYFKSDESSAVNSNDSFRSENLIPDNEHEDESDVLIECRDVYKSFGEKH
        MVSISGSV FPLT P+CSS SRKVAVLD  NSFC KKK+QRRIVCNCIAPPP+FKSD SS VN NDSF SE+L  DNE EDESDVLIECR+VYKSFGEKH
Subjt:  MVSISGSVLFPLTAPNCSSGSRKVAVLDASNSFCFKKKEQRRIVCNCIAPPPYFKSDESSAVNSNDSFRSENLIPDNEHEDESDVLIECRDVYKSFGEKH

Query:  ILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRKRVGLIDDEEISGLRIGLVFQSAALFDSLTVRENVGFLLYENSSLSGDQISA
        ILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLL+PDKGEVYIRGRKRVGLIDDEE+SGLRIGLVFQSAALFDSLTVR+NVGFLLYENSSLS DQISA
Subjt:  ILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRKRVGLIDDEEISGLRIGLVFQSAALFDSLTVRENVGFLLYENSSLSGDQISA

Query:  LVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGNIASYIVVTHQHST
        LVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPG IASYIVVTHQHST
Subjt:  LVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGNIASYIVVTHQHST

Query:  IRRAVDRLLFLYEGKVVWQGMTHEFTTSTNPIVQQFASGNLDGPIKY
        IRRAVDRLLFLYEGKVVWQGMT EFTTSTNPIVQQFASG+LDGPIKY
Subjt:  IRRAVDRLLFLYEGKVVWQGMTHEFTTSTNPIVQQFASGNLDGPIKY

XP_022139091.1 protein TRIGALACTOSYLDIACYLGLYCEROL 3, chloroplastic [Momordica charantia]1.16e-246100Show/hide
Query:  MVSISGSVLFPLTAPNCSSGSRKVAVLDASNSFCFKKKEQRRIVCNCIAPPPYFKSDESSAVNSNDSFRSENLIPDNEHEDESDVLIECRDVYKSFGEKH
        MVSISGSVLFPLTAPNCSSGSRKVAVLDASNSFCFKKKEQRRIVCNCIAPPPYFKSDESSAVNSNDSFRSENLIPDNEHEDESDVLIECRDVYKSFGEKH
Subjt:  MVSISGSVLFPLTAPNCSSGSRKVAVLDASNSFCFKKKEQRRIVCNCIAPPPYFKSDESSAVNSNDSFRSENLIPDNEHEDESDVLIECRDVYKSFGEKH

Query:  ILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRKRVGLIDDEEISGLRIGLVFQSAALFDSLTVRENVGFLLYENSSLSGDQISA
        ILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRKRVGLIDDEEISGLRIGLVFQSAALFDSLTVRENVGFLLYENSSLSGDQISA
Subjt:  ILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRKRVGLIDDEEISGLRIGLVFQSAALFDSLTVRENVGFLLYENSSLSGDQISA

Query:  LVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGNIASYIVVTHQHST
        LVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGNIASYIVVTHQHST
Subjt:  LVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGNIASYIVVTHQHST

Query:  IRRAVDRLLFLYEGKVVWQGMTHEFTTSTNPIVQQFASGNLDGPIKY
        IRRAVDRLLFLYEGKVVWQGMTHEFTTSTNPIVQQFASGNLDGPIKY
Subjt:  IRRAVDRLLFLYEGKVVWQGMTHEFTTSTNPIVQQFASGNLDGPIKY

XP_022932753.1 protein TRIGALACTOSYLDIACYLGLYCEROL 3, chloroplastic-like [Cucurbita moschata]1.77e-22592.8Show/hide
Query:  MVSISGSVLFPLTAPNCSSGSRKVAVLDASNSFCFKKKEQRRIVCNCIAPPPYFKSDESSAVNSNDSFRSENLIPDNEHEDESDVLIECRDVYKSFGEKH
        MVSISGSV FPLT P+CSS SRKVAVLD  NSFC KKK+QRRIVCNCIAPPP+FKSD SS VN NDSF SE+L  DNE EDESDVLIECR+VYKSFGEKH
Subjt:  MVSISGSVLFPLTAPNCSSGSRKVAVLDASNSFCFKKKEQRRIVCNCIAPPPYFKSDESSAVNSNDSFRSENLIPDNEHEDESDVLIECRDVYKSFGEKH

Query:  ILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRKRVGLIDDEEISGLRIGLVFQSAALFDSLTVRENVGFLLYENSSLSGDQISA
        ILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLL+PDKGEVYIRGRKRVGLIDDEE+SGLRIGLVFQSAALFDSLTVR+NVGFLLYENSSLS DQISA
Subjt:  ILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRKRVGLIDDEEISGLRIGLVFQSAALFDSLTVRENVGFLLYENSSLSGDQISA

Query:  LVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGNIASYIVVTHQHST
        LVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPG IASYIVVTHQHST
Subjt:  LVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGNIASYIVVTHQHST

Query:  IRRAVDRLLFLYEGKVVWQGMTHEFTTSTNPIVQQFASGNLDGPIKY
        IRRAVDRLLFLYEGKVVWQGMT EFTTSTNPIVQQFASG+LDGPIKY
Subjt:  IRRAVDRLLFLYEGKVVWQGMTHEFTTSTNPIVQQFASGNLDGPIKY

XP_023520683.1 protein TRIGALACTOSYLDIACYLGLYCEROL 3, chloroplastic-like [Cucurbita pepo subsp. pepo]1.02e-22492.51Show/hide
Query:  MVSISGSVLFPLTAPNCSSGSRKVAVLDASNSFCFKKKEQRRIVCNCIAPPPYFKSDESSAVNSNDSFRSENLIPDNEHEDESDVLIECRDVYKSFGEKH
        MVSISGSV FPLT P+CSS SRKVAVLD  NSFC KKK+QRRIVCNCIAPPP+FKSD SS VN NDSF SE+L  DNE EDESDVLIECR+VYKSFGEKH
Subjt:  MVSISGSVLFPLTAPNCSSGSRKVAVLDASNSFCFKKKEQRRIVCNCIAPPPYFKSDESSAVNSNDSFRSENLIPDNEHEDESDVLIECRDVYKSFGEKH

Query:  ILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRKRVGLIDDEEISGLRIGLVFQSAALFDSLTVRENVGFLLYENSSLSGDQISA
        ILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLL+PDKGEVYIRGRKRVGLIDDEE+SGLRIGLVFQSAALFDSLTVR+NVGFLLYENSSLS DQISA
Subjt:  ILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRKRVGLIDDEEISGLRIGLVFQSAALFDSLTVRENVGFLLYENSSLSGDQISA

Query:  LVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGNIASYIVVTHQHST
        LVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDN RKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPG IASYIVVTHQHST
Subjt:  LVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGNIASYIVVTHQHST

Query:  IRRAVDRLLFLYEGKVVWQGMTHEFTTSTNPIVQQFASGNLDGPIKY
        IRRAVDRLLFLYEGKVVWQGMT EFTTSTNPIVQQFASG+LDGPIKY
Subjt:  IRRAVDRLLFLYEGKVVWQGMTHEFTTSTNPIVQQFASGNLDGPIKY

XP_038876767.1 protein TRIGALACTOSYLDIACYLGLYCEROL 3, chloroplastic isoform X1 [Benincasa hispida]6.42e-22692.82Show/hide
Query:  MVSISGSVLFPLTAPNCSSGSRKVAVLDASNSFCFKKKEQRRIV-CNCIAPPPYFKSDESSAVNSNDSFRSENLIPDNEHEDESDVLIECRDVYKSFGEK
        MVSISGSV FPLT PNCS+ SRKVAVLDA NSFC KKK+QRRIV CNCIAPPPYFKSD SSAVNSNDSFRSE+L PDNE +DESDVLIECR+VYKSFGEK
Subjt:  MVSISGSVLFPLTAPNCSSGSRKVAVLDASNSFCFKKKEQRRIV-CNCIAPPPYFKSDESSAVNSNDSFRSENLIPDNEHEDESDVLIECRDVYKSFGEK

Query:  HILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRKRVGLIDDEEISGLRIGLVFQSAALFDSLTVRENVGFLLYENSSLSGDQIS
        HILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLL+PDKGEVYIRGRKR GLIDDEE+SGLRIGLVFQSAALFDSLTVR+NVGFLLYENSSLS +QIS
Subjt:  HILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRKRVGLIDDEEISGLRIGLVFQSAALFDSLTVRENVGFLLYENSSLSGDQIS

Query:  ALVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGNIASYIVVTHQHS
         LVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPG IASYIVVTHQHS
Subjt:  ALVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGNIASYIVVTHQHS

Query:  TIRRAVDRLLFLYEGKVVWQGMTHEFTTSTNPIVQQFASGNLDGPIKY
        TIRRAVDRLLFLYEGKVVWQGMT EFTTSTNPIVQQFASG+LDGPI+Y
Subjt:  TIRRAVDRLLFLYEGKVVWQGMTHEFTTSTNPIVQQFASGNLDGPIKY

TrEMBL top hitse value%identityAlignment
A0A0A0LY39 ABC transporter domain-containing protein3.47e-22391.67Show/hide
Query:  MVSISGSVLFPLTAPNCSSGSRKVAVLDASNSFCFKKKEQRRIV-CNCIAPPPYFKSDESSAVNSNDSFRSENLIPDNEHEDESDVLIECRDVYKSFGEK
        MVS+SGSV FPLT P+CSS SRKVAV+DA NSFC K K+QRRIV CNCIAPPPYFKSDESSAVNSNDSFRSE+L  +NE ++ESDVLIECR+V+KSFGEK
Subjt:  MVSISGSVLFPLTAPNCSSGSRKVAVLDASNSFCFKKKEQRRIV-CNCIAPPPYFKSDESSAVNSNDSFRSENLIPDNEHEDESDVLIECRDVYKSFGEK

Query:  HILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRKRVGLIDDEEISGLRIGLVFQSAALFDSLTVRENVGFLLYENSSLSGDQIS
        HILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLL+PDKGEVYIRGRKRVGLIDDEE+SGLRIGLVFQSAALFDSLTVR+NVGFLLYENSSLS DQIS
Subjt:  HILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRKRVGLIDDEEISGLRIGLVFQSAALFDSLTVRENVGFLLYENSSLSGDQIS

Query:  ALVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGNIASYIVVTHQHS
         LVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPG IASYIVVTHQHS
Subjt:  ALVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGNIASYIVVTHQHS

Query:  TIRRAVDRLLFLYEGKVVWQGMTHEFTTSTNPIVQQFASGNLDGPIKY
        TIRRAVDRLLFLYEGKVVWQGMT EFTTSTNPIVQQFASG+LDGPI+Y
Subjt:  TIRRAVDRLLFLYEGKVVWQGMTHEFTTSTNPIVQQFASGNLDGPIKY

A0A1S3B8N5 protein TRIGALACTOSYLDIACYLGLYCEROL 3, chloroplastic isoform X11.48e-22492.24Show/hide
Query:  MVSISGSVLFPLTAPNCSSGSRKVAVLDASNSFCFKKKEQRRIV-CNCIAPPPYFKSDESSAVNSNDSFRSENLIPDNEHEDESDVLIECRDVYKSFGEK
        MVSISGSV FPLT PNCSS SRKVAVLDA NSFC K K+QRRIV CNCIAPPPYFKSDESSAVNSNDSFRSE+L  +NE +DESDVLIECR+V+KSFGEK
Subjt:  MVSISGSVLFPLTAPNCSSGSRKVAVLDASNSFCFKKKEQRRIV-CNCIAPPPYFKSDESSAVNSNDSFRSENLIPDNEHEDESDVLIECRDVYKSFGEK

Query:  HILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRKRVGLIDDEEISGLRIGLVFQSAALFDSLTVRENVGFLLYENSSLSGDQIS
        HILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLL+PDKGEVYIRGRKR+GLIDDEE+SGLRIGLVFQSAALFDSLTVR+NVGFLLYENSSLS DQIS
Subjt:  HILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRKRVGLIDDEEISGLRIGLVFQSAALFDSLTVRENVGFLLYENSSLSGDQIS

Query:  ALVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGNIASYIVVTHQHS
         LVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPG IASYIVVTHQHS
Subjt:  ALVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGNIASYIVVTHQHS

Query:  TIRRAVDRLLFLYEGKVVWQGMTHEFTTSTNPIVQQFASGNLDGPIKY
        TIRRAVDRLLFLYEG+VVWQGMT EFTTSTNPIVQQFASG+LDGPI+Y
Subjt:  TIRRAVDRLLFLYEGKVVWQGMTHEFTTSTNPIVQQFASGNLDGPIKY

A0A6J1CBM2 protein TRIGALACTOSYLDIACYLGLYCEROL 3, chloroplastic5.61e-247100Show/hide
Query:  MVSISGSVLFPLTAPNCSSGSRKVAVLDASNSFCFKKKEQRRIVCNCIAPPPYFKSDESSAVNSNDSFRSENLIPDNEHEDESDVLIECRDVYKSFGEKH
        MVSISGSVLFPLTAPNCSSGSRKVAVLDASNSFCFKKKEQRRIVCNCIAPPPYFKSDESSAVNSNDSFRSENLIPDNEHEDESDVLIECRDVYKSFGEKH
Subjt:  MVSISGSVLFPLTAPNCSSGSRKVAVLDASNSFCFKKKEQRRIVCNCIAPPPYFKSDESSAVNSNDSFRSENLIPDNEHEDESDVLIECRDVYKSFGEKH

Query:  ILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRKRVGLIDDEEISGLRIGLVFQSAALFDSLTVRENVGFLLYENSSLSGDQISA
        ILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRKRVGLIDDEEISGLRIGLVFQSAALFDSLTVRENVGFLLYENSSLSGDQISA
Subjt:  ILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRKRVGLIDDEEISGLRIGLVFQSAALFDSLTVRENVGFLLYENSSLSGDQISA

Query:  LVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGNIASYIVVTHQHST
        LVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGNIASYIVVTHQHST
Subjt:  LVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGNIASYIVVTHQHST

Query:  IRRAVDRLLFLYEGKVVWQGMTHEFTTSTNPIVQQFASGNLDGPIKY
        IRRAVDRLLFLYEGKVVWQGMTHEFTTSTNPIVQQFASGNLDGPIKY
Subjt:  IRRAVDRLLFLYEGKVVWQGMTHEFTTSTNPIVQQFASGNLDGPIKY

A0A6J1F2M5 protein TRIGALACTOSYLDIACYLGLYCEROL 3, chloroplastic-like8.59e-22692.8Show/hide
Query:  MVSISGSVLFPLTAPNCSSGSRKVAVLDASNSFCFKKKEQRRIVCNCIAPPPYFKSDESSAVNSNDSFRSENLIPDNEHEDESDVLIECRDVYKSFGEKH
        MVSISGSV FPLT P+CSS SRKVAVLD  NSFC KKK+QRRIVCNCIAPPP+FKSD SS VN NDSF SE+L  DNE EDESDVLIECR+VYKSFGEKH
Subjt:  MVSISGSVLFPLTAPNCSSGSRKVAVLDASNSFCFKKKEQRRIVCNCIAPPPYFKSDESSAVNSNDSFRSENLIPDNEHEDESDVLIECRDVYKSFGEKH

Query:  ILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRKRVGLIDDEEISGLRIGLVFQSAALFDSLTVRENVGFLLYENSSLSGDQISA
        ILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLL+PDKGEVYIRGRKRVGLIDDEE+SGLRIGLVFQSAALFDSLTVR+NVGFLLYENSSLS DQISA
Subjt:  ILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRKRVGLIDDEEISGLRIGLVFQSAALFDSLTVRENVGFLLYENSSLSGDQISA

Query:  LVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGNIASYIVVTHQHST
        LVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPG IASYIVVTHQHST
Subjt:  LVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGNIASYIVVTHQHST

Query:  IRRAVDRLLFLYEGKVVWQGMTHEFTTSTNPIVQQFASGNLDGPIKY
        IRRAVDRLLFLYEGKVVWQGMT EFTTSTNPIVQQFASG+LDGPIKY
Subjt:  IRRAVDRLLFLYEGKVVWQGMTHEFTTSTNPIVQQFASGNLDGPIKY

A0A6J1KHV9 protein TRIGALACTOSYLDIACYLGLYCEROL 3, chloroplastic-like3.18e-22291.17Show/hide
Query:  MVSISGSVLFPLTAPNCSSGSRKVAVLDASNSFCFKKKEQRRIVCNCIAPPPYFKSDESSAVNSNDSFRSENLIPDNEHEDE----SDVLIECRDVYKSF
        MVSISGSV FPLT P+CSS SRKVAVLD  NSFC KKK+QRRIVCNCIAPPP+FKSD SS VN NDSF SE+L  DNE EDE    SDVLIECR+VYKSF
Subjt:  MVSISGSVLFPLTAPNCSSGSRKVAVLDASNSFCFKKKEQRRIVCNCIAPPPYFKSDESSAVNSNDSFRSENLIPDNEHEDE----SDVLIECRDVYKSF

Query:  GEKHILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRKRVGLIDDEEISGLRIGLVFQSAALFDSLTVRENVGFLLYENSSLSGD
        GEKHILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLL+PDKGEVYIRGRKRVGLIDDEE+SGLRIGLVFQSAALFDSLTVR+NVGFLLYENSSLS D
Subjt:  GEKHILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRKRVGLIDDEEISGLRIGLVFQSAALFDSLTVRENVGFLLYENSSLSGD

Query:  QISALVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGNIASYIVVTH
        QISALVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDA GKPG IASYI VTH
Subjt:  QISALVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGNIASYIVVTH

Query:  QHSTIRRAVDRLLFLYEGKVVWQGMTHEFTTSTNPIVQQFASGNLDGPIKY
        QHSTIRRAVDRLLFLYEGKVVWQGMT EFTTSTNPIVQQFASG+LDGPIKY
Subjt:  QHSTIRRAVDRLLFLYEGKVVWQGMTHEFTTSTNPIVQQFASGNLDGPIKY

SwissProt top hitse value%identityAlignment
P30769 Probable ribonucleotide transport ATP-binding protein mkl3.2e-3836.12Show/hide
Query:  VLIECRDVYKSFGEKHILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRKRVGLIDDE--EISGLRIGLVFQSAALFDSLTVREN
        V IE + + KSFG   I   V+  I  GE   ++GPSGTGKS  LK + GLL P++G + I G   +     E  EI  L  G++FQ  ALF S+ + +N
Subjt:  VLIECRDVYKSFGEKHILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRKRVGLIDDE--EISGLRIGLVFQSAALFDSLTVREN

Query:  VGFLLYENSSLSGDQISALVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDAS
          F L E++     +I  +V E L  VGL G E + P E+SGGM+KR  LAR+++ D       P+++L DEP +GLDP+ +  +  LI  ++ + +   
Subjt:  VGFLLYENSSLSGDQISALVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDAS

Query:  GKPGNIASYIVVTHQHSTIRRAVDRLLFLYEGKVVWQGMTHEFTTSTNPIVQQFASGNLDGPI
              A+ ++VTH  +  R   D +  L+   +V  G      TS  P+V+QF +G   GPI
Subjt:  GKPGNIASYIVVTHQHSTIRRAVDRLLFLYEGKVVWQGMTHEFTTSTNPIVQQFASGNLDGPI

P63358 Probable ribonucleotide transport ATP-binding protein mkl1.9e-3836.5Show/hide
Query:  VLIECRDVYKSFGEKHILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRKRVGLIDDE--EISGLRIGLVFQSAALFDSLTVREN
        V IE   + KSFG   I   V+  I  GE   ++GPSGTGKS  LK + GLL P++G + I G   +     E  EI  L  G++FQ  ALF S+ + +N
Subjt:  VLIECRDVYKSFGEKHILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRKRVGLIDDE--EISGLRIGLVFQSAALFDSLTVREN

Query:  VGFLLYENSSLSGDQISALVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDAS
          F L E++     +I  +V E LA VGL G E + P E+SGGM+KR  LAR+++ D       P+++L DEP +GLDP+ +  +  LI  ++ + +   
Subjt:  VGFLLYENSSLSGDQISALVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDAS

Query:  GKPGNIASYIVVTHQHSTIRRAVDRLLFLYEGKVVWQGMTHEFTTSTNPIVQQFASGNLDGPI
              A+ ++VTH  +  R   D +  L+   +V  G      TS  P+V+QF +G   GPI
Subjt:  GKPGNIASYIVVTHQHSTIRRAVDRLLFLYEGKVVWQGMTHEFTTSTNPIVQQFASGNLDGPI

P9WQL4 Probable ribonucleotide transport ATP-binding protein mkl1.9e-3836.5Show/hide
Query:  VLIECRDVYKSFGEKHILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRKRVGLIDDE--EISGLRIGLVFQSAALFDSLTVREN
        V IE   + KSFG   I   V+  I  GE   ++GPSGTGKS  LK + GLL P++G + I G   +     E  EI  L  G++FQ  ALF S+ + +N
Subjt:  VLIECRDVYKSFGEKHILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRKRVGLIDDE--EISGLRIGLVFQSAALFDSLTVREN

Query:  VGFLLYENSSLSGDQISALVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDAS
          F L E++     +I  +V E LA VGL G E + P E+SGGM+KR  LAR+++ D       P+++L DEP +GLDP+ +  +  LI  ++ + +   
Subjt:  VGFLLYENSSLSGDQISALVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDAS

Query:  GKPGNIASYIVVTHQHSTIRRAVDRLLFLYEGKVVWQGMTHEFTTSTNPIVQQFASGNLDGPI
              A+ ++VTH  +  R   D +  L+   +V  G      TS  P+V+QF +G   GPI
Subjt:  GKPGNIASYIVVTHQHSTIRRAVDRLLFLYEGKVVWQGMTHEFTTSTNPIVQQFASGNLDGPI

P9WQL5 Probable ribonucleotide transport ATP-binding protein mkl1.9e-3836.5Show/hide
Query:  VLIECRDVYKSFGEKHILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRKRVGLIDDE--EISGLRIGLVFQSAALFDSLTVREN
        V IE   + KSFG   I   V+  I  GE   ++GPSGTGKS  LK + GLL P++G + I G   +     E  EI  L  G++FQ  ALF S+ + +N
Subjt:  VLIECRDVYKSFGEKHILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRKRVGLIDDE--EISGLRIGLVFQSAALFDSLTVREN

Query:  VGFLLYENSSLSGDQISALVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDAS
          F L E++     +I  +V E LA VGL G E + P E+SGGM+KR  LAR+++ D       P+++L DEP +GLDP+ +  +  LI  ++ + +   
Subjt:  VGFLLYENSSLSGDQISALVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDAS

Query:  GKPGNIASYIVVTHQHSTIRRAVDRLLFLYEGKVVWQGMTHEFTTSTNPIVQQFASGNLDGPI
              A+ ++VTH  +  R   D +  L+   +V  G      TS  P+V+QF +G   GPI
Subjt:  GKPGNIASYIVVTHQHSTIRRAVDRLLFLYEGKVVWQGMTHEFTTSTNPIVQQFASGNLDGPI

Q9AT00 Protein TRIGALACTOSYLDIACYLGLYCEROL 3, chloroplastic1.4e-13473.33Show/hide
Query:  SISGSVLFPLTAPNCSSGSRKVAVLDASNSFCFKKKEQRRIVCNCIAPPPYFKSDESSAVNSNDSFRSENLIPDNEHEDESDVLIECRDVYKSFGEKHIL
        S S S L P +     S S +  V+   +   F+    R++ C CIAPP    +D +   +   S     +  +   E++SDVLIECRDVYKSFGEKHIL
Subjt:  SISGSVLFPLTAPNCSSGSRKVAVLDASNSFCFKKKEQRRIVCNCIAPPPYFKSDESSAVNSNDSFRSENLIPDNEHEDESDVLIECRDVYKSFGEKHIL

Query:  RGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRKRVGLIDDEEISGLRIGLVFQSAALFDSLTVRENVGFLLYENSSLSGDQISALV
        +GVSFKIRHGEAVGVIGPSGTGKSTILKI+AGLLAPDKGEVYIRG+KR GLI DEEISGLRIGLVFQSAALFDSL+VRENVGFLLYE S +S +QIS LV
Subjt:  RGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRKRVGLIDDEEISGLRIGLVFQSAALFDSLTVRENVGFLLYENSSLSGDQISALV

Query:  TENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGNIASYIVVTHQHSTIR
        T+ LAAVGLKGVE+RLPSELSGGMKKRVALARS+IFD T++ IEPEVLLYDEPTAGLDPIASTVVEDLIRSVH+  EDA GKPG IASY+VVTHQHSTI+
Subjt:  TENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGNIASYIVVTHQHSTIR

Query:  RAVDRLLFLYEGKVVWQGMTHEFTTSTNPIVQQFASGNLDGPIKY
        RAVDRLLFLYEGK+VWQGMTHEFTTSTNPIVQQFA+G+LDGPI+Y
Subjt:  RAVDRLLFLYEGKVVWQGMTHEFTTSTNPIVQQFASGNLDGPIKY

Arabidopsis top hitse value%identityAlignment
AT1G65410.1 non-intrinsic ABC protein 119.8e-13673.33Show/hide
Query:  SISGSVLFPLTAPNCSSGSRKVAVLDASNSFCFKKKEQRRIVCNCIAPPPYFKSDESSAVNSNDSFRSENLIPDNEHEDESDVLIECRDVYKSFGEKHIL
        S S S L P +     S S +  V+   +   F+    R++ C CIAPP    +D +   +   S     +  +   E++SDVLIECRDVYKSFGEKHIL
Subjt:  SISGSVLFPLTAPNCSSGSRKVAVLDASNSFCFKKKEQRRIVCNCIAPPPYFKSDESSAVNSNDSFRSENLIPDNEHEDESDVLIECRDVYKSFGEKHIL

Query:  RGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRKRVGLIDDEEISGLRIGLVFQSAALFDSLTVRENVGFLLYENSSLSGDQISALV
        +GVSFKIRHGEAVGVIGPSGTGKSTILKI+AGLLAPDKGEVYIRG+KR GLI DEEISGLRIGLVFQSAALFDSL+VRENVGFLLYE S +S +QIS LV
Subjt:  RGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRKRVGLIDDEEISGLRIGLVFQSAALFDSLTVRENVGFLLYENSSLSGDQISALV

Query:  TENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGNIASYIVVTHQHSTIR
        T+ LAAVGLKGVE+RLPSELSGGMKKRVALARS+IFD T++ IEPEVLLYDEPTAGLDPIASTVVEDLIRSVH+  EDA GKPG IASY+VVTHQHSTI+
Subjt:  TENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGNIASYIVVTHQHSTIR

Query:  RAVDRLLFLYEGKVVWQGMTHEFTTSTNPIVQQFASGNLDGPIKY
        RAVDRLLFLYEGK+VWQGMTHEFTTSTNPIVQQFA+G+LDGPI+Y
Subjt:  RAVDRLLFLYEGKVVWQGMTHEFTTSTNPIVQQFASGNLDGPIKY

AT1G67940.1 non-intrinsic ABC protein 33.9e-2332.92Show/hide
Query:  ILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRKRVGLIDDEEISGL--RIGLVFQSAALFDSLTVRENVGFLLYENSSLSGDQI
        IL+GV+  I  G  VGVIGPSG+GKST L+ +  L  P +  V++ G      I + ++  L  R+G++FQ   LF   TV +NV +      +L G+++
Subjt:  ILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRKRVGLIDDEEISGL--RIGLVFQSAALFDSLTVRENVGFLLYENSSLSGDQI

Query:  SALVTENLAAVG--LKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGNIASYIVVTH
        S      L ++         +  +ELS G  +RVALAR++         EPEVLL DEPT+ LDPI++  +ED+I    +K +   G      + ++V+H
Subjt:  SALVTENLAAVG--LKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGNIASYIVVTH

Query:  QHSTIRRAVDRLLFLYEGKVVWQGMTHEFTTSTNPIVQQF
            I++  D +  + +G++V      E + +T+P+ Q+F
Subjt:  QHSTIRRAVDRLLFLYEGKVVWQGMTHEFTTSTNPIVQQF

AT3G47760.1 ABC2 homolog 42.3e-2029.71Show/hide
Query:  CRDVYKSFGEKHILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRKRVGLIDDEEISGLRIGLVFQSAALFDSLTVRENVGFLLY
        CRD      +K  +RG+S  +  GE  G++GP+G GK++ + ++ GL+ P  G  ++ G   + +  D +I    IG+  Q   L+++LT RE++  L Y
Subjt:  CRDVYKSFGEKHILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRKRVGLIDDEEISGLRIGLVFQSAALFDSLTVRENVGFLLY

Query:  EN-SSLSGDQISALVTENLAAVGL--KGVEDRLPSELSGGMKKRVALARSIIFDNTRKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKP
            +L G  +   V E+L +V L   GV D+   + SGGMK+R+++A S+I         P+V+  DEP+ GLDP +   +   I+           + 
Subjt:  EN-SSLSGDQISALVTENLAAVGL--KGVEDRLPSELSGGMKKRVALARSIIFDNTRKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKP

Query:  GNIASYIVVTHQHSTIRRAVDRLLFLYEGKVVWQGMTHE
         N  + I+ TH         DRL    +G++   G   E
Subjt:  GNIASYIVVTHQHSTIRRAVDRLLFLYEGKVVWQGMTHE

AT3G62150.1 P-glycoprotein 212.3e-2029.83Show/hide
Query:  YKSFGEKHILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRKRVGLIDDEEISGLR--IGLVFQSAALFDSLTVRENVGF----L
        Y +  E+ I RG S  I  G  V ++G SG+GKST++ +I     P  GEV I G      + + ++  +R  IGLV Q   LF S +++EN+ +     
Subjt:  YKSFGEKHILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRKRVGLIDDEEISGLR--IGLVFQSAALFDSLTVRENVGF----L

Query:  LYENSSLSGDQISALVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPG
          E    + +  +A    +    GL  +     ++LSGG K+R+A+AR+I+ D       P +LL DE T+ LD  +  +V++ +  + +          
Subjt:  LYENSSLSGDQISALVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPG

Query:  NIASYIVVTHQHSTIRRAVDRLLFLYEGKVVWQGMTHE
           + +VV H+ ST+R A D +  +++GK+V +G   E
Subjt:  NIASYIVVTHQHSTIRRAVDRLLFLYEGKVVWQGMTHE

AT5G46540.1 P-glycoprotein 72.3e-2031.6Show/hide
Query:  IECRDVYKSFGEK---HILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRKRVGLIDDEEISGLR--IGLVFQSAALFDSLTVRE
        IE RDVY  +  +    I  G S  + +G  V ++G SG+GKST++ +I     P+ GEV I G      +   ++  +R  IGLV Q   LF + T+RE
Subjt:  IECRDVYKSFGEK---HILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRKRVGLIDDEEISGLR--IGLVFQSAALFDSLTVRE

Query:  NVGFLLYENSSLSGDQI-SALVTENLA------AVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSV
        N+   +Y     S  +I +AL   N +        GL+ +     ++LSGG K+R+A+AR+I+ +       P++LL DE T+ LD  +  +V+D +  +
Subjt:  NVGFLLYENSSLSGDQI-SALVTENLA------AVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTRKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSV

Query:  HIKGEDASGKPGNIASYIVVTHQHSTIRRAVDRLLFLYEGKVVWQGMTHE
         +             + +VV H+ +TIR A D +  + +GKV+ +G TH+
Subjt:  HIKGEDASGKPGNIASYIVVTHQHSTIRRAVDRLLFLYEGKVVWQGMTHE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTTCTATATCGGGTTCGGTGTTGTTCCCATTAACTGCACCCAATTGTTCCTCTGGATCGCGTAAAGTAGCTGTTCTTGATGCGTCCAATTCGTTTTGTTTTAAGAA
AAAGGAGCAGAGGAGGATTGTTTGTAATTGCATTGCGCCACCGCCGTACTTCAAGAGTGATGAGTCATCTGCCGTAAATTCCAATGACTCGTTTAGATCAGAAAATTTAA
TCCCAGATAATGAGCATGAGGATGAGTCTGATGTTCTCATCGAGTGTAGAGATGTCTACAAATCCTTTGGGGAAAAGCATATACTGAGAGGTGTGAGCTTCAAGATTAGA
CATGGAGAAGCCGTGGGAGTAATTGGGCCTTCTGGTACTGGGAAGTCTACAATACTGAAGATCATTGCCGGTCTTCTTGCTCCAGACAAGGGTGAGGTATATATTCGAGG
TAGAAAGCGAGTTGGTTTGATCGATGATGAGGAGATATCTGGTCTTCGAATTGGATTGGTTTTTCAAAGTGCAGCCCTTTTTGACTCATTAACTGTTCGAGAAAATGTTG
GTTTCCTTTTGTATGAAAATTCAAGCTTGTCTGGAGATCAAATCTCAGCTCTGGTAACTGAGAATTTAGCTGCTGTTGGGCTGAAGGGAGTTGAGGATCGGTTACCTTCT
GAATTATCAGGTGGAATGAAGAAACGAGTTGCTTTAGCTAGGTCTATAATCTTTGATAACACAAGGAAAGAAATTGAGCCAGAGGTACTCTTGTATGATGAACCAACTGC
TGGACTCGATCCAATTGCATCAACTGTCGTTGAAGATCTTATCCGTTCTGTACATATTAAGGGCGAAGATGCTAGCGGGAAGCCCGGAAATATTGCGTCCTATATTGTAG
TCACCCACCAACATAGTACCATTAGGAGAGCTGTTGACAGGTTATTGTTTTTATACGAGGGGAAAGTTGTCTGGCAAGGAATGACTCATGAATTTACTACATCAACAAAT
CCGATTGTTCAACAGTTTGCATCGGGAAACTTGGACGGTCCAATTAAGTACTAG
mRNA sequenceShow/hide mRNA sequence
GTATTTATCATAGTATAAGAGAATTATATTGGAAAGAGGAACAGGGCTAACGTCTCATTGCAAGCCCACCTCATCACGTCACCGCGGCAAGAATGCGACGGAGAATCAAT
CAGCAAGCACATCAAGTTTGAGAATTCTTAACAATGCTCTGCTCCCATCTCTGTTCTTTCACGTTTCCAAATCTTTTTCTGGGCGACCTCTGTAGCATTTAGCGTTCATT
CTGAAGCACAATTCGTTAAACCATCCTCGGTTTAGCTTGAGATGCAGGATTTAATCGGCTTTTGAGGGGGTTTTGTTCAAAATCCTATCTGGGTTCTTCGAATTTATGAA
GTTTCGATTGCATTGAAACTAAATTAGTACGAAACTGTTTTTGAGCTGGTTACTATCGCGGGTCAGCTTCAAAATCACGAAGTTCGTTTACCCTCTAACGGGTTGACTCC
GGAAACATGGTTTCTATATCGGGTTCGGTGTTGTTCCCATTAACTGCACCCAATTGTTCCTCTGGATCGCGTAAAGTAGCTGTTCTTGATGCGTCCAATTCGTTTTGTTT
TAAGAAAAAGGAGCAGAGGAGGATTGTTTGTAATTGCATTGCGCCACCGCCGTACTTCAAGAGTGATGAGTCATCTGCCGTAAATTCCAATGACTCGTTTAGATCAGAAA
ATTTAATCCCAGATAATGAGCATGAGGATGAGTCTGATGTTCTCATCGAGTGTAGAGATGTCTACAAATCCTTTGGGGAAAAGCATATACTGAGAGGTGTGAGCTTCAAG
ATTAGACATGGAGAAGCCGTGGGAGTAATTGGGCCTTCTGGTACTGGGAAGTCTACAATACTGAAGATCATTGCCGGTCTTCTTGCTCCAGACAAGGGTGAGGTATATAT
TCGAGGTAGAAAGCGAGTTGGTTTGATCGATGATGAGGAGATATCTGGTCTTCGAATTGGATTGGTTTTTCAAAGTGCAGCCCTTTTTGACTCATTAACTGTTCGAGAAA
ATGTTGGTTTCCTTTTGTATGAAAATTCAAGCTTGTCTGGAGATCAAATCTCAGCTCTGGTAACTGAGAATTTAGCTGCTGTTGGGCTGAAGGGAGTTGAGGATCGGTTA
CCTTCTGAATTATCAGGTGGAATGAAGAAACGAGTTGCTTTAGCTAGGTCTATAATCTTTGATAACACAAGGAAAGAAATTGAGCCAGAGGTACTCTTGTATGATGAACC
AACTGCTGGACTCGATCCAATTGCATCAACTGTCGTTGAAGATCTTATCCGTTCTGTACATATTAAGGGCGAAGATGCTAGCGGGAAGCCCGGAAATATTGCGTCCTATA
TTGTAGTCACCCACCAACATAGTACCATTAGGAGAGCTGTTGACAGGTTATTGTTTTTATACGAGGGGAAAGTTGTCTGGCAAGGAATGACTCATGAATTTACTACATCA
ACAAATCCGATTGTTCAACAGTTTGCATCGGGAAACTTGGACGGTCCAATTAAGTACTAGCAGCAGTAACACTAATGATCATGTACTTTTAGATTAGACACAGCACCATT
CCTAATTCTTTGGAGGCAGCAGAGTAGACCCAGAGAAGCTCATGTCTGTAAATTCTTTTTTTTTTCCTTTTAAAGCTTATATGTCTCCTTTATTCACGTAGCTTTGTCCA
ATGTCTGCATACTCCTGTTAGAATGTTTTTGTTTAGTAATAGGTAGATGTTAAAAGTACAAAATCATGGAATATGTGAGGATCATTTTAGTATTTCTGCCAAACAAAAAG
ATCATTTCAGGATGAGGTTTCATCAAATCCTGATCTGCCTAAACTGATTATTTTTGTTCTAATTTTATACAAAATGTGAAGTACAATATCTTTTTCAATCATTTACAGGG
TTCAAAGAAAGGGTGGGGTATTGATAAAAATCAATTCCAAATCACAATTGTCAAGGCTCACGCTACAACATGAGTTCGAACAACTGGAAACAGATTTGAATACAGAAAGA
GATTCGTACCAGTTGCATTTACTTTCCCATTTGCTTAGTAACAAACAGACAGCTATATTTACTCATTTACGAAGGCTTACCTCAAATTCGATCCCCTTATGATTGCTTCC
ACTTGAAGAGTATTACTTGCACTTGCACCATAAGTACTAGACTTTGATAAAGTAATGGGGAGTTAACATTATTTCAAAGTATTTTTGTTCCTTTTTTTGTTCACTTTGAA
GCATATGGGAATAAAAATGTGAAGTCAAGAAATGTGATGTTTGAAAGTAGAATGGTTGTACTTGGTAATATGTTGGAATGAGATTTGTGAAGTGACAATTTTCTAGAACA
ATGTGCTATAAAATATCCATTCTCTTCCCTTTATCATGGCA
Protein sequenceShow/hide protein sequence
MVSISGSVLFPLTAPNCSSGSRKVAVLDASNSFCFKKKEQRRIVCNCIAPPPYFKSDESSAVNSNDSFRSENLIPDNEHEDESDVLIECRDVYKSFGEKHILRGVSFKIR
HGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRKRVGLIDDEEISGLRIGLVFQSAALFDSLTVRENVGFLLYENSSLSGDQISALVTENLAAVGLKGVEDRLPS
ELSGGMKKRVALARSIIFDNTRKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGNIASYIVVTHQHSTIRRAVDRLLFLYEGKVVWQGMTHEFTTSTN
PIVQQFASGNLDGPIKY