; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr016720 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr016720
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionprotein TRIGALACTOSYLDIACYLGLYCEROL 3, chloroplastic
Genome locationtig00152985:1549826..1561809
RNA-Seq ExpressionSgr016720
SyntenySgr016720
Gene Ontology termsGO:0005524 - ATP binding (molecular function)
InterPro domainsIPR003439 - ABC transporter-like, ATP-binding domain
IPR003593 - AAA+ ATPase domain
IPR017871 - ABC transporter-like, conserved site
IPR027417 - P-loop containing nucleoside triphosphate hydrolase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6588286.1 Protein TRIGALACTOSYLDIACYLGLYCEROL 3, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]1.7e-15191.23Show/hide
Query:  MVSILGSVFLPLTIPNCSSRSRKVAVFDVCNSFCFKKKEQRKIICNCIAPPQYLKSDGSSAVNSNDSFRSESLSPDNKHEDESDVLIECRDVYKSFGEKH
        MVSI GSVF PLT+P+CSSRSRKVAV D  NSFC KKK+QR+I+CNCIAPP + KSDGSS VN NDSF SE LS DN+ EDESDVLIECR+VYKSFGEKH
Subjt:  MVSILGSVFLPLTIPNCSSRSRKVAVFDVCNSFCFKKKEQRKIICNCIAPPQYLKSDGSSAVNSNDSFRSESLSPDNKHEDESDVLIECRDVYKSFGEKH

Query:  ILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRNRVGLIDDEEISGLRIGLVFQSAALFDSLTVRENVGFLLYENSSLSEDQISA
        ILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLL+PDKGEVYIRGR RVGLIDDEE+SGLRIGLVFQSAALFDSLTVR+NVGFLLYENSSLSEDQISA
Subjt:  ILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRNRVGLIDDEEISGLRIGLVFQSAALFDSLTVRENVGFLLYENSSLSEDQISA

Query:  LVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTKKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGKIASYIVVTHQHST
        LVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNT+KEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGKIASYIVVTHQHST
Subjt:  LVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTKKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGKIASYIVVTHQHST

Query:  IRRAVDRL
        IRRAVDRL
Subjt:  IRRAVDRL

XP_022139091.1 protein TRIGALACTOSYLDIACYLGLYCEROL 3, chloroplastic [Momordica charantia]7.4e-15593.51Show/hide
Query:  MVSILGSVFLPLTIPNCSSRSRKVAVFDVCNSFCFKKKEQRKIICNCIAPPQYLKSDGSSAVNSNDSFRSESLSPDNKHEDESDVLIECRDVYKSFGEKH
        MVSI GSV  PLT PNCSS SRKVAV D  NSFCFKKKEQR+I+CNCIAPP Y KSD SSAVNSNDSFRSE+L PDN+HEDESDVLIECRDVYKSFGEKH
Subjt:  MVSILGSVFLPLTIPNCSSRSRKVAVFDVCNSFCFKKKEQRKIICNCIAPPQYLKSDGSSAVNSNDSFRSESLSPDNKHEDESDVLIECRDVYKSFGEKH

Query:  ILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRNRVGLIDDEEISGLRIGLVFQSAALFDSLTVRENVGFLLYENSSLSEDQISA
        ILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGR RVGLIDDEEISGLRIGLVFQSAALFDSLTVRENVGFLLYENSSLS DQISA
Subjt:  ILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRNRVGLIDDEEISGLRIGLVFQSAALFDSLTVRENVGFLLYENSSLSEDQISA

Query:  LVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTKKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGKIASYIVVTHQHST
        LVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNT+KEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPG IASYIVVTHQHST
Subjt:  LVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTKKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGKIASYIVVTHQHST

Query:  IRRAVDRL
        IRRAVDRL
Subjt:  IRRAVDRL

XP_022932753.1 protein TRIGALACTOSYLDIACYLGLYCEROL 3, chloroplastic-like [Cucurbita moschata]2.2e-15191.23Show/hide
Query:  MVSILGSVFLPLTIPNCSSRSRKVAVFDVCNSFCFKKKEQRKIICNCIAPPQYLKSDGSSAVNSNDSFRSESLSPDNKHEDESDVLIECRDVYKSFGEKH
        MVSI GSVF PLT+P+CSSRSRKVAV D  NSFC KKK+QR+I+CNCIAPP + KSDGSS VN NDSF SE LS DN+ EDESDVLIECR+VYKSFGEKH
Subjt:  MVSILGSVFLPLTIPNCSSRSRKVAVFDVCNSFCFKKKEQRKIICNCIAPPQYLKSDGSSAVNSNDSFRSESLSPDNKHEDESDVLIECRDVYKSFGEKH

Query:  ILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRNRVGLIDDEEISGLRIGLVFQSAALFDSLTVRENVGFLLYENSSLSEDQISA
        ILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLL+PDKGEVYIRGR RVGLIDDEE+SGLRIGLVFQSAALFDSLTVR+NVGFLLYENSSLSEDQISA
Subjt:  ILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRNRVGLIDDEEISGLRIGLVFQSAALFDSLTVRENVGFLLYENSSLSEDQISA

Query:  LVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTKKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGKIASYIVVTHQHST
        LVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNT+KEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGKIASYIVVTHQHST
Subjt:  LVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTKKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGKIASYIVVTHQHST

Query:  IRRAVDRL
        IRRAVDRL
Subjt:  IRRAVDRL

XP_023520683.1 protein TRIGALACTOSYLDIACYLGLYCEROL 3, chloroplastic-like [Cucurbita pepo subsp. pepo]8.5e-15190.91Show/hide
Query:  MVSILGSVFLPLTIPNCSSRSRKVAVFDVCNSFCFKKKEQRKIICNCIAPPQYLKSDGSSAVNSNDSFRSESLSPDNKHEDESDVLIECRDVYKSFGEKH
        MVSI GSVF PLT+P+CSSRSRKVAV D  NSFC KKK+QR+I+CNCIAPP + KSDGSS VN NDSF SE LS DN+ EDESDVLIECR+VYKSFGEKH
Subjt:  MVSILGSVFLPLTIPNCSSRSRKVAVFDVCNSFCFKKKEQRKIICNCIAPPQYLKSDGSSAVNSNDSFRSESLSPDNKHEDESDVLIECRDVYKSFGEKH

Query:  ILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRNRVGLIDDEEISGLRIGLVFQSAALFDSLTVRENVGFLLYENSSLSEDQISA
        ILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLL+PDKGEVYIRGR RVGLIDDEE+SGLRIGLVFQSAALFDSLTVR+NVGFLLYENSSLSEDQISA
Subjt:  ILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRNRVGLIDDEEISGLRIGLVFQSAALFDSLTVRENVGFLLYENSSLSEDQISA

Query:  LVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTKKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGKIASYIVVTHQHST
        LVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDN +KEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGKIASYIVVTHQHST
Subjt:  LVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTKKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGKIASYIVVTHQHST

Query:  IRRAVDRL
        IRRAVDRL
Subjt:  IRRAVDRL

XP_038876767.1 protein TRIGALACTOSYLDIACYLGLYCEROL 3, chloroplastic isoform X1 [Benincasa hispida]1.0e-15191.26Show/hide
Query:  MVSILGSVFLPLTIPNCSSRSRKVAVFDVCNSFCFKKKEQRKI-ICNCIAPPQYLKSDGSSAVNSNDSFRSESLSPDNKHEDESDVLIECRDVYKSFGEK
        MVSI GSVF PLT+PNCS+RSRKVAV D  NSFC KKK+QR+I +CNCIAPP Y KSDGSSAVNSNDSFRSE LSPDN+ +DESDVLIECR+VYKSFGEK
Subjt:  MVSILGSVFLPLTIPNCSSRSRKVAVFDVCNSFCFKKKEQRKI-ICNCIAPPQYLKSDGSSAVNSNDSFRSESLSPDNKHEDESDVLIECRDVYKSFGEK

Query:  HILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRNRVGLIDDEEISGLRIGLVFQSAALFDSLTVRENVGFLLYENSSLSEDQIS
        HILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLL+PDKGEVYIRGR R GLIDDEE+SGLRIGLVFQSAALFDSLTVR+NVGFLLYENSSLSE+QIS
Subjt:  HILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRNRVGLIDDEEISGLRIGLVFQSAALFDSLTVRENVGFLLYENSSLSEDQIS

Query:  ALVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTKKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGKIASYIVVTHQHS
         LVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNT+KEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGKIASYIVVTHQHS
Subjt:  ALVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTKKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGKIASYIVVTHQHS

Query:  TIRRAVDRL
        TIRRAVDRL
Subjt:  TIRRAVDRL

TrEMBL top hitse value%identityAlignment
A0A0A0LY39 ABC transporter domain-containing protein2.5e-14889.64Show/hide
Query:  MVSILGSVFLPLTIPNCSSRSRKVAVFDVCNSFCFKKKEQRKII-CNCIAPPQYLKSDGSSAVNSNDSFRSESLSPDNKHEDESDVLIECRDVYKSFGEK
        MVS+ GSVF PLT+P+CSSRSRKVAV D  NSFC K K+QR+I+ CNCIAPP Y KSD SSAVNSNDSFRSE LS +N+ ++ESDVLIECR+V+KSFGEK
Subjt:  MVSILGSVFLPLTIPNCSSRSRKVAVFDVCNSFCFKKKEQRKII-CNCIAPPQYLKSDGSSAVNSNDSFRSESLSPDNKHEDESDVLIECRDVYKSFGEK

Query:  HILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRNRVGLIDDEEISGLRIGLVFQSAALFDSLTVRENVGFLLYENSSLSEDQIS
        HILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLL+PDKGEVYIRGR RVGLIDDEE+SGLRIGLVFQSAALFDSLTVR+NVGFLLYENSSLSEDQIS
Subjt:  HILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRNRVGLIDDEEISGLRIGLVFQSAALFDSLTVRENVGFLLYENSSLSEDQIS

Query:  ALVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTKKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGKIASYIVVTHQHS
         LVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNT+KEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGKIASYIVVTHQHS
Subjt:  ALVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTKKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGKIASYIVVTHQHS

Query:  TIRRAVDRL
        TIRRAVDRL
Subjt:  TIRRAVDRL

A0A1S3B8N5 protein TRIGALACTOSYLDIACYLGLYCEROL 3, chloroplastic isoform X11.7e-14990.29Show/hide
Query:  MVSILGSVFLPLTIPNCSSRSRKVAVFDVCNSFCFKKKEQRKII-CNCIAPPQYLKSDGSSAVNSNDSFRSESLSPDNKHEDESDVLIECRDVYKSFGEK
        MVSI GSVF PLT+PNCSSRSRKVAV D  NSFC K K+QR+I+ CNCIAPP Y KSD SSAVNSNDSFRSE LS +N+ +DESDVLIECR+V+KSFGEK
Subjt:  MVSILGSVFLPLTIPNCSSRSRKVAVFDVCNSFCFKKKEQRKII-CNCIAPPQYLKSDGSSAVNSNDSFRSESLSPDNKHEDESDVLIECRDVYKSFGEK

Query:  HILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRNRVGLIDDEEISGLRIGLVFQSAALFDSLTVRENVGFLLYENSSLSEDQIS
        HILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLL+PDKGEVYIRGR R+GLIDDEE+SGLRIGLVFQSAALFDSLTVR+NVGFLLYENSSLSEDQIS
Subjt:  HILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRNRVGLIDDEEISGLRIGLVFQSAALFDSLTVRENVGFLLYENSSLSEDQIS

Query:  ALVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTKKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGKIASYIVVTHQHS
         LVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNT+KEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGKIASYIVVTHQHS
Subjt:  ALVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTKKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGKIASYIVVTHQHS

Query:  TIRRAVDRL
        TIRRAVDRL
Subjt:  TIRRAVDRL

A0A6J1CBM2 protein TRIGALACTOSYLDIACYLGLYCEROL 3, chloroplastic3.6e-15593.51Show/hide
Query:  MVSILGSVFLPLTIPNCSSRSRKVAVFDVCNSFCFKKKEQRKIICNCIAPPQYLKSDGSSAVNSNDSFRSESLSPDNKHEDESDVLIECRDVYKSFGEKH
        MVSI GSV  PLT PNCSS SRKVAV D  NSFCFKKKEQR+I+CNCIAPP Y KSD SSAVNSNDSFRSE+L PDN+HEDESDVLIECRDVYKSFGEKH
Subjt:  MVSILGSVFLPLTIPNCSSRSRKVAVFDVCNSFCFKKKEQRKIICNCIAPPQYLKSDGSSAVNSNDSFRSESLSPDNKHEDESDVLIECRDVYKSFGEKH

Query:  ILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRNRVGLIDDEEISGLRIGLVFQSAALFDSLTVRENVGFLLYENSSLSEDQISA
        ILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGR RVGLIDDEEISGLRIGLVFQSAALFDSLTVRENVGFLLYENSSLS DQISA
Subjt:  ILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRNRVGLIDDEEISGLRIGLVFQSAALFDSLTVRENVGFLLYENSSLSEDQISA

Query:  LVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTKKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGKIASYIVVTHQHST
        LVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNT+KEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPG IASYIVVTHQHST
Subjt:  LVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTKKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGKIASYIVVTHQHST

Query:  IRRAVDRL
        IRRAVDRL
Subjt:  IRRAVDRL

A0A6J1F2M5 protein TRIGALACTOSYLDIACYLGLYCEROL 3, chloroplastic-like1.1e-15191.23Show/hide
Query:  MVSILGSVFLPLTIPNCSSRSRKVAVFDVCNSFCFKKKEQRKIICNCIAPPQYLKSDGSSAVNSNDSFRSESLSPDNKHEDESDVLIECRDVYKSFGEKH
        MVSI GSVF PLT+P+CSSRSRKVAV D  NSFC KKK+QR+I+CNCIAPP + KSDGSS VN NDSF SE LS DN+ EDESDVLIECR+VYKSFGEKH
Subjt:  MVSILGSVFLPLTIPNCSSRSRKVAVFDVCNSFCFKKKEQRKIICNCIAPPQYLKSDGSSAVNSNDSFRSESLSPDNKHEDESDVLIECRDVYKSFGEKH

Query:  ILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRNRVGLIDDEEISGLRIGLVFQSAALFDSLTVRENVGFLLYENSSLSEDQISA
        ILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLL+PDKGEVYIRGR RVGLIDDEE+SGLRIGLVFQSAALFDSLTVR+NVGFLLYENSSLSEDQISA
Subjt:  ILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRNRVGLIDDEEISGLRIGLVFQSAALFDSLTVRENVGFLLYENSSLSEDQISA

Query:  LVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTKKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGKIASYIVVTHQHST
        LVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNT+KEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGKIASYIVVTHQHST
Subjt:  LVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTKKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGKIASYIVVTHQHST

Query:  IRRAVDRL
        IRRAVDRL
Subjt:  IRRAVDRL

A0A6J1KHV9 protein TRIGALACTOSYLDIACYLGLYCEROL 3, chloroplastic-like5.0e-14989.42Show/hide
Query:  MVSILGSVFLPLTIPNCSSRSRKVAVFDVCNSFCFKKKEQRKIICNCIAPPQYLKSDGSSAVNSNDSFRSESLSPDN----KHEDESDVLIECRDVYKSF
        MVSI GSVF PLT+P+CSSRSRKVAV D  NSFC KKK+QR+I+CNCIAPP + KSDGSS VN NDSF SE LS DN    + EDESDVLIECR+VYKSF
Subjt:  MVSILGSVFLPLTIPNCSSRSRKVAVFDVCNSFCFKKKEQRKIICNCIAPPQYLKSDGSSAVNSNDSFRSESLSPDN----KHEDESDVLIECRDVYKSF

Query:  GEKHILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRNRVGLIDDEEISGLRIGLVFQSAALFDSLTVRENVGFLLYENSSLSED
        GEKHILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLL+PDKGEVYIRGR RVGLIDDEE+SGLRIGLVFQSAALFDSLTVR+NVGFLLYENSSLSED
Subjt:  GEKHILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRNRVGLIDDEEISGLRIGLVFQSAALFDSLTVRENVGFLLYENSSLSED

Query:  QISALVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTKKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGKIASYIVVTH
        QISALVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNT+KEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDA GKPGKIASYI VTH
Subjt:  QISALVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTKKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGKIASYIVVTH

Query:  QHSTIRRAVDRL
        QHSTIRRAVDRL
Subjt:  QHSTIRRAVDRL

SwissProt top hitse value%identityAlignment
P30769 Probable ribonucleotide transport ATP-binding protein mkl4.8e-3236Show/hide
Query:  VLIECRDVYKSFGEKHILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRNRVGLIDDE--EISGLRIGLVFQSAALFDSLTVREN
        V IE + + KSFG   I   V+  I  GE   ++GPSGTGKS  LK + GLL P++G + I G + +     E  EI  L  G++FQ  ALF S+ + +N
Subjt:  VLIECRDVYKSFGEKHILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRNRVGLIDDE--EISGLRIGLVFQSAALFDSLTVREN

Query:  VGFLLYENSSLSEDQISALVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTKKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDAS
          F L E++   E +I  +V E L  VGL G E + P E+SGGM+KR  LAR+++ D       P+++L DEP +GLDP+ +  +  LI  ++ + +   
Subjt:  VGFLLYENSSLSEDQISALVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTKKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDAS

Query:  GKPGKIASYIVVTHQHSTIRRAVDRLDTVPFLIIWRQQSLVTEKTREVLV
              A+ ++VTH  +  R   D +      +++R+  LV    REVL+
Subjt:  GKPGKIASYIVVTHQHSTIRRAVDRLDTVPFLIIWRQQSLVTEKTREVLV

P63358 Probable ribonucleotide transport ATP-binding protein mkl2.8e-3236.4Show/hide
Query:  VLIECRDVYKSFGEKHILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRNRVGLIDDE--EISGLRIGLVFQSAALFDSLTVREN
        V IE   + KSFG   I   V+  I  GE   ++GPSGTGKS  LK + GLL P++G + I G + +     E  EI  L  G++FQ  ALF S+ + +N
Subjt:  VLIECRDVYKSFGEKHILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRNRVGLIDDE--EISGLRIGLVFQSAALFDSLTVREN

Query:  VGFLLYENSSLSEDQISALVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTKKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDAS
          F L E++   E +I  +V E LA VGL G E + P E+SGGM+KR  LAR+++ D       P+++L DEP +GLDP+ +  +  LI  ++ + +   
Subjt:  VGFLLYENSSLSEDQISALVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTKKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDAS

Query:  GKPGKIASYIVVTHQHSTIRRAVDRLDTVPFLIIWRQQSLVTEKTREVLV
              A+ ++VTH  +  R   D +      +++R+  LV    REVL+
Subjt:  GKPGKIASYIVVTHQHSTIRRAVDRLDTVPFLIIWRQQSLVTEKTREVLV

P9WQL4 Probable ribonucleotide transport ATP-binding protein mkl2.8e-3236.4Show/hide
Query:  VLIECRDVYKSFGEKHILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRNRVGLIDDE--EISGLRIGLVFQSAALFDSLTVREN
        V IE   + KSFG   I   V+  I  GE   ++GPSGTGKS  LK + GLL P++G + I G + +     E  EI  L  G++FQ  ALF S+ + +N
Subjt:  VLIECRDVYKSFGEKHILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRNRVGLIDDE--EISGLRIGLVFQSAALFDSLTVREN

Query:  VGFLLYENSSLSEDQISALVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTKKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDAS
          F L E++   E +I  +V E LA VGL G E + P E+SGGM+KR  LAR+++ D       P+++L DEP +GLDP+ +  +  LI  ++ + +   
Subjt:  VGFLLYENSSLSEDQISALVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTKKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDAS

Query:  GKPGKIASYIVVTHQHSTIRRAVDRLDTVPFLIIWRQQSLVTEKTREVLV
              A+ ++VTH  +  R   D +      +++R+  LV    REVL+
Subjt:  GKPGKIASYIVVTHQHSTIRRAVDRLDTVPFLIIWRQQSLVTEKTREVLV

P9WQL5 Probable ribonucleotide transport ATP-binding protein mkl2.8e-3236.4Show/hide
Query:  VLIECRDVYKSFGEKHILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRNRVGLIDDE--EISGLRIGLVFQSAALFDSLTVREN
        V IE   + KSFG   I   V+  I  GE   ++GPSGTGKS  LK + GLL P++G + I G + +     E  EI  L  G++FQ  ALF S+ + +N
Subjt:  VLIECRDVYKSFGEKHILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRNRVGLIDDE--EISGLRIGLVFQSAALFDSLTVREN

Query:  VGFLLYENSSLSEDQISALVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTKKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDAS
          F L E++   E +I  +V E LA VGL G E + P E+SGGM+KR  LAR+++ D       P+++L DEP +GLDP+ +  +  LI  ++ + +   
Subjt:  VGFLLYENSSLSEDQISALVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTKKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDAS

Query:  GKPGKIASYIVVTHQHSTIRRAVDRLDTVPFLIIWRQQSLVTEKTREVLV
              A+ ++VTH  +  R   D +      +++R+  LV    REVL+
Subjt:  GKPGKIASYIVVTHQHSTIRRAVDRLDTVPFLIIWRQQSLVTEKTREVLV

Q9AT00 Protein TRIGALACTOSYLDIACYLGLYCEROL 3, chloroplastic4.3e-11379.55Show/hide
Query:  QRKIICNCIAPPQYLKSDGSSAVNSNDSFRSESLSPDNKHEDESDVLIECRDVYKSFGEKHILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPD
        +RK+ C CIAPPQ L +D +   +   S     +  +   E++SDVLIECRDVYKSFGEKHIL+GVSFKIRHGEAVGVIGPSGTGKSTILKI+AGLLAPD
Subjt:  QRKIICNCIAPPQYLKSDGSSAVNSNDSFRSESLSPDNKHEDESDVLIECRDVYKSFGEKHILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPD

Query:  KGEVYIRGRNRVGLIDDEEISGLRIGLVFQSAALFDSLTVRENVGFLLYENSSLSEDQISALVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFD
        KGEVYIRG+ R GLI DEEISGLRIGLVFQSAALFDSL+VRENVGFLLYE S +SE+QIS LVT+ LAAVGLKGVE+RLPSELSGGMKKRVALARS+IFD
Subjt:  KGEVYIRGRNRVGLIDDEEISGLRIGLVFQSAALFDSLTVRENVGFLLYENSSLSEDQISALVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFD

Query:  NTKKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGKIASYIVVTHQHSTIRRAVDRL
         TK+ IEPEVLLYDEPTAGLDPIASTVVEDLIRSVH+  EDA GKPGKIASY+VVTHQHSTI+RAVDRL
Subjt:  NTKKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGKIASYIVVTHQHSTIRRAVDRL

Arabidopsis top hitse value%identityAlignment
AT1G65410.1 non-intrinsic ABC protein 113.0e-11479.55Show/hide
Query:  QRKIICNCIAPPQYLKSDGSSAVNSNDSFRSESLSPDNKHEDESDVLIECRDVYKSFGEKHILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPD
        +RK+ C CIAPPQ L +D +   +   S     +  +   E++SDVLIECRDVYKSFGEKHIL+GVSFKIRHGEAVGVIGPSGTGKSTILKI+AGLLAPD
Subjt:  QRKIICNCIAPPQYLKSDGSSAVNSNDSFRSESLSPDNKHEDESDVLIECRDVYKSFGEKHILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPD

Query:  KGEVYIRGRNRVGLIDDEEISGLRIGLVFQSAALFDSLTVRENVGFLLYENSSLSEDQISALVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFD
        KGEVYIRG+ R GLI DEEISGLRIGLVFQSAALFDSL+VRENVGFLLYE S +SE+QIS LVT+ LAAVGLKGVE+RLPSELSGGMKKRVALARS+IFD
Subjt:  KGEVYIRGRNRVGLIDDEEISGLRIGLVFQSAALFDSLTVRENVGFLLYENSSLSEDQISALVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFD

Query:  NTKKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGKIASYIVVTHQHSTIRRAVDRL
         TK+ IEPEVLLYDEPTAGLDPIASTVVEDLIRSVH+  EDA GKPGKIASY+VVTHQHSTI+RAVDRL
Subjt:  NTKKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGKIASYIVVTHQHSTIRRAVDRL

AT1G67940.1 non-intrinsic ABC protein 33.7e-1934.45Show/hide
Query:  ILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRNRVGLIDDEEISGL--RIGLVFQSAALFDSLTVRENVGF-LLYENSSLSEDQ
        IL+GV+  I  G  VGVIGPSG+GKST L+ +  L  P +  V++ G +    I + ++  L  R+G++FQ   LF   TV +NV +        LS+++
Subjt:  ILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRNRVGLIDDEEISGL--RIGLVFQSAALFDSLTVRENVGF-LLYENSSLSEDQ

Query:  ISALVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTKKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGKIASYIVVTHQ
        +  L+  +LA +     + +  +ELS G  +RVALAR++         EPEVLL DEPT+ LDPI++  +ED+I  +         K  +  + ++V+H 
Subjt:  ISALVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTKKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGKIASYIVVTHQ

Query:  HSTIRRAVD
           I++  D
Subjt:  HSTIRRAVD

AT2G13610.1 ABC-2 type transporter family protein4.8e-1935.2Show/hide
Query:  KHILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRNRVGLIDDEEISGLRIGLVFQSAALFDSLTVRENVGFLLYENSSLSEDQ-
        KH+L+GV+ + +  E + ++GPSG GKS++L+I+A  L P  G VY+  R  V   + ++IS    G V Q   LF  LTV E + F       L  D+ 
Subjt:  KHILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRNRVGLIDDEEISGLRIGLVFQSAALFDSLTVRENVGFLLYENSSLSEDQ-

Query:  ---ISALVTE-NLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTKKEIEPEVLLYDEPTAGLDPIASTVVEDLIR
           + +LV E  L AV    V D     +SGG ++RV++   +I D       P+VL+ DEPT+GLD  ++ ++ D+++
Subjt:  ---ISALVTE-NLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTKKEIEPEVLLYDEPTAGLDPIASTVVEDLIR

AT3G62150.1 P-glycoprotein 211.8e-1831.05Show/hide
Query:  YKSFGEKHILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRNRVGLIDDEEISGLR--IGLVFQSAALFDSLTVRENVGFLLYEN
        Y +  E+ I RG S  I  G  V ++G SG+GKST++ +I     P  GEV I G N    + + ++  +R  IGLV Q   LF S +++EN+ +   EN
Subjt:  YKSFGEKHILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRNRVGLIDDEEISGLR--IGLVFQSAALFDSLTVRENVGFLLYEN

Query:  SSLSE-----DQISALVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTKKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKP
        +++ E     +  +A    +    GL  +     ++LSGG K+R+A+AR+I+ D       P +LL DE T+ LD  +  +V++ +  + +         
Subjt:  SSLSE-----DQISALVTENLAAVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTKKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKP

Query:  GKIASYIVVTHQHSTIRRA
            + +VV H+ ST+R A
Subjt:  GKIASYIVVTHQHSTIRRA

AT5G46540.1 P-glycoprotein 72.4e-1827.24Show/hide
Query:  IECRDVYKSFGEK---HILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRNRVGLIDDEEISGLR--IGLVFQSAALFDSLTVRE
        IE RDVY  +  +    I  G S  + +G  V ++G SG+GKST++ +I     P+ GEV I G +    +   ++  +R  IGLV Q   LF + T+RE
Subjt:  IECRDVYKSFGEK---HILRGVSFKIRHGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRNRVGLIDDEEISGLR--IGLVFQSAALFDSLTVRE

Query:  NVGFLLYENSSLSEDQI-SALVTENLA------AVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTKKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSV
        N+   +Y     S+ +I +AL   N +        GL+ +     ++LSGG K+R+A+AR+I+ +       P++LL DE T+ LD  +  +V+D +  +
Subjt:  NVGFLLYENSSLSEDQI-SALVTENLA------AVGLKGVEDRLPSELSGGMKKRVALARSIIFDNTKKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSV

Query:  HIKGEDASGKPGKIASYIVVTHQHSTIRRAVDRLDTVPFLIIWRQQSLVTEKTREVLV-------DQISKLSKKTSKAHLEDRPSDILRLKTPHTSLDSQ
         +             + +VV H+ +TIR       T   + + +Q  ++ + T + ++        Q+ +L + + K    D+  +   +     S DSQ
Subjt:  HIKGEDASGKPGKIASYIVVTHQHSTIRRAVDRLDTVPFLIIWRQQSLVTEKTREVLV-------DQISKLSKKTSKAHLEDRPSDILRLKTPHTSLDSQ

Query:  Q-MHLETARIPA
          +H  T   P+
Subjt:  Q-MHLETARIPA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTTCTATATTGGGTTCGGTGTTCCTGCCGTTAACCATACCCAATTGTTCCTCTCGATCGCGTAAAGTAGCTGTTTTTGATGTGTGCAATTCGTTTTGTTTTAAGAA
AAAGGAGCAGAGAAAGATTATTTGTAATTGCATTGCGCCTCCTCAGTACTTGAAGAGTGATGGGTCATCTGCCGTAAATTCCAATGACTCGTTTAGATCAGAAAGTTTAA
GCCCAGATAATAAGCATGAGGATGAGTCTGATGTTCTCATCGAGTGTAGAGATGTCTACAAATCCTTTGGGGAAAAGCATATATTGAGAGGTGTGAGCTTCAAGATTAGA
CACGGAGAAGCTGTGGGAGTAATTGGGCCTTCTGGTACTGGGAAGTCTACAATACTGAAGATCATTGCAGGTCTTCTTGCTCCAGACAAGGGAGAGGTATATATTCGAGG
TAGAAATCGAGTTGGTTTGATCGATGATGAGGAGATATCTGGTCTTCGAATTGGATTGGTTTTTCAGAGTGCAGCGCTTTTTGATTCGTTAACTGTTCGAGAAAATGTTG
GTTTCCTTTTGTATGAAAACTCAAGCTTGTCTGAAGATCAAATTTCAGCACTCGTTACTGAGAACTTAGCTGCTGTTGGGCTGAAGGGAGTTGAGGATCGGTTACCTTCT
GAGTTATCAGGTGGAATGAAGAAACGAGTTGCTTTAGCTCGATCTATAATTTTTGATAACACAAAGAAAGAAATTGAGCCAGAGGTACTCTTGTATGATGAACCAACTGC
TGGACTCGATCCAATTGCATCAACTGTAGTTGAAGATCTTATCCGTTCTGTACATATTAAGGGTGAGGATGCCAGCGGGAAGCCTGGGAAGATTGCATCTTATATTGTAG
TCACCCACCAACATAGTACCATTAGGAGAGCTGTTGACAGATTAGACACAGTACCTTTCCTGATTATTTGGAGGCAGCAGAGTTTAGTTACAGAGAAGACCAGAGAAGTT
CTTGTGGACCAGATCTCAAAGCTGAGCAAGAAAACTAGTAAAGCTCATTTGGAAGACCGGCCGAGCGACATCCTTCGCCTGAAGACTCCACACACAAGTCTGGATTCCCA
GCAAATGCACCTGGAAACCGCTCGAATCCCTGCTTCTCCGAAACCAAACCCGAGAAGCAGTTATACGACAGATCGAGAAGAGTTAAATCCTCGAGGCTGGAAATATTTCC
TGGGATCTCACCAGACAGATAATTGTGTGACAAATCTAAGGCTCTTACTCTCTGCATTTTCTCTAGGCCAGGAAGCTGACCTTCAAGTTTCCAATGAACTTGTTGCCAGC
GAGGGAGAGATATCTCAGGTTCGACCATTTTGTTATTGCATCGTTCAAATTTCCCGAGAGATTGTTGGAGCTAAAATCAACGATCTCCAACGACTTACATCCTGCCAGAG
TCAGTGGGACTTCACCAGAAATCATGTTGTTGCTAATGTCCAATATCTTCAGACTATCCAATGCATCAAGCTCAGGTTGAATTTCACCAGAAAGATTATTATGTTAAGTA
TCAAAGCAAGTCAAGCAGAACAAGCCCTGATTTCTCTATCGTCTCCACAATCTTGCTTGGAAGAGGGCCAGATAAATCATTGTTACTCAAGTCCAACACAAGTAGCTGCT
CTGAAAACAGGAGTCGAGGCGACATCTTGTCCACTGATTGGGAAGGGTTTAAGCAGAGAATCAACAAGAACAAAAGCAGAAGGGCATGCCAGTCGAGGGACTTACAGGAA
GAGAAAGATGAAACCCATTTCTGGTCCATCACTGAAGAACGAACCCAAAAGACCCTCCTACAGGAGACTCGAAGCAATTTCCCTAAAACCCCACAGTAACAGCAGACCAA
CTGTTCGAAATTACAGCAATGGAAATCCAATCCCCATCTGCCTAGATAGTAGAGAAAGCTCTAAAAAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTTCTATATTGGGTTCGGTGTTCCTGCCGTTAACCATACCCAATTGTTCCTCTCGATCGCGTAAAGTAGCTGTTTTTGATGTGTGCAATTCGTTTTGTTTTAAGAA
AAAGGAGCAGAGAAAGATTATTTGTAATTGCATTGCGCCTCCTCAGTACTTGAAGAGTGATGGGTCATCTGCCGTAAATTCCAATGACTCGTTTAGATCAGAAAGTTTAA
GCCCAGATAATAAGCATGAGGATGAGTCTGATGTTCTCATCGAGTGTAGAGATGTCTACAAATCCTTTGGGGAAAAGCATATATTGAGAGGTGTGAGCTTCAAGATTAGA
CACGGAGAAGCTGTGGGAGTAATTGGGCCTTCTGGTACTGGGAAGTCTACAATACTGAAGATCATTGCAGGTCTTCTTGCTCCAGACAAGGGAGAGGTATATATTCGAGG
TAGAAATCGAGTTGGTTTGATCGATGATGAGGAGATATCTGGTCTTCGAATTGGATTGGTTTTTCAGAGTGCAGCGCTTTTTGATTCGTTAACTGTTCGAGAAAATGTTG
GTTTCCTTTTGTATGAAAACTCAAGCTTGTCTGAAGATCAAATTTCAGCACTCGTTACTGAGAACTTAGCTGCTGTTGGGCTGAAGGGAGTTGAGGATCGGTTACCTTCT
GAGTTATCAGGTGGAATGAAGAAACGAGTTGCTTTAGCTCGATCTATAATTTTTGATAACACAAAGAAAGAAATTGAGCCAGAGGTACTCTTGTATGATGAACCAACTGC
TGGACTCGATCCAATTGCATCAACTGTAGTTGAAGATCTTATCCGTTCTGTACATATTAAGGGTGAGGATGCCAGCGGGAAGCCTGGGAAGATTGCATCTTATATTGTAG
TCACCCACCAACATAGTACCATTAGGAGAGCTGTTGACAGATTAGACACAGTACCTTTCCTGATTATTTGGAGGCAGCAGAGTTTAGTTACAGAGAAGACCAGAGAAGTT
CTTGTGGACCAGATCTCAAAGCTGAGCAAGAAAACTAGTAAAGCTCATTTGGAAGACCGGCCGAGCGACATCCTTCGCCTGAAGACTCCACACACAAGTCTGGATTCCCA
GCAAATGCACCTGGAAACCGCTCGAATCCCTGCTTCTCCGAAACCAAACCCGAGAAGCAGTTATACGACAGATCGAGAAGAGTTAAATCCTCGAGGCTGGAAATATTTCC
TGGGATCTCACCAGACAGATAATTGTGTGACAAATCTAAGGCTCTTACTCTCTGCATTTTCTCTAGGCCAGGAAGCTGACCTTCAAGTTTCCAATGAACTTGTTGCCAGC
GAGGGAGAGATATCTCAGGTTCGACCATTTTGTTATTGCATCGTTCAAATTTCCCGAGAGATTGTTGGAGCTAAAATCAACGATCTCCAACGACTTACATCCTGCCAGAG
TCAGTGGGACTTCACCAGAAATCATGTTGTTGCTAATGTCCAATATCTTCAGACTATCCAATGCATCAAGCTCAGGTTGAATTTCACCAGAAAGATTATTATGTTAAGTA
TCAAAGCAAGTCAAGCAGAACAAGCCCTGATTTCTCTATCGTCTCCACAATCTTGCTTGGAAGAGGGCCAGATAAATCATTGTTACTCAAGTCCAACACAAGTAGCTGCT
CTGAAAACAGGAGTCGAGGCGACATCTTGTCCACTGATTGGGAAGGGTTTAAGCAGAGAATCAACAAGAACAAAAGCAGAAGGGCATGCCAGTCGAGGGACTTACAGGAA
GAGAAAGATGAAACCCATTTCTGGTCCATCACTGAAGAACGAACCCAAAAGACCCTCCTACAGGAGACTCGAAGCAATTTCCCTAAAACCCCACAGTAACAGCAGACCAA
CTGTTCGAAATTACAGCAATGGAAATCCAATCCCCATCTGCCTAGATAGTAGAGAAAGCTCTAAAAAGTGA
Protein sequenceShow/hide protein sequence
MVSILGSVFLPLTIPNCSSRSRKVAVFDVCNSFCFKKKEQRKIICNCIAPPQYLKSDGSSAVNSNDSFRSESLSPDNKHEDESDVLIECRDVYKSFGEKHILRGVSFKIR
HGEAVGVIGPSGTGKSTILKIIAGLLAPDKGEVYIRGRNRVGLIDDEEISGLRIGLVFQSAALFDSLTVRENVGFLLYENSSLSEDQISALVTENLAAVGLKGVEDRLPS
ELSGGMKKRVALARSIIFDNTKKEIEPEVLLYDEPTAGLDPIASTVVEDLIRSVHIKGEDASGKPGKIASYIVVTHQHSTIRRAVDRLDTVPFLIIWRQQSLVTEKTREV
LVDQISKLSKKTSKAHLEDRPSDILRLKTPHTSLDSQQMHLETARIPASPKPNPRSSYTTDREELNPRGWKYFLGSHQTDNCVTNLRLLLSAFSLGQEADLQVSNELVAS
EGEISQVRPFCYCIVQISREIVGAKINDLQRLTSCQSQWDFTRNHVVANVQYLQTIQCIKLRLNFTRKIIMLSIKASQAEQALISLSSPQSCLEEGQINHCYSSPTQVAA
LKTGVEATSCPLIGKGLSRESTRTKAEGHASRGTYRKRKMKPISGPSLKNEPKRPSYRRLEAISLKPHSNSRPTVRNYSNGNPIPICLDSRESSKK