; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr021338 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr021338
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionprotein IN CHLOROPLAST ATPASE BIOGENESIS, chloroplastic-like isoform X2
Genome locationtig00153654:1208996..1212260
RNA-Seq ExpressionSgr021338
SyntenySgr021338
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0045770.1 uncharacterized protein E6C27_scaffold243G002910 [Cucumis melo var. makuwa]6.4e-18292.75Show/hide
Query:  MKIGGGVVCGSPRAAALPSLLLGRGGVTIRCSSSSSTSDHVSFIKDIAATKPPQHLFHLLKMLKTR-GGSIISPGAKQGIIPLAIPLAKNSSGTITALLR
        MKIGGGVVCGSPRAAALPSLLL R GVT+RCS+SSST+DHVSFIKD+AAT+PPQHLFHLLKMLKTR G SIISPGAKQGIIPL +PLAKNS+GTITALLR
Subjt:  MKIGGGVVCGSPRAAALPSLLLGRGGVTIRCSSSSSTSDHVSFIKDIAATKPPQHLFHLLKMLKTR-GGSIISPGAKQGIIPLAIPLAKNSSGTITALLR

Query:  WPTAPAGMDMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDARGSGEQNDELFLAAAGAGQKLYEKGDFAESQVTNVDAYLLKKVGLFPDVIERKILRHFEE
        WPTAPAGM+MPVVDVNRNGVWLLAKNVDQFIHRLLVEEDARGSGEQNDELFLAAA AGQKLY +GDF+ESQ+TN+D YLLKKVGLFPDVIERKILRHFEE
Subjt:  WPTAPAGMDMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDARGSGEQNDELFLAAAGAGQKLYEKGDFAESQVTNVDAYLLKKVGLFPDVIERKILRHFEE

Query:  GDLVSALVTGEFYTKKEHFPGFARPFVFNAEVLLKVGRKTEAKDAARGALKSPWWTLGCKYEEVANIAQWEDEQIEYFKEKVTEEGKQEDLKKGKAPAQV
        GDLVSALVTGEFYTKKEHFPGFARP+VFNAEVLLKVGRKTEAKDAARGALKSPWWTLGCKYEEVANIAQWEDEQIEYFKEKVTEEGKQEDLKKGKAPAQV
Subjt:  GDLVSALVTGEFYTKKEHFPGFARPFVFNAEVLLKVGRKTEAKDAARGALKSPWWTLGCKYEEVANIAQWEDEQIEYFKEKVTEEGKQEDLKKGKAPAQV

Query:  ALDQAAFLLDLASVDGTWDNSMERIAQCYEEAGLHEIARFILYRD
        ALDQAAFLLDLASVDGTWDN +ERIAQCYEEAGLHEIA F+LYRD
Subjt:  ALDQAAFLLDLASVDGTWDNSMERIAQCYEEAGLHEIARFILYRD

XP_004149691.1 protein IN CHLOROPLAST ATPASE BIOGENESIS, chloroplastic isoform X1 [Cucumis sativus]1.3e-18293.02Show/hide
Query:  MKIGGGVVCGSPRAAALPSLLLGRGGVTIRCSSSSSTSDHVSFIKDIAATKPPQHLFHLLKMLKTRGGSIISPGAKQGIIPLAIPLAKNSSGTITALLRW
        MKIGGGVVCGSPRAAALPSLLL R GVT+RCS+SSSTSDHVSFIKD+AAT+PPQHLFHLLKMLKTRG SIISPGAKQGIIPL +PLAKNSSGTITALLRW
Subjt:  MKIGGGVVCGSPRAAALPSLLLGRGGVTIRCSSSSSTSDHVSFIKDIAATKPPQHLFHLLKMLKTRGGSIISPGAKQGIIPLAIPLAKNSSGTITALLRW

Query:  PTAPAGMDMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDARGSGEQNDELFLAAAGAGQKLYEKGDFAESQVTNVDAYLLKKVGLFPDVIERKILRHFEEG
        PTAPAGM+MPVVDVNRNGVWLLAKNVDQFIHRLLVEEDARGSGEQNDELFLAAA AGQKLY +GDF+ESQ+TN+D YLLKKVGLFPD+IERKILRHFEEG
Subjt:  PTAPAGMDMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDARGSGEQNDELFLAAAGAGQKLYEKGDFAESQVTNVDAYLLKKVGLFPDVIERKILRHFEEG

Query:  DLVSALVTGEFYTKKEHFPGFARPFVFNAEVLLKVGRKTEAKDAARGALKSPWWTLGCKYEEVANIAQWEDEQIEYFKEKVTEEGKQEDLKKGKAPAQVA
        DLVSALVTGEFYTKKEHFPGFARP+VFNAEVLLKVGRKTEAKDAARGALKSPWWTLGCKYEEVANIAQWEDEQIEYFKEKVTEEGKQEDLKKGKAPAQVA
Subjt:  DLVSALVTGEFYTKKEHFPGFARPFVFNAEVLLKVGRKTEAKDAARGALKSPWWTLGCKYEEVANIAQWEDEQIEYFKEKVTEEGKQEDLKKGKAPAQVA

Query:  LDQAAFLLDLASVDGTWDNSMERIAQCYEEAGLHEIARFILYRD
        LDQAAFLLDLASVDGTWDN +ERIAQCYEEAGL EIA F+LYRD
Subjt:  LDQAAFLLDLASVDGTWDNSMERIAQCYEEAGLHEIARFILYRD

XP_008457752.1 PREDICTED: uncharacterized protein LOC103497369 [Cucumis melo]2.6e-18393.02Show/hide
Query:  MKIGGGVVCGSPRAAALPSLLLGRGGVTIRCSSSSSTSDHVSFIKDIAATKPPQHLFHLLKMLKTRGGSIISPGAKQGIIPLAIPLAKNSSGTITALLRW
        MKIGGGVVCGSPRAAALPSLLL R GVT+RCS+SSST+DHVSFIKD+AAT+PPQHLFHLLKMLKTRG SIISPGAKQGIIPL +PLAKNS+GTITALLRW
Subjt:  MKIGGGVVCGSPRAAALPSLLLGRGGVTIRCSSSSSTSDHVSFIKDIAATKPPQHLFHLLKMLKTRGGSIISPGAKQGIIPLAIPLAKNSSGTITALLRW

Query:  PTAPAGMDMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDARGSGEQNDELFLAAAGAGQKLYEKGDFAESQVTNVDAYLLKKVGLFPDVIERKILRHFEEG
        PTAPAGM+MPVVDVNRNGVWLLAKNVDQFIHRLLVEEDARGSGEQNDELFLAAA AGQKLY +GDF+ESQ+TN+D YLLKKVGLFPDVIERKILRHFEEG
Subjt:  PTAPAGMDMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDARGSGEQNDELFLAAAGAGQKLYEKGDFAESQVTNVDAYLLKKVGLFPDVIERKILRHFEEG

Query:  DLVSALVTGEFYTKKEHFPGFARPFVFNAEVLLKVGRKTEAKDAARGALKSPWWTLGCKYEEVANIAQWEDEQIEYFKEKVTEEGKQEDLKKGKAPAQVA
        DLVSALVTGEFYTKKEHFPGFARP+VFNAEVLLKVGRKTEAKDAARGALKSPWWTLGCKYEEVANIAQWEDEQIEYFKEKVTEEGKQEDLKKGKAPAQVA
Subjt:  DLVSALVTGEFYTKKEHFPGFARPFVFNAEVLLKVGRKTEAKDAARGALKSPWWTLGCKYEEVANIAQWEDEQIEYFKEKVTEEGKQEDLKKGKAPAQVA

Query:  LDQAAFLLDLASVDGTWDNSMERIAQCYEEAGLHEIARFILYRD
        LDQAAFLLDLASVDGTWDN +ERIAQCYEEAGLHEIA F+LYRD
Subjt:  LDQAAFLLDLASVDGTWDNSMERIAQCYEEAGLHEIARFILYRD

XP_023533903.1 uncharacterized protein LOC111795609 [Cucurbita pepo subsp. pepo]1.9e-18193.02Show/hide
Query:  MKIGGGVVCGSPRAAALPSLLLGRGGVTIRCSSSSSTSDHVSFIKDIAATKPPQHLFHLLKMLKTRGGSIISPGAKQGIIPLAIPLAKNSSGTITALLRW
        MKIGGGVVCGSPRAA LPSLLLGR GVTIRCSSSSSTSDHVSFIKD+AAT+PPQHL +LLKMLKTRG SIISPGAKQGIIPLAIPLAKNSSGTITALLRW
Subjt:  MKIGGGVVCGSPRAAALPSLLLGRGGVTIRCSSSSSTSDHVSFIKDIAATKPPQHLFHLLKMLKTRGGSIISPGAKQGIIPLAIPLAKNSSGTITALLRW

Query:  PTAPAGMDMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDARGSGEQNDELFLAAAGAGQKLYEKGDFAESQVTNVDAYLLKKVGLFPDVIERKILRHFEEG
        PTAPAGM+MPVVDVNRNGVWLLAKNVDQFIHRLLVEEDA+GSGEQNDELFLAAA AGQKLYE+GDFAESQ+ N+D YLLKKVG+FPD+IERKILRHFEEG
Subjt:  PTAPAGMDMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDARGSGEQNDELFLAAAGAGQKLYEKGDFAESQVTNVDAYLLKKVGLFPDVIERKILRHFEEG

Query:  DLVSALVTGEFYTKKEHFPGFARPFVFNAEVLLKVGRKTEAKDAARGALKSPWWTLGCKYEEVANIAQWEDEQIEYFKEKVTEEGKQEDLKKGKAPAQVA
        DLVSALVTGEFYTKKEHFPGFARP+VFNAEVLLKVGRKTEAKDAARGALKSPWWTLGCKYEEVANIAQWEDEQIEY KEKVTEEGK EDLKKGKAPAQVA
Subjt:  DLVSALVTGEFYTKKEHFPGFARPFVFNAEVLLKVGRKTEAKDAARGALKSPWWTLGCKYEEVANIAQWEDEQIEYFKEKVTEEGKQEDLKKGKAPAQVA

Query:  LDQAAFLLDLASVDGTWDNSMERIAQCYEEAGLHEIARFILYRD
        LDQAAFLLDLASVDGTWD S+ERIAQCYEEAGL EIARF+LYRD
Subjt:  LDQAAFLLDLASVDGTWDNSMERIAQCYEEAGLHEIARFILYRD

XP_038900520.1 protein IN CHLOROPLAST ATPASE BIOGENESIS, chloroplastic-like [Benincasa hispida]1.2e-18393.31Show/hide
Query:  MKIGGGVVCGSPRAAALPSLLLGRGGVTIRCSSSSSTSDHVSFIKDIAATKPPQHLFHLLKMLKTRGGSIISPGAKQGIIPLAIPLAKNSSGTITALLRW
        MKIGGGVVCGSPRAAALPSLLL R GVT+RCS+SSSTSDHVSF+KDIAAT+PPQHLFHLLKMLKTRG SIISPGAKQGIIPL IPLAKNSSGTITALLRW
Subjt:  MKIGGGVVCGSPRAAALPSLLLGRGGVTIRCSSSSSTSDHVSFIKDIAATKPPQHLFHLLKMLKTRGGSIISPGAKQGIIPLAIPLAKNSSGTITALLRW

Query:  PTAPAGMDMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDARGSGEQNDELFLAAAGAGQKLYEKGDFAESQVTNVDAYLLKKVGLFPDVIERKILRHFEEG
        PTAPAGM+MPVVDVNRNGVWLLAKNVDQFIHRLLVEEDARGSG+QNDELFLAAA AGQKLY +GDF+ES++TN+D YLLKKVGLFPDVIERKILRHFEEG
Subjt:  PTAPAGMDMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDARGSGEQNDELFLAAAGAGQKLYEKGDFAESQVTNVDAYLLKKVGLFPDVIERKILRHFEEG

Query:  DLVSALVTGEFYTKKEHFPGFARPFVFNAEVLLKVGRKTEAKDAARGALKSPWWTLGCKYEEVANIAQWEDEQIEYFKEKVTEEGKQEDLKKGKAPAQVA
        DLVSALVTGEFYTKKEHFPGFARP+VFNAEVLLKVGR TEAKDAARGALKSPWWTLGCKYEEVANIAQWEDEQIEYFKEK+TEEGKQEDLKKGKAPAQVA
Subjt:  DLVSALVTGEFYTKKEHFPGFARPFVFNAEVLLKVGRKTEAKDAARGALKSPWWTLGCKYEEVANIAQWEDEQIEYFKEKVTEEGKQEDLKKGKAPAQVA

Query:  LDQAAFLLDLASVDGTWDNSMERIAQCYEEAGLHEIARFILYRD
        LDQAAFLLDLASVDGTWDNS++RIAQCYEEAGLHEIARFILYRD
Subjt:  LDQAAFLLDLASVDGTWDNSMERIAQCYEEAGLHEIARFILYRD

TrEMBL top hitse value%identityAlignment
A0A0A0LJE6 Uncharacterized protein6.3e-18393.02Show/hide
Query:  MKIGGGVVCGSPRAAALPSLLLGRGGVTIRCSSSSSTSDHVSFIKDIAATKPPQHLFHLLKMLKTRGGSIISPGAKQGIIPLAIPLAKNSSGTITALLRW
        MKIGGGVVCGSPRAAALPSLLL R GVT+RCS+SSSTSDHVSFIKD+AAT+PPQHLFHLLKMLKTRG SIISPGAKQGIIPL +PLAKNSSGTITALLRW
Subjt:  MKIGGGVVCGSPRAAALPSLLLGRGGVTIRCSSSSSTSDHVSFIKDIAATKPPQHLFHLLKMLKTRGGSIISPGAKQGIIPLAIPLAKNSSGTITALLRW

Query:  PTAPAGMDMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDARGSGEQNDELFLAAAGAGQKLYEKGDFAESQVTNVDAYLLKKVGLFPDVIERKILRHFEEG
        PTAPAGM+MPVVDVNRNGVWLLAKNVDQFIHRLLVEEDARGSGEQNDELFLAAA AGQKLY +GDF+ESQ+TN+D YLLKKVGLFPD+IERKILRHFEEG
Subjt:  PTAPAGMDMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDARGSGEQNDELFLAAAGAGQKLYEKGDFAESQVTNVDAYLLKKVGLFPDVIERKILRHFEEG

Query:  DLVSALVTGEFYTKKEHFPGFARPFVFNAEVLLKVGRKTEAKDAARGALKSPWWTLGCKYEEVANIAQWEDEQIEYFKEKVTEEGKQEDLKKGKAPAQVA
        DLVSALVTGEFYTKKEHFPGFARP+VFNAEVLLKVGRKTEAKDAARGALKSPWWTLGCKYEEVANIAQWEDEQIEYFKEKVTEEGKQEDLKKGKAPAQVA
Subjt:  DLVSALVTGEFYTKKEHFPGFARPFVFNAEVLLKVGRKTEAKDAARGALKSPWWTLGCKYEEVANIAQWEDEQIEYFKEKVTEEGKQEDLKKGKAPAQVA

Query:  LDQAAFLLDLASVDGTWDNSMERIAQCYEEAGLHEIARFILYRD
        LDQAAFLLDLASVDGTWDN +ERIAQCYEEAGL EIA F+LYRD
Subjt:  LDQAAFLLDLASVDGTWDNSMERIAQCYEEAGLHEIARFILYRD

A0A1S3C5T9 uncharacterized protein LOC1034973691.3e-18393.02Show/hide
Query:  MKIGGGVVCGSPRAAALPSLLLGRGGVTIRCSSSSSTSDHVSFIKDIAATKPPQHLFHLLKMLKTRGGSIISPGAKQGIIPLAIPLAKNSSGTITALLRW
        MKIGGGVVCGSPRAAALPSLLL R GVT+RCS+SSST+DHVSFIKD+AAT+PPQHLFHLLKMLKTRG SIISPGAKQGIIPL +PLAKNS+GTITALLRW
Subjt:  MKIGGGVVCGSPRAAALPSLLLGRGGVTIRCSSSSSTSDHVSFIKDIAATKPPQHLFHLLKMLKTRGGSIISPGAKQGIIPLAIPLAKNSSGTITALLRW

Query:  PTAPAGMDMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDARGSGEQNDELFLAAAGAGQKLYEKGDFAESQVTNVDAYLLKKVGLFPDVIERKILRHFEEG
        PTAPAGM+MPVVDVNRNGVWLLAKNVDQFIHRLLVEEDARGSGEQNDELFLAAA AGQKLY +GDF+ESQ+TN+D YLLKKVGLFPDVIERKILRHFEEG
Subjt:  PTAPAGMDMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDARGSGEQNDELFLAAAGAGQKLYEKGDFAESQVTNVDAYLLKKVGLFPDVIERKILRHFEEG

Query:  DLVSALVTGEFYTKKEHFPGFARPFVFNAEVLLKVGRKTEAKDAARGALKSPWWTLGCKYEEVANIAQWEDEQIEYFKEKVTEEGKQEDLKKGKAPAQVA
        DLVSALVTGEFYTKKEHFPGFARP+VFNAEVLLKVGRKTEAKDAARGALKSPWWTLGCKYEEVANIAQWEDEQIEYFKEKVTEEGKQEDLKKGKAPAQVA
Subjt:  DLVSALVTGEFYTKKEHFPGFARPFVFNAEVLLKVGRKTEAKDAARGALKSPWWTLGCKYEEVANIAQWEDEQIEYFKEKVTEEGKQEDLKKGKAPAQVA

Query:  LDQAAFLLDLASVDGTWDNSMERIAQCYEEAGLHEIARFILYRD
        LDQAAFLLDLASVDGTWDN +ERIAQCYEEAGLHEIA F+LYRD
Subjt:  LDQAAFLLDLASVDGTWDNSMERIAQCYEEAGLHEIARFILYRD

A0A5A7TRJ7 Uncharacterized protein3.1e-18292.75Show/hide
Query:  MKIGGGVVCGSPRAAALPSLLLGRGGVTIRCSSSSSTSDHVSFIKDIAATKPPQHLFHLLKMLKTR-GGSIISPGAKQGIIPLAIPLAKNSSGTITALLR
        MKIGGGVVCGSPRAAALPSLLL R GVT+RCS+SSST+DHVSFIKD+AAT+PPQHLFHLLKMLKTR G SIISPGAKQGIIPL +PLAKNS+GTITALLR
Subjt:  MKIGGGVVCGSPRAAALPSLLLGRGGVTIRCSSSSSTSDHVSFIKDIAATKPPQHLFHLLKMLKTR-GGSIISPGAKQGIIPLAIPLAKNSSGTITALLR

Query:  WPTAPAGMDMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDARGSGEQNDELFLAAAGAGQKLYEKGDFAESQVTNVDAYLLKKVGLFPDVIERKILRHFEE
        WPTAPAGM+MPVVDVNRNGVWLLAKNVDQFIHRLLVEEDARGSGEQNDELFLAAA AGQKLY +GDF+ESQ+TN+D YLLKKVGLFPDVIERKILRHFEE
Subjt:  WPTAPAGMDMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDARGSGEQNDELFLAAAGAGQKLYEKGDFAESQVTNVDAYLLKKVGLFPDVIERKILRHFEE

Query:  GDLVSALVTGEFYTKKEHFPGFARPFVFNAEVLLKVGRKTEAKDAARGALKSPWWTLGCKYEEVANIAQWEDEQIEYFKEKVTEEGKQEDLKKGKAPAQV
        GDLVSALVTGEFYTKKEHFPGFARP+VFNAEVLLKVGRKTEAKDAARGALKSPWWTLGCKYEEVANIAQWEDEQIEYFKEKVTEEGKQEDLKKGKAPAQV
Subjt:  GDLVSALVTGEFYTKKEHFPGFARPFVFNAEVLLKVGRKTEAKDAARGALKSPWWTLGCKYEEVANIAQWEDEQIEYFKEKVTEEGKQEDLKKGKAPAQV

Query:  ALDQAAFLLDLASVDGTWDNSMERIAQCYEEAGLHEIARFILYRD
        ALDQAAFLLDLASVDGTWDN +ERIAQCYEEAGLHEIA F+LYRD
Subjt:  ALDQAAFLLDLASVDGTWDNSMERIAQCYEEAGLHEIARFILYRD

A0A5D3BJN5 Uncharacterized protein1.3e-18393.02Show/hide
Query:  MKIGGGVVCGSPRAAALPSLLLGRGGVTIRCSSSSSTSDHVSFIKDIAATKPPQHLFHLLKMLKTRGGSIISPGAKQGIIPLAIPLAKNSSGTITALLRW
        MKIGGGVVCGSPRAAALPSLLL R GVT+RCS+SSST+DHVSFIKD+AAT+PPQHLFHLLKMLKTRG SIISPGAKQGIIPL +PLAKNS+GTITALLRW
Subjt:  MKIGGGVVCGSPRAAALPSLLLGRGGVTIRCSSSSSTSDHVSFIKDIAATKPPQHLFHLLKMLKTRGGSIISPGAKQGIIPLAIPLAKNSSGTITALLRW

Query:  PTAPAGMDMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDARGSGEQNDELFLAAAGAGQKLYEKGDFAESQVTNVDAYLLKKVGLFPDVIERKILRHFEEG
        PTAPAGM+MPVVDVNRNGVWLLAKNVDQFIHRLLVEEDARGSGEQNDELFLAAA AGQKLY +GDF+ESQ+TN+D YLLKKVGLFPDVIERKILRHFEEG
Subjt:  PTAPAGMDMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDARGSGEQNDELFLAAAGAGQKLYEKGDFAESQVTNVDAYLLKKVGLFPDVIERKILRHFEEG

Query:  DLVSALVTGEFYTKKEHFPGFARPFVFNAEVLLKVGRKTEAKDAARGALKSPWWTLGCKYEEVANIAQWEDEQIEYFKEKVTEEGKQEDLKKGKAPAQVA
        DLVSALVTGEFYTKKEHFPGFARP+VFNAEVLLKVGRKTEAKDAARGALKSPWWTLGCKYEEVANIAQWEDEQIEYFKEKVTEEGKQEDLKKGKAPAQVA
Subjt:  DLVSALVTGEFYTKKEHFPGFARPFVFNAEVLLKVGRKTEAKDAARGALKSPWWTLGCKYEEVANIAQWEDEQIEYFKEKVTEEGKQEDLKKGKAPAQVA

Query:  LDQAAFLLDLASVDGTWDNSMERIAQCYEEAGLHEIARFILYRD
        LDQAAFLLDLASVDGTWDN +ERIAQCYEEAGLHEIA F+LYRD
Subjt:  LDQAAFLLDLASVDGTWDNSMERIAQCYEEAGLHEIARFILYRD

A0A6J1D7C4 uncharacterized protein LOC111017955 isoform X24.5e-18193.31Show/hide
Query:  MKIGGGVVCGSPRAAALPSLLLGRGGVTIRCSSSSSTSDHVSFIKDIAATKPPQHLFHLLKMLKTRGGSIISPGAKQGIIPLAIPLAKNSSGTITALLRW
        MKIGGGVVCGSPRAA LPSLLL R G TIR SSSSSTSDHVSFI DIAAT+PPQHL  LLKMLKTRGGSIISPGAKQGIIPLA+PLAKNSSGTITALLRW
Subjt:  MKIGGGVVCGSPRAAALPSLLLGRGGVTIRCSSSSSTSDHVSFIKDIAATKPPQHLFHLLKMLKTRGGSIISPGAKQGIIPLAIPLAKNSSGTITALLRW

Query:  PTAPAGMDMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDARGSGEQNDELFLAAAGAGQKLYEKGDFAESQVTNVDAYLLKKVGLFPDVIERKILRHFEEG
        PTAPAGMDMPVVDVNRNGVWLLAKNVDQFI+RLLVEEDARGSGEQ+DELFLAAA AGQKLYE+G FAES+VTNVD+YLLKKVGLFPDVIERKILRHFEEG
Subjt:  PTAPAGMDMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDARGSGEQNDELFLAAAGAGQKLYEKGDFAESQVTNVDAYLLKKVGLFPDVIERKILRHFEEG

Query:  DLVSALVTGEFYTKKEHFPGFARPFVFNAEVLLKVGRKTEAKDAARGALKSPWWTLGCKYEEVANIAQWEDEQIEYFKEKVTEEGKQEDLKKGKAPAQVA
        DLVSALVTGEFYTKKEHFPGFARP+VFNAEVLLKVGR+TEAKDAARGALKSPWWTLGCKYEEVANIAQWEDEQIEYFKEKVTEEGK+EDLKKGKAPAQVA
Subjt:  DLVSALVTGEFYTKKEHFPGFARPFVFNAEVLLKVGRKTEAKDAARGALKSPWWTLGCKYEEVANIAQWEDEQIEYFKEKVTEEGKQEDLKKGKAPAQVA

Query:  LDQAAFLLDLASVDGTWDNSMERIAQCYEEAGLHEIARFILYRD
        LDQAAFLLDLASVDGTWDNS+ERIAQCYEEAGLHEIA+F+LYRD
Subjt:  LDQAAFLLDLASVDGTWDNSMERIAQCYEEAGLHEIARFILYRD

SwissProt top hitse value%identityAlignment
Q94JY0 Protein IN CHLOROPLAST ATPASE BIOGENESIS, chloroplastic6.6e-12970.48Show/hide
Query:  RCSSSSSTSDHVSFIKDIAATKPPQHLFHLLKMLKTRGGSIISPGAKQGIIPLAIPLAKNSSGTITALLRWPTAPAGMDMPVVDVNRNGVWLLAKNVDQF
        R   S  +S HVSFIKD+AAT+PP HL HLLK+L+TRG +IISPGAKQG+IPLAIPL+KNSSG++TALLRWPTAP GMDMPVV+V R+GV L+A+NVD++
Subjt:  RCSSSSSTSDHVSFIKDIAATKPPQHLFHLLKMLKTRGGSIISPGAKQGIIPLAIPLAKNSSGTITALLRWPTAPAGMDMPVVDVNRNGVWLLAKNVDQF

Query:  IHRLLVEEDARGSGEQNDELFLAAAGAGQKLYEKGDFAESQVTNVDAYLLKKVGLFPDVIERKILRHFEEGDLVSALVTGEFYTKKEHFPGFARPFVFNA
        IHR+LVEEDA    ++  EL+ A+  AG+KLYEKG FAES++ N+D Y+LKKVGLFPD++ERK+LRHF+EGD VSA+VTGEFYTKK+ FPGF RPFV+ A
Subjt:  IHRLLVEEDARGSGEQNDELFLAAAGAGQKLYEKGDFAESQVTNVDAYLLKKVGLFPDVIERKILRHFEEGDLVSALVTGEFYTKKEHFPGFARPFVFNA

Query:  EVLLKVGRKTEAKDAARGALKSPWWTLGCKYEEVANIAQWEDEQIEYFKEKVTEEGKQEDLKKGKAPAQVALDQAAFLLDLASVDGTWDNSMERIAQCYE
         +L KVGR  EAKDAAR AL+SPWWTLGC YEEVA+IAQWEDEQIE+ +EKV++EG+ EDL KGKAP QVALD AAFLLDLAS++GTW  S+  IA+CYE
Subjt:  EVLLKVGRKTEAKDAARGALKSPWWTLGCKYEEVANIAQWEDEQIEYFKEKVTEEGKQEDLKKGKAPAQVALDQAAFLLDLASVDGTWDNSMERIAQCYE

Query:  EAGLHEIARFILYRD
        EAGLH I+ F+LY D
Subjt:  EAGLHEIARFILYRD

Arabidopsis top hitse value%identityAlignment
AT2G23370.1 unknown protein6.1e-12266.03Show/hide
Query:  SSSSTSDHVSFIKDIAATKPPQHLFHLLKMLKTRGGSIISPGAKQGIIPLAIPLAKNSSGTITALLRWPTAPAGMDMPVVDVNRNGVWLLAKNVDQFIHR
        SSSS S+H  FIKDIA  +PP+HL  LL +   RG SI+SPGAKQG++PL IPL K S G+  ALLRWPTAP+ M+MPVV+V ++GVW LA NVDQFIHR
Subjt:  SSSSTSDHVSFIKDIAATKPPQHLFHLLKMLKTRGGSIISPGAKQGIIPLAIPLAKNSSGTITALLRWPTAPAGMDMPVVDVNRNGVWLLAKNVDQFIHR

Query:  LLVEEDARGSGEQNDELFLAAAGAGQKLYEKGDFAESQVTNVDAYLLKKVGLFPDVIERKILRHFEEGDLVSALVTGEFYTKKEHFPGFARPFVFNAEVL
        +LVEED     E + E+F AA  AG+KLY KGDFA S++ ++DAYLL+KVGLFPD +ERK++RH E GD VSALV  EFYTK+ +FPGFARPF FNA+VL
Subjt:  LLVEEDARGSGEQNDELFLAAAGAGQKLYEKGDFAESQVTNVDAYLLKKVGLFPDVIERKILRHFEEGDLVSALVTGEFYTKKEHFPGFARPFVFNAEVL

Query:  LKVGRKTEAKDAARGALKSPWWTLGCKYEEVANIAQWEDEQIEYFKEKVTEEGKQEDLKKGKAPAQVALDQAAFLLDLASVDGTWDNSMERIAQCYEEAG
        LK+GR  EAKDAARGALKS WWTLGC+YEE+A IA+W +EQI  +KE+VT EGKQ D+ +GK  AQ +LD+AAFLL+LAS++GTWD S+ER+AQCY+EAG
Subjt:  LKVGRKTEAKDAARGALKSPWWTLGCKYEEVANIAQWEDEQIEYFKEKVTEEGKQEDLKKGKAPAQVALDQAAFLLDLASVDGTWDNSMERIAQCYEEAG

Query:  LHEIARFILYRD
        L++IA+F+LYRD
Subjt:  LHEIARFILYRD

AT4G34090.1 unknown protein4.7e-13070.48Show/hide
Query:  RCSSSSSTSDHVSFIKDIAATKPPQHLFHLLKMLKTRGGSIISPGAKQGIIPLAIPLAKNSSGTITALLRWPTAPAGMDMPVVDVNRNGVWLLAKNVDQF
        R   S  +S HVSFIKD+AAT+PP HL HLLK+L+TRG +IISPGAKQG+IPLAIPL+KNSSG++TALLRWPTAP GMDMPVV+V R+GV L+A+NVD++
Subjt:  RCSSSSSTSDHVSFIKDIAATKPPQHLFHLLKMLKTRGGSIISPGAKQGIIPLAIPLAKNSSGTITALLRWPTAPAGMDMPVVDVNRNGVWLLAKNVDQF

Query:  IHRLLVEEDARGSGEQNDELFLAAAGAGQKLYEKGDFAESQVTNVDAYLLKKVGLFPDVIERKILRHFEEGDLVSALVTGEFYTKKEHFPGFARPFVFNA
        IHR+LVEEDA    ++  EL+ A+  AG+KLYEKG FAES++ N+D Y+LKKVGLFPD++ERK+LRHF+EGD VSA+VTGEFYTKK+ FPGF RPFV+ A
Subjt:  IHRLLVEEDARGSGEQNDELFLAAAGAGQKLYEKGDFAESQVTNVDAYLLKKVGLFPDVIERKILRHFEEGDLVSALVTGEFYTKKEHFPGFARPFVFNA

Query:  EVLLKVGRKTEAKDAARGALKSPWWTLGCKYEEVANIAQWEDEQIEYFKEKVTEEGKQEDLKKGKAPAQVALDQAAFLLDLASVDGTWDNSMERIAQCYE
         +L KVGR  EAKDAAR AL+SPWWTLGC YEEVA+IAQWEDEQIE+ +EKV++EG+ EDL KGKAP QVALD AAFLLDLAS++GTW  S+  IA+CYE
Subjt:  EVLLKVGRKTEAKDAARGALKSPWWTLGCKYEEVANIAQWEDEQIEYFKEKVTEEGKQEDLKKGKAPAQVALDQAAFLLDLASVDGTWDNSMERIAQCYE

Query:  EAGLHEIARFILYRD
        EAGLH I+ F+LY D
Subjt:  EAGLHEIARFILYRD

AT4G34090.2 unknown protein1.2e-12870.25Show/hide
Query:  RCSSSSSTSDHVSFIKDIAATKPPQHLFHLLKMLKTRGGSIISPGAKQGIIPLAIPLAKNSS-GTITALLRWPTAPAGMDMPVVDVNRNGVWLLAKNVDQ
        R   S  +S HVSFIKD+AAT+PP HL HLLK+L+TRG +IISPGAKQG+IPLAIPL+KNSS G++TALLRWPTAP GMDMPVV+V R+GV L+A+NVD+
Subjt:  RCSSSSSTSDHVSFIKDIAATKPPQHLFHLLKMLKTRGGSIISPGAKQGIIPLAIPLAKNSS-GTITALLRWPTAPAGMDMPVVDVNRNGVWLLAKNVDQ

Query:  FIHRLLVEEDARGSGEQNDELFLAAAGAGQKLYEKGDFAESQVTNVDAYLLKKVGLFPDVIERKILRHFEEGDLVSALVTGEFYTKKEHFPGFARPFVFN
        +IHR+LVEEDA    ++  EL+ A+  AG+KLYEKG FAES++ N+D Y+LKKVGLFPD++ERK+LRHF+EGD VSA+VTGEFYTKK+ FPGF RPFV+ 
Subjt:  FIHRLLVEEDARGSGEQNDELFLAAAGAGQKLYEKGDFAESQVTNVDAYLLKKVGLFPDVIERKILRHFEEGDLVSALVTGEFYTKKEHFPGFARPFVFN

Query:  AEVLLKVGRKTEAKDAARGALKSPWWTLGCKYEEVANIAQWEDEQIEYFKEKVTEEGKQEDLKKGKAPAQVALDQAAFLLDLASVDGTWDNSMERIAQCY
        A +L KVGR  EAKDAAR AL+SPWWTLGC YEEVA+IAQWEDEQIE+ +EKV++EG+ EDL KGKAP QVALD AAFLLDLAS++GTW  S+  IA+CY
Subjt:  AEVLLKVGRKTEAKDAARGALKSPWWTLGCKYEEVANIAQWEDEQIEYFKEKVTEEGKQEDLKKGKAPAQVALDQAAFLLDLASVDGTWDNSMERIAQCY

Query:  EEAGLHEIARFILYRD
        EEAGLH I+ F+LY D
Subjt:  EEAGLHEIARFILYRD

AT4G34090.3 unknown protein2.2e-12770.42Show/hide
Query:  HVSFIKDIAATKPPQHLFHLLKMLKTRGGSIISPGAKQGIIPLAIPLAKNSSGTITALLRWPTAPAGMDMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDA
        HVSFIKD+AAT+PP HL HLLK+L+TRG +IISPGAKQG+IPLAIPL+KNSSG++TALLRWPTAP GMDMPVV+V R+GV L+A+NVD++IHR+LVEEDA
Subjt:  HVSFIKDIAATKPPQHLFHLLKMLKTRGGSIISPGAKQGIIPLAIPLAKNSSGTITALLRWPTAPAGMDMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDA

Query:  RGSGEQNDELFLAAAGAGQKLYEKGDFAESQVTNVDAYLLKKVGLFPDVIERKILRHFEEGDLVSALVTGEFYTKKEHFPGFARPFVFNAEVLLK-----
            ++  EL+ A+  AG+KLYEKG FAES++ N+D Y+LKKVGLFPD++ERK+LRHF+EGD VSA+VTGEFYTKK+ FPGF RPFV+ A +L K     
Subjt:  RGSGEQNDELFLAAAGAGQKLYEKGDFAESQVTNVDAYLLKKVGLFPDVIERKILRHFEEGDLVSALVTGEFYTKKEHFPGFARPFVFNAEVLLK-----

Query:  -VGRKTEAKDAARGALKSPWWTLGCKYEEVANIAQWEDEQIEYFKEKVTEEGKQEDLKKGKAPAQVALDQAAFLLDLASVDGTWDNSMERIAQCYEEAGL
         VGR  EAKDAAR AL+SPWWTLGC YEEVA+IAQWEDEQIE+ +EKV++EG+ EDL KGKAP QVALD AAFLLDLAS++GTW  S+  IA+CYEEAGL
Subjt:  -VGRKTEAKDAARGALKSPWWTLGCKYEEVANIAQWEDEQIEYFKEKVTEEGKQEDLKKGKAPAQVALDQAAFLLDLASVDGTWDNSMERIAQCYEEAGL

Query:  HEIARFILYRD
        H I+ F+LY D
Subjt:  HEIARFILYRD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAATTGGTGGTGGAGTGGTATGCGGAAGTCCACGCGCCGCCGCTCTGCCCTCACTGCTTCTCGGACGCGGTGGAGTTACCATTCGCTGCTCTTCATCTTCCTCTAC
TTCCGACCATGTATCGTTCATTAAGGATATTGCGGCAACTAAGCCTCCTCAGCATTTGTTTCATTTGCTGAAAATGCTGAAGACAAGAGGTGGATCCATTATTTCTCCTG
GAGCCAAGCAAGGGATAATTCCTCTTGCCATTCCACTAGCGAAAAACAGCTCAGGTACTATAACTGCACTGCTGCGCTGGCCTACTGCACCCGCTGGGATGGATATGCCA
GTAGTGGACGTCAATAGAAATGGAGTGTGGCTTCTAGCCAAGAACGTGGATCAATTTATTCATAGACTTCTAGTTGAAGAAGATGCCAGAGGAAGTGGAGAACAAAATGA
TGAGCTATTTCTTGCAGCAGCTGGTGCTGGGCAGAAACTTTATGAAAAGGGTGATTTTGCTGAATCTCAGGTCACGAATGTAGATGCGTATCTGCTTAAAAAGGTTGGGT
TGTTTCCAGATGTCATAGAACGTAAAATATTGCGCCATTTTGAGGAAGGCGACCTTGTTTCAGCTTTGGTGACTGGAGAATTCTATACTAAAAAGGAGCACTTCCCAGGA
TTTGCACGGCCATTCGTATTCAATGCAGAGGTTTTGCTCAAGGTGGGACGAAAAACAGAGGCTAAGGATGCTGCGAGGGGAGCCTTAAAATCACCATGGTGGACCCTAGG
CTGTAAATATGAGGAAGTTGCTAATATTGCGCAATGGGAAGATGAGCAAATTGAGTATTTCAAAGAGAAGGTCACAGAAGAAGGAAAGCAAGAGGATCTTAAGAAGGGAA
AGGCTCCTGCCCAGGTTGCCTTGGACCAAGCAGCCTTTTTGTTGGATTTAGCTTCTGTTGATGGAACTTGGGACAACTCTATGGAGCGCATTGCTCAATGTTATGAAGAG
GCAGGCCTTCATGAGATTGCGAGATTCATACTTTACAGAGACTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAAATTGGTGGTGGAGTGGTATGCGGAAGTCCACGCGCCGCCGCTCTGCCCTCACTGCTTCTCGGACGCGGTGGAGTTACCATTCGCTGCTCTTCATCTTCCTCTAC
TTCCGACCATGTATCGTTCATTAAGGATATTGCGGCAACTAAGCCTCCTCAGCATTTGTTTCATTTGCTGAAAATGCTGAAGACAAGAGGTGGATCCATTATTTCTCCTG
GAGCCAAGCAAGGGATAATTCCTCTTGCCATTCCACTAGCGAAAAACAGCTCAGGTACTATAACTGCACTGCTGCGCTGGCCTACTGCACCCGCTGGGATGGATATGCCA
GTAGTGGACGTCAATAGAAATGGAGTGTGGCTTCTAGCCAAGAACGTGGATCAATTTATTCATAGACTTCTAGTTGAAGAAGATGCCAGAGGAAGTGGAGAACAAAATGA
TGAGCTATTTCTTGCAGCAGCTGGTGCTGGGCAGAAACTTTATGAAAAGGGTGATTTTGCTGAATCTCAGGTCACGAATGTAGATGCGTATCTGCTTAAAAAGGTTGGGT
TGTTTCCAGATGTCATAGAACGTAAAATATTGCGCCATTTTGAGGAAGGCGACCTTGTTTCAGCTTTGGTGACTGGAGAATTCTATACTAAAAAGGAGCACTTCCCAGGA
TTTGCACGGCCATTCGTATTCAATGCAGAGGTTTTGCTCAAGGTGGGACGAAAAACAGAGGCTAAGGATGCTGCGAGGGGAGCCTTAAAATCACCATGGTGGACCCTAGG
CTGTAAATATGAGGAAGTTGCTAATATTGCGCAATGGGAAGATGAGCAAATTGAGTATTTCAAAGAGAAGGTCACAGAAGAAGGAAAGCAAGAGGATCTTAAGAAGGGAA
AGGCTCCTGCCCAGGTTGCCTTGGACCAAGCAGCCTTTTTGTTGGATTTAGCTTCTGTTGATGGAACTTGGGACAACTCTATGGAGCGCATTGCTCAATGTTATGAAGAG
GCAGGCCTTCATGAGATTGCGAGATTCATACTTTACAGAGACTGA
Protein sequenceShow/hide protein sequence
MKIGGGVVCGSPRAAALPSLLLGRGGVTIRCSSSSSTSDHVSFIKDIAATKPPQHLFHLLKMLKTRGGSIISPGAKQGIIPLAIPLAKNSSGTITALLRWPTAPAGMDMP
VVDVNRNGVWLLAKNVDQFIHRLLVEEDARGSGEQNDELFLAAAGAGQKLYEKGDFAESQVTNVDAYLLKKVGLFPDVIERKILRHFEEGDLVSALVTGEFYTKKEHFPG
FARPFVFNAEVLLKVGRKTEAKDAARGALKSPWWTLGCKYEEVANIAQWEDEQIEYFKEKVTEEGKQEDLKKGKAPAQVALDQAAFLLDLASVDGTWDNSMERIAQCYEE
AGLHEIARFILYRD