; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0006572 (gene) of Snake gourd v1 genome

Gene IDTan0006572
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionprotein IN CHLOROPLAST ATPASE BIOGENESIS, chloroplastic-like isoform X2
Genome locationLG05:41893752..41898652
RNA-Seq ExpressionTan0006572
SyntenyTan0006572
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7015967.1 hypothetical protein SDJN02_21071 [Cucurbita argyrosperma subsp. argyrosperma]4.8e-18595.64Show/hide
Query:  MKIGGGVVCGSPRAAVLPSLLLRRRGVTVRCSSSSSTSDHVSFIKDIAATEPPQHLFHLLKMLKTRGGSIISPGAKQGIIPLAIPLAKNSSGTITALLRW
        MKIGGGVVCGSPRAAVLPSLLL RRGVT+RCSSSSSTSDHVSFIKD+AATEPPQHL +LLKMLKTRG SIISPGAKQGIIPLAIPLAKNSSGTITALLRW
Subjt:  MKIGGGVVCGSPRAAVLPSLLLRRRGVTVRCSSSSSTSDHVSFIKDIAATEPPQHLFHLLKMLKTRGGSIISPGAKQGIIPLAIPLAKNSSGTITALLRW

Query:  PTAPAGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDAKGSGEQRDELFLAAADAGQKLYGRGDFAESQIKNIDGYLLKKVGLFPDVIERKILRHFEEG
        PTAPAGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDAKGSGE  DELFLAAADAGQKLY RGDFAESQIKNIDGYLLKKVG+FPD+IERKILRHFEEG
Subjt:  PTAPAGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDAKGSGEQRDELFLAAADAGQKLYGRGDFAESQIKNIDGYLLKKVGLFPDVIERKILRHFEEG

Query:  DLVSALVTGEFYTKKEHFPGFARPFVFNAEVLLKVGRKTEAKDAARGALKSPWWTLGCKYEEVANVAQWEDEQIEYLKEKVTEEGKLEDLKKGKAPAQVA
        DLVSALVTGEFYTKKEHFPGFARP+VFNAEVLLKVGRKTEAKDAARGALKSPWWTLGCKYEEVAN+AQWEDEQIEYLKEKVTEEGKLEDLKKGKAPAQVA
Subjt:  DLVSALVTGEFYTKKEHFPGFARPFVFNAEVLLKVGRKTEAKDAARGALKSPWWTLGCKYEEVANVAQWEDEQIEYLKEKVTEEGKLEDLKKGKAPAQVA

Query:  LDQAAFLLDLASVDATWDNSVERIAQCYEEAGLQEIARFVLYRD
        LDQAAFLLDLASVD TWD SVERIAQCYEEAGLQEIARFVLYRD
Subjt:  LDQAAFLLDLASVDATWDNSVERIAQCYEEAGLQEIARFVLYRD

XP_004149691.1 protein IN CHLOROPLAST ATPASE BIOGENESIS, chloroplastic isoform X1 [Cucumis sativus]1.4e-18494.19Show/hide
Query:  MKIGGGVVCGSPRAAVLPSLLLRRRGVTVRCSSSSSTSDHVSFIKDIAATEPPQHLFHLLKMLKTRGGSIISPGAKQGIIPLAIPLAKNSSGTITALLRW
        MKIGGGVVCGSPRAA LPSLLLRRRGVTVRCS+SSSTSDHVSFIKD+AATEPPQHLFHLLKMLKTRG SIISPGAKQGIIPL +PLAKNSSGTITALLRW
Subjt:  MKIGGGVVCGSPRAAVLPSLLLRRRGVTVRCSSSSSTSDHVSFIKDIAATEPPQHLFHLLKMLKTRGGSIISPGAKQGIIPLAIPLAKNSSGTITALLRW

Query:  PTAPAGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDAKGSGEQRDELFLAAADAGQKLYGRGDFAESQIKNIDGYLLKKVGLFPDVIERKILRHFEEG
        PTAPAGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDA+GSGEQ DELFLAAADAGQKLYGRGDF+ESQI N+DGYLLKKVGLFPD+IERKILRHFEEG
Subjt:  PTAPAGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDAKGSGEQRDELFLAAADAGQKLYGRGDFAESQIKNIDGYLLKKVGLFPDVIERKILRHFEEG

Query:  DLVSALVTGEFYTKKEHFPGFARPFVFNAEVLLKVGRKTEAKDAARGALKSPWWTLGCKYEEVANVAQWEDEQIEYLKEKVTEEGKLEDLKKGKAPAQVA
        DLVSALVTGEFYTKKEHFPGFARP+VFNAEVLLKVGRKTEAKDAARGALKSPWWTLGCKYEEVAN+AQWEDEQIEY KEKVTEEGK EDLKKGKAPAQVA
Subjt:  DLVSALVTGEFYTKKEHFPGFARPFVFNAEVLLKVGRKTEAKDAARGALKSPWWTLGCKYEEVANVAQWEDEQIEYLKEKVTEEGKLEDLKKGKAPAQVA

Query:  LDQAAFLLDLASVDATWDNSVERIAQCYEEAGLQEIARFVLYRD
        LDQAAFLLDLASVD TWDN VERIAQCYEEAGL EIA FVLYRD
Subjt:  LDQAAFLLDLASVDATWDNSVERIAQCYEEAGLQEIARFVLYRD

XP_008457752.1 PREDICTED: uncharacterized protein LOC103497369 [Cucumis melo]3.1e-18493.9Show/hide
Query:  MKIGGGVVCGSPRAAVLPSLLLRRRGVTVRCSSSSSTSDHVSFIKDIAATEPPQHLFHLLKMLKTRGGSIISPGAKQGIIPLAIPLAKNSSGTITALLRW
        MKIGGGVVCGSPRAA LPSLLLRRRGVTVRCS+SSST+DHVSFIKD+AATEPPQHLFHLLKMLKTRG SIISPGAKQGIIPL +PLAKNS+GTITALLRW
Subjt:  MKIGGGVVCGSPRAAVLPSLLLRRRGVTVRCSSSSSTSDHVSFIKDIAATEPPQHLFHLLKMLKTRGGSIISPGAKQGIIPLAIPLAKNSSGTITALLRW

Query:  PTAPAGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDAKGSGEQRDELFLAAADAGQKLYGRGDFAESQIKNIDGYLLKKVGLFPDVIERKILRHFEEG
        PTAPAGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDA+GSGEQ DELFLAAADAGQKLYGRGDF+ESQI N+DGYLLKKVGLFPDVIERKILRHFEEG
Subjt:  PTAPAGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDAKGSGEQRDELFLAAADAGQKLYGRGDFAESQIKNIDGYLLKKVGLFPDVIERKILRHFEEG

Query:  DLVSALVTGEFYTKKEHFPGFARPFVFNAEVLLKVGRKTEAKDAARGALKSPWWTLGCKYEEVANVAQWEDEQIEYLKEKVTEEGKLEDLKKGKAPAQVA
        DLVSALVTGEFYTKKEHFPGFARP+VFNAEVLLKVGRKTEAKDAARGALKSPWWTLGCKYEEVAN+AQWEDEQIEY KEKVTEEGK EDLKKGKAPAQVA
Subjt:  DLVSALVTGEFYTKKEHFPGFARPFVFNAEVLLKVGRKTEAKDAARGALKSPWWTLGCKYEEVANVAQWEDEQIEYLKEKVTEEGKLEDLKKGKAPAQVA

Query:  LDQAAFLLDLASVDATWDNSVERIAQCYEEAGLQEIARFVLYRD
        LDQAAFLLDLASVD TWDN VERIAQCYEEAGL EIA FVLYRD
Subjt:  LDQAAFLLDLASVDATWDNSVERIAQCYEEAGLQEIARFVLYRD

XP_023533903.1 uncharacterized protein LOC111795609 [Cucurbita pepo subsp. pepo]7.4e-18695.93Show/hide
Query:  MKIGGGVVCGSPRAAVLPSLLLRRRGVTVRCSSSSSTSDHVSFIKDIAATEPPQHLFHLLKMLKTRGGSIISPGAKQGIIPLAIPLAKNSSGTITALLRW
        MKIGGGVVCGSPRAAVLPSLLL RRGVT+RCSSSSSTSDHVSFIKD+AATEPPQHL +LLKMLKTRG SIISPGAKQGIIPLAIPLAKNSSGTITALLRW
Subjt:  MKIGGGVVCGSPRAAVLPSLLLRRRGVTVRCSSSSSTSDHVSFIKDIAATEPPQHLFHLLKMLKTRGGSIISPGAKQGIIPLAIPLAKNSSGTITALLRW

Query:  PTAPAGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDAKGSGEQRDELFLAAADAGQKLYGRGDFAESQIKNIDGYLLKKVGLFPDVIERKILRHFEEG
        PTAPAGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDAKGSGEQ DELFLAAADAGQKLY RGDFAESQIKNIDGYLLKKVG+FPD+IERKILRHFEEG
Subjt:  PTAPAGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDAKGSGEQRDELFLAAADAGQKLYGRGDFAESQIKNIDGYLLKKVGLFPDVIERKILRHFEEG

Query:  DLVSALVTGEFYTKKEHFPGFARPFVFNAEVLLKVGRKTEAKDAARGALKSPWWTLGCKYEEVANVAQWEDEQIEYLKEKVTEEGKLEDLKKGKAPAQVA
        DLVSALVTGEFYTKKEHFPGFARP+VFNAEVLLKVGRKTEAKDAARGALKSPWWTLGCKYEEVAN+AQWEDEQIEYLKEKVTEEGKLEDLKKGKAPAQVA
Subjt:  DLVSALVTGEFYTKKEHFPGFARPFVFNAEVLLKVGRKTEAKDAARGALKSPWWTLGCKYEEVANVAQWEDEQIEYLKEKVTEEGKLEDLKKGKAPAQVA

Query:  LDQAAFLLDLASVDATWDNSVERIAQCYEEAGLQEIARFVLYRD
        LDQAAFLLDLASVD TWD SVERIAQCYEEAGLQEIARFVLYRD
Subjt:  LDQAAFLLDLASVDATWDNSVERIAQCYEEAGLQEIARFVLYRD

XP_038900520.1 protein IN CHLOROPLAST ATPASE BIOGENESIS, chloroplastic-like [Benincasa hispida]2.4e-18493.6Show/hide
Query:  MKIGGGVVCGSPRAAVLPSLLLRRRGVTVRCSSSSSTSDHVSFIKDIAATEPPQHLFHLLKMLKTRGGSIISPGAKQGIIPLAIPLAKNSSGTITALLRW
        MKIGGGVVCGSPRAA LPSLLLRRRGVTVRCS+SSSTSDHVSF+KDIAATEPPQHLFHLLKMLKTRG SIISPGAKQGIIPL IPLAKNSSGTITALLRW
Subjt:  MKIGGGVVCGSPRAAVLPSLLLRRRGVTVRCSSSSSTSDHVSFIKDIAATEPPQHLFHLLKMLKTRGGSIISPGAKQGIIPLAIPLAKNSSGTITALLRW

Query:  PTAPAGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDAKGSGEQRDELFLAAADAGQKLYGRGDFAESQIKNIDGYLLKKVGLFPDVIERKILRHFEEG
        PTAPAGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDA+GSG+Q DELFLAAADAGQKLYGRGDF+ES+I N+DGYLLKKVGLFPDVIERKILRHFEEG
Subjt:  PTAPAGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDAKGSGEQRDELFLAAADAGQKLYGRGDFAESQIKNIDGYLLKKVGLFPDVIERKILRHFEEG

Query:  DLVSALVTGEFYTKKEHFPGFARPFVFNAEVLLKVGRKTEAKDAARGALKSPWWTLGCKYEEVANVAQWEDEQIEYLKEKVTEEGKLEDLKKGKAPAQVA
        DLVSALVTGEFYTKKEHFPGFARP+VFNAEVLLKVGR TEAKDAARGALKSPWWTLGCKYEEVAN+AQWEDEQIEY KEK+TEEGK EDLKKGKAPAQVA
Subjt:  DLVSALVTGEFYTKKEHFPGFARPFVFNAEVLLKVGRKTEAKDAARGALKSPWWTLGCKYEEVANVAQWEDEQIEYLKEKVTEEGKLEDLKKGKAPAQVA

Query:  LDQAAFLLDLASVDATWDNSVERIAQCYEEAGLQEIARFVLYRD
        LDQAAFLLDLASVD TWDNSV+RIAQCYEEAGL EIARF+LYRD
Subjt:  LDQAAFLLDLASVDATWDNSVERIAQCYEEAGLQEIARFVLYRD

TrEMBL top hitse value%identityAlignment
A0A0A0LJE6 Uncharacterized protein6.7e-18594.19Show/hide
Query:  MKIGGGVVCGSPRAAVLPSLLLRRRGVTVRCSSSSSTSDHVSFIKDIAATEPPQHLFHLLKMLKTRGGSIISPGAKQGIIPLAIPLAKNSSGTITALLRW
        MKIGGGVVCGSPRAA LPSLLLRRRGVTVRCS+SSSTSDHVSFIKD+AATEPPQHLFHLLKMLKTRG SIISPGAKQGIIPL +PLAKNSSGTITALLRW
Subjt:  MKIGGGVVCGSPRAAVLPSLLLRRRGVTVRCSSSSSTSDHVSFIKDIAATEPPQHLFHLLKMLKTRGGSIISPGAKQGIIPLAIPLAKNSSGTITALLRW

Query:  PTAPAGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDAKGSGEQRDELFLAAADAGQKLYGRGDFAESQIKNIDGYLLKKVGLFPDVIERKILRHFEEG
        PTAPAGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDA+GSGEQ DELFLAAADAGQKLYGRGDF+ESQI N+DGYLLKKVGLFPD+IERKILRHFEEG
Subjt:  PTAPAGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDAKGSGEQRDELFLAAADAGQKLYGRGDFAESQIKNIDGYLLKKVGLFPDVIERKILRHFEEG

Query:  DLVSALVTGEFYTKKEHFPGFARPFVFNAEVLLKVGRKTEAKDAARGALKSPWWTLGCKYEEVANVAQWEDEQIEYLKEKVTEEGKLEDLKKGKAPAQVA
        DLVSALVTGEFYTKKEHFPGFARP+VFNAEVLLKVGRKTEAKDAARGALKSPWWTLGCKYEEVAN+AQWEDEQIEY KEKVTEEGK EDLKKGKAPAQVA
Subjt:  DLVSALVTGEFYTKKEHFPGFARPFVFNAEVLLKVGRKTEAKDAARGALKSPWWTLGCKYEEVANVAQWEDEQIEYLKEKVTEEGKLEDLKKGKAPAQVA

Query:  LDQAAFLLDLASVDATWDNSVERIAQCYEEAGLQEIARFVLYRD
        LDQAAFLLDLASVD TWDN VERIAQCYEEAGL EIA FVLYRD
Subjt:  LDQAAFLLDLASVDATWDNSVERIAQCYEEAGLQEIARFVLYRD

A0A1S3C5T9 uncharacterized protein LOC1034973691.5e-18493.9Show/hide
Query:  MKIGGGVVCGSPRAAVLPSLLLRRRGVTVRCSSSSSTSDHVSFIKDIAATEPPQHLFHLLKMLKTRGGSIISPGAKQGIIPLAIPLAKNSSGTITALLRW
        MKIGGGVVCGSPRAA LPSLLLRRRGVTVRCS+SSST+DHVSFIKD+AATEPPQHLFHLLKMLKTRG SIISPGAKQGIIPL +PLAKNS+GTITALLRW
Subjt:  MKIGGGVVCGSPRAAVLPSLLLRRRGVTVRCSSSSSTSDHVSFIKDIAATEPPQHLFHLLKMLKTRGGSIISPGAKQGIIPLAIPLAKNSSGTITALLRW

Query:  PTAPAGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDAKGSGEQRDELFLAAADAGQKLYGRGDFAESQIKNIDGYLLKKVGLFPDVIERKILRHFEEG
        PTAPAGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDA+GSGEQ DELFLAAADAGQKLYGRGDF+ESQI N+DGYLLKKVGLFPDVIERKILRHFEEG
Subjt:  PTAPAGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDAKGSGEQRDELFLAAADAGQKLYGRGDFAESQIKNIDGYLLKKVGLFPDVIERKILRHFEEG

Query:  DLVSALVTGEFYTKKEHFPGFARPFVFNAEVLLKVGRKTEAKDAARGALKSPWWTLGCKYEEVANVAQWEDEQIEYLKEKVTEEGKLEDLKKGKAPAQVA
        DLVSALVTGEFYTKKEHFPGFARP+VFNAEVLLKVGRKTEAKDAARGALKSPWWTLGCKYEEVAN+AQWEDEQIEY KEKVTEEGK EDLKKGKAPAQVA
Subjt:  DLVSALVTGEFYTKKEHFPGFARPFVFNAEVLLKVGRKTEAKDAARGALKSPWWTLGCKYEEVANVAQWEDEQIEYLKEKVTEEGKLEDLKKGKAPAQVA

Query:  LDQAAFLLDLASVDATWDNSVERIAQCYEEAGLQEIARFVLYRD
        LDQAAFLLDLASVD TWDN VERIAQCYEEAGL EIA FVLYRD
Subjt:  LDQAAFLLDLASVDATWDNSVERIAQCYEEAGLQEIARFVLYRD

A0A5D3BJN5 Uncharacterized protein1.5e-18493.9Show/hide
Query:  MKIGGGVVCGSPRAAVLPSLLLRRRGVTVRCSSSSSTSDHVSFIKDIAATEPPQHLFHLLKMLKTRGGSIISPGAKQGIIPLAIPLAKNSSGTITALLRW
        MKIGGGVVCGSPRAA LPSLLLRRRGVTVRCS+SSST+DHVSFIKD+AATEPPQHLFHLLKMLKTRG SIISPGAKQGIIPL +PLAKNS+GTITALLRW
Subjt:  MKIGGGVVCGSPRAAVLPSLLLRRRGVTVRCSSSSSTSDHVSFIKDIAATEPPQHLFHLLKMLKTRGGSIISPGAKQGIIPLAIPLAKNSSGTITALLRW

Query:  PTAPAGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDAKGSGEQRDELFLAAADAGQKLYGRGDFAESQIKNIDGYLLKKVGLFPDVIERKILRHFEEG
        PTAPAGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDA+GSGEQ DELFLAAADAGQKLYGRGDF+ESQI N+DGYLLKKVGLFPDVIERKILRHFEEG
Subjt:  PTAPAGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDAKGSGEQRDELFLAAADAGQKLYGRGDFAESQIKNIDGYLLKKVGLFPDVIERKILRHFEEG

Query:  DLVSALVTGEFYTKKEHFPGFARPFVFNAEVLLKVGRKTEAKDAARGALKSPWWTLGCKYEEVANVAQWEDEQIEYLKEKVTEEGKLEDLKKGKAPAQVA
        DLVSALVTGEFYTKKEHFPGFARP+VFNAEVLLKVGRKTEAKDAARGALKSPWWTLGCKYEEVAN+AQWEDEQIEY KEKVTEEGK EDLKKGKAPAQVA
Subjt:  DLVSALVTGEFYTKKEHFPGFARPFVFNAEVLLKVGRKTEAKDAARGALKSPWWTLGCKYEEVANVAQWEDEQIEYLKEKVTEEGKLEDLKKGKAPAQVA

Query:  LDQAAFLLDLASVDATWDNSVERIAQCYEEAGLQEIARFVLYRD
        LDQAAFLLDLASVD TWDN VERIAQCYEEAGL EIA FVLYRD
Subjt:  LDQAAFLLDLASVDATWDNSVERIAQCYEEAGLQEIARFVLYRD

A0A6J1G585 uncharacterized protein LOC111451004 isoform X22.0e-18494.77Show/hide
Query:  MKIGGGVVCGSPRAAVLPSLLLRRRGVTVRCSSSSSTSDHVSFIKDIAATEPPQHLFHLLKMLKTRGGSIISPGAKQGIIPLAIPLAKNSSGTITALLRW
        MKIGGGVVCGSPRAAVLPSLLL RRGVT+RCSSSSST DHVSFIKD+AATEPPQHL +LLKMLKTRG SIISPGAKQGIIPLAIPLAKNSSGTITALLRW
Subjt:  MKIGGGVVCGSPRAAVLPSLLLRRRGVTVRCSSSSSTSDHVSFIKDIAATEPPQHLFHLLKMLKTRGGSIISPGAKQGIIPLAIPLAKNSSGTITALLRW

Query:  PTAPAGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDAKGSGEQRDELFLAAADAGQKLYGRGDFAESQIKNIDGYLLKKVGLFPDVIERKILRHFEEG
        PTAPAGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDAKGSGEQ DELFLAAADAGQKLY RGD AESQIKNIDGYLLKKVG+FPD+IERKILRHFEEG
Subjt:  PTAPAGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDAKGSGEQRDELFLAAADAGQKLYGRGDFAESQIKNIDGYLLKKVGLFPDVIERKILRHFEEG

Query:  DLVSALVTGEFYTKKEHFPGFARPFVFNAEVLLKVGRKTEAKDAARGALKSPWWTLGCKYEEVANVAQWEDEQIEYLKEKVTEEGKLEDLKKGKAPAQVA
        DLVSALVTGEFYTKKEHFPGFARP+VFNAEVLLKVGRKTEAKDAARGALKSPWWTLGCKYEEVAN+AQWEDEQIEYLKEKVTEEGKLEDLKKGKAPAQVA
Subjt:  DLVSALVTGEFYTKKEHFPGFARPFVFNAEVLLKVGRKTEAKDAARGALKSPWWTLGCKYEEVANVAQWEDEQIEYLKEKVTEEGKLEDLKKGKAPAQVA

Query:  LDQAAFLLDLASVDATWDNSVERIAQCYEEAGLQEIARFVLYRD
        LDQAAFLLDLASVD TWD SVERIAQCYEEAGLQE+ARFVL+RD
Subjt:  LDQAAFLLDLASVDATWDNSVERIAQCYEEAGLQEIARFVLYRD

A0A6J1L0D2 uncharacterized protein LOC1114998921.7e-18394.75Show/hide
Query:  MKIGGGVVCGSPRAAVLPSLLLRRRGVTVRCSSSSSTSDHVSFIKDIAATEPPQHLFHLLKMLKTRGGSIISPGAKQGIIPLAIPLAKNSSGTITALLRW
        MKIGGGVVCGSPRAAVLPSLLL RRGVT+RC+SSSSTS+HVSFIKD+AATEPPQHL +LLKMLKTRG SIISPGAKQGIIPLAIPLAKN+SGTITALLRW
Subjt:  MKIGGGVVCGSPRAAVLPSLLLRRRGVTVRCSSSSSTSDHVSFIKDIAATEPPQHLFHLLKMLKTRGGSIISPGAKQGIIPLAIPLAKNSSGTITALLRW

Query:  PTAPAGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDAKGSGEQRDELFLAAADAGQKLYGRGDFAESQIKNIDGYLLKKVGLFPDVIERKILRHFEEG
        PTAPAGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDAKGSGEQ DELFLAAADAG+KLY RGDFAESQIKNIDGYLLKKVG+FPD+IERKILRHFEEG
Subjt:  PTAPAGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDAKGSGEQRDELFLAAADAGQKLYGRGDFAESQIKNIDGYLLKKVGLFPDVIERKILRHFEEG

Query:  DLVSALVTGEFYTKKEHFPGFARPFVFNAEVLLKVGRKTEAKDAARGALKSPWWTLGCKYEEVANVAQWEDEQIEYLKEKVTEEGKLEDLKKGKAPAQVA
        DLVSALVTGEFYTKKEHFPGFARP+VFNAEVLLKVGRKTEAKDAARGALKSPWWTLGCKYEEVAN+AQWEDEQIEYLKEKVTEEGKLEDLKKGKAPAQVA
Subjt:  DLVSALVTGEFYTKKEHFPGFARPFVFNAEVLLKVGRKTEAKDAARGALKSPWWTLGCKYEEVANVAQWEDEQIEYLKEKVTEEGKLEDLKKGKAPAQVA

Query:  LDQAAFLLDLASVDATWDNSVERIAQCYEEAGLQEIARFVLYR
        LDQAAFLLDLASVD TWD SVERIAQCYEEAGLQEIARFVLYR
Subjt:  LDQAAFLLDLASVDATWDNSVERIAQCYEEAGLQEIARFVLYR

SwissProt top hitse value%identityAlignment
Q94JY0 Protein IN CHLOROPLAST ATPASE BIOGENESIS, chloroplastic9.5e-12867.46Show/hide
Query:  GSPRAAVLPSLLLRRRGVTVRCSSSSSTSDHVSFIKDIAATEPPQHLFHLLKMLKTRGGSIISPGAKQGIIPLAIPLAKNSSGTITALLRWPTAPAGMEM
        GS    + PS  L  R    R S  S  S HVSFIKD+AATEPP HL HLLK+L+TRG +IISPGAKQG+IPLAIPL+KNSSG++TALLRWPTAP GM+M
Subjt:  GSPRAAVLPSLLLRRRGVTVRCSSSSSTSDHVSFIKDIAATEPPQHLFHLLKMLKTRGGSIISPGAKQGIIPLAIPLAKNSSGTITALLRWPTAPAGMEM

Query:  PVVDVNRNGVWLLAKNVDQFIHRLLVEEDAKGSGEQRDELFLAAADAGQKLYGRGDFAESQIKNIDGYLLKKVGLFPDVIERKILRHFEEGDLVSALVTG
        PVV+V R+GV L+A+NVD++IHR+LVEEDA    ++  EL+ A+ +AG+KLY +G FAES+I N+D Y+LKKVGLFPD++ERK+LRHF+EGD VSA+VTG
Subjt:  PVVDVNRNGVWLLAKNVDQFIHRLLVEEDAKGSGEQRDELFLAAADAGQKLYGRGDFAESQIKNIDGYLLKKVGLFPDVIERKILRHFEEGDLVSALVTG

Query:  EFYTKKEHFPGFARPFVFNAEVLLKVGRKTEAKDAARGALKSPWWTLGCKYEEVANVAQWEDEQIEYLKEKVTEEGKLEDLKKGKAPAQVALDQAAFLLD
        EFYTKK+ FPGF RPFV+ A +L KVGR  EAKDAAR AL+SPWWTLGC YEEVA++AQWEDEQIE+++EKV++EG+ EDL KGKAP QVALD AAFLLD
Subjt:  EFYTKKEHFPGFARPFVFNAEVLLKVGRKTEAKDAARGALKSPWWTLGCKYEEVANVAQWEDEQIEYLKEKVTEEGKLEDLKKGKAPAQVALDQAAFLLD

Query:  LASVDATWDNSVERIAQCYEEAGLQEIARFVLYRD
        LAS++ TW  S+  IA+CYEEAGL  I+ FVLY D
Subjt:  LASVDATWDNSVERIAQCYEEAGLQEIARFVLYRD

Arabidopsis top hitse value%identityAlignment
AT2G23370.1 unknown protein3.4e-12061.76Show/hide
Query:  GGVVCGSPRAAVLPSLLLRRRGVTVRCSSSSSTSDHVSFIKDIAATEPPQHLFHLLKMLKTRGGSIISPGAKQGIIPLAIPLAKNSSGTITALLRWPTAP
        G  V G  R  +   LL   R       SSSS S+H  FIKDIA  +PP+HL  LL +   RG SI+SPGAKQG++PL IPL K S G+  ALLRWPTAP
Subjt:  GGVVCGSPRAAVLPSLLLRRRGVTVRCSSSSSTSDHVSFIKDIAATEPPQHLFHLLKMLKTRGGSIISPGAKQGIIPLAIPLAKNSSGTITALLRWPTAP

Query:  AGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDAKGSGEQRDELFLAAADAGQKLYGRGDFAESQIKNIDGYLLKKVGLFPDVIERKILRHFEEGDLVS
        + MEMPVV+V ++GVW LA NVDQFIHR+LVEED     E   E+F AA +AG+KLY +GDFA S++ ++D YLL+KVGLFPD +ERK++RH E GD VS
Subjt:  AGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDAKGSGEQRDELFLAAADAGQKLYGRGDFAESQIKNIDGYLLKKVGLFPDVIERKILRHFEEGDLVS

Query:  ALVTGEFYTKKEHFPGFARPFVFNAEVLLKVGRKTEAKDAARGALKSPWWTLGCKYEEVANVAQWEDEQIEYLKEKVTEEGKLEDLKKGKAPAQVALDQA
        ALV  EFYTK+ +FPGFARPF FNA+VLLK+GR  EAKDAARGALKS WWTLGC+YEE+A +A+W +EQI   KE+VT EGK  D+ +GK  AQ +LD+A
Subjt:  ALVTGEFYTKKEHFPGFARPFVFNAEVLLKVGRKTEAKDAARGALKSPWWTLGCKYEEVANVAQWEDEQIEYLKEKVTEEGKLEDLKKGKAPAQVALDQA

Query:  AFLLDLASVDATWDNSVERIAQCYEEAGLQEIARFVLYRD
        AFLL+LAS++ TWD S+ER+AQCY+EAGL +IA+FVLYRD
Subjt:  AFLLDLASVDATWDNSVERIAQCYEEAGLQEIARFVLYRD

AT4G34090.1 unknown protein6.7e-12967.46Show/hide
Query:  GSPRAAVLPSLLLRRRGVTVRCSSSSSTSDHVSFIKDIAATEPPQHLFHLLKMLKTRGGSIISPGAKQGIIPLAIPLAKNSSGTITALLRWPTAPAGMEM
        GS    + PS  L  R    R S  S  S HVSFIKD+AATEPP HL HLLK+L+TRG +IISPGAKQG+IPLAIPL+KNSSG++TALLRWPTAP GM+M
Subjt:  GSPRAAVLPSLLLRRRGVTVRCSSSSSTSDHVSFIKDIAATEPPQHLFHLLKMLKTRGGSIISPGAKQGIIPLAIPLAKNSSGTITALLRWPTAPAGMEM

Query:  PVVDVNRNGVWLLAKNVDQFIHRLLVEEDAKGSGEQRDELFLAAADAGQKLYGRGDFAESQIKNIDGYLLKKVGLFPDVIERKILRHFEEGDLVSALVTG
        PVV+V R+GV L+A+NVD++IHR+LVEEDA    ++  EL+ A+ +AG+KLY +G FAES+I N+D Y+LKKVGLFPD++ERK+LRHF+EGD VSA+VTG
Subjt:  PVVDVNRNGVWLLAKNVDQFIHRLLVEEDAKGSGEQRDELFLAAADAGQKLYGRGDFAESQIKNIDGYLLKKVGLFPDVIERKILRHFEEGDLVSALVTG

Query:  EFYTKKEHFPGFARPFVFNAEVLLKVGRKTEAKDAARGALKSPWWTLGCKYEEVANVAQWEDEQIEYLKEKVTEEGKLEDLKKGKAPAQVALDQAAFLLD
        EFYTKK+ FPGF RPFV+ A +L KVGR  EAKDAAR AL+SPWWTLGC YEEVA++AQWEDEQIE+++EKV++EG+ EDL KGKAP QVALD AAFLLD
Subjt:  EFYTKKEHFPGFARPFVFNAEVLLKVGRKTEAKDAARGALKSPWWTLGCKYEEVANVAQWEDEQIEYLKEKVTEEGKLEDLKKGKAPAQVALDQAAFLLD

Query:  LASVDATWDNSVERIAQCYEEAGLQEIARFVLYRD
        LAS++ TW  S+  IA+CYEEAGL  I+ FVLY D
Subjt:  LASVDATWDNSVERIAQCYEEAGLQEIARFVLYRD

AT4G34090.2 unknown protein1.7e-12767.26Show/hide
Query:  GSPRAAVLPSLLLRRRGVTVRCSSSSSTSDHVSFIKDIAATEPPQHLFHLLKMLKTRGGSIISPGAKQGIIPLAIPLAKNSS-GTITALLRWPTAPAGME
        GS    + PS  L  R    R S  S  S HVSFIKD+AATEPP HL HLLK+L+TRG +IISPGAKQG+IPLAIPL+KNSS G++TALLRWPTAP GM+
Subjt:  GSPRAAVLPSLLLRRRGVTVRCSSSSSTSDHVSFIKDIAATEPPQHLFHLLKMLKTRGGSIISPGAKQGIIPLAIPLAKNSS-GTITALLRWPTAPAGME

Query:  MPVVDVNRNGVWLLAKNVDQFIHRLLVEEDAKGSGEQRDELFLAAADAGQKLYGRGDFAESQIKNIDGYLLKKVGLFPDVIERKILRHFEEGDLVSALVT
        MPVV+V R+GV L+A+NVD++IHR+LVEEDA    ++  EL+ A+ +AG+KLY +G FAES+I N+D Y+LKKVGLFPD++ERK+LRHF+EGD VSA+VT
Subjt:  MPVVDVNRNGVWLLAKNVDQFIHRLLVEEDAKGSGEQRDELFLAAADAGQKLYGRGDFAESQIKNIDGYLLKKVGLFPDVIERKILRHFEEGDLVSALVT

Query:  GEFYTKKEHFPGFARPFVFNAEVLLKVGRKTEAKDAARGALKSPWWTLGCKYEEVANVAQWEDEQIEYLKEKVTEEGKLEDLKKGKAPAQVALDQAAFLL
        GEFYTKK+ FPGF RPFV+ A +L KVGR  EAKDAAR AL+SPWWTLGC YEEVA++AQWEDEQIE+++EKV++EG+ EDL KGKAP QVALD AAFLL
Subjt:  GEFYTKKEHFPGFARPFVFNAEVLLKVGRKTEAKDAARGALKSPWWTLGCKYEEVANVAQWEDEQIEYLKEKVTEEGKLEDLKKGKAPAQVALDQAAFLL

Query:  DLASVDATWDNSVERIAQCYEEAGLQEIARFVLYRD
        DLAS++ TW  S+  IA+CYEEAGL  I+ FVLY D
Subjt:  DLASVDATWDNSVERIAQCYEEAGLQEIARFVLYRD

AT4G34090.3 unknown protein1.6e-12569.45Show/hide
Query:  HVSFIKDIAATEPPQHLFHLLKMLKTRGGSIISPGAKQGIIPLAIPLAKNSSGTITALLRWPTAPAGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDA
        HVSFIKD+AATEPP HL HLLK+L+TRG +IISPGAKQG+IPLAIPL+KNSSG++TALLRWPTAP GM+MPVV+V R+GV L+A+NVD++IHR+LVEEDA
Subjt:  HVSFIKDIAATEPPQHLFHLLKMLKTRGGSIISPGAKQGIIPLAIPLAKNSSGTITALLRWPTAPAGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDA

Query:  KGSGEQRDELFLAAADAGQKLYGRGDFAESQIKNIDGYLLKKVGLFPDVIERKILRHFEEGDLVSALVTGEFYTKKEHFPGFARPFVFNAEVLLK-----
            ++  EL+ A+ +AG+KLY +G FAES+I N+D Y+LKKVGLFPD++ERK+LRHF+EGD VSA+VTGEFYTKK+ FPGF RPFV+ A +L K     
Subjt:  KGSGEQRDELFLAAADAGQKLYGRGDFAESQIKNIDGYLLKKVGLFPDVIERKILRHFEEGDLVSALVTGEFYTKKEHFPGFARPFVFNAEVLLK-----

Query:  -VGRKTEAKDAARGALKSPWWTLGCKYEEVANVAQWEDEQIEYLKEKVTEEGKLEDLKKGKAPAQVALDQAAFLLDLASVDATWDNSVERIAQCYEEAGL
         VGR  EAKDAAR AL+SPWWTLGC YEEVA++AQWEDEQIE+++EKV++EG+ EDL KGKAP QVALD AAFLLDLAS++ TW  S+  IA+CYEEAGL
Subjt:  -VGRKTEAKDAARGALKSPWWTLGCKYEEVANVAQWEDEQIEYLKEKVTEEGKLEDLKKGKAPAQVALDQAAFLLDLASVDATWDNSVERIAQCYEEAGL

Query:  QEIARFVLYRD
          I+ FVLY D
Subjt:  QEIARFVLYRD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAATTGGTGGTGGAGTGGTCTGTGGAAGTCCACGCGCCGCCGTTCTGCCCTCACTGCTTCTCCGCCGCCGTGGAGTCACTGTTCGCTGCTCTTCATCTTCCTCTAC
TTCCGACCATGTATCGTTCATTAAGGATATTGCGGCAACGGAGCCTCCTCAGCATTTGTTTCATTTGCTGAAAATGCTGAAGACTAGAGGTGGATCCATAATTTCTCCTG
GAGCCAAGCAAGGCATTATTCCTCTTGCCATTCCACTGGCAAAAAACAGCTCGGGTACTATAACTGCACTGCTGCGCTGGCCTACCGCACCCGCTGGGATGGAGATGCCA
GTAGTGGACGTCAATAGGAATGGAGTGTGGCTACTAGCCAAGAACGTGGATCAATTTATTCACAGGCTTCTAGTGGAAGAAGATGCCAAAGGAAGTGGAGAGCAAAGGGA
TGAGCTATTTCTTGCAGCAGCTGATGCTGGGCAGAAACTTTATGGAAGGGGTGATTTTGCTGAATCTCAGATCAAAAACATAGATGGGTATTTGCTGAAAAAGGTTGGGT
TATTTCCAGATGTCATAGAACGTAAAATATTGCGCCATTTTGAGGAAGGCGACCTTGTTTCAGCTTTAGTGACTGGAGAATTCTATACTAAAAAGGAGCACTTCCCAGGA
TTTGCACGGCCATTTGTATTCAACGCAGAGGTTTTGCTGAAGGTGGGACGAAAAACAGAAGCAAAGGATGCCGCGAGGGGAGCCTTAAAATCACCATGGTGGACCCTAGG
CTGTAAGTATGAGGAAGTTGCTAATGTCGCACAATGGGAGGATGAGCAAATTGAGTATTTAAAAGAGAAGGTCACAGAAGAAGGAAAGCTAGAAGACCTTAAAAAGGGAA
AGGCTCCTGCCCAGGTTGCCTTGGACCAAGCTGCCTTTTTGTTGGATTTAGCGTCTGTTGATGCAACTTGGGACAACTCTGTGGAGCGCATTGCTCAATGTTACGAAGAG
GCAGGCCTTCAGGAGATTGCCAGATTCGTACTTTACAGAGACTGA
mRNA sequenceShow/hide mRNA sequence
GTATCATCGGTGAGAGAGAATAAAGAAAATTAGGAAAAGAAAAGCAGGAACTTGGGGATGTAGTAAGAAGTAACGGTAGTCCCTCAATTTTTTATTTCATATTTTGTTCA
CAGAAAAAGGAAGAGATAAAATGCGTTATTGATGAGGCCCGTGTAGCTAACGCCTGCGTTTAGCAATCGTATTGAAACGTTAGAAAACGAAGATGAAAATTGGTGGTGGA
GTGGTCTGTGGAAGTCCACGCGCCGCCGTTCTGCCCTCACTGCTTCTCCGCCGCCGTGGAGTCACTGTTCGCTGCTCTTCATCTTCCTCTACTTCCGACCATGTATCGTT
CATTAAGGATATTGCGGCAACGGAGCCTCCTCAGCATTTGTTTCATTTGCTGAAAATGCTGAAGACTAGAGGTGGATCCATAATTTCTCCTGGAGCCAAGCAAGGCATTA
TTCCTCTTGCCATTCCACTGGCAAAAAACAGCTCGGGTACTATAACTGCACTGCTGCGCTGGCCTACCGCACCCGCTGGGATGGAGATGCCAGTAGTGGACGTCAATAGG
AATGGAGTGTGGCTACTAGCCAAGAACGTGGATCAATTTATTCACAGGCTTCTAGTGGAAGAAGATGCCAAAGGAAGTGGAGAGCAAAGGGATGAGCTATTTCTTGCAGC
AGCTGATGCTGGGCAGAAACTTTATGGAAGGGGTGATTTTGCTGAATCTCAGATCAAAAACATAGATGGGTATTTGCTGAAAAAGGTTGGGTTATTTCCAGATGTCATAG
AACGTAAAATATTGCGCCATTTTGAGGAAGGCGACCTTGTTTCAGCTTTAGTGACTGGAGAATTCTATACTAAAAAGGAGCACTTCCCAGGATTTGCACGGCCATTTGTA
TTCAACGCAGAGGTTTTGCTGAAGGTGGGACGAAAAACAGAAGCAAAGGATGCCGCGAGGGGAGCCTTAAAATCACCATGGTGGACCCTAGGCTGTAAGTATGAGGAAGT
TGCTAATGTCGCACAATGGGAGGATGAGCAAATTGAGTATTTAAAAGAGAAGGTCACAGAAGAAGGAAAGCTAGAAGACCTTAAAAAGGGAAAGGCTCCTGCCCAGGTTG
CCTTGGACCAAGCTGCCTTTTTGTTGGATTTAGCGTCTGTTGATGCAACTTGGGACAACTCTGTGGAGCGCATTGCTCAATGTTACGAAGAGGCAGGCCTTCAGGAGATT
GCCAGATTCGTACTTTACAGAGACTGAATACATAAACAGAGGCATTTTTCTTCTCTACACTCTCCTCTTCATTTTCATTGTCTTGTTCCTGTTTCCCTTCTTCTTTTTGA
TGTGTAATTATTTCTGTATTTCTTCATTATCTAAACATCATTCTTTTTTCTCTCCCCCCTGGATGAGATGATCCTTTTACTACAAGAATACTCAAAACTTCCTCTTGATA
AATCATAATGGTTCTTTCCGAGCAAGTGGTTAATATAGGAGAAATATGTTCACTCTTAGATTAATAATTTCAAGGTTAAAATACTATTTTGATCCCTATACTTTGGGTCT
CATTCTATTTTAGCCCCTACTTTCGATGAAGCTTCAAACTGGTCCCTATTAGTAGTTTATATGGTTAACTTTTCTTTAAAAAAAAATTAGTCTCTATTAGTCTTCAATTT
ACGTCTTTTCTGGAATGAAAACTATTATTATTATACTTCTGGGTGTATATTTCTTACACTAGTCAAAGTCATGTCTAGTATTACTCTCTTATTGGTATAAGCCATGGTTG
GTTGAATGGAGATCTAACATAACCTTCTCACCTCCTAGACATTTTTTGGGTGTTCGAGAGATAGGTAGCTTGGGTTGGATTAACAAGTATTCAGAATATGTAAATTATTA
TGAAAATTAAGGAAATAGATGCCCATGCCATGTGTTATCCATATGTCCAAATTTAGCATTGTCAAGTGGCAAACTCCCCATAACCAATGTGTCCCCCATGTGTCATCTAA
ACACCATTGGTTAGTGTCCAAAATGAGTCACATCATGCTCTCCATTTGCCTATAAATAGTGGTTGTAATGGTCTTTGTAAGACACATTCATTTGGTAGATTTTTTCCCAA
TCTTCTTCATTCTTTCCCACTTTTCCTTATGCCATAATTTT
Protein sequenceShow/hide protein sequence
MKIGGGVVCGSPRAAVLPSLLLRRRGVTVRCSSSSSTSDHVSFIKDIAATEPPQHLFHLLKMLKTRGGSIISPGAKQGIIPLAIPLAKNSSGTITALLRWPTAPAGMEMP
VVDVNRNGVWLLAKNVDQFIHRLLVEEDAKGSGEQRDELFLAAADAGQKLYGRGDFAESQIKNIDGYLLKKVGLFPDVIERKILRHFEEGDLVSALVTGEFYTKKEHFPG
FARPFVFNAEVLLKVGRKTEAKDAARGALKSPWWTLGCKYEEVANVAQWEDEQIEYLKEKVTEEGKLEDLKKGKAPAQVALDQAAFLLDLASVDATWDNSVERIAQCYEE
AGLQEIARFVLYRD