; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g06050 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g06050
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionINVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; BEST Arabidopsis thaliana protein match is: my s in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink).
Genome locationchr7:5007101..5012644
RNA-Seq ExpressionMoc07g06050
SyntenyMoc07g06050
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142326.1 uncharacterized protein LOC111012467 [Momordica charantia]7.5e-10653.19Show/hide
Query:  VSIKPIPELTQATFDTLKYYKDTFPKGRKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAMVCGFSGSVKRKSKGRAHALKTVVGTESVTPTVPRT
        +SIKPIPEL QATFDTLK+YKD FP+GRKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAMVCGF+ SVKRKSKGRAHALK V  ++ VTP V + 
Subjt:  VSIKPIPELTQATFDTLKYYKDTFPKGRKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAMVCGFSGSVKRKSKGRAHALKTVVGTESVTPTVPRT

Query:  EAQGNSGPSSAVPTPVIELDLSGGRSEEKRPREESEALDVSPLNEVRGESPLRRRRKKKKTSSSSEAGARGTLPMSHGDLVDDPAARMGGTSDVRMRFRM
         AQ  +GPSSA PTPVIELD +G RS EKR R ESEALDVSPL EVR                                                     
Subjt:  EAQGNSGPSSAVPTPVIELDLSGGRSEEKRPREESEALDVSPLNEVRGESPLRRRRKKKKTSSSSEAGARGTLPMSHGDLVDDPAARMGGTSDVRMRFRM

Query:  ESSSSGVKDQVSRISATCLDRCLRRASKFVSDPGRFIASIHSAVMVKAELDGREALTAKERENSSAALEAATTLKGELLKAQGEVDVLRAKLLKEKDDLT
                                                          + +  L  +E E   A L AA  +   L K        + +LLKEKDD+ 
Subjt:  ESSSSGVKDQVSRISATCLDRCLRRASKFVSDPGRFIASIHSAVMVKAELDGREALTAKERENSSAALEAATTLKGELLKAQGEVDVLRAKLLKEKDDLT

Query:  QVLEERDASIGRLTTELKELKERLTNGALLEESFRQHPDFDGFAKDFSDAGFKFLMKGIVADMPHLQIDLNDLKKKYSEK--------------------
        Q LE +DA+IGRL  ELK  KERLTNGALLE +FRQHPDFDGFAKDFSDAGFKFLMKGI AD+PHL++DL DLKK+Y+EK                    
Subjt:  QVLEERDASIGRLTTELKELKERLTNGALLEESFRQHPDFDGFAKDFSDAGFKFLMKGIVADMPHLQIDLNDLKKKYSEK--------------------

Query:  ELDSDYSDMEEGDAPSQEPTEVGTTQEEAPSQQGGSEEVNLLGSQGELSSHLGSS
        +LDSDYSD++E + PSQEPTEVGTTQE  PSQQ GS+EVNLLGSQGELSSHLGSS
Subjt:  ELDSDYSDMEEGDAPSQEPTEVGTTQEEAPSQQGGSEEVNLLGSQGELSSHLGSS

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]2.3e-11575.44Show/hide
Query:  MFEYGPRLPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDEDEADLLSVDQLLGCFEARRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVG
        MFEYG RLPLHPF QEFL RTGLAPAQVAPNGWGVIFALAILFWLRARD +EA+LL VDQLL CFEA+RIAKKPGR+YMCARKGAGGIVKGPTSIKGWV 
Subjt:  MFEYGPRLPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDEDEADLLSVDQLLGCFEARRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVG

Query:  KWFFASGEWLAKDESGRLFFDVPARFGNLVSIKPIPELTQATFDTLKYYKDTFPKGRKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAMVCGFSG
        KWF+ASGEWLAKDESGR FFDVP RFGNLVSI+P+PELTQA+FDTLKYYK+ FP+GRK+GTLVTD+LLLESGLLDYNP VRPIE SRPNS LAMVC F+ 
Subjt:  KWFFASGEWLAKDESGRLFFDVPARFGNLVSIKPIPELTQATFDTLKYYKDTFPKGRKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAMVCGFSG

Query:  SVKRKSKGRAHALKTVVGTESVTPTVPRTEAQGNSGPSSAVPTPVIELDLSGGRSEEKRPREESEAL-------DVSPLNE
         VKRKSKGRAHAL+    ++  TP V         GP+S  P PVIEL+ SGG S EKRPR+++EA+       DV PL E
Subjt:  SVKRKSKGRAHALKTVVGTESVTPTVPRTEAQGNSGPSSAVPTPVIELDLSGGRSEEKRPREESEAL-------DVSPLNE

XP_022152119.1 uncharacterized protein LOC111019909 [Momordica charantia]2.3e-9969.52Show/hide
Query:  MGGTSDVRMRFRMESSSSGVKDQVSRISATCLDRCLRRASKFVSDPG------------RFIASIHSAVMVKAELDGREALTAKERENSSAALEAATTLK
        MGGT DVR RFRME SSSGVKDQVSRISATCLDRCL+RASKFVSDPG             F+ASIHSA+MVKAELDGREAL AKERENSSAALEAATTLK
Subjt:  MGGTSDVRMRFRMESSSSGVKDQVSRISATCLDRCLRRASKFVSDPG------------RFIASIHSAVMVKAELDGREALTAKERENSSAALEAATTLK

Query:  GELLKAQGEVDVLRA-----------------------------------KLLKEKDDLTQVLEERDASIGRLTTELKELKERLTNGALLEESFRQHPDF
        GELLKAQGEV +LRA                                   +LLKEKDDL QVLE +D SIGRLT ELK+LKERLTNG+LLEESFRQH DF
Subjt:  GELLKAQGEVDVLRA-----------------------------------KLLKEKDDLTQVLEERDASIGRLTTELKELKERLTNGALLEESFRQHPDF

Query:  DGFAKDFSDAGFKFLMKGIVADMPHLQIDLNDLKKKYSEK--------------------ELDSDYSDMEEGDAPSQEPTEVGTTQEEAPSQQGGSEEVN
        DGFAKDFSDAGFKFLMKGI ADMPHLQIDL++LKKKYSEK                    ELDSDYSDMEE DAPSQEP E+GTTQEE PSQQ GS+EVN
Subjt:  DGFAKDFSDAGFKFLMKGIVADMPHLQIDLNDLKKKYSEK--------------------ELDSDYSDMEEGDAPSQEPTEVGTTQEEAPSQQGGSEEVN

Query:  LLGSQGELSSHLGSS
        LLGS+GELSSHLGSS
Subjt:  LLGSQGELSSHLGSS

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]6.3e-14578.33Show/hide
Query:  SDSGEGLEYPSRMPEHYLGPLRRGFKIPNDILNRIPEEGERADNPPEGWVTLYLKMFEYGPRLPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWL
        S SG+GLEYPSR+PEHYLG LRRGF IP +IL R+PEEGERADNPPEGWVTLY KMFEYG RLPLHPF QEFL RTGLAPAQVAPNGWGVIFALAILFWL
Subjt:  SDSGEGLEYPSRMPEHYLGPLRRGFKIPNDILNRIPEEGERADNPPEGWVTLYLKMFEYGPRLPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWL

Query:  RARDEDEADLLSVDQLLGCFEARRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVGKWFFASGEWLAKDESGRLFFDVPARFGNLVSIKPIPELTQATFDT
        RARD +EA+L  VDQLL CFEA+RIAKKPGR+YMCARKGAGGIVKGPTSIKGWV KWF+ASGEWLAKDESGR FFDVP RFGNLVSI+P+PELTQA+FDT
Subjt:  RARDEDEADLLSVDQLLGCFEARRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVGKWFFASGEWLAKDESGRLFFDVPARFGNLVSIKPIPELTQATFDT

Query:  LKYYKDTFPKGRKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAMVCGFSGSVKRKSKGRAHALKTVVGTESVTPTVPRTEAQGNSGPSSAVPTPV
        LKYYK+ FP+GRK+GTLVTD+LLLESGLLDYNP VRPIE+SRPNSELAMVCGF+  VKRKSKGRAHAL+    ++  TP V         GP+S  P  V
Subjt:  LKYYKDTFPKGRKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAMVCGFSGSVKRKSKGRAHALKTVVGTESVTPTVPRTEAQGNSGPSSAVPTPV

Query:  IELDLSGGRSEEKRPREESEALD
        IEL+ SGG S EKRPR+++EA+D
Subjt:  IELDLSGGRSEEKRPREESEALD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]6.6e-21980.15Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVGKWFFASGEWLAKDESGRLFFDVPARFGNLVSIKPIPELTQATFDTLKYYKDTFPKGRKIGTLVTDKLLLESGLLDYNP
        MCARKG GGIVKGPTSIKGWVGKWFFASGEWLAKDESGR FFDVP RFGNLVSIK IPEL QATFDTLK+YKD FP+ RKI TLVTDKLLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVGKWFFASGEWLAKDESGRLFFDVPARFGNLVSIKPIPELTQATFDTLKYYKDTFPKGRKIGTLVTDKLLLESGLLDYNP

Query:  LVRPIEASRPNSELAMVCGFSGSVKRKSKGRAHALKTVVGTESVTPTVPRTEAQGNSGPSSAVPTPVIELDLSGGRSEEKRPREESEALDVSPLNEVRGE
        LVR IEASRPNSELAMVCGF+GSVKRKSKGRAHALKTVVGTE VTPTVPRT AQGNSGPSSAVPTPVIELDLSGGRS EKR REESEALDVSPLNEVRGE
Subjt:  LVRPIEASRPNSELAMVCGFSGSVKRKSKGRAHALKTVVGTESVTPTVPRTEAQGNSGPSSAVPTPVIELDLSGGRSEEKRPREESEALDVSPLNEVRGE

Query:  SPLRRRRKKKKTSSSSEAGARGTLPMSHGDLVDDPAARMGGTSDVRMRFRMESSSSGVKDQVSRISATCLDRCLRRASKFVSDPG------------RFI
        SPLRRRRKKKKTSSSSEAGARGTLP SH DLVDDP ARM GTS+VRMRF ME SSSGVKDQVSRISATCLDR LRRASKFVSDPG             FI
Subjt:  SPLRRRRKKKKTSSSSEAGARGTLPMSHGDLVDDPAARMGGTSDVRMRFRMESSSSGVKDQVSRISATCLDRCLRRASKFVSDPG------------RFI

Query:  ASIHSAVMVKAELDGREALTAKERENSSAALEAATTLKGELLKAQGEVDVLRA-----------------------------------KLLKEKDDLTQV
        ASIH AVMVKAELDGREAL AKERENS AALEAATTLKGELLKAQGEVD+LRA                                   +LLKEKDDL QV
Subjt:  ASIHSAVMVKAELDGREALTAKERENSSAALEAATTLKGELLKAQGEVDVLRA-----------------------------------KLLKEKDDLTQV

Query:  LEERDASIGRLTTELKELKERLTNGALLEESFRQHPDFDGFAKDFSDAGFKFLMKGIVADMPHLQIDLNDLKKKYSEK--------------------EL
        LEE+DASIGRLTTELK+LKERLTNG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGI ADMPHLQIDLN LKKKYSEK                    EL
Subjt:  LEERDASIGRLTTELKELKERLTNGALLEESFRQHPDFDGFAKDFSDAGFKFLMKGIVADMPHLQIDLNDLKKKYSEK--------------------EL

Query:  DSDYSDMEEGDAPSQEPTEVGTTQEEAPSQQGGS
        DSDYSDMEE DAPSQEP EVGTTQEE PSQQGGS
Subjt:  DSDYSDMEEGDAPSQEPTEVGTTQEEAPSQQGGS

TrEMBL top hitse value%identityAlignment
A0A6J1CLV1 uncharacterized protein LOC1110124673.6e-10653.19Show/hide
Query:  VSIKPIPELTQATFDTLKYYKDTFPKGRKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAMVCGFSGSVKRKSKGRAHALKTVVGTESVTPTVPRT
        +SIKPIPEL QATFDTLK+YKD FP+GRKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAMVCGF+ SVKRKSKGRAHALK V  ++ VTP V + 
Subjt:  VSIKPIPELTQATFDTLKYYKDTFPKGRKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAMVCGFSGSVKRKSKGRAHALKTVVGTESVTPTVPRT

Query:  EAQGNSGPSSAVPTPVIELDLSGGRSEEKRPREESEALDVSPLNEVRGESPLRRRRKKKKTSSSSEAGARGTLPMSHGDLVDDPAARMGGTSDVRMRFRM
         AQ  +GPSSA PTPVIELD +G RS EKR R ESEALDVSPL EVR                                                     
Subjt:  EAQGNSGPSSAVPTPVIELDLSGGRSEEKRPREESEALDVSPLNEVRGESPLRRRRKKKKTSSSSEAGARGTLPMSHGDLVDDPAARMGGTSDVRMRFRM

Query:  ESSSSGVKDQVSRISATCLDRCLRRASKFVSDPGRFIASIHSAVMVKAELDGREALTAKERENSSAALEAATTLKGELLKAQGEVDVLRAKLLKEKDDLT
                                                          + +  L  +E E   A L AA  +   L K        + +LLKEKDD+ 
Subjt:  ESSSSGVKDQVSRISATCLDRCLRRASKFVSDPGRFIASIHSAVMVKAELDGREALTAKERENSSAALEAATTLKGELLKAQGEVDVLRAKLLKEKDDLT

Query:  QVLEERDASIGRLTTELKELKERLTNGALLEESFRQHPDFDGFAKDFSDAGFKFLMKGIVADMPHLQIDLNDLKKKYSEK--------------------
        Q LE +DA+IGRL  ELK  KERLTNGALLE +FRQHPDFDGFAKDFSDAGFKFLMKGI AD+PHL++DL DLKK+Y+EK                    
Subjt:  QVLEERDASIGRLTTELKELKERLTNGALLEESFRQHPDFDGFAKDFSDAGFKFLMKGIVADMPHLQIDLNDLKKKYSEK--------------------

Query:  ELDSDYSDMEEGDAPSQEPTEVGTTQEEAPSQQGGSEEVNLLGSQGELSSHLGSS
        +LDSDYSD++E + PSQEPTEVGTTQE  PSQQ GS+EVNLLGSQGELSSHLGSS
Subjt:  ELDSDYSDMEEGDAPSQEPTEVGTTQEEAPSQQGGSEEVNLLGSQGELSSHLGSS

A0A6J1CR42 uncharacterized protein LOC1110138261.1e-11575.44Show/hide
Query:  MFEYGPRLPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDEDEADLLSVDQLLGCFEARRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVG
        MFEYG RLPLHPF QEFL RTGLAPAQVAPNGWGVIFALAILFWLRARD +EA+LL VDQLL CFEA+RIAKKPGR+YMCARKGAGGIVKGPTSIKGWV 
Subjt:  MFEYGPRLPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDEDEADLLSVDQLLGCFEARRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVG

Query:  KWFFASGEWLAKDESGRLFFDVPARFGNLVSIKPIPELTQATFDTLKYYKDTFPKGRKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAMVCGFSG
        KWF+ASGEWLAKDESGR FFDVP RFGNLVSI+P+PELTQA+FDTLKYYK+ FP+GRK+GTLVTD+LLLESGLLDYNP VRPIE SRPNS LAMVC F+ 
Subjt:  KWFFASGEWLAKDESGRLFFDVPARFGNLVSIKPIPELTQATFDTLKYYKDTFPKGRKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAMVCGFSG

Query:  SVKRKSKGRAHALKTVVGTESVTPTVPRTEAQGNSGPSSAVPTPVIELDLSGGRSEEKRPREESEAL-------DVSPLNE
         VKRKSKGRAHAL+    ++  TP V         GP+S  P PVIEL+ SGG S EKRPR+++EA+       DV PL E
Subjt:  SVKRKSKGRAHALKTVVGTESVTPTVPRTEAQGNSGPSSAVPTPVIELDLSGGRSEEKRPREESEAL-------DVSPLNE

A0A6J1DF31 uncharacterized protein LOC1110199091.1e-9969.52Show/hide
Query:  MGGTSDVRMRFRMESSSSGVKDQVSRISATCLDRCLRRASKFVSDPG------------RFIASIHSAVMVKAELDGREALTAKERENSSAALEAATTLK
        MGGT DVR RFRME SSSGVKDQVSRISATCLDRCL+RASKFVSDPG             F+ASIHSA+MVKAELDGREAL AKERENSSAALEAATTLK
Subjt:  MGGTSDVRMRFRMESSSSGVKDQVSRISATCLDRCLRRASKFVSDPG------------RFIASIHSAVMVKAELDGREALTAKERENSSAALEAATTLK

Query:  GELLKAQGEVDVLRA-----------------------------------KLLKEKDDLTQVLEERDASIGRLTTELKELKERLTNGALLEESFRQHPDF
        GELLKAQGEV +LRA                                   +LLKEKDDL QVLE +D SIGRLT ELK+LKERLTNG+LLEESFRQH DF
Subjt:  GELLKAQGEVDVLRA-----------------------------------KLLKEKDDLTQVLEERDASIGRLTTELKELKERLTNGALLEESFRQHPDF

Query:  DGFAKDFSDAGFKFLMKGIVADMPHLQIDLNDLKKKYSEK--------------------ELDSDYSDMEEGDAPSQEPTEVGTTQEEAPSQQGGSEEVN
        DGFAKDFSDAGFKFLMKGI ADMPHLQIDL++LKKKYSEK                    ELDSDYSDMEE DAPSQEP E+GTTQEE PSQQ GS+EVN
Subjt:  DGFAKDFSDAGFKFLMKGIVADMPHLQIDLNDLKKKYSEK--------------------ELDSDYSDMEEGDAPSQEPTEVGTTQEEAPSQQGGSEEVN

Query:  LLGSQGELSSHLGSS
        LLGS+GELSSHLGSS
Subjt:  LLGSQGELSSHLGSS

A0A6J1DXS5 uncharacterized protein LOC1110255023.1e-14578.33Show/hide
Query:  SDSGEGLEYPSRMPEHYLGPLRRGFKIPNDILNRIPEEGERADNPPEGWVTLYLKMFEYGPRLPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWL
        S SG+GLEYPSR+PEHYLG LRRGF IP +IL R+PEEGERADNPPEGWVTLY KMFEYG RLPLHPF QEFL RTGLAPAQVAPNGWGVIFALAILFWL
Subjt:  SDSGEGLEYPSRMPEHYLGPLRRGFKIPNDILNRIPEEGERADNPPEGWVTLYLKMFEYGPRLPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWL

Query:  RARDEDEADLLSVDQLLGCFEARRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVGKWFFASGEWLAKDESGRLFFDVPARFGNLVSIKPIPELTQATFDT
        RARD +EA+L  VDQLL CFEA+RIAKKPGR+YMCARKGAGGIVKGPTSIKGWV KWF+ASGEWLAKDESGR FFDVP RFGNLVSI+P+PELTQA+FDT
Subjt:  RARDEDEADLLSVDQLLGCFEARRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVGKWFFASGEWLAKDESGRLFFDVPARFGNLVSIKPIPELTQATFDT

Query:  LKYYKDTFPKGRKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAMVCGFSGSVKRKSKGRAHALKTVVGTESVTPTVPRTEAQGNSGPSSAVPTPV
        LKYYK+ FP+GRK+GTLVTD+LLLESGLLDYNP VRPIE+SRPNSELAMVCGF+  VKRKSKGRAHAL+    ++  TP V         GP+S  P  V
Subjt:  LKYYKDTFPKGRKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAMVCGFSGSVKRKSKGRAHALKTVVGTESVTPTVPRTEAQGNSGPSSAVPTPV

Query:  IELDLSGGRSEEKRPREESEALD
        IEL+ SGG S EKRPR+++EA+D
Subjt:  IELDLSGGRSEEKRPREESEALD

A0A6J1DZB3 uncharacterized protein LOC1110256653.2e-21980.15Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVGKWFFASGEWLAKDESGRLFFDVPARFGNLVSIKPIPELTQATFDTLKYYKDTFPKGRKIGTLVTDKLLLESGLLDYNP
        MCARKG GGIVKGPTSIKGWVGKWFFASGEWLAKDESGR FFDVP RFGNLVSIK IPEL QATFDTLK+YKD FP+ RKI TLVTDKLLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVGKWFFASGEWLAKDESGRLFFDVPARFGNLVSIKPIPELTQATFDTLKYYKDTFPKGRKIGTLVTDKLLLESGLLDYNP

Query:  LVRPIEASRPNSELAMVCGFSGSVKRKSKGRAHALKTVVGTESVTPTVPRTEAQGNSGPSSAVPTPVIELDLSGGRSEEKRPREESEALDVSPLNEVRGE
        LVR IEASRPNSELAMVCGF+GSVKRKSKGRAHALKTVVGTE VTPTVPRT AQGNSGPSSAVPTPVIELDLSGGRS EKR REESEALDVSPLNEVRGE
Subjt:  LVRPIEASRPNSELAMVCGFSGSVKRKSKGRAHALKTVVGTESVTPTVPRTEAQGNSGPSSAVPTPVIELDLSGGRSEEKRPREESEALDVSPLNEVRGE

Query:  SPLRRRRKKKKTSSSSEAGARGTLPMSHGDLVDDPAARMGGTSDVRMRFRMESSSSGVKDQVSRISATCLDRCLRRASKFVSDPG------------RFI
        SPLRRRRKKKKTSSSSEAGARGTLP SH DLVDDP ARM GTS+VRMRF ME SSSGVKDQVSRISATCLDR LRRASKFVSDPG             FI
Subjt:  SPLRRRRKKKKTSSSSEAGARGTLPMSHGDLVDDPAARMGGTSDVRMRFRMESSSSGVKDQVSRISATCLDRCLRRASKFVSDPG------------RFI

Query:  ASIHSAVMVKAELDGREALTAKERENSSAALEAATTLKGELLKAQGEVDVLRA-----------------------------------KLLKEKDDLTQV
        ASIH AVMVKAELDGREAL AKERENS AALEAATTLKGELLKAQGEVD+LRA                                   +LLKEKDDL QV
Subjt:  ASIHSAVMVKAELDGREALTAKERENSSAALEAATTLKGELLKAQGEVDVLRA-----------------------------------KLLKEKDDLTQV

Query:  LEERDASIGRLTTELKELKERLTNGALLEESFRQHPDFDGFAKDFSDAGFKFLMKGIVADMPHLQIDLNDLKKKYSEK--------------------EL
        LEE+DASIGRLTTELK+LKERLTNG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGI ADMPHLQIDLN LKKKYSEK                    EL
Subjt:  LEERDASIGRLTTELKELKERLTNGALLEESFRQHPDFDGFAKDFSDAGFKFLMKGIVADMPHLQIDLNDLKKKYSEK--------------------EL

Query:  DSDYSDMEEGDAPSQEPTEVGTTQEEAPSQQGGS
        DSDYSDMEE DAPSQEP EVGTTQEE PSQQGGS
Subjt:  DSDYSDMEEGDAPSQEPTEVGTTQEEAPSQQGGS

SwissProt top hitse value%identityAlignment
Q9LEX8 Uncharacterized protein At3g60930, chloroplastic1.2e-0526.73Show/hide
Query:  SRMPEHYLGPLRRGFKIPNDILNRIPEEGERADNPPEGWVTLYLKMFEYGPR--LPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDEDEA
        S   E  L  L+  F +   +  R+P   ERAD+PP G+ TLY + F YG    LP+     E++    +A +Q+       + +L  L  +  R  +  
Subjt:  SRMPEHYLGPLRRGFKIPNDILNRIPEEGERADNPPEGWVTLYLKMFEYGPR--LPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDEDEA

Query:  DLLSVDQLLGCFEARRIAK-KPGRYYMCARKGAGGIVKGPTSIKGWVGKWFFASGEWLAKDESGRLFFDVPARFG----NLVSIKPIPELTQATFDTLKY
          +++  L    E RR+ K +  RYY+   KG   I   P+  + +   +FF + E    ++   L   V  R+G     L  ++PIP+   + F  L  
Subjt:  DLLSVDQLLGCFEARRIAK-KPGRYYMCARKGAGGIVKGPTSIKGWVGKWFFASGEWLAKDESGRLFFDVPARFG----NLVSIKPIPELTQATFDTLKY

Query:  YK
         K
Subjt:  YK

Arabidopsis top hitse value%identityAlignment
AT1G32010.1 myosin heavy chain-related1.9e-0627.41Show/hide
Query:  IPNDILNRIPEEGERADNPPEGWVTLYLKMF-EYGPRLPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDEDEADLLSVDQLLGCFEARRI
        +P  +  RIP + +R  + PEG++ L+   F E G R P+  F   F     +A +Q+       I   A L  L AR       LSV+ +       ++
Subjt:  IPNDILNRIPEEGERADNPPEGWVTLYLKMF-EYGPRLPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDEDEADLLSVDQLLGCFEARRI

Query:  AKKPGRYYMCARKGAGGIVKGPTSIKGWVGKWFFA
          K G++Y+ + +G   +  GP+  + W+G +F+A
Subjt:  AKKPGRYYMCARKGAGGIVKGPTSIKGWVGKWFFA

AT5G38190.1 INVOLVED IN: biological_process unknown1.6e-0526.67Show/hide
Query:  IPNDILNRIPEEGERADNPPEGWVTLYLKMF-EYGPRLPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDEDEADLLSVDQLLGCFEARRI
        +P  +  RIP + +R  + PEG++ L+   F E G R P+  F   F     +A +Q+       I   A L  L AR       LSV+ +       ++
Subjt:  IPNDILNRIPEEGERADNPPEGWVTLYLKMF-EYGPRLPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDEDEADLLSVDQLLGCFEARRI

Query:  AKKPGRYYMCARKGAGGIVKGPTSIKGWVGKWFFA
          K G++Y+ + +G   +   P+  + W+G +F+A
Subjt:  AKKPGRYYMCARKGAGGIVKGPTSIKGWVGKWFFA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTAACGAGACTTTGGAATTGCCTTGCAGCTGCGCAGAAGAAGCAAGAACTTGCGGAACAGGTGAAAGATCTTCTTTCAAGAACCAGTAATGCAGAGCGGAATTCGCT
ACCGCAACATGATGTAAATCCTTTTCCAAAATCCAAAATCCGTATTTGCACAACGGTTCTTCACGAATCGAGCTCGAACCCGGTCTCCGGTTCCGACCTAAACACTAGAG
TGGACTTGCACAAGAGGATAAGCACTCCGACGCTCAAGTCAAACCTTACGTTTCCTGAATTTCTGGAGTTCGATCTGAAACCAGCTCGAACCCTTTGTGTAGGTCAGTCA
CTTTTCCTTGCTCTTATTCTTCTTTCAAACATGGTAGTTTTCTTGTCTTCCCCCTCCAGTAGTGATAGCCTAGGTAGTGTAGGTCGGACTATAAGTAGTTCACCCCCCAA
ACCAAGTGATTCTGGGGAGGGTCTGGAGTACCCTTCTAGGATGCCCGAGCACTATCTCGGACCCCTCCGTAGGGGGTTTAAAATTCCGAATGACATCCTCAATAGGATTC
CGGAGGAAGGGGAAAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTTTACTTAAAAATGTTTGAGTACGGCCCCAGACTTCCTCTTCACCCTTTTGCTCAAGAGTTT
CTCAATCGAACTGGTTTGGCGCCGGCACAAGTGGCCCCTAATGGGTGGGGTGTCATTTTTGCCTTAGCCATTCTTTTTTGGTTGCGAGCTCGGGACGAGGATGAGGCCGA
TCTACTAAGCGTTGACCAACTTCTTGGGTGTTTTGAGGCCAGGAGGATAGCTAAAAAACCTGGTCGGTATTATATGTGCGCGAGGAAGGGCGCAGGTGGTATAGTTAAGG
GGCCGACCTCCATCAAAGGATGGGTAGGTAAGTGGTTCTTTGCCTCAGGTGAATGGCTGGCAAAGGACGAGTCAGGTCGTCTCTTTTTTGACGTGCCTGCTAGGTTTGGG
AACCTAGTATCGATCAAGCCAATTCCTGAGCTCACTCAAGCCACCTTCGACACCCTCAAATATTACAAGGACACCTTCCCCAAGGGCCGGAAGATCGGAACCTTGGTCAC
CGACAAGCTGCTCTTGGAGTCGGGGTTACTTGACTACAACCCTCTGGTTCGACCAATTGAAGCTTCGAGGCCAAACTCCGAGCTCGCAATGGTGTGCGGATTCTCTGGAA
GTGTGAAGCGCAAGTCCAAGGGCCGTGCTCACGCCCTCAAAACTGTGGTGGGAACTGAATCAGTGACGCCTACGGTGCCACGAACTGAGGCTCAGGGTAACTCTGGGCCG
TCTTCTGCAGTCCCCACCCCCGTGATCGAACTAGACTTGTCTGGGGGTCGATCTGAAGAGAAACGCCCAAGGGAGGAGTCCGAGGCGCTTGACGTATCTCCCTTGAATGA
AGTGAGGGGAGAGTCTCCTTTGAGGAGACGAAGAAAGAAGAAGAAGACTTCCTCCTCCTCGGAGGCTGGGGCTCGTGGGACTCTGCCTATGAGTCATGGTGATTTGGTAG
ATGACCCCGCAGCTCGGATGGGGGGAACATCCGATGTGCGAATGCGGTTCAGGATGGAATCGTCAAGTTCCGGGGTGAAGGACCAGGTGTCCCGCATCTCGGCCACGTGC
TTGGACCGCTGTCTGAGGAGAGCATCCAAGTTCGTGAGTGATCCTGGGCGTTTTATCGCTTCCATTCACTCAGCAGTTATGGTTAAGGCCGAACTAGATGGAAGAGAGGC
TTTGACAGCAAAGGAAAGGGAGAACTCCTCTGCTGCCTTAGAGGCTGCCACCACGCTGAAGGGTGAACTGCTAAAGGCCCAAGGCGAGGTGGATGTTTTAAGGGCCAAGC
TCCTGAAGGAGAAGGACGACCTTACTCAGGTCCTTGAGGAGAGGGACGCCTCAATTGGGCGTCTTACGACTGAGCTCAAAGAGCTGAAAGAGCGCCTCACCAATGGAGCT
CTGTTGGAGGAGTCATTCAGGCAACACCCAGACTTCGACGGGTTTGCCAAAGACTTCAGTGATGCTGGCTTCAAGTTCCTGATGAAAGGCATTGTTGCCGACATGCCTCA
TCTTCAGATCGATCTCAACGATCTCAAGAAGAAGTATTCTGAGAAGGAGCTTGACTCTGACTACTCCGACATGGAGGAAGGGGATGCCCCAAGCCAAGAGCCTACCGAGG
TCGGCACAACTCAAGAGGAGGCTCCATCACAGCAGGGTGGATCCGAGGAGGTCAACCTTCTGGGGTCCCAGGGCGAGCTGTCCTCCCACCTCGGAAGTAGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGCTAACGAGACTTTGGAATTGCCTTGCAGCTGCGCAGAAGAAGCAAGAACTTGCGGAACAGGTGAAAGATCTTCTTTCAAGAACCAGTAATGCAGAGCGGAATTCGCT
ACCGCAACATGATGTAAATCCTTTTCCAAAATCCAAAATCCGTATTTGCACAACGGTTCTTCACGAATCGAGCTCGAACCCGGTCTCCGGTTCCGACCTAAACACTAGAG
TGGACTTGCACAAGAGGATAAGCACTCCGACGCTCAAGTCAAACCTTACGTTTCCTGAATTTCTGGAGTTCGATCTGAAACCAGCTCGAACCCTTTGTGTAGGTCAGTCA
CTTTTCCTTGCTCTTATTCTTCTTTCAAACATGGTAGTTTTCTTGTCTTCCCCCTCCAGTAGTGATAGCCTAGGTAGTGTAGGTCGGACTATAAGTAGTTCACCCCCCAA
ACCAAGTGATTCTGGGGAGGGTCTGGAGTACCCTTCTAGGATGCCCGAGCACTATCTCGGACCCCTCCGTAGGGGGTTTAAAATTCCGAATGACATCCTCAATAGGATTC
CGGAGGAAGGGGAAAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTTTACTTAAAAATGTTTGAGTACGGCCCCAGACTTCCTCTTCACCCTTTTGCTCAAGAGTTT
CTCAATCGAACTGGTTTGGCGCCGGCACAAGTGGCCCCTAATGGGTGGGGTGTCATTTTTGCCTTAGCCATTCTTTTTTGGTTGCGAGCTCGGGACGAGGATGAGGCCGA
TCTACTAAGCGTTGACCAACTTCTTGGGTGTTTTGAGGCCAGGAGGATAGCTAAAAAACCTGGTCGGTATTATATGTGCGCGAGGAAGGGCGCAGGTGGTATAGTTAAGG
GGCCGACCTCCATCAAAGGATGGGTAGGTAAGTGGTTCTTTGCCTCAGGTGAATGGCTGGCAAAGGACGAGTCAGGTCGTCTCTTTTTTGACGTGCCTGCTAGGTTTGGG
AACCTAGTATCGATCAAGCCAATTCCTGAGCTCACTCAAGCCACCTTCGACACCCTCAAATATTACAAGGACACCTTCCCCAAGGGCCGGAAGATCGGAACCTTGGTCAC
CGACAAGCTGCTCTTGGAGTCGGGGTTACTTGACTACAACCCTCTGGTTCGACCAATTGAAGCTTCGAGGCCAAACTCCGAGCTCGCAATGGTGTGCGGATTCTCTGGAA
GTGTGAAGCGCAAGTCCAAGGGCCGTGCTCACGCCCTCAAAACTGTGGTGGGAACTGAATCAGTGACGCCTACGGTGCCACGAACTGAGGCTCAGGGTAACTCTGGGCCG
TCTTCTGCAGTCCCCACCCCCGTGATCGAACTAGACTTGTCTGGGGGTCGATCTGAAGAGAAACGCCCAAGGGAGGAGTCCGAGGCGCTTGACGTATCTCCCTTGAATGA
AGTGAGGGGAGAGTCTCCTTTGAGGAGACGAAGAAAGAAGAAGAAGACTTCCTCCTCCTCGGAGGCTGGGGCTCGTGGGACTCTGCCTATGAGTCATGGTGATTTGGTAG
ATGACCCCGCAGCTCGGATGGGGGGAACATCCGATGTGCGAATGCGGTTCAGGATGGAATCGTCAAGTTCCGGGGTGAAGGACCAGGTGTCCCGCATCTCGGCCACGTGC
TTGGACCGCTGTCTGAGGAGAGCATCCAAGTTCGTGAGTGATCCTGGGCGTTTTATCGCTTCCATTCACTCAGCAGTTATGGTTAAGGCCGAACTAGATGGAAGAGAGGC
TTTGACAGCAAAGGAAAGGGAGAACTCCTCTGCTGCCTTAGAGGCTGCCACCACGCTGAAGGGTGAACTGCTAAAGGCCCAAGGCGAGGTGGATGTTTTAAGGGCCAAGC
TCCTGAAGGAGAAGGACGACCTTACTCAGGTCCTTGAGGAGAGGGACGCCTCAATTGGGCGTCTTACGACTGAGCTCAAAGAGCTGAAAGAGCGCCTCACCAATGGAGCT
CTGTTGGAGGAGTCATTCAGGCAACACCCAGACTTCGACGGGTTTGCCAAAGACTTCAGTGATGCTGGCTTCAAGTTCCTGATGAAAGGCATTGTTGCCGACATGCCTCA
TCTTCAGATCGATCTCAACGATCTCAAGAAGAAGTATTCTGAGAAGGAGCTTGACTCTGACTACTCCGACATGGAGGAAGGGGATGCCCCAAGCCAAGAGCCTACCGAGG
TCGGCACAACTCAAGAGGAGGCTCCATCACAGCAGGGTGGATCCGAGGAGGTCAACCTTCTGGGGTCCCAGGGCGAGCTGTCCTCCCACCTCGGAAGTAGCTGA
Protein sequenceShow/hide protein sequence
MLTRLWNCLAAAQKKQELAEQVKDLLSRTSNAERNSLPQHDVNPFPKSKIRICTTVLHESSSNPVSGSDLNTRVDLHKRISTPTLKSNLTFPEFLEFDLKPARTLCVGQS
LFLALILLSNMVVFLSSPSSSDSLGSVGRTISSSPPKPSDSGEGLEYPSRMPEHYLGPLRRGFKIPNDILNRIPEEGERADNPPEGWVTLYLKMFEYGPRLPLHPFAQEF
LNRTGLAPAQVAPNGWGVIFALAILFWLRARDEDEADLLSVDQLLGCFEARRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVGKWFFASGEWLAKDESGRLFFDVPARFG
NLVSIKPIPELTQATFDTLKYYKDTFPKGRKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAMVCGFSGSVKRKSKGRAHALKTVVGTESVTPTVPRTEAQGNSGP
SSAVPTPVIELDLSGGRSEEKRPREESEALDVSPLNEVRGESPLRRRRKKKKTSSSSEAGARGTLPMSHGDLVDDPAARMGGTSDVRMRFRMESSSSGVKDQVSRISATC
LDRCLRRASKFVSDPGRFIASIHSAVMVKAELDGREALTAKERENSSAALEAATTLKGELLKAQGEVDVLRAKLLKEKDDLTQVLEERDASIGRLTTELKELKERLTNGA
LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIVADMPHLQIDLNDLKKKYSEKELDSDYSDMEEGDAPSQEPTEVGTTQEEAPSQQGGSEEVNLLGSQGELSSHLGSS