; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g18110 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g18110
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionINVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; BEST Arabidopsis thaliana protein match is: my s in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink).
Genome locationchr8:13716491..13724796
RNA-Seq ExpressionMoc08g18110
SyntenyMoc08g18110
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142326.1 uncharacterized protein LOC111012467 [Momordica charantia]1.7e-12554.8Show/hide
Query:  DVSARFGNLVSIKSIPELDQATFDTLKFYKENFPKGRKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAMVCGFTRSVKRKSKGRAHALKTVGNS-
        + S    + +SIK IPEL QATFDTLKFYK+NFP+GRKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAMVCGFT SVKRKSKGRAHALK V +S 
Subjt:  DVSARFGNLVSIKSIPELDQATFDTLKFYKENFPKGRKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAMVCGFTRSVKRKSKGRAHALKTVGNS-

Query:  ---------------GPSSAVPTPVIKLDVSGGRSEEKRPREESEALDVSPLNEVRGESPLRRKRKKKKTSSPSEAGARGTLPASHADLVDDPVARMGGT
                       GPSSA PTPVI+LD +G RS EKR R ESEALDVSPL EVR                                            
Subjt:  ---------------GPSSAVPTPVIKLDVSGGRSEEKRPREESEALDVSPLNEVRGESPLRRKRKKKKTSSPSEAGARGTLPASHADLVDDPVARMGGT

Query:  FDVRTRFRMEPSSSRVKDQVSRISATCLDRCLRRASKFVSDPGSAFVASIHSAIMVKAELDGREALAAKERENSSAALEAATTQKGELLKAQGEVGILRA
                                                                                                            
Subjt:  FDVRTRFRMEPSSSRVKDQVSRISATCLDRCLRRASKFVSDPGSAFVASIHSAIMVKAELDGREALAAKERENSSAALEAATTQKGELLKAQGEVGILRA

Query:  EVDAKAELLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQALEEKDASIGRLTAELKDLKERITCGPLLEESFQQHPDFDGFAKDFSDAGFKFL
          +AKAELLK+E E+HKAHLRAAHAITKGLEKEKFQLLKEKDD+ QALE KDA+IGRL AELK  KER+T G LLE +F+QHPDFDGFAKDFSDAGFKFL
Subjt:  EVDAKAELLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQALEEKDASIGRLTAELKDLKERITCGPLLEESFQQHPDFDGFAKDFSDAGFKFL

Query:  MKGIASDMPHLHIDLIDLKKKYSEKWASGPNETPGPQPLVDKYVRELDSDYSDMEEEDAPRQEPNEIGTTQEEVPSQQDGSQEVNLLGSQGELSSHLGSS
        MKGIA+D+PHL +DL DLKK+Y+EKWASGPN T GP  LVDKYVR+LDSDYSD++E++ P QEP E+GTTQE VPSQQDGSQEVNLLGSQGELSSHLGSS
Subjt:  MKGIASDMPHLHIDLIDLKKKYSEKWASGPNETPGPQPLVDKYVRELDSDYSDMEEEDAPRQEPNEIGTTQEEVPSQQDGSQEVNLLGSQGELSSHLGSS

XP_022147182.1 uncharacterized protein LOC111016193 [Momordica charantia]2.3e-12283.92Show/hide
Query:  RFRMEPSSSRVKDQVSRISATCLDRCLRRASKFVSDPGS-----------AFVASIHSAIMVKAELDGREALAAKERENSSAALEAATTQKGELLKAQGE
        RFRME SSS VKDQVSRISATCLDRCLRRAS+FVSDPGS           AF+ASIHSA+MVKAELDGREAL AKEREN S  LEAATT KGELLKAQGE
Subjt:  RFRMEPSSSRVKDQVSRISATCLDRCLRRASKFVSDPGS-----------AFVASIHSAIMVKAELDGREALAAKERENSSAALEAATTQKGELLKAQGE

Query:  VGILRAEVDAKAELLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQALEEKDASIGRLTAELKDLKERITCGPLLEESFQQHPDFDGFAKDFSD
        V ILRAEVDAK +LLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQ LE+KDASIGRLT ELKDLKER+T G LLEESF+QHP+FDGFAKDFSD
Subjt:  VGILRAEVDAKAELLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQALEEKDASIGRLTAELKDLKERITCGPLLEESFQQHPDFDGFAKDFSD

Query:  AGFKFLMKGIASDMPHLHIDLIDLKKKYSEKWASGPNETPGPQPLVDKYVRELDSDYSDMEEEDAPRQEPNEIGTTQEEVPSQQDG
        AGFKFLMKGIA+DMPHL IDL DLKK+YSE WASGPN TPGPQ LVDKYVRELDSDYSDMEEEDAP QEP ++GTTQEE PSQ  G
Subjt:  AGFKFLMKGIASDMPHLHIDLIDLKKKYSEKWASGPNETPGPQPLVDKYVRELDSDYSDMEEEDAPRQEPNEIGTTQEEVPSQQDG

XP_022152119.1 uncharacterized protein LOC111019909 [Momordica charantia]2.2e-14490.16Show/hide
Query:  MGGTFDVRTRFRMEPSSSRVKDQVSRISATCLDRCLRRASKFVSDPGS-----------AFVASIHSAIMVKAELDGREALAAKERENSSAALEAATTQK
        MGGTFDVRTRFRMEPSSS VKDQVSRISATCLDRCL+RASKFVSDPGS           AFVASIHSAIMVKAELDGREALAAKERENSSAALEAATT K
Subjt:  MGGTFDVRTRFRMEPSSSRVKDQVSRISATCLDRCLRRASKFVSDPGS-----------AFVASIHSAIMVKAELDGREALAAKERENSSAALEAATTQK

Query:  GELLKAQGEVGILRAEVDAKAELLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQALEEKDASIGRLTAELKDLKERITCGPLLEESFQQHPDF
        GELLKAQGEVGILRAEVDAKAELLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQ LE KD SIGRLTAELKDLKER+T G LLEESF+QH DF
Subjt:  GELLKAQGEVGILRAEVDAKAELLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQALEEKDASIGRLTAELKDLKERITCGPLLEESFQQHPDF

Query:  DGFAKDFSDAGFKFLMKGIASDMPHLHIDLIDLKKKYSEKWASGPNETPGPQPLVDKYVRELDSDYSDMEEEDAPRQEPNEIGTTQEEVPSQQDGSQEVN
        DGFAKDFSDAGFKFLMKGIA+DMPHL IDL +LKKKYSEKWASGPN TPGPQ LV KYVRELDSDYSDMEEEDAP QEPNEIGTTQEEVPSQQDGSQEVN
Subjt:  DGFAKDFSDAGFKFLMKGIASDMPHLHIDLIDLKKKYSEKWASGPNETPGPQPLVDKYVRELDSDYSDMEEEDAPRQEPNEIGTTQEEVPSQQDGSQEVN

Query:  LLGSQGELSSHLGSS
        LLGS+GELSSHLGSS
Subjt:  LLGSQGELSSHLGSS

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]1.1e-15380.06Show/hide
Query:  LARRLESELEEIENFGFSDDEEDSDTSTSGQGLEYPSRMPEHYLGPLRRGFNIPNNILLRIPEEGERADNRPEGWVTLYLKMFEYGLRLPLHPFAQEFLN
        LARRLES+LEEIEN   SDD EDSD STSGQGLEYPSR+PEHYLG LRRGF IP NILLR+PEEGERADN PEGWVTLY KMFEYGLRLPLHPF QEFL 
Subjt:  LARRLESELEEIENFGFSDDEEDSDTSTSGQGLEYPSRMPEHYLGPLRRGFNIPNNILLRIPEEGERADNRPEGWVTLYLKMFEYGLRLPLHPFAQEFLN

Query:  RTGLAPAQVAPNGWGVIFALAILFWLRARDEDEAELLSVDQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVGKWFFASGEWLAKDESGRAF
        RTGLAPAQVAPNGWGVIFALAILFWLRARD +EAEL  VDQLL CFEAKRIAKKPGR+YMCARKGAGGIVKGPTSIKGWV KWF+ASGEWLAKDESGR+F
Subjt:  RTGLAPAQVAPNGWGVIFALAILFWLRARDEDEAELLSVDQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVGKWFFASGEWLAKDESGRAF

Query:  FDVSARFGNLVSIKSIPELDQATFDTLKFYKENFPKGRKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAMVCGFTRSVKRKSKGRAHALKTVGNS
        FDV  RFGNLVSI+ +PEL QA+FDTLK+YKE FP+GRK+GTLVTD+LLLESGLLDYNP VRPIE+SRPNSELAMVCGF   VKRKSKGRAHAL+   +S
Subjt:  FDVSARFGNLVSIKSIPELDQATFDTLKFYKENFPKGRKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAMVCGFTRSVKRKSKGRAHALKTVGNS

Query:  --------GPSSAVPTPVIKLDVSGGRSEEKRPREESEALD
                GP+S  P  VI+L+ SGG S EKRPR+++EA+D
Subjt:  --------GPSSAVPTPVIKLDVSGGRSEEKRPREESEALD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]3.6e-24085.02Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVGKWFFASGEWLAKDESGRAFFDVSARFGNLVSIKSIPELDQATFDTLKFYKENFPKGRKIGTLVTDKLLLESGLLDYNP
        MCARKG GGIVKGPTSIKGWVGKWFFASGEWLAKDESGRAFFDV  RFGNLVSIK IPEL QATFDTLK YK++FP+ RKI TLVTDKLLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVGKWFFASGEWLAKDESGRAFFDVSARFGNLVSIKSIPELDQATFDTLKFYKENFPKGRKIGTLVTDKLLLESGLLDYNP

Query:  LVRPIEASRPNSELAMVCGFTRSVKRKSKGRAHALKTV----------------GNSGPSSAVPTPVIKLDVSGGRSEEKRPREESEALDVSPLNEVRGE
        LVR IEASRPNSELAMVCGFT SVKRKSKGRAHALKTV                GNSGPSSAVPTPVI+LD+SGGRS EKR REESEALDVSPLNEVRGE
Subjt:  LVRPIEASRPNSELAMVCGFTRSVKRKSKGRAHALKTV----------------GNSGPSSAVPTPVIKLDVSGGRSEEKRPREESEALDVSPLNEVRGE

Query:  SPLRRKRKKKKTSSPSEAGARGTLPASHADLVDDPVARMGGTFDVRTRFRMEPSSSRVKDQVSRISATCLDRCLRRASKFVSDPGS-----------AFV
        SPLRR+RKKKKTSS SEAGARGTLP SHADLVDDP ARM GT +VR RF MEPSSS VKDQVSRISATCLDR LRRASKFVSDPGS           AF+
Subjt:  SPLRRKRKKKKTSSPSEAGARGTLPASHADLVDDPVARMGGTFDVRTRFRMEPSSSRVKDQVSRISATCLDRCLRRASKFVSDPGS-----------AFV

Query:  ASIHSAIMVKAELDGREALAAKERENSSAALEAATTQKGELLKAQGEVGILRAEVDAKAELLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQA
        ASIH A+MVKAELDGREALAAKERENS AALEAATT KGELLKAQGEV ILRAEVDAK +LLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQ 
Subjt:  ASIHSAIMVKAELDGREALAAKERENSSAALEAATTQKGELLKAQGEVGILRAEVDAKAELLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQA

Query:  LEEKDASIGRLTAELKDLKERITCGPLLEESFQQHPDFDGFAKDFSDAGFKFLMKGIASDMPHLHIDLIDLKKKYSEKWASGPNETPGPQPLVDKYVREL
        LEEKDASIGRLT ELKDLKER+T G LLEESF+QHPDFDGFAKDFSDAGFKFLMKGIA+DMPHL IDL  LKKKYSEKWASGPN TP PQ LVDKYVREL
Subjt:  LEEKDASIGRLTAELKDLKERITCGPLLEESFQQHPDFDGFAKDFSDAGFKFLMKGIASDMPHLHIDLIDLKKKYSEKWASGPNETPGPQPLVDKYVREL

Query:  DSDYSDMEEEDAPRQEPNEIGTTQEEVPSQQDGS
        DSDYSDMEEEDAP QEP E+GTTQEEVPSQQ GS
Subjt:  DSDYSDMEEEDAPRQEPNEIGTTQEEVPSQQDGS

TrEMBL top hitse value%identityAlignment
A0A6J1CLV1 uncharacterized protein LOC1110124678.3e-12654.8Show/hide
Query:  DVSARFGNLVSIKSIPELDQATFDTLKFYKENFPKGRKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAMVCGFTRSVKRKSKGRAHALKTVGNS-
        + S    + +SIK IPEL QATFDTLKFYK+NFP+GRKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAMVCGFT SVKRKSKGRAHALK V +S 
Subjt:  DVSARFGNLVSIKSIPELDQATFDTLKFYKENFPKGRKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAMVCGFTRSVKRKSKGRAHALKTVGNS-

Query:  ---------------GPSSAVPTPVIKLDVSGGRSEEKRPREESEALDVSPLNEVRGESPLRRKRKKKKTSSPSEAGARGTLPASHADLVDDPVARMGGT
                       GPSSA PTPVI+LD +G RS EKR R ESEALDVSPL EVR                                            
Subjt:  ---------------GPSSAVPTPVIKLDVSGGRSEEKRPREESEALDVSPLNEVRGESPLRRKRKKKKTSSPSEAGARGTLPASHADLVDDPVARMGGT

Query:  FDVRTRFRMEPSSSRVKDQVSRISATCLDRCLRRASKFVSDPGSAFVASIHSAIMVKAELDGREALAAKERENSSAALEAATTQKGELLKAQGEVGILRA
                                                                                                            
Subjt:  FDVRTRFRMEPSSSRVKDQVSRISATCLDRCLRRASKFVSDPGSAFVASIHSAIMVKAELDGREALAAKERENSSAALEAATTQKGELLKAQGEVGILRA

Query:  EVDAKAELLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQALEEKDASIGRLTAELKDLKERITCGPLLEESFQQHPDFDGFAKDFSDAGFKFL
          +AKAELLK+E E+HKAHLRAAHAITKGLEKEKFQLLKEKDD+ QALE KDA+IGRL AELK  KER+T G LLE +F+QHPDFDGFAKDFSDAGFKFL
Subjt:  EVDAKAELLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQALEEKDASIGRLTAELKDLKERITCGPLLEESFQQHPDFDGFAKDFSDAGFKFL

Query:  MKGIASDMPHLHIDLIDLKKKYSEKWASGPNETPGPQPLVDKYVRELDSDYSDMEEEDAPRQEPNEIGTTQEEVPSQQDGSQEVNLLGSQGELSSHLGSS
        MKGIA+D+PHL +DL DLKK+Y+EKWASGPN T GP  LVDKYVR+LDSDYSD++E++ P QEP E+GTTQE VPSQQDGSQEVNLLGSQGELSSHLGSS
Subjt:  MKGIASDMPHLHIDLIDLKKKYSEKWASGPNETPGPQPLVDKYVRELDSDYSDMEEEDAPRQEPNEIGTTQEEVPSQQDGSQEVNLLGSQGELSSHLGSS

A0A6J1D1N9 uncharacterized protein LOC1110161931.1e-12283.92Show/hide
Query:  RFRMEPSSSRVKDQVSRISATCLDRCLRRASKFVSDPGS-----------AFVASIHSAIMVKAELDGREALAAKERENSSAALEAATTQKGELLKAQGE
        RFRME SSS VKDQVSRISATCLDRCLRRAS+FVSDPGS           AF+ASIHSA+MVKAELDGREAL AKEREN S  LEAATT KGELLKAQGE
Subjt:  RFRMEPSSSRVKDQVSRISATCLDRCLRRASKFVSDPGS-----------AFVASIHSAIMVKAELDGREALAAKERENSSAALEAATTQKGELLKAQGE

Query:  VGILRAEVDAKAELLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQALEEKDASIGRLTAELKDLKERITCGPLLEESFQQHPDFDGFAKDFSD
        V ILRAEVDAK +LLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQ LE+KDASIGRLT ELKDLKER+T G LLEESF+QHP+FDGFAKDFSD
Subjt:  VGILRAEVDAKAELLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQALEEKDASIGRLTAELKDLKERITCGPLLEESFQQHPDFDGFAKDFSD

Query:  AGFKFLMKGIASDMPHLHIDLIDLKKKYSEKWASGPNETPGPQPLVDKYVRELDSDYSDMEEEDAPRQEPNEIGTTQEEVPSQQDG
        AGFKFLMKGIA+DMPHL IDL DLKK+YSE WASGPN TPGPQ LVDKYVRELDSDYSDMEEEDAP QEP ++GTTQEE PSQ  G
Subjt:  AGFKFLMKGIASDMPHLHIDLIDLKKKYSEKWASGPNETPGPQPLVDKYVRELDSDYSDMEEEDAPRQEPNEIGTTQEEVPSQQDG

A0A6J1DF31 uncharacterized protein LOC1110199091.0e-14490.16Show/hide
Query:  MGGTFDVRTRFRMEPSSSRVKDQVSRISATCLDRCLRRASKFVSDPGS-----------AFVASIHSAIMVKAELDGREALAAKERENSSAALEAATTQK
        MGGTFDVRTRFRMEPSSS VKDQVSRISATCLDRCL+RASKFVSDPGS           AFVASIHSAIMVKAELDGREALAAKERENSSAALEAATT K
Subjt:  MGGTFDVRTRFRMEPSSSRVKDQVSRISATCLDRCLRRASKFVSDPGS-----------AFVASIHSAIMVKAELDGREALAAKERENSSAALEAATTQK

Query:  GELLKAQGEVGILRAEVDAKAELLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQALEEKDASIGRLTAELKDLKERITCGPLLEESFQQHPDF
        GELLKAQGEVGILRAEVDAKAELLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQ LE KD SIGRLTAELKDLKER+T G LLEESF+QH DF
Subjt:  GELLKAQGEVGILRAEVDAKAELLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQALEEKDASIGRLTAELKDLKERITCGPLLEESFQQHPDF

Query:  DGFAKDFSDAGFKFLMKGIASDMPHLHIDLIDLKKKYSEKWASGPNETPGPQPLVDKYVRELDSDYSDMEEEDAPRQEPNEIGTTQEEVPSQQDGSQEVN
        DGFAKDFSDAGFKFLMKGIA+DMPHL IDL +LKKKYSEKWASGPN TPGPQ LV KYVRELDSDYSDMEEEDAP QEPNEIGTTQEEVPSQQDGSQEVN
Subjt:  DGFAKDFSDAGFKFLMKGIASDMPHLHIDLIDLKKKYSEKWASGPNETPGPQPLVDKYVRELDSDYSDMEEEDAPRQEPNEIGTTQEEVPSQQDGSQEVN

Query:  LLGSQGELSSHLGSS
        LLGS+GELSSHLGSS
Subjt:  LLGSQGELSSHLGSS

A0A6J1DXS5 uncharacterized protein LOC1110255025.5e-15480.06Show/hide
Query:  LARRLESELEEIENFGFSDDEEDSDTSTSGQGLEYPSRMPEHYLGPLRRGFNIPNNILLRIPEEGERADNRPEGWVTLYLKMFEYGLRLPLHPFAQEFLN
        LARRLES+LEEIEN   SDD EDSD STSGQGLEYPSR+PEHYLG LRRGF IP NILLR+PEEGERADN PEGWVTLY KMFEYGLRLPLHPF QEFL 
Subjt:  LARRLESELEEIENFGFSDDEEDSDTSTSGQGLEYPSRMPEHYLGPLRRGFNIPNNILLRIPEEGERADNRPEGWVTLYLKMFEYGLRLPLHPFAQEFLN

Query:  RTGLAPAQVAPNGWGVIFALAILFWLRARDEDEAELLSVDQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVGKWFFASGEWLAKDESGRAF
        RTGLAPAQVAPNGWGVIFALAILFWLRARD +EAEL  VDQLL CFEAKRIAKKPGR+YMCARKGAGGIVKGPTSIKGWV KWF+ASGEWLAKDESGR+F
Subjt:  RTGLAPAQVAPNGWGVIFALAILFWLRARDEDEAELLSVDQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVGKWFFASGEWLAKDESGRAF

Query:  FDVSARFGNLVSIKSIPELDQATFDTLKFYKENFPKGRKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAMVCGFTRSVKRKSKGRAHALKTVGNS
        FDV  RFGNLVSI+ +PEL QA+FDTLK+YKE FP+GRK+GTLVTD+LLLESGLLDYNP VRPIE+SRPNSELAMVCGF   VKRKSKGRAHAL+   +S
Subjt:  FDVSARFGNLVSIKSIPELDQATFDTLKFYKENFPKGRKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAMVCGFTRSVKRKSKGRAHALKTVGNS

Query:  --------GPSSAVPTPVIKLDVSGGRSEEKRPREESEALD
                GP+S  P  VI+L+ SGG S EKRPR+++EA+D
Subjt:  --------GPSSAVPTPVIKLDVSGGRSEEKRPREESEALD

A0A6J1DZB3 uncharacterized protein LOC1110256651.7e-24085.02Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVGKWFFASGEWLAKDESGRAFFDVSARFGNLVSIKSIPELDQATFDTLKFYKENFPKGRKIGTLVTDKLLLESGLLDYNP
        MCARKG GGIVKGPTSIKGWVGKWFFASGEWLAKDESGRAFFDV  RFGNLVSIK IPEL QATFDTLK YK++FP+ RKI TLVTDKLLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVGKWFFASGEWLAKDESGRAFFDVSARFGNLVSIKSIPELDQATFDTLKFYKENFPKGRKIGTLVTDKLLLESGLLDYNP

Query:  LVRPIEASRPNSELAMVCGFTRSVKRKSKGRAHALKTV----------------GNSGPSSAVPTPVIKLDVSGGRSEEKRPREESEALDVSPLNEVRGE
        LVR IEASRPNSELAMVCGFT SVKRKSKGRAHALKTV                GNSGPSSAVPTPVI+LD+SGGRS EKR REESEALDVSPLNEVRGE
Subjt:  LVRPIEASRPNSELAMVCGFTRSVKRKSKGRAHALKTV----------------GNSGPSSAVPTPVIKLDVSGGRSEEKRPREESEALDVSPLNEVRGE

Query:  SPLRRKRKKKKTSSPSEAGARGTLPASHADLVDDPVARMGGTFDVRTRFRMEPSSSRVKDQVSRISATCLDRCLRRASKFVSDPGS-----------AFV
        SPLRR+RKKKKTSS SEAGARGTLP SHADLVDDP ARM GT +VR RF MEPSSS VKDQVSRISATCLDR LRRASKFVSDPGS           AF+
Subjt:  SPLRRKRKKKKTSSPSEAGARGTLPASHADLVDDPVARMGGTFDVRTRFRMEPSSSRVKDQVSRISATCLDRCLRRASKFVSDPGS-----------AFV

Query:  ASIHSAIMVKAELDGREALAAKERENSSAALEAATTQKGELLKAQGEVGILRAEVDAKAELLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQA
        ASIH A+MVKAELDGREALAAKERENS AALEAATT KGELLKAQGEV ILRAEVDAK +LLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQ 
Subjt:  ASIHSAIMVKAELDGREALAAKERENSSAALEAATTQKGELLKAQGEVGILRAEVDAKAELLKKEGEKHKAHLRAAHAITKGLEKEKFQLLKEKDDLAQA

Query:  LEEKDASIGRLTAELKDLKERITCGPLLEESFQQHPDFDGFAKDFSDAGFKFLMKGIASDMPHLHIDLIDLKKKYSEKWASGPNETPGPQPLVDKYVREL
        LEEKDASIGRLT ELKDLKER+T G LLEESF+QHPDFDGFAKDFSDAGFKFLMKGIA+DMPHL IDL  LKKKYSEKWASGPN TP PQ LVDKYVREL
Subjt:  LEEKDASIGRLTAELKDLKERITCGPLLEESFQQHPDFDGFAKDFSDAGFKFLMKGIASDMPHLHIDLIDLKKKYSEKWASGPNETPGPQPLVDKYVREL

Query:  DSDYSDMEEEDAPRQEPNEIGTTQEEVPSQQDGS
        DSDYSDMEEEDAP QEP E+GTTQEEVPSQQ GS
Subjt:  DSDYSDMEEEDAPRQEPNEIGTTQEEVPSQQDGS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G38190.1 INVOLVED IN: biological_process unknown2.6e-0725.84Show/hide
Query:  FSDDE-EDSDTSTSGQGLEY------PSRMPEHYLGPLRRGFNIPNNILLRIPEEGERADNRPEGWVTLYLKMF-EYGLRLPLHPFAQEFLNRTGLAPAQ
        ++DDE E +D + SG+  +       P+      +G       +P  + +RIP + +R  + PEG++ L+   F E GLR P+  F   F     +A +Q
Subjt:  FSDDE-EDSDTSTSGQGLEY------PSRMPEHYLGPLRRGFNIPNNILLRIPEEGERADNRPEGWVTLYLKMF-EYGLRLPLHPFAQEFLNRTGLAPAQ

Query:  VAPNGWGVIFALAILFWLRARDEDEAELLSVDQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVGKWFFA
        +       I   A L  L AR       LSV+ +       ++  K G++Y+ + +G   +   P+  + W+G +F+A
Subjt:  VAPNGWGVIFALAILFWLRARDEDEAELLSVDQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVGKWFFA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCACAGAGTAGCTGCAGATGGTTCAGGGCGAGACCTTAGAGAACCTGCAGCATTGATGGCCAGAACAGATCAGAAGAATCTGCCATCAGCTCAAGTTAAACAG
TTGAGAAGTACAGAAAAGGGAAACGAGAATTTGATAGGCCATCGAGTTCATACCTCAGCTGTCAGACGTTTAGGCGAGCTGGTGAAGTCGCATAGGCGAATTAGT
GCATTGAAGGGTACGAGCTCTGTTTCTAGTGTGGCAACAGACTTGGGTGGGTACGCCAAGTCATTAGGGGAATCTTCCTTCAGAGGTAATGGGCCCGACAGCACG
CGCGACCGGCGGTTACATGTCTTTTCTCATATTGGACCTGTCGGGTTCCGAGCAGGTCGGATCATAATCAGGTCGAGCTTTGGTGCCCATACTTCATCTTTTAAG
GGGCAAACTCGGTTGTCGGAGTACTCAAGCGTTTCGTCGTTGCGTATTCCGAGGAGATCTCAGCCGCTCGTTGATTACACGTGTACGGTGCAGAGGTTTTTCCGA
TCAGCTATAAATAGTGCCGAAACTTCAGTTTTCTTATCTTCCCCCTCCAGTAGTGATAGCCTGGGTAGTATAGGTCGGACAATAAGTAGTTCGCCCCCCAAGCCA
AGTGACTCTGGGGAGGTCTTAGCTCGTAGGTTAGAGTCTGAGCTGGAGGAAATAGAGAACTTTGGGTTTTCAGATGATGAAGAGGATAGCGATACCTCCACCTCG
GGCCAGGGTCTGGAGTACCCTTCTAGGATGCCCGAGCACTACCTTGGACCCCTTCGTAGGGGGTTTAACATTCCGAATAACATCCTCCTTAGGATTCCGGAGGAA
GGGGAAAGAGCTGACAATCGTCCAGAGGGATGGGTCACTCTTTATTTGAAGATGTTTGAGTACGGCCTCAGACTTCCCCTTCATCCCTTTGCTCAGGAGTTCTTA
AACCGAACCGGACTGGCTCCTGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTTGCTTTAGCCATTCTTTTTTGGTTGCGAGCTCGGGATGAGGATGAGGCC
GAGTTGCTAAGTGTCGACCAGCTTCTTGGGTGTTTTGAGGCTAAGAGGATAGCCAAGAAACCAGGTCGGTACTATATGTGCGCAAGGAAGGGCGCGGGTGGCATA
GTTAAGGGGCCGACCTCCATCAAAGGATGGGTAGGCAAGTGGTTCTTTGCCTCTGGAGAGTGGCTGGCAAAGGACGAATCAGGTCGTGCCTTTTTTGACGTGTCT
GCTAGGTTTGGGAACCTAGTGTCGATCAAGTCGATCCCCGAGCTCGATCAAGCCACTTTCGACACACTCAAGTTCTACAAAGAGAACTTCCCCAAGGGCAGGAAG
ATCGGAACCTTGGTCACCGACAAGCTTCTCTTGGAGTCGGGGCTTCTTGACTACAACCCTCTAGTTCGGCCAATTGAAGCTTCAAGGCCGAACTCTGAACTCGCA
ATGGTGTGCGGATTCACCAGAAGTGTGAAGCGCAAGTCCAAGGGCCGTGCTCACGCCCTCAAGACCGTGGGTAACTCCGGCCCATCCTCTGCAGTTCCCACCCCC
GTGATCAAACTGGACGTGTCCGGGGGTCGATCTGAAGAGAAGCGTCCAAGGGAGGAGTCCGAGGCGCTTGATGTATCTCCCCTGAACGAGGTGAGGGGAGAGTCT
CCTTTGAGGAGAAAGAGAAAGAAGAAGAAGACCTCCTCCCCCTCGGAGGCTGGGGCTCGTGGGACCCTGCCCGCGAGCCATGCTGACCTGGTGGACGACCCCGTA
GCTCGGATGGGGGGAACATTCGACGTGCGAACGCGGTTCAGGATGGAACCGTCAAGCTCTAGGGTGAAGGACCAGGTGTCCCGCATCTCGGCCACGTGCTTGGAC
CGCTGTCTGAGGAGAGCATCCAAGTTCGTGAGTGATCCTGGGTCCGCGTTTGTCGCTTCCATTCATTCAGCTATTATGGTCAAGGCTGAGCTGGATGGAAGGGAG
GCTCTGGCAGCCAAGGAGAGGGAGAACTCTTCTGCTGCCTTAGAGGCTGCCACCACGCAGAAGGGCGAGCTACTAAAGGCCCAAGGCGAGGTGGGTATCTTGAGG
GCCGAGGTGGATGCCAAGGCCGAACTTTTGAAGAAGGAGGGTGAGAAGCACAAGGCCCACCTCCGAGCAGCCCATGCGATCACCAAGGGGCTGGAGAAGGAGAAA
TTCCAGCTCCTAAAGGAGAAGGACGATCTCGCCCAAGCTCTTGAGGAGAAGGATGCCTCTATTGGGCGTCTCACAGCCGAGCTCAAAGACCTGAAGGAGCGCATC
ACCTGCGGACCTCTGCTGGAGGAGTCGTTCCAGCAACACCCAGACTTTGATGGGTTCGCCAAGGACTTTAGCGACGCCGGCTTCAAGTTCTTGATGAAGGGTATT
GCTTCCGACATGCCTCACCTCCATATCGATCTCATCGACCTCAAGAAGAAATACTCTGAGAAGTGGGCTTCTGGGCCTAACGAGACTCCTGGCCCCCAACCGCTG
GTGGACAAGTATGTCAGGGAGCTGGACTCTGACTACTCCGACATGGAAGAAGAGGACGCTCCTAGACAAGAGCCCAATGAGATCGGCACAACGCAAGAGGAGGTT
CCTTCTCAGCAGGATGGGTCTCAGGAGGTTAACCTTCTAGGCTCCCAAGGCGAGCTGTCCTCCCACCTCGGAAGTAGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCACAGAGTAGCTGCAGATGGTTCAGGGCGAGACCTTAGAGAACCTGCAGCATTGATGGCCAGAACAGATCAGAAGAATCTGCCATCAGCTCAAGTTAAACAG
TTGAGAAGTACAGAAAAGGGAAACGAGAATTTGATAGGCCATCGAGTTCATACCTCAGCTGTCAGACGTTTAGGCGAGCTGGTGAAGTCGCATAGGCGAATTAGT
GCATTGAAGGGTACGAGCTCTGTTTCTAGTGTGGCAACAGACTTGGGTGGGTACGCCAAGTCATTAGGGGAATCTTCCTTCAGAGGTAATGGGCCCGACAGCACG
CGCGACCGGCGGTTACATGTCTTTTCTCATATTGGACCTGTCGGGTTCCGAGCAGGTCGGATCATAATCAGGTCGAGCTTTGGTGCCCATACTTCATCTTTTAAG
GGGCAAACTCGGTTGTCGGAGTACTCAAGCGTTTCGTCGTTGCGTATTCCGAGGAGATCTCAGCCGCTCGTTGATTACACGTGTACGGTGCAGAGGTTTTTCCGA
TCAGCTATAAATAGTGCCGAAACTTCAGTTTTCTTATCTTCCCCCTCCAGTAGTGATAGCCTGGGTAGTATAGGTCGGACAATAAGTAGTTCGCCCCCCAAGCCA
AGTGACTCTGGGGAGGTCTTAGCTCGTAGGTTAGAGTCTGAGCTGGAGGAAATAGAGAACTTTGGGTTTTCAGATGATGAAGAGGATAGCGATACCTCCACCTCG
GGCCAGGGTCTGGAGTACCCTTCTAGGATGCCCGAGCACTACCTTGGACCCCTTCGTAGGGGGTTTAACATTCCGAATAACATCCTCCTTAGGATTCCGGAGGAA
GGGGAAAGAGCTGACAATCGTCCAGAGGGATGGGTCACTCTTTATTTGAAGATGTTTGAGTACGGCCTCAGACTTCCCCTTCATCCCTTTGCTCAGGAGTTCTTA
AACCGAACCGGACTGGCTCCTGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTTGCTTTAGCCATTCTTTTTTGGTTGCGAGCTCGGGATGAGGATGAGGCC
GAGTTGCTAAGTGTCGACCAGCTTCTTGGGTGTTTTGAGGCTAAGAGGATAGCCAAGAAACCAGGTCGGTACTATATGTGCGCAAGGAAGGGCGCGGGTGGCATA
GTTAAGGGGCCGACCTCCATCAAAGGATGGGTAGGCAAGTGGTTCTTTGCCTCTGGAGAGTGGCTGGCAAAGGACGAATCAGGTCGTGCCTTTTTTGACGTGTCT
GCTAGGTTTGGGAACCTAGTGTCGATCAAGTCGATCCCCGAGCTCGATCAAGCCACTTTCGACACACTCAAGTTCTACAAAGAGAACTTCCCCAAGGGCAGGAAG
ATCGGAACCTTGGTCACCGACAAGCTTCTCTTGGAGTCGGGGCTTCTTGACTACAACCCTCTAGTTCGGCCAATTGAAGCTTCAAGGCCGAACTCTGAACTCGCA
ATGGTGTGCGGATTCACCAGAAGTGTGAAGCGCAAGTCCAAGGGCCGTGCTCACGCCCTCAAGACCGTGGGTAACTCCGGCCCATCCTCTGCAGTTCCCACCCCC
GTGATCAAACTGGACGTGTCCGGGGGTCGATCTGAAGAGAAGCGTCCAAGGGAGGAGTCCGAGGCGCTTGATGTATCTCCCCTGAACGAGGTGAGGGGAGAGTCT
CCTTTGAGGAGAAAGAGAAAGAAGAAGAAGACCTCCTCCCCCTCGGAGGCTGGGGCTCGTGGGACCCTGCCCGCGAGCCATGCTGACCTGGTGGACGACCCCGTA
GCTCGGATGGGGGGAACATTCGACGTGCGAACGCGGTTCAGGATGGAACCGTCAAGCTCTAGGGTGAAGGACCAGGTGTCCCGCATCTCGGCCACGTGCTTGGAC
CGCTGTCTGAGGAGAGCATCCAAGTTCGTGAGTGATCCTGGGTCCGCGTTTGTCGCTTCCATTCATTCAGCTATTATGGTCAAGGCTGAGCTGGATGGAAGGGAG
GCTCTGGCAGCCAAGGAGAGGGAGAACTCTTCTGCTGCCTTAGAGGCTGCCACCACGCAGAAGGGCGAGCTACTAAAGGCCCAAGGCGAGGTGGGTATCTTGAGG
GCCGAGGTGGATGCCAAGGCCGAACTTTTGAAGAAGGAGGGTGAGAAGCACAAGGCCCACCTCCGAGCAGCCCATGCGATCACCAAGGGGCTGGAGAAGGAGAAA
TTCCAGCTCCTAAAGGAGAAGGACGATCTCGCCCAAGCTCTTGAGGAGAAGGATGCCTCTATTGGGCGTCTCACAGCCGAGCTCAAAGACCTGAAGGAGCGCATC
ACCTGCGGACCTCTGCTGGAGGAGTCGTTCCAGCAACACCCAGACTTTGATGGGTTCGCCAAGGACTTTAGCGACGCCGGCTTCAAGTTCTTGATGAAGGGTATT
GCTTCCGACATGCCTCACCTCCATATCGATCTCATCGACCTCAAGAAGAAATACTCTGAGAAGTGGGCTTCTGGGCCTAACGAGACTCCTGGCCCCCAACCGCTG
GTGGACAAGTATGTCAGGGAGCTGGACTCTGACTACTCCGACATGGAAGAAGAGGACGCTCCTAGACAAGAGCCCAATGAGATCGGCACAACGCAAGAGGAGGTT
CCTTCTCAGCAGGATGGGTCTCAGGAGGTTAACCTTCTAGGCTCCCAAGGCGAGCTGTCCTCCCACCTCGGAAGTAGTTGA
Protein sequenceShow/hide protein sequence
MHRVAADGSGRDLREPAALMARTDQKNLPSAQVKQLRSTEKGNENLIGHRVHTSAVRRLGELVKSHRRISALKGTSSVSSVATDLGGYAKSLGESSFRGNGPDST
RDRRLHVFSHIGPVGFRAGRIIIRSSFGAHTSSFKGQTRLSEYSSVSSLRIPRRSQPLVDYTCTVQRFFRSAINSAETSVFLSSPSSSDSLGSIGRTISSSPPKP
SDSGEVLARRLESELEEIENFGFSDDEEDSDTSTSGQGLEYPSRMPEHYLGPLRRGFNIPNNILLRIPEEGERADNRPEGWVTLYLKMFEYGLRLPLHPFAQEFL
NRTGLAPAQVAPNGWGVIFALAILFWLRARDEDEAELLSVDQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVGKWFFASGEWLAKDESGRAFFDVS
ARFGNLVSIKSIPELDQATFDTLKFYKENFPKGRKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAMVCGFTRSVKRKSKGRAHALKTVGNSGPSSAVPTP
VIKLDVSGGRSEEKRPREESEALDVSPLNEVRGESPLRRKRKKKKTSSPSEAGARGTLPASHADLVDDPVARMGGTFDVRTRFRMEPSSSRVKDQVSRISATCLD
RCLRRASKFVSDPGSAFVASIHSAIMVKAELDGREALAAKERENSSAALEAATTQKGELLKAQGEVGILRAEVDAKAELLKKEGEKHKAHLRAAHAITKGLEKEK
FQLLKEKDDLAQALEEKDASIGRLTAELKDLKERITCGPLLEESFQQHPDFDGFAKDFSDAGFKFLMKGIASDMPHLHIDLIDLKKKYSEKWASGPNETPGPQPL
VDKYVRELDSDYSDMEEEDAPRQEPNEIGTTQEEVPSQQDGSQEVNLLGSQGELSSHLGSS