; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g34090 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g34090
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionmyosin heavy chain-related
Genome locationchr8:24859705..24862188
RNA-Seq ExpressionMoc08g34090
SyntenyMoc08g34090
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142326.1 uncharacterized protein LOC111012467 [Momordica charantia]2.5e-13759.73Show/hide
Query:  FASGKWLAKDKSVSIKPIPELDQATFDTLKFYKDNFPKGRKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAMVCGFTSSVKRKSKGRAHALKTVQ
        FA    +  + ++SIKPIPEL QATFDTLKFYKDNFP+GRKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAMVCGFTSSVKRKSKGRAHALK VQ
Subjt:  FASGKWLAKDKSVSIKPIPELDQATFDTLKFYKDNFPKGRKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAMVCGFTSSVKRKSKGRAHALKTVQ

Query:  SSDPPTHVVDQNAAQDQVGPSSAAPTPVIELDSTRERSREKRSRSESEALDVSPLREVREGSPLKRRKKKKKATTSSEVGPRGPLPSSHADLIDDPEARM
        SSDP T  VDQNAAQDQ GPSSAAPTPVIELDST ERSREKRSRSESEALDVSPLREVR                                         
Subjt:  SSDPPTHVVDQNAAQDQVGPSSAAPTPVIELDSTRERSREKRSRSESEALDVSPLREVREGSPLKRRKKKKKATTSSEVGPRGPLPSSHADLIDDPEARM

Query:  GGTSDVKMRFRMEPLSSGVKDQVSRISAACLDRCLRRASKFVSDPGSVLQRTIDHAVEAFTASIHSAVMIKAELDGRETLAAKERENSSAAFEAATTLKG
                                                                                                            
Subjt:  GGTSDVKMRFRMEPLSSGVKDQVSRISAACLDRCLRRASKFVSDPGSVLQRTIDHAVEAFTASIHSAVMIKAELDGRETLAAKERENSSAAFEAATTLKG

Query:  ELLKARSEVDILRAEVEAKAELLKREDERHKAHLQAAHAITKGLEKEKFQLLKEKDDMLQALEGKDAAIGRLNAELKAEKERLTNGALLEAAFRQHPDFD
                        EAKAELLKREDERHKAHL+AAHAITKGLEKEKFQLLKEKDDMLQALE KDAAIGRLNAELKAEKERLTNGALLEAAFRQHPDFD
Subjt:  ELLKARSEVDILRAEVEAKAELLKREDERHKAHLQAAHAITKGLEKEKFQLLKEKDDMLQALEGKDAAIGRLNAELKAEKERLTNGALLEAAFRQHPDFD

Query:  GFAKDFSDASFKFLMKGIAADMPHLQVDLGDLKK------------------------RDLDSDYSELDEDEVPSQEPTEVGTTQEGVPSQQNGSQEVNL
        GFAKDFSDA FKFLMKGIAAD+PHL+VDLGDLKK                        RDLDSDYS+LDEDEVPSQEPTEVGTTQEGVPSQQ+GSQEVNL
Subjt:  GFAKDFSDASFKFLMKGIAADMPHLQVDLGDLKK------------------------RDLDSDYSELDEDEVPSQEPTEVGTTQEGVPSQQNGSQEVNL

Query:  LGSQGELSSHLGSS
        LGSQGELSSHLGSS
Subjt:  LGSQGELSSHLGSS

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]7.4e-10573.88Show/hide
Query:  MFEYGLRLPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDEDEVELLSVDQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVG
        MFEYGLRLPLHPF QEFL RTGLAPAQVAPNGWGVIFALAILFWLRARD +E ELL VDQLL CFEAKRIAKKPGR+YMCARKGAGGIVKGPTSIKGWV 
Subjt:  MFEYGLRLPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDEDEVELLSVDQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVG

Query:  KWFFASGKWLAKDKS--------------VSIKPIPELDQATFDTLKFYKDNFPKGRKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAMVCGFTS
        KWF+ASG+WLAKD+S              VSI+P+PEL QA+FDTLK+YK+ FP+GRK+GTLVTD+LLLESGLLDYNP VRPIE SRPNS LAMVC F S
Subjt:  KWFFASGKWLAKDKS--------------VSIKPIPELDQATFDTLKFYKDNFPKGRKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAMVCGFTS

Query:  SVKRKSKGRAHALKTVQSSDPPTHVVDQNAAQDQVGPSSAAPTPVIELDSTRERSREKRSRSESEALD
         VKRKSKGRAHAL+  QSS PPT  V        VGP+S  P PVIEL+S+   SREKR R ++EA+D
Subjt:  SVKRKSKGRAHALKTVQSSDPPTHVVDQNAAQDQVGPSSAAPTPVIELDSTRERSREKRSRSESEALD

XP_022152119.1 uncharacterized protein LOC111019909 [Momordica charantia]3.3e-12176.83Show/hide
Query:  MGGTSDVKMRFRMEPLSSGVKDQVSRISAACLDRCLRRASKFVSDPGSVLQRTIDHAVEAFTASIHSAVMIKAELDGRETLAAKERENSSAAFEAATTLK
        MGGT DV+ RFRMEP SSGVKDQVSRISA CLDRCL+RASKFVSDPGSVLQRTID+A EAF ASIHSA+M+KAELDGRE LAAKERENSSAA EAATTLK
Subjt:  MGGTSDVKMRFRMEPLSSGVKDQVSRISAACLDRCLRRASKFVSDPGSVLQRTIDHAVEAFTASIHSAVMIKAELDGRETLAAKERENSSAAFEAATTLK

Query:  GELLKARSEVDILRAEVEAKAELLKREDERHKAHLQAAHAITKGLEKEKFQLLKEKDDMLQALEGKDAAIGRLNAELKAEKERLTNGALLEAAFRQHPDF
        GELLKA+ EV ILRAEV+AKAELLK+E E+HKAHL+AAHAITKGLEKEKFQLLKEKDD+ Q LEGKD +IGRL AELK  KERLTNG+LLE +FRQH DF
Subjt:  GELLKARSEVDILRAEVEAKAELLKREDERHKAHLQAAHAITKGLEKEKFQLLKEKDDMLQALEGKDAAIGRLNAELKAEKERLTNGALLEAAFRQHPDF

Query:  DGFAKDFSDASFKFLMKGIAADMPHLQVDLGDLKK------------------------RDLDSDYSELDEDEVPSQEPTEVGTTQEGVPSQQNGSQEVN
        DGFAKDFSDA FKFLMKGIAADMPHLQ+DL +LKK                        R+LDSDYS+++E++ PSQEP E+GTTQE VPSQQ+GSQEVN
Subjt:  DGFAKDFSDASFKFLMKGIAADMPHLQVDLGDLKK------------------------RDLDSDYSELDEDEVPSQEPTEVGTTQEGVPSQQNGSQEVN

Query:  LLGSQGELSSHLGSS
        LLGS+GELSSHLGSS
Subjt:  LLGSQGELSSHLGSS

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]1.7e-13374.31Show/hide
Query:  DSGEVLARQGLEYPSRMPEHYLGPLRRGFNIPNDILLRIPEEGERADNPPEGWVTLYLKMFEYGLRLPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAI
        DS    + QGLEYPSR+PEHYLG LRRGF IP +ILLR+PEEGERADNPPEGWVTLY KMFEYGLRLPLHPF QEFL RTGLAPAQVAPNGWGVIFALAI
Subjt:  DSGEVLARQGLEYPSRMPEHYLGPLRRGFNIPNDILLRIPEEGERADNPPEGWVTLYLKMFEYGLRLPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAI

Query:  LFWLRARDEDEVELLSVDQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVGKWFFASGKWLAKDKS--------------VSIKPIPELDQA
        LFWLRARD +E EL  VDQLL CFEAKRIAKKPGR+YMCARKGAGGIVKGPTSIKGWV KWF+ASG+WLAKD+S              VSI+P+PEL QA
Subjt:  LFWLRARDEDEVELLSVDQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVGKWFFASGKWLAKDKS--------------VSIKPIPELDQA

Query:  TFDTLKFYKDNFPKGRKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAMVCGFTSSVKRKSKGRAHALKTVQSSDPPTHVVDQNAAQDQVGPSSAA
        +FDTLK+YK+ FP+GRK+GTLVTD+LLLESGLLDYNP VRPIE+SRPNSELAMVCGF S VKRKSKGRAHAL+  QSS P T  V        VGP+S  
Subjt:  TFDTLKFYKDNFPKGRKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAMVCGFTSSVKRKSKGRAHALKTVQSSDPPTHVVDQNAAQDQVGPSSAA

Query:  PTPVIELDSTRERSREKRSRSESEALD
        P  VIEL+S+   SREKR R ++EA+D
Subjt:  PTPVIELDSTRERSREKRSRSESEALD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]4.4e-20674.72Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVGKWFFASGKWLAKDKS--------------VSIKPIPELDQATFDTLKFYKDNFPKGRKIGTLVTDKLLLESGLLDYNP
        MCARKG GGIVKGPTSIKGWVGKWFFASG+WLAKD+S              VSIK IPEL QATFDTLK YKD+FP+ RKI TLVTDKLLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVGKWFFASGKWLAKDKS--------------VSIKPIPELDQATFDTLKFYKDNFPKGRKIGTLVTDKLLLESGLLDYNP

Query:  LVRPIEASRPNSELAMVCGFTSSVKRKSKGRAHALKTVQSSDPPTHVVDQNAAQDQVGPSSAAPTPVIELDSTRERSREKRSRSESEALDVSPLREVREG
        LVR IEASRPNSELAMVCGFT SVKRKSKGRAHALKTV  ++P T  V +  AQ   GPSSA PTPVIELD +  RS EKRSR ESEALDVSPL EVR  
Subjt:  LVRPIEASRPNSELAMVCGFTSSVKRKSKGRAHALKTVQSSDPPTHVVDQNAAQDQVGPSSAAPTPVIELDSTRERSREKRSRSESEALDVSPLREVREG

Query:  SPLKRRKKKKKATTSSEVGPRGPLPSSHADLIDDPEARMGGTSDVKMRFRMEPLSSGVKDQVSRISAACLDRCLRRASKFVSDPGSVLQRTIDHAVEAFT
        SPL+RR+KKKK ++SSE G RG LP+SHADL+DDPEARM GTS+V+MRF MEP SSGVKDQVSRISA CLDR LRRASKFVSDPGSVLQRTID+  EAF 
Subjt:  SPLKRRKKKKKATTSSEVGPRGPLPSSHADLIDDPEARMGGTSDVKMRFRMEPLSSGVKDQVSRISAACLDRCLRRASKFVSDPGSVLQRTIDHAVEAFT

Query:  ASIHSAVMIKAELDGRETLAAKERENSSAAFEAATTLKGELLKARSEVDILRAEVEAKAELLKREDERHKAHLQAAHAITKGLEKEKFQLLKEKDDMLQA
        ASIH AVM+KAELDGRE LAAKERENS AA EAATTLKGELLKA+ EVDILRAEV+AK +LLK+E E+HKAHL+AAHAITKGLEKEKFQLLKEKDD+ Q 
Subjt:  ASIHSAVMIKAELDGRETLAAKERENSSAAFEAATTLKGELLKARSEVDILRAEVEAKAELLKREDERHKAHLQAAHAITKGLEKEKFQLLKEKDDMLQA

Query:  LEGKDAAIGRLNAELKAEKERLTNGALLEAAFRQHPDFDGFAKDFSDASFKFLMKGIAADMPHLQVDLGDLKK------------------------RDL
        LE KDA+IGRL  ELK  KERLTNG LLE +FRQHPDFDGFAKDFSDA FKFLMKGIAADMPHLQ+DL  LKK                        R+L
Subjt:  LEGKDAAIGRLNAELKAEKERLTNGALLEAAFRQHPDFDGFAKDFSDASFKFLMKGIAADMPHLQVDLGDLKK------------------------RDL

Query:  DSDYSELDEDEVPSQEPTEVGTTQEGVPSQQNGS
        DSDYS+++E++ PSQEP EVGTTQE VPSQQ GS
Subjt:  DSDYSELDEDEVPSQEPTEVGTTQEGVPSQQNGS

TrEMBL top hitse value%identityAlignment
A0A6J1CLV1 uncharacterized protein LOC1110124671.2e-13759.73Show/hide
Query:  FASGKWLAKDKSVSIKPIPELDQATFDTLKFYKDNFPKGRKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAMVCGFTSSVKRKSKGRAHALKTVQ
        FA    +  + ++SIKPIPEL QATFDTLKFYKDNFP+GRKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAMVCGFTSSVKRKSKGRAHALK VQ
Subjt:  FASGKWLAKDKSVSIKPIPELDQATFDTLKFYKDNFPKGRKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAMVCGFTSSVKRKSKGRAHALKTVQ

Query:  SSDPPTHVVDQNAAQDQVGPSSAAPTPVIELDSTRERSREKRSRSESEALDVSPLREVREGSPLKRRKKKKKATTSSEVGPRGPLPSSHADLIDDPEARM
        SSDP T  VDQNAAQDQ GPSSAAPTPVIELDST ERSREKRSRSESEALDVSPLREVR                                         
Subjt:  SSDPPTHVVDQNAAQDQVGPSSAAPTPVIELDSTRERSREKRSRSESEALDVSPLREVREGSPLKRRKKKKKATTSSEVGPRGPLPSSHADLIDDPEARM

Query:  GGTSDVKMRFRMEPLSSGVKDQVSRISAACLDRCLRRASKFVSDPGSVLQRTIDHAVEAFTASIHSAVMIKAELDGRETLAAKERENSSAAFEAATTLKG
                                                                                                            
Subjt:  GGTSDVKMRFRMEPLSSGVKDQVSRISAACLDRCLRRASKFVSDPGSVLQRTIDHAVEAFTASIHSAVMIKAELDGRETLAAKERENSSAAFEAATTLKG

Query:  ELLKARSEVDILRAEVEAKAELLKREDERHKAHLQAAHAITKGLEKEKFQLLKEKDDMLQALEGKDAAIGRLNAELKAEKERLTNGALLEAAFRQHPDFD
                        EAKAELLKREDERHKAHL+AAHAITKGLEKEKFQLLKEKDDMLQALE KDAAIGRLNAELKAEKERLTNGALLEAAFRQHPDFD
Subjt:  ELLKARSEVDILRAEVEAKAELLKREDERHKAHLQAAHAITKGLEKEKFQLLKEKDDMLQALEGKDAAIGRLNAELKAEKERLTNGALLEAAFRQHPDFD

Query:  GFAKDFSDASFKFLMKGIAADMPHLQVDLGDLKK------------------------RDLDSDYSELDEDEVPSQEPTEVGTTQEGVPSQQNGSQEVNL
        GFAKDFSDA FKFLMKGIAAD+PHL+VDLGDLKK                        RDLDSDYS+LDEDEVPSQEPTEVGTTQEGVPSQQ+GSQEVNL
Subjt:  GFAKDFSDASFKFLMKGIAADMPHLQVDLGDLKK------------------------RDLDSDYSELDEDEVPSQEPTEVGTTQEGVPSQQNGSQEVNL

Query:  LGSQGELSSHLGSS
        LGSQGELSSHLGSS
Subjt:  LGSQGELSSHLGSS

A0A6J1CR42 uncharacterized protein LOC1110138263.6e-10573.88Show/hide
Query:  MFEYGLRLPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDEDEVELLSVDQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVG
        MFEYGLRLPLHPF QEFL RTGLAPAQVAPNGWGVIFALAILFWLRARD +E ELL VDQLL CFEAKRIAKKPGR+YMCARKGAGGIVKGPTSIKGWV 
Subjt:  MFEYGLRLPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDEDEVELLSVDQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVG

Query:  KWFFASGKWLAKDKS--------------VSIKPIPELDQATFDTLKFYKDNFPKGRKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAMVCGFTS
        KWF+ASG+WLAKD+S              VSI+P+PEL QA+FDTLK+YK+ FP+GRK+GTLVTD+LLLESGLLDYNP VRPIE SRPNS LAMVC F S
Subjt:  KWFFASGKWLAKDKS--------------VSIKPIPELDQATFDTLKFYKDNFPKGRKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAMVCGFTS

Query:  SVKRKSKGRAHALKTVQSSDPPTHVVDQNAAQDQVGPSSAAPTPVIELDSTRERSREKRSRSESEALD
         VKRKSKGRAHAL+  QSS PPT  V        VGP+S  P PVIEL+S+   SREKR R ++EA+D
Subjt:  SVKRKSKGRAHALKTVQSSDPPTHVVDQNAAQDQVGPSSAAPTPVIELDSTRERSREKRSRSESEALD

A0A6J1DF31 uncharacterized protein LOC1110199091.6e-12176.83Show/hide
Query:  MGGTSDVKMRFRMEPLSSGVKDQVSRISAACLDRCLRRASKFVSDPGSVLQRTIDHAVEAFTASIHSAVMIKAELDGRETLAAKERENSSAAFEAATTLK
        MGGT DV+ RFRMEP SSGVKDQVSRISA CLDRCL+RASKFVSDPGSVLQRTID+A EAF ASIHSA+M+KAELDGRE LAAKERENSSAA EAATTLK
Subjt:  MGGTSDVKMRFRMEPLSSGVKDQVSRISAACLDRCLRRASKFVSDPGSVLQRTIDHAVEAFTASIHSAVMIKAELDGRETLAAKERENSSAAFEAATTLK

Query:  GELLKARSEVDILRAEVEAKAELLKREDERHKAHLQAAHAITKGLEKEKFQLLKEKDDMLQALEGKDAAIGRLNAELKAEKERLTNGALLEAAFRQHPDF
        GELLKA+ EV ILRAEV+AKAELLK+E E+HKAHL+AAHAITKGLEKEKFQLLKEKDD+ Q LEGKD +IGRL AELK  KERLTNG+LLE +FRQH DF
Subjt:  GELLKARSEVDILRAEVEAKAELLKREDERHKAHLQAAHAITKGLEKEKFQLLKEKDDMLQALEGKDAAIGRLNAELKAEKERLTNGALLEAAFRQHPDF

Query:  DGFAKDFSDASFKFLMKGIAADMPHLQVDLGDLKK------------------------RDLDSDYSELDEDEVPSQEPTEVGTTQEGVPSQQNGSQEVN
        DGFAKDFSDA FKFLMKGIAADMPHLQ+DL +LKK                        R+LDSDYS+++E++ PSQEP E+GTTQE VPSQQ+GSQEVN
Subjt:  DGFAKDFSDASFKFLMKGIAADMPHLQVDLGDLKK------------------------RDLDSDYSELDEDEVPSQEPTEVGTTQEGVPSQQNGSQEVN

Query:  LLGSQGELSSHLGSS
        LLGS+GELSSHLGSS
Subjt:  LLGSQGELSSHLGSS

A0A6J1DXS5 uncharacterized protein LOC1110255028.2e-13474.31Show/hide
Query:  DSGEVLARQGLEYPSRMPEHYLGPLRRGFNIPNDILLRIPEEGERADNPPEGWVTLYLKMFEYGLRLPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAI
        DS    + QGLEYPSR+PEHYLG LRRGF IP +ILLR+PEEGERADNPPEGWVTLY KMFEYGLRLPLHPF QEFL RTGLAPAQVAPNGWGVIFALAI
Subjt:  DSGEVLARQGLEYPSRMPEHYLGPLRRGFNIPNDILLRIPEEGERADNPPEGWVTLYLKMFEYGLRLPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAI

Query:  LFWLRARDEDEVELLSVDQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVGKWFFASGKWLAKDKS--------------VSIKPIPELDQA
        LFWLRARD +E EL  VDQLL CFEAKRIAKKPGR+YMCARKGAGGIVKGPTSIKGWV KWF+ASG+WLAKD+S              VSI+P+PEL QA
Subjt:  LFWLRARDEDEVELLSVDQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVGKWFFASGKWLAKDKS--------------VSIKPIPELDQA

Query:  TFDTLKFYKDNFPKGRKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAMVCGFTSSVKRKSKGRAHALKTVQSSDPPTHVVDQNAAQDQVGPSSAA
        +FDTLK+YK+ FP+GRK+GTLVTD+LLLESGLLDYNP VRPIE+SRPNSELAMVCGF S VKRKSKGRAHAL+  QSS P T  V        VGP+S  
Subjt:  TFDTLKFYKDNFPKGRKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAMVCGFTSSVKRKSKGRAHALKTVQSSDPPTHVVDQNAAQDQVGPSSAA

Query:  PTPVIELDSTRERSREKRSRSESEALD
        P  VIEL+S+   SREKR R ++EA+D
Subjt:  PTPVIELDSTRERSREKRSRSESEALD

A0A6J1DZB3 uncharacterized protein LOC1110256652.1e-20674.72Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVGKWFFASGKWLAKDKS--------------VSIKPIPELDQATFDTLKFYKDNFPKGRKIGTLVTDKLLLESGLLDYNP
        MCARKG GGIVKGPTSIKGWVGKWFFASG+WLAKD+S              VSIK IPEL QATFDTLK YKD+FP+ RKI TLVTDKLLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVGKWFFASGKWLAKDKS--------------VSIKPIPELDQATFDTLKFYKDNFPKGRKIGTLVTDKLLLESGLLDYNP

Query:  LVRPIEASRPNSELAMVCGFTSSVKRKSKGRAHALKTVQSSDPPTHVVDQNAAQDQVGPSSAAPTPVIELDSTRERSREKRSRSESEALDVSPLREVREG
        LVR IEASRPNSELAMVCGFT SVKRKSKGRAHALKTV  ++P T  V +  AQ   GPSSA PTPVIELD +  RS EKRSR ESEALDVSPL EVR  
Subjt:  LVRPIEASRPNSELAMVCGFTSSVKRKSKGRAHALKTVQSSDPPTHVVDQNAAQDQVGPSSAAPTPVIELDSTRERSREKRSRSESEALDVSPLREVREG

Query:  SPLKRRKKKKKATTSSEVGPRGPLPSSHADLIDDPEARMGGTSDVKMRFRMEPLSSGVKDQVSRISAACLDRCLRRASKFVSDPGSVLQRTIDHAVEAFT
        SPL+RR+KKKK ++SSE G RG LP+SHADL+DDPEARM GTS+V+MRF MEP SSGVKDQVSRISA CLDR LRRASKFVSDPGSVLQRTID+  EAF 
Subjt:  SPLKRRKKKKKATTSSEVGPRGPLPSSHADLIDDPEARMGGTSDVKMRFRMEPLSSGVKDQVSRISAACLDRCLRRASKFVSDPGSVLQRTIDHAVEAFT

Query:  ASIHSAVMIKAELDGRETLAAKERENSSAAFEAATTLKGELLKARSEVDILRAEVEAKAELLKREDERHKAHLQAAHAITKGLEKEKFQLLKEKDDMLQA
        ASIH AVM+KAELDGRE LAAKERENS AA EAATTLKGELLKA+ EVDILRAEV+AK +LLK+E E+HKAHL+AAHAITKGLEKEKFQLLKEKDD+ Q 
Subjt:  ASIHSAVMIKAELDGRETLAAKERENSSAAFEAATTLKGELLKARSEVDILRAEVEAKAELLKREDERHKAHLQAAHAITKGLEKEKFQLLKEKDDMLQA

Query:  LEGKDAAIGRLNAELKAEKERLTNGALLEAAFRQHPDFDGFAKDFSDASFKFLMKGIAADMPHLQVDLGDLKK------------------------RDL
        LE KDA+IGRL  ELK  KERLTNG LLE +FRQHPDFDGFAKDFSDA FKFLMKGIAADMPHLQ+DL  LKK                        R+L
Subjt:  LEGKDAAIGRLNAELKAEKERLTNGALLEAAFRQHPDFDGFAKDFSDASFKFLMKGIAADMPHLQVDLGDLKK------------------------RDL

Query:  DSDYSELDEDEVPSQEPTEVGTTQEGVPSQQNGS
        DSDYS+++E++ PSQEP EVGTTQE VPSQQ GS
Subjt:  DSDYSELDEDEVPSQEPTEVGTTQEGVPSQQNGS

SwissProt top hitse value%identityAlignment
Q9LEX8 Uncharacterized protein At3g60930, chloroplastic1.4e-0528.57Show/hide
Query:  SRMPEHYLGPLRRGFNIPNDILLRIPEEGERADNPPEGWVTLYLKMFEYG--LRLPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDEDEV
        S   E  L  L+  F +   + LR+P   ERAD+PP G+ TLY + F YG  L LP+     E++    +A +Q+       +  + I  +     E E 
Subjt:  SRMPEHYLGPLRRGFNIPNDILLRIPEEGERADNPPEGWVTLYLKMFEYG--LRLPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDEDEV

Query:  ELLSVDQLLGCFEAKRIAK-KPGRYYMCARKGAGGIVKGPTSIKGWVGKWFFAS
        E +++  L    E +R+ K +  RYY+   KG   I   P+  + +   +FF +
Subjt:  ELLSVDQLLGCFEAKRIAK-KPGRYYMCARKGAGGIVKGPTSIKGWVGKWFFAS

Arabidopsis top hitse value%identityAlignment
AT1G32010.1 myosin heavy chain-related1.3e-0928.48Show/hide
Query:  IPNDILLRIPEEGERADNPPEGWVTLYLKMF-EYGLRLPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDEDEVELLSVDQLLGCFEAKRI
        +P  + +RIP + +R  + PEG++ L+   F E GLR P+  F   F     +A +Q+       I   A L  L AR       LSV+ +       ++
Subjt:  IPNDILLRIPEEGERADNPPEGWVTLYLKMF-EYGLRLPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDEDEVELLSVDQLLGCFEAKRI

Query:  AKKPGRYYMCARKGAGGIVKGPTSIKGWVGKWFFAS-GKWLAKDKSVSIKPIPELDQA
          K G++Y+ + +G   +  GP+  + W+G +F+A   + L +D SV  +    LD A
Subjt:  AKKPGRYYMCARKGAGGIVKGPTSIKGWVGKWFFAS-GKWLAKDKSVSIKPIPELDQA

AT2G15420.1 myosin heavy chain-related3.5e-0423.41Show/hide
Query:  NIPNDILLRIPEEGERADNPPEGWVTLYLKMF-EYGLRLPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDEDEVELLSVDQLLGCFEAKR
        N P +I L  P+  +R   PPEG++ LY   F   GL  PL  F  E+  R  +A +Q+          LAIL       E  +E +  D         R
Subjt:  NIPNDILLRIPEEGERADNPPEGWVTLYLKMF-EYGLRLPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDEDEVELLSVDQLLGCFEAKR

Query:  IAKKPGRYYMCARKGAGGIVKGPTS-IKGWVGKWFFASGKWLAKDK--SVSIKPIPELDQATFDTLKF---YKDNFPKGRKIGTLVTDKLLLESGLLDYN
        + + PG YY  A K    IV G  S I GW  ++FF      + +   +V +       +  F    F   + DN  + R++G      L   +G     
Subjt:  IAKKPGRYYMCARKGAGGIVKGPTS-IKGWVGKWFFASGKWLAKDK--SVSIKPIPELDQATFDTLKF---YKDNFPKGRKIGTLVTDKLLLESGLLDYN

Query:  PLVRPIEASRPNSELAMVCGFTSSVKRKSKGRAHALKTVQSSDPPTHV-VD-QNAAQDQVGPSSA--APTPVIELDSTRERSREKRSRSESEALDVSPLR
           RP    R    +  +  F              +  V+ S   T   +D +N   + +G  SA  + T  + +D + +R   +   ++ + +  S L 
Subjt:  PLVRPIEASRPNSELAMVCGFTSSVKRKSKGRAHALKTVQSSDPPTHV-VD-QNAAQDQVGPSSA--APTPVIELDSTRERSREKRSRSESEALDVSPLR

Query:  EVREGSPLKRRKKKKKATTSSEVGPRGP-LPSSHADLIDDPEARMGGTSDVKMRFRMEPLSSGVKDQVSRISAA-----CLDRCLRR------ASKFVSD
         +   +P K+R  +  A   S   P+ P   ++    +D   +      D+         ++   D VSRI  A      +DR + R      A K    
Subjt:  EVREGSPLKRRKKKKKATTSSEVGPRGP-LPSSHADLIDDPEARMGGTSDVKMRFRMEPLSSGVKDQVSRISAA-----CLDRCLRR------ASKFVSD

Query:  PGSVLQRTIDHAVEAFTASIHSAVMIKAELDGRETLA---AKERENSSAAFEAATTLKGELLKARSEVDILRAEVEAKAELLKREDER-HKAHLQAAHAI
         G+  + +     +A  ++   A    AE +  + LA   A ERE S+   + ++ L  ++   +S VD  R ++EA  +    E  R  K+ ++   A 
Subjt:  PGSVLQRTIDHAVEAFTASIHSAVMIKAELDGRETLA---AKERENSSAAFEAATTLKGELLKARSEVDILRAEVEAKAELLKREDER-HKAHLQAAHAI

Query:  TKGLEKEKFQLLKEKDDMLQALEGKDAAIGRLN-AELKAEKERLTNGA-LLEAAFRQHPDFDGFAKDFSDASFKFLMKGIA
         K  + +    L+     L+ L  K  AI      EL+  +  L NG   LE A     D D F +  + A    L+ GI+
Subjt:  TKGLEKEKFQLLKEKDDMLQALEGKDAAIGRLN-AELKAEKERLTNGA-LLEAAFRQHPDFDGFAKDFSDASFKFLMKGIA

AT5G38190.1 INVOLVED IN: biological_process unknown8.1e-0926.7Show/hide
Query:  IPNDILLRIPEEGERADNPPEGWVTLYLKMF-EYGLRLPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDEDEVELLSVDQLLGCFEAKRI
        +P  + +RIP + +R  + PEG++ L+   F E GLR P+  F   F     +A +Q+       I   A L  L AR       LSV+ +       ++
Subjt:  IPNDILLRIPEEGERADNPPEGWVTLYLKMF-EYGLRLPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDEDEVELLSVDQLLGCFEAKRI

Query:  AKKPGRYYMCARKGAGGIVKGPTSIKGWVGKWFFAS-GKWLAKDKSV-SIKPIPELDQATFDTLKFYKDNFPKGRK
          K G++Y+ + +G   +   P+  + W+G +F+A   + L +D S  +++    +D      LK  K N  K +K
Subjt:  AKKPGRYYMCARKGAGGIVKGPTSIKGWVGKWFFAS-GKWLAKDKSV-SIKPIPELDQATFDTLKFYKDNFPKGRK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTAGTTTTCTTATCTTCCCCCTCCAGTAGTGATAGCCTGGGTAGTGTAGGTCGGACGATAAGTAGTTCGCCCCCCAAACCAAGTGACTCTGGGGAGGTCTTAGCTCG
CCAGGGTCTGGAGTACCCTTCTAGGATGCCCGAGCATTATCTTGGACCCCTTCGTAGGGGGTTTAACATTCCGAATGACATCCTCCTTAGGATTCCGGAGGAAGGGGAAA
GAGCTGACAATCCTCCAGAGGGATGGGTCACTCTTTATTTGAAAATGTTTGAGTACGGCCTCAGACTTCCTCTTCATCCCTTTGCTCAGGAGTTCTTAAACCGAACTGGA
CTGGCTCCTGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTTGCTTTAGCCATTCTTTTTTGGTTGCGAGCTCGTGATGAGGACGAGGTCGAGCTGCTAAGTGTTGA
TCAGCTCCTTGGGTGTTTTGAGGCTAAGAGGATAGCCAAAAAACCAGGTCGGTACTATATGTGTGCAAGGAAAGGCGCAGGTGGCATAGTCAAGGGGCCGACCTCCATCA
AGGGATGGGTAGGAAAGTGGTTCTTTGCCTCGGGAAAGTGGCTGGCAAAGGATAAGTCAGTATCGATCAAGCCGATTCCCGAGCTCGATCAAGCCACTTTTGACACCCTC
AAATTCTACAAGGACAACTTCCCAAAGGGCCGGAAGATCGGGACCTTGGTCACCGACAAACTGCTATTAGAATCTGGGCTATTGGACTACAATCCTTTAGTTCGTCCGAT
TGAAGCTTCGAGGCCAAACTCCGAGCTTGCCATGGTGTGTGGATTCACGAGCAGCGTGAAACGCAAGTCTAAGGGTCGTGCTCACGCCCTTAAGACAGTTCAGAGCTCTG
ATCCACCTACACATGTTGTGGATCAGAATGCAGCTCAAGACCAGGTTGGTCCATCTTCTGCAGCTCCAACTCCGGTGATTGAGTTGGATTCTACTAGGGAACGCTCCAGG
GAGAAGCGCTCGAGGAGCGAGTCCGAAGCCTTGGACGTGTCACCTCTTCGTGAGGTGAGAGAAGGCTCTCCTCTGAAGAGGAGAAAGAAAAAGAAGAAGGCCACCACCTC
CTCGGAGGTTGGACCTCGTGGCCCCCTGCCCTCAAGCCATGCCGACCTGATAGACGACCCTGAAGCTCGGATGGGGGGCACATCCGACGTGAAGATGCGGTTCAGAATGG
AACCGTTGAGCTCTGGGGTGAAAGACCAGGTGTCACGCATCTCGGCTGCCTGCTTGGATCGCTGTCTCAGGAGAGCCTCCAAGTTTGTGAGCGACCCAGGGTCCGTGCTG
CAGCGGACTATCGACCACGCCGTCGAGGCGTTCACTGCCTCCATCCATTCAGCAGTCATGATCAAGGCCGAGCTGGATGGAAGGGAGACCTTGGCAGCGAAGGAGAGGGA
GAACTCCTCTGCTGCCTTTGAGGCTGCCACTACGCTCAAGGGCGAGCTACTGAAGGCTCGGAGCGAGGTGGATATACTGAGGGCCGAGGTTGAAGCCAAAGCCGAGCTGC
TGAAGAGGGAGGATGAGAGGCATAAGGCCCACCTCCAAGCTGCCCACGCCATCACAAAAGGGTTGGAGAAGGAAAAGTTCCAACTCCTTAAGGAGAAGGACGACATGCTC
CAGGCCCTTGAAGGGAAGGACGCTGCAATTGGGCGTCTCAATGCTGAGCTGAAGGCGGAGAAGGAGCGCCTTACCAACGGGGCTCTCCTTGAAGCAGCCTTCAGGCAACA
CCCAGATTTTGATGGGTTTGCCAAAGATTTTAGCGACGCCAGCTTCAAATTTTTGATGAAGGGGATTGCTGCTGATATGCCACACCTCCAGGTTGACCTCGGCGATCTGA
AGAAGAGAGATCTGGACTCTGACTACTCCGAACTGGATGAAGACGAGGTCCCAAGTCAGGAACCTACTGAGGTCGGCACCACTCAAGAAGGAGTCCCTTCTCAGCAGAAC
GGATCTCAGGAGGTCAACCTTCTGGGGTCCCAGGGCGAGCTATCGTCCCACCTCGGAAGTAGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTAGTTTTCTTATCTTCCCCCTCCAGTAGTGATAGCCTGGGTAGTGTAGGTCGGACGATAAGTAGTTCGCCCCCCAAACCAAGTGACTCTGGGGAGGTCTTAGCTCG
CCAGGGTCTGGAGTACCCTTCTAGGATGCCCGAGCATTATCTTGGACCCCTTCGTAGGGGGTTTAACATTCCGAATGACATCCTCCTTAGGATTCCGGAGGAAGGGGAAA
GAGCTGACAATCCTCCAGAGGGATGGGTCACTCTTTATTTGAAAATGTTTGAGTACGGCCTCAGACTTCCTCTTCATCCCTTTGCTCAGGAGTTCTTAAACCGAACTGGA
CTGGCTCCTGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTTGCTTTAGCCATTCTTTTTTGGTTGCGAGCTCGTGATGAGGACGAGGTCGAGCTGCTAAGTGTTGA
TCAGCTCCTTGGGTGTTTTGAGGCTAAGAGGATAGCCAAAAAACCAGGTCGGTACTATATGTGTGCAAGGAAAGGCGCAGGTGGCATAGTCAAGGGGCCGACCTCCATCA
AGGGATGGGTAGGAAAGTGGTTCTTTGCCTCGGGAAAGTGGCTGGCAAAGGATAAGTCAGTATCGATCAAGCCGATTCCCGAGCTCGATCAAGCCACTTTTGACACCCTC
AAATTCTACAAGGACAACTTCCCAAAGGGCCGGAAGATCGGGACCTTGGTCACCGACAAACTGCTATTAGAATCTGGGCTATTGGACTACAATCCTTTAGTTCGTCCGAT
TGAAGCTTCGAGGCCAAACTCCGAGCTTGCCATGGTGTGTGGATTCACGAGCAGCGTGAAACGCAAGTCTAAGGGTCGTGCTCACGCCCTTAAGACAGTTCAGAGCTCTG
ATCCACCTACACATGTTGTGGATCAGAATGCAGCTCAAGACCAGGTTGGTCCATCTTCTGCAGCTCCAACTCCGGTGATTGAGTTGGATTCTACTAGGGAACGCTCCAGG
GAGAAGCGCTCGAGGAGCGAGTCCGAAGCCTTGGACGTGTCACCTCTTCGTGAGGTGAGAGAAGGCTCTCCTCTGAAGAGGAGAAAGAAAAAGAAGAAGGCCACCACCTC
CTCGGAGGTTGGACCTCGTGGCCCCCTGCCCTCAAGCCATGCCGACCTGATAGACGACCCTGAAGCTCGGATGGGGGGCACATCCGACGTGAAGATGCGGTTCAGAATGG
AACCGTTGAGCTCTGGGGTGAAAGACCAGGTGTCACGCATCTCGGCTGCCTGCTTGGATCGCTGTCTCAGGAGAGCCTCCAAGTTTGTGAGCGACCCAGGGTCCGTGCTG
CAGCGGACTATCGACCACGCCGTCGAGGCGTTCACTGCCTCCATCCATTCAGCAGTCATGATCAAGGCCGAGCTGGATGGAAGGGAGACCTTGGCAGCGAAGGAGAGGGA
GAACTCCTCTGCTGCCTTTGAGGCTGCCACTACGCTCAAGGGCGAGCTACTGAAGGCTCGGAGCGAGGTGGATATACTGAGGGCCGAGGTTGAAGCCAAAGCCGAGCTGC
TGAAGAGGGAGGATGAGAGGCATAAGGCCCACCTCCAAGCTGCCCACGCCATCACAAAAGGGTTGGAGAAGGAAAAGTTCCAACTCCTTAAGGAGAAGGACGACATGCTC
CAGGCCCTTGAAGGGAAGGACGCTGCAATTGGGCGTCTCAATGCTGAGCTGAAGGCGGAGAAGGAGCGCCTTACCAACGGGGCTCTCCTTGAAGCAGCCTTCAGGCAACA
CCCAGATTTTGATGGGTTTGCCAAAGATTTTAGCGACGCCAGCTTCAAATTTTTGATGAAGGGGATTGCTGCTGATATGCCACACCTCCAGGTTGACCTCGGCGATCTGA
AGAAGAGAGATCTGGACTCTGACTACTCCGAACTGGATGAAGACGAGGTCCCAAGTCAGGAACCTACTGAGGTCGGCACCACTCAAGAAGGAGTCCCTTCTCAGCAGAAC
GGATCTCAGGAGGTCAACCTTCTGGGGTCCCAGGGCGAGCTATCGTCCCACCTCGGAAGTAGCTGA
Protein sequenceShow/hide protein sequence
MVVFLSSPSSSDSLGSVGRTISSSPPKPSDSGEVLARQGLEYPSRMPEHYLGPLRRGFNIPNDILLRIPEEGERADNPPEGWVTLYLKMFEYGLRLPLHPFAQEFLNRTG
LAPAQVAPNGWGVIFALAILFWLRARDEDEVELLSVDQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVGKWFFASGKWLAKDKSVSIKPIPELDQATFDTL
KFYKDNFPKGRKIGTLVTDKLLLESGLLDYNPLVRPIEASRPNSELAMVCGFTSSVKRKSKGRAHALKTVQSSDPPTHVVDQNAAQDQVGPSSAAPTPVIELDSTRERSR
EKRSRSESEALDVSPLREVREGSPLKRRKKKKKATTSSEVGPRGPLPSSHADLIDDPEARMGGTSDVKMRFRMEPLSSGVKDQVSRISAACLDRCLRRASKFVSDPGSVL
QRTIDHAVEAFTASIHSAVMIKAELDGRETLAAKERENSSAAFEAATTLKGELLKARSEVDILRAEVEAKAELLKREDERHKAHLQAAHAITKGLEKEKFQLLKEKDDML
QALEGKDAAIGRLNAELKAEKERLTNGALLEAAFRQHPDFDGFAKDFSDASFKFLMKGIAADMPHLQVDLGDLKKRDLDSDYSELDEDEVPSQEPTEVGTTQEGVPSQQN
GSQEVNLLGSQGELSSHLGSS