; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g19730 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g19730
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionSWIM-type domain-containing protein
Genome locationchr6:15471961..15488055
RNA-Seq ExpressionMoc06g19730
SyntenyMoc06g19730
Gene Ontology termsNA
InterPro domainsIPR004332 - Transposase, MuDR, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK00436.1 uncharacterized protein E5676_scaffold169G00400 [Cucumis melo var. makuwa]6.3e-8542.73Show/hide
Query:  MNFQNFRNLILNEVKLDPSICFVQFSVLLKFGSNRVQTVVEINEDKDVAWFFTLVKDDSTRYPLVAHVNA----MPLEESSVINSESENSGLVVASSAII
        M+F+NF +LIL EV  D S   +Q S+LL +G   +QTVV+I EDKDV WF +LVK   TR+PLVAH  A    M LE SSV++S  +N   ++ SS+ I
Subjt:  MNFQNFRNLILNEVKLDPSICFVQFSVLLKFGSNRVQTVVEINEDKDVAWFFTLVKDDSTRYPLVAHVNA----MPLEESSVINSESENSGLVVASSAII

Query:  ERDFEVYNDVDITSMSFAFYLKEKDLFAYKEVLSKAFHFMAIKNSFQFKTLRSNFKSVEFECVQNGCQWYVKASRYRGSELWMIRKYVSDHNCRMTIVQS
        + DF++  D+ ++S+S  F LKEKD+FA KE+LSK+F+++AIKN+F+FKT+RSN KS+EF+C Q+ C WYV+ASRY+G ELW +RKY+++HNC + ++Q+
Subjt:  ERDFEVYNDVDITSMSFAFYLKEKDLFAYKEVLSKAFHFMAIKNSFQFKTLRSNFKSVEFECVQNGCQWYVKASRYRGSELWMIRKYVSDHNCRMTIVQS

Query:  SHRQVSSSFIGECMVEKFRFSSSDRSTLKDIIHHMRTNYGVGVSYNKAWRAKTAVNKLLTGDADDSYALIPKFFVKLKEMNP------------------
        +H+Q S+S I +C+ +   FSS DRST  DI+ HMRT  GV VSY KAWRAK  V   L G+A +SYALIP FF+KLKE+NP                  
Subjt:  SHRQVSSSFIGECMVEKFRFSSSDRSTLKDIIHHMRTNYGVGVSYNKAWRAKTAVNKLLTGDADDSYALIPKFFVKLKEMNP------------------

Query:  ----------VICRIDL----------------------------NSYSYVSEYYLSSRLLFTYN-------------------------GSIRPVGNHL
                    CR ++                            N+ + V    +  R  F                              +RPVGNH 
Subjt:  ----------VICRIDL----------------------------NSYSYVSEYYLSSRLLFTYN-------------------------GSIRPVGNHL

Query:  DWGSVAIDVNVLPPIIKRPAGRPRKQKIPSIGK
        +W S+ I+ N+LPP  KR AGRPRKQ+I SIG+
Subjt:  DWGSVAIDVNVLPPIIKRPAGRPRKQKIPSIGK

TYK22454.1 uncharacterized protein E5676_scaffold3009G00020 [Cucumis melo var. makuwa]7.0e-8442.26Show/hide
Query:  MNFQNFRNLILNEVKLDPSICFVQFSVLLKFGSNRVQTVVEINEDKDVAWFFTLVKDDSTRYPLVAHVNA----MPLEESSVINSESENSGLVVASSAII
        M F+NF +LIL  V+ D S   +Q S+LL +G   +QTVV+I EDKDV WF +LVK   TR+PLVAH  A    M LE SSV++S  +N   ++ SS+I 
Subjt:  MNFQNFRNLILNEVKLDPSICFVQFSVLLKFGSNRVQTVVEINEDKDVAWFFTLVKDDSTRYPLVAHVNA----MPLEESSVINSESENSGLVVASSAII

Query:  ERDFEVYNDVDITSMSFAFYLKEKDLFAYKEVLSKAFHFMAIKNSFQFKTLRSNFKSVEFECVQNGCQWYVKASRYRGSELWMIRKYVSDHNCRMTIVQS
        + DF++  D+ ++S+SF F LKEKD+FA K++LSK+F+++AIKN+F+FKT++SN KS+EF+C Q+ C WYV+ASRY+G ELW +RKY+++HNC + ++Q+
Subjt:  ERDFEVYNDVDITSMSFAFYLKEKDLFAYKEVLSKAFHFMAIKNSFQFKTLRSNFKSVEFECVQNGCQWYVKASRYRGSELWMIRKYVSDHNCRMTIVQS

Query:  SHRQVSSSFIGECMVEKFRFSSSDRSTLKDIIHHMRTNYGVGVSYNKAWRAKTAVNKLLTGDADDSYALIPKFFVKLKEMNP------------------
        +H+Q S+S I +C+ +   FSS DRST  DI+ HMRT  GV VSY KAWRAK  V   L G+A +SYALIP FF+KLKE+NP                  
Subjt:  SHRQVSSSFIGECMVEKFRFSSSDRSTLKDIIHHMRTNYGVGVSYNKAWRAKTAVNKLLTGDADDSYALIPKFFVKLKEMNP------------------

Query:  ----------VICRIDL----------------------------NSYSYVSEYYLSSRLLFTYN-------------------------GSIRPVGNHL
                    CR ++                            N+ + V    +  R  F                              +RPVGNH 
Subjt:  ----------VICRIDL----------------------------NSYSYVSEYYLSSRLLFTYN-------------------------GSIRPVGNHL

Query:  DWGSVAIDVNVLPPIIKRPAGRPRKQKIPSIGK
        +W S+ I+ N+LPP  KR AGRPRKQ+I SIG+
Subjt:  DWGSVAIDVNVLPPIIKRPAGRPRKQKIPSIGK

TYK28211.1 MuDR family transposase [Cucumis melo var. makuwa]2.0e-8642.73Show/hide
Query:  MNFQNFRNLILNEVKLDPSICFVQFSVLLKFGSNRVQTVVEINEDKDVAWFFTLVKDDSTRYPL----VAHVNAMPLEESSVINSESENSGLVVASSAII
        M+F+NF +LIL EV+ D S   +Q S+LL +G   +QT V+I EDKDV WF +LVK   TR+PL    +AHV+ M LE SSV++S  +N   +++SS+ I
Subjt:  MNFQNFRNLILNEVKLDPSICFVQFSVLLKFGSNRVQTVVEINEDKDVAWFFTLVKDDSTRYPL----VAHVNAMPLEESSVINSESENSGLVVASSAII

Query:  ERDFEVYNDVDITSMSFAFYLKEKDLFAYKEVLSKAFHFMAIKNSFQFKTLRSNFKSVEFECVQNGCQWYVKASRYRGSELWMIRKYVSDHNCRMTIVQS
        + DF++  D+ ++S+S  F LKEKD+FA KE+LSK+F+++AIKN+F+FKT+RSN KS+EF+C Q+ C WYV+ASRY+G ELW +RKY+++HNC + ++Q+
Subjt:  ERDFEVYNDVDITSMSFAFYLKEKDLFAYKEVLSKAFHFMAIKNSFQFKTLRSNFKSVEFECVQNGCQWYVKASRYRGSELWMIRKYVSDHNCRMTIVQS

Query:  SHRQVSSSFIGECMVEKFRFSSSDRSTLKDIIHHMRTNYGVGVSYNKAWRAKTAVNKLLTGDADDSYALIPKFFVKLKEMNP------------------
        +H+Q SSS I +C+V+   FSS DRST  DI+ HMRT  GV VSY KAWRAK  V   L G+A +SYALIP FF+KLKE+NP                  
Subjt:  SHRQVSSSFIGECMVEKFRFSSSDRSTLKDIIHHMRTNYGVGVSYNKAWRAKTAVNKLLTGDADDSYALIPKFFVKLKEMNP------------------

Query:  ----------VICRIDL----------------------------NSYSYVSEYYLSSRLLFTYN-------------------------GSIRPVGNHL
                    CR ++                            N+ + V    +  R  F                              +RPVGNH 
Subjt:  ----------VICRIDL----------------------------NSYSYVSEYYLSSRLLFTYN-------------------------GSIRPVGNHL

Query:  DWGSVAIDVNVLPPIIKRPAGRPRKQKIPSIGK
        +W S+ I+ N+LPP  KR AGRPRKQ+I SIG+
Subjt:  DWGSVAIDVNVLPPIIKRPAGRPRKQKIPSIGK

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]1.7e-9081.02Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTG----------------------------EEAELLDVDQLLACFEAKRITKKPGRFHMCARKGADSIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTG                            EEAELLDVDQLLACFEAKRI KKPGRF+MCARKGA  IVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTG----------------------------EEAELLDVDQLLACFEAKRITKKPGRFHMCARKGADSIVKGPTSIKGWVR

Query:  KWFYASGEWLEKDESGRSFFDVPTRFGNLVSIQPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESRLLDYNPAVRPIESSRPNSELAMVCGFAS
        KWFYASGEWL KDESGRSFFDVPTRFGNLVSI+PVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLES LLDYNPAVRPIE SRPNS LAMVC FAS
Subjt:  KWFYASGEWLEKDESGRSFFDVPTRFGNLVSIQPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESRLLDYNPAVRPIESSRPNSELAMVCGFAS

Query:  NVKRKSKGRAHALEAA
         VKRKSKGRAHALEAA
Subjt:  NVKRKSKGRAHALEAA

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]3.1e-14083.6Show/hide
Query:  MSSSFSSNLGSDEDLARRLESELEEIENFRFSDDGKDSDAFTSGQGLEYPSRIPEHYLGSLRRRFAIPENILLRIPEEGERADNPPEGWVTLYFKMFEYG
        MSSS SSNL  + DLARRLES+LEEIEN R SDDG+DSDA TSGQGLEYPSRIPEHYLGSLRR FAIPENILLR+PEEGERADNPPEGWVTLYFKMFEYG
Subjt:  MSSSFSSNLGSDEDLARRLESELEEIENFRFSDDGKDSDAFTSGQGLEYPSRIPEHYLGSLRRRFAIPENILLRIPEEGERADNPPEGWVTLYFKMFEYG

Query:  LRLPLHPFVQEFLFRTG----------------------------EEAELLDVDQLLACFEAKRITKKPGRFHMCARKGADSIVKGPTSIKGWVRKWFYA
        LRLPLHPFVQEFLFRTG                            EEAEL DVDQLLACFEAKRI KKPGRF+MCARKGA  IVKGPTSIKGWVRKWFYA
Subjt:  LRLPLHPFVQEFLFRTG----------------------------EEAELLDVDQLLACFEAKRITKKPGRFHMCARKGADSIVKGPTSIKGWVRKWFYA

Query:  SGEWLEKDESGRSFFDVPTRFGNLVSIQPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESRLLDYNPAVRPIESSRPNSELAMVCGFASNVKRK
        SGEWL KDESGRSFFDVPTRFGNLVSI+PVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLES LLDYNPAVRPIESSRPNSELAMVCGFAS VKRK
Subjt:  SGEWLEKDESGRSFFDVPTRFGNLVSIQPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESRLLDYNPAVRPIESSRPNSELAMVCGFASNVKRK

Query:  SKGRAHALEAA
        SKGRAHALEAA
Subjt:  SKGRAHALEAA

TrEMBL top hitse value%identityAlignment
A0A5D3C8T8 SWIM-type domain-containing protein3.1e-8542.73Show/hide
Query:  MNFQNFRNLILNEVKLDPSICFVQFSVLLKFGSNRVQTVVEINEDKDVAWFFTLVKDDSTRYPLVAHVNA----MPLEESSVINSESENSGLVVASSAII
        M+F+NF +LIL EV  D S   +Q S+LL +G   +QTVV+I EDKDV WF +LVK   TR+PLVAH  A    M LE SSV++S  +N   ++ SS+ I
Subjt:  MNFQNFRNLILNEVKLDPSICFVQFSVLLKFGSNRVQTVVEINEDKDVAWFFTLVKDDSTRYPLVAHVNA----MPLEESSVINSESENSGLVVASSAII

Query:  ERDFEVYNDVDITSMSFAFYLKEKDLFAYKEVLSKAFHFMAIKNSFQFKTLRSNFKSVEFECVQNGCQWYVKASRYRGSELWMIRKYVSDHNCRMTIVQS
        + DF++  D+ ++S+S  F LKEKD+FA KE+LSK+F+++AIKN+F+FKT+RSN KS+EF+C Q+ C WYV+ASRY+G ELW +RKY+++HNC + ++Q+
Subjt:  ERDFEVYNDVDITSMSFAFYLKEKDLFAYKEVLSKAFHFMAIKNSFQFKTLRSNFKSVEFECVQNGCQWYVKASRYRGSELWMIRKYVSDHNCRMTIVQS

Query:  SHRQVSSSFIGECMVEKFRFSSSDRSTLKDIIHHMRTNYGVGVSYNKAWRAKTAVNKLLTGDADDSYALIPKFFVKLKEMNP------------------
        +H+Q S+S I +C+ +   FSS DRST  DI+ HMRT  GV VSY KAWRAK  V   L G+A +SYALIP FF+KLKE+NP                  
Subjt:  SHRQVSSSFIGECMVEKFRFSSSDRSTLKDIIHHMRTNYGVGVSYNKAWRAKTAVNKLLTGDADDSYALIPKFFVKLKEMNP------------------

Query:  ----------VICRIDL----------------------------NSYSYVSEYYLSSRLLFTYN-------------------------GSIRPVGNHL
                    CR ++                            N+ + V    +  R  F                              +RPVGNH 
Subjt:  ----------VICRIDL----------------------------NSYSYVSEYYLSSRLLFTYN-------------------------GSIRPVGNHL

Query:  DWGSVAIDVNVLPPIIKRPAGRPRKQKIPSIGK
        +W S+ I+ N+LPP  KR AGRPRKQ+I SIG+
Subjt:  DWGSVAIDVNVLPPIIKRPAGRPRKQKIPSIGK

A0A5D3DFS5 SWIM-type domain-containing protein3.4e-8442.26Show/hide
Query:  MNFQNFRNLILNEVKLDPSICFVQFSVLLKFGSNRVQTVVEINEDKDVAWFFTLVKDDSTRYPLVAHVNA----MPLEESSVINSESENSGLVVASSAII
        M F+NF +LIL  V+ D S   +Q S+LL +G   +QTVV+I EDKDV WF +LVK   TR+PLVAH  A    M LE SSV++S  +N   ++ SS+I 
Subjt:  MNFQNFRNLILNEVKLDPSICFVQFSVLLKFGSNRVQTVVEINEDKDVAWFFTLVKDDSTRYPLVAHVNA----MPLEESSVINSESENSGLVVASSAII

Query:  ERDFEVYNDVDITSMSFAFYLKEKDLFAYKEVLSKAFHFMAIKNSFQFKTLRSNFKSVEFECVQNGCQWYVKASRYRGSELWMIRKYVSDHNCRMTIVQS
        + DF++  D+ ++S+SF F LKEKD+FA K++LSK+F+++AIKN+F+FKT++SN KS+EF+C Q+ C WYV+ASRY+G ELW +RKY+++HNC + ++Q+
Subjt:  ERDFEVYNDVDITSMSFAFYLKEKDLFAYKEVLSKAFHFMAIKNSFQFKTLRSNFKSVEFECVQNGCQWYVKASRYRGSELWMIRKYVSDHNCRMTIVQS

Query:  SHRQVSSSFIGECMVEKFRFSSSDRSTLKDIIHHMRTNYGVGVSYNKAWRAKTAVNKLLTGDADDSYALIPKFFVKLKEMNP------------------
        +H+Q S+S I +C+ +   FSS DRST  DI+ HMRT  GV VSY KAWRAK  V   L G+A +SYALIP FF+KLKE+NP                  
Subjt:  SHRQVSSSFIGECMVEKFRFSSSDRSTLKDIIHHMRTNYGVGVSYNKAWRAKTAVNKLLTGDADDSYALIPKFFVKLKEMNP------------------

Query:  ----------VICRIDL----------------------------NSYSYVSEYYLSSRLLFTYN-------------------------GSIRPVGNHL
                    CR ++                            N+ + V    +  R  F                              +RPVGNH 
Subjt:  ----------VICRIDL----------------------------NSYSYVSEYYLSSRLLFTYN-------------------------GSIRPVGNHL

Query:  DWGSVAIDVNVLPPIIKRPAGRPRKQKIPSIGK
        +W S+ I+ N+LPP  KR AGRPRKQ+I SIG+
Subjt:  DWGSVAIDVNVLPPIIKRPAGRPRKQKIPSIGK

A0A5D3DX57 MuDR family transposase9.5e-8742.73Show/hide
Query:  MNFQNFRNLILNEVKLDPSICFVQFSVLLKFGSNRVQTVVEINEDKDVAWFFTLVKDDSTRYPL----VAHVNAMPLEESSVINSESENSGLVVASSAII
        M+F+NF +LIL EV+ D S   +Q S+LL +G   +QT V+I EDKDV WF +LVK   TR+PL    +AHV+ M LE SSV++S  +N   +++SS+ I
Subjt:  MNFQNFRNLILNEVKLDPSICFVQFSVLLKFGSNRVQTVVEINEDKDVAWFFTLVKDDSTRYPL----VAHVNAMPLEESSVINSESENSGLVVASSAII

Query:  ERDFEVYNDVDITSMSFAFYLKEKDLFAYKEVLSKAFHFMAIKNSFQFKTLRSNFKSVEFECVQNGCQWYVKASRYRGSELWMIRKYVSDHNCRMTIVQS
        + DF++  D+ ++S+S  F LKEKD+FA KE+LSK+F+++AIKN+F+FKT+RSN KS+EF+C Q+ C WYV+ASRY+G ELW +RKY+++HNC + ++Q+
Subjt:  ERDFEVYNDVDITSMSFAFYLKEKDLFAYKEVLSKAFHFMAIKNSFQFKTLRSNFKSVEFECVQNGCQWYVKASRYRGSELWMIRKYVSDHNCRMTIVQS

Query:  SHRQVSSSFIGECMVEKFRFSSSDRSTLKDIIHHMRTNYGVGVSYNKAWRAKTAVNKLLTGDADDSYALIPKFFVKLKEMNP------------------
        +H+Q SSS I +C+V+   FSS DRST  DI+ HMRT  GV VSY KAWRAK  V   L G+A +SYALIP FF+KLKE+NP                  
Subjt:  SHRQVSSSFIGECMVEKFRFSSSDRSTLKDIIHHMRTNYGVGVSYNKAWRAKTAVNKLLTGDADDSYALIPKFFVKLKEMNP------------------

Query:  ----------VICRIDL----------------------------NSYSYVSEYYLSSRLLFTYN-------------------------GSIRPVGNHL
                    CR ++                            N+ + V    +  R  F                              +RPVGNH 
Subjt:  ----------VICRIDL----------------------------NSYSYVSEYYLSSRLLFTYN-------------------------GSIRPVGNHL

Query:  DWGSVAIDVNVLPPIIKRPAGRPRKQKIPSIGK
        +W S+ I+ N+LPP  KR AGRPRKQ+I SIG+
Subjt:  DWGSVAIDVNVLPPIIKRPAGRPRKQKIPSIGK

A0A6J1CR42 uncharacterized protein LOC1110138268.3e-9181.02Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTG----------------------------EEAELLDVDQLLACFEAKRITKKPGRFHMCARKGADSIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTG                            EEAELLDVDQLLACFEAKRI KKPGRF+MCARKGA  IVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTG----------------------------EEAELLDVDQLLACFEAKRITKKPGRFHMCARKGADSIVKGPTSIKGWVR

Query:  KWFYASGEWLEKDESGRSFFDVPTRFGNLVSIQPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESRLLDYNPAVRPIESSRPNSELAMVCGFAS
        KWFYASGEWL KDESGRSFFDVPTRFGNLVSI+PVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLES LLDYNPAVRPIE SRPNS LAMVC FAS
Subjt:  KWFYASGEWLEKDESGRSFFDVPTRFGNLVSIQPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESRLLDYNPAVRPIESSRPNSELAMVCGFAS

Query:  NVKRKSKGRAHALEAA
         VKRKSKGRAHALEAA
Subjt:  NVKRKSKGRAHALEAA

A0A6J1DXS5 uncharacterized protein LOC1110255021.5e-14083.6Show/hide
Query:  MSSSFSSNLGSDEDLARRLESELEEIENFRFSDDGKDSDAFTSGQGLEYPSRIPEHYLGSLRRRFAIPENILLRIPEEGERADNPPEGWVTLYFKMFEYG
        MSSS SSNL  + DLARRLES+LEEIEN R SDDG+DSDA TSGQGLEYPSRIPEHYLGSLRR FAIPENILLR+PEEGERADNPPEGWVTLYFKMFEYG
Subjt:  MSSSFSSNLGSDEDLARRLESELEEIENFRFSDDGKDSDAFTSGQGLEYPSRIPEHYLGSLRRRFAIPENILLRIPEEGERADNPPEGWVTLYFKMFEYG

Query:  LRLPLHPFVQEFLFRTG----------------------------EEAELLDVDQLLACFEAKRITKKPGRFHMCARKGADSIVKGPTSIKGWVRKWFYA
        LRLPLHPFVQEFLFRTG                            EEAEL DVDQLLACFEAKRI KKPGRF+MCARKGA  IVKGPTSIKGWVRKWFYA
Subjt:  LRLPLHPFVQEFLFRTG----------------------------EEAELLDVDQLLACFEAKRITKKPGRFHMCARKGADSIVKGPTSIKGWVRKWFYA

Query:  SGEWLEKDESGRSFFDVPTRFGNLVSIQPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESRLLDYNPAVRPIESSRPNSELAMVCGFASNVKRK
        SGEWL KDESGRSFFDVPTRFGNLVSI+PVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLES LLDYNPAVRPIESSRPNSELAMVCGFAS VKRK
Subjt:  SGEWLEKDESGRSFFDVPTRFGNLVSIQPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESRLLDYNPAVRPIESSRPNSELAMVCGFASNVKRK

Query:  SKGRAHALEAA
        SKGRAHALEAA
Subjt:  SKGRAHALEAA

SwissProt top hitse value%identityAlignment
Q9LEX8 Uncharacterized protein At3g60930, chloroplastic8.0e-0627.4Show/hide
Query:  SRIPEHYLGSLRRRFAIPENILLRIPEEGERADNPPEGWVTLYFKMFEYG--LRLPLHPFVQEF----------------------LFRTGEEAELLDVD
        S   E  L  L+ RF +   + LR+P   ERAD+PP G+ TLY + F YG  L LP+   V E+                      L R+ E    + + 
Subjt:  SRIPEHYLGSLRRRFAIPENILLRIPEEGERADNPPEGWVTLYFKMFEYG--LRLPLHPFVQEF----------------------LFRTGEEAELLDVD

Query:  QLLACFEAKRITK-KPGRFHMCARKGADSIVKGPTSIKGWVRKWFY-ASGEWLEKDESGRSFFDVPTRFG----NLVSIQPVPELTQASFDTLKYYK---
         L    E +R+ K +  R+++   KG   I   P+  + +   +F+ A  + + +D  G     V TR+G     L  ++P+P+   ++F  L   K   
Subjt:  QLLACFEAKRITK-KPGRFHMCARKGADSIVKGPTSIKGWVRKWFY-ASGEWLEKDESGRSFFDVPTRFG----NLVSIQPVPELTQASFDTLKYYK---

Query:  -EHFPRGR
         +HF R R
Subjt:  -EHFPRGR

Arabidopsis top hitse value%identityAlignment
AT1G32010.1 myosin heavy chain-related1.1e-0522.54Show/hide
Query:  RFSDDGKDS-DAFTSGQGLEY------PSRIPEHYLGSLRRRFAIPENILLRIPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFVQEFL--------
        R++DD ++S D   SG+  +       P+      +G       +P  + +RIP + +R  + PEG++ L+   F E GLR P+  F+  F         
Subjt:  RFSDDGKDS-DAFTSGQGLEY------PSRIPEHYLGSLRRRFAIPENILLRIPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFVQEFL--------

Query:  ------FRTGEEAEL--------LDVDQLLACFEAKRITKKPGRFHMCARKGADSIVKGPTSIKGWVRKWFYA
               RT    ++        L V+ +       ++  K G+ ++ + +G   +  GP+  + W+  +FYA
Subjt:  ------FRTGEEAEL--------LDVDQLLACFEAKRITKKPGRFHMCARKGADSIVKGPTSIKGWVRKWFYA

AT2G15420.1 myosin heavy chain-related2.6e-0428.91Show/hide
Query:  PENILLRIPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFVQEFLFR---------------------TGEEAEL-LDVDQLLACFEAKRITKKPGRF
        P  I L  P+  +R   PPEG++ LY   F   GL  PL  F+ E+  R                      G E  + +D D         R+ + PG +
Subjt:  PENILLRIPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFVQEFLFR---------------------TGEEAEL-LDVDQLLACFEAKRITKKPGRF

Query:  HMCARKGADSIVKGPTS-IKGWVRKWFY
        +  A K    IV G  S I GW R++F+
Subjt:  HMCARKGADSIVKGPTS-IKGWVRKWFY

AT5G38190.1 INVOLVED IN: biological_process unknown1.2e-0421.97Show/hide
Query:  RFSDDGKDS-DAFTSGQGLEY------PSRIPEHYLGSLRRRFAIPENILLRIPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFVQEFL--------
        R++DD ++S D   SG+  +       P+      +G       +P  + +RIP + +R  + PEG++ L+   F E GLR P+  F+  F         
Subjt:  RFSDDGKDS-DAFTSGQGLEY------PSRIPEHYLGSLRRRFAIPENILLRIPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFVQEFL--------

Query:  ------FRTGEEAEL--------LDVDQLLACFEAKRITKKPGRFHMCARKGADSIVKGPTSIKGWVRKWFYA
               RT    ++        L V+ +       ++  K G+ ++ + +G   +   P+  + W+  +FYA
Subjt:  ------FRTGEEAEL--------LDVDQLLACFEAKRITKKPGRFHMCARKGADSIVKGPTSIKGWVRKWFYA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATTTTCAAAATTTTAGGAATTTGATATTGAATGAAGTGAAGTTAGATCCATCAATTTGTTTTGTCCAATTTTCAGTTTTATTAAAGTTTGGTAGTAATAGAGTTCA
AACTGTTGTTGAGATTAATGAAGATAAAGATGTTGCTTGGTTTTTTACTTTGGTTAAGGATGACAGCACAAGATATCCTTTAGTTGCTCATGTAAATGCTATGCCTTTGG
AGGAGTCTAGTGTTATTAATTCTGAAAGTGAGAATTCAGGTCTGGTTGTTGCTTCTTCTGCTATAATTGAAAGAGATTTTGAAGTTTATAACGATGTTGATATTACTAGT
ATGTCTTTTGCATTTTATCTGAAGGAGAAAGATCTATTTGCATATAAGGAAGTTTTGTCAAAAGCTTTTCACTTTATGGCTATAAAGAATAGCTTTCAGTTCAAGACTTT
GAGATCAAATTTCAAATCAGTTGAGTTTGAATGTGTTCAGAATGGCTGCCAATGGTATGTTAAAGCATCTCGTTATAGAGGTAGTGAACTATGGATGATTAGAAAGTATG
TTAGTGATCATAATTGTCGTATGACGATAGTTCAATCTTCTCATAGACAAGTTTCATCATCTTTTATTGGTGAATGTATGGTTGAGAAATTTAGATTTAGTTCTTCTGAT
CGTTCTACTCTGAAAGATATCATTCATCATATGCGCACAAACTATGGAGTTGGTGTTAGTTATAATAAAGCTTGGAGGGCAAAAACAGCTGTTAATAAATTGTTGACAGG
AGATGCTGATGATTCTTATGCATTGATTCCGAAGTTTTTTGTGAAGTTGAAAGAAATGAATCCAGTTATTTGTAGGATTGATTTGAATTCTTATTCATATGTGTCTGAGT
ATTATCTGTCTAGTAGATTGTTATTTACTTATAATGGTTCAATTCGTCCTGTCGGTAACCATTTGGATTGGGGATCTGTAGCAATTGATGTTAATGTATTACCTCCTATC
ATTAAACGTCCAGCAGGAAGGCCTCGAAAACAAAAGATTCCTTCAATTGGTAAGGATCATCTTTGCTCTTGTGAAGAGCAGTGTACCGGTTTGACTCCTACTGTGGAAAA
AGTCAGAATTTCAAGTTACAACCCAAGAGGTGCTTGGGAAGTGGATAATGCGTGGACAGAAACTCGACAAATTCATCGTGAGAACAATGATGCTCTGCCATCTGTAGAAA
TTGAATGCACAAAAACTGGAACAATAATTCCAAATCCAACCCATGAAAAGACATCTCGAGACTCTTGGAAGTCGTTAGAAGACCTCTATGGAGCAGCCAACAAGGCTTCA
CTCAAGATGAGTGATTACATTTCTACTATGAAACTGGTTGATTGTTTAACTCTTGCAAGTGAACCCATTAGTATTGCAGCTCGAACTCGGCCTCCGAACCGATCTGAATA
CTTGGGCGGACCTGCACAAAAAGGTGAGCACTTCGACGATCAAGTCAGTATAGGGTATTCTTTCCCCCAAACATTGGCCCCCTCTCTGTCTGATCCGATCTCGACCTGGC
AGAGAAGTTCATTTGATTTGCTTCGGACATGTGGCGACTCCCTATTCGTGGGAAAACATAACCGTTGCGGTGGATTTATCATCGCAATATTCAAATATTCCGACGCTTCG
GACCTCCGGGAGGATCCTAGCCGCTCGTTGATTACACATCTCGAACCCTTGGTAGGTCGGTCTCTTCCCTCACTTTCTTTTTCGAACGTAGTTGCCATGTCGTCCTCTTT
TAGCAGCAACTTAGGATCCGATGAGGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGATGACGGGAAGGATAGTGATGCCTTCA
CCTCGGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGAGGTTCGCTATCCCTGAGAACATCCTCCTTAGGATTCCGGAGGAG
GGGGAAAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCCGGCTTCCCCTTCACCCTTTTGTCCAAGAATTTCTCTTCCG
GACTGGTGAAGAGGCCGAGCTGTTGGACGTAGACCAGCTCCTCGCGTGCTTCGAAGCGAAAAGGATAACTAAGAAGCCTGGTCGGTTCCATATGTGCGCAAGGAAAGGCG
CAGACAGTATAGTTAAGGGGCCGACCTCCATCAAGGGATGGGTGAGAAAGTGGTTCTACGCTTCTGGGGAATGGCTTGAAAAGGACGAGTCAGGTCGTTCCTTCTTTGAC
GTTCCCACTAGGTTTGGGAACCTAGTTTCAATCCAACCAGTCCCCGAGCTTACGCAAGCCTCCTTCGACACGTTGAAATATTACAAGGAGCATTTTCCGAGGGGTAGGAA
GGTCGGAACCTTGGTGACCGACAAGCTGCTGCTTGAGTCCAGGCTGCTAGATTACAACCCTGCAGTTCGTCCCATTGAATCCTCAAGGCCGAACTCCGAATTAGCCATGG
TTTGCGGGTTTGCGAGTAACGTGAAACGCAAGTCCAAGGGCCGAGCCCATGCTCTTGAGGCCGCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGAATTTTCAAAATTTTAGGAATTTGATATTGAATGAAGTGAAGTTAGATCCATCAATTTGTTTTGTCCAATTTTCAGTTTTATTAAAGTTTGGTAGTAATAGAGTTCA
AACTGTTGTTGAGATTAATGAAGATAAAGATGTTGCTTGGTTTTTTACTTTGGTTAAGGATGACAGCACAAGATATCCTTTAGTTGCTCATGTAAATGCTATGCCTTTGG
AGGAGTCTAGTGTTATTAATTCTGAAAGTGAGAATTCAGGTCTGGTTGTTGCTTCTTCTGCTATAATTGAAAGAGATTTTGAAGTTTATAACGATGTTGATATTACTAGT
ATGTCTTTTGCATTTTATCTGAAGGAGAAAGATCTATTTGCATATAAGGAAGTTTTGTCAAAAGCTTTTCACTTTATGGCTATAAAGAATAGCTTTCAGTTCAAGACTTT
GAGATCAAATTTCAAATCAGTTGAGTTTGAATGTGTTCAGAATGGCTGCCAATGGTATGTTAAAGCATCTCGTTATAGAGGTAGTGAACTATGGATGATTAGAAAGTATG
TTAGTGATCATAATTGTCGTATGACGATAGTTCAATCTTCTCATAGACAAGTTTCATCATCTTTTATTGGTGAATGTATGGTTGAGAAATTTAGATTTAGTTCTTCTGAT
CGTTCTACTCTGAAAGATATCATTCATCATATGCGCACAAACTATGGAGTTGGTGTTAGTTATAATAAAGCTTGGAGGGCAAAAACAGCTGTTAATAAATTGTTGACAGG
AGATGCTGATGATTCTTATGCATTGATTCCGAAGTTTTTTGTGAAGTTGAAAGAAATGAATCCAGTTATTTGTAGGATTGATTTGAATTCTTATTCATATGTGTCTGAGT
ATTATCTGTCTAGTAGATTGTTATTTACTTATAATGGTTCAATTCGTCCTGTCGGTAACCATTTGGATTGGGGATCTGTAGCAATTGATGTTAATGTATTACCTCCTATC
ATTAAACGTCCAGCAGGAAGGCCTCGAAAACAAAAGATTCCTTCAATTGGTAAGGATCATCTTTGCTCTTGTGAAGAGCAGTGTACCGGTTTGACTCCTACTGTGGAAAA
AGTCAGAATTTCAAGTTACAACCCAAGAGGTGCTTGGGAAGTGGATAATGCGTGGACAGAAACTCGACAAATTCATCGTGAGAACAATGATGCTCTGCCATCTGTAGAAA
TTGAATGCACAAAAACTGGAACAATAATTCCAAATCCAACCCATGAAAAGACATCTCGAGACTCTTGGAAGTCGTTAGAAGACCTCTATGGAGCAGCCAACAAGGCTTCA
CTCAAGATGAGTGATTACATTTCTACTATGAAACTGGTTGATTGTTTAACTCTTGCAAGTGAACCCATTAGTATTGCAGCTCGAACTCGGCCTCCGAACCGATCTGAATA
CTTGGGCGGACCTGCACAAAAAGGTGAGCACTTCGACGATCAAGTCAGTATAGGGTATTCTTTCCCCCAAACATTGGCCCCCTCTCTGTCTGATCCGATCTCGACCTGGC
AGAGAAGTTCATTTGATTTGCTTCGGACATGTGGCGACTCCCTATTCGTGGGAAAACATAACCGTTGCGGTGGATTTATCATCGCAATATTCAAATATTCCGACGCTTCG
GACCTCCGGGAGGATCCTAGCCGCTCGTTGATTACACATCTCGAACCCTTGGTAGGTCGGTCTCTTCCCTCACTTTCTTTTTCGAACGTAGTTGCCATGTCGTCCTCTTT
TAGCAGCAACTTAGGATCCGATGAGGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGATGACGGGAAGGATAGTGATGCCTTCA
CCTCGGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGAGGTTCGCTATCCCTGAGAACATCCTCCTTAGGATTCCGGAGGAG
GGGGAAAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCCGGCTTCCCCTTCACCCTTTTGTCCAAGAATTTCTCTTCCG
GACTGGTGAAGAGGCCGAGCTGTTGGACGTAGACCAGCTCCTCGCGTGCTTCGAAGCGAAAAGGATAACTAAGAAGCCTGGTCGGTTCCATATGTGCGCAAGGAAAGGCG
CAGACAGTATAGTTAAGGGGCCGACCTCCATCAAGGGATGGGTGAGAAAGTGGTTCTACGCTTCTGGGGAATGGCTTGAAAAGGACGAGTCAGGTCGTTCCTTCTTTGAC
GTTCCCACTAGGTTTGGGAACCTAGTTTCAATCCAACCAGTCCCCGAGCTTACGCAAGCCTCCTTCGACACGTTGAAATATTACAAGGAGCATTTTCCGAGGGGTAGGAA
GGTCGGAACCTTGGTGACCGACAAGCTGCTGCTTGAGTCCAGGCTGCTAGATTACAACCCTGCAGTTCGTCCCATTGAATCCTCAAGGCCGAACTCCGAATTAGCCATGG
TTTGCGGGTTTGCGAGTAACGTGAAACGCAAGTCCAAGGGCCGAGCCCATGCTCTTGAGGCCGCCTAG
Protein sequenceShow/hide protein sequence
MNFQNFRNLILNEVKLDPSICFVQFSVLLKFGSNRVQTVVEINEDKDVAWFFTLVKDDSTRYPLVAHVNAMPLEESSVINSESENSGLVVASSAIIERDFEVYNDVDITS
MSFAFYLKEKDLFAYKEVLSKAFHFMAIKNSFQFKTLRSNFKSVEFECVQNGCQWYVKASRYRGSELWMIRKYVSDHNCRMTIVQSSHRQVSSSFIGECMVEKFRFSSSD
RSTLKDIIHHMRTNYGVGVSYNKAWRAKTAVNKLLTGDADDSYALIPKFFVKLKEMNPVICRIDLNSYSYVSEYYLSSRLLFTYNGSIRPVGNHLDWGSVAIDVNVLPPI
IKRPAGRPRKQKIPSIGKDHLCSCEEQCTGLTPTVEKVRISSYNPRGAWEVDNAWTETRQIHRENNDALPSVEIECTKTGTIIPNPTHEKTSRDSWKSLEDLYGAANKAS
LKMSDYISTMKLVDCLTLASEPISIAARTRPPNRSEYLGGPAQKGEHFDDQVSIGYSFPQTLAPSLSDPISTWQRSSFDLLRTCGDSLFVGKHNRCGGFIIAIFKYSDAS
DLREDPSRSLITHLEPLVGRSLPSLSFSNVVAMSSSFSSNLGSDEDLARRLESELEEIENFRFSDDGKDSDAFTSGQGLEYPSRIPEHYLGSLRRRFAIPENILLRIPEE
GERADNPPEGWVTLYFKMFEYGLRLPLHPFVQEFLFRTGEEAELLDVDQLLACFEAKRITKKPGRFHMCARKGADSIVKGPTSIKGWVRKWFYASGEWLEKDESGRSFFD
VPTRFGNLVSIQPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESRLLDYNPAVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAA