; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g15570 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g15570
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionmyosin heavy chain-related
Genome locationchr4:11696342..11700391
RNA-Seq ExpressionMoc04g15570
SyntenyMoc04g15570
Gene Ontology termsGO:0016020 - membrane (cellular component)
GO:0043167 - ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142326.1 uncharacterized protein LOC111012467 [Momordica charantia]1.1e-10651.08Show/hide
Query:  RRRKKKKTTSPLEAGARGVLPASFTDRVDDPEARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKLVSDPGSVLQRTIDYAAEAFVASIQ
        +RRKKKK  S  E GA  VLPA F DRVDDP ARMGGTSDVTARFR+EPSSSGVRDQVSRISAASLDRCLRRASK VS PGSVL R IDYAAEAFVASIQ
Subjt:  RRRKKKKTTSPLEAGARGVLPASFTDRVDDPEARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKLVSDPGSVLQRTIDYAAEAFVASIQ

Query:  SALAVKAELDGREVMAAREKEEFSAALEAASSAMKDELLKAHSEVGILKAEVET----------------------------------------------
        SALAVKAELDGREV+AAREKEEFSAALEAASS MKDELLKAHSEV  LKAEVE+                                              
Subjt:  SALAVKAELDGREVMAAREKEEFSAALEAASSAMKDELLKAHSEVGILKAEVET----------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------KAELLKKEEDRRKAQLRAANAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELETVK
                                              KAELLK+E++R KA LRAA+AITKGLEKEKFQLLKEKDDMLQALE K+  +    AEL+  K
Subjt:  --------------------------------------KAELLKKEEDRRKAQLRAANAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELETVK

Query:  ERLSNGALLEESFRQHPDFDGFGKDFSDAGFKFLMKGIASDMSDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEED--------Q
        ERL+NGALLE +FRQHPDFDGF KDFSDAGFKFLMKGIA+D+  L++DLG LKKRYAE+WASGP+GT GP +LVDKYVRDLDSDYSDL+ED        +
Subjt:  ERLSNGALLEESFRQHPDFDGFGKDFSDAGFKFLMKGIASDMSDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEED--------Q

Query:  VGTTQEGVP
        VGTTQEGVP
Subjt:  VGTTQEGVP

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]3.3e-11179.12Show/hide
Query:  MFEYGLRFTLHPFVQEFLFRTGLAPAQVAPNGWGVTFALVILFWLRARDNEEAELLDVDRLLACFEAKMIAKKPGRFYMCARKGAG--------------
        MFEYGLR  LHPFVQEFLFRTGLAPAQVAPNGWGV FAL ILFWLRARD+EEAELLDVD+LLACFEAK IAKKPGRFYMCARKGAG              
Subjt:  MFEYGLRFTLHPFVQEFLFRTGLAPAQVAPNGWGVTFALVILFWLRARDNEEAELLDVDRLLACFEAKMIAKKPGRFYMCARKGAG--------------

Query:  ---------------GRSFFDVPTRFGNLVSVRPVPELTQASFDTLKYYKDHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELVMVCGFAC
                       GRSFFDVPTRFGNLVS+RPVPELTQASFDTLKYYK+ FPRGRKVGTLVTD+LLLESGLLDYNPAVRPIE SRPNS L MVC FA 
Subjt:  ---------------GRSFFDVPTRFGNLVSVRPVPELTQASFDTLKYYKDHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELVMVCGFAC

Query:  NVKRKSKGRSHALEAAQSSKPATPAVVGLASEDPAPVIELESSGGPSREKRPRDQTEAV-------DVSPLGE
         VKRKSKGR+HALEAAQSSKP TPAVVG ASEDPAPVIELESSGGPSREKRPRDQTEAV       DV PLGE
Subjt:  NVKRKSKGRSHALEAAQSSKPATPAVVGLASEDPAPVIELESSGGPSREKRPRDQTEAV-------DVSPLGE

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]1.3e-12385.96Show/hide
Query:  GTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKLVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVMAAREKEEFSAALEAASSAMKD
        G   + A+ R+EPSSSGVRDQVSRISAASLDRCLRRASK VS PGSVLQRTIDYAAEAFVASIQSALAVKAELDGREV+AAREKEEFSAALE ASS MKD
Subjt:  GTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKLVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVMAAREKEEFSAALEAASSAMKD

Query:  ELLKAHSEVGILKAEVETKAELLKKEEDRRKAQLRAANAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELETVKERLSNGALLEESFRQHPDFD
        ELLKAHSEV  LKAEVE++AELLKKEEDRR+AQLRAA+AIT+GLE+EKFQLLKEKDDMLQALEAK++EL+HATAELET KERLSNG LLEE+FRQHPDFD
Subjt:  ELLKAHSEVGILKAEVETKAELLKKEEDRRKAQLRAANAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELETVKERLSNGALLEESFRQHPDFD

Query:  GFGKDFSDAGFKFLMKGIASDMSDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEEDQVGTTQEGVPQAGS
        GF KDFSDAGFKFLMKGIASDM DLQIDL GLK+RYAE+WASGP GTPGPQALVD+YVRDLDSDYSD EEDQVG+TQEG    GS
Subjt:  GFGKDFSDAGFKFLMKGIASDMSDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEEDQVGTTQEGVPQAGS

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]1.7e-16083.38Show/hide
Query:  MSSSFSSNLGFDEDLARRLKSELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFSIPENILLRIPDEGERADNPPEGWVTLYFKMFEYG
        MSSS SSNL  + DLARRL+S+LEEIEN R SDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGF+IPENILLR+P+EGERADNPPEGWVTLYFKMFEYG
Subjt:  MSSSFSSNLGFDEDLARRLKSELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFSIPENILLRIPDEGERADNPPEGWVTLYFKMFEYG

Query:  LRFTLHPFVQEFLFRTGLAPAQVAPNGWGVTFALVILFWLRARDNEEAELLDVDRLLACFEAKMIAKKPGRFYMCARKGAG-------------------
        LR  LHPFVQEFLFRTGLAPAQVAPNGWGV FAL ILFWLRARD+EEAEL DVD+LLACFEAK IAKKPGRFYMCARKGAG                   
Subjt:  LRFTLHPFVQEFLFRTGLAPAQVAPNGWGVTFALVILFWLRARDNEEAELLDVDRLLACFEAKMIAKKPGRFYMCARKGAG-------------------

Query:  ----------GRSFFDVPTRFGNLVSVRPVPELTQASFDTLKYYKDHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELVMVCGFACNVKRK
                  GRSFFDVPTRFGNLVS+RPVPELTQASFDTLKYYK+ FPRGRKVGTLVTD+LLLESGLLDYNPAVRPIESSRPNSEL MVCGFA  VKRK
Subjt:  ----------GRSFFDVPTRFGNLVSVRPVPELTQASFDTLKYYKDHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELVMVCGFACNVKRK

Query:  SKGRSHALEAAQSSKPATPAVVGLASEDPAPVIELESSGGPSREKRPRDQTEAVD
        SKGR+HALEAAQSSKPATPAVVG ASEDPA VIELESSGGPSREKRPRDQTEAVD
Subjt:  SKGRSHALEAAQSSKPATPAVVGLASEDPAPVIELESSGGPSREKRPRDQTEAVD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]2.9e-17669.18Show/hide
Query:  KGAGGRSFFDVPTRFGNLVSVRPVPELTQASFDTLKYYKDHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELVMVCGFACNVKRKSKGRSH
        K   GR+FFDVPTRFGNLVS++ +PEL QA+FDTLK+YKDHFPR RK+ TLVTDKLLLESGLLDYNP VR IE+SRPNSEL MVCGF  +VKRKSKGR+H
Subjt:  KGAGGRSFFDVPTRFGNLVSVRPVPELTQASFDTLKYYKDHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELVMVCGFACNVKRKSKGRSH

Query:  ALEAAQSSKPATPAV--------VGLASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVREEVPLKRRRKKKKTTSPLEAGARGVLPASFTDRV
        AL+    ++P TP V         G +S  P PVIEL+ SGG S EKR R+++EA+DVSPL  EVR E PL+RRRKKKKT+S  EAGARG LP S  D V
Subjt:  ALEAAQSSKPATPAV--------VGLASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVREEVPLKRRRKKKKTTSPLEAGARGVLPASFTDRV

Query:  DDPEARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKLVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVMAAREKEEFSAALE
        DDPEARM GTS+V  RF +EPSSSGV+DQVSRISA  LDR LRRASK VSDPGSVLQRTID  AEAF+ASI  A+ VKAELDGRE +AA+E+E   AALE
Subjt:  DDPEARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKLVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVMAAREKEEFSAALE

Query:  AASSAMKDELLKAHSEVGILKAEVETKAELLKKEEDRRKAQLRAANAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELETVKERLSNGALLEES
        AA++ +K ELLKA  EV IL+AEV+ K +LLKKE ++ KA LRAA+AITKGLEKEKFQLLKEKDD+ Q LE K+  +   T EL+ +KERL+NG LLEES
Subjt:  AASSAMKDELLKAHSEVGILKAEVETKAELLKKEEDRRKAQLRAANAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELETVKERLSNGALLEES

Query:  FRQHPDFDGFGKDFSDAGFKFLMKGIASDMSDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEED--------QVGTTQEGVP--Q
        FRQHPDFDGF KDFSDAGFKFLMKGIA+DM  LQIDL GLKK+Y+E+WASGP+GTP PQ+LVDKYVR+LDSDYSD+EE+        +VGTTQE VP  Q
Subjt:  FRQHPDFDGFGKDFSDAGFKFLMKGIASDMSDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEED--------QVGTTQEGVP--Q

Query:  AGS
         GS
Subjt:  AGS

TrEMBL top hitse value%identityAlignment
A0A6J1CLV1 uncharacterized protein LOC1110124675.3e-10751.08Show/hide
Query:  RRRKKKKTTSPLEAGARGVLPASFTDRVDDPEARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKLVSDPGSVLQRTIDYAAEAFVASIQ
        +RRKKKK  S  E GA  VLPA F DRVDDP ARMGGTSDVTARFR+EPSSSGVRDQVSRISAASLDRCLRRASK VS PGSVL R IDYAAEAFVASIQ
Subjt:  RRRKKKKTTSPLEAGARGVLPASFTDRVDDPEARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKLVSDPGSVLQRTIDYAAEAFVASIQ

Query:  SALAVKAELDGREVMAAREKEEFSAALEAASSAMKDELLKAHSEVGILKAEVET----------------------------------------------
        SALAVKAELDGREV+AAREKEEFSAALEAASS MKDELLKAHSEV  LKAEVE+                                              
Subjt:  SALAVKAELDGREVMAAREKEEFSAALEAASSAMKDELLKAHSEVGILKAEVET----------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------KAELLKKEEDRRKAQLRAANAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELETVK
                                              KAELLK+E++R KA LRAA+AITKGLEKEKFQLLKEKDDMLQALE K+  +    AEL+  K
Subjt:  --------------------------------------KAELLKKEEDRRKAQLRAANAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELETVK

Query:  ERLSNGALLEESFRQHPDFDGFGKDFSDAGFKFLMKGIASDMSDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEED--------Q
        ERL+NGALLE +FRQHPDFDGF KDFSDAGFKFLMKGIA+D+  L++DLG LKKRYAE+WASGP+GT GP +LVDKYVRDLDSDYSDL+ED        +
Subjt:  ERLSNGALLEESFRQHPDFDGFGKDFSDAGFKFLMKGIASDMSDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEED--------Q

Query:  VGTTQEGVP
        VGTTQEGVP
Subjt:  VGTTQEGVP

A0A6J1CR42 uncharacterized protein LOC1110138261.6e-11179.12Show/hide
Query:  MFEYGLRFTLHPFVQEFLFRTGLAPAQVAPNGWGVTFALVILFWLRARDNEEAELLDVDRLLACFEAKMIAKKPGRFYMCARKGAG--------------
        MFEYGLR  LHPFVQEFLFRTGLAPAQVAPNGWGV FAL ILFWLRARD+EEAELLDVD+LLACFEAK IAKKPGRFYMCARKGAG              
Subjt:  MFEYGLRFTLHPFVQEFLFRTGLAPAQVAPNGWGVTFALVILFWLRARDNEEAELLDVDRLLACFEAKMIAKKPGRFYMCARKGAG--------------

Query:  ---------------GRSFFDVPTRFGNLVSVRPVPELTQASFDTLKYYKDHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELVMVCGFAC
                       GRSFFDVPTRFGNLVS+RPVPELTQASFDTLKYYK+ FPRGRKVGTLVTD+LLLESGLLDYNPAVRPIE SRPNS L MVC FA 
Subjt:  ---------------GRSFFDVPTRFGNLVSVRPVPELTQASFDTLKYYKDHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELVMVCGFAC

Query:  NVKRKSKGRSHALEAAQSSKPATPAVVGLASEDPAPVIELESSGGPSREKRPRDQTEAV-------DVSPLGE
         VKRKSKGR+HALEAAQSSKP TPAVVG ASEDPAPVIELESSGGPSREKRPRDQTEAV       DV PLGE
Subjt:  NVKRKSKGRSHALEAAQSSKPATPAVVGLASEDPAPVIELESSGGPSREKRPRDQTEAV-------DVSPLGE

A0A6J1D971 uncharacterized protein LOC1110185386.3e-12485.96Show/hide
Query:  GTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKLVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVMAAREKEEFSAALEAASSAMKD
        G   + A+ R+EPSSSGVRDQVSRISAASLDRCLRRASK VS PGSVLQRTIDYAAEAFVASIQSALAVKAELDGREV+AAREKEEFSAALE ASS MKD
Subjt:  GTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKLVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVMAAREKEEFSAALEAASSAMKD

Query:  ELLKAHSEVGILKAEVETKAELLKKEEDRRKAQLRAANAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELETVKERLSNGALLEESFRQHPDFD
        ELLKAHSEV  LKAEVE++AELLKKEEDRR+AQLRAA+AIT+GLE+EKFQLLKEKDDMLQALEAK++EL+HATAELET KERLSNG LLEE+FRQHPDFD
Subjt:  ELLKAHSEVGILKAEVETKAELLKKEEDRRKAQLRAANAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELETVKERLSNGALLEESFRQHPDFD

Query:  GFGKDFSDAGFKFLMKGIASDMSDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEEDQVGTTQEGVPQAGS
        GF KDFSDAGFKFLMKGIASDM DLQIDL GLK+RYAE+WASGP GTPGPQALVD+YVRDLDSDYSD EEDQVG+TQEG    GS
Subjt:  GFGKDFSDAGFKFLMKGIASDMSDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEEDQVGTTQEGVPQAGS

A0A6J1DXS5 uncharacterized protein LOC1110255028.4e-16183.38Show/hide
Query:  MSSSFSSNLGFDEDLARRLKSELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFSIPENILLRIPDEGERADNPPEGWVTLYFKMFEYG
        MSSS SSNL  + DLARRL+S+LEEIEN R SDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGF+IPENILLR+P+EGERADNPPEGWVTLYFKMFEYG
Subjt:  MSSSFSSNLGFDEDLARRLKSELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFSIPENILLRIPDEGERADNPPEGWVTLYFKMFEYG

Query:  LRFTLHPFVQEFLFRTGLAPAQVAPNGWGVTFALVILFWLRARDNEEAELLDVDRLLACFEAKMIAKKPGRFYMCARKGAG-------------------
        LR  LHPFVQEFLFRTGLAPAQVAPNGWGV FAL ILFWLRARD+EEAEL DVD+LLACFEAK IAKKPGRFYMCARKGAG                   
Subjt:  LRFTLHPFVQEFLFRTGLAPAQVAPNGWGVTFALVILFWLRARDNEEAELLDVDRLLACFEAKMIAKKPGRFYMCARKGAG-------------------

Query:  ----------GRSFFDVPTRFGNLVSVRPVPELTQASFDTLKYYKDHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELVMVCGFACNVKRK
                  GRSFFDVPTRFGNLVS+RPVPELTQASFDTLKYYK+ FPRGRKVGTLVTD+LLLESGLLDYNPAVRPIESSRPNSEL MVCGFA  VKRK
Subjt:  ----------GRSFFDVPTRFGNLVSVRPVPELTQASFDTLKYYKDHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELVMVCGFACNVKRK

Query:  SKGRSHALEAAQSSKPATPAVVGLASEDPAPVIELESSGGPSREKRPRDQTEAVD
        SKGR+HALEAAQSSKPATPAVVG ASEDPA VIELESSGGPSREKRPRDQTEAVD
Subjt:  SKGRSHALEAAQSSKPATPAVVGLASEDPAPVIELESSGGPSREKRPRDQTEAVD

A0A6J1DZB3 uncharacterized protein LOC1110256651.4e-17669.18Show/hide
Query:  KGAGGRSFFDVPTRFGNLVSVRPVPELTQASFDTLKYYKDHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELVMVCGFACNVKRKSKGRSH
        K   GR+FFDVPTRFGNLVS++ +PEL QA+FDTLK+YKDHFPR RK+ TLVTDKLLLESGLLDYNP VR IE+SRPNSEL MVCGF  +VKRKSKGR+H
Subjt:  KGAGGRSFFDVPTRFGNLVSVRPVPELTQASFDTLKYYKDHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELVMVCGFACNVKRKSKGRSH

Query:  ALEAAQSSKPATPAV--------VGLASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVREEVPLKRRRKKKKTTSPLEAGARGVLPASFTDRV
        AL+    ++P TP V         G +S  P PVIEL+ SGG S EKR R+++EA+DVSPL  EVR E PL+RRRKKKKT+S  EAGARG LP S  D V
Subjt:  ALEAAQSSKPATPAV--------VGLASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVREEVPLKRRRKKKKTTSPLEAGARGVLPASFTDRV

Query:  DDPEARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKLVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVMAAREKEEFSAALE
        DDPEARM GTS+V  RF +EPSSSGV+DQVSRISA  LDR LRRASK VSDPGSVLQRTID  AEAF+ASI  A+ VKAELDGRE +AA+E+E   AALE
Subjt:  DDPEARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKLVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVMAAREKEEFSAALE

Query:  AASSAMKDELLKAHSEVGILKAEVETKAELLKKEEDRRKAQLRAANAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELETVKERLSNGALLEES
        AA++ +K ELLKA  EV IL+AEV+ K +LLKKE ++ KA LRAA+AITKGLEKEKFQLLKEKDD+ Q LE K+  +   T EL+ +KERL+NG LLEES
Subjt:  AASSAMKDELLKAHSEVGILKAEVETKAELLKKEEDRRKAQLRAANAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELETVKERLSNGALLEES

Query:  FRQHPDFDGFGKDFSDAGFKFLMKGIASDMSDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEED--------QVGTTQEGVP--Q
        FRQHPDFDGF KDFSDAGFKFLMKGIA+DM  LQIDL GLKK+Y+E+WASGP+GTP PQ+LVDKYVR+LDSDYSD+EE+        +VGTTQE VP  Q
Subjt:  FRQHPDFDGFGKDFSDAGFKFLMKGIASDMSDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEED--------QVGTTQEGVP--Q

Query:  AGS
         GS
Subjt:  AGS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G42060.1 myosin heavy chain-related8.4e-0426.06Show/hide
Query:  SRIPEHYLGSLRRGFSIPENILLRIPDEGERADNPPEGWVTLYFKMF-EYGLRFTLHPFVQEFLFRTGLAPAQVAPNGWGVTFALVILFWLRARDNEEAE
        SR    + G        PE +   IP+  +R  + PEG++ L+   F E GL F L  F+  +  R  +A +Q++         LVIL        EE  
Subjt:  SRIPEHYLGSLRRGFSIPENILLRIPDEGERADNPPEGWVTLYFKMF-EYGLRFTLHPFVQEFLFRTGLAPAQVAPNGWGVTFALVILFWLRARDNEEAE

Query:  LLDVDRLLACFEAKMIAKKPGRFYMCARKGAGGRSFFDVPTR
        ++D+D         +  K   R  +CA    G + F+   +R
Subjt:  LLDVDRLLACFEAKMIAKKPGRFYMCARKGAGGRSFFDVPTR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCCAGTTCATAGATGGATCCACCAAACCTGAAACCAAAAAACTCAATGCCGGTGCGGTGTCCCGCCTAATATACTCCGGCGCTCAAGTTAGTAATCAGAGTAAAAA
TAAGAATAAGACACCAGAGGGTTCGATGTTTTTCACTTACTTGGTGTGGAGGATCGCCCTTCCTTTTATGGCCCTGAAGGACCGACCAAGAAGGACAACAGTTGGAAGTG
GTAGGGACATCGCACGCGGTACGGTAGGACAGAGAGGTGCACATCCCCGACAGTGCAGAGTAGACTCGCTACCTGTGTTACCTGAGCATGTCGCCCAGGGCTGTTGGATG
TGCCCATCATACGCCTTCTGCTTGGTGATATCTCCAATGATAATTGATTCCCAGTGGGCCCGCGTCCGTGAAGGGGCGAACACGTGTCCTATTCGGAAGCTATCCATAAC
AATTGCAGCTCGAACTCGGCTTCCAGACCGATCTGAATACTTGGGCGGACCTGCACAAAAAGGTGAGCACTCCGACGATCAAGTCAGTATAGGTCGGATTCCCAGTTTAG
TTCGAGGACCGTCGGTTACACCCGGGGTCATCCGCGTGTCCAGGGTATTCTCTTTCCCAAACATTGGCCCCTCTCTGTCTGGTCTGATCTCGACCTGGCAGAGAAGTTCG
TTCGACTTGCTTTGGATGCGTGGCGACTTCCTATTCGTGGGAAAATATAACTGTTGCAGTAGATTTATCGTCGGAATATTCAAATATTCCGACGCTTCGGATCTTAGGGA
GGATCCTAGCCGCTCGTTGATTACACGTCTCGAACCCTTGGTAGGTCGGTCGCTTCCCTCACTTTCTCTTTCGAACGTGATTGCCATGTCGTCCTCTTTTAGCAGCAACT
TAGGATTCGATGAGGATTTAGCTCGTAGGTTAAAGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGATGACGGGGAGGATAGTGATGCCTCCACCTCGGGTCAG
GGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCTCTATCCCTGAGAACATCCTCCTTAGGATTCCGGATGAGGGGGAGAGAGC
TGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGATTTACCCTTCACCCTTTCGTCCAAGAGTTTCTTTTCCGAACTGGGTTGG
CTCCGGCTCAAGTGGCCCCCAATGGGTGGGGTGTCACTTTCGCTTTGGTCATCCTTTTTTGGCTACGAGCTCGGGATAATGAAGAGGCCGAGCTGTTAGACGTAGACCGG
CTCCTCGCGTGCTTCGAAGCGAAAATGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAGGAAAGGCGCAGGCGGTCGTTCCTTCTTTGACGTTCCCACTAGGTT
TGGGAACCTAGTTTCAGTCCGACCAGTCCCCGAGCTTACGCAAGCCTCCTTCGACACGCTGAAATATTACAAGGACCATTTTCCGAGGGGTAGGAAGGTCGGAACCTTAG
TGACCGACAAGCTGTTGCTTGAGTCCGGGCTGCTAGATTACAACCCCGCAGTTCGTCCCATTGAATCCTCAAGGCCGAACTCCGAACTTGTCATGGTTTGCGGATTTGCA
TGCAACGTGAAGCGCAAGTCCAAGGGCCGATCCCATGCTCTTGAGGCCGCCCAGAGTTCGAAACCTGCCACCCCTGCTGTGGTAGGGCTAGCCTCGGAAGATCCAGCCCC
AGTGATCGAGCTGGAGTCTTCTGGGGGTCCTTCGAGGGAGAAGCGCCCCAGGGATCAGACCGAGGCGGTGGACGTCTCGCCCTTGGGCGAGGAGGTGAGGGAGGAAGTCC
CTCTGAAGCGAAGGAGGAAGAAGAAGAAGACCACCTCCCCCTTGGAGGCCGGAGCTCGTGGGGTCTTGCCTGCGAGCTTCACAGATCGGGTCGACGATCCTGAGGCCAGG
ATGGGCGGGACGTCCGATGTGACGGCACGGTTCAGAGTTGAGCCGTCAAGTTCTGGGGTGCGGGACCAGGTGTCTCGCATCTCGGCTGCAAGTTTGGACCGCTGCCTAAG
GAGGGCGTCCAAATTGGTAAGTGACCCGGGGTCCGTTCTACAGAGGACCATCGACTACGCCGCTGAGGCGTTCGTTGCTTCCATTCAATCGGCTCTGGCCGTGAAGGCCG
AGCTGGATGGGAGGGAAGTTATGGCAGCGAGGGAGAAAGAGGAGTTCTCTGCTGCCTTGGAGGCTGCTTCCTCCGCCATGAAGGATGAGCTGCTGAAGGCTCACTCTGAG
GTGGGAATTTTGAAGGCCGAGGTGGAGACCAAGGCCGAGCTGCTGAAGAAGGAAGAAGACAGACGCAAGGCCCAGCTCCGAGCTGCCAATGCTATAACCAAGGGCTTGGA
GAAGGAGAAGTTCCAACTCCTCAAGGAGAAGGACGACATGCTCCAGGCGCTTGAAGCGAAGGAGGAGGAGCTGAAGCATGCGACTGCCGAGCTAGAGACGGTGAAGGAGC
GTCTCAGCAATGGAGCCTTATTGGAGGAATCGTTTAGGCAACATCCTGACTTCGATGGATTTGGCAAAGACTTCTCTGACGCGGGCTTCAAGTTTCTCATGAAGGGCATT
GCTTCCGACATGTCTGACCTTCAGATCGATCTCGGTGGTCTAAAGAAGAGGTATGCTGAGCAGTGGGCGTCTGGGCCCAGCGGCACCCCTGGCCCCCAAGCGTTGGTGGA
TAAGTACGTCAGAGATCTGGACTCTGACTACTCCGACCTCGAAGAGGATCAGGTCGGCACCACTCAAGAGGGCGTTCCTCAAGCAGGCTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGCCAGTTCATAGATGGATCCACCAAACCTGAAACCAAAAAACTCAATGCCGGTGCGGTGTCCCGCCTAATATACTCCGGCGCTCAAGTTAGTAATCAGAGTAAAAA
TAAGAATAAGACACCAGAGGGTTCGATGTTTTTCACTTACTTGGTGTGGAGGATCGCCCTTCCTTTTATGGCCCTGAAGGACCGACCAAGAAGGACAACAGTTGGAAGTG
GTAGGGACATCGCACGCGGTACGGTAGGACAGAGAGGTGCACATCCCCGACAGTGCAGAGTAGACTCGCTACCTGTGTTACCTGAGCATGTCGCCCAGGGCTGTTGGATG
TGCCCATCATACGCCTTCTGCTTGGTGATATCTCCAATGATAATTGATTCCCAGTGGGCCCGCGTCCGTGAAGGGGCGAACACGTGTCCTATTCGGAAGCTATCCATAAC
AATTGCAGCTCGAACTCGGCTTCCAGACCGATCTGAATACTTGGGCGGACCTGCACAAAAAGGTGAGCACTCCGACGATCAAGTCAGTATAGGTCGGATTCCCAGTTTAG
TTCGAGGACCGTCGGTTACACCCGGGGTCATCCGCGTGTCCAGGGTATTCTCTTTCCCAAACATTGGCCCCTCTCTGTCTGGTCTGATCTCGACCTGGCAGAGAAGTTCG
TTCGACTTGCTTTGGATGCGTGGCGACTTCCTATTCGTGGGAAAATATAACTGTTGCAGTAGATTTATCGTCGGAATATTCAAATATTCCGACGCTTCGGATCTTAGGGA
GGATCCTAGCCGCTCGTTGATTACACGTCTCGAACCCTTGGTAGGTCGGTCGCTTCCCTCACTTTCTCTTTCGAACGTGATTGCCATGTCGTCCTCTTTTAGCAGCAACT
TAGGATTCGATGAGGATTTAGCTCGTAGGTTAAAGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGATGACGGGGAGGATAGTGATGCCTCCACCTCGGGTCAG
GGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCTCTATCCCTGAGAACATCCTCCTTAGGATTCCGGATGAGGGGGAGAGAGC
TGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGATTTACCCTTCACCCTTTCGTCCAAGAGTTTCTTTTCCGAACTGGGTTGG
CTCCGGCTCAAGTGGCCCCCAATGGGTGGGGTGTCACTTTCGCTTTGGTCATCCTTTTTTGGCTACGAGCTCGGGATAATGAAGAGGCCGAGCTGTTAGACGTAGACCGG
CTCCTCGCGTGCTTCGAAGCGAAAATGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAGGAAAGGCGCAGGCGGTCGTTCCTTCTTTGACGTTCCCACTAGGTT
TGGGAACCTAGTTTCAGTCCGACCAGTCCCCGAGCTTACGCAAGCCTCCTTCGACACGCTGAAATATTACAAGGACCATTTTCCGAGGGGTAGGAAGGTCGGAACCTTAG
TGACCGACAAGCTGTTGCTTGAGTCCGGGCTGCTAGATTACAACCCCGCAGTTCGTCCCATTGAATCCTCAAGGCCGAACTCCGAACTTGTCATGGTTTGCGGATTTGCA
TGCAACGTGAAGCGCAAGTCCAAGGGCCGATCCCATGCTCTTGAGGCCGCCCAGAGTTCGAAACCTGCCACCCCTGCTGTGGTAGGGCTAGCCTCGGAAGATCCAGCCCC
AGTGATCGAGCTGGAGTCTTCTGGGGGTCCTTCGAGGGAGAAGCGCCCCAGGGATCAGACCGAGGCGGTGGACGTCTCGCCCTTGGGCGAGGAGGTGAGGGAGGAAGTCC
CTCTGAAGCGAAGGAGGAAGAAGAAGAAGACCACCTCCCCCTTGGAGGCCGGAGCTCGTGGGGTCTTGCCTGCGAGCTTCACAGATCGGGTCGACGATCCTGAGGCCAGG
ATGGGCGGGACGTCCGATGTGACGGCACGGTTCAGAGTTGAGCCGTCAAGTTCTGGGGTGCGGGACCAGGTGTCTCGCATCTCGGCTGCAAGTTTGGACCGCTGCCTAAG
GAGGGCGTCCAAATTGGTAAGTGACCCGGGGTCCGTTCTACAGAGGACCATCGACTACGCCGCTGAGGCGTTCGTTGCTTCCATTCAATCGGCTCTGGCCGTGAAGGCCG
AGCTGGATGGGAGGGAAGTTATGGCAGCGAGGGAGAAAGAGGAGTTCTCTGCTGCCTTGGAGGCTGCTTCCTCCGCCATGAAGGATGAGCTGCTGAAGGCTCACTCTGAG
GTGGGAATTTTGAAGGCCGAGGTGGAGACCAAGGCCGAGCTGCTGAAGAAGGAAGAAGACAGACGCAAGGCCCAGCTCCGAGCTGCCAATGCTATAACCAAGGGCTTGGA
GAAGGAGAAGTTCCAACTCCTCAAGGAGAAGGACGACATGCTCCAGGCGCTTGAAGCGAAGGAGGAGGAGCTGAAGCATGCGACTGCCGAGCTAGAGACGGTGAAGGAGC
GTCTCAGCAATGGAGCCTTATTGGAGGAATCGTTTAGGCAACATCCTGACTTCGATGGATTTGGCAAAGACTTCTCTGACGCGGGCTTCAAGTTTCTCATGAAGGGCATT
GCTTCCGACATGTCTGACCTTCAGATCGATCTCGGTGGTCTAAAGAAGAGGTATGCTGAGCAGTGGGCGTCTGGGCCCAGCGGCACCCCTGGCCCCCAAGCGTTGGTGGA
TAAGTACGTCAGAGATCTGGACTCTGACTACTCCGACCTCGAAGAGGATCAGGTCGGCACCACTCAAGAGGGCGTTCCTCAAGCAGGCTCTTAG
Protein sequenceShow/hide protein sequence
MSQFIDGSTKPETKKLNAGAVSRLIYSGAQVSNQSKNKNKTPEGSMFFTYLVWRIALPFMALKDRPRRTTVGSGRDIARGTVGQRGAHPRQCRVDSLPVLPEHVAQGCWM
CPSYAFCLVISPMIIDSQWARVREGANTCPIRKLSITIAARTRLPDRSEYLGGPAQKGEHSDDQVSIGRIPSLVRGPSVTPGVIRVSRVFSFPNIGPSLSGLISTWQRSS
FDLLWMRGDFLFVGKYNCCSRFIVGIFKYSDASDLREDPSRSLITRLEPLVGRSLPSLSLSNVIAMSSSFSSNLGFDEDLARRLKSELEEIENFRFSDDGEDSDASTSGQ
GLEYPSRIPEHYLGSLRRGFSIPENILLRIPDEGERADNPPEGWVTLYFKMFEYGLRFTLHPFVQEFLFRTGLAPAQVAPNGWGVTFALVILFWLRARDNEEAELLDVDR
LLACFEAKMIAKKPGRFYMCARKGAGGRSFFDVPTRFGNLVSVRPVPELTQASFDTLKYYKDHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELVMVCGFA
CNVKRKSKGRSHALEAAQSSKPATPAVVGLASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVREEVPLKRRRKKKKTTSPLEAGARGVLPASFTDRVDDPEAR
MGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKLVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVMAAREKEEFSAALEAASSAMKDELLKAHSE
VGILKAEVETKAELLKKEEDRRKAQLRAANAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELETVKERLSNGALLEESFRQHPDFDGFGKDFSDAGFKFLMKGI
ASDMSDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEEDQVGTTQEGVPQAGS