; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g26860 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g26860
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionmyosin heavy chain-related
Genome locationchr4:19737038..19739466
RNA-Seq ExpressionMoc04g26860
SyntenyMoc04g26860
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0043167 - ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142326.1 uncharacterized protein LOC111012467 [Momordica charantia]3.0e-11252.46Show/hide
Query:  RRRKKKKAISPSEVGACRVLPVSFADRVDDPAARMGGTSDVTTRFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQ
        +RRKKKKAIS SEVGACRVLP  FADRVDDPAARMGGTSDVT RFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVS PGSVL R IDYAAEAFVASIQ
Subjt:  RRRKKKKAISPSEVGACRVLPVSFADRVDDPAARMGGTSDVTTRFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQ

Query:  SALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQ---------------------------------------------
        SALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQ                                             
Subjt:  SALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQ---------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------AELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHAIAELETAK
                                               AELLK+E++R KA LRAAHAIT+GLE+EKFQLLKEKDDMLQALE KD  +    AEL+  K
Subjt:  ---------------------------------------AELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHAIAELETAK

Query:  ERLSNGVLLEESFRQHPDFDGFAKDFSDGGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPEVLVEQYVRDLDSDYSDPEED--------Q
        ERL+NG LLE +FRQHPDFDGFAKDFSD GFKFLMKGIA+D+P L++DL  LK+RYAEKWASGP GT GP  LV++YVRDLDSDYSD +ED        +
Subjt:  ERLSNGVLLEESFRQHPDFDGFAKDFSDGGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPEVLVEQYVRDLDSDYSDPEED--------Q

Query:  VGSTQEGAP
        VG+TQEG P
Subjt:  VGSTQEGAP

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]1.0e-13691.67Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRTRDSEEAGCHFRFGQLLECFEAKRIVKKPGRFHMCARKGAGSIVKGPTSIKGWV
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLR RDSEEA       QLL CFEAKRI KKPGRF+MCARKGAG IVKGPTSIKGWV
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRTRDSEEAGCHFRFGQLLECFEAKRIVKKPGRFHMCARKGAGSIVKGPTSIKGWV

Query:  RKWFYASGEWLAKDESGRSFFDVPTRFVNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFA
        RKWFYASGEWLAKDESGRSFFDVPTRF NLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIE SRPNS LAMVC FA
Subjt:  RKWFYASGEWLAKDESGRSFFDVPTRFVNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFA

Query:  SGVKRKSKGRAHALEAAQSSKPATPAVAGPASEDPAPVIELASSGGPSREKRPRDQTEAVDAQTEAVDVPLKGEEA
        SGVKRKSKGRAHALEAAQSSKP TPAV GPASEDPAPVIEL SSGGPSREKRPRDQTEAVDAQTEA DVP  GE A
Subjt:  SGVKRKSKGRAHALEAAQSSKPATPAVAGPASEDPAPVIELASSGGPSREKRPRDQTEAVDAQTEAVDVPLKGEEA

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]1.7e-13492.98Show/hide
Query:  GTSDVTTRFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKD
        G   +  + RIEPSSSGVRDQVSRISAASLDRCLRRASKFVS PGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALE ASSTMKD
Subjt:  GTSDVTTRFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKD

Query:  ELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHAIAELETAKERLSNGVLLEESFRQHPDFD
        ELLKAHSEVETLKAEVESQAELLKKEEDRR+AQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHA AELETAKERLSNGVLLEE+FRQHPDFD
Subjt:  ELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHAIAELETAKERLSNGVLLEESFRQHPDFD

Query:  GFAKDFSDGGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPEVLVEQYVRDLDSDYSDPEEDQVGSTQEGAPQAGS
        GFAKDFSD GFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGP+ LV+QYVRDLDSDYSDPEEDQVGSTQEGA   GS
Subjt:  GFAKDFSDGGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPEVLVEQYVRDLDSDYSDPEEDQVGSTQEGAPQAGS

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]2.1e-18594.35Show/hide
Query:  MSSSISSNLGSDLARRLESELEEIENFRISDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENVLLRLPEEGERADNPPEGWVTLYFKMFEYGLR
        MSSSISSNL SDLARRLES+LEEIEN RISDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPEN+LLRLPEEGERADNPPEGWVTLYFKMFEYGLR
Subjt:  MSSSISSNLGSDLARRLESELEEIENFRISDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENVLLRLPEEGERADNPPEGWVTLYFKMFEYGLR

Query:  LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRTRDSEEAGCHFRFGQLLECFEAKRIVKKPGRFHMCARKGAGSIVKGPTSIKGWVRKWFYAS
        LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLR RDSEEA       QLL CFEAKRI KKPGRF+MCARKGAG IVKGPTSIKGWVRKWFYAS
Subjt:  LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRTRDSEEAGCHFRFGQLLECFEAKRIVKKPGRFHMCARKGAGSIVKGPTSIKGWVRKWFYAS

Query:  GEWLAKDESGRSFFDVPTRFVNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASGVKRKS
        GEWLAKDESGRSFFDVPTRF NLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASGVKRKS
Subjt:  GEWLAKDESGRSFFDVPTRFVNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASGVKRKS

Query:  KGRAHALEAAQSSKPATPAVAGPASEDPAPVIELASSGGPSREKRPRDQTEAVD
        KGRAHALEAAQSSKPATPAV GPASEDPA VIEL SSGGPSREKRPRDQTEAVD
Subjt:  KGRAHALEAAQSSKPATPAVAGPASEDPAPVIELASSGGPSREKRPRDQTEAVD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]3.4e-18868.14Show/hide
Query:  MCARKGAGSIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFVNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP
        MCARKG G IVKGPTSIKGWV KWF+ASGEWLAKDESGR+FFDVPTRF NLVSI+ +PEL QA+FDTLK+YK+ FPR RK+ TLVTD+LLLESGLLDYNP
Subjt:  MCARKGAGSIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFVNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCGFASGVKRKSKGRAHALEAAQSSKPATPAV--------AGPASEDPAPVIELASSGGPSREKRPRDQTEAVDAQTEAVDVPL
         VR IE+SRPNSELAMVCGF   VKRKSKGRAHAL+    ++P TP V        +GP+S  P PVIEL  SGG S EKR R+++EA+D        PL
Subjt:  AVRPIESSRPNSELAMVCGFASGVKRKSKGRAHALEAAQSSKPATPAV--------AGPASEDPAPVIELASSGGPSREKRPRDQTEAVDAQTEAVDVPL

Query:  KGEEAREEAPLKRRRKKKKAISPSEVGACRVLPVSFADRVDDPAARMGGTSDVTTRFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTI
           E R E+PL+RRRKKKK  S SE GA   LP S AD VDDP ARM GTS+V  RF +EPSSSGV+DQVSRISA  LDR LRRASKFVSDPGSVLQRTI
Subjt:  KGEEAREEAPLKRRRKKKKAISPSEVGACRVLPVSFADRVDDPAARMGGTSDVTTRFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTI

Query:  DYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLL
        D  AEAF+ASI  A+ VKAELDGRE LAA+E+E   AALEAA +T+K ELLKA  EV+ L+AEV+++ +LLKKE ++ KA LRAAHAIT+GLE+EKFQLL
Subjt:  DYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLL

Query:  KEKDDMLQALEAKDKELEHAIAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDGGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPEV
        KEKDD+ Q LE KD  +     EL+  KERL+NG LLEESFRQHPDFDGFAKDFSD GFKFLMKGIA+DMP LQIDL+GLK++Y+EKWASGP GTP P+ 
Subjt:  KEKDDMLQALEAKDKELEHAIAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDGGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPEV

Query:  LVEQYVRDLDSDYSDPEED--------QVGSTQEGAP--QAGS
        LV++YVR+LDSDYSD EE+        +VG+TQE  P  Q GS
Subjt:  LVEQYVRDLDSDYSDPEED--------QVGSTQEGAP--QAGS

TrEMBL top hitse value%identityAlignment
A0A6J1CLV1 uncharacterized protein LOC1110124671.5e-11252.46Show/hide
Query:  RRRKKKKAISPSEVGACRVLPVSFADRVDDPAARMGGTSDVTTRFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQ
        +RRKKKKAIS SEVGACRVLP  FADRVDDPAARMGGTSDVT RFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVS PGSVL R IDYAAEAFVASIQ
Subjt:  RRRKKKKAISPSEVGACRVLPVSFADRVDDPAARMGGTSDVTTRFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQ

Query:  SALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQ---------------------------------------------
        SALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQ                                             
Subjt:  SALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQ---------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------AELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHAIAELETAK
                                               AELLK+E++R KA LRAAHAIT+GLE+EKFQLLKEKDDMLQALE KD  +    AEL+  K
Subjt:  ---------------------------------------AELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHAIAELETAK

Query:  ERLSNGVLLEESFRQHPDFDGFAKDFSDGGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPEVLVEQYVRDLDSDYSDPEED--------Q
        ERL+NG LLE +FRQHPDFDGFAKDFSD GFKFLMKGIA+D+P L++DL  LK+RYAEKWASGP GT GP  LV++YVRDLDSDYSD +ED        +
Subjt:  ERLSNGVLLEESFRQHPDFDGFAKDFSDGGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPEVLVEQYVRDLDSDYSDPEED--------Q

Query:  VGSTQEGAP
        VG+TQEG P
Subjt:  VGSTQEGAP

A0A6J1CR42 uncharacterized protein LOC1110138265.0e-13791.67Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRTRDSEEAGCHFRFGQLLECFEAKRIVKKPGRFHMCARKGAGSIVKGPTSIKGWV
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLR RDSEEA       QLL CFEAKRI KKPGRF+MCARKGAG IVKGPTSIKGWV
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRTRDSEEAGCHFRFGQLLECFEAKRIVKKPGRFHMCARKGAGSIVKGPTSIKGWV

Query:  RKWFYASGEWLAKDESGRSFFDVPTRFVNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFA
        RKWFYASGEWLAKDESGRSFFDVPTRF NLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIE SRPNS LAMVC FA
Subjt:  RKWFYASGEWLAKDESGRSFFDVPTRFVNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFA

Query:  SGVKRKSKGRAHALEAAQSSKPATPAVAGPASEDPAPVIELASSGGPSREKRPRDQTEAVDAQTEAVDVPLKGEEA
        SGVKRKSKGRAHALEAAQSSKP TPAV GPASEDPAPVIEL SSGGPSREKRPRDQTEAVDAQTEA DVP  GE A
Subjt:  SGVKRKSKGRAHALEAAQSSKPATPAVAGPASEDPAPVIELASSGGPSREKRPRDQTEAVDAQTEAVDVPLKGEEA

A0A6J1D971 uncharacterized protein LOC1110185388.0e-13592.98Show/hide
Query:  GTSDVTTRFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKD
        G   +  + RIEPSSSGVRDQVSRISAASLDRCLRRASKFVS PGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALE ASSTMKD
Subjt:  GTSDVTTRFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKD

Query:  ELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHAIAELETAKERLSNGVLLEESFRQHPDFD
        ELLKAHSEVETLKAEVESQAELLKKEEDRR+AQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHA AELETAKERLSNGVLLEE+FRQHPDFD
Subjt:  ELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHAIAELETAKERLSNGVLLEESFRQHPDFD

Query:  GFAKDFSDGGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPEVLVEQYVRDLDSDYSDPEEDQVGSTQEGAPQAGS
        GFAKDFSD GFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGP+ LV+QYVRDLDSDYSDPEEDQVGSTQEGA   GS
Subjt:  GFAKDFSDGGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPEVLVEQYVRDLDSDYSDPEEDQVGSTQEGAPQAGS

A0A6J1DXS5 uncharacterized protein LOC1110255021.0e-18594.35Show/hide
Query:  MSSSISSNLGSDLARRLESELEEIENFRISDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENVLLRLPEEGERADNPPEGWVTLYFKMFEYGLR
        MSSSISSNL SDLARRLES+LEEIEN RISDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPEN+LLRLPEEGERADNPPEGWVTLYFKMFEYGLR
Subjt:  MSSSISSNLGSDLARRLESELEEIENFRISDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENVLLRLPEEGERADNPPEGWVTLYFKMFEYGLR

Query:  LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRTRDSEEAGCHFRFGQLLECFEAKRIVKKPGRFHMCARKGAGSIVKGPTSIKGWVRKWFYAS
        LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLR RDSEEA       QLL CFEAKRI KKPGRF+MCARKGAG IVKGPTSIKGWVRKWFYAS
Subjt:  LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRTRDSEEAGCHFRFGQLLECFEAKRIVKKPGRFHMCARKGAGSIVKGPTSIKGWVRKWFYAS

Query:  GEWLAKDESGRSFFDVPTRFVNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASGVKRKS
        GEWLAKDESGRSFFDVPTRF NLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASGVKRKS
Subjt:  GEWLAKDESGRSFFDVPTRFVNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASGVKRKS

Query:  KGRAHALEAAQSSKPATPAVAGPASEDPAPVIELASSGGPSREKRPRDQTEAVD
        KGRAHALEAAQSSKPATPAV GPASEDPA VIEL SSGGPSREKRPRDQTEAVD
Subjt:  KGRAHALEAAQSSKPATPAVAGPASEDPAPVIELASSGGPSREKRPRDQTEAVD

A0A6J1DZB3 uncharacterized protein LOC1110256651.6e-18868.14Show/hide
Query:  MCARKGAGSIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFVNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP
        MCARKG G IVKGPTSIKGWV KWF+ASGEWLAKDESGR+FFDVPTRF NLVSI+ +PEL QA+FDTLK+YK+ FPR RK+ TLVTD+LLLESGLLDYNP
Subjt:  MCARKGAGSIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFVNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCGFASGVKRKSKGRAHALEAAQSSKPATPAV--------AGPASEDPAPVIELASSGGPSREKRPRDQTEAVDAQTEAVDVPL
         VR IE+SRPNSELAMVCGF   VKRKSKGRAHAL+    ++P TP V        +GP+S  P PVIEL  SGG S EKR R+++EA+D        PL
Subjt:  AVRPIESSRPNSELAMVCGFASGVKRKSKGRAHALEAAQSSKPATPAV--------AGPASEDPAPVIELASSGGPSREKRPRDQTEAVDAQTEAVDVPL

Query:  KGEEAREEAPLKRRRKKKKAISPSEVGACRVLPVSFADRVDDPAARMGGTSDVTTRFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTI
           E R E+PL+RRRKKKK  S SE GA   LP S AD VDDP ARM GTS+V  RF +EPSSSGV+DQVSRISA  LDR LRRASKFVSDPGSVLQRTI
Subjt:  KGEEAREEAPLKRRRKKKKAISPSEVGACRVLPVSFADRVDDPAARMGGTSDVTTRFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTI

Query:  DYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLL
        D  AEAF+ASI  A+ VKAELDGRE LAA+E+E   AALEAA +T+K ELLKA  EV+ L+AEV+++ +LLKKE ++ KA LRAAHAIT+GLE+EKFQLL
Subjt:  DYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLL

Query:  KEKDDMLQALEAKDKELEHAIAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDGGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPEV
        KEKDD+ Q LE KD  +     EL+  KERL+NG LLEESFRQHPDFDGFAKDFSD GFKFLMKGIA+DMP LQIDL+GLK++Y+EKWASGP GTP P+ 
Subjt:  KEKDDMLQALEAKDKELEHAIAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDGGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPEV

Query:  LVEQYVRDLDSDYSDPEED--------QVGSTQEGAP--QAGS
        LV++YVR+LDSDYSD EE+        +VG+TQE  P  Q GS
Subjt:  LVEQYVRDLDSDYSDPEED--------QVGSTQEGAP--QAGS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G15420.1 myosin heavy chain-related4.5e-0524.48Show/hide
Query:  PENVLLRLPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRTRDSEEAGCHFRFGQLLECFEAKRI
        P  + L  P+  +R   PPEG++ LY   F   GL  PL  F+ E+  R  +A +Q+          LAIL         E G         E     R+
Subjt:  PENVLLRLPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRTRDSEEAGCHFRFGQLLECFEAKRI

Query:  VKKPGRFHMCARKGAGSIVKGPTS-IKGWVRKWFYAS--------------GEWLAKDESGRSFFDVPTRFVNLVSIRPVPELTQASFDTLKYYKERFPR
         + PG ++  A K    IV G  S I GW R++F+                 +W    E      D P  F++  +I  + EL    + T  + + R  R
Subjt:  VKKPGRFHMCARKGAGSIVKGPTS-IKGWVRKWFYAS--------------GEWLAKDESGRSFFDVPTRFVNLVSIRPVPELTQASFDTLKYYKERFPR

Query:  GRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASGVKRKSKGRAHALEAAQSSKPATPAVAGPASED---PAPVIE---LASSGG--PS
         R +G ++           +    +  +E S   +E  +          +S GR  A E+A        +   P +ED      V+    L S GG  PS
Subjt:  GRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASGVKRKSKGRAHALEAAQSSKPATPAVAGPASED---PAPVIE---LASSGG--PS

Query:  REKRPRDQTEAVDAQTEAVDVPLKGEEAREEAPLKRRRKKKKAISPSEVGACRVLPVSFADRVDDPAARMGGTS--DVTTRFRIEPSSSGVRDQVSRISA
        +++  RD     DA+  +  VP            K  R++       + G       S+  +  D A     TS  D+ +R R      G  D  S    
Subjt:  REKRPRDQTEAVDAQTEAVDVPLKGEEAREEAPLKRRRKKKKAISPSEVGACRVLPVSFADRVDDPAARMGGTS--DVTTRFRIEPSSSGVRDQVSRISA

Query:  ASLDRCLRR------ASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAR--EKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQ
         S+DR + R      A K     G+  + +     +A V++ + A    AE +  + LA     + E SA LE  SS + +++    S V+    E   Q
Subjt:  ASLDRCLRR------ASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAR--EKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQ

Query:  AELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKEL----EHAIAELETAKERLSNGV
         E L K      A+LR +       ER+K     +    LQ LE   K+        I ELE  +  L NGV
Subjt:  AELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKEL----EHAIAELETAKERLSNGV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCCCAATCCTTCAGATCATACCTTACGTTCCTTGAATTCTTGGAGTTCGATCTGAAGGCAGCTCGAACCCTTGGTAGGTCGGTCTCTTCCCTCTCTCTTTCGAACGT
AATTGCCATGTCATCCTCTATTAGCAGCAACCTAGGATCCGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGAATCTCCGATGACGGGGAGG
ATAGTGACGCCTCCACTTCAGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCGCTATCCCTGAGAACGTCCTCCTC
AGGCTTCCGGAGGAGGGGGAGAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTAAGACTTCCCCTTCACCCTTTTGTCCA
AGAATTTCTCTTCCGGACTGGGTTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTCGCTTTGGCCATCCTCTTCTGGCTACGAACTCGGGATAGTGAGG
AGGCCGGGTGTCATTTTCGTTTTGGCCAGCTCCTCGAGTGCTTCGAGGCGAAAAGGATAGTTAAGAAGCCTGGTCGGTTTCATATGTGCGCAAGGAAAGGCGCAGGCAGT
ATAGTTAAGGGGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCCGGGGAATGGCTCGCAAAGGACGAGTCAGGTCGTTCCTTCTTTGACGTCCCCAC
TAGGTTTGTAAACCTAGTTTCAATCCGACCAGTCCCCGAGCTTACGCAAGCCTCCTTCGATACGCTGAAATACTACAAGGAGCGCTTTCCGAGGGGTAGGAAGGTCGGAA
CCCTGGTGACTGACGAACTGCTGCTTGAGTCCGGGCTGCTAGATTACAACCCTGCAGTTCGTCCCATTGAATCCTCAAGGCCGAACTCTGAACTTGCCATGGTTTGCGGA
TTTGCAAGCGGCGTGAAGCGCAAGTCTAAGGGCCGAGCCCATGCTCTTGAGGCCGCCCAGAGTTCGAAACCTGCCACCCCTGCCGTGGCAGGGCCTGCCTCGGAAGATCC
AGCCCCGGTGATCGAGCTGGCGTCTTCTGGGGGTCCCTCGAGGGAGAAGCGCCCCAGGGATCAGACCGAGGCGGTGGACGCCCAGACCGAGGCGGTGGACGTCCCGCTTA
AGGGCGAAGAGGCGAGGGAGGAAGCCCCTCTGAAGCGAAGAAGGAAGAAAAAGAAGGCGATCTCCCCCTCGGAGGTCGGAGCTTGCAGGGTCTTGCCTGTAAGTTTCGCA
GATCGGGTGGACGATCCTGCGGCCAGGATGGGCGGGACGTCCGACGTGACGACACGATTCAGAATTGAGCCGTCAAGTTCCGGGGTGAGGGACCAGGTGTCCCGCATCTC
AGCTGCAAGTTTGGACCGCTGCCTAAGGAGGGCGTCCAAATTTGTGAGCGACCCTGGGTCCGTTCTGCAGAGGACCATCGACTACGCCGCCGAGGCGTTCGTTGCTTCCA
TTCAATCGGCTCTGGCTGTAAAGGCCGAGCTGGATGGGAGGGAAGTTCTGGCAGCGAGGGAGAAAGAGGAGTTCTCTGCTGCCTTGGAGGCTGCTTCCTCCACCATGAAG
GATGAGCTGCTGAAGGCTCACTCTGAGGTGGAGACTTTGAAGGCCGAGGTGGAGTCCCAGGCCGAGCTGCTGAAGAAGGAAGAGGACAGGCGCAAGGCCCAACTCCGAGC
TGCCCACGCTATTACCAGGGGCCTGGAGAGGGAGAAGTTCCAGCTCCTGAAGGAGAAGGACGACATGCTCCAGGCGCTTGAAGCGAAGGATAAGGAGCTGGAGCATGCGA
TTGCCGAGCTGGAGACGGCGAAGGAGCGCCTCAGCAATGGAGTCCTACTGGAGGAATCGTTTAGGCAACATCCTGACTTCGATGGATTTGCCAAAGATTTTTCTGACGGG
GGCTTCAAGTTCCTCATGAAGGGCATTGCTTCCGACATGCCCGACCTTCAGATCGATCTCAGCGGTCTGAAAAGGAGGTATGCCGAGAAGTGGGCGTCTGGTCCTGGCGG
CACCCCTGGCCCCGAAGTGTTGGTGGAACAGTATGTCAGAGATCTGGACTCTGACTACTCCGATCCCGAAGAGGATCAGGTCGGCTCCACTCAAGAGGGCGCTCCTCAAG
CGGGCTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGCCCCAATCCTTCAGATCATACCTTACGTTCCTTGAATTCTTGGAGTTCGATCTGAAGGCAGCTCGAACCCTTGGTAGGTCGGTCTCTTCCCTCTCTCTTTCGAACGT
AATTGCCATGTCATCCTCTATTAGCAGCAACCTAGGATCCGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGAATCTCCGATGACGGGGAGG
ATAGTGACGCCTCCACTTCAGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCGCTATCCCTGAGAACGTCCTCCTC
AGGCTTCCGGAGGAGGGGGAGAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTAAGACTTCCCCTTCACCCTTTTGTCCA
AGAATTTCTCTTCCGGACTGGGTTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTCGCTTTGGCCATCCTCTTCTGGCTACGAACTCGGGATAGTGAGG
AGGCCGGGTGTCATTTTCGTTTTGGCCAGCTCCTCGAGTGCTTCGAGGCGAAAAGGATAGTTAAGAAGCCTGGTCGGTTTCATATGTGCGCAAGGAAAGGCGCAGGCAGT
ATAGTTAAGGGGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCCGGGGAATGGCTCGCAAAGGACGAGTCAGGTCGTTCCTTCTTTGACGTCCCCAC
TAGGTTTGTAAACCTAGTTTCAATCCGACCAGTCCCCGAGCTTACGCAAGCCTCCTTCGATACGCTGAAATACTACAAGGAGCGCTTTCCGAGGGGTAGGAAGGTCGGAA
CCCTGGTGACTGACGAACTGCTGCTTGAGTCCGGGCTGCTAGATTACAACCCTGCAGTTCGTCCCATTGAATCCTCAAGGCCGAACTCTGAACTTGCCATGGTTTGCGGA
TTTGCAAGCGGCGTGAAGCGCAAGTCTAAGGGCCGAGCCCATGCTCTTGAGGCCGCCCAGAGTTCGAAACCTGCCACCCCTGCCGTGGCAGGGCCTGCCTCGGAAGATCC
AGCCCCGGTGATCGAGCTGGCGTCTTCTGGGGGTCCCTCGAGGGAGAAGCGCCCCAGGGATCAGACCGAGGCGGTGGACGCCCAGACCGAGGCGGTGGACGTCCCGCTTA
AGGGCGAAGAGGCGAGGGAGGAAGCCCCTCTGAAGCGAAGAAGGAAGAAAAAGAAGGCGATCTCCCCCTCGGAGGTCGGAGCTTGCAGGGTCTTGCCTGTAAGTTTCGCA
GATCGGGTGGACGATCCTGCGGCCAGGATGGGCGGGACGTCCGACGTGACGACACGATTCAGAATTGAGCCGTCAAGTTCCGGGGTGAGGGACCAGGTGTCCCGCATCTC
AGCTGCAAGTTTGGACCGCTGCCTAAGGAGGGCGTCCAAATTTGTGAGCGACCCTGGGTCCGTTCTGCAGAGGACCATCGACTACGCCGCCGAGGCGTTCGTTGCTTCCA
TTCAATCGGCTCTGGCTGTAAAGGCCGAGCTGGATGGGAGGGAAGTTCTGGCAGCGAGGGAGAAAGAGGAGTTCTCTGCTGCCTTGGAGGCTGCTTCCTCCACCATGAAG
GATGAGCTGCTGAAGGCTCACTCTGAGGTGGAGACTTTGAAGGCCGAGGTGGAGTCCCAGGCCGAGCTGCTGAAGAAGGAAGAGGACAGGCGCAAGGCCCAACTCCGAGC
TGCCCACGCTATTACCAGGGGCCTGGAGAGGGAGAAGTTCCAGCTCCTGAAGGAGAAGGACGACATGCTCCAGGCGCTTGAAGCGAAGGATAAGGAGCTGGAGCATGCGA
TTGCCGAGCTGGAGACGGCGAAGGAGCGCCTCAGCAATGGAGTCCTACTGGAGGAATCGTTTAGGCAACATCCTGACTTCGATGGATTTGCCAAAGATTTTTCTGACGGG
GGCTTCAAGTTCCTCATGAAGGGCATTGCTTCCGACATGCCCGACCTTCAGATCGATCTCAGCGGTCTGAAAAGGAGGTATGCCGAGAAGTGGGCGTCTGGTCCTGGCGG
CACCCCTGGCCCCGAAGTGTTGGTGGAACAGTATGTCAGAGATCTGGACTCTGACTACTCCGATCCCGAAGAGGATCAGGTCGGCTCCACTCAAGAGGGCGCTCCTCAAG
CGGGCTCTTAG
Protein sequenceShow/hide protein sequence
MPQSFRSYLTFLEFLEFDLKAARTLGRSVSSLSLSNVIAMSSSISSNLGSDLARRLESELEEIENFRISDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENVLL
RLPEEGERADNPPEGWVTLYFKMFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRTRDSEEAGCHFRFGQLLECFEAKRIVKKPGRFHMCARKGAGS
IVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFVNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCG
FASGVKRKSKGRAHALEAAQSSKPATPAVAGPASEDPAPVIELASSGGPSREKRPRDQTEAVDAQTEAVDVPLKGEEAREEAPLKRRRKKKKAISPSEVGACRVLPVSFA
DRVDDPAARMGGTSDVTTRFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMK
DELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHAIAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDG
GFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPEVLVEQYVRDLDSDYSDPEEDQVGSTQEGAPQAGS