; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g10620 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g10620
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionmyosin heavy chain-related
Genome locationchr9:9023006..9030783
RNA-Seq ExpressionMoc09g10620
SyntenyMoc09g10620
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0043167 - ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138041.1 uncharacterized protein LOC111009298 [Momordica charantia]4.7e-11085.43Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASREWLAKDESGRSFFDVSTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP
        MCARKGA GIVKGPTSIKGWVRKWFYAS EWLAKDES              V+IRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASREWLAKDESGRSFFDVSTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCGFASGVKRKSKGRAHALEAAQSSKPATPAVARPALEDPAPVIELESSGGPSREKRPRDQTEAVDAPPLGEEVREEAPLKRKR
        AVRPIESSRPNSELAMVCGFAS VKRKSKG+AHALEAAQSSKP TPAV  PA EDPAPVIELESS GPSREKRPRDQTEAVD  PLGEEVREE PLKR+R
Subjt:  AVRPIESSRPNSELAMVCGFASGVKRKSKGRAHALEAAQSSKPATPAVARPALEDPAPVIELESSGGPSREKRPRDQTEAVDAPPLGEEVREEAPLKRKR

Query:  KKKKAISPSEVGACRVLPASFADRVDDSEARMGGTSDVTTRFRVEPSSSGVRDQ
        KKKK  SP EVGA  VLPASFADRVDD EARMGGT DVTTRFRVEPSSSGVRDQ
Subjt:  KKKKAISPSEVGACRVLPASFADRVDDSEARMGGTSDVTTRFRVEPSSSGVRDQ

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]4.0e-13893.41Show/hide
Query:  MFEYGLRLPLHPFVQEFLFWTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLFACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLF TGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQL ACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFWTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLFACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFYASREWLAKDESGRSFFDVSTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS
        KWFYAS EWLAKDESGRSFFDV TRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIE SRPNS LAMVC FAS
Subjt:  KWFYASREWLAKDESGRSFFDVSTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS

Query:  GVKRKSKGRAHALEAAQSSKPATPAVARPALEDPAPVIELESSGGPSREKRPRDQTEAVDA-------PPLGE
        GVKRKSKGRAHALEAAQSSKP TPAV  PA EDPAPVIELESSGGPSREKRPRDQTEAVDA       PPLGE
Subjt:  GVKRKSKGRAHALEAAQSSKPATPAVARPALEDPAPVIELESSGGPSREKRPRDQTEAVDA-------PPLGE

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]1.1e-13292.28Show/hide
Query:  GTSDVTTRFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPRSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKD
        G   +  + R+EPSSSGVRDQVSRISAASLDRCLRRASKFVS P SVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALE ASSTMKD
Subjt:  GTSDVTTRFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPRSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKD

Query:  ELLKAHFEVEILKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAKDKELKHATAELETAKERLSNGVLLEESFRQHPDFD
        ELLKAH EVE LKAEVESQAELLKKEEDRR+AQLRAAHAITRGLE+EKFQLLKEKDDMLQALEAKDKEL+HATAELETAKERLSNGVLLEE+FRQHPDFD
Subjt:  ELLKAHFEVEILKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAKDKELKHATAELETAKERLSNGVLLEESFRQHPDFD

Query:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEEDQVGSTQESAPQAGS
        GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEEDQVGSTQE A   GS
Subjt:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEEDQVGSTQESAPQAGS

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]1.7e-16586.97Show/hide
Query:  MSSSFSSNLGSDLAR----------------------------------RILEHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYGLR
        MSSS SSNL SDLAR                                  RI EHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYGLR
Subjt:  MSSSFSSNLGSDLAR----------------------------------RILEHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYGLR

Query:  LPLHPFVQEFLFWTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLFACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASR
        LPLHPFVQEFLF TGLAPAQVAPNGWGVIFALAILFWLRARDSEEAEL DVDQL ACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYAS 
Subjt:  LPLHPFVQEFLFWTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLFACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASR

Query:  EWLAKDESGRSFFDVSTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASGVKRKSK
        EWLAKDESGRSFFDV TRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASGVKRKSK
Subjt:  EWLAKDESGRSFFDVSTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASGVKRKSK

Query:  GRAHALEAAQSSKPATPAVARPALEDPAPVIELESSGGPSREKRPRDQTEAVD
        GRAHALEAAQSSKPATPAV  PA EDPA VIELESSGGPSREKRPRDQTEAVD
Subjt:  GRAHALEAAQSSKPATPAVARPALEDPAPVIELESSGGPSREKRPRDQTEAVD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]1.4e-19169.78Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASREWLAKDESGRSFFDVSTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP
        MCARKG GGIVKGPTSIKGWV KWF+AS EWLAKDESGR+FFDV TRFGNLVSI+ +PEL QA+FDTLK+YK+ FPR RK+ TLVTD+LLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASREWLAKDESGRSFFDVSTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCGFASGVKRKSKGRAHALEAAQSSKPATPAVAR--------PALEDPAPVIELESSGGPSREKRPRDQTEAVDAPPLGEEVRE
         VR IE+SRPNSELAMVCGF   VKRKSKGRAHAL+    ++P TP V R        P+   P PVIEL+ SGG S EKR R+++EA+D  PL  EVR 
Subjt:  AVRPIESSRPNSELAMVCGFASGVKRKSKGRAHALEAAQSSKPATPAVAR--------PALEDPAPVIELESSGGPSREKRPRDQTEAVDAPPLGEEVRE

Query:  EAPLKRKRKKKKAISPSEVGACRVLPASFADRVDDSEARMGGTSDVTTRFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPRSVLQRTIDYAAEAF
        E+PL+R+RKKKK  S SE GA   LP S AD VDD EARM GTS+V  RF +EPSSSGV+DQVSRISA  LDR LRRASKFVSDP SVLQRTID  AEAF
Subjt:  EAPLKRKRKKKKAISPSEVGACRVLPASFADRVDDSEARMGGTSDVTTRFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPRSVLQRTIDYAAEAF

Query:  VASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHFEVEILKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDML
        +ASI  A+ VKAELDGRE LAA+E+E   AALEAA +T+K ELLKA  EV+IL+AEV+++ +LLKKE ++ KA LRAAHAIT+GLEKEKFQLLKEKDD+ 
Subjt:  VASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHFEVEILKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDML

Query:  QALEAKDKELKHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVR
        Q LE KD  +   T EL+  KERL+NG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP LQIDL+GLK++Y+EKWASGP GTP PQ+LVD+YVR
Subjt:  QALEAKDKELKHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVR

Query:  DLDSDYSDPEED--------QVGSTQESAP--QAGS
        +LDSDYSD EE+        +VG+TQE  P  Q GS
Subjt:  DLDSDYSDPEED--------QVGSTQESAP--QAGS

TrEMBL top hitse value%identityAlignment
A0A6J1C8K9 uncharacterized protein LOC1110092982.3e-11085.43Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASREWLAKDESGRSFFDVSTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP
        MCARKGA GIVKGPTSIKGWVRKWFYAS EWLAKDES              V+IRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASREWLAKDESGRSFFDVSTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCGFASGVKRKSKGRAHALEAAQSSKPATPAVARPALEDPAPVIELESSGGPSREKRPRDQTEAVDAPPLGEEVREEAPLKRKR
        AVRPIESSRPNSELAMVCGFAS VKRKSKG+AHALEAAQSSKP TPAV  PA EDPAPVIELESS GPSREKRPRDQTEAVD  PLGEEVREE PLKR+R
Subjt:  AVRPIESSRPNSELAMVCGFASGVKRKSKGRAHALEAAQSSKPATPAVARPALEDPAPVIELESSGGPSREKRPRDQTEAVDAPPLGEEVREEAPLKRKR

Query:  KKKKAISPSEVGACRVLPASFADRVDDSEARMGGTSDVTTRFRVEPSSSGVRDQ
        KKKK  SP EVGA  VLPASFADRVDD EARMGGT DVTTRFRVEPSSSGVRDQ
Subjt:  KKKKAISPSEVGACRVLPASFADRVDDSEARMGGTSDVTTRFRVEPSSSGVRDQ

A0A6J1CR42 uncharacterized protein LOC1110138262.0e-13893.41Show/hide
Query:  MFEYGLRLPLHPFVQEFLFWTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLFACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLF TGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQL ACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFWTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLFACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFYASREWLAKDESGRSFFDVSTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS
        KWFYAS EWLAKDESGRSFFDV TRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIE SRPNS LAMVC FAS
Subjt:  KWFYASREWLAKDESGRSFFDVSTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS

Query:  GVKRKSKGRAHALEAAQSSKPATPAVARPALEDPAPVIELESSGGPSREKRPRDQTEAVDA-------PPLGE
        GVKRKSKGRAHALEAAQSSKP TPAV  PA EDPAPVIELESSGGPSREKRPRDQTEAVDA       PPLGE
Subjt:  GVKRKSKGRAHALEAAQSSKPATPAVARPALEDPAPVIELESSGGPSREKRPRDQTEAVDA-------PPLGE

A0A6J1D971 uncharacterized protein LOC1110185385.5e-13392.28Show/hide
Query:  GTSDVTTRFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPRSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKD
        G   +  + R+EPSSSGVRDQVSRISAASLDRCLRRASKFVS P SVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALE ASSTMKD
Subjt:  GTSDVTTRFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPRSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKD

Query:  ELLKAHFEVEILKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAKDKELKHATAELETAKERLSNGVLLEESFRQHPDFD
        ELLKAH EVE LKAEVESQAELLKKEEDRR+AQLRAAHAITRGLE+EKFQLLKEKDDMLQALEAKDKEL+HATAELETAKERLSNGVLLEE+FRQHPDFD
Subjt:  ELLKAHFEVEILKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAKDKELKHATAELETAKERLSNGVLLEESFRQHPDFD

Query:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEEDQVGSTQESAPQAGS
        GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEEDQVGSTQE A   GS
Subjt:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEEDQVGSTQESAPQAGS

A0A6J1DXS5 uncharacterized protein LOC1110255028.4e-16686.97Show/hide
Query:  MSSSFSSNLGSDLAR----------------------------------RILEHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYGLR
        MSSS SSNL SDLAR                                  RI EHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYGLR
Subjt:  MSSSFSSNLGSDLAR----------------------------------RILEHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYGLR

Query:  LPLHPFVQEFLFWTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLFACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASR
        LPLHPFVQEFLF TGLAPAQVAPNGWGVIFALAILFWLRARDSEEAEL DVDQL ACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYAS 
Subjt:  LPLHPFVQEFLFWTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLFACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASR

Query:  EWLAKDESGRSFFDVSTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASGVKRKSK
        EWLAKDESGRSFFDV TRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASGVKRKSK
Subjt:  EWLAKDESGRSFFDVSTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASGVKRKSK

Query:  GRAHALEAAQSSKPATPAVARPALEDPAPVIELESSGGPSREKRPRDQTEAVD
        GRAHALEAAQSSKPATPAV  PA EDPA VIELESSGGPSREKRPRDQTEAVD
Subjt:  GRAHALEAAQSSKPATPAVARPALEDPAPVIELESSGGPSREKRPRDQTEAVD

A0A6J1DZB3 uncharacterized protein LOC1110256656.9e-19269.78Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASREWLAKDESGRSFFDVSTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP
        MCARKG GGIVKGPTSIKGWV KWF+AS EWLAKDESGR+FFDV TRFGNLVSI+ +PEL QA+FDTLK+YK+ FPR RK+ TLVTD+LLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASREWLAKDESGRSFFDVSTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCGFASGVKRKSKGRAHALEAAQSSKPATPAVAR--------PALEDPAPVIELESSGGPSREKRPRDQTEAVDAPPLGEEVRE
         VR IE+SRPNSELAMVCGF   VKRKSKGRAHAL+    ++P TP V R        P+   P PVIEL+ SGG S EKR R+++EA+D  PL  EVR 
Subjt:  AVRPIESSRPNSELAMVCGFASGVKRKSKGRAHALEAAQSSKPATPAVAR--------PALEDPAPVIELESSGGPSREKRPRDQTEAVDAPPLGEEVRE

Query:  EAPLKRKRKKKKAISPSEVGACRVLPASFADRVDDSEARMGGTSDVTTRFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPRSVLQRTIDYAAEAF
        E+PL+R+RKKKK  S SE GA   LP S AD VDD EARM GTS+V  RF +EPSSSGV+DQVSRISA  LDR LRRASKFVSDP SVLQRTID  AEAF
Subjt:  EAPLKRKRKKKKAISPSEVGACRVLPASFADRVDDSEARMGGTSDVTTRFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPRSVLQRTIDYAAEAF

Query:  VASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHFEVEILKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDML
        +ASI  A+ VKAELDGRE LAA+E+E   AALEAA +T+K ELLKA  EV+IL+AEV+++ +LLKKE ++ KA LRAAHAIT+GLEKEKFQLLKEKDD+ 
Subjt:  VASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHFEVEILKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDML

Query:  QALEAKDKELKHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVR
        Q LE KD  +   T EL+  KERL+NG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP LQIDL+GLK++Y+EKWASGP GTP PQ+LVD+YVR
Subjt:  QALEAKDKELKHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVR

Query:  DLDSDYSDPEED--------QVGSTQESAP--QAGS
        +LDSDYSD EE+        +VG+TQE  P  Q GS
Subjt:  DLDSDYSDPEED--------QVGSTQESAP--QAGS

SwissProt top hitse value%identityAlignment
Q9LEX8 Uncharacterized protein At3g60930, chloroplastic8.0e-0423.32Show/hide
Query:  EHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYG--LRLPLHPFVQEFLFWTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLD
        E  L  L+  F +   + LR+P   ERAD+PP G+ TLY + F YG  L LP+   V E++    +A +Q+       + +L  L  +  R  E    + 
Subjt:  EHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYG--LRLPLHPFVQEFLFWTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLD

Query:  VDQLFACFEAKRIAK-KPGRFYMCARKGAGGIVKGPTSIKGWVRKWFY-ASREWLAKDESGRSFFDVSTRFG----NLVSIRPVPELTQASFDTLKYYK-
        +  L    E +R+ K +  R+Y+   KG   I   P+  + +   +F+ A  + + +D  G     V TR+G     L  + P+P+   ++F  L   K 
Subjt:  VDQLFACFEAKRIAK-KPGRFYMCARKGAGGIVKGPTSIKGWVRKWFY-ASREWLAKDESGRSFFDVSTRFG----NLVSIRPVPELTQASFDTLKYYK-

Query:  ---ERFPRGRK--------------------------VGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASGVKRKSKGRAHALEAAQSSKP
           + F R R                           V    T   L E          R +   R      ++   A     +         A Q++  
Subjt:  ---ERFPRGRK--------------------------VGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASGVKRKSKGRAHALEAAQSSKP

Query:  ATPAVARPALEDPAPVIELESSGGPSREKRPRDQTEAVDAPP-----LGEEVREEAPLKRKRKKKKAISPSEVGACRVLPASFADRVDDSEARMGGTSDV
        A+     P    P      E+ G       P    EAV A P      G+ +R +    +K+KKKK  S SEV   ++LP  F DR   +    G    +
Subjt:  ATPAVARPALEDPAPVIELESSGGPSREKRPRDQTEAVDAPP-----LGEEVREEAPLKRKRKKKKAISPSEVGACRVLPASFADRVDDSEARMGGTSDV

Query:  TTRFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPRSVLQRTIDYAAEAFVAS--IQSALAVKAELDGREVLAAREKEEFSAALEAASST---MKD
             + P  + +  +    +A+   R +   ++ V    S ++  ++ A +   A   IQ+    K E       A  EKEE              M +
Subjt:  TTRFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPRSVLQRTIDYAAEAFVAS--IQSALAVKAELDGREVLAAREKEEFSAALEAASST---MKD

Query:  ELLKAHFEVEILKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAKDKELKHATAELETAKERLSNGVLLEESFRQHPDFD
        + LKA+ E+  LK  + S+A  L+  E  R  Q          + K K    + K  +L  +  +   L  A A  +   E L  G +LE    Q    D
Subjt:  ELLKAHFEVEILKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAKDKELKHATAELETAKERLSNGVLLEESFRQHPDFD

Query:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQ
         + KDF+DA  +  +    S++ D       LK    E     PGG    ++L D+
Subjt:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQ

Arabidopsis top hitse value%identityAlignment
AT3G42060.1 myosin heavy chain-related4.8e-0427.03Show/hide
Query:  AMSSSFSSNLGSDLARRILEHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFVQEFLFWTGLAPAQVAPNGWGVIFAL
        A S S S  +G   A R    + G        PE +   +PE  +R  + PEG++ L+   F E GL  PL  F+  +     +A +Q++         L
Subjt:  AMSSSFSSNLGSDLARRILEHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFVQEFLFWTGLAPAQVAPNGWGVIFAL

Query:  AILFWLRARDSEEAELLDVDQLFACFEAKRIAKKPGRFYMCARKGAG-GIVKGPTS-IKGWVRKWFYASREWLAKDESGRSFFDV
         IL       +EE  ++D+D LF    +  I  K  R  +CA    G  I  G TS ++ W + +F+A    ++ D++  S  ++
Subjt:  AILFWLRARDSEEAELLDVDQLFACFEAKRIAKKPGRFYMCARKGAG-GIVKGPTS-IKGWVRKWFYASREWLAKDESGRSFFDV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCCTGATAAGAGCCATCGGTGCTTCGAAGAGAAGAGCCTGAAAGAAGCGGCCGATCACTTAGATGAGGCAAGACCGAGCCCTGATCATCAACTTAATCCGACCAC
CCTTGACGGGGGAGAACCAGTGTCGGGCTTGAATAAAGCGACAGACCTCTCGGGAAAGGCGAGGCCGAGCCCTGGTTTTCGACTTAATCCGACCACCCTTGGTAAGGGAG
AACCAGTGGTGGGCCTGATTAAAGCGACAAACCTTTCGGGAGAGGCAAGGTCGAGCCCTGGTCATCGACTTAATTCGACCACCCTTGACAGGGGAGAACCGGCTGAAGAG
GTACTAGTCCCGGATCAACAAGTTTGGTGGATGCGAGGTAGTGAGCCGGAGGACCGAAATCCACGCATCAACAAAGCCAAAGTCAAACTTCATCTTTACAACAGTTGCAT
TGCACAATCATATTGTTTATGCAAGGATATGCACAACAGTGTGTTCCAGATTGTAGCTCGAACTCGGCCTCCGGACCGACCTGAACACTTGGGCGGACCTGCACAAAAAG
GGTATTCCCTTCCCCAAACATTGGCCCCCTCCCTGTCTGGTCCGACCTCGACCTGGCAGAGAAGTTCATTCGACTTGCTTTGGACACGTGGCGACTTCCTATTCGTGGGA
AAATACAACCGTCGCGGAAGATTTATCGTCGGAATATTCAAATATTCCGACGCTTCGGATCCCAGGGAGGATCCTAGCCGCTCGTTGACTACACGTGCAGCTCGAACCCT
TGGTAGGTCGGTCTCTTCCCTCTCTCTTTCGAACGTAGTTGCCATGTCGTCCTCTTTTAGCAGCAACTTAGGATCCGATTTAGCTCGTAGGATACTTGAGCACTACCTCG
GATCCCTTCGTAGGGGATTCGCTATCCCTGAGAACATCCTCCTCAGGCTTCCGGAGGAGGGGGAGAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAA
ATGTTTGAGTACGGCCTCAGACTTCCCCTTCACCCTTTTGTCCAAGAATTTCTCTTCTGGACTGGGTTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTT
CGCTTTGGCCATCCTCTTTTGGCTACGAGCTCGGGATAGTGAGGAGGCCGAGCTATTGGACGTAGACCAGCTCTTCGCGTGCTTCGAAGCGAAAAGGATAGCTAAGAAGC
CTGGTCGGTTCTATATGTGCGCAAGGAAAGGCGCAGGCGGTATAGTTAAGGGGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCCAGGGAATGGCTC
GCAAAGGACGAGTCAGGTCGTTCCTTCTTTGACGTCTCCACTAGGTTTGGGAACCTAGTTTCAATCCGACCAGTCCCCGAGCTTACGCAAGCCTCCTTCGACACGCTGAA
ATACTACAAGGAGCGCTTTCCGAGGGGTAGGAAGGTCGGAACCCTGGTGACCGACGAGCTGCTGCTTGAGTCCGGGCTGCTAGATTACAACCCTGCAGTTCGTCCCATTG
AATCCTCAAGGCCGAACTCTGAACTTGCCATGGTTTGCGGATTTGCAAGCGGCGTGAAGCGCAAGTCTAAGGGCCGAGCCCATGCTCTTGAGGCCGCCCAGAGTTCGAAA
CCTGCCACCCCTGCCGTGGCAAGGCCTGCCTTGGAAGATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTGGGGGTCCCTCGAGGGAGAAGCGCCCCAGGGATCAGACCGA
GGCGGTGGACGCCCCGCCTTTGGGCGAGGAGGTGAGGGAGGAAGCCCCTCTGAAGCGAAAAAGGAAGAAAAAGAAGGCGATTTCCCCCTCGGAGGTCGGAGCTTGCAGGG
TCTTGCCTGCAAGTTTCGCAGATCGGGTGGACGATTCTGAGGCCAGGATGGGCGGGACGTCCGACGTGACGACACGGTTCAGAGTTGAGCCGTCAAGTTCCGGGGTGAGG
GACCAGGTGTCCCGCATCTCAGCTGCAAGTTTGGACCGCTGCCTAAGGAGGGCGTCCAAATTTGTGAGCGACCCTAGGTCTGTTCTGCAGAGGACCATCGACTACGCCGC
CGAGGCGTTCGTTGCTTCCATTCAATCGGCTCTGGCTGTAAAGGCCGAGCTGGATGGGAGGGAAGTTCTGGCAGCGAGGGAGAAGGAGGAGTTCTCTGCTGCCTTGGAGG
CTGCTTCCTCCACCATGAAGGATGAGCTGCTGAAGGCTCACTTTGAGGTGGAGATTTTGAAGGCCGAGGTGGAGTCCCAGGCCGAGCTGCTGAAGAAGGAAGAGGACAGG
CGCAAGGCCCAACTCCGAGCTGCCCACGCTATCACCAGGGGCTTGGAGAAGGAGAAGTTCCAGCTCCTGAAGGAGAAGGACGACATGCTCCAGGCGCTTGAAGCGAAGGA
TAAGGAGCTGAAGCATGCGACTGCCGAGCTGGAGACGGCGAAGGAGCGCCTCAGCAATGGAGTCCTATTGGAGGAATCGTTTAGGCAACATCCTGACTTCGATGGATTTG
CCAAAGACTTTTCTGACGCGGGCTTCAAGTTCCTCATGAAGGGCATTGCTTCCGACATGCCCGATCTTCAGATCGATCTCAGCGGTCTGAAAAGGAGGTATGCTGAGAAG
TGGGCGTCTGGGCCTGGCGGCACCCCTGGCCCCCAAGCGTTGGTGGATCAGTATGTCAGAGATCTGGACTCTGACTACTCCGATCCCGAAGAGGACCAGGTCGGCTCCAC
TCAAGAGAGCGCTCCTCAAGCAGGCTCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAATCCTGATAAGAGCCATCGGTGCTTCGAAGAGAAGAGCCTGAAAGAAGCGGCCGATCACTTAGATGAGGCAAGACCGAGCCCTGATCATCAACTTAATCCGACCAC
CCTTGACGGGGGAGAACCAGTGTCGGGCTTGAATAAAGCGACAGACCTCTCGGGAAAGGCGAGGCCGAGCCCTGGTTTTCGACTTAATCCGACCACCCTTGGTAAGGGAG
AACCAGTGGTGGGCCTGATTAAAGCGACAAACCTTTCGGGAGAGGCAAGGTCGAGCCCTGGTCATCGACTTAATTCGACCACCCTTGACAGGGGAGAACCGGCTGAAGAG
GTACTAGTCCCGGATCAACAAGTTTGGTGGATGCGAGGTAGTGAGCCGGAGGACCGAAATCCACGCATCAACAAAGCCAAAGTCAAACTTCATCTTTACAACAGTTGCAT
TGCACAATCATATTGTTTATGCAAGGATATGCACAACAGTGTGTTCCAGATTGTAGCTCGAACTCGGCCTCCGGACCGACCTGAACACTTGGGCGGACCTGCACAAAAAG
GGTATTCCCTTCCCCAAACATTGGCCCCCTCCCTGTCTGGTCCGACCTCGACCTGGCAGAGAAGTTCATTCGACTTGCTTTGGACACGTGGCGACTTCCTATTCGTGGGA
AAATACAACCGTCGCGGAAGATTTATCGTCGGAATATTCAAATATTCCGACGCTTCGGATCCCAGGGAGGATCCTAGCCGCTCGTTGACTACACGTGCAGCTCGAACCCT
TGGTAGGTCGGTCTCTTCCCTCTCTCTTTCGAACGTAGTTGCCATGTCGTCCTCTTTTAGCAGCAACTTAGGATCCGATTTAGCTCGTAGGATACTTGAGCACTACCTCG
GATCCCTTCGTAGGGGATTCGCTATCCCTGAGAACATCCTCCTCAGGCTTCCGGAGGAGGGGGAGAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAA
ATGTTTGAGTACGGCCTCAGACTTCCCCTTCACCCTTTTGTCCAAGAATTTCTCTTCTGGACTGGGTTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTT
CGCTTTGGCCATCCTCTTTTGGCTACGAGCTCGGGATAGTGAGGAGGCCGAGCTATTGGACGTAGACCAGCTCTTCGCGTGCTTCGAAGCGAAAAGGATAGCTAAGAAGC
CTGGTCGGTTCTATATGTGCGCAAGGAAAGGCGCAGGCGGTATAGTTAAGGGGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCCAGGGAATGGCTC
GCAAAGGACGAGTCAGGTCGTTCCTTCTTTGACGTCTCCACTAGGTTTGGGAACCTAGTTTCAATCCGACCAGTCCCCGAGCTTACGCAAGCCTCCTTCGACACGCTGAA
ATACTACAAGGAGCGCTTTCCGAGGGGTAGGAAGGTCGGAACCCTGGTGACCGACGAGCTGCTGCTTGAGTCCGGGCTGCTAGATTACAACCCTGCAGTTCGTCCCATTG
AATCCTCAAGGCCGAACTCTGAACTTGCCATGGTTTGCGGATTTGCAAGCGGCGTGAAGCGCAAGTCTAAGGGCCGAGCCCATGCTCTTGAGGCCGCCCAGAGTTCGAAA
CCTGCCACCCCTGCCGTGGCAAGGCCTGCCTTGGAAGATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTGGGGGTCCCTCGAGGGAGAAGCGCCCCAGGGATCAGACCGA
GGCGGTGGACGCCCCGCCTTTGGGCGAGGAGGTGAGGGAGGAAGCCCCTCTGAAGCGAAAAAGGAAGAAAAAGAAGGCGATTTCCCCCTCGGAGGTCGGAGCTTGCAGGG
TCTTGCCTGCAAGTTTCGCAGATCGGGTGGACGATTCTGAGGCCAGGATGGGCGGGACGTCCGACGTGACGACACGGTTCAGAGTTGAGCCGTCAAGTTCCGGGGTGAGG
GACCAGGTGTCCCGCATCTCAGCTGCAAGTTTGGACCGCTGCCTAAGGAGGGCGTCCAAATTTGTGAGCGACCCTAGGTCTGTTCTGCAGAGGACCATCGACTACGCCGC
CGAGGCGTTCGTTGCTTCCATTCAATCGGCTCTGGCTGTAAAGGCCGAGCTGGATGGGAGGGAAGTTCTGGCAGCGAGGGAGAAGGAGGAGTTCTCTGCTGCCTTGGAGG
CTGCTTCCTCCACCATGAAGGATGAGCTGCTGAAGGCTCACTTTGAGGTGGAGATTTTGAAGGCCGAGGTGGAGTCCCAGGCCGAGCTGCTGAAGAAGGAAGAGGACAGG
CGCAAGGCCCAACTCCGAGCTGCCCACGCTATCACCAGGGGCTTGGAGAAGGAGAAGTTCCAGCTCCTGAAGGAGAAGGACGACATGCTCCAGGCGCTTGAAGCGAAGGA
TAAGGAGCTGAAGCATGCGACTGCCGAGCTGGAGACGGCGAAGGAGCGCCTCAGCAATGGAGTCCTATTGGAGGAATCGTTTAGGCAACATCCTGACTTCGATGGATTTG
CCAAAGACTTTTCTGACGCGGGCTTCAAGTTCCTCATGAAGGGCATTGCTTCCGACATGCCCGATCTTCAGATCGATCTCAGCGGTCTGAAAAGGAGGTATGCTGAGAAG
TGGGCGTCTGGGCCTGGCGGCACCCCTGGCCCCCAAGCGTTGGTGGATCAGTATGTCAGAGATCTGGACTCTGACTACTCCGATCCCGAAGAGGACCAGGTCGGCTCCAC
TCAAGAGAGCGCTCCTCAAGCAGGCTCTTAA
Protein sequenceShow/hide protein sequence
MNPDKSHRCFEEKSLKEAADHLDEARPSPDHQLNPTTLDGGEPVSGLNKATDLSGKARPSPGFRLNPTTLGKGEPVVGLIKATNLSGEARSSPGHRLNSTTLDRGEPAEE
VLVPDQQVWWMRGSEPEDRNPRINKAKVKLHLYNSCIAQSYCLCKDMHNSVFQIVARTRPPDRPEHLGGPAQKGYSLPQTLAPSLSGPTSTWQRSSFDLLWTRGDFLFVG
KYNRRGRFIVGIFKYSDASDPREDPSRSLTTRAARTLGRSVSSLSLSNVVAMSSSFSSNLGSDLARRILEHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFK
MFEYGLRLPLHPFVQEFLFWTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLFACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASREWL
AKDESGRSFFDVSTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASGVKRKSKGRAHALEAAQSSK
PATPAVARPALEDPAPVIELESSGGPSREKRPRDQTEAVDAPPLGEEVREEAPLKRKRKKKKAISPSEVGACRVLPASFADRVDDSEARMGGTSDVTTRFRVEPSSSGVR
DQVSRISAASLDRCLRRASKFVSDPRSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHFEVEILKAEVESQAELLKKEEDR
RKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAKDKELKHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEK
WASGPGGTPGPQALVDQYVRDLDSDYSDPEEDQVGSTQESAPQAGS