; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g40510 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g40510
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionmyosin heavy chain-related
Genome locationchr8:31074079..31076772
RNA-Seq ExpressionMoc08g40510
SyntenyMoc08g40510
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0043167 - ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138041.1 uncharacterized protein LOC111009298 [Momordica charantia]1.1e-11788.19Show/hide
Query:  MCARKGTGGIVKGPTSIKGWVRKWFYASGEWLVKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLEFGLLDYNP
        MCARKG  GIVKGPTSIKGWVRKWFYASGEWL KDES              V+IRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLE GLLDYNP
Subjt:  MCARKGTGGIVKGPTSIKGWVRKWFYASGEWLVKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLEFGLLDYNP

Query:  AVRPIESSRPNSELAMVCGFVSNVKCKSKGRAHALEAAQSSKPATPAVVGPASKDPAPMIELESSGGPSREKRPRDQTEAMDASPLGEEVREEVPLKRRR
        AVRPIESSRPNSELAMVCGF SNVK KSKG+AHALEAAQSSKP TPAVVGPAS+DPAP+IELESS GPSREKRPRDQTEA+D SPLGEEVREEVPLKRRR
Subjt:  AVRPIESSRPNSELAMVCGFVSNVKCKSKGRAHALEAAQSSKPATPAVVGPASKDPAPMIELESSGGPSREKRPRDQTEAMDASPLGEEVREEVPLKRRR

Query:  KKKKTTSPLEVGARGVLPASFADRVDDPEARMGGTSDVMTRFRVEPSSSGVRDQ
        KKKKTTSPLEVGARGVLPASFADRVDDPEARMGGT DV TRFRVEPSSSGVRDQ
Subjt:  KKKKTTSPLEVGARGVLPASFADRVDDPEARMGGTSDVMTRFRVEPSSSGVRDQ

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]1.8e-13690.84Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRISKKPDRFYMCARKGTGGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRI+KKP RFYMCARKG GGIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRISKKPDRFYMCARKGTGGIVKGPTSIKGWVR

Query:  KWFYASGEWLVKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLEFGLLDYNPAVRPIESSRPNSELAMVCGFVS
        KWFYASGEWL KDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLE GLLDYNPAVRPIE SRPNS LAMVC F S
Subjt:  KWFYASGEWLVKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLEFGLLDYNPAVRPIESSRPNSELAMVCGFVS

Query:  NVKCKSKGRAHALEAAQSSKPATPAVVGPASKDPAPMIELESSGGPSREKRPRDQTEAMDAS-------PLGE
         VK KSKGRAHALEAAQSSKP TPAVVGPAS+DPAP+IELESSGGPSREKRPRDQTEA+DA        PLGE
Subjt:  NVKCKSKGRAHALEAAQSSKPATPAVVGPASKDPAPMIELESSGGPSREKRPRDQTEAMDAS-------PLGE

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]2.1e-12988.42Show/hide
Query:  GTSDVMTRFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYVAEAFVASIQSALAVKAELDGREVLAAREKEEFSSALEAASSTMKD
        G   ++ + R+EPSSSGVRDQVSRISAASLDRCLRRASKFVS PGSVLQRTIDY AEAFVASIQSALAVKAELDGREVLAAREKEEFS+ALE ASSTMKD
Subjt:  GTSDVMTRFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYVAEAFVASIQSALAVKAELDGREVLAAREKEEFSSALEAASSTMKD

Query:  ELLKAHSEVEILKVEVETKAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDEMLQALEAKDEELKHATAELETAKEHLSNGVLLEESFRQHPDFD
        ELLKAHSEVE LK EVE++AELLKKEEDRR+AQLRAAHAITRGLE+EKFQLLKEKD+MLQALEAKD+EL+HATAELETAKE LSNGVLLEE+FRQHPDFD
Subjt:  ELLKAHSEVEILKVEVETKAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDEMLQALEAKDEELKHATAELETAKEHLSNGVLLEESFRQHPDFD

Query:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKKRYAEQWASGPGGTPGPQALVDKYVRDLDSDYSDLEEDQVGTTKEGAPQAGS
        GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLK+RYAE+WASGPGGTPGPQALVD+YVRDLDSDYSD EEDQVG+T+EGA   GS
Subjt:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKKRYAEQWASGPGGTPGPQALVDKYVRDLDSDYSDLEEDQVGTTKEGAPQAGS

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]6.0e-15694.48Show/hide
Query:  AIPENILLRIPEEGERADNPPEGWVTLYFKMFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRI
        AIPENILLR+PEEGERADNPPEGWVTLYFKMFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAEL DVDQLLACFEAKRI
Subjt:  AIPENILLRIPEEGERADNPPEGWVTLYFKMFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRI

Query:  SKKPDRFYMCARKGTGGIVKGPTSIKGWVRKWFYASGEWLVKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLE
        +KKP RFYMCARKG GGIVKGPTSIKGWVRKWFYASGEWL KDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLE
Subjt:  SKKPDRFYMCARKGTGGIVKGPTSIKGWVRKWFYASGEWLVKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLE

Query:  FGLLDYNPAVRPIESSRPNSELAMVCGFVSNVKCKSKGRAHALEAAQSSKPATPAVVGPASKDPAPMIELESSGGPSREKRPRDQTEAMD
         GLLDYNPAVRPIESSRPNSELAMVCGF S VK KSKGRAHALEAAQSSKPATPAVVGPAS+DPA +IELESSGGPSREKRPRDQTEA+D
Subjt:  FGLLDYNPAVRPIESSRPNSELAMVCGFVSNVKCKSKGRAHALEAAQSSKPATPAVVGPASKDPAPMIELESSGGPSREKRPRDQTEAMD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]3.0e-20071.08Show/hide
Query:  MCARKGTGGIVKGPTSIKGWVRKWFYASGEWLVKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLEFGLLDYNP
        MCARKGTGGIVKGPTSIKGWV KWF+ASGEWL KDESGR+FFDVPTRFGNLVSI+ +PEL QA+FDTLK+YK+HFPR RK+ TLVTDKLLLE GLLDYNP
Subjt:  MCARKGTGGIVKGPTSIKGWVRKWFYASGEWLVKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLEFGLLDYNP

Query:  AVRPIESSRPNSELAMVCGFVSNVKCKSKGRAHALEAAQSSKPATPAV--------VGPASKDPAPMIELESSGGPSREKRPRDQTEAMDASPLGEEVRE
         VR IE+SRPNSELAMVCGF  +VK KSKGRAHAL+    ++P TP V         GP+S  P P+IEL+ SGG S EKR R+++EA+D SPL  EVR 
Subjt:  AVRPIESSRPNSELAMVCGFVSNVKCKSKGRAHALEAAQSSKPATPAV--------VGPASKDPAPMIELESSGGPSREKRPRDQTEAMDASPLGEEVRE

Query:  EVPLKRRRKKKKTTSPLEVGARGVLPASFADRVDDPEARMGGTSDVMTRFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYVAEAF
        E PL+RRRKKKKT+S  E GARG LP S AD VDDPEARM GTS+V  RF +EPSSSGV+DQVSRISA  LDR LRRASKFVSDPGSVLQRTID VAEAF
Subjt:  EVPLKRRRKKKKTTSPLEVGARGVLPASFADRVDDPEARMGGTSDVMTRFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYVAEAF

Query:  VASIQSALAVKAELDGREVLAAREKEEFSSALEAASSTMKDELLKAHSEVEILKVEVETKAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDEML
        +ASI  A+ VKAELDGRE LAA+E+E   +ALEAA +T+K ELLKA  EV+IL+ EV+ K +LLKKE ++ KA LRAAHAIT+GLEKEKFQLLKEKD++ 
Subjt:  VASIQSALAVKAELDGREVLAAREKEEFSSALEAASSTMKDELLKAHSEVEILKVEVETKAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDEML

Query:  QALEAKDEELKHATAELETAKEHLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKKRYAEQWASGPGGTPGPQALVDKYVR
        Q LE KD  +   T EL+  KE L+NG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP LQIDL+GLKK+Y+E+WASGP GTP PQ+LVDKYVR
Subjt:  QALEAKDEELKHATAELETAKEHLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKKRYAEQWASGPGGTPGPQALVDKYVR

Query:  DLDSDYSDLEED--------QVGTTKEGAP--QAGS
        +LDSDYSD+EE+        +VGTT+E  P  Q GS
Subjt:  DLDSDYSDLEED--------QVGTTKEGAP--QAGS

TrEMBL top hitse value%identityAlignment
A0A6J1C8K9 uncharacterized protein LOC1110092985.4e-11888.19Show/hide
Query:  MCARKGTGGIVKGPTSIKGWVRKWFYASGEWLVKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLEFGLLDYNP
        MCARKG  GIVKGPTSIKGWVRKWFYASGEWL KDES              V+IRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLE GLLDYNP
Subjt:  MCARKGTGGIVKGPTSIKGWVRKWFYASGEWLVKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLEFGLLDYNP

Query:  AVRPIESSRPNSELAMVCGFVSNVKCKSKGRAHALEAAQSSKPATPAVVGPASKDPAPMIELESSGGPSREKRPRDQTEAMDASPLGEEVREEVPLKRRR
        AVRPIESSRPNSELAMVCGF SNVK KSKG+AHALEAAQSSKP TPAVVGPAS+DPAP+IELESS GPSREKRPRDQTEA+D SPLGEEVREEVPLKRRR
Subjt:  AVRPIESSRPNSELAMVCGFVSNVKCKSKGRAHALEAAQSSKPATPAVVGPASKDPAPMIELESSGGPSREKRPRDQTEAMDASPLGEEVREEVPLKRRR

Query:  KKKKTTSPLEVGARGVLPASFADRVDDPEARMGGTSDVMTRFRVEPSSSGVRDQ
        KKKKTTSPLEVGARGVLPASFADRVDDPEARMGGT DV TRFRVEPSSSGVRDQ
Subjt:  KKKKTTSPLEVGARGVLPASFADRVDDPEARMGGTSDVMTRFRVEPSSSGVRDQ

A0A6J1CR42 uncharacterized protein LOC1110138268.8e-13790.84Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRISKKPDRFYMCARKGTGGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRI+KKP RFYMCARKG GGIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRISKKPDRFYMCARKGTGGIVKGPTSIKGWVR

Query:  KWFYASGEWLVKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLEFGLLDYNPAVRPIESSRPNSELAMVCGFVS
        KWFYASGEWL KDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLE GLLDYNPAVRPIE SRPNS LAMVC F S
Subjt:  KWFYASGEWLVKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLEFGLLDYNPAVRPIESSRPNSELAMVCGFVS

Query:  NVKCKSKGRAHALEAAQSSKPATPAVVGPASKDPAPMIELESSGGPSREKRPRDQTEAMDAS-------PLGE
         VK KSKGRAHALEAAQSSKP TPAVVGPAS+DPAP+IELESSGGPSREKRPRDQTEA+DA        PLGE
Subjt:  NVKCKSKGRAHALEAAQSSKPATPAVVGPASKDPAPMIELESSGGPSREKRPRDQTEAMDAS-------PLGE

A0A6J1D971 uncharacterized protein LOC1110185381.0e-12988.42Show/hide
Query:  GTSDVMTRFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYVAEAFVASIQSALAVKAELDGREVLAAREKEEFSSALEAASSTMKD
        G   ++ + R+EPSSSGVRDQVSRISAASLDRCLRRASKFVS PGSVLQRTIDY AEAFVASIQSALAVKAELDGREVLAAREKEEFS+ALE ASSTMKD
Subjt:  GTSDVMTRFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYVAEAFVASIQSALAVKAELDGREVLAAREKEEFSSALEAASSTMKD

Query:  ELLKAHSEVEILKVEVETKAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDEMLQALEAKDEELKHATAELETAKEHLSNGVLLEESFRQHPDFD
        ELLKAHSEVE LK EVE++AELLKKEEDRR+AQLRAAHAITRGLE+EKFQLLKEKD+MLQALEAKD+EL+HATAELETAKE LSNGVLLEE+FRQHPDFD
Subjt:  ELLKAHSEVEILKVEVETKAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDEMLQALEAKDEELKHATAELETAKEHLSNGVLLEESFRQHPDFD

Query:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKKRYAEQWASGPGGTPGPQALVDKYVRDLDSDYSDLEEDQVGTTKEGAPQAGS
        GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLK+RYAE+WASGPGGTPGPQALVD+YVRDLDSDYSD EEDQVG+T+EGA   GS
Subjt:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKKRYAEQWASGPGGTPGPQALVDKYVRDLDSDYSDLEEDQVGTTKEGAPQAGS

A0A6J1DXS5 uncharacterized protein LOC1110255022.9e-15694.48Show/hide
Query:  AIPENILLRIPEEGERADNPPEGWVTLYFKMFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRI
        AIPENILLR+PEEGERADNPPEGWVTLYFKMFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAEL DVDQLLACFEAKRI
Subjt:  AIPENILLRIPEEGERADNPPEGWVTLYFKMFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRI

Query:  SKKPDRFYMCARKGTGGIVKGPTSIKGWVRKWFYASGEWLVKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLE
        +KKP RFYMCARKG GGIVKGPTSIKGWVRKWFYASGEWL KDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLE
Subjt:  SKKPDRFYMCARKGTGGIVKGPTSIKGWVRKWFYASGEWLVKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLE

Query:  FGLLDYNPAVRPIESSRPNSELAMVCGFVSNVKCKSKGRAHALEAAQSSKPATPAVVGPASKDPAPMIELESSGGPSREKRPRDQTEAMD
         GLLDYNPAVRPIESSRPNSELAMVCGF S VK KSKGRAHALEAAQSSKPATPAVVGPAS+DPA +IELESSGGPSREKRPRDQTEA+D
Subjt:  FGLLDYNPAVRPIESSRPNSELAMVCGFVSNVKCKSKGRAHALEAAQSSKPATPAVVGPASKDPAPMIELESSGGPSREKRPRDQTEAMD

A0A6J1DZB3 uncharacterized protein LOC1110256651.5e-20071.08Show/hide
Query:  MCARKGTGGIVKGPTSIKGWVRKWFYASGEWLVKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLEFGLLDYNP
        MCARKGTGGIVKGPTSIKGWV KWF+ASGEWL KDESGR+FFDVPTRFGNLVSI+ +PEL QA+FDTLK+YK+HFPR RK+ TLVTDKLLLE GLLDYNP
Subjt:  MCARKGTGGIVKGPTSIKGWVRKWFYASGEWLVKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLEFGLLDYNP

Query:  AVRPIESSRPNSELAMVCGFVSNVKCKSKGRAHALEAAQSSKPATPAV--------VGPASKDPAPMIELESSGGPSREKRPRDQTEAMDASPLGEEVRE
         VR IE+SRPNSELAMVCGF  +VK KSKGRAHAL+    ++P TP V         GP+S  P P+IEL+ SGG S EKR R+++EA+D SPL  EVR 
Subjt:  AVRPIESSRPNSELAMVCGFVSNVKCKSKGRAHALEAAQSSKPATPAV--------VGPASKDPAPMIELESSGGPSREKRPRDQTEAMDASPLGEEVRE

Query:  EVPLKRRRKKKKTTSPLEVGARGVLPASFADRVDDPEARMGGTSDVMTRFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYVAEAF
        E PL+RRRKKKKT+S  E GARG LP S AD VDDPEARM GTS+V  RF +EPSSSGV+DQVSRISA  LDR LRRASKFVSDPGSVLQRTID VAEAF
Subjt:  EVPLKRRRKKKKTTSPLEVGARGVLPASFADRVDDPEARMGGTSDVMTRFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYVAEAF

Query:  VASIQSALAVKAELDGREVLAAREKEEFSSALEAASSTMKDELLKAHSEVEILKVEVETKAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDEML
        +ASI  A+ VKAELDGRE LAA+E+E   +ALEAA +T+K ELLKA  EV+IL+ EV+ K +LLKKE ++ KA LRAAHAIT+GLEKEKFQLLKEKD++ 
Subjt:  VASIQSALAVKAELDGREVLAAREKEEFSSALEAASSTMKDELLKAHSEVEILKVEVETKAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDEML

Query:  QALEAKDEELKHATAELETAKEHLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKKRYAEQWASGPGGTPGPQALVDKYVR
        Q LE KD  +   T EL+  KE L+NG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP LQIDL+GLKK+Y+E+WASGP GTP PQ+LVDKYVR
Subjt:  QALEAKDEELKHATAELETAKEHLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKKRYAEQWASGPGGTPGPQALVDKYVR

Query:  DLDSDYSDLEED--------QVGTTKEGAP--QAGS
        +LDSDYSD+EE+        +VGTT+E  P  Q GS
Subjt:  DLDSDYSDLEED--------QVGTTKEGAP--QAGS

SwissProt top hitse value%identityAlignment
Q9LEX8 Uncharacterized protein At3g60930, chloroplastic2.4e-0622.9Show/hide
Query:  ILLRIPEEGERADNPPEGWVTLYFKMFEYG--LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRISK-
        + LR+P   ERAD+PP G+ TLY + F YG  L LP+   V E++    +A +Q+       + +L  L  +  R  E    + +  L    E +R+ K 
Subjt:  ILLRIPEEGERADNPPEGWVTLYFKMFEYG--LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRISK-

Query:  KPDRFYMCARKGTGGIVKGPTSIKGWVRKWFYASGEWLVKDESGRSFFDVPTRFG----NLVSIRPVPELTQASFDTLKYYK----EHFPRGRKVGTL--
        + DR+Y+   KG   I   P+  + +   +F+ + E  + ++       V TR+G     L  + P+P+   ++F  L   K    +HF R R    L  
Subjt:  KPDRFYMCARKGTGGIVKGPTSIKGWVRKWFYASGEWLVKDESGRSFFDVPTRFG----NLVSIRPVPELTQASFDTLKYYK----EHFPRGRKVGTL--

Query:  -------------------VTDKLLLEFGLLDYNPAVRPIESSRPNSELAMV-CGFVS---NVKCKSKGRA-HALEAAQSSKPATPAVVGPASKDPAPMI
                             D    +  L +     +  +  R   E  +V  G +S   + +    G       A Q++  A+   V P +  P    
Subjt:  -------------------VTDKLLLEFGLLDYNPAVRPIESSRPNSELAMV-CGFVS---NVKCKSKGRA-HALEAAQSSKPATPAVVGPASKDPAPMI

Query:  ELESSGGPSREKRPRDQTEAMDASP-----LGEEVREEVPLKRRRKKKKTTSPLEVGARGVLPASFADRVDDPEARMGGTSDVMTRFRVEPSSSGVRDQV
          E+ G       P    EA+ A P      G+ +R +    +++KKKK  S  EV    +LP  F DR       +GG    +    + P  + +  + 
Subjt:  ELESSGGPSREKRPRDQTEAMDASP-----LGEEVREEVPLKRRRKKKKTTSPLEVGARGVLPASFADRVDDPEARMGGTSDVMTRFRVEPSSSGVRDQV

Query:  SRISAASLDRCLRRASKFVSDPGSVLQRTIDYVAEAFVAS--IQSALAVKAELDGREVLAAREKEEFSSALEAASST---MKDELLKAHSEVEILKVEVE
           +A+   R +   ++ V    S ++  ++   +   A   IQ+    K E       A  EKEE              M ++ LKA+SE+  LK    
Subjt:  SRISAASLDRCLRRASKFVSDPGSVLQRTIDYVAEAFVAS--IQSALAVKAELDGREVLAAREKEEFSSALEAASST---MKDELLKAHSEVEILKVEVE

Query:  TKAELLKKEEDRRKAQLRAAHAITRGLEKEKF-QLLKEKDEMLQALEAKDEELKHATAELETAK--EHLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFL
            +L +    R +++  A   TR    E F   +K  +  +  L+  ++   + +     A+  E L  G +LE    Q    D + KDF+DA  +  
Subjt:  TKAELLKKEEDRRKAQLRAAHAITRGLEKEKF-QLLKEKDEMLQALEAKDEELKHATAELETAK--EHLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFL

Query:  MKGIASDMPDLQIDLSGLKKRYAEQWASGPGGTPGPQALVDK
        +    S++ D       LK    E     PGG    ++L D+
Subjt:  MKGIASDMPDLQIDLSGLKKRYAEQWASGPGGTPGPQALVDK

Arabidopsis top hitse value%identityAlignment
AT1G32010.1 myosin heavy chain-related5.1e-0423.16Show/hide
Query:  FVAIPENILLRIPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEA
        F  +P  + +RIP + +R  + PEG++ L+   F E GLR P+  F+  F     +A +Q+       I   A L  L AR       L V+ +      
Subjt:  FVAIPENILLRIPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEA

Query:  KRISKKPDRFYMCARKGTGGIVKGPTSIKGWVRKWFYASGEW-LVKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDK
         ++  K  + Y+ + +G   +  GP+  + W+  +FYA  +  LV+D S    F +        ++R +   ++   D  +  +E  P   K    +  K
Subjt:  KRISKKPDRFYMCARKGTGGIVKGPTSIKGWVRKWFYASGEW-LVKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDK

Query:  LLLEFGLLDYNPAVRPIESSRPNSELAMVCGFVSNVKCKSKGRAHALEAAQSSKPATPAVVGPASKDPAPMIELESSGGPSREKRP-RDQTEAMDASPL-
           +        + RP + S  N  LA +   +   + + +  + A  A         A +  A K+   + +    G    E+R   D T    A P+ 
Subjt:  LLLEFGLLDYNPAVRPIESSRPNSELAMVCGFVSNVKCKSKGRAHALEAAQSSKPATPAVVGPASKDPAPMIELESSGGPSREKRP-RDQTEAMDASPL-

Query:  -GEEVREEVPLKRRRKKKKTT-SPLEVGARGVLPASFADRVDDPEARMG-GTSDVMT--RFRVEPSSSGVRDQVSRISAASLDR--CLRRASKFVSDPGS
         G++      + R R    T+ S     ++ V+PA       D   R+G G  D+    R       S   +   R S  S DR  C R   K V  PG 
Subjt:  -GEEVREEVPLKRRRKKKKTT-SPLEVGARGVLPASFADRVDDPEARMG-GTSDVMT--RFRVEPSSSGVRDQVSRISAASLDR--CLRRASKFVSDPGS

Query:  VLQRTIDYVAEAFV------ASIQSALAVKAELD--GREVLAAREK-EEFSSALEAASSTMK----------DELLKAHSEVEILKVEVETKAELLK---
          +   +++AEA +      A  Q+  A    +    R + +AREK  E    L+ A  T+           +EL     ++ +LK E   +A  L+   
Subjt:  VLQRTIDYVAEAFV------ASIQSALAVKAELD--GREVLAAREK-EEFSSALEAASSTMK----------DELLKAHSEVEILKVEVETKAELLK---

Query:  KEEDRRKAQLRAAHAITRGLEKEKF--QLLKEKDEMLQALEAKDEELKHATAELETA
        +++   +A+L     I R   +EKF   + K + E+L     + E++    AE E A
Subjt:  KEEDRRKAQLRAAHAITRGLEKEKF--QLLKEKDEMLQALEAKDEELKHATAELETA

AT2G15420.1 myosin heavy chain-related1.2e-0524.54Show/hide
Query:  PENILLRIPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIS
        P  I L  P+  +R   PPEG++ LY   F   GL  PL  F+ E+  R  +A +Q+          LAIL       +E    +D D         R+ 
Subjt:  PENILLRIPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIS

Query:  KKPDRFYMCARKGTGGIVKGPTS-IKGWVRKWFYAS--------------GEWLVKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRG
        + P  +Y  A K    IV G  S I GW R++F+                 +W +  E      D P  F  L +I  + EL    + T  + +    R 
Subjt:  KKPDRFYMCARKGTGGIVKGPTS-IKGWVRKWFYAS--------------GEWLVKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRG

Query:  RKVGTLVTD--KLLLEFGLLDYNPAVRPIESSRPNSELAMVCGFVSNVKCKSKGRAHALEAAQSSKPATPAVVGPASKDPAP------MIELESSGG--P
        R +G ++    + + EFG    NP +  +    P+  +      V N   +S GR  A E+A        +   P ++D            L S GG  P
Subjt:  RKVGTLVTD--KLLLEFGLLDYNPAVRPIESSRPNSELAMVCGFVSNVKCKSKGRAHALEAAQSSKPATPAVVGPASKDPAP------MIELESSGG--P

Query:  SREKRPRDQTEAMDASPLGEEVREEVPLKRRRKKKKTTSPLEVGARGVLPASFA--DRVDDPEARMGGTS--DVMTRFRVEPSSSGVRDQVSRISAASLD
        S+++  RD           E+   +VP   RR         E   RG +   F+   +  D       TS  D+++R R      G  D  S     S+D
Subjt:  SREKRPRDQTEAMDASPLGEEVREEVPLKRRRKKKKTTSPLEVGARGVLPASFA--DRVDDPEARMGGTS--DVMTRFRVEPSSSGVRDQVSRISAASLD

Query:  RCLRR------ASKFVSDPGSVLQRTIDYVAEAFVASIQSALAVKAELDGREVLAAR--EKEEFSSALEAASSTMKDELLKAHSEVEILKVEVETKAELL
        R + R      A K     G+  + +     +A V++ + A    AE +  + LA     + E S+ LE  SS + +++    S V+  ++++E   +  
Subjt:  RCLRR------ASKFVSDPGSVLQRTIDYVAEAFVASIQSALAVKAELDGREVLAAR--EKEEFSSALEAASSTMKDELLKAHSEVEILKVEVETKAELL

Query:  KKEEDR-RKAQLRAAHAITRGLEKEKFQLLKEKDEMLQALEAKDEELKHAT-AELETAKEHLSNGV-LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA
          E  R RK+++    A  +  + +    L    + L+ L  K   +  AT  ELE  +  L NGV  LE +     D D F +  + A    L+ GI+
Subjt:  KKEEDR-RKAQLRAAHAITRGLEKEKFQLLKEKDEMLQALEAKDEELKHAT-AELETAKEHLSNGV-LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA

AT3G42060.1 myosin heavy chain-related1.3e-0428.1Show/hide
Query:  PENILLRIPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIS
        PE +   IPE  +R  + PEG++ L+   F E GL  PL  F+  +  R  +A +Q++         L IL       +EE  ++D+D L     +  I 
Subjt:  PENILLRIPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIS

Query:  KKPDRFYMCARKGTG-GIVKGPTS-IKGWVRKWFYASGEWLVKDESGRSFFDV
         K +R  +CA    G  I  G TS ++ W + +F+A    +  D++  S  ++
Subjt:  KKPDRFYMCARKGTG-GIVKGPTS-IKGWVRKWFYASGEWLVKDESGRSFFDV

AT5G38190.1 INVOLVED IN: biological_process unknown3.2e-0626.32Show/hide
Query:  FVAIPENILLRIPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEA
        F  +P  + +RIP + +R  + PEG++ L+   F E GLR P+  F+  F     +A +Q+       I   A L  L AR       L V+ +      
Subjt:  FVAIPENILLRIPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEA

Query:  KRISKKPDRFYMCARKGTGGIVKGPTSIKGWVRKWFYASGEW-LVKDESGRS
         ++  K  + Y+ + +G   +   P+  + W+  +FYA  +  LV+D S  +
Subjt:  KRISKKPDRFYMCARKGTGGIVKGPTSIKGWVRKWFYASGEW-LVKDESGRS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCACCTTGGGGCACCAATGGCTGTCCTCCACGTGTCCAGGGTATTCTTTCCCCCAAACATTGGCCCCCTCTCTGTCTGGTCCGATCCCGACCTGGCAGAGAAGTTC
ATTTGATTTGCTTCGGACACGTGGCGACTCCCTATTCGTGAGAAAACATAACCGTTGCGGTGGATTTATCATCGGAATATTCAAATATTTCGACGCTTCGGATCTCCGGG
AGGATCCTAGCCGCTCGTTGATTACACGTGTACTGTGGGGAAATTTTCCGACAAGCTATAAATACCCACAACCCTTCAGGTCATACCTTACGTTTCCTGAATTCTTGGAG
TTCGATCTAAAGGTAGCTCGAACCCTTGGATACCTGAGCACTACCTCGGATCCCTTCGTCGCTATCCCTGAGAACATCCTCCTTAGGATTCCGGAGGAGGGGGAGAGAGC
TGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGACTTCCCCTTCACCCTTTCGTCCAAGAGTTTCTTTTCCGAACTGGGCTGG
CTCCAGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTCGCTTTGGCCATCCTTTTTTGGCTACGAGCTCGGGATAGTGAAGAGGCCGAACTGTTGGACGTAGACCAG
CTTCTCGCGTGCTTCGAAGCAAAAAGGATATCTAAGAAGCCTGATCGGTTCTATATGTGCGCAAGGAAAGGCACAGGCGGTATAGTTAAGGGGCCGACCTCCATCAAGGG
ATGGGTGAGAAAGTGGTTCTACGCTTCTGGGGAATGGCTTGTAAAGGACGAGTCAGGTCGTTCCTTCTTTGACGTTCCCACTAGGTTTGGGAACCTAGTTTCAATCCGAC
CAGTCCCCGAGCTTACGCAAGCCTCCTTCGACACGTTGAAATATTACAAGGAGCATTTTCCGAGGGGTAGGAAGGTCGGAACCTTGGTGACCGACAAGCTGCTGCTCGAG
TTCGGGCTGCTAGATTACAACCCTGCAGTTCGTCCCATTGAATCCTCAAGGCCGAACTCCGAATTAGCCATGGTTTGCGGGTTTGTGAGTAACGTGAAATGCAAGTCCAA
GGGCCGAGCCCATGCTCTTGAGGCCGCCCAGAGTTCGAAACCTGCCACTCCTGCTGTGGTAGGGCCAGCCTCGAAAGATCCAGCCCCAATGATCGAGCTGGAGTCTTCTG
GGGGTCCTTCGAGGGAGAAGCGCCCCAGGGATCAGACCGAAGCGATGGACGCCTCGCCCTTGGGCGAGGAGGTGAGGGAGGAAGTCCCTCTGAAGCGAAGGAGAAAGAAG
AAGAAGACCACCTCCCCCTTGGAGGTCGGAGCTCGTGGGGTCTTGCCTGCGAGCTTCGCAGATCGGGTGGACGATCCTGAAGCCAGGATGGGCGGGACGTCCGACGTGAT
GACACGGTTCAGAGTTGAGCCGTCAAGTTCTGGGGTGAGGGACCAGGTGTCCCGCATCTCGGCTGCAAGTTTGGACCGCTGCCTAAGGAGGGCGTCCAAATTTGTAAGTG
ATCCAGGGTCCGTTCTGCAAAGGACCATCGACTACGTCGCTGAGGCGTTTGTTGCTTCCATTCAATCGGCTCTGGCTGTAAAGGCCGAGCTAGATGGGAGGGAAGTTCTG
GCAGCGAGGGAGAAAGAGGAGTTCTCTTCTGCCTTGGAGGCTGCTTCCTCCACCATGAAGGATGAGCTGCTGAAGGCTCACTCTGAGGTGGAAATTTTGAAGGTAGAGGT
GGAGACCAAGGCCGAGCTGCTGAAGAAGGAAGAGGACAGACGCAAGGCCCAGCTCCGAGCTGCCCATGCTATCACCAGGGGCTTGGAGAAGGAGAAGTTCCAACTCCTGA
AGGAGAAGGACGAGATGCTCCAAGCGCTTGAAGCGAAGGATGAAGAGCTGAAGCATGCGACTGCCGAGCTGGAGACGGCAAAGGAGCATCTCAGCAATGGAGTCCTATTG
GAGGAATCGTTTAGGCAACATCCTGACTTCGATGGATTTGCCAAAGACTTCTCTGACGCGGGCTTCAAGTTCCTCATGAAGGGCATTGCTTCCGACATGCCTGACCTTCA
GATCGATCTCAGTGGTCTGAAAAAGAGGTATGCCGAGCAGTGGGCGTCTGGGCCTGGCGGCACCCCTGGCCCCCAAGCGTTGGTGGATAAGTATGTCAGAGATCTGGACT
CTGACTACTCCGATCTCGAAGAGGACCAGGTCGGCACCACTAAGGAGGGCGCTCCTCAGGCAGGCTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTCACCTTGGGGCACCAATGGCTGTCCTCCACGTGTCCAGGGTATTCTTTCCCCCAAACATTGGCCCCCTCTCTGTCTGGTCCGATCCCGACCTGGCAGAGAAGTTC
ATTTGATTTGCTTCGGACACGTGGCGACTCCCTATTCGTGAGAAAACATAACCGTTGCGGTGGATTTATCATCGGAATATTCAAATATTTCGACGCTTCGGATCTCCGGG
AGGATCCTAGCCGCTCGTTGATTACACGTGTACTGTGGGGAAATTTTCCGACAAGCTATAAATACCCACAACCCTTCAGGTCATACCTTACGTTTCCTGAATTCTTGGAG
TTCGATCTAAAGGTAGCTCGAACCCTTGGATACCTGAGCACTACCTCGGATCCCTTCGTCGCTATCCCTGAGAACATCCTCCTTAGGATTCCGGAGGAGGGGGAGAGAGC
TGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGACTTCCCCTTCACCCTTTCGTCCAAGAGTTTCTTTTCCGAACTGGGCTGG
CTCCAGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTCGCTTTGGCCATCCTTTTTTGGCTACGAGCTCGGGATAGTGAAGAGGCCGAACTGTTGGACGTAGACCAG
CTTCTCGCGTGCTTCGAAGCAAAAAGGATATCTAAGAAGCCTGATCGGTTCTATATGTGCGCAAGGAAAGGCACAGGCGGTATAGTTAAGGGGCCGACCTCCATCAAGGG
ATGGGTGAGAAAGTGGTTCTACGCTTCTGGGGAATGGCTTGTAAAGGACGAGTCAGGTCGTTCCTTCTTTGACGTTCCCACTAGGTTTGGGAACCTAGTTTCAATCCGAC
CAGTCCCCGAGCTTACGCAAGCCTCCTTCGACACGTTGAAATATTACAAGGAGCATTTTCCGAGGGGTAGGAAGGTCGGAACCTTGGTGACCGACAAGCTGCTGCTCGAG
TTCGGGCTGCTAGATTACAACCCTGCAGTTCGTCCCATTGAATCCTCAAGGCCGAACTCCGAATTAGCCATGGTTTGCGGGTTTGTGAGTAACGTGAAATGCAAGTCCAA
GGGCCGAGCCCATGCTCTTGAGGCCGCCCAGAGTTCGAAACCTGCCACTCCTGCTGTGGTAGGGCCAGCCTCGAAAGATCCAGCCCCAATGATCGAGCTGGAGTCTTCTG
GGGGTCCTTCGAGGGAGAAGCGCCCCAGGGATCAGACCGAAGCGATGGACGCCTCGCCCTTGGGCGAGGAGGTGAGGGAGGAAGTCCCTCTGAAGCGAAGGAGAAAGAAG
AAGAAGACCACCTCCCCCTTGGAGGTCGGAGCTCGTGGGGTCTTGCCTGCGAGCTTCGCAGATCGGGTGGACGATCCTGAAGCCAGGATGGGCGGGACGTCCGACGTGAT
GACACGGTTCAGAGTTGAGCCGTCAAGTTCTGGGGTGAGGGACCAGGTGTCCCGCATCTCGGCTGCAAGTTTGGACCGCTGCCTAAGGAGGGCGTCCAAATTTGTAAGTG
ATCCAGGGTCCGTTCTGCAAAGGACCATCGACTACGTCGCTGAGGCGTTTGTTGCTTCCATTCAATCGGCTCTGGCTGTAAAGGCCGAGCTAGATGGGAGGGAAGTTCTG
GCAGCGAGGGAGAAAGAGGAGTTCTCTTCTGCCTTGGAGGCTGCTTCCTCCACCATGAAGGATGAGCTGCTGAAGGCTCACTCTGAGGTGGAAATTTTGAAGGTAGAGGT
GGAGACCAAGGCCGAGCTGCTGAAGAAGGAAGAGGACAGACGCAAGGCCCAGCTCCGAGCTGCCCATGCTATCACCAGGGGCTTGGAGAAGGAGAAGTTCCAACTCCTGA
AGGAGAAGGACGAGATGCTCCAAGCGCTTGAAGCGAAGGATGAAGAGCTGAAGCATGCGACTGCCGAGCTGGAGACGGCAAAGGAGCATCTCAGCAATGGAGTCCTATTG
GAGGAATCGTTTAGGCAACATCCTGACTTCGATGGATTTGCCAAAGACTTCTCTGACGCGGGCTTCAAGTTCCTCATGAAGGGCATTGCTTCCGACATGCCTGACCTTCA
GATCGATCTCAGTGGTCTGAAAAAGAGGTATGCCGAGCAGTGGGCGTCTGGGCCTGGCGGCACCCCTGGCCCCCAAGCGTTGGTGGATAAGTATGTCAGAGATCTGGACT
CTGACTACTCCGATCTCGAAGAGGACCAGGTCGGCACCACTAAGGAGGGCGCTCCTCAGGCAGGCTCTTAG
Protein sequenceShow/hide protein sequence
MVTLGHQWLSSTCPGYSFPQTLAPSLSGPIPTWQRSSFDLLRTRGDSLFVRKHNRCGGFIIGIFKYFDASDLREDPSRSLITRVLWGNFPTSYKYPQPFRSYLTFPEFLE
FDLKVARTLGYLSTTSDPFVAIPENILLRIPEEGERADNPPEGWVTLYFKMFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQ
LLACFEAKRISKKPDRFYMCARKGTGGIVKGPTSIKGWVRKWFYASGEWLVKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLE
FGLLDYNPAVRPIESSRPNSELAMVCGFVSNVKCKSKGRAHALEAAQSSKPATPAVVGPASKDPAPMIELESSGGPSREKRPRDQTEAMDASPLGEEVREEVPLKRRRKK
KKTTSPLEVGARGVLPASFADRVDDPEARMGGTSDVMTRFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYVAEAFVASIQSALAVKAELDGREVL
AAREKEEFSSALEAASSTMKDELLKAHSEVEILKVEVETKAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDEMLQALEAKDEELKHATAELETAKEHLSNGVLL
EESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKKRYAEQWASGPGGTPGPQALVDKYVRDLDSDYSDLEEDQVGTTKEGAPQAGS