; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g18330 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g18330
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionmyosin heavy chain-related
Genome locationchr8:13874221..13877351
RNA-Seq ExpressionMoc08g18330
SyntenyMoc08g18330
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138041.1 uncharacterized protein LOC111009298 [Momordica charantia]1.7e-11187.15Show/hide
Query:  MCARKGADGIVKGPTSIKGWVRKWFYASGEWLAKDELVVPSLTFPLVSIQPVPELMQASFDTLKYYNERFPRGRKVGTLVTDELLLESGLLDYNPAVRPI
        MCARKGA GIVKGPTSIKGWVRKWFYASGEWLAKDE          V+I+PVPEL QASFDTLKYY E FPRGRKVGTLVTD+LLLESGLLDYNPAVRPI
Subjt:  MCARKGADGIVKGPTSIKGWVRKWFYASGEWLAKDELVVPSLTFPLVSIQPVPELMQASFDTLKYYNERFPRGRKVGTLVTDELLLESGLLDYNPAVRPI

Query:  ESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAVVGPASEDPTPVIELESSGGPSREKRPRDQTEAVDALPSGEEMREEVPPKRRRKKKKT
        ESSRPNSELAMVCGFASNVKRKSKG+AHALEAAQSSKP TPAVVGPASEDP PVIELESS GPSREKRPRDQTEAVD  P GEE+REEVP KRRRKKKKT
Subjt:  ESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAVVGPASEDPTPVIELESSGGPSREKRPRDQTEAVDALPSGEEMREEVPPKRRRKKKKT

Query:  TSPLEVGARGVLPASYAYRVDDPEARMDGTSDVTARFRVEPSSSRVRDQ
        TSPLEVGARGVLPAS+A RVDDPEARM GT DVT RFRVEPSSS VRDQ
Subjt:  TSPLEVGARGVLPASYAYRVDDPEARMDGTSDVTARFRVEPSSSRVRDQ

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]6.0e-12587.23Show/hide
Query:  MFEYGLRLPFHPFVQEFLFRTGLAPAQVASNGWGVIFALAILFWLRAWDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGADGIVKGPTSIKGWVR
        MFEYGLRLP HPFVQEFLFRTGLAPAQVA NGWGVIFALAILFWLRA DSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGA GIVKGPTSIKGWVR
Subjt:  MFEYGLRLPFHPFVQEFLFRTGLAPAQVASNGWGVIFALAILFWLRAWDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGADGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDE-----LVVPSLTFPLVSIQPVPELMQASFDTLKYYNERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS
        KWFYASGEWLAKDE       VP+    LVSI+PVPEL QASFDTLKYY ERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIE SRPNS LAMVC FAS
Subjt:  KWFYASGEWLAKDE-----LVVPSLTFPLVSIQPVPELMQASFDTLKYYNERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS

Query:  NVKRKSKGRAHALEAAQSSKPATPAVVGPASEDPTPVIELESSGGPSREKRPRDQTEAVDALPSGEEMREEVPP
         VKRKSKGRAHALEAAQSSKP TPAVVGPASEDP PVIELESSGGPSREKRPRDQTEAVDA    +    +VPP
Subjt:  NVKRKSKGRAHALEAAQSSKPATPAVVGPASEDPTPVIELESSGGPSREKRPRDQTEAVDALPSGEEMREEVPP

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]1.7e-10374.05Show/hide
Query:  GTSDVTARFRVEPSSSRVRDQVSRISAASLDRCLRRASKFVSDPVSVLQRTIDYAAEVRLVSPFLFNLPNSCKGRAGWEGSS--GSEGERGVLCCLGGCF
        G   + A+ R+EPSSS VRDQVSRISAASLDRCLRRASKFVS P SVLQRTIDYAAE      F+ ++ ++   +A  +G     +  +      L    
Subjt:  GTSDVTARFRVEPSSSRVRDQVSRISAASLDRCLRRASKFVSDPVSVLQRTIDYAAEVRLVSPFLFNLPNSCKGRAGWEGSS--GSEGERGVLCCLGGCF

Query:  PTMKDELLKAHSEVETLKAEVETKAELLKKEEDRRKAQLRAAHTITRGLEKEKFQLLKEKDEMLQALEAKEEELKRATAELETVKERLSNGVLLEESFRQ
         TMKDELLKAHSEVETLKAEVE++AELLKKEEDRR+AQLRAAH ITRGLE+EKFQLLKEKD+MLQALEAK++EL+ ATAELET KERLSNGVLLEE+FRQ
Subjt:  PTMKDELLKAHSEVETLKAEVETKAELLKKEEDRRKAQLRAAHTITRGLEKEKFQLLKEKDEMLQALEAKEEELKRATAELETVKERLSNGVLLEESFRQ

Query:  HPDFDGFAKDFSDAGFKFLMKGIASDMSDLQIDLSGLKKKYAEQWASGPGGTSGPQALVDKYVRDLDSDYSDLEEDQVGTTQEGTPQAG
        HPDFDGFAKDFSDAGFKFLMKGIASDM DLQIDLSGLK++YAE+WASGPGGT GPQALVD+YVRDLDSDYSD EEDQVG+TQEG    G
Subjt:  HPDFDGFAKDFSDAGFKFLMKGIASDMSDLQIDLSGLKKKYAEQWASGPGGTSGPQALVDKYVRDLDSDYSDLEEDQVGTTQEGTPQAG

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]4.9e-17590.7Show/hide
Query:  MSSSFSSNLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLCRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMFEYG
        MSSS SSNL  + DLARRLES+LEEIEN R SDDGEDSDASTSGQGLEYPSRIPEHYLGSL RGFAIPENILLR+PEEGERADNPPEGWVTLYFKMFEYG
Subjt:  MSSSFSSNLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLCRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMFEYG

Query:  LRLPFHPFVQEFLFRTGLAPAQVASNGWGVIFALAILFWLRAWDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGADGIVKGPTSIKGWVRKWFYA
        LRLP HPFVQEFLFRTGLAPAQVA NGWGVIFALAILFWLRA DSEEAEL DVDQLLACFEAKRIAKKPGRFYMCARKGA GIVKGPTSIKGWVRKWFYA
Subjt:  LRLPFHPFVQEFLFRTGLAPAQVASNGWGVIFALAILFWLRAWDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGADGIVKGPTSIKGWVRKWFYA

Query:  SGEWLAKDE-----LVVPSLTFPLVSIQPVPELMQASFDTLKYYNERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASNVKRK
        SGEWLAKDE       VP+    LVSI+PVPEL QASFDTLKYY ERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS VKRK
Subjt:  SGEWLAKDE-----LVVPSLTFPLVSIQPVPELMQASFDTLKYYNERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASNVKRK

Query:  SKGRAHALEAAQSSKPATPAVVGPASEDPTPVIELESSGGPSREKRPRDQTEAVD
        SKGRAHALEAAQSSKPATPAVVGPASEDP  VIELESSGGPSREKRPRDQTEAVD
Subjt:  SKGRAHALEAAQSSKPATPAVVGPASEDPTPVIELESSGGPSREKRPRDQTEAVD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]1.3e-16763.79Show/hide
Query:  MCARKGADGIVKGPTSIKGWVRKWFYASGEWLAKDE-----LVVPSLTFPLVSIQPVPELMQASFDTLKYYNERFPRGRKVGTLVTDELLLESGLLDYNP
        MCARKG  GIVKGPTSIKGWV KWF+ASGEWLAKDE       VP+    LVSI+ +PEL QA+FDTLK+Y + FPR RK+ TLVTD+LLLESGLLDYNP
Subjt:  MCARKGADGIVKGPTSIKGWVRKWFYASGEWLAKDE-----LVVPSLTFPLVSIQPVPELMQASFDTLKYYNERFPRGRKVGTLVTDELLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAV--------VGPASEDPTPVIELESSGGPSREKRPRDQTEAVDALPSGEEMRE
         VR IE+SRPNSELAMVCGF  +VKRKSKGRAHAL+    ++P TP V         GP+S  PTPVIEL+ SGG S EKR R+++EA+D  P   E+R 
Subjt:  AVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAV--------VGPASEDPTPVIELESSGGPSREKRPRDQTEAVDALPSGEEMRE

Query:  EVPPKRRRKKKKTTSPLEVGARGVLPASYAYRVDDPEARMDGTSDVTARFRVEPSSSRVRDQVSRISAASLDRCLRRASKFVSDPVSVLQRTIDYAAEVR
        E P +RRRKKKKT+S  E GARG LP S+A  VDDPEARM GTS+V  RF +EPSSS V+DQVSRISA  LDR LRRASKFVSDP SVLQRTID  AE  
Subjt:  EVPPKRRRKKKKTTSPLEVGARGVLPASYAYRVDDPEARMDGTSDVTARFRVEPSSSRVRDQVSRISAASLDRCLRRASKFVSDPVSVLQRTIDYAAEVR

Query:  LVSPFLFNLPNSCKGRAGWEGSSGSEGERGVLCCLGGCFPTMKDELLKAHSEVETLKAEVETKAELLKKEEDRRKAQLRAAHTITRGLEKEKFQLLKEKD
        + S  L  +  +     G E  +  E E            T+K ELLKA  EV+ L+AEV+ K +LLKKE ++ KA LRAAH IT+GLEKEKFQLLKEKD
Subjt:  LVSPFLFNLPNSCKGRAGWEGSSGSEGERGVLCCLGGCFPTMKDELLKAHSEVETLKAEVETKAELLKKEEDRRKAQLRAAHTITRGLEKEKFQLLKEKD

Query:  EMLQALEAKEEELKRATAELETVKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMSDLQIDLSGLKKKYAEQWASGPGGTSGPQALVDK
        ++ Q LE K+  + R T EL+ +KERL+NG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA+DM  LQIDL+GLKKKY+E+WASGP GT  PQ+LVDK
Subjt:  EMLQALEAKEEELKRATAELETVKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMSDLQIDLSGLKKKYAEQWASGPGGTSGPQALVDK

Query:  YVRDLDSDYSDLEED--------QVGTTQEGTP
        YVR+LDSDYSD+EE+        +VGTTQE  P
Subjt:  YVRDLDSDYSDLEED--------QVGTTQEGTP

TrEMBL top hitse value%identityAlignment
A0A6J1C8K9 uncharacterized protein LOC1110092988.3e-11287.15Show/hide
Query:  MCARKGADGIVKGPTSIKGWVRKWFYASGEWLAKDELVVPSLTFPLVSIQPVPELMQASFDTLKYYNERFPRGRKVGTLVTDELLLESGLLDYNPAVRPI
        MCARKGA GIVKGPTSIKGWVRKWFYASGEWLAKDE          V+I+PVPEL QASFDTLKYY E FPRGRKVGTLVTD+LLLESGLLDYNPAVRPI
Subjt:  MCARKGADGIVKGPTSIKGWVRKWFYASGEWLAKDELVVPSLTFPLVSIQPVPELMQASFDTLKYYNERFPRGRKVGTLVTDELLLESGLLDYNPAVRPI

Query:  ESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAVVGPASEDPTPVIELESSGGPSREKRPRDQTEAVDALPSGEEMREEVPPKRRRKKKKT
        ESSRPNSELAMVCGFASNVKRKSKG+AHALEAAQSSKP TPAVVGPASEDP PVIELESS GPSREKRPRDQTEAVD  P GEE+REEVP KRRRKKKKT
Subjt:  ESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAVVGPASEDPTPVIELESSGGPSREKRPRDQTEAVDALPSGEEMREEVPPKRRRKKKKT

Query:  TSPLEVGARGVLPASYAYRVDDPEARMDGTSDVTARFRVEPSSSRVRDQ
        TSPLEVGARGVLPAS+A RVDDPEARM GT DVT RFRVEPSSS VRDQ
Subjt:  TSPLEVGARGVLPASYAYRVDDPEARMDGTSDVTARFRVEPSSSRVRDQ

A0A6J1CR42 uncharacterized protein LOC1110138262.9e-12587.23Show/hide
Query:  MFEYGLRLPFHPFVQEFLFRTGLAPAQVASNGWGVIFALAILFWLRAWDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGADGIVKGPTSIKGWVR
        MFEYGLRLP HPFVQEFLFRTGLAPAQVA NGWGVIFALAILFWLRA DSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGA GIVKGPTSIKGWVR
Subjt:  MFEYGLRLPFHPFVQEFLFRTGLAPAQVASNGWGVIFALAILFWLRAWDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGADGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDE-----LVVPSLTFPLVSIQPVPELMQASFDTLKYYNERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS
        KWFYASGEWLAKDE       VP+    LVSI+PVPEL QASFDTLKYY ERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIE SRPNS LAMVC FAS
Subjt:  KWFYASGEWLAKDE-----LVVPSLTFPLVSIQPVPELMQASFDTLKYYNERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS

Query:  NVKRKSKGRAHALEAAQSSKPATPAVVGPASEDPTPVIELESSGGPSREKRPRDQTEAVDALPSGEEMREEVPP
         VKRKSKGRAHALEAAQSSKP TPAVVGPASEDP PVIELESSGGPSREKRPRDQTEAVDA    +    +VPP
Subjt:  NVKRKSKGRAHALEAAQSSKPATPAVVGPASEDPTPVIELESSGGPSREKRPRDQTEAVDALPSGEEMREEVPP

A0A6J1D971 uncharacterized protein LOC1110185388.3e-10474.05Show/hide
Query:  GTSDVTARFRVEPSSSRVRDQVSRISAASLDRCLRRASKFVSDPVSVLQRTIDYAAEVRLVSPFLFNLPNSCKGRAGWEGSS--GSEGERGVLCCLGGCF
        G   + A+ R+EPSSS VRDQVSRISAASLDRCLRRASKFVS P SVLQRTIDYAAE      F+ ++ ++   +A  +G     +  +      L    
Subjt:  GTSDVTARFRVEPSSSRVRDQVSRISAASLDRCLRRASKFVSDPVSVLQRTIDYAAEVRLVSPFLFNLPNSCKGRAGWEGSS--GSEGERGVLCCLGGCF

Query:  PTMKDELLKAHSEVETLKAEVETKAELLKKEEDRRKAQLRAAHTITRGLEKEKFQLLKEKDEMLQALEAKEEELKRATAELETVKERLSNGVLLEESFRQ
         TMKDELLKAHSEVETLKAEVE++AELLKKEEDRR+AQLRAAH ITRGLE+EKFQLLKEKD+MLQALEAK++EL+ ATAELET KERLSNGVLLEE+FRQ
Subjt:  PTMKDELLKAHSEVETLKAEVETKAELLKKEEDRRKAQLRAAHTITRGLEKEKFQLLKEKDEMLQALEAKEEELKRATAELETVKERLSNGVLLEESFRQ

Query:  HPDFDGFAKDFSDAGFKFLMKGIASDMSDLQIDLSGLKKKYAEQWASGPGGTSGPQALVDKYVRDLDSDYSDLEEDQVGTTQEGTPQAG
        HPDFDGFAKDFSDAGFKFLMKGIASDM DLQIDLSGLK++YAE+WASGPGGT GPQALVD+YVRDLDSDYSD EEDQVG+TQEG    G
Subjt:  HPDFDGFAKDFSDAGFKFLMKGIASDMSDLQIDLSGLKKKYAEQWASGPGGTSGPQALVDKYVRDLDSDYSDLEEDQVGTTQEGTPQAG

A0A6J1DXS5 uncharacterized protein LOC1110255022.4e-17590.7Show/hide
Query:  MSSSFSSNLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLCRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMFEYG
        MSSS SSNL  + DLARRLES+LEEIEN R SDDGEDSDASTSGQGLEYPSRIPEHYLGSL RGFAIPENILLR+PEEGERADNPPEGWVTLYFKMFEYG
Subjt:  MSSSFSSNLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLCRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMFEYG

Query:  LRLPFHPFVQEFLFRTGLAPAQVASNGWGVIFALAILFWLRAWDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGADGIVKGPTSIKGWVRKWFYA
        LRLP HPFVQEFLFRTGLAPAQVA NGWGVIFALAILFWLRA DSEEAEL DVDQLLACFEAKRIAKKPGRFYMCARKGA GIVKGPTSIKGWVRKWFYA
Subjt:  LRLPFHPFVQEFLFRTGLAPAQVASNGWGVIFALAILFWLRAWDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGADGIVKGPTSIKGWVRKWFYA

Query:  SGEWLAKDE-----LVVPSLTFPLVSIQPVPELMQASFDTLKYYNERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASNVKRK
        SGEWLAKDE       VP+    LVSI+PVPEL QASFDTLKYY ERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS VKRK
Subjt:  SGEWLAKDE-----LVVPSLTFPLVSIQPVPELMQASFDTLKYYNERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASNVKRK

Query:  SKGRAHALEAAQSSKPATPAVVGPASEDPTPVIELESSGGPSREKRPRDQTEAVD
        SKGRAHALEAAQSSKPATPAVVGPASEDP  VIELESSGGPSREKRPRDQTEAVD
Subjt:  SKGRAHALEAAQSSKPATPAVVGPASEDPTPVIELESSGGPSREKRPRDQTEAVD

A0A6J1DZB3 uncharacterized protein LOC1110256656.2e-16863.79Show/hide
Query:  MCARKGADGIVKGPTSIKGWVRKWFYASGEWLAKDE-----LVVPSLTFPLVSIQPVPELMQASFDTLKYYNERFPRGRKVGTLVTDELLLESGLLDYNP
        MCARKG  GIVKGPTSIKGWV KWF+ASGEWLAKDE       VP+    LVSI+ +PEL QA+FDTLK+Y + FPR RK+ TLVTD+LLLESGLLDYNP
Subjt:  MCARKGADGIVKGPTSIKGWVRKWFYASGEWLAKDE-----LVVPSLTFPLVSIQPVPELMQASFDTLKYYNERFPRGRKVGTLVTDELLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAV--------VGPASEDPTPVIELESSGGPSREKRPRDQTEAVDALPSGEEMRE
         VR IE+SRPNSELAMVCGF  +VKRKSKGRAHAL+    ++P TP V         GP+S  PTPVIEL+ SGG S EKR R+++EA+D  P   E+R 
Subjt:  AVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAV--------VGPASEDPTPVIELESSGGPSREKRPRDQTEAVDALPSGEEMRE

Query:  EVPPKRRRKKKKTTSPLEVGARGVLPASYAYRVDDPEARMDGTSDVTARFRVEPSSSRVRDQVSRISAASLDRCLRRASKFVSDPVSVLQRTIDYAAEVR
        E P +RRRKKKKT+S  E GARG LP S+A  VDDPEARM GTS+V  RF +EPSSS V+DQVSRISA  LDR LRRASKFVSDP SVLQRTID  AE  
Subjt:  EVPPKRRRKKKKTTSPLEVGARGVLPASYAYRVDDPEARMDGTSDVTARFRVEPSSSRVRDQVSRISAASLDRCLRRASKFVSDPVSVLQRTIDYAAEVR

Query:  LVSPFLFNLPNSCKGRAGWEGSSGSEGERGVLCCLGGCFPTMKDELLKAHSEVETLKAEVETKAELLKKEEDRRKAQLRAAHTITRGLEKEKFQLLKEKD
        + S  L  +  +     G E  +  E E            T+K ELLKA  EV+ L+AEV+ K +LLKKE ++ KA LRAAH IT+GLEKEKFQLLKEKD
Subjt:  LVSPFLFNLPNSCKGRAGWEGSSGSEGERGVLCCLGGCFPTMKDELLKAHSEVETLKAEVETKAELLKKEEDRRKAQLRAAHTITRGLEKEKFQLLKEKD

Query:  EMLQALEAKEEELKRATAELETVKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMSDLQIDLSGLKKKYAEQWASGPGGTSGPQALVDK
        ++ Q LE K+  + R T EL+ +KERL+NG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA+DM  LQIDL+GLKKKY+E+WASGP GT  PQ+LVDK
Subjt:  EMLQALEAKEEELKRATAELETVKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMSDLQIDLSGLKKKYAEQWASGPGGTSGPQALVDK

Query:  YVRDLDSDYSDLEED--------QVGTTQEGTP
        YVR+LDSDYSD+EE+        +VGTTQE  P
Subjt:  YVRDLDSDYSDLEED--------QVGTTQEGTP

SwissProt top hitse value%identityAlignment
Q9LEX8 Uncharacterized protein At3g60930, chloroplastic4.5e-0626.86Show/hide
Query:  ILLRIPEEGERADNPPEGWVTLYFKMFEYG--LRLPFHPFVQEFLFRTGLAPAQVASNGWGVIFALAILFWLRAWDSEEAELLDVDQLLACFEAKRIAK-
        + LR+P   ERAD+PP G+ TLY + F YG  L LP    V E++    +A +Q+       +  + I    R+++SE    + +  L    E +R+ K 
Subjt:  ILLRIPEEGERADNPPEGWVTLYFKMFEYG--LRLPFHPFVQEFLFRTGLAPAQVASNGWGVIFALAILFWLRAWDSEEAELLDVDQLLACFEAKRIAK-

Query:  KPGRFYMCARKGADGIVKGPTSIKGWVRKWFYASGEWLAKDELVVPSLT------FPLVSIQPVPELMQASFDTL
        +  R+Y+   KG   I   P+  + +   +F+ + E    ++L+   LT        L  ++P+P+   ++F  L
Subjt:  KPGRFYMCARKGADGIVKGPTSIKGWVRKWFYASGEWLAKDELVVPSLT------FPLVSIQPVPELMQASFDTL

Arabidopsis top hitse value%identityAlignment
AT1G32010.1 myosin heavy chain-related9.3e-0721.58Show/hide
Query:  RLESELEEIENFRFSDDGEDSDASTSGQGLEY------PSRIPEHYLGSLCRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMF-EYGLRLPFHPFVQ
        R+ ++ +   N    D+ E +D + SG+  +       P+      +G       +P  + +RIP + +R  + PEG++ L+   F E GLR P   F+ 
Subjt:  RLESELEEIENFRFSDDGEDSDASTSGQGLEY------PSRIPEHYLGSLCRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMF-EYGLRLPFHPFVQ

Query:  EFLFRTGLAPAQVASNGWGVIFALAILFWLRAWDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGADGIVKGPTSIKGWVRKWFYA
         F     +A +Q+       + ++     L+   +     L V+ +       ++  K G+ Y+ + +G   +  GP+  + W+  +FYA
Subjt:  EFLFRTGLAPAQVASNGWGVIFALAILFWLRAWDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGADGIVKGPTSIKGWVRKWFYA

AT2G15420.1 myosin heavy chain-related4.3e-0423.8Show/hide
Query:  PENILLRIPEEGERADNPPEGWVTLYFKMF-EYGLRLPFHPFVQEFLFRTGLAPAQVASNGWGVIFALAILFWLRAWDSEEAELLDVDQLLACFEAKRIA
        P  I L  P+  +R   PPEG++ LY   F   GL  P   F+ E+  R  +A +Q+          LAIL       +E    +D D         R+ 
Subjt:  PENILLRIPEEGERADNPPEGWVTLYFKMF-EYGLRLPFHPFVQEFLFRTGLAPAQVASNGWGVIFALAILFWLRAWDSEEAELLDVDQLLACFEAKRIA

Query:  KKPGRFYMCARKGADGIVKGPTS-IKGWVRKWFYAS--------------GEWLAKDELVVPSLTFP---LVSIQPVPELMQASFDTLKYYNERFPRGRK
        + PG +Y  A K    IV G  S I GW R++F+                 +W    E V     FP   L +I  + EL    + T  +   R  R R 
Subjt:  KKPGRFYMCARKGADGIVKGPTS-IKGWVRKWFYAS--------------GEWLAKDELVVPSLTFP---LVSIQPVPELMQASFDTLKYYNERFPRGRK

Query:  VGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAVVGPASED---PTPVIE---LESSGG--PSREK
        +G ++           +    +  +E S   +E  +      N   +S GR  A E+A        +   P +ED      V+    L S GG  PS+++
Subjt:  VGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAVVGPASED---PTPVIE---LESSGG--PSREK

Query:  RPRDQTEAVDALPSGEEMREEVPPKRRRKKKKTTSPLEVGARGVLP--ASYAYRVDDPEARMDGTSDVTARFRVEPSSSRVRDQVSRISAASLDRCLRRA
          RD           E+   +VP   RR         E   RG +    SY Y+  D     + TS                D VSRI  AS D      
Subjt:  RPRDQTEAVDALPSGEEMREEVPPKRRRKKKKTTSPLEVGARGVLP--ASYAYRVDDPEARMDGTSDVTARFRVEPSSSRVRDQVSRISAASLDRCLRRA

Query:  SKFVSDPV--SVLQRTIDYAAEVRLVSPFLFNLPNSCKGRAGWEGSSGSEGERGVLCCL----------GGCFPTMKDELLKAHSEVETLKAEVETKAEL
         +FV   +     Q+T         +S FL      CK +      +    E   L  L                   EL +  S +++   E   + E 
Subjt:  SKFVSDPV--SVLQRTIDYAAEVRLVSPFLFNLPNSCKGRAGWEGSSGSEGERGVLCCL----------GGCFPTMKDELLKAHSEVETLKAEVETKAEL

Query:  LKKEEDRRKAQLRAAHTITRGLEKEKFQLLKEKDEMLQALEAKEEELKRATA-------ELETVKERLSNGV-LLEESFRQHPDFDGFAKDFSDAGFKFL
        L K      A+LR +      +E    +  K   ++   L+  E+ +K+  A       ELE  +  L NGV  LE +     D D F +  + A    L
Subjt:  LKKEEDRRKAQLRAAHTITRGLEKEKFQLLKEKDEMLQALEAKEEELKRATA-------ELETVKERLSNGV-LLEESFRQHPDFDGFAKDFSDAGFKFL

Query:  MKGIA
        + GI+
Subjt:  MKGIA

AT3G42060.1 myosin heavy chain-related4.3e-0428.57Show/hide
Query:  SRIPEHYLGSLCRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMF-EYGLRLPFHPFVQEFLFRTGLAPAQVASNGWGVIFALAILFWLRAWDSEEAE
        SR    + G        PE +   IPE  +R  + PEG++ L+   F E GL  P   F+  +  R  +A +Q++         L IL       +EE  
Subjt:  SRIPEHYLGSLCRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMF-EYGLRLPFHPFVQEFLFRTGLAPAQVASNGWGVIFALAILFWLRAWDSEEAE

Query:  LLDVDQLLACFEAKRIAKKPGRFYMCA--RKGADGIVKGPTS-IKGWVRKWFYA
        ++D+D L     +  I  K  R  +CA  R+G   I  G TS ++ W + +F+A
Subjt:  LLDVDQLLACFEAKRIAKKPGRFYMCA--RKGADGIVKGPTS-IKGWVRKWFYA

AT5G38190.1 INVOLVED IN: biological_process unknown6.0e-0622.35Show/hide
Query:  RFSDD-GEDSDASTSGQGLEY------PSRIPEHYLGSLCRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMF-EYGLRLPFHPFVQEFLFRTGLAPA
        R++DD  E +D + SG+  +       P+      +G       +P  + +RIP + +R  + PEG++ L+   F E GLR P   F+  F     +A +
Subjt:  RFSDD-GEDSDASTSGQGLEY------PSRIPEHYLGSLCRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMF-EYGLRLPFHPFVQEFLFRTGLAPA

Query:  QVASNGWGVIFALAILFWLRAWDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGADGIVKGPTSIKGWVRKWFYA
        Q+       + ++     L+   +     L V+ +       ++  K G+ Y+ + +G   +   P+  + W+  +FYA
Subjt:  QVASNGWGVIFALAILFWLRAWDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGADGIVKGPTSIKGWVRKWFYA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTCCTCTTTTAGCAGCAACTTAGGATCCGATGAAGATTTAGCTCGTAGGCTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGATGACGGGGAGGA
TAGTGATGCCTCCACCTCGGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTTGGATCCCTTTGTAGGGGATTCGCTATCCCTGAGAACATCCTCCTTA
GGATTCCGGAGGAGGGGGAGAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTTTACTTCAAAATGTTTGAGTACGGCCTCAGACTTCCCTTTCACCCTTTTGTCCAA
GAATTTCTCTTCCGGACTGGGTTGGCTCCGGCTCAAGTGGCCTCCAATGGGTGGGGTGTCATTTTTGCTTTGGCCATCCTTTTTTGGCTACGAGCTTGGGATAGTGAAGA
GGCCGAGCTGTTAGACGTAGACCAGCTCCTCGCGTGCTTCGAAGCAAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTACATGTGCGCAAGGAAAGGCGCAGACGGTATAG
TTAAGGGGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTTTACGCTTCTGGGGAATGGCTTGCAAAGGACGAGTTGGTCGTTCCTTCTTTGACGTTCCCACTAGTT
TCAATCCAACCAGTCCCCGAACTTATGCAAGCCTCCTTCGACACGCTGAAATATTACAATGAGCGCTTTCCGAGGGGTAGGAAGGTCGGAACCTTGGTGACCGACGAGCT
GCTGCTTGAGTCCGGGCTGCTAGACTACAACCCCGCAGTTCGTCCCATTGAATCCTCAAGGCCGAACTCCGAACTTGCCATGGTTTGCGGATTTGCAAGCAACGTGAAGC
GCAAGTCCAAGGGCCGAGCCCATGCTCTTGAGGCCGCCCAGAGTTCGAAACCTGCCACTCCCGCTGTGGTAGGGCCAGCCTCGGAAGATCCAACCCCAGTGATCGAACTG
GAGTCTTCTGGGGGTCCCTCGAGGGAGAAGCGCCCCAGAGATCAGACCGAGGCGGTGGACGCCTTGCCTTCGGGCGAGGAGATGAGGGAGGAAGTCCCTCCGAAGCGAAG
GAGGAAGAAGAAGAAGACAACCTCCCCCTTGGAGGTCGGAGCTCGTGGGGTCTTGCCTGCGAGCTACGCATATCGGGTGGACGATCCTGAGGCCAGGATGGACGGGACGT
CTGATGTGACGGCACGGTTCAGAGTTGAGCCGTCAAGTTCTAGGGTGAGGGACCAGGTGTCCCGCATCTCGGCCGCAAGTTTGGACCGCTGCCTAAGGAGGGCGTCCAAA
TTTGTGAGCGACCCAGTGTCCGTTCTGCAGAGGACCATCGACTACGCTGCTGAGGTAAGACTAGTGTCTCCGTTTTTGTTTAATTTACCTAACAGCTGTAAAGGCCGAGC
TGGATGGGAGGGAAGCTCTGGCAGCGAGGGAGAAAGAGGAGTTCTCTGCTGCCTTGGAGGCTGCTTCCCCACCATGAAGGATGAGCTACTGAAGGCTCACTCTGAGGTGG
AAACTTTGAAGGCCGAGGTGGAGACCAAGGCCGAGCTGCTGAAGAAGGAAGAGGACAGACGCAAGGCCCAGCTCCGAGCTGCTCATACTATCACCAGGGGCTTGGAGAAG
GAGAAGTTCCAACTCCTCAAGGAGAAGGACGAGATGCTCCAGGCGCTTGAAGCGAAGGAGGAGGAGCTGAAGCGTGCGACTGCCGAGCTGGAGACGGTGAAGGAGCGTCT
CAGCAATGGAGTCCTATTGGAGGAATCGTTTAGGCAACATCCTGACTTCGATGGATTTGCCAAAGACTTCTCTGACGCGGGCTTCAAGTTTCTCATGAAGGGCATTGCCT
CCGACATGTCTGACCTTCAGATCGATCTCAGTGGTTTGAAGAAGAAGTATGCCGAGCAGTGGGCGTCTGGGCCTGGCGGCACCTCTGGCCCCCAAGCGTTGGTGGATAAG
TATGTCAGAGATCTGGACTCTGACTACTCCGATCTCGAAGAGGACCAGGTCGGCACCACTCAGGAGGGCACTCCTCAGGCAGGGCAGAGCTGCAAGGTCTATGAGCCTTG
GCTCTGCTCTCCATTTAATGAAGAAGCTTTCATTTGTTTTTACTTTGGTGTCGGCAACATCTTTCCTTTCTTTGCTTTTTCTTTGAACTGCGGCCAATGTCACCTCGCAC
CTCTTACCTTTGAGGTTCAGAGGCTTGACCATTTTGAACTTTTCACATCGTCCCATTACCTTGAAGGTTTGAATTTTAAGTTCATCAGTGGTTTTGGCATCGCACCTCGT
ACCCTTAGATCCATTGAATACCCTTTTGGCATTTCAAGGATAATAACGCTTCAGGTGCTCCGCGTTCCACGGGTGCGCGAGGACGTCTCCTTTCAGATCGGCCAATACGC
ACGTCCCAGGTCGGATTATGCCCTTGACCTCAAACGGGCCCTCCCAGGTCGGATCAAGGGCACCCACATGGGTTTTGACCCTCCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCGTCCTCTTTTAGCAGCAACTTAGGATCCGATGAAGATTTAGCTCGTAGGCTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGATGACGGGGAGGA
TAGTGATGCCTCCACCTCGGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTTGGATCCCTTTGTAGGGGATTCGCTATCCCTGAGAACATCCTCCTTA
GGATTCCGGAGGAGGGGGAGAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTTTACTTCAAAATGTTTGAGTACGGCCTCAGACTTCCCTTTCACCCTTTTGTCCAA
GAATTTCTCTTCCGGACTGGGTTGGCTCCGGCTCAAGTGGCCTCCAATGGGTGGGGTGTCATTTTTGCTTTGGCCATCCTTTTTTGGCTACGAGCTTGGGATAGTGAAGA
GGCCGAGCTGTTAGACGTAGACCAGCTCCTCGCGTGCTTCGAAGCAAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTACATGTGCGCAAGGAAAGGCGCAGACGGTATAG
TTAAGGGGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTTTACGCTTCTGGGGAATGGCTTGCAAAGGACGAGTTGGTCGTTCCTTCTTTGACGTTCCCACTAGTT
TCAATCCAACCAGTCCCCGAACTTATGCAAGCCTCCTTCGACACGCTGAAATATTACAATGAGCGCTTTCCGAGGGGTAGGAAGGTCGGAACCTTGGTGACCGACGAGCT
GCTGCTTGAGTCCGGGCTGCTAGACTACAACCCCGCAGTTCGTCCCATTGAATCCTCAAGGCCGAACTCCGAACTTGCCATGGTTTGCGGATTTGCAAGCAACGTGAAGC
GCAAGTCCAAGGGCCGAGCCCATGCTCTTGAGGCCGCCCAGAGTTCGAAACCTGCCACTCCCGCTGTGGTAGGGCCAGCCTCGGAAGATCCAACCCCAGTGATCGAACTG
GAGTCTTCTGGGGGTCCCTCGAGGGAGAAGCGCCCCAGAGATCAGACCGAGGCGGTGGACGCCTTGCCTTCGGGCGAGGAGATGAGGGAGGAAGTCCCTCCGAAGCGAAG
GAGGAAGAAGAAGAAGACAACCTCCCCCTTGGAGGTCGGAGCTCGTGGGGTCTTGCCTGCGAGCTACGCATATCGGGTGGACGATCCTGAGGCCAGGATGGACGGGACGT
CTGATGTGACGGCACGGTTCAGAGTTGAGCCGTCAAGTTCTAGGGTGAGGGACCAGGTGTCCCGCATCTCGGCCGCAAGTTTGGACCGCTGCCTAAGGAGGGCGTCCAAA
TTTGTGAGCGACCCAGTGTCCGTTCTGCAGAGGACCATCGACTACGCTGCTGAGGTAAGACTAGTGTCTCCGTTTTTGTTTAATTTACCTAACAGCTGTAAAGGCCGAGC
TGGATGGGAGGGAAGCTCTGGCAGCGAGGGAGAAAGAGGAGTTCTCTGCTGCCTTGGAGGCTGCTTCCCCACCATGAAGGATGAGCTACTGAAGGCTCACTCTGAGGTGG
AAACTTTGAAGGCCGAGGTGGAGACCAAGGCCGAGCTGCTGAAGAAGGAAGAGGACAGACGCAAGGCCCAGCTCCGAGCTGCTCATACTATCACCAGGGGCTTGGAGAAG
GAGAAGTTCCAACTCCTCAAGGAGAAGGACGAGATGCTCCAGGCGCTTGAAGCGAAGGAGGAGGAGCTGAAGCGTGCGACTGCCGAGCTGGAGACGGTGAAGGAGCGTCT
CAGCAATGGAGTCCTATTGGAGGAATCGTTTAGGCAACATCCTGACTTCGATGGATTTGCCAAAGACTTCTCTGACGCGGGCTTCAAGTTTCTCATGAAGGGCATTGCCT
CCGACATGTCTGACCTTCAGATCGATCTCAGTGGTTTGAAGAAGAAGTATGCCGAGCAGTGGGCGTCTGGGCCTGGCGGCACCTCTGGCCCCCAAGCGTTGGTGGATAAG
TATGTCAGAGATCTGGACTCTGACTACTCCGATCTCGAAGAGGACCAGGTCGGCACCACTCAGGAGGGCACTCCTCAGGCAGGGCAGAGCTGCAAGGTCTATGAGCCTTG
GCTCTGCTCTCCATTTAATGAAGAAGCTTTCATTTGTTTTTACTTTGGTGTCGGCAACATCTTTCCTTTCTTTGCTTTTTCTTTGAACTGCGGCCAATGTCACCTCGCAC
CTCTTACCTTTGAGGTTCAGAGGCTTGACCATTTTGAACTTTTCACATCGTCCCATTACCTTGAAGGTTTGAATTTTAAGTTCATCAGTGGTTTTGGCATCGCACCTCGT
ACCCTTAGATCCATTGAATACCCTTTTGGCATTTCAAGGATAATAACGCTTCAGGTGCTCCGCGTTCCACGGGTGCGCGAGGACGTCTCCTTTCAGATCGGCCAATACGC
ACGTCCCAGGTCGGATTATGCCCTTGACCTCAAACGGGCCCTCCCAGGTCGGATCAAGGGCACCCACATGGGTTTTGACCCTCCTTAA
Protein sequenceShow/hide protein sequence
MSSSFSSNLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLCRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMFEYGLRLPFHPFVQ
EFLFRTGLAPAQVASNGWGVIFALAILFWLRAWDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGADGIVKGPTSIKGWVRKWFYASGEWLAKDELVVPSLTFPLV
SIQPVPELMQASFDTLKYYNERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAVVGPASEDPTPVIEL
ESSGGPSREKRPRDQTEAVDALPSGEEMREEVPPKRRRKKKKTTSPLEVGARGVLPASYAYRVDDPEARMDGTSDVTARFRVEPSSSRVRDQVSRISAASLDRCLRRASK
FVSDPVSVLQRTIDYAAEVRLVSPFLFNLPNSCKGRAGWEGSSGSEGERGVLCCLGGCFPTMKDELLKAHSEVETLKAEVETKAELLKKEEDRRKAQLRAAHTITRGLEK
EKFQLLKEKDEMLQALEAKEEELKRATAELETVKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMSDLQIDLSGLKKKYAEQWASGPGGTSGPQALVDK
YVRDLDSDYSDLEEDQVGTTQEGTPQAGQSCKVYEPWLCSPFNEEAFICFYFGVGNIFPFFAFSLNCGQCHLAPLTFEVQRLDHFELFTSSHYLEGLNFKFISGFGIAPR
TLRSIEYPFGISRIITLQVLRVPRVREDVSFQIGQYARPRSDYALDLKRALPGRIKGTHMGFDPP