; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g14570 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g14570
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionmyosin heavy chain-related
Genome locationchr3:9811665..9818991
RNA-Seq ExpressionMoc03g14570
SyntenyMoc03g14570
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0005524 - ATP binding (molecular function)
GO:0016887 - ATPase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138041.1 uncharacterized protein LOC111009298 [Momordica charantia]1.9e-11688.19Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP
        MCARKGA GIVKGPTSIKGWVRKWFYASGEWLAKDES              V+IRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP

Query:  AVRPIESLRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAVAGPTSEDPTPVIELESSGGPSREKRPRDQTEAVDALPLGEEVREEVPLKRRR
        AVRPIES RPNSELAMVCGFASNVKRKSKG+AHALEAAQSSKP TPAV GP SEDP PVIELESS GPSREKRPRDQTEAVD  PLGEEVREEVPLKRRR
Subjt:  AVRPIESLRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAVAGPTSEDPTPVIELESSGGPSREKRPRDQTEAVDALPLGEEVREEVPLKRRR

Query:  KKKKTTSPLEVGACGVLPASFADRVDDPEARMGGTSDVTARFRVEPSSSGVRDQ
        KKKKTTSPLEVGA GVLPASFADRVDDPEARMGGT DVT RFRVEPSSSGVRDQ
Subjt:  KKKKTTSPLEVGACGVLPASFADRVDDPEARMGGTSDVTARFRVEPSSSGVRDQ

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]1.3e-13893.41Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLALAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLA AQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLALAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESLRPNSELAMVCGFAS
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIE  RPNS LAMVC FAS
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESLRPNSELAMVCGFAS

Query:  NVKRKSKGRAHALEAAQSSKPATPAVAGPTSEDPTPVIELESSGGPSREKRPRDQTEAVDAL-------PLGE
         VKRKSKGRAHALEAAQSSKP TPAV GP SEDP PVIELESSGGPSREKRPRDQTEAVDA        PLGE
Subjt:  NVKRKSKGRAHALEAAQSSKPATPAVAGPTSEDPTPVIELESSGGPSREKRPRDQTEAVDAL-------PLGE

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]6.9e-18092.39Show/hide
Query:  MSSSFSSNLGSDEDLACRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRG----------LPEEGERADNPPEGWVTLYFKMFEYG
        MSSS SSNL  + DLA RLES+LEEIEN R SDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRG          LPEEGERADNPPEGWVTLYFKMFEYG
Subjt:  MSSSFSSNLGSDEDLACRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRG----------LPEEGERADNPPEGWVTLYFKMFEYG

Query:  LRLPLHPFVQEFLFRTGLALAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA
        LRLPLHPFVQEFLFRTGLA AQVAPNGWGVIFALAILFWLRARDSEEAEL DVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA
Subjt:  LRLPLHPFVQEFLFRTGLALAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA

Query:  SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESLRPNSELAMVCGFASNVKRK
        SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIES RPNSELAMVCGFAS VKRK
Subjt:  SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESLRPNSELAMVCGFASNVKRK

Query:  SKGRAHALEAAQSSKPATPAVAGPTSEDPTPVIELESSGGPSREKRPRDQTEAVD
        SKGRAHALEAAQSSKPATPAV GP SEDP  VIELESSGGPSREKRPRDQTEAVD
Subjt:  SKGRAHALEAAQSSKPATPAVAGPTSEDPTPVIELESSGGPSREKRPRDQTEAVD

XP_022159185.1 uncharacterized protein LOC111025606 [Momordica charantia]4.6e-10774.68Show/hide
Query:  MVCGFASNVKRKSKGRAHALEAAQSSKPATPAVAGPTSEDPTPVIELESSGGPSREKRPRDQTEAVDALPLGEEVREEVPLKRRRKKKKTTSPLEVGACG
        MVCGFAS+VKRKSKGRAHA EAAQSSKPATPAVAGP SEDP PVIELESSGGPSREKRPRDQTEAVDALPLGEEVREEVPLKRRRKKKKT SPLEVGACG
Subjt:  MVCGFASNVKRKSKGRAHALEAAQSSKPATPAVAGPTSEDPTPVIELESSGGPSREKRPRDQTEAVDALPLGEEVREEVPLKRRRKKKKTTSPLEVGACG

Query:  VLPASFADRVDDPEARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAR
        VLPASFADRVDDPEARMGGTSDVTARFRV+PSS+GVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAR
Subjt:  VLPASFADRVDDPEARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAR

Query:  EKEEFSAALEAASSTMKDELLKAHSEVEILKAEVETKAELLKKEEDRRKAQLRPAHAITRGLEKEKFQLLKEKDDMLQALEAKDKELEHATAELETAKER
        EKEEFS                                                                        ALEAKDKELEHATAELETAKER
Subjt:  EKEEFSAALEAASSTMKDELLKAHSEVEILKAEVETKAELLKKEEDRRKAQLRPAHAITRGLEKEKFQLLKEKDDMLQALEAKDKELEHATAELETAKER

Query:  LSNGVLLEESFR
        LSNGVLLEESFR
Subjt:  LSNGVLLEESFR

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]3.4e-17972.48Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP
        MCARKG GGIVKGPTSIKGWV KWF+ASGEWLAKDESGR+FFDVPTRFGNLVSI+ +PEL QA+FDTLK+YK+ FPR RK+ TLVTD+LLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP

Query:  AVRPIESLRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAV--------AGPTSEDPTPVIELESSGGPSREKRPRDQTEAVDALPLGEEVRE
         VR IE+ RPNSELAMVCGF  +VKRKSKGRAHAL+    ++P TP V        +GP+S  PTPVIEL+ SGG S EKR R+++EA+D  PL  EVR 
Subjt:  AVRPIESLRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAV--------AGPTSEDPTPVIELESSGGPSREKRPRDQTEAVDALPLGEEVRE

Query:  EVPLKRRRKKKKTTSPLEVGACGVLPASFADRVDDPEARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAF
        E PL+RRRKKKKT+S  E GA G LP S AD VDDPEARM GTS+V  RF +EPSSSGV+DQVSRISA  LDR LRRASKFVSDPGSVLQRTID  AEAF
Subjt:  EVPLKRRRKKKKTTSPLEVGACGVLPASFADRVDDPEARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAF

Query:  VASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVETKAELLKKEEDRRKAQLRPAHAITRGLEKEKFQLLKEKDDML
        +ASI  A+ VKAELDGRE LAA+E+E   AALEAA +T+K ELLKA  EV+IL+AEV+ K +LLKKE ++ KA LR AHAIT+GLEKEKFQLLKEKDD+ 
Subjt:  VASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVETKAELLKKEEDRRKAQLRPAHAITRGLEKEKFQLLKEKDDML

Query:  QALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIACDMPDLQIDLSGLKKE
        Q LE KD  +   T EL+  KERL+NG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA DMP LQIDL+GLKK+
Subjt:  QALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIACDMPDLQIDLSGLKKE

TrEMBL top hitse value%identityAlignment
A0A6J1C8K9 uncharacterized protein LOC1110092989.0e-11788.19Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP
        MCARKGA GIVKGPTSIKGWVRKWFYASGEWLAKDES              V+IRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP

Query:  AVRPIESLRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAVAGPTSEDPTPVIELESSGGPSREKRPRDQTEAVDALPLGEEVREEVPLKRRR
        AVRPIES RPNSELAMVCGFASNVKRKSKG+AHALEAAQSSKP TPAV GP SEDP PVIELESS GPSREKRPRDQTEAVD  PLGEEVREEVPLKRRR
Subjt:  AVRPIESLRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAVAGPTSEDPTPVIELESSGGPSREKRPRDQTEAVDALPLGEEVREEVPLKRRR

Query:  KKKKTTSPLEVGACGVLPASFADRVDDPEARMGGTSDVTARFRVEPSSSGVRDQ
        KKKKTTSPLEVGA GVLPASFADRVDDPEARMGGT DVT RFRVEPSSSGVRDQ
Subjt:  KKKKTTSPLEVGACGVLPASFADRVDDPEARMGGTSDVTARFRVEPSSSGVRDQ

A0A6J1CR42 uncharacterized protein LOC1110138266.4e-13993.41Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLALAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLA AQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLALAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESLRPNSELAMVCGFAS
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIE  RPNS LAMVC FAS
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESLRPNSELAMVCGFAS

Query:  NVKRKSKGRAHALEAAQSSKPATPAVAGPTSEDPTPVIELESSGGPSREKRPRDQTEAVDAL-------PLGE
         VKRKSKGRAHALEAAQSSKP TPAV GP SEDP PVIELESSGGPSREKRPRDQTEAVDA        PLGE
Subjt:  NVKRKSKGRAHALEAAQSSKPATPAVAGPTSEDPTPVIELESSGGPSREKRPRDQTEAVDAL-------PLGE

A0A6J1DXS5 uncharacterized protein LOC1110255023.4e-18092.39Show/hide
Query:  MSSSFSSNLGSDEDLACRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRG----------LPEEGERADNPPEGWVTLYFKMFEYG
        MSSS SSNL  + DLA RLES+LEEIEN R SDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRG          LPEEGERADNPPEGWVTLYFKMFEYG
Subjt:  MSSSFSSNLGSDEDLACRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRG----------LPEEGERADNPPEGWVTLYFKMFEYG

Query:  LRLPLHPFVQEFLFRTGLALAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA
        LRLPLHPFVQEFLFRTGLA AQVAPNGWGVIFALAILFWLRARDSEEAEL DVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA
Subjt:  LRLPLHPFVQEFLFRTGLALAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA

Query:  SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESLRPNSELAMVCGFASNVKRK
        SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIES RPNSELAMVCGFAS VKRK
Subjt:  SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESLRPNSELAMVCGFASNVKRK

Query:  SKGRAHALEAAQSSKPATPAVAGPTSEDPTPVIELESSGGPSREKRPRDQTEAVD
        SKGRAHALEAAQSSKPATPAV GP SEDP  VIELESSGGPSREKRPRDQTEAVD
Subjt:  SKGRAHALEAAQSSKPATPAVAGPTSEDPTPVIELESSGGPSREKRPRDQTEAVD

A0A6J1DXZ1 uncharacterized protein LOC1110256062.2e-10774.68Show/hide
Query:  MVCGFASNVKRKSKGRAHALEAAQSSKPATPAVAGPTSEDPTPVIELESSGGPSREKRPRDQTEAVDALPLGEEVREEVPLKRRRKKKKTTSPLEVGACG
        MVCGFAS+VKRKSKGRAHA EAAQSSKPATPAVAGP SEDP PVIELESSGGPSREKRPRDQTEAVDALPLGEEVREEVPLKRRRKKKKT SPLEVGACG
Subjt:  MVCGFASNVKRKSKGRAHALEAAQSSKPATPAVAGPTSEDPTPVIELESSGGPSREKRPRDQTEAVDALPLGEEVREEVPLKRRRKKKKTTSPLEVGACG

Query:  VLPASFADRVDDPEARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAR
        VLPASFADRVDDPEARMGGTSDVTARFRV+PSS+GVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAR
Subjt:  VLPASFADRVDDPEARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAR

Query:  EKEEFSAALEAASSTMKDELLKAHSEVEILKAEVETKAELLKKEEDRRKAQLRPAHAITRGLEKEKFQLLKEKDDMLQALEAKDKELEHATAELETAKER
        EKEEFS                                                                        ALEAKDKELEHATAELETAKER
Subjt:  EKEEFSAALEAASSTMKDELLKAHSEVEILKAEVETKAELLKKEEDRRKAQLRPAHAITRGLEKEKFQLLKEKDDMLQALEAKDKELEHATAELETAKER

Query:  LSNGVLLEESFR
        LSNGVLLEESFR
Subjt:  LSNGVLLEESFR

A0A6J1DZB3 uncharacterized protein LOC1110256651.7e-17972.48Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP
        MCARKG GGIVKGPTSIKGWV KWF+ASGEWLAKDESGR+FFDVPTRFGNLVSI+ +PEL QA+FDTLK+YK+ FPR RK+ TLVTD+LLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP

Query:  AVRPIESLRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAV--------AGPTSEDPTPVIELESSGGPSREKRPRDQTEAVDALPLGEEVRE
         VR IE+ RPNSELAMVCGF  +VKRKSKGRAHAL+    ++P TP V        +GP+S  PTPVIEL+ SGG S EKR R+++EA+D  PL  EVR 
Subjt:  AVRPIESLRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAV--------AGPTSEDPTPVIELESSGGPSREKRPRDQTEAVDALPLGEEVRE

Query:  EVPLKRRRKKKKTTSPLEVGACGVLPASFADRVDDPEARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAF
        E PL+RRRKKKKT+S  E GA G LP S AD VDDPEARM GTS+V  RF +EPSSSGV+DQVSRISA  LDR LRRASKFVSDPGSVLQRTID  AEAF
Subjt:  EVPLKRRRKKKKTTSPLEVGACGVLPASFADRVDDPEARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAF

Query:  VASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVETKAELLKKEEDRRKAQLRPAHAITRGLEKEKFQLLKEKDDML
        +ASI  A+ VKAELDGRE LAA+E+E   AALEAA +T+K ELLKA  EV+IL+AEV+ K +LLKKE ++ KA LR AHAIT+GLEKEKFQLLKEKDD+ 
Subjt:  VASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVETKAELLKKEEDRRKAQLRPAHAITRGLEKEKFQLLKEKDDML

Query:  QALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIACDMPDLQIDLSGLKKE
        Q LE KD  +   T EL+  KERL+NG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA DMP LQIDL+GLKK+
Subjt:  QALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIACDMPDLQIDLSGLKKE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G15420.1 myosin heavy chain-related1.6e-0424.79Show/hide
Query:  PEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFVQEFLFRTGLALAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYM
        P+  +R   PPEG++ LY   F   GL  PL  F+ E+  R  +A++Q+          LAIL       +E    +D D         R+ + PG +Y 
Subjt:  PEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFVQEFLFRTGLALAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYM

Query:  CARKGAGGIVKGPTS-IKGWVRKWFYAS--------------GEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVT
         A K    IV G  S I GW R++F+                 +W    E      D P  F  L +I  + EL    + T  + + R  R R +G ++ 
Subjt:  CARKGAGGIVKGPTS-IKGWVRKWFYAS--------------GEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVT

Query:  DELLLESGLLDYNPAVRPIESLRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAVAGPTSED---PTPVIE---LESSGG--PSREKRPRDQT
            L   + ++      +  + P+  +        N   +S GR  A E+A        +   P +ED      V+    L S GG  PS+++  RD  
Subjt:  DELLLESGLLDYNPAVRPIESLRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAVAGPTSED---PTPVIE---LESSGG--PSREKRPRDQT

Query:  EAVDALPLGEEVREEVPLKRRRKKKKTTSPLEVGACGVLPASFADRVDDPEARMGGTS--DVTARFRVEPSSSGVRDQVSRISAASLDRCLRR------A
                 E+   +VP K  R++      ++ G       S+  +  D       TS  D+ +R R      G  D  S     S+DR + R      A
Subjt:  EAVDALPLGEEVREEVPLKRRRKKKKTTSPLEVGACGVLPASFADRVDDPEARMGGTS--DVTARFRVEPSSSGVRDQVSRISAASLDRCLRR------A

Query:  SKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAR--EKEEFSAALEAASSTMKDELLKAHSEVEILKAEVETKAELLKKEEDRRKAQLR
         K     G+  + +     +A V++ + A    AE +  + LA     + E SA LE  SS + +++    S V+    E   + E L K      A+LR
Subjt:  SKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAR--EKEEFSAALEAASSTMKDELLKAHSEVEILKAEVETKAELLKKEEDRRKAQLR

Query:  PAHAITRGLEKEKFQLLKEKDDMLQALE---AKDKELEHAT-AELETAKERLSNGV-LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA
         +       E++K     +    LQ LE    K   +  AT  ELE  +  L NGV  LE +     D D F +  + A    L+ GI+
Subjt:  PAHAITRGLEKEKFQLLKEKDDMLQALE---AKDKELEHAT-AELETAKERLSNGV-LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA

AT4G03830.1 Protein of unknown function, DUF6013.5e-0427.2Show/hide
Query:  PEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFVQEFLFRTGLALAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYM
        PE  +  + P  G    +   F E GL  PL   + +F+   G+AL Q+ PN    I +L  L        E+  LL +  LL  +  K+  +  G F++
Subjt:  PEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFVQEFLFRTGLALAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYM

Query:  CARKGAGGIVKGPTSIKGWVRKWFY
          RKG       P   + W + +F+
Subjt:  CARKGAGGIVKGPTSIKGWVRKWFY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGCGTTGGAATCTGGGAGCAGTGCGGCAACGACAACCTTCAAATTTGGGAGTAGTGCGACGACGACAGTGTTCAGATCTTACCGGAGCAGTGCGGTGGTGGCAGC
GTTCGGATCTAGCCAGAATAGTGCGTCGGCGGGAGCTTTCGGATATGGGAACACAAAGGTGGCGGCACCGTTTGGATCTTGCTGTAACATTGCGACGACGGCAGCTTTCG
GATCTGAAACTACGAAGGCGACGACAACGTTTAGATACGGCCGAAGTAATGCGGAGGCGGCGACTTTCGTATCCTCGAACACGAAGGAAGCTAAGATTGCAACTCGAACT
CGGCCTCCGGACCGATCTGAATACTTGGGCGGACCTGCACAAAAAGGTGAGCACTCCGACGATCAAGTCAGTATAGGTCGGATTCCCAGTTTAGTTCGAGGGTATTCTCT
TCCCCAAACATTGGCCCCCTCTCTGTCTGGTCCGATCTCGACCTGGCAGAGAAGTTCATTCGATTTGCTTTGGACGCGTGGCGACTTCCTATTCGTGGGAAAATATAACC
GTTGCGGTAGATCTATCGTCGGAATATTTAAATATTCCGACGCTTCGGATCTTAGGGAGGATCCTAGCCGCTCGTTGATTACACGTCTCGAACCCTTGGTAGGTTGGTCT
CTTCCCTCACTTTCTCTTTCGAACGTAGTTGCCATGTCGTCCTCTTTTAGCAGCAACTTAGGATCCGATGAGGATTTAGCTTGTAGGTTAGAGTCCGAGCTCGAGGAGAT
AGAAAACTTTAGATTCTCCGATGACGGGGAGGATAGTGACGCCTCTACCTCGGGTCAGGGTTTAGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTA
GGGGGCTTCCGGAGGAGGGGGAGAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGACTTCCCCTTCACCCTTTTGTC
CAAGAATTTCTCTTCCGGACTGGGTTGGCTCTGGCTCAAGTGGCCCCCAATGGGTGGGGCGTCATTTTCGCCTTGGCCATCCTTTTTTGGCTACGAGCTCGGGATAGTGA
GGAGGCCGAGTTGTTGGACGTAGACCAGCTCCTCGCGTGCTTCGAAGCGAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAGGAAAGGCGCAGGCGGTA
TAGTTAAAGGGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCTGGGGAATGGCTTGCAAAGGACGAGTCAGGTCGTTCCTTCTTTGACGTCCCCACT
AGGTTTGGGAACCTAGTTTCAATCCGACCAGTCCCTGAGCTTACGCAAGCCTCCTTCGACACGTTGAAATATTACAAGGAGCGTTTTCCGAGGGGTAGGAAGGTCGGAAC
CCTGGTGACTGACGAGCTGCTGCTTGAGTCCGGGCTGCTAGATTATAACCCCGCAGTTCGTCCCATTGAATCCTTAAGGCCGAACTCCGAACTTGCCATGGTTTGCGGAT
TTGCAAGCAACGTGAAGCGCAAGTCCAAAGGCCGAGCCCATGCTCTTGAGGCCGCCCAGAGTTCGAAACCTGCCACCCCTGCCGTGGCAGGGCCTACCTCGGAAGATCCA
ACCCCAGTGATCGAGCTGGAGTCTTCTGGGGGTCCTTCGAGGGAGAAACGCCCCAGGGATCAGACCGAGGCGGTGGACGCCTTGCCCTTGGGCGAGGAGGTGAGGGAGGA
AGTCCCTCTGAAGCGAAGGAGGAAGAAAAAGAAGACGACCTCCCCCTTGGAGGTCGGAGCTTGTGGGGTCTTGCCTGCGAGCTTCGCAGATCGGGTGGACGATCCTGAGG
CCAGGATGGGCGGGACGTCCGATGTGACAGCACGGTTCAGAGTTGAGCCGTCAAGTTCCGGGGTGAGGGACCAGGTGTCCCGCATCTCGGCTGCAAGTTTGGACCGCTGC
CTAAGGAGGGCGTCCAAATTTGTGAGCGACCCAGGGTCCGTTCTGCAGAGGACCATCGACTACGCCGCTGAGGCGTTTGTTGCTTCCATTCAATCGGCTCTGGCTGTAAA
GGCCGAGCTGGATGGGAGGGAAGTTCTGGCAGCGAGGGAGAAAGAGGAGTTCTCTGCTGCCTTGGAGGCTGCTTCCTCCACCATGAAGGATGAGCTGCTGAAGGCTCACT
CTGAGGTGGAGATTTTGAAGGCCGAGGTGGAGACCAAGGCCGAGCTGCTGAAGAAGGAAGAGGACAGACGCAAGGCCCAGCTCCGACCTGCCCATGCTATCACCAGGGGC
TTGGAGAAGGAGAAGTTCCAACTCCTGAAGGAGAAGGACGACATGCTCCAGGCACTTGAAGCGAAAGATAAGGAGCTGGAGCATGCGACCGCCGAGCTGGAGACGGCGAA
GGAGCGTCTCAGCAATGGAGTCCTATTGGAGGAATCGTTTAGGCAACATCCTGACTTCGATGGATTTGCCAAAGACTTCTCTGACGCGGGCTTCAAGTTCCTCATGAAGG
GCATTGCTTGCGACATGCCTGACCTTCAGATCGATCTCAGTGGTCTGAAAAAAGAGGTATGCCGAGCAGTGGGCGTCTGGGCCTGGCGGCACCCCTGGCCCCCAAGCGTT
GGTGGATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCAGCGTTGGAATCTGGGAGCAGTGCGGCAACGACAACCTTCAAATTTGGGAGTAGTGCGACGACGACAGTGTTCAGATCTTACCGGAGCAGTGCGGTGGTGGCAGC
GTTCGGATCTAGCCAGAATAGTGCGTCGGCGGGAGCTTTCGGATATGGGAACACAAAGGTGGCGGCACCGTTTGGATCTTGCTGTAACATTGCGACGACGGCAGCTTTCG
GATCTGAAACTACGAAGGCGACGACAACGTTTAGATACGGCCGAAGTAATGCGGAGGCGGCGACTTTCGTATCCTCGAACACGAAGGAAGCTAAGATTGCAACTCGAACT
CGGCCTCCGGACCGATCTGAATACTTGGGCGGACCTGCACAAAAAGGTGAGCACTCCGACGATCAAGTCAGTATAGGTCGGATTCCCAGTTTAGTTCGAGGGTATTCTCT
TCCCCAAACATTGGCCCCCTCTCTGTCTGGTCCGATCTCGACCTGGCAGAGAAGTTCATTCGATTTGCTTTGGACGCGTGGCGACTTCCTATTCGTGGGAAAATATAACC
GTTGCGGTAGATCTATCGTCGGAATATTTAAATATTCCGACGCTTCGGATCTTAGGGAGGATCCTAGCCGCTCGTTGATTACACGTCTCGAACCCTTGGTAGGTTGGTCT
CTTCCCTCACTTTCTCTTTCGAACGTAGTTGCCATGTCGTCCTCTTTTAGCAGCAACTTAGGATCCGATGAGGATTTAGCTTGTAGGTTAGAGTCCGAGCTCGAGGAGAT
AGAAAACTTTAGATTCTCCGATGACGGGGAGGATAGTGACGCCTCTACCTCGGGTCAGGGTTTAGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTA
GGGGGCTTCCGGAGGAGGGGGAGAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGACTTCCCCTTCACCCTTTTGTC
CAAGAATTTCTCTTCCGGACTGGGTTGGCTCTGGCTCAAGTGGCCCCCAATGGGTGGGGCGTCATTTTCGCCTTGGCCATCCTTTTTTGGCTACGAGCTCGGGATAGTGA
GGAGGCCGAGTTGTTGGACGTAGACCAGCTCCTCGCGTGCTTCGAAGCGAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAGGAAAGGCGCAGGCGGTA
TAGTTAAAGGGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCTGGGGAATGGCTTGCAAAGGACGAGTCAGGTCGTTCCTTCTTTGACGTCCCCACT
AGGTTTGGGAACCTAGTTTCAATCCGACCAGTCCCTGAGCTTACGCAAGCCTCCTTCGACACGTTGAAATATTACAAGGAGCGTTTTCCGAGGGGTAGGAAGGTCGGAAC
CCTGGTGACTGACGAGCTGCTGCTTGAGTCCGGGCTGCTAGATTATAACCCCGCAGTTCGTCCCATTGAATCCTTAAGGCCGAACTCCGAACTTGCCATGGTTTGCGGAT
TTGCAAGCAACGTGAAGCGCAAGTCCAAAGGCCGAGCCCATGCTCTTGAGGCCGCCCAGAGTTCGAAACCTGCCACCCCTGCCGTGGCAGGGCCTACCTCGGAAGATCCA
ACCCCAGTGATCGAGCTGGAGTCTTCTGGGGGTCCTTCGAGGGAGAAACGCCCCAGGGATCAGACCGAGGCGGTGGACGCCTTGCCCTTGGGCGAGGAGGTGAGGGAGGA
AGTCCCTCTGAAGCGAAGGAGGAAGAAAAAGAAGACGACCTCCCCCTTGGAGGTCGGAGCTTGTGGGGTCTTGCCTGCGAGCTTCGCAGATCGGGTGGACGATCCTGAGG
CCAGGATGGGCGGGACGTCCGATGTGACAGCACGGTTCAGAGTTGAGCCGTCAAGTTCCGGGGTGAGGGACCAGGTGTCCCGCATCTCGGCTGCAAGTTTGGACCGCTGC
CTAAGGAGGGCGTCCAAATTTGTGAGCGACCCAGGGTCCGTTCTGCAGAGGACCATCGACTACGCCGCTGAGGCGTTTGTTGCTTCCATTCAATCGGCTCTGGCTGTAAA
GGCCGAGCTGGATGGGAGGGAAGTTCTGGCAGCGAGGGAGAAAGAGGAGTTCTCTGCTGCCTTGGAGGCTGCTTCCTCCACCATGAAGGATGAGCTGCTGAAGGCTCACT
CTGAGGTGGAGATTTTGAAGGCCGAGGTGGAGACCAAGGCCGAGCTGCTGAAGAAGGAAGAGGACAGACGCAAGGCCCAGCTCCGACCTGCCCATGCTATCACCAGGGGC
TTGGAGAAGGAGAAGTTCCAACTCCTGAAGGAGAAGGACGACATGCTCCAGGCACTTGAAGCGAAAGATAAGGAGCTGGAGCATGCGACCGCCGAGCTGGAGACGGCGAA
GGAGCGTCTCAGCAATGGAGTCCTATTGGAGGAATCGTTTAGGCAACATCCTGACTTCGATGGATTTGCCAAAGACTTCTCTGACGCGGGCTTCAAGTTCCTCATGAAGG
GCATTGCTTGCGACATGCCTGACCTTCAGATCGATCTCAGTGGTCTGAAAAAAGAGGTATGCCGAGCAGTGGGCGTCTGGGCCTGGCGGCACCCCTGGCCCCCAAGCGTT
GGTGGATAA
Protein sequenceShow/hide protein sequence
MAALESGSSAATTTFKFGSSATTTVFRSYRSSAVVAAFGSSQNSASAGAFGYGNTKVAAPFGSCCNIATTAAFGSETTKATTTFRYGRSNAEAATFVSSNTKEAKIATRT
RPPDRSEYLGGPAQKGEHSDDQVSIGRIPSLVRGYSLPQTLAPSLSGPISTWQRSSFDLLWTRGDFLFVGKYNRCGRSIVGIFKYSDASDLREDPSRSLITRLEPLVGWS
LPSLSLSNVVAMSSSFSSNLGSDEDLACRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGLPEEGERADNPPEGWVTLYFKMFEYGLRLPLHPFV
QEFLFRTGLALAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPT
RFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESLRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAVAGPTSEDP
TPVIELESSGGPSREKRPRDQTEAVDALPLGEEVREEVPLKRRRKKKKTTSPLEVGACGVLPASFADRVDDPEARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRC
LRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVETKAELLKKEEDRRKAQLRPAHAITRG
LEKEKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIACDMPDLQIDLSGLKKEVCRAVGVWAWRHPWPPSV
GG