; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g20280 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g20280
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionDNA-binding HORMA family protein
Genome locationchr4:14754090..14757609
RNA-Seq ExpressionMoc04g20280
SyntenyMoc04g20280
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0005524 - ATP binding (molecular function)
GO:0016887 - ATPase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138041.1 uncharacterized protein LOC111009298 [Momordica charantia]9.7e-11788.58Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTFKYYKEHFPKGRKVGTLVTDKLLLESGLLDYNP
        MCARKGA GIVKGPTSIKGWVRKWFYASGEWLAKDES              V+IRPVPELTQASFDT KYYKEHFP+GRKVGTLVTDKLLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTFKYYKEHFPKGRKVGTLVTDKLLLESGLLDYNP

Query:  AVCPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPVVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVREEVPLKRRR
        AV PIESSRPNSELAMVCGFASNVKRKSKG+AHALEAAQSSKP TP VVGPASEDPAPVIELESS GPSREKRPRDQTEAVDVSPLGEEVREEVPLKRRR
Subjt:  AVCPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPVVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVREEVPLKRRR

Query:  KKKKTTSPLEVGACRALPTSFADRVDDPEARMGGTSDVTARFRVEPSSSGVRDQ
        KKKKTTSPLEVGA   LP SFADRVDDPEARMGGT DVT RFRVEPSSSGVRDQ
Subjt:  KKKKTTSPLEVGACRALPTSFADRVDDPEARMGGTSDVTARFRVEPSSSGVRDQ

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]1.2e-13892.67Show/hide
Query:  MFKYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
        MF+YGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
Subjt:  MFKYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTFKYYKEHFPKGRKVGTLVTDKLLLESGLLDYNPAVCPIESSRPNSELAMVCGFAS
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDT KYYKE FP+GRKVGTLVTD+LLLESGLLDYNPAV PIE SRPNS LAMVC FAS
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTFKYYKEHFPKGRKVGTLVTDKLLLESGLLDYNPAVCPIESSRPNSELAMVCGFAS

Query:  NVKRKSKGRAHALEAAQSSKPATPVVVGPASEDPAPVIELESSGGPSREKRPRDQTEAV-------DVSPLGE
         VKRKSKGRAHALEAAQSSKP TP VVGPASEDPAPVIELESSGGPSREKRPRDQTEAV       DV PLGE
Subjt:  NVKRKSKGRAHALEAAQSSKPATPVVVGPASEDPAPVIELESSGGPSREKRPRDQTEAV-------DVSPLGE

XP_022158650.1 uncharacterized protein LOC111025108 [Momordica charantia]6.5e-10596.91Show/hide
Query:  MFKYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
        MF+YGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGA GIVKGPTSIKGWVR
Subjt:  MFKYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTFKYYKEHFPKGRKVGTLVTDKLLLESGLLDYNPAVCPIESSRPNSELAM
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDT KYYKEHFP+GRKVGTLVTDKLLLESGLLDYNPAV PIESSRPNSEL M
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTFKYYKEHFPKGRKVGTLVTDKLLLESGLLDYNPAVCPIESSRPNSELAM

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]2.0e-18693.24Show/hide
Query:  MSSSFNSDLGSDEDLARKLESELEEIENFRFSDDGEDSDVSTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLGIPEEGERADNPPEGWVTLYFKMFKYG
        MSSS +S+L  + DLAR+LES+LEEIEN R SDDGEDSD STSGQGLEYPSRIPEHYLGSLRRGFAIPENILL +PEEGERADNPPEGWVTLYFKMF+YG
Subjt:  MSSSFNSDLGSDEDLARKLESELEEIENFRFSDDGEDSDVSTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLGIPEEGERADNPPEGWVTLYFKMFKYG

Query:  LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA
        LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAEL DVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA
Subjt:  LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA

Query:  SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTFKYYKEHFPKGRKVGTLVTDKLLLESGLLDYNPAVCPIESSRPNSELAMVCGFASNVKRK
        SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDT KYYKE FP+GRKVGTLVTD+LLLESGLLDYNPAV PIESSRPNSELAMVCGFAS VKRK
Subjt:  SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTFKYYKEHFPKGRKVGTLVTDKLLLESGLLDYNPAVCPIESSRPNSELAMVCGFASNVKRK

Query:  SKGRAHALEAAQSSKPATPVVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVD
        SKGRAHALEAAQSSKPATP VVGPASEDPA VIELESSGGPSREKRPRDQTEAVD
Subjt:  SKGRAHALEAAQSSKPATPVVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]5.2e-17164.04Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTFKYYKEHFPKGRKVGTLVTDKLLLESGLLDYNP
        MCARKG GGIVKGPTSIKGWV KWF+ASGEWLAKDESGR+FFDVPTRFGNLVSI+ +PEL QA+FDT K+YK+HFP+ RK+ TLVTDKLLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTFKYYKEHFPKGRKVGTLVTDKLLLESGLLDYNP

Query:  AVCPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPVV--------VGPASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVRE
         V  IE+SRPNSELAMVCGF  +VKRKSKGRAHAL+    ++P TP V         GP+S  P PVIEL+ SGG S EKR R+++EA+DVSPL  EVR 
Subjt:  AVCPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPVV--------VGPASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVRE

Query:  EVPLKRRRKKKKTTSPLEVGACRALPTSFADRVDDPEARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAF
        E PL+RRRKKKKT+S  E GA   LPTS AD VDDPEARM GTS+V  RF +EPSSSGV+DQVSRISA  LDR LRRASKFVSDPGSVLQRTID  AEAF
Subjt:  EVPLKRRRKKKKTTSPLEVGACRALPTSFADRVDDPEARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAF

Query:  VASVQSALAVKAELDGNEALAAREKEEFSAALDAASSTMKNELLKAHSEVEILKAE--------------------------------------------
        +AS+  A+ VKAELDG EALAA+E+E   AAL+AA +T+K ELLKA  EV+IL+AE                                            
Subjt:  VASVQSALAVKAELDGNEALAAREKEEFSAALDAASSTMKNELLKAHSEVEILKAE--------------------------------------------

Query:  -ALEAKEEELKHATAELEMVKERLSNVVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPNGTHGTQALVDKYVR
          LE K+  +   T EL+ +KERL+N  LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP LQIDL GLKK+Y+E+WASGPNGT   Q+LVDKYVR
Subjt:  -ALEAKEEELKHATAELEMVKERLSNVVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPNGTHGTQALVDKYVR

Query:  DLDSDYSDLEED-----QVLRVPRVREDVSFQIG
        +LDSDYSD+EE+     +   V   +E+V  Q G
Subjt:  DLDSDYSDLEED-----QVLRVPRVREDVSFQIG

TrEMBL top hitse value%identityAlignment
A0A6J1C8K9 uncharacterized protein LOC1110092984.7e-11788.58Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTFKYYKEHFPKGRKVGTLVTDKLLLESGLLDYNP
        MCARKGA GIVKGPTSIKGWVRKWFYASGEWLAKDES              V+IRPVPELTQASFDT KYYKEHFP+GRKVGTLVTDKLLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTFKYYKEHFPKGRKVGTLVTDKLLLESGLLDYNP

Query:  AVCPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPVVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVREEVPLKRRR
        AV PIESSRPNSELAMVCGFASNVKRKSKG+AHALEAAQSSKP TP VVGPASEDPAPVIELESS GPSREKRPRDQTEAVDVSPLGEEVREEVPLKRRR
Subjt:  AVCPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPVVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVREEVPLKRRR

Query:  KKKKTTSPLEVGACRALPTSFADRVDDPEARMGGTSDVTARFRVEPSSSGVRDQ
        KKKKTTSPLEVGA   LP SFADRVDDPEARMGGT DVT RFRVEPSSSGVRDQ
Subjt:  KKKKTTSPLEVGACRALPTSFADRVDDPEARMGGTSDVTARFRVEPSSSGVRDQ

A0A6J1CR42 uncharacterized protein LOC1110138265.7e-13992.67Show/hide
Query:  MFKYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
        MF+YGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
Subjt:  MFKYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTFKYYKEHFPKGRKVGTLVTDKLLLESGLLDYNPAVCPIESSRPNSELAMVCGFAS
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDT KYYKE FP+GRKVGTLVTD+LLLESGLLDYNPAV PIE SRPNS LAMVC FAS
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTFKYYKEHFPKGRKVGTLVTDKLLLESGLLDYNPAVCPIESSRPNSELAMVCGFAS

Query:  NVKRKSKGRAHALEAAQSSKPATPVVVGPASEDPAPVIELESSGGPSREKRPRDQTEAV-------DVSPLGE
         VKRKSKGRAHALEAAQSSKP TP VVGPASEDPAPVIELESSGGPSREKRPRDQTEAV       DV PLGE
Subjt:  NVKRKSKGRAHALEAAQSSKPATPVVVGPASEDPAPVIELESSGGPSREKRPRDQTEAV-------DVSPLGE

A0A6J1DWF1 uncharacterized protein LOC1110251083.2e-10596.91Show/hide
Query:  MFKYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
        MF+YGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGA GIVKGPTSIKGWVR
Subjt:  MFKYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTFKYYKEHFPKGRKVGTLVTDKLLLESGLLDYNPAVCPIESSRPNSELAM
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDT KYYKEHFP+GRKVGTLVTDKLLLESGLLDYNPAV PIESSRPNSEL M
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTFKYYKEHFPKGRKVGTLVTDKLLLESGLLDYNPAVCPIESSRPNSELAM

A0A6J1DXS5 uncharacterized protein LOC1110255029.6e-18793.24Show/hide
Query:  MSSSFNSDLGSDEDLARKLESELEEIENFRFSDDGEDSDVSTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLGIPEEGERADNPPEGWVTLYFKMFKYG
        MSSS +S+L  + DLAR+LES+LEEIEN R SDDGEDSD STSGQGLEYPSRIPEHYLGSLRRGFAIPENILL +PEEGERADNPPEGWVTLYFKMF+YG
Subjt:  MSSSFNSDLGSDEDLARKLESELEEIENFRFSDDGEDSDVSTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLGIPEEGERADNPPEGWVTLYFKMFKYG

Query:  LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA
        LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAEL DVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA
Subjt:  LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA

Query:  SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTFKYYKEHFPKGRKVGTLVTDKLLLESGLLDYNPAVCPIESSRPNSELAMVCGFASNVKRK
        SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDT KYYKE FP+GRKVGTLVTD+LLLESGLLDYNPAV PIESSRPNSELAMVCGFAS VKRK
Subjt:  SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTFKYYKEHFPKGRKVGTLVTDKLLLESGLLDYNPAVCPIESSRPNSELAMVCGFASNVKRK

Query:  SKGRAHALEAAQSSKPATPVVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVD
        SKGRAHALEAAQSSKPATP VVGPASEDPA VIELESSGGPSREKRPRDQTEAVD
Subjt:  SKGRAHALEAAQSSKPATPVVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVD

A0A6J1DZB3 uncharacterized protein LOC1110256652.5e-17164.04Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTFKYYKEHFPKGRKVGTLVTDKLLLESGLLDYNP
        MCARKG GGIVKGPTSIKGWV KWF+ASGEWLAKDESGR+FFDVPTRFGNLVSI+ +PEL QA+FDT K+YK+HFP+ RK+ TLVTDKLLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTFKYYKEHFPKGRKVGTLVTDKLLLESGLLDYNP

Query:  AVCPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPVV--------VGPASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVRE
         V  IE+SRPNSELAMVCGF  +VKRKSKGRAHAL+    ++P TP V         GP+S  P PVIEL+ SGG S EKR R+++EA+DVSPL  EVR 
Subjt:  AVCPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPVV--------VGPASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVRE

Query:  EVPLKRRRKKKKTTSPLEVGACRALPTSFADRVDDPEARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAF
        E PL+RRRKKKKT+S  E GA   LPTS AD VDDPEARM GTS+V  RF +EPSSSGV+DQVSRISA  LDR LRRASKFVSDPGSVLQRTID  AEAF
Subjt:  EVPLKRRRKKKKTTSPLEVGACRALPTSFADRVDDPEARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAF

Query:  VASVQSALAVKAELDGNEALAAREKEEFSAALDAASSTMKNELLKAHSEVEILKAE--------------------------------------------
        +AS+  A+ VKAELDG EALAA+E+E   AAL+AA +T+K ELLKA  EV+IL+AE                                            
Subjt:  VASVQSALAVKAELDGNEALAAREKEEFSAALDAASSTMKNELLKAHSEVEILKAE--------------------------------------------

Query:  -ALEAKEEELKHATAELEMVKERLSNVVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPNGTHGTQALVDKYVR
          LE K+  +   T EL+ +KERL+N  LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP LQIDL GLKK+Y+E+WASGPNGT   Q+LVDKYVR
Subjt:  -ALEAKEEELKHATAELEMVKERLSNVVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPNGTHGTQALVDKYVR

Query:  DLDSDYSDLEED-----QVLRVPRVREDVSFQIG
        +LDSDYSD+EE+     +   V   +E+V  Q G
Subjt:  DLDSDYSDLEED-----QVLRVPRVREDVSFQIG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G42060.1 myosin heavy chain-related1.4e-0426.47Show/hide
Query:  SRIPEHYLGSLRRGFAIPENILLGIPEEGERADNPPEGWVTLYFKMF-KYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAE
        SR    + G        PE +   IPE  +R  + PEG++ L+   F + GL  PL  F+  +  R  +A +Q++         L IL       +EE  
Subjt:  SRIPEHYLGSLRRGFAIPENILLGIPEEGERADNPPEGWVTLYFKMF-KYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAE

Query:  LLDVDQLLACFEAKRIAKKPGRFYMCARKGAG-GIVKGPTS-IKGWVRKWFYASGEWLAKDESGRSFFDV
        ++D+D L     +  I  K  R  +CA    G  I  G TS ++ W + +F+A    ++ D++  S  ++
Subjt:  LLDVDQLLACFEAKRIAKKPGRFYMCARKGAG-GIVKGPTS-IKGWVRKWFYASGEWLAKDESGRSFFDV

AT4G32200.1 DNA-binding HORMA family protein9.7e-0625.88Show/hide
Query:  IPEEGERADNPPEGWVTLYFKMFKY-GLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFE---AKRIAKKPG
        IP E ER  NPP G+V +Y   F+   L  P+   +  FL R  +A +Q+ P      F  A+ F      +EE  L++V     CFE     +  + PG
Subjt:  IPEEGERADNPPEGWVTLYFKMFKY-GLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFE---AKRIAKKPG

Query:  RFYMCARKGAGGIVKGP--TSIKGWVRKWFY--ASGEWLAKDESGRS--FFDVPTRFGNLVSIRPVPE-------LTQASFDTFKYYKEHFPKGRK---V
         +++   +     + GP  ++ K W   +FY     E   +  SGR   + + P R+      RP P+       + +A F   +   +   + R    +
Subjt:  RFYMCARKGAGGIVKGP--TSIKGWVRKWFY--ASGEWLAKDESGRS--FFDVPTRFGNLVSIRPVPE-------LTQASFDTFKYYKEHFPKGRK---V

Query:  GTLVTDKLLLESGLLDYNPAVCPIESSR
        G +    +   +GLL+  P   PIE  R
Subjt:  GTLVTDKLLLESGLLDYNPAVCPIESSR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTCATGAGTCATCTGGGAGCACCAATAGGGGTCCTCCACGTGTCCAGGGTATTCTCTTCTCCCAAACATTGGCCCCCTCTCTGTCTGGTCCGATCTCGACCTGGCA
GAGAAGTTCATTCGACTTGCTTTGGATGCGTGGCGACTTCCTATTCGTGGGAAAATATAACTGTTGCGGTGGATTTATCGTCGGAATATTCAAATATTCCGACGCTTCGG
ATTTTCGGGAGGATCCTAGCCGCTCGTTGATTATACATCTCGAACCCTTGGTAGGTCGGTCTCTTCCCTCACTTTCTCTTTCAAACGTAGTTGCCATGTCGTCCTCTTTT
AACAGCGACTTAGGGTCCGATGAGGATTTAGCTCGTAAGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGATGACGGGGAGGATAGTGATGTCTCCAC
CTCGGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCGCTATCCCTGAGAACATCCTCCTTGGGATTCCGGAGGAGG
GGGAAAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTAAGTACGGCCTCAGACTTCCCCTTCACCCTTTTGTCCAAGAATTTCTCTTCCGA
ACTGGGTTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATCTTCGCTTTGGCCATCCTTTTTTGGCTACGAGCTCGGGATAGTGAAGAGGCCGAGCTATTGGA
CGTAGACCAGCTCCTCGCGTGCTTCGAAGCGAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAGGAAAGGCGCAGGCGGTATAGTTAAGGGGCCGACCT
CCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCTGGGGAATGGCTTGCAAAGGACGAGTCAGGTCGTTCCTTCTTTGACGTTCCCACTAGGTTTGGGAACCTAGTA
TCAATCCGACCAGTCCCCGAGCTCACGCAAGCCTCCTTCGACACGTTCAAATATTACAAGGAGCATTTTCCGAAGGGTAGGAAGGTCGGAACCTTGGTGACCGACAAGCT
GCTGCTTGAGTCCGGGCTGCTAGATTACAACCCTGCAGTTTGTCCCATTGAATCCTCAAGGCCGAACTCCGAACTTGCCATGGTTTGCGGATTTGCAAGCAACGTGAAGC
GCAAGTCCAAGGGCCGAGCCCATGCTCTTGAGGCCGCCCAGAGTTCGAAACCTGCCACTCCTGTTGTGGTAGGGCCAGCCTCGGAAGATCCAGCCCCAGTGATCGAGCTG
GAGTCTTCTGGGGGTCCTTCGAGGGAGAAGCGCCCCAGGGATCAGACCGAGGCGGTGGACGTCTCACCCTTGGGCGAGGAGGTGAGGGAGGAAGTCCCTCTGAAGCGAAG
GAGGAAGAAGAAGAAGACCACCTCCCCCTTGGAGGTCGGAGCTTGTAGGGCCCTGCCTACGAGCTTCGCAGATCGGGTGGACGATCCTGAGGCCAGGATGGGCGGGACGT
CCGACGTGACAGCACGGTTCAGAGTCGAGCCGTCAAGTTCTGGGGTGAGGGACCAGGTGTCCCGCATCTCGGCTGCAAGTTTGGACCGCTGCCTCAGAAGAGCGTCCAAA
TTTGTAAGTGACCCGGGGTCCGTGCTGCAGAGGACCATCGACTACGCCGCTGAGGCGTTTGTTGCTTCCGTTCAATCGGCTCTGGCCGTGAAGGCCGAGCTGGATGGGAA
TGAAGCTCTGGCAGCGAGGGAGAAAGAGGAGTTCTCTGCTGCCTTGGATGCTGCCTCTTCCACCATGAAGAATGAGCTGCTGAAAGCTCACTCTGAGGTGGAAATTTTGA
AGGCTGAGGCGCTTGAAGCGAAGGAGGAGGAGCTGAAGCACGCGACTGCTGAGCTGGAGATGGTGAAGGAGCGTCTCAGCAATGTAGTCCTATTGGAGGAATCGTTCAGG
CAACATCCTGACTTCGATGGATTTGCCAAAGACTTCTCTGACGCGGGCTTCAAGTTTCTCATGAAGGGCATTGCTTCCGACATGCCTGACCTTCAGATCGATCTCGGTGG
TCTGAAGAAGAGGTATGCTGAGCAGTGGGCGTCTGGGCCTAACGGCACCCATGGCACCCAAGCGTTGGTGGATAAGTACGTCAGAGATCTGGACTCTGACTACTCCGACC
TCGAAGAGGATCAGGTGCTCCGCGTTCCACGGGTGCGCGAGGACGTCTCCTTTCAGATCGGCCAATATGTACGTCCCAGGTCGGACTATTCCCTTGACCTCAAACGGCCC
CTCCCAGGTCGGGTCAAGGGCACCCACATGGGTTTGGACCCTCCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGACTCATGAGTCATCTGGGAGCACCAATAGGGGTCCTCCACGTGTCCAGGGTATTCTCTTCTCCCAAACATTGGCCCCCTCTCTGTCTGGTCCGATCTCGACCTGGCA
GAGAAGTTCATTCGACTTGCTTTGGATGCGTGGCGACTTCCTATTCGTGGGAAAATATAACTGTTGCGGTGGATTTATCGTCGGAATATTCAAATATTCCGACGCTTCGG
ATTTTCGGGAGGATCCTAGCCGCTCGTTGATTATACATCTCGAACCCTTGGTAGGTCGGTCTCTTCCCTCACTTTCTCTTTCAAACGTAGTTGCCATGTCGTCCTCTTTT
AACAGCGACTTAGGGTCCGATGAGGATTTAGCTCGTAAGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGATGACGGGGAGGATAGTGATGTCTCCAC
CTCGGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCGCTATCCCTGAGAACATCCTCCTTGGGATTCCGGAGGAGG
GGGAAAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTAAGTACGGCCTCAGACTTCCCCTTCACCCTTTTGTCCAAGAATTTCTCTTCCGA
ACTGGGTTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATCTTCGCTTTGGCCATCCTTTTTTGGCTACGAGCTCGGGATAGTGAAGAGGCCGAGCTATTGGA
CGTAGACCAGCTCCTCGCGTGCTTCGAAGCGAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAGGAAAGGCGCAGGCGGTATAGTTAAGGGGCCGACCT
CCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCTGGGGAATGGCTTGCAAAGGACGAGTCAGGTCGTTCCTTCTTTGACGTTCCCACTAGGTTTGGGAACCTAGTA
TCAATCCGACCAGTCCCCGAGCTCACGCAAGCCTCCTTCGACACGTTCAAATATTACAAGGAGCATTTTCCGAAGGGTAGGAAGGTCGGAACCTTGGTGACCGACAAGCT
GCTGCTTGAGTCCGGGCTGCTAGATTACAACCCTGCAGTTTGTCCCATTGAATCCTCAAGGCCGAACTCCGAACTTGCCATGGTTTGCGGATTTGCAAGCAACGTGAAGC
GCAAGTCCAAGGGCCGAGCCCATGCTCTTGAGGCCGCCCAGAGTTCGAAACCTGCCACTCCTGTTGTGGTAGGGCCAGCCTCGGAAGATCCAGCCCCAGTGATCGAGCTG
GAGTCTTCTGGGGGTCCTTCGAGGGAGAAGCGCCCCAGGGATCAGACCGAGGCGGTGGACGTCTCACCCTTGGGCGAGGAGGTGAGGGAGGAAGTCCCTCTGAAGCGAAG
GAGGAAGAAGAAGAAGACCACCTCCCCCTTGGAGGTCGGAGCTTGTAGGGCCCTGCCTACGAGCTTCGCAGATCGGGTGGACGATCCTGAGGCCAGGATGGGCGGGACGT
CCGACGTGACAGCACGGTTCAGAGTCGAGCCGTCAAGTTCTGGGGTGAGGGACCAGGTGTCCCGCATCTCGGCTGCAAGTTTGGACCGCTGCCTCAGAAGAGCGTCCAAA
TTTGTAAGTGACCCGGGGTCCGTGCTGCAGAGGACCATCGACTACGCCGCTGAGGCGTTTGTTGCTTCCGTTCAATCGGCTCTGGCCGTGAAGGCCGAGCTGGATGGGAA
TGAAGCTCTGGCAGCGAGGGAGAAAGAGGAGTTCTCTGCTGCCTTGGATGCTGCCTCTTCCACCATGAAGAATGAGCTGCTGAAAGCTCACTCTGAGGTGGAAATTTTGA
AGGCTGAGGCGCTTGAAGCGAAGGAGGAGGAGCTGAAGCACGCGACTGCTGAGCTGGAGATGGTGAAGGAGCGTCTCAGCAATGTAGTCCTATTGGAGGAATCGTTCAGG
CAACATCCTGACTTCGATGGATTTGCCAAAGACTTCTCTGACGCGGGCTTCAAGTTTCTCATGAAGGGCATTGCTTCCGACATGCCTGACCTTCAGATCGATCTCGGTGG
TCTGAAGAAGAGGTATGCTGAGCAGTGGGCGTCTGGGCCTAACGGCACCCATGGCACCCAAGCGTTGGTGGATAAGTACGTCAGAGATCTGGACTCTGACTACTCCGACC
TCGAAGAGGATCAGGTGCTCCGCGTTCCACGGGTGCGCGAGGACGTCTCCTTTCAGATCGGCCAATATGTACGTCCCAGGTCGGACTATTCCCTTGACCTCAAACGGCCC
CTCCCAGGTCGGGTCAAGGGCACCCACATGGGTTTGGACCCTCCTTAA
Protein sequenceShow/hide protein sequence
MTHESSGSTNRGPPRVQGILFSQTLAPSLSGPISTWQRSSFDLLWMRGDFLFVGKYNCCGGFIVGIFKYSDASDFREDPSRSLIIHLEPLVGRSLPSLSLSNVVAMSSSF
NSDLGSDEDLARKLESELEEIENFRFSDDGEDSDVSTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLGIPEEGERADNPPEGWVTLYFKMFKYGLRLPLHPFVQEFLFR
TGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLV
SIRPVPELTQASFDTFKYYKEHFPKGRKVGTLVTDKLLLESGLLDYNPAVCPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPVVVGPASEDPAPVIEL
ESSGGPSREKRPRDQTEAVDVSPLGEEVREEVPLKRRRKKKKTTSPLEVGACRALPTSFADRVDDPEARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASK
FVSDPGSVLQRTIDYAAEAFVASVQSALAVKAELDGNEALAAREKEEFSAALDAASSTMKNELLKAHSEVEILKAEALEAKEEELKHATAELEMVKERLSNVVLLEESFR
QHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPNGTHGTQALVDKYVRDLDSDYSDLEEDQVLRVPRVREDVSFQIGQYVRPRSDYSLDLKRP
LPGRVKGTHMGLDPP