; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G17987 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G17987
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionProtein of unknown function (DUF789)
Genome locationctg3345:2792207..2795377
RNA-Seq ExpressionCucsat.G17987
SyntenyCucsat.G17987
Gene Ontology termsGO:0009808 - lignin metabolic process (biological process)
GO:0032259 - methylation (biological process)
GO:0005506 - iron ion binding (molecular function)
GO:0008168 - methyltransferase activity (molecular function)
GO:0016710 - trans-cinnamate 4-monooxygenase activity (molecular function)
GO:0020037 - heme binding (molecular function)
InterPro domainsIPR008507 - Protein of unknown function DUF789


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004139934.1 uncharacterized protein LOC101222318 isoform X1 [Cucumis sativus]1.45e-238100Show/hide
Query:  MILDKVSMQSNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTLGDLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTV
        MILDKVSMQSNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTLGDLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTV
Subjt:  MILDKVSMQSNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTLGDLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTV

Query:  NGFRDECGDSETRDSFSDSCSDESESEKLWRWDGSSSEEGGFLEQESPLHLSDRLGYLYFQYFERSTPYGRVPLMDKINGLARRYPGLMTLRSVDLSPAS
        NGFRDECGDSETRDSFSDSCSDESESEKLWRWDGSSSEEGGFLEQESPLHLSDRLGYLYFQYFERSTPYGRVPLMDKINGLARRYPGLMTLRSVDLSPAS
Subjt:  NGFRDECGDSETRDSFSDSCSDESESEKLWRWDGSSSEEGGFLEQESPLHLSDRLGYLYFQYFERSTPYGRVPLMDKINGLARRYPGLMTLRSVDLSPAS

Query:  WMAVSWYPIYHIPMGRTIKDLSTCFLSYHTLSSSFQDMDVEDEFESGEKKRKEGEGISLAAFGLATYKMQGNVWISGNYGRDQERLMSLLSVADSWLKQL
        WMAVSWYPIYHIPMGRTIKDLSTCFLSYHTLSSSFQDMDVEDEFESGEKKRKEGEGISLAAFGLATYKMQGNVWISGNYGRDQERLMSLLSVADSWLKQL
Subjt:  WMAVSWYPIYHIPMGRTIKDLSTCFLSYHTLSSSFQDMDVEDEFESGEKKRKEGEGISLAAFGLATYKMQGNVWISGNYGRDQERLMSLLSVADSWLKQL

Query:  RVQHHDFNYFTAIRRG
        RVQHHDFNYFTAIRRG
Subjt:  RVQHHDFNYFTAIRRG

XP_008448279.1 PREDICTED: uncharacterized protein LOC103490514 [Cucumis melo]1.89e-23498.42Show/hide
Query:  MILDKVSMQSNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTLGDLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTV
        MILDK SMQSNLGCFLHCTTPVVNSQFL KSEIRNLNRLWHPWEREKVEYFTLGDLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTV
Subjt:  MILDKVSMQSNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTLGDLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTV

Query:  NGFRDECGDSETRDSFSDSCSDESESEKLWRWDGSSSEEGGFLEQESPLHLSDRLGYLYFQYFERSTPYGRVPLMDKINGLARRYPGLMTLRSVDLSPAS
        +GFRDECGDSETRDSFSDSCSDESESEKLWRWDGSSSEEGGFLEQESP+HLSDRLGYLYFQYFERSTPYGRVPLMDKI+GLARRYPGLMTLRSVDLSPAS
Subjt:  NGFRDECGDSETRDSFSDSCSDESESEKLWRWDGSSSEEGGFLEQESPLHLSDRLGYLYFQYFERSTPYGRVPLMDKINGLARRYPGLMTLRSVDLSPAS

Query:  WMAVSWYPIYHIPMGRTIKDLSTCFLSYHTLSSSFQDMDVEDEFESGEKKRKEGEGISLAAFGLATYKMQGNVWISGNYGRDQERLMSLLSVADSWLKQL
        WMAVSWYPIYHIPMGRTIKDLSTCFLSYHTLSSSFQDMDVEDEFESGEKKRKEGEGISLAAFGLATYKMQGNVWISGNYGRDQERLMSLLSVADSWLKQL
Subjt:  WMAVSWYPIYHIPMGRTIKDLSTCFLSYHTLSSSFQDMDVEDEFESGEKKRKEGEGISLAAFGLATYKMQGNVWISGNYGRDQERLMSLLSVADSWLKQL

Query:  RVQHHDFNYFTAIRRG
        RVQHHDFNYFTAIRRG
Subjt:  RVQHHDFNYFTAIRRG

XP_022140683.1 uncharacterized protein LOC111011286 [Momordica charantia]9.27e-22695.58Show/hide
Query:  MILDKVSMQSNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTLGDLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTV
        MIL K SMQSNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTLGDLW+CYDEWSAYGAGVPIAVN+GETLVQYYVPYLSAIQIFTSNSTV
Subjt:  MILDKVSMQSNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTLGDLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTV

Query:  NGFRDECGDSETRDSFSDSCSDESESEKLWRWDGSSSEEGGFLEQESPLHLSDRLGYLYFQYFERSTPYGRVPLMDKINGLARRYPGLMTLRSVDLSPAS
        N FR+ECGDSETRDSFSDSCSDESESEKLWRWDGSSSEEGG LEQ+SPLHLSDRLGYLYFQYFERSTPYGRVPLMDKINGLARRYPGLMTLRSVDLSPAS
Subjt:  NGFRDECGDSETRDSFSDSCSDESESEKLWRWDGSSSEEGGFLEQESPLHLSDRLGYLYFQYFERSTPYGRVPLMDKINGLARRYPGLMTLRSVDLSPAS

Query:  WMAVSWYPIYHIPMGRTIKDLSTCFLSYHTLSSSFQDMDVEDEFES-GEKKRKEGEGISLAAFGLATYKMQGNVWISGNYGRDQERLMSLLSVADSWLKQ
        WMAV+WYPIYHIPMGRTIKDLSTCFLSYHTLSSSFQDMDVEDEFES GE+KRKEGEGISLAAFGLATYKMQGNVWISGNYGRDQER++SLLSVADSWLKQ
Subjt:  WMAVSWYPIYHIPMGRTIKDLSTCFLSYHTLSSSFQDMDVEDEFES-GEKKRKEGEGISLAAFGLATYKMQGNVWISGNYGRDQERLMSLLSVADSWLKQ

Query:  LRVQHHDFNYFTAIRRG
        LRVQHHDFNYFT IRRG
Subjt:  LRVQHHDFNYFTAIRRG

XP_031743931.1 uncharacterized protein LOC101222318 isoform X2 [Cucumis sativus]9.79e-23198.1Show/hide
Query:  MILDKVSMQSNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTLGDLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTV
        MILDKVSMQSNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTLGDLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTV
Subjt:  MILDKVSMQSNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTLGDLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTV

Query:  NGFRDECGDSETRDSFSDSCSDESESEKLWRWDGSSSEEGGFLEQESPLHLSDRLGYLYFQYFERSTPYGRVPLMDKINGLARRYPGLMTLRSVDLSPAS
        NGF      SETRDSFSDSCSDESESEKLWRWDGSSSEEGGFLEQESPLHLSDRLGYLYFQYFERSTPYGRVPLMDKINGLARRYPGLMTLRSVDLSPAS
Subjt:  NGFRDECGDSETRDSFSDSCSDESESEKLWRWDGSSSEEGGFLEQESPLHLSDRLGYLYFQYFERSTPYGRVPLMDKINGLARRYPGLMTLRSVDLSPAS

Query:  WMAVSWYPIYHIPMGRTIKDLSTCFLSYHTLSSSFQDMDVEDEFESGEKKRKEGEGISLAAFGLATYKMQGNVWISGNYGRDQERLMSLLSVADSWLKQL
        WMAVSWYPIYHIPMGRTIKDLSTCFLSYHTLSSSFQDMDVEDEFESGEKKRKEGEGISLAAFGLATYKMQGNVWISGNYGRDQERLMSLLSVADSWLKQL
Subjt:  WMAVSWYPIYHIPMGRTIKDLSTCFLSYHTLSSSFQDMDVEDEFESGEKKRKEGEGISLAAFGLATYKMQGNVWISGNYGRDQERLMSLLSVADSWLKQL

Query:  RVQHHDFNYFTAIRRG
        RVQHHDFNYFTAIRRG
Subjt:  RVQHHDFNYFTAIRRG

XP_038900400.1 uncharacterized protein LOC120087632 [Benincasa hispida]1.49e-23196.84Show/hide
Query:  MILDKVSMQSNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTLGDLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTV
        MILDK SMQSNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTL DLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTV
Subjt:  MILDKVSMQSNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTLGDLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTV

Query:  NGFRDECGDSETRDSFSDSCSDESESEKLWRWDGSSSEEGGFLEQESPLHLSDRLGYLYFQYFERSTPYGRVPLMDKINGLARRYPGLMTLRSVDLSPAS
        NGFRDECGDSETRDSFSDSCSDESESEKLWRWDGSSSEEGGFLEQESPLHLSDRLGYLYFQYFERSTPYGRVPLMDKINGLARRYPGLMTLRSVDLSPAS
Subjt:  NGFRDECGDSETRDSFSDSCSDESESEKLWRWDGSSSEEGGFLEQESPLHLSDRLGYLYFQYFERSTPYGRVPLMDKINGLARRYPGLMTLRSVDLSPAS

Query:  WMAVSWYPIYHIPMGRTIKDLSTCFLSYHTLSSSFQDMDVEDEFESGEKKRKEGEGISLAAFGLATYKMQGNVWISGNYGRDQERLMSLLSVADSWLKQL
        WMAV+WYPIYHIPMGRTIKDLSTCFL+YHTLSSSFQDMDVEDEFESGE+KRKEGE ISL AFGLATYKMQGNVWISGNYGRDQER++SLLSVADSWLKQL
Subjt:  WMAVSWYPIYHIPMGRTIKDLSTCFLSYHTLSSSFQDMDVEDEFESGEKKRKEGEGISLAAFGLATYKMQGNVWISGNYGRDQERLMSLLSVADSWLKQL

Query:  RVQHHDFNYFTAIRRG
        RVQHHDFNYFT IRRG
Subjt:  RVQHHDFNYFTAIRRG

TrEMBL top hitse value%identityAlignment
A0A0A0KG73 Uncharacterized protein7.00e-239100Show/hide
Query:  MILDKVSMQSNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTLGDLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTV
        MILDKVSMQSNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTLGDLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTV
Subjt:  MILDKVSMQSNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTLGDLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTV

Query:  NGFRDECGDSETRDSFSDSCSDESESEKLWRWDGSSSEEGGFLEQESPLHLSDRLGYLYFQYFERSTPYGRVPLMDKINGLARRYPGLMTLRSVDLSPAS
        NGFRDECGDSETRDSFSDSCSDESESEKLWRWDGSSSEEGGFLEQESPLHLSDRLGYLYFQYFERSTPYGRVPLMDKINGLARRYPGLMTLRSVDLSPAS
Subjt:  NGFRDECGDSETRDSFSDSCSDESESEKLWRWDGSSSEEGGFLEQESPLHLSDRLGYLYFQYFERSTPYGRVPLMDKINGLARRYPGLMTLRSVDLSPAS

Query:  WMAVSWYPIYHIPMGRTIKDLSTCFLSYHTLSSSFQDMDVEDEFESGEKKRKEGEGISLAAFGLATYKMQGNVWISGNYGRDQERLMSLLSVADSWLKQL
        WMAVSWYPIYHIPMGRTIKDLSTCFLSYHTLSSSFQDMDVEDEFESGEKKRKEGEGISLAAFGLATYKMQGNVWISGNYGRDQERLMSLLSVADSWLKQL
Subjt:  WMAVSWYPIYHIPMGRTIKDLSTCFLSYHTLSSSFQDMDVEDEFESGEKKRKEGEGISLAAFGLATYKMQGNVWISGNYGRDQERLMSLLSVADSWLKQL

Query:  RVQHHDFNYFTAIRRG
        RVQHHDFNYFTAIRRG
Subjt:  RVQHHDFNYFTAIRRG

A0A1S3BIR2 uncharacterized protein LOC1034905149.16e-23598.42Show/hide
Query:  MILDKVSMQSNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTLGDLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTV
        MILDK SMQSNLGCFLHCTTPVVNSQFL KSEIRNLNRLWHPWEREKVEYFTLGDLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTV
Subjt:  MILDKVSMQSNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTLGDLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTV

Query:  NGFRDECGDSETRDSFSDSCSDESESEKLWRWDGSSSEEGGFLEQESPLHLSDRLGYLYFQYFERSTPYGRVPLMDKINGLARRYPGLMTLRSVDLSPAS
        +GFRDECGDSETRDSFSDSCSDESESEKLWRWDGSSSEEGGFLEQESP+HLSDRLGYLYFQYFERSTPYGRVPLMDKI+GLARRYPGLMTLRSVDLSPAS
Subjt:  NGFRDECGDSETRDSFSDSCSDESESEKLWRWDGSSSEEGGFLEQESPLHLSDRLGYLYFQYFERSTPYGRVPLMDKINGLARRYPGLMTLRSVDLSPAS

Query:  WMAVSWYPIYHIPMGRTIKDLSTCFLSYHTLSSSFQDMDVEDEFESGEKKRKEGEGISLAAFGLATYKMQGNVWISGNYGRDQERLMSLLSVADSWLKQL
        WMAVSWYPIYHIPMGRTIKDLSTCFLSYHTLSSSFQDMDVEDEFESGEKKRKEGEGISLAAFGLATYKMQGNVWISGNYGRDQERLMSLLSVADSWLKQL
Subjt:  WMAVSWYPIYHIPMGRTIKDLSTCFLSYHTLSSSFQDMDVEDEFESGEKKRKEGEGISLAAFGLATYKMQGNVWISGNYGRDQERLMSLLSVADSWLKQL

Query:  RVQHHDFNYFTAIRRG
        RVQHHDFNYFTAIRRG
Subjt:  RVQHHDFNYFTAIRRG

A0A5A7V865 DUF789 domain-containing protein9.16e-23598.42Show/hide
Query:  MILDKVSMQSNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTLGDLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTV
        MILDK SMQSNLGCFLHCTTPVVNSQFL KSEIRNLNRLWHPWEREKVEYFTLGDLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTV
Subjt:  MILDKVSMQSNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTLGDLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTV

Query:  NGFRDECGDSETRDSFSDSCSDESESEKLWRWDGSSSEEGGFLEQESPLHLSDRLGYLYFQYFERSTPYGRVPLMDKINGLARRYPGLMTLRSVDLSPAS
        +GFRDECGDSETRDSFSDSCSDESESEKLWRWDGSSSEEGGFLEQESP+HLSDRLGYLYFQYFERSTPYGRVPLMDKI+GLARRYPGLMTLRSVDLSPAS
Subjt:  NGFRDECGDSETRDSFSDSCSDESESEKLWRWDGSSSEEGGFLEQESPLHLSDRLGYLYFQYFERSTPYGRVPLMDKINGLARRYPGLMTLRSVDLSPAS

Query:  WMAVSWYPIYHIPMGRTIKDLSTCFLSYHTLSSSFQDMDVEDEFESGEKKRKEGEGISLAAFGLATYKMQGNVWISGNYGRDQERLMSLLSVADSWLKQL
        WMAVSWYPIYHIPMGRTIKDLSTCFLSYHTLSSSFQDMDVEDEFESGEKKRKEGEGISLAAFGLATYKMQGNVWISGNYGRDQERLMSLLSVADSWLKQL
Subjt:  WMAVSWYPIYHIPMGRTIKDLSTCFLSYHTLSSSFQDMDVEDEFESGEKKRKEGEGISLAAFGLATYKMQGNVWISGNYGRDQERLMSLLSVADSWLKQL

Query:  RVQHHDFNYFTAIRRG
        RVQHHDFNYFTAIRRG
Subjt:  RVQHHDFNYFTAIRRG

A0A6J1CIJ3 uncharacterized protein LOC1110112864.49e-22695.58Show/hide
Query:  MILDKVSMQSNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTLGDLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTV
        MIL K SMQSNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTLGDLW+CYDEWSAYGAGVPIAVN+GETLVQYYVPYLSAIQIFTSNSTV
Subjt:  MILDKVSMQSNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTLGDLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTV

Query:  NGFRDECGDSETRDSFSDSCSDESESEKLWRWDGSSSEEGGFLEQESPLHLSDRLGYLYFQYFERSTPYGRVPLMDKINGLARRYPGLMTLRSVDLSPAS
        N FR+ECGDSETRDSFSDSCSDESESEKLWRWDGSSSEEGG LEQ+SPLHLSDRLGYLYFQYFERSTPYGRVPLMDKINGLARRYPGLMTLRSVDLSPAS
Subjt:  NGFRDECGDSETRDSFSDSCSDESESEKLWRWDGSSSEEGGFLEQESPLHLSDRLGYLYFQYFERSTPYGRVPLMDKINGLARRYPGLMTLRSVDLSPAS

Query:  WMAVSWYPIYHIPMGRTIKDLSTCFLSYHTLSSSFQDMDVEDEFES-GEKKRKEGEGISLAAFGLATYKMQGNVWISGNYGRDQERLMSLLSVADSWLKQ
        WMAV+WYPIYHIPMGRTIKDLSTCFLSYHTLSSSFQDMDVEDEFES GE+KRKEGEGISLAAFGLATYKMQGNVWISGNYGRDQER++SLLSVADSWLKQ
Subjt:  WMAVSWYPIYHIPMGRTIKDLSTCFLSYHTLSSSFQDMDVEDEFES-GEKKRKEGEGISLAAFGLATYKMQGNVWISGNYGRDQERLMSLLSVADSWLKQ

Query:  LRVQHHDFNYFTAIRRG
        LRVQHHDFNYFT IRRG
Subjt:  LRVQHHDFNYFTAIRRG

A0A6J1FWT0 uncharacterized protein LOC1114481452.29e-22294.04Show/hide
Query:  MILDKVSMQSNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTLGDLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTV
        MILDK SMQSNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTLGDLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTV
Subjt:  MILDKVSMQSNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTLGDLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTV

Query:  NGFRDECG-DSETRDSFSDSCSDESESEKLWRWDGSSSEEGGFLEQESPLHLSDRLGYLYFQYFERSTPYGRVPLMDKINGLARRYPGLMTLRSVDLSPA
        NGFRDECG DSE RDSFSDS SDESESEK WRWDGSSSEEGG LEQ+SPLHLSDRLGYLYFQYFERSTPYGRVPLMDKINGLARR+PGLMTLRSVDLSPA
Subjt:  NGFRDECG-DSETRDSFSDSCSDESESEKLWRWDGSSSEEGGFLEQESPLHLSDRLGYLYFQYFERSTPYGRVPLMDKINGLARRYPGLMTLRSVDLSPA

Query:  SWMAVSWYPIYHIPMGRTIKDLSTCFLSYHTLSSSFQDMDVEDEFESG-EKKRKEGEGISLAAFGLATYKMQGNVWISGNYGRDQERLMSLLSVADSWLK
        SWMAV+WYPIYHIPMGRTIKDLSTCFL+YHTLSSSFQDMDVEDEFESG E+KRKEGEG++L AFGLATYKMQGNVWISGNYGRDQER++SLLSVADSWLK
Subjt:  SWMAVSWYPIYHIPMGRTIKDLSTCFLSYHTLSSSFQDMDVEDEFESG-EKKRKEGEGISLAAFGLATYKMQGNVWISGNYGRDQERLMSLLSVADSWLK

Query:  QLRVQHHDFNYFTA-IRRG
        QLRVQHHDFNYFT  IRRG
Subjt:  QLRVQHHDFNYFTA-IRRG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G03610.1 Protein of unknown function (DUF789)5.3e-12071.38Show/hide
Query:  QSNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTLGDLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTVNGFRDEC-
        +SNL  FLHC TP+V  Q LPK+EIR LNRLWHPWER+KVE+F L DLW+CYDEWSAYGA VPI V NGE+LVQYYVPYLSAIQIFTS+S++   R+E  
Subjt:  QSNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTLGDLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTVNGFRDEC-

Query:  -GDSETRDSFSDSCSDESESEKLWRWDGSSSEEGGFLEQESPLHLSDRLGYLYFQYFERSTPYGRVPLMDKINGLARRYPGLMTLRSVDLSPASWMAVSW
         G+ E RD FSDS SDE           S SEEG  LE  + LH SDRLGYLY QYFERS PY RVPLMDKIN LA+RYPGLM+LRSVDLSPASWM+V+W
Subjt:  -GDSETRDSFSDSCSDESESEKLWRWDGSSSEEGGFLEQESPLHLSDRLGYLYFQYFERSTPYGRVPLMDKINGLARRYPGLMTLRSVDLSPASWMAVSW

Query:  YPIYHIPMGRTIKDLSTCFLSYHTLSSSFQDMDVEDEFESGEKKRKEGEGISLAAFGLATYKMQGNVWISGNYGRDQERLMSLLSVADSWLKQLRVQHHD
        YPIYHIPMGRTIKDLSTCFL+YHTLSSSFQDM+ E+     E+ R+EGE I+L  FG+ATYKMQG+VW+S ++  DQERL SL SVADSWLKQLRVQHHD
Subjt:  YPIYHIPMGRTIKDLSTCFLSYHTLSSSFQDMDVEDEFESGEKKRKEGEGISLAAFGLATYKMQGNVWISGNYGRDQERLMSLLSVADSWLKQLRVQHHD

Query:  FNYF
        FNYF
Subjt:  FNYF

AT1G73210.1 Protein of unknown function (DUF789)3.6e-7650.63Show/hide
Query:  QSNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTLGDLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSN-STVNGFRDEC
        +SNL  FL   TP   S  LP+ +            +E++EYF LGDLW+CYDE SAYG G  + +NNGET++QYYVPYLSAIQI T+  + ++  ++E 
Subjt:  QSNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTLGDLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSN-STVNGFRDEC

Query:  GDSETRDSFSDSCSDE-------SESEKLWRWDGSSSEEGGFLEQESPLHLSDRLGYLYFQYFERSTPYGRVPLMDKINGLARRYPGLMTLRSVDLSPAS
         +SE+ + +SDS S++       ++S K W    + SE+  F    SPL L DRLG L F+Y ER  P+ R+PL DKIN L  +YPGLMTLRSVD+SPAS
Subjt:  GDSETRDSFSDSCSDE-------SESEKLWRWDGSSSEEGGFLEQESPLHLSDRLGYLYFQYFERSTPYGRVPLMDKINGLARRYPGLMTLRSVDLSPAS

Query:  WMAVSWYPIYHIPMGRTIKDLSTCFLSYHTLSSSFQDMDVEDEFESGEKKRKEGEG------ISLAAFGLATYKMQGNVWISGNYGRDQERLMSLLSVAD
        WMAV+WYPIYHIP  R  KDL+T FL+YHTLSSSFQD  VE +  +  ++ +  E       + L  FG+ TYKMQG++W  G  G DQ+RL+ L S AD
Subjt:  WMAVSWYPIYHIPMGRTIKDLSTCFLSYHTLSSSFQDMDVEDEFESGEKKRKEGEG------ISLAAFGLATYKMQGNVWISGNYGRDQERLMSLLSVAD

Query:  SWLKQLRVQHHDFNYF
        SWLKQL V HHD+N+F
Subjt:  SWLKQLRVQHHDFNYF

AT4G03420.1 Protein of unknown function (DUF789)2.1e-12974.59Show/hide
Query:  SNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTLGDLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTVNGFRDECGD
        SNL  FLHCTTPVV  Q L K+EIR+LNR+WHPWER+KVE+F L DLW+CYDEWSAYGAGVPI ++NGE+LVQYYVPYLSAIQIFTS S++   RD+  D
Subjt:  SNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTLGDLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTVNGFRDECGD

Query:  SETRDSFSDSCSDESESEKLWRWDGSSSEEGGFLEQESPLHLSDRLGYLYFQYFERSTPYGRVPLMDKINGLARRYPGLMTLRSVDLSPASWMAVSWYPI
         E+RDSFSDS SDESES+KL R    +S+EG  LE ++ LH +DRLGYLY QYFERS PY RVPLMDKIN LA+RYPGLM+LRSVDLSPASWMAV+WYPI
Subjt:  SETRDSFSDSCSDESESEKLWRWDGSSSEEGGFLEQESPLHLSDRLGYLYFQYFERSTPYGRVPLMDKINGLARRYPGLMTLRSVDLSPASWMAVSWYPI

Query:  YHIPMGRTIKDLSTCFLSYHTLSSSFQDMDVEDEFESGEKKRKEGEGISLAAFGLATYKMQGNVWIS-GNYGRDQERLMSLLSVADSWLKQLRVQHHDFN
        YHIPMGRTIKDLSTCFL+YHTLSSSFQDM+ E+     E+ RKEGEG++L  FGLATYKMQGNVW+S  + G+DQER++SLLSVADSWLKQLRVQHHDFN
Subjt:  YHIPMGRTIKDLSTCFLSYHTLSSSFQDMDVEDEFESGEKKRKEGEGISLAAFGLATYKMQGNVWIS-GNYGRDQERLMSLLSVADSWLKQLRVQHHDFN

Query:  YFT
        YF+
Subjt:  YFT

AT4G28150.1 Protein of unknown function (DUF789)3.9e-10764.69Show/hide
Query:  QSNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTLGDLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTVNGFRDECG
        +SNL  FL CTTP+V +  LPK++I+NLN LW+P E + VEYF LGD W+C+DEWSAYGAGVPI    GETLVQYYVPYLSAIQIFTS+S +N  R+E  
Subjt:  QSNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTLGDLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTVNGFRDECG

Query:  DSETRDSFSDSCSDESESEKLWRWDGSSSEEGGFLEQESPLHLSDRLGYLYFQYFERSTPYGRVPLMDKINGLARRYPGLMTLRSVDLSPASWMAVSWYP
         +E+ DS S+SCS+E      WRW+G SS E GF  QE PL   DRLGY Y QYFER TPY RVPLMDKI  L  RY GL +LRSVDLSPASWMAV+WYP
Subjt:  DSETRDSFSDSCSDESESEKLWRWDGSSSEEGGFLEQESPLHLSDRLGYLYFQYFERSTPYGRVPLMDKINGLARRYPGLMTLRSVDLSPASWMAVSWYP

Query:  IYHIPMGRTIKDLSTCFLSYHTLSSSFQDMDVEDEFESGEKKRKEGEGISLAAFGLATYKMQGNVWISGNYGRDQERLMSLLSVADSWLKQLRVQHHDFN
        IYHIPM R+IKDLSTCFL+YHTLSSSFQD+          K+ +E E IS++AFG+ATYKMQG +W       D +RL+  LSVADSWLKQLRV HHDF 
Subjt:  IYHIPMGRTIKDLSTCFLSYHTLSSSFQDMDVEDEFESGEKKRKEGEGISLAAFGLATYKMQGNVWISGNYGRDQERLMSLLSVADSWLKQLRVQHHDFN

Query:  YFT
        YFT
Subjt:  YFT

AT4G28150.2 Protein of unknown function (DUF789)2.8e-10564.69Show/hide
Query:  QSNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTLGDLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTVNGFRDECG
        +SNL  FL CTTP+V +  LPK  I+NLN LW+P E + VEYF LGD W+C+DEWSAYGAGVPI    GETLVQYYVPYLSAIQIFTS+S +N  R+E  
Subjt:  QSNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTLGDLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTVNGFRDECG

Query:  DSETRDSFSDSCSDESESEKLWRWDGSSSEEGGFLEQESPLHLSDRLGYLYFQYFERSTPYGRVPLMDKINGLARRYPGLMTLRSVDLSPASWMAVSWYP
         +E+ DS S+SCS+E      WRW+G SS E GF  QE PL   DRLGY Y QYFER TPY RVPLMDKI  L  RY GL +LRSVDLSPASWMAV+WYP
Subjt:  DSETRDSFSDSCSDESESEKLWRWDGSSSEEGGFLEQESPLHLSDRLGYLYFQYFERSTPYGRVPLMDKINGLARRYPGLMTLRSVDLSPASWMAVSWYP

Query:  IYHIPMGRTIKDLSTCFLSYHTLSSSFQDMDVEDEFESGEKKRKEGEGISLAAFGLATYKMQGNVWISGNYGRDQERLMSLLSVADSWLKQLRVQHHDFN
        IYHIPM R+IKDLSTCFL+YHTLSSSFQD+          K+ +E E IS++AFG+ATYKMQG +W       D +RL+  LSVADSWLKQLRV HHDF 
Subjt:  IYHIPMGRTIKDLSTCFLSYHTLSSSFQDMDVEDEFESGEKKRKEGEGISLAAFGLATYKMQGNVWISGNYGRDQERLMSLLSVADSWLKQLRVQHHDFN

Query:  YFT
        YFT
Subjt:  YFT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTCTTGACAAAGTGTCAATGCAATCCAATTTGGGTTGTTTTCTTCATTGCACAACGCCAGTTGTCAACTCCCAATTCTTGCCCAAGAGCGAGATTAGGAATCTTAA
TCGTTTATGGCATCCATGGGAGAGAGAGAAGGTTGAATATTTCACCCTTGGCGATCTCTGGAATTGTTACGACGAATGGAGCGCTTACGGTGCCGGTGTTCCAATCGCCG
TCAACAACGGCGAGACCCTTGTTCAATATTATGTTCCTTATCTCTCTGCAATCCAAATCTTCACAAGCAATTCCACCGTCAATGGTTTCAGGGATGAATGCGGTGACAGT
GAAACAAGGGATTCATTCAGTGATTCATGTAGCGATGAGAGTGAAAGTGAAAAATTATGGAGATGGGACGGAAGTTCATCGGAAGAAGGAGGATTCTTAGAACAAGAAAG
CCCTCTGCACCTCAGCGACAGATTGGGGTACCTTTACTTTCAATATTTCGAGAGATCAACTCCATATGGAAGAGTTCCATTAATGGATAAGATCAATGGATTAGCTCGAA
GATACCCTGGGTTGATGACATTGAGAAGCGTTGATCTTTCTCCAGCTAGTTGGATGGCTGTTTCGTGGTACCCAATATACCACATTCCAATGGGGAGAACAATAAAGGAT
TTATCAACATGCTTCTTGAGTTACCATACACTATCATCATCATTTCAAGATATGGATGTGGAGGATGAATTTGAAAGTGGAGAAAAGAAGAGAAAAGAAGGGGAAGGGAT
ATCGCTGGCAGCATTTGGTTTAGCCACATACAAGATGCAAGGAAACGTGTGGATTTCTGGTAATTATGGGAGGGACCAAGAAAGATTGATGTCTCTATTGAGCGTGGCTG
ATTCTTGGCTAAAGCAACTCAGGGTCCAACACCACGATTTCAACTACTTTACTGCCATTCGTCGTGGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGATTCTTGACAAAGTGTCAATGCAATCCAATTTGGGTTGTTTTCTTCATTGCACAACGCCAGTTGTCAACTCCCAATTCTTGCCCAAGAGCGAGATTAGGAATCTTAA
TCGTTTATGGCATCCATGGGAGAGAGAGAAGGTTGAATATTTCACCCTTGGCGATCTCTGGAATTGTTACGACGAATGGAGCGCTTACGGTGCCGGTGTTCCAATCGCCG
TCAACAACGGCGAGACCCTTGTTCAATATTATGTTCCTTATCTCTCTGCAATCCAAATCTTCACAAGCAATTCCACCGTCAATGGTTTCAGGGATGAATGCGGTGACAGT
GAAACAAGGGATTCATTCAGTGATTCATGTAGCGATGAGAGTGAAAGTGAAAAATTATGGAGATGGGACGGAAGTTCATCGGAAGAAGGAGGATTCTTAGAACAAGAAAG
CCCTCTGCACCTCAGCGACAGATTGGGGTACCTTTACTTTCAATATTTCGAGAGATCAACTCCATATGGAAGAGTTCCATTAATGGATAAGATCAATGGATTAGCTCGAA
GATACCCTGGGTTGATGACATTGAGAAGCGTTGATCTTTCTCCAGCTAGTTGGATGGCTGTTTCGTGGTACCCAATATACCACATTCCAATGGGGAGAACAATAAAGGAT
TTATCAACATGCTTCTTGAGTTACCATACACTATCATCATCATTTCAAGATATGGATGTGGAGGATGAATTTGAAAGTGGAGAAAAGAAGAGAAAAGAAGGGGAAGGGAT
ATCGCTGGCAGCATTTGGTTTAGCCACATACAAGATGCAAGGAAACGTGTGGATTTCTGGTAATTATGGGAGGGACCAAGAAAGATTGATGTCTCTATTGAGCGTGGCTG
ATTCTTGGCTAAAGCAACTCAGGGTCCAACACCACGATTTCAACTACTTTACTGCCATTCGTCGTGGCTAA
Protein sequenceShow/hide protein sequence
MILDKVSMQSNLGCFLHCTTPVVNSQFLPKSEIRNLNRLWHPWEREKVEYFTLGDLWNCYDEWSAYGAGVPIAVNNGETLVQYYVPYLSAIQIFTSNSTVNGFRDECGDS
ETRDSFSDSCSDESESEKLWRWDGSSSEEGGFLEQESPLHLSDRLGYLYFQYFERSTPYGRVPLMDKINGLARRYPGLMTLRSVDLSPASWMAVSWYPIYHIPMGRTIKD
LSTCFLSYHTLSSSFQDMDVEDEFESGEKKRKEGEGISLAAFGLATYKMQGNVWISGNYGRDQERLMSLLSVADSWLKQLRVQHHDFNYFTAIRRG