; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh00G000050 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh00G000050
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionProtein of unknown function, DUF547
Genome locationCmo_Chr00:838679..842850
RNA-Seq ExpressionCmoCh00G000050
SyntenyCmoCh00G000050
Gene Ontology termsNA
InterPro domainsIPR006869 - Domain of unknown function DUF547
IPR025757 - Ternary complex factor MIP1, leucine-zipper


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7019314.1 hypothetical protein SDJN02_18274, partial [Cucurbita argyrosperma subsp. argyrosperma]7.3e-16698.72Show/hide
Query:  ENKKSGRSKEMGDENGGIMINKRRLNREKKMALLQDVDKLKKKLRHEENVHRALKRAFTRPLGALPRLPPYLPPSTLELLAEVAVLEEEVVWLTERVVNF
        ENKKSGRSKEMGDENGGIMIN+RRLNREKKMALLQDVDKLKKKLRHEENVHRALKRAFTRPLGALPRLPPYLPPSTLELLAEVAVLEEEVVWLTERVVNF
Subjt:  ENKKSGRSKEMGDENGGIMINKRRLNREKKMALLQDVDKLKKKLRHEENVHRALKRAFTRPLGALPRLPPYLPPSTLELLAEVAVLEEEVVWLTERVVNF

Query:  QQNLYEEAAYVSSQRNVKNFVNSSDETRNAKQTSWKSNSPSKENQFASCYVKDKPSPEKKATKIISPSKKTKMPTEHELAEKSLEILKLQLGSRLMDHER
        QQNLYEEAAYVSSQRNVKNFVNSSD+TRNAKQTSWKSNSPSKENQFASCYVKDKPSPEKKATKIISPSKKTKMPTEHELAEKSLEILKLQLGSRLMDHER
Subjt:  QQNLYEEAAYVSSQRNVKNFVNSSDETRNAKQTSWKSNSPSKENQFASCYVKDKPSPEKKATKIISPSKKTKMPTEHELAEKSLEILKLQLGSRLMDHER

Query:  AQESSCGASDNVESKTSANEISEDIVKCLCSIFVQEGTPRDKCIKGNDIVSSRCLFTVEANPIYHNEMVNTVPLIHRLKYLLGKLASVNLEGLNQHQKLA
        AQESSCGASDNVESKTSANEISEDIVKCLCSIFVQEGTPRDKCIKGNDIVSSRCLFTVEANPIYHNE VNTVPLIHRLKYLLGKLASVNLEGLNQ QKLA
Subjt:  AQESSCGASDNVESKTSANEISEDIVKCLCSIFVQEGTPRDKCIKGNDIVSSRCLFTVEANPIYHNEMVNTVPLIHRLKYLLGKLASVNLEGLNQHQKLA

Query:  FWINTYNSCIMN
        FWINTYNSCIMN
Subjt:  FWINTYNSCIMN

XP_022931736.1 uncharacterized protein LOC111437898 isoform X1 [Cucurbita moschata]5.3e-20999.74Show/hide
Query:  MANRVLSVRKMNARLRSDNQFHIQKPSIHD-QENKKSGRSKEMGDENGGIMINKRRLNREKKMALLQDVDKLKKKLRHEENVHRALKRAFTRPLGALPRL
        MANRVLSVRKMNARLRSDNQFHIQKPSIHD QENKKSGRSKEMGDENGGIMINKRRLNREKKMALLQDVDKLKKKLRHEENVHRALKRAFTRPLGALPRL
Subjt:  MANRVLSVRKMNARLRSDNQFHIQKPSIHD-QENKKSGRSKEMGDENGGIMINKRRLNREKKMALLQDVDKLKKKLRHEENVHRALKRAFTRPLGALPRL

Query:  PPYLPPSTLELLAEVAVLEEEVVWLTERVVNFQQNLYEEAAYVSSQRNVKNFVNSSDETRNAKQTSWKSNSPSKENQFASCYVKDKPSPEKKATKIISPS
        PPYLPPSTLELLAEVAVLEEEVVWLTERVVNFQQNLYEEAAYVSSQRNVKNFVNSSDETRNAKQTSWKSNSPSKENQFASCYVKDKPSPEKKATKIISPS
Subjt:  PPYLPPSTLELLAEVAVLEEEVVWLTERVVNFQQNLYEEAAYVSSQRNVKNFVNSSDETRNAKQTSWKSNSPSKENQFASCYVKDKPSPEKKATKIISPS

Query:  KKTKMPTEHELAEKSLEILKLQLGSRLMDHERAQESSCGASDNVESKTSANEISEDIVKCLCSIFVQEGTPRDKCIKGNDIVSSRCLFTVEANPIYHNEM
        KKTKMPTEHELAEKSLEILKLQLGSRLMDHERAQESSCGASDNVESKTSANEISEDIVKCLCSIFVQEGTPRDKCIKGNDIVSSRCLFTVEANPIYHNEM
Subjt:  KKTKMPTEHELAEKSLEILKLQLGSRLMDHERAQESSCGASDNVESKTSANEISEDIVKCLCSIFVQEGTPRDKCIKGNDIVSSRCLFTVEANPIYHNEM

Query:  VNTVPLIHRLKYLLGKLASVNLEGLNQHQKLAFWINTYNSCIMNALLEHGIPETPERVVALMQKAEIVIGGCILNAMTIEQFILRLS
        VNTVPLIHRLKYLLGKLASVNLEGLNQHQKLAFWINTYNSCIMNALLEHGIPETPERVVALMQKAEIVIGGCILNAMTIEQFILRLS
Subjt:  VNTVPLIHRLKYLLGKLASVNLEGLNQHQKLAFWINTYNSCIMNALLEHGIPETPERVVALMQKAEIVIGGCILNAMTIEQFILRLS

XP_022931737.1 uncharacterized protein LOC111437898 isoform X2 [Cucurbita moschata]3.4e-16399.68Show/hide
Query:  MANRVLSVRKMNARLRSDNQFHIQKPSIHD-QENKKSGRSKEMGDENGGIMINKRRLNREKKMALLQDVDKLKKKLRHEENVHRALKRAFTRPLGALPRL
        MANRVLSVRKMNARLRSDNQFHIQKPSIHD QENKKSGRSKEMGDENGGIMINKRRLNREKKMALLQDVDKLKKKLRHEENVHRALKRAFTRPLGALPRL
Subjt:  MANRVLSVRKMNARLRSDNQFHIQKPSIHD-QENKKSGRSKEMGDENGGIMINKRRLNREKKMALLQDVDKLKKKLRHEENVHRALKRAFTRPLGALPRL

Query:  PPYLPPSTLELLAEVAVLEEEVVWLTERVVNFQQNLYEEAAYVSSQRNVKNFVNSSDETRNAKQTSWKSNSPSKENQFASCYVKDKPSPEKKATKIISPS
        PPYLPPSTLELLAEVAVLEEEVVWLTERVVNFQQNLYEEAAYVSSQRNVKNFVNSSDETRNAKQTSWKSNSPSKENQFASCYVKDKPSPEKKATKIISPS
Subjt:  PPYLPPSTLELLAEVAVLEEEVVWLTERVVNFQQNLYEEAAYVSSQRNVKNFVNSSDETRNAKQTSWKSNSPSKENQFASCYVKDKPSPEKKATKIISPS

Query:  KKTKMPTEHELAEKSLEILKLQLGSRLMDHERAQESSCGASDNVESKTSANEISEDIVKCLCSIFVQEGTPRDKCIKGNDIVSSRCLFTVEANPIYHNEM
        KKTKMPTEHELAEKSLEILKLQLGSRLMDHERAQESSCGASDNVESKTSANEISEDIVKCLCSIFVQEGTPRDKCIKGNDIVSSRCLFTVEANPIYHNEM
Subjt:  KKTKMPTEHELAEKSLEILKLQLGSRLMDHERAQESSCGASDNVESKTSANEISEDIVKCLCSIFVQEGTPRDKCIKGNDIVSSRCLFTVEANPIYHNEM

Query:  VNTVPLIHRL
        VNTVPLIHRL
Subjt:  VNTVPLIHRL

XP_022973199.1 uncharacterized protein LOC111471700 isoform X1 [Cucurbita maxima]1.1e-20196.64Show/hide
Query:  MANRVLSVRKMNARLRSDNQFHIQKPSIHD-QENKKSGRSKEMGDENGGIMINKRRLNREKKMALLQDVDKLKKKLRHEENVHRALKRAFTRPLGALPRL
        MANRVLSVRKMNARLR D QF IQKPSIHD QENKKSGRSKEMGDENGGIMI++RRLNREKKMALLQDVDKLKKKLRHEENVHRALKRAF+RPLGALPRL
Subjt:  MANRVLSVRKMNARLRSDNQFHIQKPSIHD-QENKKSGRSKEMGDENGGIMINKRRLNREKKMALLQDVDKLKKKLRHEENVHRALKRAFTRPLGALPRL

Query:  PPYLPPSTLELLAEVAVLEEEVVWLTERVVNFQQNLYEEAAYVSSQRNVKNFVNSSDETRNAKQTSWKSNSPSKENQFASCYVKDKPSPEKKATKIISPS
        PPYLPPSTLELLAEVAVLEEEVVWLTERVVNFQQNLYEEAAYVSSQRNVKNFVNSSD+TRNAKQTSWKSNSPSKENQFASCYVKDKPSPEKK+TKIISPS
Subjt:  PPYLPPSTLELLAEVAVLEEEVVWLTERVVNFQQNLYEEAAYVSSQRNVKNFVNSSDETRNAKQTSWKSNSPSKENQFASCYVKDKPSPEKKATKIISPS

Query:  KKTKMPTEHELAEKSLEILKLQLGSRLMDHERAQESSCGASDNVESKTSANEISEDIVKCLCSIFVQEGTPRDKCIKGNDIVSSRCLFTVEANPIYHNEM
        KKTKMPTEHELAEKSLEIL LQLGSRLMDHERAQESSCGASDNVESKTSANEISEDIVKCLCSIFVQEGTPRDKCIKGNDIVSSRCLFTV+ANPIYHNE 
Subjt:  KKTKMPTEHELAEKSLEILKLQLGSRLMDHERAQESSCGASDNVESKTSANEISEDIVKCLCSIFVQEGTPRDKCIKGNDIVSSRCLFTVEANPIYHNEM

Query:  VNTVPLIHRLKYLLGKLASVNLEGLNQHQKLAFWINTYNSCIMNALLEHGIPETPERVVALMQKAEIVIGGCILNAMTIEQFILRLS
        VNTVPLIHRLKYLLGKLASVNLEGLNQ QKLAFWINTYNSCIMNALLEHGIPETPERVVALMQKAEIVIGGCILNAMTIEQFILRLS
Subjt:  VNTVPLIHRLKYLLGKLASVNLEGLNQHQKLAFWINTYNSCIMNALLEHGIPETPERVVALMQKAEIVIGGCILNAMTIEQFILRLS

XP_023521283.1 uncharacterized protein LOC111785033 isoform X2 [Cucurbita pepo subsp. pepo]9.5e-15896.14Show/hide
Query:  MANRVLSVRKMNARLRSDNQFHIQKPSIHDQENKKSGRSKEMGDENGGIMINKRRLNREKKMALLQDVDKLKKKLRHEENVHRALKRAFTRPLGALPRLP
        MANRVLSVRKMNARLR D QF IQKPSIHDQENKKSGRS EMGDENGGIM N+RRLNREKKMALLQDVDKLKKKLRHEENVHRALKRAFTRPLGALPRLP
Subjt:  MANRVLSVRKMNARLRSDNQFHIQKPSIHDQENKKSGRSKEMGDENGGIMINKRRLNREKKMALLQDVDKLKKKLRHEENVHRALKRAFTRPLGALPRLP

Query:  PYLPPSTLELLAEVAVLEEEVVWLTERVVNFQQNLYEEAAYVSSQRNVKNFVNSSDETRNAKQTSWKSNSPSKENQFASCYVKDKPSPEKKATKIISPSK
        PYLPPSTLELLAEVAVLEEEVVWLTERVVNFQQNLYEEAAYVSSQRNVKNFVN+SD+TRNAKQTSWKSNSPSKENQ ASCYVKDKPSPEKKATKIISPSK
Subjt:  PYLPPSTLELLAEVAVLEEEVVWLTERVVNFQQNLYEEAAYVSSQRNVKNFVNSSDETRNAKQTSWKSNSPSKENQFASCYVKDKPSPEKKATKIISPSK

Query:  KTKMPTEHELAEKSLEILKLQLGSRLMDHERAQESSCGASDNVESKTSANEISEDIVKCLCSIFVQEGTPRDKCIKGNDIVSSRCLFTVEANPIYHNEMV
        KTKMPTEHELAEKSLEILKLQLGSRLMDHERAQESSCGASDNVESKTSANEISEDIVKCLCSIFVQEGTPRDKCIKGNDIVSSRCLFTV+ANPIY N+MV
Subjt:  KTKMPTEHELAEKSLEILKLQLGSRLMDHERAQESSCGASDNVESKTSANEISEDIVKCLCSIFVQEGTPRDKCIKGNDIVSSRCLFTVEANPIYHNEMV

Query:  NTVPLIHRLKY
        NTVPLIHRLKY
Subjt:  NTVPLIHRLKY

TrEMBL top hitse value%identityAlignment
A0A6J1D606 uncharacterized protein LOC111017954 isoform X21.0e-14466.45Show/hide
Query:  MNARLRSDNQFHIQKPSIHDQENKKSGRSKEMGDENGGIMINKRRLNREKKMALLQDVDKLKKKLRHEENVHRALKRAFTRPLGALPRLPPYLPPSTLEL
        MNAR+R + QF I+K SIHD+E  K+  SKEMGDE  GI  N+RRLNREKKMALLQDVDKLKKKLRHEENVHRAL+RAFTRPLGALPRLPPYLPPSTLEL
Subjt:  MNARLRSDNQFHIQKPSIHDQENKKSGRSKEMGDENGGIMINKRRLNREKKMALLQDVDKLKKKLRHEENVHRALKRAFTRPLGALPRLPPYLPPSTLEL

Query:  LAEVAVLEEEVVWLTERVVNFQQNLYEEAAYVSSQRNVKNFV--------------------------------------------NSSDETR-------
        LAEVAVLEEEVV L+ERVVNF+Q+LY+EA +VSSQRNV+NFV                                            N SD+T        
Subjt:  LAEVAVLEEEVVWLTERVVNFQQNLYEEAAYVSSQRNVKNFV--------------------------------------------NSSDETR-------

Query:  NAKQTSWKSNSPSKENQFASCYVKDKPSPEKKATKIISPSKKTKMPTEHELAEKSLEILKLQLGSRLMDHERAQESSCGASDNVESKTSANEISEDIVKC
        NA+QTSWKSNSPSKENQF S YVKDKPSPEKK TKI+SPS+KT  PT+HE+AEKS + LKLQLGSRL+D+ERA+ESS GASD++ESKTS NEISE IVKC
Subjt:  NAKQTSWKSNSPSKENQFASCYVKDKPSPEKKATKIISPSKKTKMPTEHELAEKSLEILKLQLGSRLMDHERAQESSCGASDNVESKTSANEISEDIVKC

Query:  LCSIFVQEGTPRDKCI------------------------------KGNDIVSSRCLFTVEANPIYHNEMVNTVPLIHRLKYLLGKLASVNLEGLNQHQK
        LCSIFV+  T  DKC+                              KGNDI S R LF VEAN I+ NEM NT+P IHRLKYLLGKLASV+LEGLNQ QK
Subjt:  LCSIFVQEGTPRDKCI------------------------------KGNDIVSSRCLFTVEANPIYHNEMVNTVPLIHRLKYLLGKLASVNLEGLNQHQK

Query:  LAFWINTYNSCIMNALLEHGIPETPERVVALMQKAEIVIGGCILNAMTIEQFILRL
        LAFWINTYNSCIMNALLEHGIPETPERVVALMQKAEIV+GG ILNAMTIE FILRL
Subjt:  LAFWINTYNSCIMNALLEHGIPETPERVVALMQKAEIVIGGCILNAMTIEQFILRL

A0A6J1EUH6 uncharacterized protein LOC111437898 isoform X12.6e-20999.74Show/hide
Query:  MANRVLSVRKMNARLRSDNQFHIQKPSIHD-QENKKSGRSKEMGDENGGIMINKRRLNREKKMALLQDVDKLKKKLRHEENVHRALKRAFTRPLGALPRL
        MANRVLSVRKMNARLRSDNQFHIQKPSIHD QENKKSGRSKEMGDENGGIMINKRRLNREKKMALLQDVDKLKKKLRHEENVHRALKRAFTRPLGALPRL
Subjt:  MANRVLSVRKMNARLRSDNQFHIQKPSIHD-QENKKSGRSKEMGDENGGIMINKRRLNREKKMALLQDVDKLKKKLRHEENVHRALKRAFTRPLGALPRL

Query:  PPYLPPSTLELLAEVAVLEEEVVWLTERVVNFQQNLYEEAAYVSSQRNVKNFVNSSDETRNAKQTSWKSNSPSKENQFASCYVKDKPSPEKKATKIISPS
        PPYLPPSTLELLAEVAVLEEEVVWLTERVVNFQQNLYEEAAYVSSQRNVKNFVNSSDETRNAKQTSWKSNSPSKENQFASCYVKDKPSPEKKATKIISPS
Subjt:  PPYLPPSTLELLAEVAVLEEEVVWLTERVVNFQQNLYEEAAYVSSQRNVKNFVNSSDETRNAKQTSWKSNSPSKENQFASCYVKDKPSPEKKATKIISPS

Query:  KKTKMPTEHELAEKSLEILKLQLGSRLMDHERAQESSCGASDNVESKTSANEISEDIVKCLCSIFVQEGTPRDKCIKGNDIVSSRCLFTVEANPIYHNEM
        KKTKMPTEHELAEKSLEILKLQLGSRLMDHERAQESSCGASDNVESKTSANEISEDIVKCLCSIFVQEGTPRDKCIKGNDIVSSRCLFTVEANPIYHNEM
Subjt:  KKTKMPTEHELAEKSLEILKLQLGSRLMDHERAQESSCGASDNVESKTSANEISEDIVKCLCSIFVQEGTPRDKCIKGNDIVSSRCLFTVEANPIYHNEM

Query:  VNTVPLIHRLKYLLGKLASVNLEGLNQHQKLAFWINTYNSCIMNALLEHGIPETPERVVALMQKAEIVIGGCILNAMTIEQFILRLS
        VNTVPLIHRLKYLLGKLASVNLEGLNQHQKLAFWINTYNSCIMNALLEHGIPETPERVVALMQKAEIVIGGCILNAMTIEQFILRLS
Subjt:  VNTVPLIHRLKYLLGKLASVNLEGLNQHQKLAFWINTYNSCIMNALLEHGIPETPERVVALMQKAEIVIGGCILNAMTIEQFILRLS

A0A6J1EUJ1 uncharacterized protein LOC111437898 isoform X21.6e-16399.68Show/hide
Query:  MANRVLSVRKMNARLRSDNQFHIQKPSIHD-QENKKSGRSKEMGDENGGIMINKRRLNREKKMALLQDVDKLKKKLRHEENVHRALKRAFTRPLGALPRL
        MANRVLSVRKMNARLRSDNQFHIQKPSIHD QENKKSGRSKEMGDENGGIMINKRRLNREKKMALLQDVDKLKKKLRHEENVHRALKRAFTRPLGALPRL
Subjt:  MANRVLSVRKMNARLRSDNQFHIQKPSIHD-QENKKSGRSKEMGDENGGIMINKRRLNREKKMALLQDVDKLKKKLRHEENVHRALKRAFTRPLGALPRL

Query:  PPYLPPSTLELLAEVAVLEEEVVWLTERVVNFQQNLYEEAAYVSSQRNVKNFVNSSDETRNAKQTSWKSNSPSKENQFASCYVKDKPSPEKKATKIISPS
        PPYLPPSTLELLAEVAVLEEEVVWLTERVVNFQQNLYEEAAYVSSQRNVKNFVNSSDETRNAKQTSWKSNSPSKENQFASCYVKDKPSPEKKATKIISPS
Subjt:  PPYLPPSTLELLAEVAVLEEEVVWLTERVVNFQQNLYEEAAYVSSQRNVKNFVNSSDETRNAKQTSWKSNSPSKENQFASCYVKDKPSPEKKATKIISPS

Query:  KKTKMPTEHELAEKSLEILKLQLGSRLMDHERAQESSCGASDNVESKTSANEISEDIVKCLCSIFVQEGTPRDKCIKGNDIVSSRCLFTVEANPIYHNEM
        KKTKMPTEHELAEKSLEILKLQLGSRLMDHERAQESSCGASDNVESKTSANEISEDIVKCLCSIFVQEGTPRDKCIKGNDIVSSRCLFTVEANPIYHNEM
Subjt:  KKTKMPTEHELAEKSLEILKLQLGSRLMDHERAQESSCGASDNVESKTSANEISEDIVKCLCSIFVQEGTPRDKCIKGNDIVSSRCLFTVEANPIYHNEM

Query:  VNTVPLIHRL
        VNTVPLIHRL
Subjt:  VNTVPLIHRL

A0A6J1IAR9 uncharacterized protein LOC111471700 isoform X23.9e-15796.13Show/hide
Query:  MANRVLSVRKMNARLRSDNQFHIQKPSIHD-QENKKSGRSKEMGDENGGIMINKRRLNREKKMALLQDVDKLKKKLRHEENVHRALKRAFTRPLGALPRL
        MANRVLSVRKMNARLR D QF IQKPSIHD QENKKSGRSKEMGDENGGIMI++RRLNREKKMALLQDVDKLKKKLRHEENVHRALKRAF+RPLGALPRL
Subjt:  MANRVLSVRKMNARLRSDNQFHIQKPSIHD-QENKKSGRSKEMGDENGGIMINKRRLNREKKMALLQDVDKLKKKLRHEENVHRALKRAFTRPLGALPRL

Query:  PPYLPPSTLELLAEVAVLEEEVVWLTERVVNFQQNLYEEAAYVSSQRNVKNFVNSSDETRNAKQTSWKSNSPSKENQFASCYVKDKPSPEKKATKIISPS
        PPYLPPSTLELLAEVAVLEEEVVWLTERVVNFQQNLYEEAAYVSSQRNVKNFVNSSD+TRNAKQTSWKSNSPSKENQFASCYVKDKPSPEKK+TKIISPS
Subjt:  PPYLPPSTLELLAEVAVLEEEVVWLTERVVNFQQNLYEEAAYVSSQRNVKNFVNSSDETRNAKQTSWKSNSPSKENQFASCYVKDKPSPEKKATKIISPS

Query:  KKTKMPTEHELAEKSLEILKLQLGSRLMDHERAQESSCGASDNVESKTSANEISEDIVKCLCSIFVQEGTPRDKCIKGNDIVSSRCLFTVEANPIYHNEM
        KKTKMPTEHELAEKSLEIL LQLGSRLMDHERAQESSCGASDNVESKTSANEISEDIVKCLCSIFVQEGTPRDKCIKGNDIVSSRCLFTV+ANPIYHNE 
Subjt:  KKTKMPTEHELAEKSLEILKLQLGSRLMDHERAQESSCGASDNVESKTSANEISEDIVKCLCSIFVQEGTPRDKCIKGNDIVSSRCLFTVEANPIYHNEM

Query:  VNTVPLIHRL
        VNTVPLIHRL
Subjt:  VNTVPLIHRL

A0A6J1IDV4 uncharacterized protein LOC111471700 isoform X15.2e-20296.64Show/hide
Query:  MANRVLSVRKMNARLRSDNQFHIQKPSIHD-QENKKSGRSKEMGDENGGIMINKRRLNREKKMALLQDVDKLKKKLRHEENVHRALKRAFTRPLGALPRL
        MANRVLSVRKMNARLR D QF IQKPSIHD QENKKSGRSKEMGDENGGIMI++RRLNREKKMALLQDVDKLKKKLRHEENVHRALKRAF+RPLGALPRL
Subjt:  MANRVLSVRKMNARLRSDNQFHIQKPSIHD-QENKKSGRSKEMGDENGGIMINKRRLNREKKMALLQDVDKLKKKLRHEENVHRALKRAFTRPLGALPRL

Query:  PPYLPPSTLELLAEVAVLEEEVVWLTERVVNFQQNLYEEAAYVSSQRNVKNFVNSSDETRNAKQTSWKSNSPSKENQFASCYVKDKPSPEKKATKIISPS
        PPYLPPSTLELLAEVAVLEEEVVWLTERVVNFQQNLYEEAAYVSSQRNVKNFVNSSD+TRNAKQTSWKSNSPSKENQFASCYVKDKPSPEKK+TKIISPS
Subjt:  PPYLPPSTLELLAEVAVLEEEVVWLTERVVNFQQNLYEEAAYVSSQRNVKNFVNSSDETRNAKQTSWKSNSPSKENQFASCYVKDKPSPEKKATKIISPS

Query:  KKTKMPTEHELAEKSLEILKLQLGSRLMDHERAQESSCGASDNVESKTSANEISEDIVKCLCSIFVQEGTPRDKCIKGNDIVSSRCLFTVEANPIYHNEM
        KKTKMPTEHELAEKSLEIL LQLGSRLMDHERAQESSCGASDNVESKTSANEISEDIVKCLCSIFVQEGTPRDKCIKGNDIVSSRCLFTV+ANPIYHNE 
Subjt:  KKTKMPTEHELAEKSLEILKLQLGSRLMDHERAQESSCGASDNVESKTSANEISEDIVKCLCSIFVQEGTPRDKCIKGNDIVSSRCLFTVEANPIYHNEM

Query:  VNTVPLIHRLKYLLGKLASVNLEGLNQHQKLAFWINTYNSCIMNALLEHGIPETPERVVALMQKAEIVIGGCILNAMTIEQFILRLS
        VNTVPLIHRLKYLLGKLASVNLEGLNQ QKLAFWINTYNSCIMNALLEHGIPETPERVVALMQKAEIVIGGCILNAMTIEQFILRLS
Subjt:  VNTVPLIHRLKYLLGKLASVNLEGLNQHQKLAFWINTYNSCIMNALLEHGIPETPERVVALMQKAEIVIGGCILNAMTIEQFILRLS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G37080.1 Protein of unknown function, DUF5473.8e-8045.72Show/hide
Query:  SIHDQENKKSGRSKEMGDENGGIMINKRRLNREKKMALLQDVDKLKKKLRHEENVHRALKRAFTRPLGALPRLPPYLPPSTLELLAEVAVLEEEVVWLTE
        S  D++ K   +       +G  ++N+RR N+EKKM LLQDVDKLK+KLR EENVHRAL+RAFTRPLGALPRLP YLP  TLELLAEVAVLEEEVV L E
Subjt:  SIHDQENKKSGRSKEMGDENGGIMINKRRLNREKKMALLQDVDKLKKKLRHEENVHRALKRAFTRPLGALPRLPPYLPPSTLELLAEVAVLEEEVVWLTE

Query:  RVVNFQQNLYEEAAYVSSQRNVK----NFVN---------------------------------------------SSDETRN---------AKQTSWKS
        +VVNF+Q LY+EA Y+SS+RN++    N +N                                             SSD+T N          KQ S KS
Subjt:  RVVNFQQNLYEEAAYVSSQRNVK----NFVN---------------------------------------------SSDETRN---------AKQTSWKS

Query:  NSPS-----------KENQFASCYVKD---KPSPEKKATKIISPSKKTKMPTEHE-LAEKSLEILKLQLGSRLMDHERAQESSCGASD---NVESKTSAN
        N  S           KENQ +S   KD   K SPEKK  + ++  KK K   + E  A+K  E  KLQL  RL D ++AQES  G+S     ++S   AN
Subjt:  NSPS-----------KENQFASCYVKD---KPSPEKKATKIISPSKKTKMPTEHE-LAEKSLEILKLQLGSRLMDHERAQESSCGASD---NVESKTSAN

Query:  EISEDIVKCLCSIFVQEGTPRDKCI---------KGNDIVSSRCLFTVEANPIYHNEMVNTVPLIHRLKYLLGKLASVNLEGLNQHQKLAFWINTYNSCI
         +SED++KCL +I ++  + +D  +         +  ++ + +   +V+ + +     +N   LIHRLK+LL KL+ VNL+GL+  QKLAFWINTYNSC+
Subjt:  EISEDIVKCLCSIFVQEGTPRDKCI---------KGNDIVSSRCLFTVEANPIYHNEMVNTVPLIHRLKYLLGKLASVNLEGLNQHQKLAFWINTYNSCI

Query:  MNALLEHGIPETPERVVALMQKAEIVIGGCILNAMTIEQFILRL
        MNA LEHGIP TPE VVALMQKA I++GG  LNA+TIE FILRL
Subjt:  MNALLEHGIPETPERVVALMQKAEIVIGGCILNAMTIEQFILRL

AT4G37080.2 Protein of unknown function, DUF5475.9e-8145.37Show/hide
Query:  IQKPSIHDQENKKSGRSKEMGDENGGI------MINKRRLNREKKMALLQDVDKLKKKLRHEENVHRALKRAFTRPLGALPRLPPYLPPSTLELLAEVAV
        ++ PS HD  + +  + K     NG +      ++N+RR N+EKKM LLQDVDKLK+KLR EENVHRAL+RAFTRPLGALPRLP YLP  TLELLAEVAV
Subjt:  IQKPSIHDQENKKSGRSKEMGDENGGI------MINKRRLNREKKMALLQDVDKLKKKLRHEENVHRALKRAFTRPLGALPRLPPYLPPSTLELLAEVAV

Query:  LEEEVVWLTERVVNFQQNLYEEAAYVSSQRNVK----NFVN---------------------------------------------SSDETRN-------
        LEEEVV L E+VVNF+Q LY+EA Y+SS+RN++    N +N                                             SSD+T N       
Subjt:  LEEEVVWLTERVVNFQQNLYEEAAYVSSQRNVK----NFVN---------------------------------------------SSDETRN-------

Query:  --AKQTSWKSNSPS-----------KENQFASCYVKD---KPSPEKKATKIISPSKKTKMPTEHE-LAEKSLEILKLQLGSRLMDHERAQESSCGASD--
           KQ S KSN  S           KENQ +S   KD   K SPEKK  + ++  KK K   + E  A+K  E  KLQL  RL D ++AQES  G+S   
Subjt:  --AKQTSWKSNSPS-----------KENQFASCYVKD---KPSPEKKATKIISPSKKTKMPTEHE-LAEKSLEILKLQLGSRLMDHERAQESSCGASD--

Query:  -NVESKTSANEISEDIVKCLCSIFVQEGTPRDKCI---------KGNDIVSSRCLFTVEANPIYHNEMVNTVPLIHRLKYLLGKLASVNLEGLNQHQKLA
          ++S   AN +SED++KCL +I ++  + +D  +         +  ++ + +   +V+ + +     +N   LIHRLK+LL KL+ VNL+GL+  QKLA
Subjt:  -NVESKTSANEISEDIVKCLCSIFVQEGTPRDKCI---------KGNDIVSSRCLFTVEANPIYHNEMVNTVPLIHRLKYLLGKLASVNLEGLNQHQKLA

Query:  FWINTYNSCIMNALLEHGIPETPERVVALMQKAEIVIGGCILNAMTIEQFILRL
        FWINTYNSC+MNA LEHGIP TPE VVALMQKA I++GG  LNA+TIE FILRL
Subjt:  FWINTYNSCIMNALLEHGIPETPERVVALMQKAEIVIGGCILNAMTIEQFILRL

AT4G37080.3 Protein of unknown function, DUF5475.9e-8145.37Show/hide
Query:  IQKPSIHDQENKKSGRSKEMGDENGGI------MINKRRLNREKKMALLQDVDKLKKKLRHEENVHRALKRAFTRPLGALPRLPPYLPPSTLELLAEVAV
        ++ PS HD  + +  + K     NG +      ++N+RR N+EKKM LLQDVDKLK+KLR EENVHRAL+RAFTRPLGALPRLP YLP  TLELLAEVAV
Subjt:  IQKPSIHDQENKKSGRSKEMGDENGGI------MINKRRLNREKKMALLQDVDKLKKKLRHEENVHRALKRAFTRPLGALPRLPPYLPPSTLELLAEVAV

Query:  LEEEVVWLTERVVNFQQNLYEEAAYVSSQRNVK----NFVN---------------------------------------------SSDETRN-------
        LEEEVV L E+VVNF+Q LY+EA Y+SS+RN++    N +N                                             SSD+T N       
Subjt:  LEEEVVWLTERVVNFQQNLYEEAAYVSSQRNVK----NFVN---------------------------------------------SSDETRN-------

Query:  --AKQTSWKSNSPS-----------KENQFASCYVKD---KPSPEKKATKIISPSKKTKMPTEHE-LAEKSLEILKLQLGSRLMDHERAQESSCGASD--
           KQ S KSN  S           KENQ +S   KD   K SPEKK  + ++  KK K   + E  A+K  E  KLQL  RL D ++AQES  G+S   
Subjt:  --AKQTSWKSNSPS-----------KENQFASCYVKD---KPSPEKKATKIISPSKKTKMPTEHE-LAEKSLEILKLQLGSRLMDHERAQESSCGASD--

Query:  -NVESKTSANEISEDIVKCLCSIFVQEGTPRDKCI---------KGNDIVSSRCLFTVEANPIYHNEMVNTVPLIHRLKYLLGKLASVNLEGLNQHQKLA
          ++S   AN +SED++KCL +I ++  + +D  +         +  ++ + +   +V+ + +     +N   LIHRLK+LL KL+ VNL+GL+  QKLA
Subjt:  -NVESKTSANEISEDIVKCLCSIFVQEGTPRDKCI---------KGNDIVSSRCLFTVEANPIYHNEMVNTVPLIHRLKYLLGKLASVNLEGLNQHQKLA

Query:  FWINTYNSCIMNALLEHGIPETPERVVALMQKAEIVIGGCILNAMTIEQFILRL
        FWINTYNSC+MNA LEHGIP TPE VVALMQKA I++GG  LNA+TIE FILRL
Subjt:  FWINTYNSCIMNALLEHGIPETPERVVALMQKAEIVIGGCILNAMTIEQFILRL

AT5G42690.1 Protein of unknown function, DUF5476.1e-6244.41Show/hide
Query:  KEMGDENGGIMINKRRLNREKKMALLQDVDKLKKKLRHEENVHRALKRAFTRPLGALPRLPPYLPPSTLELLAEVAVLEEEVVWLTERVVNFQQNLYEEA
        K  G ENG  ++N++ LNREK + L +DV+KL+KKLR EEN+HRA++RAF+RPLGALPRLPP+LPPS LELLAEVAVLEEE+V L E +V+ +Q LY+EA
Subjt:  KEMGDENGGIMINKRRLNREKKMALLQDVDKLKKKLRHEENVHRALKRAFTRPLGALPRLPPYLPPSTLELLAEVAVLEEEVVWLTERVVNFQQNLYEEA

Query:  AYVSSQRNVKNFVNS-------SDETRNAKQTSWKSNSPSKENQFASCYVKDKPSPEKKATKIISPSKKTKMPTEHELAEKSLEILKLQLGSRLMDHERA
         + SS  +++N   S         ++++A  ++ +S SP      +    +     +  AT I +P KKT +   H    KSLE  KL+  S       A
Subjt:  AYVSSQRNVKNFVNS-------SDETRNAKQTSWKSNSPSKENQFASCYVKDKPSPEKKATKIISPSKKTKMPTEHELAEKSLEILKLQLGSRLMDHERA

Query:  QESSCGASDNVESKTSANEISEDIVKCLCSIF----------VQEGTPRDK------------CIKGNDIVSSRCLFTVEANPIYHNE-MVNTVPLIHRL
        + SS G  D        N+ISED+VKCL +IF          V +    DK              +  DI   +    VE   +  N    +++ LI +L
Subjt:  QESSCGASDNVESKTSANEISEDIVKCLCSIF----------VQEGTPRDK------------CIKGNDIVSSRCLFTVEANPIYHNE-MVNTVPLIHRL

Query:  KYLLGKLASVNLEGLNQHQKLAFWINTYNSCIMNALLEHGIPETPERVVALMQKAEIVIGGCILNAMTIEQFILRL
        K LLG+L+ VN++ LNQ +KLAFWIN YNSC+MN  LEHGIPE+P+ +V LMQKA I +GG  LNA+TIE FILRL
Subjt:  KYLLGKLASVNLEGLNQHQKLAFWINTYNSCIMNALLEHGIPETPERVVALMQKAEIVIGGCILNAMTIEQFILRL

AT5G42690.2 Protein of unknown function, DUF5474.2e-6344.41Show/hide
Query:  KEMGDENGGIMINKRRLNREKKMALLQDVDKLKKKLRHEENVHRALKRAFTRPLGALPRLPPYLPPSTLELLAEVAVLEEEVVWLTERVVNFQQNLYEEA
        K  G ENG  ++N++ LNREK + L +DV+KL+KKLR EEN+HRA++RAF+RPLGALPRLPP+LPPS LELLAEVAVLEEE+V L E +V+ +Q LY+EA
Subjt:  KEMGDENGGIMINKRRLNREKKMALLQDVDKLKKKLRHEENVHRALKRAFTRPLGALPRLPPYLPPSTLELLAEVAVLEEEVVWLTERVVNFQQNLYEEA

Query:  AYVSSQRNVKNFVNS-------SDETRNAKQTSWKSNSPSKENQFASCYVKDKPSPEKKATKIISPSKKTKMPTEHELAEKSLEILKLQLGSRLMDHERA
         + SS  +++N   S         ++++A  ++ +S SP      +    +     +  AT I +P KKT +   H    KSLE  KL+  S       A
Subjt:  AYVSSQRNVKNFVNS-------SDETRNAKQTSWKSNSPSKENQFASCYVKDKPSPEKKATKIISPSKKTKMPTEHELAEKSLEILKLQLGSRLMDHERA

Query:  QESSCGASDNVESKTSANEISEDIVKCLCSIF----------VQEGTPRDK------------CIKGNDIVSSRCLFTVEANPIYHNE-MVNTVPLIHRL
        + SS G  D        N+ISED+VKCL +IF          V +    DK              +  DI   +    VE   +  N    +++ LI +L
Subjt:  QESSCGASDNVESKTSANEISEDIVKCLCSIF----------VQEGTPRDK------------CIKGNDIVSSRCLFTVEANPIYHNE-MVNTVPLIHRL

Query:  KYLLGKLASVNLEGLNQHQKLAFWINTYNSCIMNALLEHGIPETPERVVALMQKAEIVIGGCILNAMTIEQFILRL
        K LLG+L+ VN++ LNQ +KLAFWIN YNSC+MN  LEHGIPE+P+ +V LMQKA I +GG  LNA+TIE FILRL
Subjt:  KYLLGKLASVNLEGLNQHQKLAFWINTYNSCIMNALLEHGIPETPERVVALMQKAEIVIGGCILNAMTIEQFILRL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCAATCGAGTTCTGAGTGTGAGAAAAATGAATGCCAGACTTCGATCCGACAATCAATTCCATATACAGAAACCGTCAATTCATGATCAAGAGAATAAGAAA
TCAGGAAGGAGCAAAGAGATGGGAGATGAAAATGGTGGAATTATGATCAATAAACGAAGATTAAACAGAGAAAAGAAAATGGCATTGCTACAAGATGTTGATAAG
CTGAAGAAGAAGCTGAGGCATGAGGAAAATGTTCACAGAGCTTTGAAGAGAGCTTTCACTAGACCTTTAGGAGCCTTGCCTCGTCTTCCTCCTTATCTCCCTCCA
TCTACACTTGAGCTTCTAGCTGAAGTAGCTGTTCTTGAAGAGGAAGTTGTCTGGCTTACAGAACGAGTTGTGAATTTTCAACAAAATCTTTATGAAGAAGCTGCC
TATGTTTCCTCACAGCGGAATGTCAAAAATTTTGTCAATAGCTCTGATGAAACAAGGAATGCCAAGCAAACCTCTTGGAAATCCAATTCACCTTCAAAGGAAAAC
CAGTTTGCTTCTTGTTATGTGAAAGATAAACCTTCCCCAGAAAAGAAAGCCACAAAAATTATCAGCCCATCCAAAAAGACAAAGATGCCAACTGAACATGAACTT
GCAGAGAAGAGCTTAGAAATTTTGAAGTTGCAGCTTGGGTCCAGATTAATGGATCATGAAAGAGCACAAGAGAGTTCTTGTGGTGCATCAGATAATGTAGAATCT
AAGACATCAGCTAACGAAATTTCTGAGGATATTGTGAAGTGTTTATGTTCCATTTTTGTTCAAGAGGGCACTCCGAGAGACAAATGTATCAAAGGAAATGATATT
GTTTCCAGTCGATGTCTCTTTACGGTCGAAGCGAACCCGATTTATCACAACGAAATGGTTAACACAGTTCCCCTAATTCACAGGCTAAAATACCTACTTGGAAAG
CTTGCCTCTGTGAACTTAGAGGGTCTTAACCAGCACCAGAAGCTTGCCTTTTGGATTAACACCTATAATTCTTGCATAATGAATGCACTTTTGGAGCATGGAATA
CCAGAGACTCCAGAAAGGGTTGTAGCTCTAATGCAAAAGGCCGAAATAGTCATTGGGGGATGCATACTCAATGCAATGACAATTGAGCAATTCATCTTGCGACTA
TCTTAA
mRNA sequenceShow/hide mRNA sequence
GAAATCAGTGAGCGAGTGAGTGAGTGGTTTCTGCGAGCGAGAGAAGAAATCAGAGCACGCGACTGTTACTGAATACTCTCCTTCTTTCTCTCTCTCTTTCCATCC
CAATTTCTGTCTCTTCCCGTGAAAGAAGAAGAACCGGAAGAACAATAAGCTCCAATGGCCAATCGAGTTCTGAGTGTGAGAAAAATGAATGCCAGACTTCGATCC
GACAATCAATTCCATATACAGAAACCGTCAATTCATGATCAAGAGAATAAGAAATCAGGAAGGAGCAAAGAGATGGGAGATGAAAATGGTGGAATTATGATCAAT
AAACGAAGATTAAACAGAGAAAAGAAAATGGCATTGCTACAAGATGTTGATAAGCTGAAGAAGAAGCTGAGGCATGAGGAAAATGTTCACAGAGCTTTGAAGAGA
GCTTTCACTAGACCTTTAGGAGCCTTGCCTCGTCTTCCTCCTTATCTCCCTCCATCTACACTTGAGCTTCTAGCTGAAGTAGCTGTTCTTGAAGAGGAAGTTGTC
TGGCTTACAGAACGAGTTGTGAATTTTCAACAAAATCTTTATGAAGAAGCTGCCTATGTTTCCTCACAGCGGAATGTCAAAAATTTTGTCAATAGCTCTGATGAA
ACAAGGAATGCCAAGCAAACCTCTTGGAAATCCAATTCACCTTCAAAGGAAAACCAGTTTGCTTCTTGTTATGTGAAAGATAAACCTTCCCCAGAAAAGAAAGCC
ACAAAAATTATCAGCCCATCCAAAAAGACAAAGATGCCAACTGAACATGAACTTGCAGAGAAGAGCTTAGAAATTTTGAAGTTGCAGCTTGGGTCCAGATTAATG
GATCATGAAAGAGCACAAGAGAGTTCTTGTGGTGCATCAGATAATGTAGAATCTAAGACATCAGCTAACGAAATTTCTGAGGATATTGTGAAGTGTTTATGTTCC
ATTTTTGTTCAAGAGGGCACTCCGAGAGACAAATGTATCAAAGGAAATGATATTGTTTCCAGTCGATGTCTCTTTACGGTCGAAGCGAACCCGATTTATCACAAC
GAAATGGTTAACACAGTTCCCCTAATTCACAGGCTAAAATACCTACTTGGAAAGCTTGCCTCTGTGAACTTAGAGGGTCTTAACCAGCACCAGAAGCTTGCCTTT
TGGATTAACACCTATAATTCTTGCATAATGAATGCACTTTTGGAGCATGGAATACCAGAGACTCCAGAAAGGGTTGTAGCTCTAATGCAAAAGGCCGAAATAGTC
ATTGGGGGATGCATACTCAATGCAATGACAATTGAGCAATTCATCTTGCGACTATCTTAACACCTGAAGTTTATGCATTCGAAGGCTGTCGTCGAAAGTTACAAA
GCTGGTCGTCTCCTCCGGTAAGTTCGCAATTTAATGGTACATCCGAATAACCAGCTCTGAAGACGACTATTTTCCGTTCATTGAATGCATTGTATGAATGAAAGG
AAGGTTGAGAGTATACATAGGAAGTCATGTGGAGGAAGAGTCAGAAGAATTTGTTTCAAGTCATTCCTAAGATTTGCTTTTAAACAAGTTGGATTCATGAAGTTG
AAAATGCTGTTCAGAGTGTATAGATAGATACATGTGAATGAAGGCTGGTCTGTGGACTCAATAAATCTTCTTTGAAGATTTTTTTTTGTCTAGTGTCTTGCACAG
ACATGAGTAGAGATGTGTACTTTAACTTGAGAGCTAGAGTGCTGATTCTCGAGTAGGCTTGTCTTGTTCATCTTTTTTTTTTTTGTATATTGGATCAATGATCTT
GTAGAGGAATGGAT
Protein sequenceShow/hide protein sequence
MANRVLSVRKMNARLRSDNQFHIQKPSIHDQENKKSGRSKEMGDENGGIMINKRRLNREKKMALLQDVDKLKKKLRHEENVHRALKRAFTRPLGALPRLPPYLPP
STLELLAEVAVLEEEVVWLTERVVNFQQNLYEEAAYVSSQRNVKNFVNSSDETRNAKQTSWKSNSPSKENQFASCYVKDKPSPEKKATKIISPSKKTKMPTEHEL
AEKSLEILKLQLGSRLMDHERAQESSCGASDNVESKTSANEISEDIVKCLCSIFVQEGTPRDKCIKGNDIVSSRCLFTVEANPIYHNEMVNTVPLIHRLKYLLGK
LASVNLEGLNQHQKLAFWINTYNSCIMNALLEHGIPETPERVVALMQKAEIVIGGCILNAMTIEQFILRLS