; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS003635 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS003635
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionProtein of unknown function, DUF547
Genome locationscaffold963:353019..356149
RNA-Seq ExpressionMS003635
SyntenyMS003635
Gene Ontology termsNA
InterPro domainsIPR006869 - Domain of unknown function DUF547
IPR025757 - Ternary complex factor MIP1, leucine-zipper


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_007047200.1 PREDICTED: uncharacterized protein LOC18611093 isoform X1 [Theobroma cacao]1.5e-20265.16Show/hide
Query:  KQEKM-KTEGSKEMGDEKVGIKSNRRRVNREKKMALLQDVDKLKKKLRHEENVHRALERAFTRPLGALPRLPPYLPPSTLELLAEVAVLEEEVVWLSERV
        K+EKM K++G + +G  K    +NRRR NRE+KMALLQDVDKLK+KLRHEENVHRALERAFTRPLGALPRLPPYLPP TLELLAEVAVLEEEVV L E+V
Subjt:  KQEKM-KTEGSKEMGDEKVGIKSNRRRVNREKKMALLQDVDKLKKKLRHEENVHRALERAFTRPLGALPRLPPYLPPSTLELLAEVAVLEEEVVWLSERV

Query:  VNFRQDLYQEAVFVSSQRNVENFVLGIESISTESLKHGRSKSFSPKSAC------RSQPPLARSISSRKMLFSHNVSDQTGNYSARLINAKQTSWKSNSP
        VNFRQ LYQEAV+ SS+RNVEN    IE     S KH RSKS S           + QP LARS+SSRK+L     +++ G   +R  N +Q S K NS 
Subjt:  VNFRQDLYQEAVFVSSQRNVENFVLGIESISTESLKHGRSKSFSPKSAC------RSQPPLARSISSRKMLFSHNVSDQTGNYSARLINAKQTSWKSNSP

Query:  S-----KENQFGSYYAKDKPSPEKKTTKIVSPSRKTPTKHEVAEKSFDALKLQLGSRLIDNERAEESSFGASDD--LESKTSPNEISEGIVKCLCSIFVE
        S     KENQ  +   KDK SPEKK TK+V+P ++ PTKHE A K  DALK QL  RL+D ERA+ES  G+SDD   E+ ++PN+ISE  V+CLCSIFV 
Subjt:  S-----KENQFGSYYAKDKPSPEKKTTKIVSPSRKTPTKHEVAEKSFDALKLQLGSRLIDNERAEESSFGASDD--LESKTSPNEISEGIVKCLCSIFVE

Query:  VSTSSDKCVESQTPTPRASSDAYTSNVEAELLDPYNICSESKGNDIGSYRHLFAVEANSIHLNEMANTLPRIHRLKYLLGKLASVNLEGLNQQQKLAFWI
        +ST  D+ VES     ++++++Y  + E+E  DPY ICS+SK  DIG Y++L  +EAN++ L+   N L  IHRLK+LLGKL SVNL+GL+ QQKLAFWI
Subjt:  VSTSSDKCVESQTPTPRASSDAYTSNVEAELLDPYNICSESKGNDIGSYRHLFAVEANSIHLNEMANTLPRIHRLKYLLGKLASVNLEGLNQQQKLAFWI

Query:  NTYNSCIMNALLEHGIPETPERVVALMQKAEIVVGGYILSAMTIEHFILRLPYHLKFMCSKAIKSDEMKARDVFGLEWSEPLVTFALCCGSWSSPAVRVY
        NTYNSC+MNA+LEHGIPETPE VV LMQKA IVVGG++L+A+TIEHFILRLP+HLKF CSKA K+DEMKAR++FGLEWSEPLVT+AL CGSWSSPAVRVY
Subjt:  NTYNSCIMNALLEHGIPETPERVVALMQKAEIVVGGYILSAMTIEHFILRLPYHLKFMCSKAIKSDEMKARDVFGLEWSEPLVTFALCCGSWSSPAVRVY

Query:  TGSNVEDELEEAKRSYLQAAVGISRRGNKLMLPKLLDWYLLDFAKDLESLVDWVCLQLSDELRKEAVKCLERRGRQPVEEFVQVVPYDFSFRLLFNK
        T S+VEDELE AKR YLQAAV ISR  NKL++PKLLDWYLLDFAKDLESL+DWVCLQL++ELR EAVKCLER+G++P+ + VQV+PYDFSFRLL  +
Subjt:  TGSNVEDELEEAKRSYLQAAVGISRRGNKLMLPKLLDWYLLDFAKDLESLVDWVCLQLSDELRKEAVKCLERRGRQPVEEFVQVVPYDFSFRLLFNK

XP_017980873.1 PREDICTED: uncharacterized protein LOC18611093 isoform X2 [Theobroma cacao]4.3e-20264.77Show/hide
Query:  KQEKMKTEGSKEMGDEKVGIKSNRRRVNREKKMALLQDVDKLKKKLRHEENVHRALERAFTRPLGALPRLPPYLPPSTLELLAEVAVLEEEVVWLSERVV
        K++  K++G + +G  K    +NRRR NRE+KMALLQDVDKLK+KLRHEENVHRALERAFTRPLGALPRLPPYLPP TLELLAEVAVLEEEVV L E+VV
Subjt:  KQEKMKTEGSKEMGDEKVGIKSNRRRVNREKKMALLQDVDKLKKKLRHEENVHRALERAFTRPLGALPRLPPYLPPSTLELLAEVAVLEEEVVWLSERVV

Query:  NFRQDLYQEAVFVSSQRNVENFVLGIESISTESLKHGRSKSFSPKSAC------RSQPPLARSISSRKMLFSHNVSDQTGNYSARLINAKQTSWKSNSPS
        NFRQ LYQEAV+ SS+RNVEN    IE     S KH RSKS S           + QP LARS+SSRK+L     +++ G   +R  N +Q S K NS S
Subjt:  NFRQDLYQEAVFVSSQRNVENFVLGIESISTESLKHGRSKSFSPKSAC------RSQPPLARSISSRKMLFSHNVSDQTGNYSARLINAKQTSWKSNSPS

Query:  -----KENQFGSYYAKDKPSPEKKTTKIVSPSRKTPTKHEVAEKSFDALKLQLGSRLIDNERAEESSFGASDD--LESKTSPNEISEGIVKCLCSIFVEV
             KENQ  +   KDK SPEKK TK+V+P ++ PTKHE A K  DALK QL  RL+D ERA+ES  G+SDD   E+ ++PN+ISE  V+CLCSIFV +
Subjt:  -----KENQFGSYYAKDKPSPEKKTTKIVSPSRKTPTKHEVAEKSFDALKLQLGSRLIDNERAEESSFGASDD--LESKTSPNEISEGIVKCLCSIFVEV

Query:  STSSDKCVESQTPTPRASSDAYTSNVEAELLDPYNICSESKGNDIGSYRHLFAVEANSIHLNEMANTLPRIHRLKYLLGKLASVNLEGLNQQQKLAFWIN
        ST  D+ VES     ++++++Y  + E+E  DPY ICS+SK  DIG Y++L  +EAN++ L+   N L  IHRLK+LLGKL SVNL+GL+ QQKLAFWIN
Subjt:  STSSDKCVESQTPTPRASSDAYTSNVEAELLDPYNICSESKGNDIGSYRHLFAVEANSIHLNEMANTLPRIHRLKYLLGKLASVNLEGLNQQQKLAFWIN

Query:  TYNSCIMNALLEHGIPETPERVVALMQKAEIVVGGYILSAMTIEHFILRLPYHLKFMCSKAIKSDEMKARDVFGLEWSEPLVTFALCCGSWSSPAVRVYT
        TYNSC+MNA+LEHGIPETPE VV LMQKA IVVGG++L+A+TIEHFILRLP+HLKF CSKA K+DEMKAR++FGLEWSEPLVT+AL CGSWSSPAVRVYT
Subjt:  TYNSCIMNALLEHGIPETPERVVALMQKAEIVVGGYILSAMTIEHFILRLPYHLKFMCSKAIKSDEMKARDVFGLEWSEPLVTFALCCGSWSSPAVRVYT

Query:  GSNVEDELEEAKRSYLQAAVGISRRGNKLMLPKLLDWYLLDFAKDLESLVDWVCLQLSDELRKEAVKCLERRGRQPVEEFVQVVPYDFSFRLLFNK
         S+VEDELE AKR YLQAAV ISR  NKL++PKLLDWYLLDFAKDLESL+DWVCLQL++ELR EAVKCLER+G++P+ + VQV+PYDFSFRLL  +
Subjt:  GSNVEDELEEAKRSYLQAAVGISRRGNKLMLPKLLDWYLLDFAKDLESLVDWVCLQLSDELRKEAVKCLERRGRQPVEEFVQVVPYDFSFRLLFNK

XP_017980875.1 PREDICTED: uncharacterized protein LOC18611093 isoform X3 [Theobroma cacao]4.3e-20265.14Show/hide
Query:  KTEGSKEMGDEKVGIKSNRRRVNREKKMALLQDVDKLKKKLRHEENVHRALERAFTRPLGALPRLPPYLPPSTLELLAEVAVLEEEVVWLSERVVNFRQD
        K++G + +G  K    +NRRR NRE+KMALLQDVDKLK+KLRHEENVHRALERAFTRPLGALPRLPPYLPP TLELLAEVAVLEEEVV L E+VVNFRQ 
Subjt:  KTEGSKEMGDEKVGIKSNRRRVNREKKMALLQDVDKLKKKLRHEENVHRALERAFTRPLGALPRLPPYLPPSTLELLAEVAVLEEEVVWLSERVVNFRQD

Query:  LYQEAVFVSSQRNVENFVLGIESISTESLKHGRSKSFSPKSAC------RSQPPLARSISSRKMLFSHNVSDQTGNYSARLINAKQTSWKSNSPS-----
        LYQEAV+ SS+RNVEN    IE     S KH RSKS S           + QP LARS+SSRK+L     +++ G   +R  N +Q S K NS S     
Subjt:  LYQEAVFVSSQRNVENFVLGIESISTESLKHGRSKSFSPKSAC------RSQPPLARSISSRKMLFSHNVSDQTGNYSARLINAKQTSWKSNSPS-----

Query:  KENQFGSYYAKDKPSPEKKTTKIVSPSRKTPTKHEVAEKSFDALKLQLGSRLIDNERAEESSFGASDD--LESKTSPNEISEGIVKCLCSIFVEVSTSSD
        KENQ  +   KDK SPEKK TK+V+P ++ PTKHE A K  DALK QL  RL+D ERA+ES  G+SDD   E+ ++PN+ISE  V+CLCSIFV +ST  D
Subjt:  KENQFGSYYAKDKPSPEKKTTKIVSPSRKTPTKHEVAEKSFDALKLQLGSRLIDNERAEESSFGASDD--LESKTSPNEISEGIVKCLCSIFVEVSTSSD

Query:  KCVESQTPTPRASSDAYTSNVEAELLDPYNICSESKGNDIGSYRHLFAVEANSIHLNEMANTLPRIHRLKYLLGKLASVNLEGLNQQQKLAFWINTYNSC
        + VES     ++++++Y  + E+E  DPY ICS+SK  DIG Y++L  +EAN++ L+   N L  IHRLK+LLGKL SVNL+GL+ QQKLAFWINTYNSC
Subjt:  KCVESQTPTPRASSDAYTSNVEAELLDPYNICSESKGNDIGSYRHLFAVEANSIHLNEMANTLPRIHRLKYLLGKLASVNLEGLNQQQKLAFWINTYNSC

Query:  IMNALLEHGIPETPERVVALMQKAEIVVGGYILSAMTIEHFILRLPYHLKFMCSKAIKSDEMKARDVFGLEWSEPLVTFALCCGSWSSPAVRVYTGSNVE
        +MNA+LEHGIPETPE VV LMQKA IVVGG++L+A+TIEHFILRLP+HLKF CSKA K+DEMKAR++FGLEWSEPLVT+AL CGSWSSPAVRVYT S+VE
Subjt:  IMNALLEHGIPETPERVVALMQKAEIVVGGYILSAMTIEHFILRLPYHLKFMCSKAIKSDEMKARDVFGLEWSEPLVTFALCCGSWSSPAVRVYTGSNVE

Query:  DELEEAKRSYLQAAVGISRRGNKLMLPKLLDWYLLDFAKDLESLVDWVCLQLSDELRKEAVKCLERRGRQPVEEFVQVVPYDFSFRLLFNK
        DELE AKR YLQAAV ISR  NKL++PKLLDWYLLDFAKDLESL+DWVCLQL++ELR EAVKCLER+G++P+ + VQV+PYDFSFRLL  +
Subjt:  DELEEAKRSYLQAAVGISRRGNKLMLPKLLDWYLLDFAKDLESLVDWVCLQLSDELRKEAVKCLERRGRQPVEEFVQVVPYDFSFRLLFNK

XP_022149544.1 uncharacterized protein LOC111017954 isoform X1 [Momordica charantia]0.0e+0098.11Show/hide
Query:  KQEKMKTEGSKEMGDEKVGIKSNRRRVNREKKMALLQDVDKLKKKLRHEENVHRALERAFTRPLGALPRLPPYLPPSTLELLAEVAVLEEEVVWLSERVV
        KQEKMKTEGSKEMGDEKVGIKSNRRR+NREKKMALLQDVDKLKKKLRHEENVHRALERAFTRPLGALPRLPPYLPPSTLELLAEVAVLEEEVV LSERVV
Subjt:  KQEKMKTEGSKEMGDEKVGIKSNRRRVNREKKMALLQDVDKLKKKLRHEENVHRALERAFTRPLGALPRLPPYLPPSTLELLAEVAVLEEEVVWLSERVV

Query:  NFRQDLYQEAVFVSSQRNVENFVLGIESISTESLKHGRSKSFSPKSACRSQPPLARSISSRKMLFSHNVSDQTGNYSARLINAKQTSWKSNSPSKENQFG
        NFRQDLYQEAVFVSSQRNVENFVLGIESISTESLKHGRSKSFSPKSACRSQPPLARSISSRKMLFSHNVSDQTGNYSARLINA+QTSWKSNSPSKENQFG
Subjt:  NFRQDLYQEAVFVSSQRNVENFVLGIESISTESLKHGRSKSFSPKSACRSQPPLARSISSRKMLFSHNVSDQTGNYSARLINAKQTSWKSNSPSKENQFG

Query:  SYYAKDKPSPEKKTTKIVSPSRKTPTKHEVAEKSFDALKLQLGSRLIDNERAEESSFGASDDLESKTSPNEISEGIVKCLCSIFVEVSTSSDKCVESQTP
        SYY KDKPSPEKKTTKIVSPSRKTPTKHEVAEKSFDALKLQLGSRL+DNERAEESSFGASDDLESKTSPNEISEGIVKCLCSIFVEVSTSSDKCVESQ  
Subjt:  SYYAKDKPSPEKKTTKIVSPSRKTPTKHEVAEKSFDALKLQLGSRLIDNERAEESSFGASDDLESKTSPNEISEGIVKCLCSIFVEVSTSSDKCVESQTP

Query:  TPRASSDAYTSNVEAELLDPYNICSESKGNDIGSYRHLFAVEANSIHLNEMANTLPRIHRLKYLLGKLASVNLEGLNQQQKLAFWINTYNSCIMNALLEH
        TP+ASSDAYTSNVEAELLDPYNICSESKGNDIGSYRHLFAVEANSIHLNEMANTLPRIHRLKYLLGKLASV+LEGLNQQQKLAFWINTYNSCIMNALLEH
Subjt:  TPRASSDAYTSNVEAELLDPYNICSESKGNDIGSYRHLFAVEANSIHLNEMANTLPRIHRLKYLLGKLASVNLEGLNQQQKLAFWINTYNSCIMNALLEH

Query:  GIPETPERVVALMQKAEIVVGGYILSAMTIEHFILRLPYHLKFMCSKAIKSDEMKARDVFGLEWSEPLVTFALCCGSWSSPAVRVYTGSNVEDELEEAKR
        GIPETPERVVALMQKAEIVVGGYIL+AMTIEHFILRLPYHLKFMCSKAIKSDEMKARDVFGLEWSEPLVTFALCCGSWSSPAVRVYTGSNVEDELEEAKR
Subjt:  GIPETPERVVALMQKAEIVVGGYILSAMTIEHFILRLPYHLKFMCSKAIKSDEMKARDVFGLEWSEPLVTFALCCGSWSSPAVRVYTGSNVEDELEEAKR

Query:  SYLQAAVGISRRGNKLMLPKLLDWYLLDFAKDLESLVDWVCLQLSDELRKEAVKCLERRGRQPVEEFVQVVPYDFSFRLLFNK
        SYLQAAVGISRRGNK+MLPKLLDWYLLDFAKDLESLVDWVCLQLSDELRKEAVKCLERRGRQPVEEFVQVVPYDFSFRLLFNK
Subjt:  SYLQAAVGISRRGNKLMLPKLLDWYLLDFAKDLESLVDWVCLQLSDELRKEAVKCLERRGRQPVEEFVQVVPYDFSFRLLFNK

XP_022149545.1 uncharacterized protein LOC111017954 isoform X2 [Momordica charantia]0.0e+0097.94Show/hide
Query:  QEKMKTEGSKEMGDEKVGIKSNRRRVNREKKMALLQDVDKLKKKLRHEENVHRALERAFTRPLGALPRLPPYLPPSTLELLAEVAVLEEEVVWLSERVVN
        +EKMKTEGSKEMGDEKVGIKSNRRR+NREKKMALLQDVDKLKKKLRHEENVHRALERAFTRPLGALPRLPPYLPPSTLELLAEVAVLEEEVV LSERVVN
Subjt:  QEKMKTEGSKEMGDEKVGIKSNRRRVNREKKMALLQDVDKLKKKLRHEENVHRALERAFTRPLGALPRLPPYLPPSTLELLAEVAVLEEEVVWLSERVVN

Query:  FRQDLYQEAVFVSSQRNVENFVLGIESISTESLKHGRSKSFSPKSACRSQPPLARSISSRKMLFSHNVSDQTGNYSARLINAKQTSWKSNSPSKENQFGS
        FRQDLYQEAVFVSSQRNVENFVLGIESISTESLKHGRSKSFSPKSACRSQPPLARSISSRKMLFSHNVSDQTGNYSARLINA+QTSWKSNSPSKENQFGS
Subjt:  FRQDLYQEAVFVSSQRNVENFVLGIESISTESLKHGRSKSFSPKSACRSQPPLARSISSRKMLFSHNVSDQTGNYSARLINAKQTSWKSNSPSKENQFGS

Query:  YYAKDKPSPEKKTTKIVSPSRKTPTKHEVAEKSFDALKLQLGSRLIDNERAEESSFGASDDLESKTSPNEISEGIVKCLCSIFVEVSTSSDKCVESQTPT
        YY KDKPSPEKKTTKIVSPSRKTPTKHEVAEKSFDALKLQLGSRL+DNERAEESSFGASDDLESKTSPNEISEGIVKCLCSIFVEVSTSSDKCVESQ  T
Subjt:  YYAKDKPSPEKKTTKIVSPSRKTPTKHEVAEKSFDALKLQLGSRLIDNERAEESSFGASDDLESKTSPNEISEGIVKCLCSIFVEVSTSSDKCVESQTPT

Query:  PRASSDAYTSNVEAELLDPYNICSESKGNDIGSYRHLFAVEANSIHLNEMANTLPRIHRLKYLLGKLASVNLEGLNQQQKLAFWINTYNSCIMNALLEHG
        P+ASSDAYTSNVEAELLDPYNICSESKGNDIGSYRHLFAVEANSIHLNEMANTLPRIHRLKYLLGKLASV+LEGLNQQQKLAFWINTYNSCIMNALLEHG
Subjt:  PRASSDAYTSNVEAELLDPYNICSESKGNDIGSYRHLFAVEANSIHLNEMANTLPRIHRLKYLLGKLASVNLEGLNQQQKLAFWINTYNSCIMNALLEHG

Query:  IPETPERVVALMQKAEIVVGGYILSAMTIEHFILRLPYHLKFMCSKAIKSDEMKARDVFGLEWSEPLVTFALCCGSWSSPAVRVYTGSNVEDELEEAKRS
        IPETPERVVALMQKAEIVVGGYIL+AMTIEHFILRLPYHLKFMCSKAIKSDEMKARDVFGLEWSEPLVTFALCCGSWSSPAVRVYTGSNVEDELEEAKRS
Subjt:  IPETPERVVALMQKAEIVVGGYILSAMTIEHFILRLPYHLKFMCSKAIKSDEMKARDVFGLEWSEPLVTFALCCGSWSSPAVRVYTGSNVEDELEEAKRS

Query:  YLQAAVGISRRGNKLMLPKLLDWYLLDFAKDLESLVDWVCLQLSDELRKEAVKCLERRGRQPVEEFVQVVPYDFSFRLLFNK
        YLQAAVGISRRGNK+MLPKLLDWYLLDFAKDLESLVDWVCLQLSDELRKEAVKCLERRGRQPVEEFVQVVPYDFSFRLLFNK
Subjt:  YLQAAVGISRRGNKLMLPKLLDWYLLDFAKDLESLVDWVCLQLSDELRKEAVKCLERRGRQPVEEFVQVVPYDFSFRLLFNK

TrEMBL top hitse value%identityAlignment
A0A061DN03 Uncharacterized protein isoform 17.1e-20365.16Show/hide
Query:  KQEKM-KTEGSKEMGDEKVGIKSNRRRVNREKKMALLQDVDKLKKKLRHEENVHRALERAFTRPLGALPRLPPYLPPSTLELLAEVAVLEEEVVWLSERV
        K+EKM K++G + +G  K    +NRRR NRE+KMALLQDVDKLK+KLRHEENVHRALERAFTRPLGALPRLPPYLPP TLELLAEVAVLEEEVV L E+V
Subjt:  KQEKM-KTEGSKEMGDEKVGIKSNRRRVNREKKMALLQDVDKLKKKLRHEENVHRALERAFTRPLGALPRLPPYLPPSTLELLAEVAVLEEEVVWLSERV

Query:  VNFRQDLYQEAVFVSSQRNVENFVLGIESISTESLKHGRSKSFSPKSAC------RSQPPLARSISSRKMLFSHNVSDQTGNYSARLINAKQTSWKSNSP
        VNFRQ LYQEAV+ SS+RNVEN    IE     S KH RSKS S           + QP LARS+SSRK+L     +++ G   +R  N +Q S K NS 
Subjt:  VNFRQDLYQEAVFVSSQRNVENFVLGIESISTESLKHGRSKSFSPKSAC------RSQPPLARSISSRKMLFSHNVSDQTGNYSARLINAKQTSWKSNSP

Query:  S-----KENQFGSYYAKDKPSPEKKTTKIVSPSRKTPTKHEVAEKSFDALKLQLGSRLIDNERAEESSFGASDD--LESKTSPNEISEGIVKCLCSIFVE
        S     KENQ  +   KDK SPEKK TK+V+P ++ PTKHE A K  DALK QL  RL+D ERA+ES  G+SDD   E+ ++PN+ISE  V+CLCSIFV 
Subjt:  S-----KENQFGSYYAKDKPSPEKKTTKIVSPSRKTPTKHEVAEKSFDALKLQLGSRLIDNERAEESSFGASDD--LESKTSPNEISEGIVKCLCSIFVE

Query:  VSTSSDKCVESQTPTPRASSDAYTSNVEAELLDPYNICSESKGNDIGSYRHLFAVEANSIHLNEMANTLPRIHRLKYLLGKLASVNLEGLNQQQKLAFWI
        +ST  D+ VES     ++++++Y  + E+E  DPY ICS+SK  DIG Y++L  +EAN++ L+   N L  IHRLK+LLGKL SVNL+GL+ QQKLAFWI
Subjt:  VSTSSDKCVESQTPTPRASSDAYTSNVEAELLDPYNICSESKGNDIGSYRHLFAVEANSIHLNEMANTLPRIHRLKYLLGKLASVNLEGLNQQQKLAFWI

Query:  NTYNSCIMNALLEHGIPETPERVVALMQKAEIVVGGYILSAMTIEHFILRLPYHLKFMCSKAIKSDEMKARDVFGLEWSEPLVTFALCCGSWSSPAVRVY
        NTYNSC+MNA+LEHGIPETPE VV LMQKA IVVGG++L+A+TIEHFILRLP+HLKF CSKA K+DEMKAR++FGLEWSEPLVT+AL CGSWSSPAVRVY
Subjt:  NTYNSCIMNALLEHGIPETPERVVALMQKAEIVVGGYILSAMTIEHFILRLPYHLKFMCSKAIKSDEMKARDVFGLEWSEPLVTFALCCGSWSSPAVRVY

Query:  TGSNVEDELEEAKRSYLQAAVGISRRGNKLMLPKLLDWYLLDFAKDLESLVDWVCLQLSDELRKEAVKCLERRGRQPVEEFVQVVPYDFSFRLLFNK
        T S+VEDELE AKR YLQAAV ISR  NKL++PKLLDWYLLDFAKDLESL+DWVCLQL++ELR EAVKCLER+G++P+ + VQV+PYDFSFRLL  +
Subjt:  TGSNVEDELEEAKRSYLQAAVGISRRGNKLMLPKLLDWYLLDFAKDLESLVDWVCLQLSDELRKEAVKCLERRGRQPVEEFVQVVPYDFSFRLLFNK

A0A6J0ZJR8 uncharacterized protein LOC110409952 isoform X13.5e-20265.16Show/hide
Query:  KQEKM-KTEGSKEMGDEKVGIKSNRRRVNREKKMALLQDVDKLKKKLRHEENVHRALERAFTRPLGALPRLPPYLPPSTLELLAEVAVLEEEVVWLSERV
        K+EKM K++GS+ +G  K    +NRRR NRE+KMALLQDVDKLK+KLRHEENVHRALERAFTRPLGALPRLPPYLPPSTLELLAEVAVLEEEVV L E+V
Subjt:  KQEKM-KTEGSKEMGDEKVGIKSNRRRVNREKKMALLQDVDKLKKKLRHEENVHRALERAFTRPLGALPRLPPYLPPSTLELLAEVAVLEEEVVWLSERV

Query:  VNFRQDLYQEAVFVSSQRNVENFVLGIESISTESLKHGRSKSFSPKSAC------RSQPPLARSISSRKMLFSHNVSDQTGNYSARLINAKQTSWKSNSP
        VNFRQ LYQEAV+ SS+RNVEN    IE     S KH RSKS S           + QP LARS+SSRK+L   +  D+ G   +R  N +Q S K NS 
Subjt:  VNFRQDLYQEAVFVSSQRNVENFVLGIESISTESLKHGRSKSFSPKSAC------RSQPPLARSISSRKMLFSHNVSDQTGNYSARLINAKQTSWKSNSP

Query:  S-----KENQFGSYYAKDKPSPEKKTTKIVSPSRKTPTKHEVAEKSFDALKLQLGSRLIDNERAEESSFGASDD--LESKTSPNEISEGIVKCLCSIFVE
        S     KENQ  +   KDK SPEKK  K+V+P+++ PTKHE A+K  DALK QL  RL+D ERA+ES  G+SDD   E+ ++PN+ISE  V+CLCSIFV 
Subjt:  S-----KENQFGSYYAKDKPSPEKKTTKIVSPSRKTPTKHEVAEKSFDALKLQLGSRLIDNERAEESSFGASDD--LESKTSPNEISEGIVKCLCSIFVE

Query:  VSTSSDKCVESQTPTPRASSDAYTSNVEAELLDPYNICSESKGNDIGSYRHLFAVEANSIHLNEMANTLPRIHRLKYLLGKLASVNLEGLNQQQKLAFWI
        +STS DK +ES     +  +++   + E+E  DPY ICS+SK  DIG Y++L  +EAN++ L+ + N L  IHRLK+LLGKLASVNL+GL+ QQKLAFWI
Subjt:  VSTSSDKCVESQTPTPRASSDAYTSNVEAELLDPYNICSESKGNDIGSYRHLFAVEANSIHLNEMANTLPRIHRLKYLLGKLASVNLEGLNQQQKLAFWI

Query:  NTYNSCIMNALLEHGIPETPERVVALMQKAEIVVGGYILSAMTIEHFILRLPYHLKFMCSKAIKSDEMKARDVFGLEWSEPLVTFALCCGSWSSPAVRVY
        NTYNSC+MNA+LEHGIPETPE VV LMQKA IVVGG++L+A+TIEHFILRLP+HLKF CSKA K DEMKAR++FGLEWSEPLVTFAL CGSWSSPAVRVY
Subjt:  NTYNSCIMNALLEHGIPETPERVVALMQKAEIVVGGYILSAMTIEHFILRLPYHLKFMCSKAIKSDEMKARDVFGLEWSEPLVTFALCCGSWSSPAVRVY

Query:  TGSNVEDELEEAKRSYLQAAVGISRRGNKLMLPKLLDWYLLDFAKDLESLVDWVCLQLSDELRKEAVKCLERRGRQPVEEFVQVVPYDFSFRLLFNK
        T S+VEDELE AKR YLQAAV ISR  NKL++PK+LDWYLLDFAK+LESL+DWVCLQL++E R EAVKCL+R+G++P+ + VQV+PYDFSFRLL  +
Subjt:  TGSNVEDELEEAKRSYLQAAVGISRRGNKLMLPKLLDWYLLDFAKDLESLVDWVCLQLSDELRKEAVKCLERRGRQPVEEFVQVVPYDFSFRLLFNK

A0A6J0ZJZ9 uncharacterized protein LOC110409952 isoform X21.0e-20164.77Show/hide
Query:  KQEKMKTEGSKEMGDEKVGIKSNRRRVNREKKMALLQDVDKLKKKLRHEENVHRALERAFTRPLGALPRLPPYLPPSTLELLAEVAVLEEEVVWLSERVV
        K++  K++GS+ +G  K    +NRRR NRE+KMALLQDVDKLK+KLRHEENVHRALERAFTRPLGALPRLPPYLPPSTLELLAEVAVLEEEVV L E+VV
Subjt:  KQEKMKTEGSKEMGDEKVGIKSNRRRVNREKKMALLQDVDKLKKKLRHEENVHRALERAFTRPLGALPRLPPYLPPSTLELLAEVAVLEEEVVWLSERVV

Query:  NFRQDLYQEAVFVSSQRNVENFVLGIESISTESLKHGRSKSFSPKSAC------RSQPPLARSISSRKMLFSHNVSDQTGNYSARLINAKQTSWKSNSPS
        NFRQ LYQEAV+ SS+RNVEN    IE     S KH RSKS S           + QP LARS+SSRK+L   +  D+ G   +R  N +Q S K NS S
Subjt:  NFRQDLYQEAVFVSSQRNVENFVLGIESISTESLKHGRSKSFSPKSAC------RSQPPLARSISSRKMLFSHNVSDQTGNYSARLINAKQTSWKSNSPS

Query:  -----KENQFGSYYAKDKPSPEKKTTKIVSPSRKTPTKHEVAEKSFDALKLQLGSRLIDNERAEESSFGASDD--LESKTSPNEISEGIVKCLCSIFVEV
             KENQ  +   KDK SPEKK  K+V+P+++ PTKHE A+K  DALK QL  RL+D ERA+ES  G+SDD   E+ ++PN+ISE  V+CLCSIFV +
Subjt:  -----KENQFGSYYAKDKPSPEKKTTKIVSPSRKTPTKHEVAEKSFDALKLQLGSRLIDNERAEESSFGASDD--LESKTSPNEISEGIVKCLCSIFVEV

Query:  STSSDKCVESQTPTPRASSDAYTSNVEAELLDPYNICSESKGNDIGSYRHLFAVEANSIHLNEMANTLPRIHRLKYLLGKLASVNLEGLNQQQKLAFWIN
        STS DK +ES     +  +++   + E+E  DPY ICS+SK  DIG Y++L  +EAN++ L+ + N L  IHRLK+LLGKLASVNL+GL+ QQKLAFWIN
Subjt:  STSSDKCVESQTPTPRASSDAYTSNVEAELLDPYNICSESKGNDIGSYRHLFAVEANSIHLNEMANTLPRIHRLKYLLGKLASVNLEGLNQQQKLAFWIN

Query:  TYNSCIMNALLEHGIPETPERVVALMQKAEIVVGGYILSAMTIEHFILRLPYHLKFMCSKAIKSDEMKARDVFGLEWSEPLVTFALCCGSWSSPAVRVYT
        TYNSC+MNA+LEHGIPETPE VV LMQKA IVVGG++L+A+TIEHFILRLP+HLKF CSKA K DEMKAR++FGLEWSEPLVTFAL CGSWSSPAVRVYT
Subjt:  TYNSCIMNALLEHGIPETPERVVALMQKAEIVVGGYILSAMTIEHFILRLPYHLKFMCSKAIKSDEMKARDVFGLEWSEPLVTFALCCGSWSSPAVRVYT

Query:  GSNVEDELEEAKRSYLQAAVGISRRGNKLMLPKLLDWYLLDFAKDLESLVDWVCLQLSDELRKEAVKCLERRGRQPVEEFVQVVPYDFSFRLLFNK
         S+VEDELE AKR YLQAAV ISR  NKL++PK+LDWYLLDFAK+LESL+DWVCLQL++E R EAVKCL+R+G++P+ + VQV+PYDFSFRLL  +
Subjt:  GSNVEDELEEAKRSYLQAAVGISRRGNKLMLPKLLDWYLLDFAKDLESLVDWVCLQLSDELRKEAVKCLERRGRQPVEEFVQVVPYDFSFRLLFNK

A0A6J1D606 uncharacterized protein LOC111017954 isoform X20.0e+0097.94Show/hide
Query:  QEKMKTEGSKEMGDEKVGIKSNRRRVNREKKMALLQDVDKLKKKLRHEENVHRALERAFTRPLGALPRLPPYLPPSTLELLAEVAVLEEEVVWLSERVVN
        +EKMKTEGSKEMGDEKVGIKSNRRR+NREKKMALLQDVDKLKKKLRHEENVHRALERAFTRPLGALPRLPPYLPPSTLELLAEVAVLEEEVV LSERVVN
Subjt:  QEKMKTEGSKEMGDEKVGIKSNRRRVNREKKMALLQDVDKLKKKLRHEENVHRALERAFTRPLGALPRLPPYLPPSTLELLAEVAVLEEEVVWLSERVVN

Query:  FRQDLYQEAVFVSSQRNVENFVLGIESISTESLKHGRSKSFSPKSACRSQPPLARSISSRKMLFSHNVSDQTGNYSARLINAKQTSWKSNSPSKENQFGS
        FRQDLYQEAVFVSSQRNVENFVLGIESISTESLKHGRSKSFSPKSACRSQPPLARSISSRKMLFSHNVSDQTGNYSARLINA+QTSWKSNSPSKENQFGS
Subjt:  FRQDLYQEAVFVSSQRNVENFVLGIESISTESLKHGRSKSFSPKSACRSQPPLARSISSRKMLFSHNVSDQTGNYSARLINAKQTSWKSNSPSKENQFGS

Query:  YYAKDKPSPEKKTTKIVSPSRKTPTKHEVAEKSFDALKLQLGSRLIDNERAEESSFGASDDLESKTSPNEISEGIVKCLCSIFVEVSTSSDKCVESQTPT
        YY KDKPSPEKKTTKIVSPSRKTPTKHEVAEKSFDALKLQLGSRL+DNERAEESSFGASDDLESKTSPNEISEGIVKCLCSIFVEVSTSSDKCVESQ  T
Subjt:  YYAKDKPSPEKKTTKIVSPSRKTPTKHEVAEKSFDALKLQLGSRLIDNERAEESSFGASDDLESKTSPNEISEGIVKCLCSIFVEVSTSSDKCVESQTPT

Query:  PRASSDAYTSNVEAELLDPYNICSESKGNDIGSYRHLFAVEANSIHLNEMANTLPRIHRLKYLLGKLASVNLEGLNQQQKLAFWINTYNSCIMNALLEHG
        P+ASSDAYTSNVEAELLDPYNICSESKGNDIGSYRHLFAVEANSIHLNEMANTLPRIHRLKYLLGKLASV+LEGLNQQQKLAFWINTYNSCIMNALLEHG
Subjt:  PRASSDAYTSNVEAELLDPYNICSESKGNDIGSYRHLFAVEANSIHLNEMANTLPRIHRLKYLLGKLASVNLEGLNQQQKLAFWINTYNSCIMNALLEHG

Query:  IPETPERVVALMQKAEIVVGGYILSAMTIEHFILRLPYHLKFMCSKAIKSDEMKARDVFGLEWSEPLVTFALCCGSWSSPAVRVYTGSNVEDELEEAKRS
        IPETPERVVALMQKAEIVVGGYIL+AMTIEHFILRLPYHLKFMCSKAIKSDEMKARDVFGLEWSEPLVTFALCCGSWSSPAVRVYTGSNVEDELEEAKRS
Subjt:  IPETPERVVALMQKAEIVVGGYILSAMTIEHFILRLPYHLKFMCSKAIKSDEMKARDVFGLEWSEPLVTFALCCGSWSSPAVRVYTGSNVEDELEEAKRS

Query:  YLQAAVGISRRGNKLMLPKLLDWYLLDFAKDLESLVDWVCLQLSDELRKEAVKCLERRGRQPVEEFVQVVPYDFSFRLLFNK
        YLQAAVGISRRGNK+MLPKLLDWYLLDFAKDLESLVDWVCLQLSDELRKEAVKCLERRGRQPVEEFVQVVPYDFSFRLLFNK
Subjt:  YLQAAVGISRRGNKLMLPKLLDWYLLDFAKDLESLVDWVCLQLSDELRKEAVKCLERRGRQPVEEFVQVVPYDFSFRLLFNK

A0A6J1D7B9 uncharacterized protein LOC111017954 isoform X10.0e+0098.11Show/hide
Query:  KQEKMKTEGSKEMGDEKVGIKSNRRRVNREKKMALLQDVDKLKKKLRHEENVHRALERAFTRPLGALPRLPPYLPPSTLELLAEVAVLEEEVVWLSERVV
        KQEKMKTEGSKEMGDEKVGIKSNRRR+NREKKMALLQDVDKLKKKLRHEENVHRALERAFTRPLGALPRLPPYLPPSTLELLAEVAVLEEEVV LSERVV
Subjt:  KQEKMKTEGSKEMGDEKVGIKSNRRRVNREKKMALLQDVDKLKKKLRHEENVHRALERAFTRPLGALPRLPPYLPPSTLELLAEVAVLEEEVVWLSERVV

Query:  NFRQDLYQEAVFVSSQRNVENFVLGIESISTESLKHGRSKSFSPKSACRSQPPLARSISSRKMLFSHNVSDQTGNYSARLINAKQTSWKSNSPSKENQFG
        NFRQDLYQEAVFVSSQRNVENFVLGIESISTESLKHGRSKSFSPKSACRSQPPLARSISSRKMLFSHNVSDQTGNYSARLINA+QTSWKSNSPSKENQFG
Subjt:  NFRQDLYQEAVFVSSQRNVENFVLGIESISTESLKHGRSKSFSPKSACRSQPPLARSISSRKMLFSHNVSDQTGNYSARLINAKQTSWKSNSPSKENQFG

Query:  SYYAKDKPSPEKKTTKIVSPSRKTPTKHEVAEKSFDALKLQLGSRLIDNERAEESSFGASDDLESKTSPNEISEGIVKCLCSIFVEVSTSSDKCVESQTP
        SYY KDKPSPEKKTTKIVSPSRKTPTKHEVAEKSFDALKLQLGSRL+DNERAEESSFGASDDLESKTSPNEISEGIVKCLCSIFVEVSTSSDKCVESQ  
Subjt:  SYYAKDKPSPEKKTTKIVSPSRKTPTKHEVAEKSFDALKLQLGSRLIDNERAEESSFGASDDLESKTSPNEISEGIVKCLCSIFVEVSTSSDKCVESQTP

Query:  TPRASSDAYTSNVEAELLDPYNICSESKGNDIGSYRHLFAVEANSIHLNEMANTLPRIHRLKYLLGKLASVNLEGLNQQQKLAFWINTYNSCIMNALLEH
        TP+ASSDAYTSNVEAELLDPYNICSESKGNDIGSYRHLFAVEANSIHLNEMANTLPRIHRLKYLLGKLASV+LEGLNQQQKLAFWINTYNSCIMNALLEH
Subjt:  TPRASSDAYTSNVEAELLDPYNICSESKGNDIGSYRHLFAVEANSIHLNEMANTLPRIHRLKYLLGKLASVNLEGLNQQQKLAFWINTYNSCIMNALLEH

Query:  GIPETPERVVALMQKAEIVVGGYILSAMTIEHFILRLPYHLKFMCSKAIKSDEMKARDVFGLEWSEPLVTFALCCGSWSSPAVRVYTGSNVEDELEEAKR
        GIPETPERVVALMQKAEIVVGGYIL+AMTIEHFILRLPYHLKFMCSKAIKSDEMKARDVFGLEWSEPLVTFALCCGSWSSPAVRVYTGSNVEDELEEAKR
Subjt:  GIPETPERVVALMQKAEIVVGGYILSAMTIEHFILRLPYHLKFMCSKAIKSDEMKARDVFGLEWSEPLVTFALCCGSWSSPAVRVYTGSNVEDELEEAKR

Query:  SYLQAAVGISRRGNKLMLPKLLDWYLLDFAKDLESLVDWVCLQLSDELRKEAVKCLERRGRQPVEEFVQVVPYDFSFRLLFNK
        SYLQAAVGISRRGNK+MLPKLLDWYLLDFAKDLESLVDWVCLQLSDELRKEAVKCLERRGRQPVEEFVQVVPYDFSFRLLFNK
Subjt:  SYLQAAVGISRRGNKLMLPKLLDWYLLDFAKDLESLVDWVCLQLSDELRKEAVKCLERRGRQPVEEFVQVVPYDFSFRLLFNK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G37080.1 Protein of unknown function, DUF5471.6e-16756.03Show/hide
Query:  KQEKMKTEGSKEMGDEKVGIKSNRRRVNREKKMALLQDVDKLKKKLRHEENVHRALERAFTRPLGALPRLPPYLPPSTLELLAEVAVLEEEVVWLSERVV
        +++KM+++G+  + +    +  NRRR N+EKKM LLQDVDKLK+KLR EENVHRALERAFTRPLGALPRLP YLP  TLELLAEVAVLEEEVV L E+VV
Subjt:  KQEKMKTEGSKEMGDEKVGIKSNRRRVNREKKMALLQDVDKLKKKLRHEENVHRALERAFTRPLGALPRLPPYLPPSTLELLAEVAVLEEEVVWLSERVV

Query:  NFRQDLYQEAVFVSSQRNVE---NFVLGIESISTESLKHGRSKSFSPKS-------ACRSQPPLARSISSRKMLFS-HNVSDQTGNYSARLINAKQTSWK
        NFRQ LYQEAV++SS+RN+E   N  L  E+    S KH RSKS S            + Q  L+RSISSRK+  S   V+D++G    R+++ KQ S K
Subjt:  NFRQDLYQEAVFVSSQRNVE---NFVLGIESISTESLKHGRSKSFSPKS-------ACRSQPPLARSISSRKMLFS-HNVSDQTGNYSARLINAKQTSWK

Query:  SNSPS-----------KENQFGSYYAKD---KPSPEKKTTK-IVSPSRKTP--TKHEVAEKSFDALKLQLGSRLIDNERAEESSFGASDD---LESKTSP
        SN  S           KENQ  S  +KD   K SPEKK  + + S  +K P       A+K  ++ KLQL  RL D ++A+ES  G+S +   L+S    
Subjt:  SNSPS-----------KENQFGSYYAKD---KPSPEKKTTK-IVSPSRKTP--TKHEVAEKSFDALKLQLGSRLIDNERAEESSFGASDD---LESKTSP

Query:  NEISEGIVKCLCSIFVEVSTSSDKCVESQTPTPRASSDAYTSNVEAELLDPYNICSESKGNDIGSYRHLFAVEANSIHLNEMANTLPRIHRLKYLLGKLA
        N +SE ++KCL +I + +S+S D                        +LDPYN CSE +  ++G+Y+H  +V+ +S+ L    N    IHRLK+LL KL+
Subjt:  NEISEGIVKCLCSIFVEVSTSSDKCVESQTPTPRASSDAYTSNVEAELLDPYNICSESKGNDIGSYRHLFAVEANSIHLNEMANTLPRIHRLKYLLGKLA

Query:  SVNLEGLNQQQKLAFWINTYNSCIMNALLEHGIPETPERVVALMQKAEIVVGGYILSAMTIEHFILRLPYHLKFMCSKAIKSDEMKARDVFGLEWSEPLV
         VNL+GL+ QQKLAFWINTYNSC+MNA LEHGIP TPE VVALMQKA I+VGG+ L+A+TIEHFILRLPYHLKF C K    +EM+A   FGLEWSEPLV
Subjt:  SVNLEGLNQQQKLAFWINTYNSCIMNALLEHGIPETPERVVALMQKAEIVVGGYILSAMTIEHFILRLPYHLKFMCSKAIKSDEMKARDVFGLEWSEPLV

Query:  TFALCCGSWSSPAVRVYTGSNVEDELEEAKRSYLQAAVGISRRGNKLMLPKLLDWYLLDFAKDLESLVDWVCLQLSDELRKEAVKCLERRGRQPVEEFVQ
        TFAL CGSWSSPAVRVYT +NVE+ELE AKR YLQA+VGIS + NKLMLPK+LDWYLLDFAKDLESL+DWVCLQL D+LR+EA KC+ER+ ++ + E VQ
Subjt:  TFALCCGSWSSPAVRVYTGSNVEDELEEAKRSYLQAAVGISRRGNKLMLPKLLDWYLLDFAKDLESLVDWVCLQLSDELRKEAVKCLERRGRQPVEEFVQ

Query:  VVPYDFSFRLLFNK
        VVPYDFSFRLL ++
Subjt:  VVPYDFSFRLLFNK

AT4G37080.2 Protein of unknown function, DUF5471.6e-16756.03Show/hide
Query:  KQEKMKTEGSKEMGDEKVGIKSNRRRVNREKKMALLQDVDKLKKKLRHEENVHRALERAFTRPLGALPRLPPYLPPSTLELLAEVAVLEEEVVWLSERVV
        +++KM+++G+  + +    +  NRRR N+EKKM LLQDVDKLK+KLR EENVHRALERAFTRPLGALPRLP YLP  TLELLAEVAVLEEEVV L E+VV
Subjt:  KQEKMKTEGSKEMGDEKVGIKSNRRRVNREKKMALLQDVDKLKKKLRHEENVHRALERAFTRPLGALPRLPPYLPPSTLELLAEVAVLEEEVVWLSERVV

Query:  NFRQDLYQEAVFVSSQRNVE---NFVLGIESISTESLKHGRSKSFSPKS-------ACRSQPPLARSISSRKMLFS-HNVSDQTGNYSARLINAKQTSWK
        NFRQ LYQEAV++SS+RN+E   N  L  E+    S KH RSKS S            + Q  L+RSISSRK+  S   V+D++G    R+++ KQ S K
Subjt:  NFRQDLYQEAVFVSSQRNVE---NFVLGIESISTESLKHGRSKSFSPKS-------ACRSQPPLARSISSRKMLFS-HNVSDQTGNYSARLINAKQTSWK

Query:  SNSPS-----------KENQFGSYYAKD---KPSPEKKTTK-IVSPSRKTP--TKHEVAEKSFDALKLQLGSRLIDNERAEESSFGASDD---LESKTSP
        SN  S           KENQ  S  +KD   K SPEKK  + + S  +K P       A+K  ++ KLQL  RL D ++A+ES  G+S +   L+S    
Subjt:  SNSPS-----------KENQFGSYYAKD---KPSPEKKTTK-IVSPSRKTP--TKHEVAEKSFDALKLQLGSRLIDNERAEESSFGASDD---LESKTSP

Query:  NEISEGIVKCLCSIFVEVSTSSDKCVESQTPTPRASSDAYTSNVEAELLDPYNICSESKGNDIGSYRHLFAVEANSIHLNEMANTLPRIHRLKYLLGKLA
        N +SE ++KCL +I + +S+S D                        +LDPYN CSE +  ++G+Y+H  +V+ +S+ L    N    IHRLK+LL KL+
Subjt:  NEISEGIVKCLCSIFVEVSTSSDKCVESQTPTPRASSDAYTSNVEAELLDPYNICSESKGNDIGSYRHLFAVEANSIHLNEMANTLPRIHRLKYLLGKLA

Query:  SVNLEGLNQQQKLAFWINTYNSCIMNALLEHGIPETPERVVALMQKAEIVVGGYILSAMTIEHFILRLPYHLKFMCSKAIKSDEMKARDVFGLEWSEPLV
         VNL+GL+ QQKLAFWINTYNSC+MNA LEHGIP TPE VVALMQKA I+VGG+ L+A+TIEHFILRLPYHLKF C K    +EM+A   FGLEWSEPLV
Subjt:  SVNLEGLNQQQKLAFWINTYNSCIMNALLEHGIPETPERVVALMQKAEIVVGGYILSAMTIEHFILRLPYHLKFMCSKAIKSDEMKARDVFGLEWSEPLV

Query:  TFALCCGSWSSPAVRVYTGSNVEDELEEAKRSYLQAAVGISRRGNKLMLPKLLDWYLLDFAKDLESLVDWVCLQLSDELRKEAVKCLERRGRQPVEEFVQ
        TFAL CGSWSSPAVRVYT +NVE+ELE AKR YLQA+VGIS + NKLMLPK+LDWYLLDFAKDLESL+DWVCLQL D+LR+EA KC+ER+ ++ + E VQ
Subjt:  TFALCCGSWSSPAVRVYTGSNVEDELEEAKRSYLQAAVGISRRGNKLMLPKLLDWYLLDFAKDLESLVDWVCLQLSDELRKEAVKCLERRGRQPVEEFVQ

Query:  VVPYDFSFRLLFNK
        VVPYDFSFRLL ++
Subjt:  VVPYDFSFRLLFNK

AT4G37080.3 Protein of unknown function, DUF5471.6e-16756.03Show/hide
Query:  KQEKMKTEGSKEMGDEKVGIKSNRRRVNREKKMALLQDVDKLKKKLRHEENVHRALERAFTRPLGALPRLPPYLPPSTLELLAEVAVLEEEVVWLSERVV
        +++KM+++G+  + +    +  NRRR N+EKKM LLQDVDKLK+KLR EENVHRALERAFTRPLGALPRLP YLP  TLELLAEVAVLEEEVV L E+VV
Subjt:  KQEKMKTEGSKEMGDEKVGIKSNRRRVNREKKMALLQDVDKLKKKLRHEENVHRALERAFTRPLGALPRLPPYLPPSTLELLAEVAVLEEEVVWLSERVV

Query:  NFRQDLYQEAVFVSSQRNVE---NFVLGIESISTESLKHGRSKSFSPKS-------ACRSQPPLARSISSRKMLFS-HNVSDQTGNYSARLINAKQTSWK
        NFRQ LYQEAV++SS+RN+E   N  L  E+    S KH RSKS S            + Q  L+RSISSRK+  S   V+D++G    R+++ KQ S K
Subjt:  NFRQDLYQEAVFVSSQRNVE---NFVLGIESISTESLKHGRSKSFSPKS-------ACRSQPPLARSISSRKMLFS-HNVSDQTGNYSARLINAKQTSWK

Query:  SNSPS-----------KENQFGSYYAKD---KPSPEKKTTK-IVSPSRKTP--TKHEVAEKSFDALKLQLGSRLIDNERAEESSFGASDD---LESKTSP
        SN  S           KENQ  S  +KD   K SPEKK  + + S  +K P       A+K  ++ KLQL  RL D ++A+ES  G+S +   L+S    
Subjt:  SNSPS-----------KENQFGSYYAKD---KPSPEKKTTK-IVSPSRKTP--TKHEVAEKSFDALKLQLGSRLIDNERAEESSFGASDD---LESKTSP

Query:  NEISEGIVKCLCSIFVEVSTSSDKCVESQTPTPRASSDAYTSNVEAELLDPYNICSESKGNDIGSYRHLFAVEANSIHLNEMANTLPRIHRLKYLLGKLA
        N +SE ++KCL +I + +S+S D                        +LDPYN CSE +  ++G+Y+H  +V+ +S+ L    N    IHRLK+LL KL+
Subjt:  NEISEGIVKCLCSIFVEVSTSSDKCVESQTPTPRASSDAYTSNVEAELLDPYNICSESKGNDIGSYRHLFAVEANSIHLNEMANTLPRIHRLKYLLGKLA

Query:  SVNLEGLNQQQKLAFWINTYNSCIMNALLEHGIPETPERVVALMQKAEIVVGGYILSAMTIEHFILRLPYHLKFMCSKAIKSDEMKARDVFGLEWSEPLV
         VNL+GL+ QQKLAFWINTYNSC+MNA LEHGIP TPE VVALMQKA I+VGG+ L+A+TIEHFILRLPYHLKF C K    +EM+A   FGLEWSEPLV
Subjt:  SVNLEGLNQQQKLAFWINTYNSCIMNALLEHGIPETPERVVALMQKAEIVVGGYILSAMTIEHFILRLPYHLKFMCSKAIKSDEMKARDVFGLEWSEPLV

Query:  TFALCCGSWSSPAVRVYTGSNVEDELEEAKRSYLQAAVGISRRGNKLMLPKLLDWYLLDFAKDLESLVDWVCLQLSDELRKEAVKCLERRGRQPVEEFVQ
        TFAL CGSWSSPAVRVYT +NVE+ELE AKR YLQA+VGIS + NKLMLPK+LDWYLLDFAKDLESL+DWVCLQL D+LR+EA KC+ER+ ++ + E VQ
Subjt:  TFALCCGSWSSPAVRVYTGSNVEDELEEAKRSYLQAAVGISRRGNKLMLPKLLDWYLLDFAKDLESLVDWVCLQLSDELRKEAVKCLERRGRQPVEEFVQ

Query:  VVPYDFSFRLLFNK
        VVPYDFSFRLL ++
Subjt:  VVPYDFSFRLLFNK

AT5G42690.1 Protein of unknown function, DUF5471.2e-12548.67Show/hide
Query:  NRRRVNREKKMALLQDVDKLKKKLRHEENVHRALERAFTRPLGALPRLPPYLPPSTLELLAEVAVLEEEVVWLSERVVNFRQDLYQEAVFVSSQRNVENF
        NR+ +NREK + L +DV+KL+KKLR EEN+HRA+ERAF+RPLGALPRLPP+LPPS LELLAEVAVLEEE+V L E +V+ RQ+LYQEAVF SS  ++EN 
Subjt:  NRRRVNREKKMALLQDVDKLKKKLRHEENVHRALERAFTRPLGALPRLPPYLPPSTLELLAEVAVLEEEVVWLSERVVNFRQDLYQEAVFVSSQRNVENF

Query:  VLGIESISTESLKHGRSKSFSPKSACR-SQPPLARSISSRKMLFSHNVSDQTGNYSARLINAKQTSWKSNSPSKENQFGSYYAKDKPSPEKKTTKIVSPS
               S    KH ++KS S  ++ R S+ PL+R+  S  +                               KEN+  +             T I +P 
Subjt:  VLGIESISTESLKHGRSKSFSPKSACR-SQPPLARSISSRKMLFSHNVSDQTGNYSARLINAKQTSWKSNSPSKENQFGSYYAKDKPSPEKKTTKIVSPS

Query:  RKTPTKHEVAEKSFDALKLQLGSRLIDNERAEESSFGASDDLESKTSPNEISEGIVKCLCSIFVEVSTSSDKCVESQTPTPRASSDAYTSNVEAELLDPY
        +KT   H    KS +A KL+  S       AE SS G  D+      PN+ISE +VKCL +IF+ +S+     ++    T    +D  T+       DPY
Subjt:  RKTPTKHEVAEKSFDALKLQLGSRLIDNERAEESSFGASDDLESKTSPNEISEGIVKCLCSIFVEVSTSSDKCVESQTPTPRASSDAYTSNVEAELLDPY

Query:  NICSESKGNDIGSYRHLFAVEANSIHLNE-MANTLPRIHRLKYLLGKLASVNLEGLNQQQKLAFWINTYNSCIMNALLEHGIPETPERVVALMQKAEIVV
         ICS  +  DIG Y++   VE  S++ N   +++L  I +LK LLG+L+ VN++ LNQQ+KLAFWIN YNSC+MN  LEHGIPE+P+ +V LMQKA I V
Subjt:  NICSESKGNDIGSYRHLFAVEANSIHLNE-MANTLPRIHRLKYLLGKLASVNLEGLNQQQKLAFWINTYNSCIMNALLEHGIPETPERVVALMQKAEIVV

Query:  GGYILSAMTIEHFILRLPYHLKFMCSKAIKSDEMKARDVFGLEWSEPLVTFALCCGSWSSPAVRVYTGSNVEDELEEAKRSYLQAAVGISRRGNKLMLPK
        GG+ L+A+TIEHFILRLP+H K++  K  K +EM  R  FGLE SEPLVTFAL CGSWSSPAVRVYT S VE+ELE AKR YL+A+VGIS    K+ +PK
Subjt:  GGYILSAMTIEHFILRLPYHLKFMCSKAIKSDEMKARDVFGLEWSEPLVTFALCCGSWSSPAVRVYTGSNVEDELEEAKRSYLQAAVGISRRGNKLMLPK

Query:  LLDWYLLDFAKDLESLVDWVCLQLSDELRKEAVKCLER-RGRQPVEEFVQVVPYDFSFRLLFN
        L+DWY  DFAKD+ESL+DW+ LQL  EL K+A+ C+E+   + P    V ++PYDF+FR LF+
Subjt:  LLDWYLLDFAKDLESLVDWVCLQLSDELRKEAVKCLER-RGRQPVEEFVQVVPYDFSFRLLFN

AT5G42690.2 Protein of unknown function, DUF5478.2e-12748.67Show/hide
Query:  NRRRVNREKKMALLQDVDKLKKKLRHEENVHRALERAFTRPLGALPRLPPYLPPSTLELLAEVAVLEEEVVWLSERVVNFRQDLYQEAVFVSSQRNVENF
        NR+ +NREK + L +DV+KL+KKLR EEN+HRA+ERAF+RPLGALPRLPP+LPPS LELLAEVAVLEEE+V L E +V+ RQ+LYQEAVF SS  ++EN 
Subjt:  NRRRVNREKKMALLQDVDKLKKKLRHEENVHRALERAFTRPLGALPRLPPYLPPSTLELLAEVAVLEEEVVWLSERVVNFRQDLYQEAVFVSSQRNVENF

Query:  VLGIESISTESLKHGRSKSFSPKSACR-SQPPLARSISSRKMLFSHNVSDQTGNYSARLINAKQTSWKSNSPSKENQFGSYYAKDKPSPEKKTTKIVSPS
               S    KH ++KS S  ++ R S+ PL+R+  S  +                               KEN+  +             T I +P 
Subjt:  VLGIESISTESLKHGRSKSFSPKSACR-SQPPLARSISSRKMLFSHNVSDQTGNYSARLINAKQTSWKSNSPSKENQFGSYYAKDKPSPEKKTTKIVSPS

Query:  RKTPTKHEVAEKSFDALKLQLGSRLIDNERAEESSFGASDDLESKTSPNEISEGIVKCLCSIFVEVSTSSDKCVESQTPTPRASSDAYTSNVEAELLDPY
        +KT   H    KS +A KL+  S       AE SS G  D+      PN+ISE +VKCL +IF+ +S+     ++    T    +D  T+       DPY
Subjt:  RKTPTKHEVAEKSFDALKLQLGSRLIDNERAEESSFGASDDLESKTSPNEISEGIVKCLCSIFVEVSTSSDKCVESQTPTPRASSDAYTSNVEAELLDPY

Query:  NICSESKGNDIGSYRHLFAVEANSIHLNE-MANTLPRIHRLKYLLGKLASVNLEGLNQQQKLAFWINTYNSCIMNALLEHGIPETPERVVALMQKAEIVV
         ICS  +  DIG Y++   VE  S++ N   +++L  I +LK LLG+L+ VN++ LNQQ+KLAFWIN YNSC+MN  LEHGIPE+P+ +V LMQKA I V
Subjt:  NICSESKGNDIGSYRHLFAVEANSIHLNE-MANTLPRIHRLKYLLGKLASVNLEGLNQQQKLAFWINTYNSCIMNALLEHGIPETPERVVALMQKAEIVV

Query:  GGYILSAMTIEHFILRLPYHLKFMCSKAIKSDEMKARDVFGLEWSEPLVTFALCCGSWSSPAVRVYTGSNVEDELEEAKRSYLQAAVGISRRGNKLMLPK
        GG+ L+A+TIEHFILRLP+H K++  K  K +EM  R  FGLE SEPLVTFAL CGSWSSPAVRVYT S VE+ELE AKR YL+A+VGIS    K+ +PK
Subjt:  GGYILSAMTIEHFILRLPYHLKFMCSKAIKSDEMKARDVFGLEWSEPLVTFALCCGSWSSPAVRVYTGSNVEDELEEAKRSYLQAAVGISRRGNKLMLPK

Query:  LLDWYLLDFAKDLESLVDWVCLQLSDELRKEAVKCLER-RGRQPVEEFVQVVPYDFSFRLLFN
        L+DWY  DFAKD+ESL+DW+ LQL  EL K+A+ C+E+   + P    V ++PYDF+FR LF+
Subjt:  LLDWYLLDFAKDLESLVDWVCLQLSDELRKEAVKCLER-RGRQPVEEFVQVVPYDFSFRLLFN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
CAAAAGCAGGAGAAAATGAAGACAGAAGGAAGCAAAGAGATGGGTGATGAAAAAGTTGGGATTAAGAGCAATAGAAGAAGAGTAAACAGAGAAAAGAAAATGGCATTGCT
GCAAGATGTGGATAAGTTAAAGAAGAAGCTGAGGCATGAAGAAAATGTTCACAGAGCTTTGGAGAGAGCTTTTACTAGACCTTTAGGAGCTCTGCCTCGTCTTCCTCCTT
ATCTCCCTCCATCTACATTGGAGCTTCTAGCTGAAGTAGCTGTTCTGGAAGAGGAAGTTGTCTGGCTTTCCGAGCGAGTTGTGAATTTTCGGCAAGACCTCTACCAAGAA
GCTGTCTTCGTGTCCTCACAGCGGAATGTAGAAAATTTCGTCCTTGGCATTGAGAGTATCTCAACCGAAAGTTTAAAGCATGGTCGATCGAAATCCTTTTCACCAAAATC
GGCATGTCGGTCTCAACCACCACTTGCAAGAAGTATTTCGAGTAGGAAGATGTTATTTAGTCACAATGTCTCTGATCAAACAGGAAACTATTCTGCCAGACTCATAAATG
CCAAGCAAACCTCCTGGAAATCAAATTCACCTTCAAAGGAGAATCAGTTTGGTTCTTATTATGCGAAAGATAAACCTTCTCCAGAAAAGAAAACCACTAAAATTGTCAGC
CCATCAAGGAAGACTCCAACTAAACATGAAGTTGCAGAGAAGAGTTTTGATGCTTTGAAGTTGCAGCTTGGGTCCAGGTTAATTGATAATGAAAGAGCAGAAGAGAGTTC
TTTTGGTGCGTCAGATGATCTCGAGTCCAAAACATCCCCTAATGAAATTTCCGAGGGTATTGTGAAGTGCTTGTGCTCCATTTTTGTTGAAGTGAGCACTTCAAGCGACA
AATGTGTCGAATCACAAACTCCTACTCCTCGAGCATCATCGGACGCCTACACGAGCAATGTAGAAGCAGAGCTTCTGGATCCATACAATATATGTTCAGAATCCAAAGGA
AATGATATCGGTTCCTATCGACATCTCTTTGCAGTCGAGGCAAATTCAATTCATCTCAATGAAATGGCTAACACACTTCCCCGTATCCACAGACTGAAATACCTACTTGG
AAAGCTTGCCTCTGTGAACTTAGAGGGTCTTAATCAGCAGCAGAAGCTGGCCTTTTGGATTAACACCTATAATTCTTGCATAATGAATGCACTTTTGGAGCATGGAATAC
CTGAGACTCCAGAAAGAGTTGTAGCTCTAATGCAAAAGGCCGAGATTGTCGTCGGGGGGTACATTCTCAGTGCAATGACAATCGAGCATTTCATTTTGCGACTGCCTTAC
CATCTGAAATTTATGTGTTCAAAGGCCATCAAAAGTGATGAGATGAAAGCGCGAGATGTGTTTGGATTAGAGTGGTCTGAACCATTGGTTACGTTTGCTCTTTGCTGCGG
AAGTTGGTCGTCTCCTGCGGTGAGAGTCTACACGGGAAGTAATGTCGAAGACGAGTTAGAAGAGGCAAAGAGAAGCTACTTACAAGCAGCAGTTGGAATATCAAGGAGGG
GGAACAAGCTAATGCTTCCAAAACTATTGGATTGGTATTTACTTGATTTTGCAAAGGATTTGGAGTCATTGGTGGATTGGGTTTGCTTACAACTGTCTGATGAGCTTAGA
AAAGAAGCTGTTAAATGCCTTGAAAGGAGGGGAAGGCAGCCTGTTGAAGAGTTTGTGCAAGTGGTTCCTTATGATTTCAGTTTTAGATTGCTTTTCAACAAA
mRNA sequenceShow/hide mRNA sequence
CAAAAGCAGGAGAAAATGAAGACAGAAGGAAGCAAAGAGATGGGTGATGAAAAAGTTGGGATTAAGAGCAATAGAAGAAGAGTAAACAGAGAAAAGAAAATGGCATTGCT
GCAAGATGTGGATAAGTTAAAGAAGAAGCTGAGGCATGAAGAAAATGTTCACAGAGCTTTGGAGAGAGCTTTTACTAGACCTTTAGGAGCTCTGCCTCGTCTTCCTCCTT
ATCTCCCTCCATCTACATTGGAGCTTCTAGCTGAAGTAGCTGTTCTGGAAGAGGAAGTTGTCTGGCTTTCCGAGCGAGTTGTGAATTTTCGGCAAGACCTCTACCAAGAA
GCTGTCTTCGTGTCCTCACAGCGGAATGTAGAAAATTTCGTCCTTGGCATTGAGAGTATCTCAACCGAAAGTTTAAAGCATGGTCGATCGAAATCCTTTTCACCAAAATC
GGCATGTCGGTCTCAACCACCACTTGCAAGAAGTATTTCGAGTAGGAAGATGTTATTTAGTCACAATGTCTCTGATCAAACAGGAAACTATTCTGCCAGACTCATAAATG
CCAAGCAAACCTCCTGGAAATCAAATTCACCTTCAAAGGAGAATCAGTTTGGTTCTTATTATGCGAAAGATAAACCTTCTCCAGAAAAGAAAACCACTAAAATTGTCAGC
CCATCAAGGAAGACTCCAACTAAACATGAAGTTGCAGAGAAGAGTTTTGATGCTTTGAAGTTGCAGCTTGGGTCCAGGTTAATTGATAATGAAAGAGCAGAAGAGAGTTC
TTTTGGTGCGTCAGATGATCTCGAGTCCAAAACATCCCCTAATGAAATTTCCGAGGGTATTGTGAAGTGCTTGTGCTCCATTTTTGTTGAAGTGAGCACTTCAAGCGACA
AATGTGTCGAATCACAAACTCCTACTCCTCGAGCATCATCGGACGCCTACACGAGCAATGTAGAAGCAGAGCTTCTGGATCCATACAATATATGTTCAGAATCCAAAGGA
AATGATATCGGTTCCTATCGACATCTCTTTGCAGTCGAGGCAAATTCAATTCATCTCAATGAAATGGCTAACACACTTCCCCGTATCCACAGACTGAAATACCTACTTGG
AAAGCTTGCCTCTGTGAACTTAGAGGGTCTTAATCAGCAGCAGAAGCTGGCCTTTTGGATTAACACCTATAATTCTTGCATAATGAATGCACTTTTGGAGCATGGAATAC
CTGAGACTCCAGAAAGAGTTGTAGCTCTAATGCAAAAGGCCGAGATTGTCGTCGGGGGGTACATTCTCAGTGCAATGACAATCGAGCATTTCATTTTGCGACTGCCTTAC
CATCTGAAATTTATGTGTTCAAAGGCCATCAAAAGTGATGAGATGAAAGCGCGAGATGTGTTTGGATTAGAGTGGTCTGAACCATTGGTTACGTTTGCTCTTTGCTGCGG
AAGTTGGTCGTCTCCTGCGGTGAGAGTCTACACGGGAAGTAATGTCGAAGACGAGTTAGAAGAGGCAAAGAGAAGCTACTTACAAGCAGCAGTTGGAATATCAAGGAGGG
GGAACAAGCTAATGCTTCCAAAACTATTGGATTGGTATTTACTTGATTTTGCAAAGGATTTGGAGTCATTGGTGGATTGGGTTTGCTTACAACTGTCTGATGAGCTTAGA
AAAGAAGCTGTTAAATGCCTTGAAAGGAGGGGAAGGCAGCCTGTTGAAGAGTTTGTGCAAGTGGTTCCTTATGATTTCAGTTTTAGATTGCTTTTCAACAAA
Protein sequenceShow/hide protein sequence
QKQEKMKTEGSKEMGDEKVGIKSNRRRVNREKKMALLQDVDKLKKKLRHEENVHRALERAFTRPLGALPRLPPYLPPSTLELLAEVAVLEEEVVWLSERVVNFRQDLYQE
AVFVSSQRNVENFVLGIESISTESLKHGRSKSFSPKSACRSQPPLARSISSRKMLFSHNVSDQTGNYSARLINAKQTSWKSNSPSKENQFGSYYAKDKPSPEKKTTKIVS
PSRKTPTKHEVAEKSFDALKLQLGSRLIDNERAEESSFGASDDLESKTSPNEISEGIVKCLCSIFVEVSTSSDKCVESQTPTPRASSDAYTSNVEAELLDPYNICSESKG
NDIGSYRHLFAVEANSIHLNEMANTLPRIHRLKYLLGKLASVNLEGLNQQQKLAFWINTYNSCIMNALLEHGIPETPERVVALMQKAEIVVGGYILSAMTIEHFILRLPY
HLKFMCSKAIKSDEMKARDVFGLEWSEPLVTFALCCGSWSSPAVRVYTGSNVEDELEEAKRSYLQAAVGISRRGNKLMLPKLLDWYLLDFAKDLESLVDWVCLQLSDELR
KEAVKCLERRGRQPVEEFVQVVPYDFSFRLLFNK