; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0020721 (gene) of Snake gourd v1 genome

Gene IDTan0020721
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein of unknown function, DUF547
Genome locationLG05:58220031..58223976
RNA-Seq ExpressionTan0020721
SyntenyTan0020721
Gene Ontology termsNA
InterPro domainsIPR006869 - Domain of unknown function DUF547
IPR025757 - Ternary complex factor MIP1, leucine-zipper


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_017980873.1 PREDICTED: uncharacterized protein LOC18611093 isoform X2 [Theobroma cacao]3.0e-19863.02Show/hide
Query:  MNARVRPNVQFLVEKASIHD--KQEKGKRGGSKEMADEKDGIKINRRRLNTEKKMALLQDVDKLKKKLRHEENVHRALKRAFTRPLGALPRLPPYLPPST
        MN RVR   Q +    S HD  K++  K  G + +   K     NRRR N E+KMALLQDVDKLK+KLRHEENVHRAL+RAFTRPLGALPRLPPYLPP T
Subjt:  MNARVRPNVQFLVEKASIHD--KQEKGKRGGSKEMADEKDGIKINRRRLNTEKKMALLQDVDKLKKKLRHEENVHRALKRAFTRPLGALPRLPPYLPPST

Query:  LELLAEVAILEEEVVWLSERVVNFRQDLYQEAVFVSSQRNVENFVNAVESISMRSLQHGQSKSLTLNDLSSPTPTYRPQPLVAGSISSRKLLSSHKIPDQ
        LELLAEVA+LEEEVV L E+VVNFRQ LYQEAV+ SS+RNVEN   ++E   +RS +H +SKSL++N++SS T   +PQP +A S+SSRKLL      ++
Subjt:  LELLAEVAILEEEVVWLSERVVNFRQDLYQEAVFVSSQRNVENFVNAVESISMRSLQHGQSKSLTLNDLSSPTPTYRPQPLVAGSISSRKLLSSHKIPDQ

Query:  TGNYSARLLNAKQNSWKSNSPS-----KENQFCSSYSVKDKPSPEKKATKIISSFKKNKTPTKHEAAEKSLDALKLQLGSRLMDYEREQESSFGALDNLE
         G   +R  N +Q S K NS S     KENQ  ++ +VKDK SPEKK TK+++  K  + PTKHE+A K LDALK QL  RL+D ER QES  G+ D+  
Subjt:  TGNYSARLLNAKQNSWKSNSPS-----KENQFCSSYSVKDKPSPEKKATKIISSFKKNKTPTKHEAAEKSLDALKLQLGSRLMDYEREQESSFGALDNLE

Query:  SKA--SPNKISEDIVKCLCSIFVQVSTPREKCVE--FQTPRASSNTCTSSVEAEHLDPYHVCAESKGNDIGFYLHLFAVEANSIHLNEMANTLPLIHRLK
        S+A  +PNKISED V+CLCSIFV++ST +++ VE      ++++N+   S E+E  DPY +C++SK  DIG Y +L  +EAN++ L+   N L LIHRLK
Subjt:  SKA--SPNKISEDIVKCLCSIFVQVSTPREKCVE--FQTPRASSNTCTSSVEAEHLDPYHVCAESKGNDIGFYLHLFAVEANSIHLNEMANTLPLIHRLK

Query:  YLLGKLASVKLEGLHQQQKLAFWINTYNSCIMNALLEHGIPETPERVVALMQKAKIVVGGYILNAITIEHFILRLPYHLKFMCSKAVKSDEMMKAHDLFG
        +LLGKL SV L+GL  QQKLAFWINTYNSC+MNA+LEHGIPETPE VV LMQKA IVVGG++LNAITIEHFILRLP+HLKF CSKA K+DE MKA ++FG
Subjt:  YLLGKLASVKLEGLHQQQKLAFWINTYNSCIMNALLEHGIPETPERVVALMQKAKIVVGGYILNAITIEHFILRLPYHLKFMCSKAVKSDEMMKAHDLFG

Query:  LEWSEPLVTFALCCGSWSSPSVRVYTGSHVEEELEEAKRSYLQASVGISRRGNKLMLPKLLDWYLLDFAKDLESLVDWVCLQLPDEIRKEAVKCLERRGM
        LEWSEPLVT+AL CGSWSSP+VRVYT SHVE+ELE AKR YLQA+V ISR  NKL++PKLLDWYLLDFAKDLESL+DWVCLQL +E+R EAVKCLER+G 
Subjt:  LEWSEPLVTFALCCGSWSSPSVRVYTGSHVEEELEEAKRSYLQASVGISRRGNKLMLPKLLDWYLLDFAKDLESLVDWVCLQLPDEIRKEAVKCLERRGM

Query:  QPLEEFVQVIPYDFSFRLLLNK
        +PL + VQV+PYDFSFRLLL +
Subjt:  QPLEEFVQVIPYDFSFRLLLNK

XP_021275153.1 uncharacterized protein LOC110409952 isoform X1 [Herrania umbratica]6.1e-19963.08Show/hide
Query:  MNARVRPNVQFLVEKASIHD---KQEKGKRGGSKEMADEKDGIKINRRRLNTEKKMALLQDVDKLKKKLRHEENVHRALKRAFTRPLGALPRLPPYLPPS
        MN RVR  +Q +    S HD   K++  K  GS+ +   K     NRRR N E+KMALLQDVDKLK+KLRHEENVHRAL+RAFTRPLGALPRLPPYLPPS
Subjt:  MNARVRPNVQFLVEKASIHD---KQEKGKRGGSKEMADEKDGIKINRRRLNTEKKMALLQDVDKLKKKLRHEENVHRALKRAFTRPLGALPRLPPYLPPS

Query:  TLELLAEVAILEEEVVWLSERVVNFRQDLYQEAVFVSSQRNVENFVNAVESISMRSLQHGQSKSLTLNDLSSPTPTYRPQPLVAGSISSRKLLSSHKIPD
        TLELLAEVA+LEEEVV L E+VVNFRQ LYQEAV+ SS+RNVEN   ++E   +RS +H +SKSL++N++SS T   +PQP +A S+SSRKLL      D
Subjt:  TLELLAEVAILEEEVVWLSERVVNFRQDLYQEAVFVSSQRNVENFVNAVESISMRSLQHGQSKSLTLNDLSSPTPTYRPQPLVAGSISSRKLLSSHKIPD

Query:  QTGNYSARLLNAKQNSWKSNSPS-----KENQFCSSYSVKDKPSPEKKATKIISSFKKNKTPTKHEAAEKSLDALKLQLGSRLMDYEREQESSFGALDNL
        + G   +R  N +Q S K NS S     KENQ  ++ +VKDK SPEKK  K+++  K  + PTKHE+A+K LDALK QL  RL+D ER QES  G+ D+ 
Subjt:  QTGNYSARLLNAKQNSWKSNSPS-----KENQFCSSYSVKDKPSPEKKATKIISSFKKNKTPTKHEAAEKSLDALKLQLGSRLMDYEREQESSFGALDNL

Query:  ESKA--SPNKISEDIVKCLCSIFVQVSTPREKCVE--FQTPRASSNTCTSSVEAEHLDPYHVCAESKGNDIGFYLHLFAVEANSIHLNEMANTLPLIHRL
         S+A  +PNKISED V+CLCSIFV++ST ++K +E      +  +N+   S E+E  DPY +C++SK  DIG Y +L  +EAN++ L+ + N L LIHRL
Subjt:  ESKA--SPNKISEDIVKCLCSIFVQVSTPREKCVE--FQTPRASSNTCTSSVEAEHLDPYHVCAESKGNDIGFYLHLFAVEANSIHLNEMANTLPLIHRL

Query:  KYLLGKLASVKLEGLHQQQKLAFWINTYNSCIMNALLEHGIPETPERVVALMQKAKIVVGGYILNAITIEHFILRLPYHLKFMCSKAVKSDEMMKAHDLF
        K+LLGKLASV L+GL  QQKLAFWINTYNSC+MNA+LEHGIPETPE VV LMQKA IVVGG++LNAITIEHFILRLP+HLKF CSKA K DE MKA ++F
Subjt:  KYLLGKLASVKLEGLHQQQKLAFWINTYNSCIMNALLEHGIPETPERVVALMQKAKIVVGGYILNAITIEHFILRLPYHLKFMCSKAVKSDEMMKAHDLF

Query:  GLEWSEPLVTFALCCGSWSSPSVRVYTGSHVEEELEEAKRSYLQASVGISRRGNKLMLPKLLDWYLLDFAKDLESLVDWVCLQLPDEIRKEAVKCLERRG
        GLEWSEPLVTFAL CGSWSSP+VRVYT SHVE+ELE AKR YLQA+V ISR  NKL++PK+LDWYLLDFAK+LESL+DWVCLQL +E R EAVKCL+R+G
Subjt:  GLEWSEPLVTFALCCGSWSSPSVRVYTGSHVEEELEEAKRSYLQASVGISRRGNKLMLPKLLDWYLLDFAKDLESLVDWVCLQLPDEIRKEAVKCLERRG

Query:  MQPLEEFVQVIPYDFSFRLLLNK
         +PL + VQV+PYDFSFRLLL +
Subjt:  MQPLEEFVQVIPYDFSFRLLLNK

XP_021275156.1 uncharacterized protein LOC110409952 isoform X2 [Herrania umbratica]4.7e-19963.18Show/hide
Query:  MNARVRPNVQFLVEKASIHD--KQEKGKRGGSKEMADEKDGIKINRRRLNTEKKMALLQDVDKLKKKLRHEENVHRALKRAFTRPLGALPRLPPYLPPST
        MN RVR  +Q +    S HD  K++  K  GS+ +   K     NRRR N E+KMALLQDVDKLK+KLRHEENVHRAL+RAFTRPLGALPRLPPYLPPST
Subjt:  MNARVRPNVQFLVEKASIHD--KQEKGKRGGSKEMADEKDGIKINRRRLNTEKKMALLQDVDKLKKKLRHEENVHRALKRAFTRPLGALPRLPPYLPPST

Query:  LELLAEVAILEEEVVWLSERVVNFRQDLYQEAVFVSSQRNVENFVNAVESISMRSLQHGQSKSLTLNDLSSPTPTYRPQPLVAGSISSRKLLSSHKIPDQ
        LELLAEVA+LEEEVV L E+VVNFRQ LYQEAV+ SS+RNVEN   ++E   +RS +H +SKSL++N++SS T   +PQP +A S+SSRKLL      D+
Subjt:  LELLAEVAILEEEVVWLSERVVNFRQDLYQEAVFVSSQRNVENFVNAVESISMRSLQHGQSKSLTLNDLSSPTPTYRPQPLVAGSISSRKLLSSHKIPDQ

Query:  TGNYSARLLNAKQNSWKSNSPS-----KENQFCSSYSVKDKPSPEKKATKIISSFKKNKTPTKHEAAEKSLDALKLQLGSRLMDYEREQESSFGALDNLE
         G   +R  N +Q S K NS S     KENQ  ++ +VKDK SPEKK  K+++  K  + PTKHE+A+K LDALK QL  RL+D ER QES  G+ D+  
Subjt:  TGNYSARLLNAKQNSWKSNSPS-----KENQFCSSYSVKDKPSPEKKATKIISSFKKNKTPTKHEAAEKSLDALKLQLGSRLMDYEREQESSFGALDNLE

Query:  SKA--SPNKISEDIVKCLCSIFVQVSTPREKCVE--FQTPRASSNTCTSSVEAEHLDPYHVCAESKGNDIGFYLHLFAVEANSIHLNEMANTLPLIHRLK
        S+A  +PNKISED V+CLCSIFV++ST ++K +E      +  +N+   S E+E  DPY +C++SK  DIG Y +L  +EAN++ L+ + N L LIHRLK
Subjt:  SKA--SPNKISEDIVKCLCSIFVQVSTPREKCVE--FQTPRASSNTCTSSVEAEHLDPYHVCAESKGNDIGFYLHLFAVEANSIHLNEMANTLPLIHRLK

Query:  YLLGKLASVKLEGLHQQQKLAFWINTYNSCIMNALLEHGIPETPERVVALMQKAKIVVGGYILNAITIEHFILRLPYHLKFMCSKAVKSDEMMKAHDLFG
        +LLGKLASV L+GL  QQKLAFWINTYNSC+MNA+LEHGIPETPE VV LMQKA IVVGG++LNAITIEHFILRLP+HLKF CSKA K DE MKA ++FG
Subjt:  YLLGKLASVKLEGLHQQQKLAFWINTYNSCIMNALLEHGIPETPERVVALMQKAKIVVGGYILNAITIEHFILRLPYHLKFMCSKAVKSDEMMKAHDLFG

Query:  LEWSEPLVTFALCCGSWSSPSVRVYTGSHVEEELEEAKRSYLQASVGISRRGNKLMLPKLLDWYLLDFAKDLESLVDWVCLQLPDEIRKEAVKCLERRGM
        LEWSEPLVTFAL CGSWSSP+VRVYT SHVE+ELE AKR YLQA+V ISR  NKL++PK+LDWYLLDFAK+LESL+DWVCLQL +E R EAVKCL+R+G 
Subjt:  LEWSEPLVTFALCCGSWSSPSVRVYTGSHVEEELEEAKRSYLQASVGISRRGNKLMLPKLLDWYLLDFAKDLESLVDWVCLQLPDEIRKEAVKCLERRGM

Query:  QPLEEFVQVIPYDFSFRLLLNK
        +PL + VQV+PYDFSFRLLL +
Subjt:  QPLEEFVQVIPYDFSFRLLLNK

XP_022149544.1 uncharacterized protein LOC111017954 isoform X1 [Momordica charantia]9.8e-28283.69Show/hide
Query:  MNARVRPNVQFLVEKASIHDKQEKGKRGGSKEMADEKDGIKINRRRLNTEKKMALLQDVDKLKKKLRHEENVHRALKRAFTRPLGALPRLPPYLPPSTLE
        MNARVRPN+QF +EKASIHDKQEK K  GSKEM DEK GIK NRRRLN EKKMALLQDVDKLKKKLRHEENVHRAL+RAFTRPLGALPRLPPYLPPSTLE
Subjt:  MNARVRPNVQFLVEKASIHDKQEKGKRGGSKEMADEKDGIKINRRRLNTEKKMALLQDVDKLKKKLRHEENVHRALKRAFTRPLGALPRLPPYLPPSTLE

Query:  LLAEVAILEEEVVWLSERVVNFRQDLYQEAVFVSSQRNVENFVNAVESISMRSLQHGQSKSLTLNDLSSPTPTYRPQPLVAGSISSRKLLSSHKIPDQTG
        LLAEVA+LEEEVV LSERVVNFRQDLYQEAVFVSSQRNVENFV  +ESIS  SL+HG+SKS       SP    R QP +A SISSRK+L SH + DQTG
Subjt:  LLAEVAILEEEVVWLSERVVNFRQDLYQEAVFVSSQRNVENFVNAVESISMRSLQHGQSKSLTLNDLSSPTPTYRPQPLVAGSISSRKLLSSHKIPDQTG

Query:  NYSARLLNAKQNSWKSNSPSKENQFCSSYSVKDKPSPEKKATKIISSFKKNKTPTKHEAAEKSLDALKLQLGSRLMDYEREQESSFGALDNLESKASPNK
        NYSARL+NA+Q SWKSNSPSKENQF  SY VKDKPSPEKK TKI+S     KTPTKHE AEKS DALKLQLGSRL+D ER +ESSFGA D+LESK SPN+
Subjt:  NYSARLLNAKQNSWKSNSPSKENQFCSSYSVKDKPSPEKKATKIISSFKKNKTPTKHEAAEKSLDALKLQLGSRLMDYEREQESSFGALDNLESKASPNK

Query:  ISEDIVKCLCSIFVQVSTPREKCVEFQTPRASSNTCTSSVEAEHLDPYHVCAESKGNDIGFYLHLFAVEANSIHLNEMANTLPLIHRLKYLLGKLASVKL
        ISE IVKCLCSIFV+VST  +KCVE QTP+ASS+  TS+VEAE LDPY++C+ESKGNDIG Y HLFAVEANSIHLNEMANTLP IHRLKYLLGKLASV L
Subjt:  ISEDIVKCLCSIFVQVSTPREKCVEFQTPRASSNTCTSSVEAEHLDPYHVCAESKGNDIGFYLHLFAVEANSIHLNEMANTLPLIHRLKYLLGKLASVKL

Query:  EGLHQQQKLAFWINTYNSCIMNALLEHGIPETPERVVALMQKAKIVVGGYILNAITIEHFILRLPYHLKFMCSKAVKSDEMMKAHDLFGLEWSEPLVTFA
        EGL+QQQKLAFWINTYNSCIMNALLEHGIPETPERVVALMQKA+IVVGGYILNA+TIEHFILRLPYHLKFMCSKA+KSDE MKA D+FGLEWSEPLVTFA
Subjt:  EGLHQQQKLAFWINTYNSCIMNALLEHGIPETPERVVALMQKAKIVVGGYILNAITIEHFILRLPYHLKFMCSKAVKSDEMMKAHDLFGLEWSEPLVTFA

Query:  LCCGSWSSPSVRVYTGSHVEEELEEAKRSYLQASVGISRRGNKLMLPKLLDWYLLDFAKDLESLVDWVCLQLPDEIRKEAVKCLERRGMQPLEEFVQVIP
        LCCGSWSSP+VRVYTGS+VE+ELEEAKRSYLQA+VGISRRGNK+MLPKLLDWYLLDFAKDLESLVDWVCLQL DE+RKEAVKCLERRG QP+EEFVQV+P
Subjt:  LCCGSWSSPSVRVYTGSHVEEELEEAKRSYLQASVGISRRGNKLMLPKLLDWYLLDFAKDLESLVDWVCLQLPDEIRKEAVKCLERRGMQPLEEFVQVIP

Query:  YDFSFRLLLNKLD
        YDFSFRLL NKLD
Subjt:  YDFSFRLLLNKLD

XP_022149545.1 uncharacterized protein LOC111017954 isoform X2 [Momordica charantia]9.2e-28083.52Show/hide
Query:  MNARVRPNVQFLVEKASIHDKQEKGKRGGSKEMADEKDGIKINRRRLNTEKKMALLQDVDKLKKKLRHEENVHRALKRAFTRPLGALPRLPPYLPPSTLE
        MNARVRPN+QF +EKASIHDK EK K  GSKEM DEK GIK NRRRLN EKKMALLQDVDKLKKKLRHEENVHRAL+RAFTRPLGALPRLPPYLPPSTLE
Subjt:  MNARVRPNVQFLVEKASIHDKQEKGKRGGSKEMADEKDGIKINRRRLNTEKKMALLQDVDKLKKKLRHEENVHRALKRAFTRPLGALPRLPPYLPPSTLE

Query:  LLAEVAILEEEVVWLSERVVNFRQDLYQEAVFVSSQRNVENFVNAVESISMRSLQHGQSKSLTLNDLSSPTPTYRPQPLVAGSISSRKLLSSHKIPDQTG
        LLAEVA+LEEEVV LSERVVNFRQDLYQEAVFVSSQRNVENFV  +ESIS  SL+HG+SKS       SP    R QP +A SISSRK+L SH + DQTG
Subjt:  LLAEVAILEEEVVWLSERVVNFRQDLYQEAVFVSSQRNVENFVNAVESISMRSLQHGQSKSLTLNDLSSPTPTYRPQPLVAGSISSRKLLSSHKIPDQTG

Query:  NYSARLLNAKQNSWKSNSPSKENQFCSSYSVKDKPSPEKKATKIISSFKKNKTPTKHEAAEKSLDALKLQLGSRLMDYEREQESSFGALDNLESKASPNK
        NYSARL+NA+Q SWKSNSPSKENQF  SY VKDKPSPEKK TKI+S     KTPTKHE AEKS DALKLQLGSRL+D ER +ESSFGA D+LESK SPN+
Subjt:  NYSARLLNAKQNSWKSNSPSKENQFCSSYSVKDKPSPEKKATKIISSFKKNKTPTKHEAAEKSLDALKLQLGSRLMDYEREQESSFGALDNLESKASPNK

Query:  ISEDIVKCLCSIFVQVSTPREKCVEFQTPRASSNTCTSSVEAEHLDPYHVCAESKGNDIGFYLHLFAVEANSIHLNEMANTLPLIHRLKYLLGKLASVKL
        ISE IVKCLCSIFV+VST  +KCVE QTP+ASS+  TS+VEAE LDPY++C+ESKGNDIG Y HLFAVEANSIHLNEMANTLP IHRLKYLLGKLASV L
Subjt:  ISEDIVKCLCSIFVQVSTPREKCVEFQTPRASSNTCTSSVEAEHLDPYHVCAESKGNDIGFYLHLFAVEANSIHLNEMANTLPLIHRLKYLLGKLASVKL

Query:  EGLHQQQKLAFWINTYNSCIMNALLEHGIPETPERVVALMQKAKIVVGGYILNAITIEHFILRLPYHLKFMCSKAVKSDEMMKAHDLFGLEWSEPLVTFA
        EGL+QQQKLAFWINTYNSCIMNALLEHGIPETPERVVALMQKA+IVVGGYILNA+TIEHFILRLPYHLKFMCSKA+KSDE MKA D+FGLEWSEPLVTFA
Subjt:  EGLHQQQKLAFWINTYNSCIMNALLEHGIPETPERVVALMQKAKIVVGGYILNAITIEHFILRLPYHLKFMCSKAVKSDEMMKAHDLFGLEWSEPLVTFA

Query:  LCCGSWSSPSVRVYTGSHVEEELEEAKRSYLQASVGISRRGNKLMLPKLLDWYLLDFAKDLESLVDWVCLQLPDEIRKEAVKCLERRGMQPLEEFVQVIP
        LCCGSWSSP+VRVYTGS+VE+ELEEAKRSYLQA+VGISRRGNK+MLPKLLDWYLLDFAKDLESLVDWVCLQL DE+RKEAVKCLERRG QP+EEFVQV+P
Subjt:  LCCGSWSSPSVRVYTGSHVEEELEEAKRSYLQASVGISRRGNKLMLPKLLDWYLLDFAKDLESLVDWVCLQLPDEIRKEAVKCLERRGMQPLEEFVQVIP

Query:  YDFSFRLLLNKLD
        YDFSFRLL NKLD
Subjt:  YDFSFRLLLNKLD

TrEMBL top hitse value%identityAlignment
A0A061DN03 Uncharacterized protein isoform 11.9e-19862.92Show/hide
Query:  MNARVRPNVQFLVEKASIHD---KQEKGKRGGSKEMADEKDGIKINRRRLNTEKKMALLQDVDKLKKKLRHEENVHRALKRAFTRPLGALPRLPPYLPPS
        MN RVR   Q +    S HD   K++  K  G + +   K     NRRR N E+KMALLQDVDKLK+KLRHEENVHRAL+RAFTRPLGALPRLPPYLPP 
Subjt:  MNARVRPNVQFLVEKASIHD---KQEKGKRGGSKEMADEKDGIKINRRRLNTEKKMALLQDVDKLKKKLRHEENVHRALKRAFTRPLGALPRLPPYLPPS

Query:  TLELLAEVAILEEEVVWLSERVVNFRQDLYQEAVFVSSQRNVENFVNAVESISMRSLQHGQSKSLTLNDLSSPTPTYRPQPLVAGSISSRKLLSSHKIPD
        TLELLAEVA+LEEEVV L E+VVNFRQ LYQEAV+ SS+RNVEN   ++E   +RS +H +SKSL++N++SS T   +PQP +A S+SSRKLL      +
Subjt:  TLELLAEVAILEEEVVWLSERVVNFRQDLYQEAVFVSSQRNVENFVNAVESISMRSLQHGQSKSLTLNDLSSPTPTYRPQPLVAGSISSRKLLSSHKIPD

Query:  QTGNYSARLLNAKQNSWKSNSPS-----KENQFCSSYSVKDKPSPEKKATKIISSFKKNKTPTKHEAAEKSLDALKLQLGSRLMDYEREQESSFGALDNL
        + G   +R  N +Q S K NS S     KENQ  ++ +VKDK SPEKK TK+++  K  + PTKHE+A K LDALK QL  RL+D ER QES  G+ D+ 
Subjt:  QTGNYSARLLNAKQNSWKSNSPS-----KENQFCSSYSVKDKPSPEKKATKIISSFKKNKTPTKHEAAEKSLDALKLQLGSRLMDYEREQESSFGALDNL

Query:  ESKA--SPNKISEDIVKCLCSIFVQVSTPREKCVE--FQTPRASSNTCTSSVEAEHLDPYHVCAESKGNDIGFYLHLFAVEANSIHLNEMANTLPLIHRL
         S+A  +PNKISED V+CLCSIFV++ST +++ VE      ++++N+   S E+E  DPY +C++SK  DIG Y +L  +EAN++ L+   N L LIHRL
Subjt:  ESKA--SPNKISEDIVKCLCSIFVQVSTPREKCVE--FQTPRASSNTCTSSVEAEHLDPYHVCAESKGNDIGFYLHLFAVEANSIHLNEMANTLPLIHRL

Query:  KYLLGKLASVKLEGLHQQQKLAFWINTYNSCIMNALLEHGIPETPERVVALMQKAKIVVGGYILNAITIEHFILRLPYHLKFMCSKAVKSDEMMKAHDLF
        K+LLGKL SV L+GL  QQKLAFWINTYNSC+MNA+LEHGIPETPE VV LMQKA IVVGG++LNAITIEHFILRLP+HLKF CSKA K+DE MKA ++F
Subjt:  KYLLGKLASVKLEGLHQQQKLAFWINTYNSCIMNALLEHGIPETPERVVALMQKAKIVVGGYILNAITIEHFILRLPYHLKFMCSKAVKSDEMMKAHDLF

Query:  GLEWSEPLVTFALCCGSWSSPSVRVYTGSHVEEELEEAKRSYLQASVGISRRGNKLMLPKLLDWYLLDFAKDLESLVDWVCLQLPDEIRKEAVKCLERRG
        GLEWSEPLVT+AL CGSWSSP+VRVYT SHVE+ELE AKR YLQA+V ISR  NKL++PKLLDWYLLDFAKDLESL+DWVCLQL +E+R EAVKCLER+G
Subjt:  GLEWSEPLVTFALCCGSWSSPSVRVYTGSHVEEELEEAKRSYLQASVGISRRGNKLMLPKLLDWYLLDFAKDLESLVDWVCLQLPDEIRKEAVKCLERRG

Query:  MQPLEEFVQVIPYDFSFRLLLNK
         +PL + VQV+PYDFSFRLLL +
Subjt:  MQPLEEFVQVIPYDFSFRLLLNK

A0A6J0ZJR8 uncharacterized protein LOC110409952 isoform X12.9e-19963.08Show/hide
Query:  MNARVRPNVQFLVEKASIHD---KQEKGKRGGSKEMADEKDGIKINRRRLNTEKKMALLQDVDKLKKKLRHEENVHRALKRAFTRPLGALPRLPPYLPPS
        MN RVR  +Q +    S HD   K++  K  GS+ +   K     NRRR N E+KMALLQDVDKLK+KLRHEENVHRAL+RAFTRPLGALPRLPPYLPPS
Subjt:  MNARVRPNVQFLVEKASIHD---KQEKGKRGGSKEMADEKDGIKINRRRLNTEKKMALLQDVDKLKKKLRHEENVHRALKRAFTRPLGALPRLPPYLPPS

Query:  TLELLAEVAILEEEVVWLSERVVNFRQDLYQEAVFVSSQRNVENFVNAVESISMRSLQHGQSKSLTLNDLSSPTPTYRPQPLVAGSISSRKLLSSHKIPD
        TLELLAEVA+LEEEVV L E+VVNFRQ LYQEAV+ SS+RNVEN   ++E   +RS +H +SKSL++N++SS T   +PQP +A S+SSRKLL      D
Subjt:  TLELLAEVAILEEEVVWLSERVVNFRQDLYQEAVFVSSQRNVENFVNAVESISMRSLQHGQSKSLTLNDLSSPTPTYRPQPLVAGSISSRKLLSSHKIPD

Query:  QTGNYSARLLNAKQNSWKSNSPS-----KENQFCSSYSVKDKPSPEKKATKIISSFKKNKTPTKHEAAEKSLDALKLQLGSRLMDYEREQESSFGALDNL
        + G   +R  N +Q S K NS S     KENQ  ++ +VKDK SPEKK  K+++  K  + PTKHE+A+K LDALK QL  RL+D ER QES  G+ D+ 
Subjt:  QTGNYSARLLNAKQNSWKSNSPS-----KENQFCSSYSVKDKPSPEKKATKIISSFKKNKTPTKHEAAEKSLDALKLQLGSRLMDYEREQESSFGALDNL

Query:  ESKA--SPNKISEDIVKCLCSIFVQVSTPREKCVE--FQTPRASSNTCTSSVEAEHLDPYHVCAESKGNDIGFYLHLFAVEANSIHLNEMANTLPLIHRL
         S+A  +PNKISED V+CLCSIFV++ST ++K +E      +  +N+   S E+E  DPY +C++SK  DIG Y +L  +EAN++ L+ + N L LIHRL
Subjt:  ESKA--SPNKISEDIVKCLCSIFVQVSTPREKCVE--FQTPRASSNTCTSSVEAEHLDPYHVCAESKGNDIGFYLHLFAVEANSIHLNEMANTLPLIHRL

Query:  KYLLGKLASVKLEGLHQQQKLAFWINTYNSCIMNALLEHGIPETPERVVALMQKAKIVVGGYILNAITIEHFILRLPYHLKFMCSKAVKSDEMMKAHDLF
        K+LLGKLASV L+GL  QQKLAFWINTYNSC+MNA+LEHGIPETPE VV LMQKA IVVGG++LNAITIEHFILRLP+HLKF CSKA K DE MKA ++F
Subjt:  KYLLGKLASVKLEGLHQQQKLAFWINTYNSCIMNALLEHGIPETPERVVALMQKAKIVVGGYILNAITIEHFILRLPYHLKFMCSKAVKSDEMMKAHDLF

Query:  GLEWSEPLVTFALCCGSWSSPSVRVYTGSHVEEELEEAKRSYLQASVGISRRGNKLMLPKLLDWYLLDFAKDLESLVDWVCLQLPDEIRKEAVKCLERRG
        GLEWSEPLVTFAL CGSWSSP+VRVYT SHVE+ELE AKR YLQA+V ISR  NKL++PK+LDWYLLDFAK+LESL+DWVCLQL +E R EAVKCL+R+G
Subjt:  GLEWSEPLVTFALCCGSWSSPSVRVYTGSHVEEELEEAKRSYLQASVGISRRGNKLMLPKLLDWYLLDFAKDLESLVDWVCLQLPDEIRKEAVKCLERRG

Query:  MQPLEEFVQVIPYDFSFRLLLNK
         +PL + VQV+PYDFSFRLLL +
Subjt:  MQPLEEFVQVIPYDFSFRLLLNK

A0A6J0ZJZ9 uncharacterized protein LOC110409952 isoform X22.3e-19963.18Show/hide
Query:  MNARVRPNVQFLVEKASIHD--KQEKGKRGGSKEMADEKDGIKINRRRLNTEKKMALLQDVDKLKKKLRHEENVHRALKRAFTRPLGALPRLPPYLPPST
        MN RVR  +Q +    S HD  K++  K  GS+ +   K     NRRR N E+KMALLQDVDKLK+KLRHEENVHRAL+RAFTRPLGALPRLPPYLPPST
Subjt:  MNARVRPNVQFLVEKASIHD--KQEKGKRGGSKEMADEKDGIKINRRRLNTEKKMALLQDVDKLKKKLRHEENVHRALKRAFTRPLGALPRLPPYLPPST

Query:  LELLAEVAILEEEVVWLSERVVNFRQDLYQEAVFVSSQRNVENFVNAVESISMRSLQHGQSKSLTLNDLSSPTPTYRPQPLVAGSISSRKLLSSHKIPDQ
        LELLAEVA+LEEEVV L E+VVNFRQ LYQEAV+ SS+RNVEN   ++E   +RS +H +SKSL++N++SS T   +PQP +A S+SSRKLL      D+
Subjt:  LELLAEVAILEEEVVWLSERVVNFRQDLYQEAVFVSSQRNVENFVNAVESISMRSLQHGQSKSLTLNDLSSPTPTYRPQPLVAGSISSRKLLSSHKIPDQ

Query:  TGNYSARLLNAKQNSWKSNSPS-----KENQFCSSYSVKDKPSPEKKATKIISSFKKNKTPTKHEAAEKSLDALKLQLGSRLMDYEREQESSFGALDNLE
         G   +R  N +Q S K NS S     KENQ  ++ +VKDK SPEKK  K+++  K  + PTKHE+A+K LDALK QL  RL+D ER QES  G+ D+  
Subjt:  TGNYSARLLNAKQNSWKSNSPS-----KENQFCSSYSVKDKPSPEKKATKIISSFKKNKTPTKHEAAEKSLDALKLQLGSRLMDYEREQESSFGALDNLE

Query:  SKA--SPNKISEDIVKCLCSIFVQVSTPREKCVE--FQTPRASSNTCTSSVEAEHLDPYHVCAESKGNDIGFYLHLFAVEANSIHLNEMANTLPLIHRLK
        S+A  +PNKISED V+CLCSIFV++ST ++K +E      +  +N+   S E+E  DPY +C++SK  DIG Y +L  +EAN++ L+ + N L LIHRLK
Subjt:  SKA--SPNKISEDIVKCLCSIFVQVSTPREKCVE--FQTPRASSNTCTSSVEAEHLDPYHVCAESKGNDIGFYLHLFAVEANSIHLNEMANTLPLIHRLK

Query:  YLLGKLASVKLEGLHQQQKLAFWINTYNSCIMNALLEHGIPETPERVVALMQKAKIVVGGYILNAITIEHFILRLPYHLKFMCSKAVKSDEMMKAHDLFG
        +LLGKLASV L+GL  QQKLAFWINTYNSC+MNA+LEHGIPETPE VV LMQKA IVVGG++LNAITIEHFILRLP+HLKF CSKA K DE MKA ++FG
Subjt:  YLLGKLASVKLEGLHQQQKLAFWINTYNSCIMNALLEHGIPETPERVVALMQKAKIVVGGYILNAITIEHFILRLPYHLKFMCSKAVKSDEMMKAHDLFG

Query:  LEWSEPLVTFALCCGSWSSPSVRVYTGSHVEEELEEAKRSYLQASVGISRRGNKLMLPKLLDWYLLDFAKDLESLVDWVCLQLPDEIRKEAVKCLERRGM
        LEWSEPLVTFAL CGSWSSP+VRVYT SHVE+ELE AKR YLQA+V ISR  NKL++PK+LDWYLLDFAK+LESL+DWVCLQL +E R EAVKCL+R+G 
Subjt:  LEWSEPLVTFALCCGSWSSPSVRVYTGSHVEEELEEAKRSYLQASVGISRRGNKLMLPKLLDWYLLDFAKDLESLVDWVCLQLPDEIRKEAVKCLERRGM

Query:  QPLEEFVQVIPYDFSFRLLLNK
        +PL + VQV+PYDFSFRLLL +
Subjt:  QPLEEFVQVIPYDFSFRLLLNK

A0A6J1D606 uncharacterized protein LOC111017954 isoform X24.4e-28083.52Show/hide
Query:  MNARVRPNVQFLVEKASIHDKQEKGKRGGSKEMADEKDGIKINRRRLNTEKKMALLQDVDKLKKKLRHEENVHRALKRAFTRPLGALPRLPPYLPPSTLE
        MNARVRPN+QF +EKASIHDK EK K  GSKEM DEK GIK NRRRLN EKKMALLQDVDKLKKKLRHEENVHRAL+RAFTRPLGALPRLPPYLPPSTLE
Subjt:  MNARVRPNVQFLVEKASIHDKQEKGKRGGSKEMADEKDGIKINRRRLNTEKKMALLQDVDKLKKKLRHEENVHRALKRAFTRPLGALPRLPPYLPPSTLE

Query:  LLAEVAILEEEVVWLSERVVNFRQDLYQEAVFVSSQRNVENFVNAVESISMRSLQHGQSKSLTLNDLSSPTPTYRPQPLVAGSISSRKLLSSHKIPDQTG
        LLAEVA+LEEEVV LSERVVNFRQDLYQEAVFVSSQRNVENFV  +ESIS  SL+HG+SKS       SP    R QP +A SISSRK+L SH + DQTG
Subjt:  LLAEVAILEEEVVWLSERVVNFRQDLYQEAVFVSSQRNVENFVNAVESISMRSLQHGQSKSLTLNDLSSPTPTYRPQPLVAGSISSRKLLSSHKIPDQTG

Query:  NYSARLLNAKQNSWKSNSPSKENQFCSSYSVKDKPSPEKKATKIISSFKKNKTPTKHEAAEKSLDALKLQLGSRLMDYEREQESSFGALDNLESKASPNK
        NYSARL+NA+Q SWKSNSPSKENQF  SY VKDKPSPEKK TKI+S     KTPTKHE AEKS DALKLQLGSRL+D ER +ESSFGA D+LESK SPN+
Subjt:  NYSARLLNAKQNSWKSNSPSKENQFCSSYSVKDKPSPEKKATKIISSFKKNKTPTKHEAAEKSLDALKLQLGSRLMDYEREQESSFGALDNLESKASPNK

Query:  ISEDIVKCLCSIFVQVSTPREKCVEFQTPRASSNTCTSSVEAEHLDPYHVCAESKGNDIGFYLHLFAVEANSIHLNEMANTLPLIHRLKYLLGKLASVKL
        ISE IVKCLCSIFV+VST  +KCVE QTP+ASS+  TS+VEAE LDPY++C+ESKGNDIG Y HLFAVEANSIHLNEMANTLP IHRLKYLLGKLASV L
Subjt:  ISEDIVKCLCSIFVQVSTPREKCVEFQTPRASSNTCTSSVEAEHLDPYHVCAESKGNDIGFYLHLFAVEANSIHLNEMANTLPLIHRLKYLLGKLASVKL

Query:  EGLHQQQKLAFWINTYNSCIMNALLEHGIPETPERVVALMQKAKIVVGGYILNAITIEHFILRLPYHLKFMCSKAVKSDEMMKAHDLFGLEWSEPLVTFA
        EGL+QQQKLAFWINTYNSCIMNALLEHGIPETPERVVALMQKA+IVVGGYILNA+TIEHFILRLPYHLKFMCSKA+KSDE MKA D+FGLEWSEPLVTFA
Subjt:  EGLHQQQKLAFWINTYNSCIMNALLEHGIPETPERVVALMQKAKIVVGGYILNAITIEHFILRLPYHLKFMCSKAVKSDEMMKAHDLFGLEWSEPLVTFA

Query:  LCCGSWSSPSVRVYTGSHVEEELEEAKRSYLQASVGISRRGNKLMLPKLLDWYLLDFAKDLESLVDWVCLQLPDEIRKEAVKCLERRGMQPLEEFVQVIP
        LCCGSWSSP+VRVYTGS+VE+ELEEAKRSYLQA+VGISRRGNK+MLPKLLDWYLLDFAKDLESLVDWVCLQL DE+RKEAVKCLERRG QP+EEFVQV+P
Subjt:  LCCGSWSSPSVRVYTGSHVEEELEEAKRSYLQASVGISRRGNKLMLPKLLDWYLLDFAKDLESLVDWVCLQLPDEIRKEAVKCLERRGMQPLEEFVQVIP

Query:  YDFSFRLLLNKLD
        YDFSFRLL NKLD
Subjt:  YDFSFRLLLNKLD

A0A6J1D7B9 uncharacterized protein LOC111017954 isoform X14.7e-28283.69Show/hide
Query:  MNARVRPNVQFLVEKASIHDKQEKGKRGGSKEMADEKDGIKINRRRLNTEKKMALLQDVDKLKKKLRHEENVHRALKRAFTRPLGALPRLPPYLPPSTLE
        MNARVRPN+QF +EKASIHDKQEK K  GSKEM DEK GIK NRRRLN EKKMALLQDVDKLKKKLRHEENVHRAL+RAFTRPLGALPRLPPYLPPSTLE
Subjt:  MNARVRPNVQFLVEKASIHDKQEKGKRGGSKEMADEKDGIKINRRRLNTEKKMALLQDVDKLKKKLRHEENVHRALKRAFTRPLGALPRLPPYLPPSTLE

Query:  LLAEVAILEEEVVWLSERVVNFRQDLYQEAVFVSSQRNVENFVNAVESISMRSLQHGQSKSLTLNDLSSPTPTYRPQPLVAGSISSRKLLSSHKIPDQTG
        LLAEVA+LEEEVV LSERVVNFRQDLYQEAVFVSSQRNVENFV  +ESIS  SL+HG+SKS       SP    R QP +A SISSRK+L SH + DQTG
Subjt:  LLAEVAILEEEVVWLSERVVNFRQDLYQEAVFVSSQRNVENFVNAVESISMRSLQHGQSKSLTLNDLSSPTPTYRPQPLVAGSISSRKLLSSHKIPDQTG

Query:  NYSARLLNAKQNSWKSNSPSKENQFCSSYSVKDKPSPEKKATKIISSFKKNKTPTKHEAAEKSLDALKLQLGSRLMDYEREQESSFGALDNLESKASPNK
        NYSARL+NA+Q SWKSNSPSKENQF  SY VKDKPSPEKK TKI+S     KTPTKHE AEKS DALKLQLGSRL+D ER +ESSFGA D+LESK SPN+
Subjt:  NYSARLLNAKQNSWKSNSPSKENQFCSSYSVKDKPSPEKKATKIISSFKKNKTPTKHEAAEKSLDALKLQLGSRLMDYEREQESSFGALDNLESKASPNK

Query:  ISEDIVKCLCSIFVQVSTPREKCVEFQTPRASSNTCTSSVEAEHLDPYHVCAESKGNDIGFYLHLFAVEANSIHLNEMANTLPLIHRLKYLLGKLASVKL
        ISE IVKCLCSIFV+VST  +KCVE QTP+ASS+  TS+VEAE LDPY++C+ESKGNDIG Y HLFAVEANSIHLNEMANTLP IHRLKYLLGKLASV L
Subjt:  ISEDIVKCLCSIFVQVSTPREKCVEFQTPRASSNTCTSSVEAEHLDPYHVCAESKGNDIGFYLHLFAVEANSIHLNEMANTLPLIHRLKYLLGKLASVKL

Query:  EGLHQQQKLAFWINTYNSCIMNALLEHGIPETPERVVALMQKAKIVVGGYILNAITIEHFILRLPYHLKFMCSKAVKSDEMMKAHDLFGLEWSEPLVTFA
        EGL+QQQKLAFWINTYNSCIMNALLEHGIPETPERVVALMQKA+IVVGGYILNA+TIEHFILRLPYHLKFMCSKA+KSDE MKA D+FGLEWSEPLVTFA
Subjt:  EGLHQQQKLAFWINTYNSCIMNALLEHGIPETPERVVALMQKAKIVVGGYILNAITIEHFILRLPYHLKFMCSKAVKSDEMMKAHDLFGLEWSEPLVTFA

Query:  LCCGSWSSPSVRVYTGSHVEEELEEAKRSYLQASVGISRRGNKLMLPKLLDWYLLDFAKDLESLVDWVCLQLPDEIRKEAVKCLERRGMQPLEEFVQVIP
        LCCGSWSSP+VRVYTGS+VE+ELEEAKRSYLQA+VGISRRGNK+MLPKLLDWYLLDFAKDLESLVDWVCLQL DE+RKEAVKCLERRG QP+EEFVQV+P
Subjt:  LCCGSWSSPSVRVYTGSHVEEELEEAKRSYLQASVGISRRGNKLMLPKLLDWYLLDFAKDLESLVDWVCLQLPDEIRKEAVKCLERRGMQPLEEFVQVIP

Query:  YDFSFRLLLNKLD
        YDFSFRLL NKLD
Subjt:  YDFSFRLLLNKLD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G37080.1 Protein of unknown function, DUF5471.7e-17056.49Show/hide
Query:  SIHDKQEKGKRGGSKEMADEKDGIKINRRRLNTEKKMALLQDVDKLKKKLRHEENVHRALKRAFTRPLGALPRLPPYLPPSTLELLAEVAILEEEVVWLS
        S  D+++K +  G+  +A+    + +NRRR N EKKM LLQDVDKLK+KLR EENVHRAL+RAFTRPLGALPRLP YLP  TLELLAEVA+LEEEVV L 
Subjt:  SIHDKQEKGKRGGSKEMADEKDGIKINRRRLNTEKKMALLQDVDKLKKKLRHEENVHRALKRAFTRPLGALPRLPPYLPPSTLELLAEVAILEEEVVWLS

Query:  ERVVNFRQDLYQEAVFVSSQRNVE--NFVNAVESISMRSLQHGQSKSLTLNDLSSP-TPTYRPQPLVAGSISSRKLLSSHK-IPDQTGNYSARLLNAKQN
        E+VVNFRQ LYQEAV++SS+RN+E  N  +  E+  +RS +H +SKS++ ++ +S  TPT + Q  ++ SISSRKL SS + + D++G    R+++ KQ 
Subjt:  ERVVNFRQDLYQEAVFVSSQRNVE--NFVNAVESISMRSLQHGQSKSLTLNDLSSP-TPTYRPQPLVAGSISSRKLLSSHK-IPDQTGNYSARLLNAKQN

Query:  SWKSNSPS-----------KENQFCSSYS--VKDKPSPEKKATKIISSFKKNKTPTKHE-AAEKSLDALKLQLGSRLMDYEREQESSFGALD---NLESK
        S KSN  S           KENQ  S+ S   K+K SPEKK  + ++S KK K   K E AA+K  ++ KLQL  RL D ++ QES  G+      L+S 
Subjt:  SWKSNSPS-----------KENQFCSSYS--VKDKPSPEKKATKIISSFKKNKTPTKHE-AAEKSLDALKLQLGSRLMDYEREQESSFGALD---NLESK

Query:  ASPNKISEDIVKCLCSIFVQVSTPREKCVEFQTPRASSNTCTSSVEAEHLDPYHVCAESKGNDIGFYLHLFAVEANSIHLNEMANTLPLIHRLKYLLGKL
           N++SED++KCL +I +++S+ ++                       LDPY+ C+E +  ++G Y H  +V+ +S+ L    N   LIHRLK+LL KL
Subjt:  ASPNKISEDIVKCLCSIFVQVSTPREKCVEFQTPRASSNTCTSSVEAEHLDPYHVCAESKGNDIGFYLHLFAVEANSIHLNEMANTLPLIHRLKYLLGKL

Query:  ASVKLEGLHQQQKLAFWINTYNSCIMNALLEHGIPETPERVVALMQKAKIVVGGYILNAITIEHFILRLPYHLKFMCSKAVKSDEMMKAHDLFGLEWSEP
        + V L+GL  QQKLAFWINTYNSC+MNA LEHGIP TPE VVALMQKA I+VGG+ LNAITIEHFILRLPYHLKF C K    +E M+AH  FGLEWSEP
Subjt:  ASVKLEGLHQQQKLAFWINTYNSCIMNALLEHGIPETPERVVALMQKAKIVVGGYILNAITIEHFILRLPYHLKFMCSKAVKSDEMMKAHDLFGLEWSEP

Query:  LVTFALCCGSWSSPSVRVYTGSHVEEELEEAKRSYLQASVGISRRGNKLMLPKLLDWYLLDFAKDLESLVDWVCLQLPDEIRKEAVKCLERRGMQPLEEF
        LVTFAL CGSWSSP+VRVYT ++VEEELE AKR YLQASVGIS + NKLMLPK+LDWYLLDFAKDLESL+DWVCLQLPD++R+EA KC+ER+  + L E 
Subjt:  LVTFALCCGSWSSPSVRVYTGSHVEEELEEAKRSYLQASVGISRRGNKLMLPKLLDWYLLDFAKDLESLVDWVCLQLPDEIRKEAVKCLERRGMQPLEEF

Query:  VQVIPYDFSFRLLLNK
        VQV+PYDFSFRLLL++
Subjt:  VQVIPYDFSFRLLLNK

AT4G37080.2 Protein of unknown function, DUF5473.7e-17056.61Show/hide
Query:  DKQEKGKRGGSKEMADEKDGIKINRRRLNTEKKMALLQDVDKLKKKLRHEENVHRALKRAFTRPLGALPRLPPYLPPSTLELLAEVAILEEEVVWLSERV
        D+++K +  G+  +A+    + +NRRR N EKKM LLQDVDKLK+KLR EENVHRAL+RAFTRPLGALPRLP YLP  TLELLAEVA+LEEEVV L E+V
Subjt:  DKQEKGKRGGSKEMADEKDGIKINRRRLNTEKKMALLQDVDKLKKKLRHEENVHRALKRAFTRPLGALPRLPPYLPPSTLELLAEVAILEEEVVWLSERV

Query:  VNFRQDLYQEAVFVSSQRNVE--NFVNAVESISMRSLQHGQSKSLTLNDLSSP-TPTYRPQPLVAGSISSRKLLSSHK-IPDQTGNYSARLLNAKQNSWK
        VNFRQ LYQEAV++SS+RN+E  N  +  E+  +RS +H +SKS++ ++ +S  TPT + Q  ++ SISSRKL SS + + D++G    R+++ KQ S K
Subjt:  VNFRQDLYQEAVFVSSQRNVE--NFVNAVESISMRSLQHGQSKSLTLNDLSSP-TPTYRPQPLVAGSISSRKLLSSHK-IPDQTGNYSARLLNAKQNSWK

Query:  SNSPS-----------KENQFCSSYS--VKDKPSPEKKATKIISSFKKNKTPTKHE-AAEKSLDALKLQLGSRLMDYEREQESSFGALD---NLESKASP
        SN  S           KENQ  S+ S   K+K SPEKK  + ++S KK K   K E AA+K  ++ KLQL  RL D ++ QES  G+      L+S    
Subjt:  SNSPS-----------KENQFCSSYS--VKDKPSPEKKATKIISSFKKNKTPTKHE-AAEKSLDALKLQLGSRLMDYEREQESSFGALD---NLESKASP

Query:  NKISEDIVKCLCSIFVQVSTPREKCVEFQTPRASSNTCTSSVEAEHLDPYHVCAESKGNDIGFYLHLFAVEANSIHLNEMANTLPLIHRLKYLLGKLASV
        N++SED++KCL +I +++S+ ++                       LDPY+ C+E +  ++G Y H  +V+ +S+ L    N   LIHRLK+LL KL+ V
Subjt:  NKISEDIVKCLCSIFVQVSTPREKCVEFQTPRASSNTCTSSVEAEHLDPYHVCAESKGNDIGFYLHLFAVEANSIHLNEMANTLPLIHRLKYLLGKLASV

Query:  KLEGLHQQQKLAFWINTYNSCIMNALLEHGIPETPERVVALMQKAKIVVGGYILNAITIEHFILRLPYHLKFMCSKAVKSDEMMKAHDLFGLEWSEPLVT
         L+GL  QQKLAFWINTYNSC+MNA LEHGIP TPE VVALMQKA I+VGG+ LNAITIEHFILRLPYHLKF C K    +E M+AH  FGLEWSEPLVT
Subjt:  KLEGLHQQQKLAFWINTYNSCIMNALLEHGIPETPERVVALMQKAKIVVGGYILNAITIEHFILRLPYHLKFMCSKAVKSDEMMKAHDLFGLEWSEPLVT

Query:  FALCCGSWSSPSVRVYTGSHVEEELEEAKRSYLQASVGISRRGNKLMLPKLLDWYLLDFAKDLESLVDWVCLQLPDEIRKEAVKCLERRGMQPLEEFVQV
        FAL CGSWSSP+VRVYT ++VEEELE AKR YLQASVGIS + NKLMLPK+LDWYLLDFAKDLESL+DWVCLQLPD++R+EA KC+ER+  + L E VQV
Subjt:  FALCCGSWSSPSVRVYTGSHVEEELEEAKRSYLQASVGISRRGNKLMLPKLLDWYLLDFAKDLESLVDWVCLQLPDEIRKEAVKCLERRGMQPLEEFVQV

Query:  IPYDFSFRLLLNK
        +PYDFSFRLLL++
Subjt:  IPYDFSFRLLLNK

AT4G37080.3 Protein of unknown function, DUF5473.7e-17056.61Show/hide
Query:  DKQEKGKRGGSKEMADEKDGIKINRRRLNTEKKMALLQDVDKLKKKLRHEENVHRALKRAFTRPLGALPRLPPYLPPSTLELLAEVAILEEEVVWLSERV
        D+++K +  G+  +A+    + +NRRR N EKKM LLQDVDKLK+KLR EENVHRAL+RAFTRPLGALPRLP YLP  TLELLAEVA+LEEEVV L E+V
Subjt:  DKQEKGKRGGSKEMADEKDGIKINRRRLNTEKKMALLQDVDKLKKKLRHEENVHRALKRAFTRPLGALPRLPPYLPPSTLELLAEVAILEEEVVWLSERV

Query:  VNFRQDLYQEAVFVSSQRNVE--NFVNAVESISMRSLQHGQSKSLTLNDLSSP-TPTYRPQPLVAGSISSRKLLSSHK-IPDQTGNYSARLLNAKQNSWK
        VNFRQ LYQEAV++SS+RN+E  N  +  E+  +RS +H +SKS++ ++ +S  TPT + Q  ++ SISSRKL SS + + D++G    R+++ KQ S K
Subjt:  VNFRQDLYQEAVFVSSQRNVE--NFVNAVESISMRSLQHGQSKSLTLNDLSSP-TPTYRPQPLVAGSISSRKLLSSHK-IPDQTGNYSARLLNAKQNSWK

Query:  SNSPS-----------KENQFCSSYS--VKDKPSPEKKATKIISSFKKNKTPTKHE-AAEKSLDALKLQLGSRLMDYEREQESSFGALD---NLESKASP
        SN  S           KENQ  S+ S   K+K SPEKK  + ++S KK K   K E AA+K  ++ KLQL  RL D ++ QES  G+      L+S    
Subjt:  SNSPS-----------KENQFCSSYS--VKDKPSPEKKATKIISSFKKNKTPTKHE-AAEKSLDALKLQLGSRLMDYEREQESSFGALD---NLESKASP

Query:  NKISEDIVKCLCSIFVQVSTPREKCVEFQTPRASSNTCTSSVEAEHLDPYHVCAESKGNDIGFYLHLFAVEANSIHLNEMANTLPLIHRLKYLLGKLASV
        N++SED++KCL +I +++S+ ++                       LDPY+ C+E +  ++G Y H  +V+ +S+ L    N   LIHRLK+LL KL+ V
Subjt:  NKISEDIVKCLCSIFVQVSTPREKCVEFQTPRASSNTCTSSVEAEHLDPYHVCAESKGNDIGFYLHLFAVEANSIHLNEMANTLPLIHRLKYLLGKLASV

Query:  KLEGLHQQQKLAFWINTYNSCIMNALLEHGIPETPERVVALMQKAKIVVGGYILNAITIEHFILRLPYHLKFMCSKAVKSDEMMKAHDLFGLEWSEPLVT
         L+GL  QQKLAFWINTYNSC+MNA LEHGIP TPE VVALMQKA I+VGG+ LNAITIEHFILRLPYHLKF C K    +E M+AH  FGLEWSEPLVT
Subjt:  KLEGLHQQQKLAFWINTYNSCIMNALLEHGIPETPERVVALMQKAKIVVGGYILNAITIEHFILRLPYHLKFMCSKAVKSDEMMKAHDLFGLEWSEPLVT

Query:  FALCCGSWSSPSVRVYTGSHVEEELEEAKRSYLQASVGISRRGNKLMLPKLLDWYLLDFAKDLESLVDWVCLQLPDEIRKEAVKCLERRGMQPLEEFVQV
        FAL CGSWSSP+VRVYT ++VEEELE AKR YLQASVGIS + NKLMLPK+LDWYLLDFAKDLESL+DWVCLQLPD++R+EA KC+ER+  + L E VQV
Subjt:  FALCCGSWSSPSVRVYTGSHVEEELEEAKRSYLQASVGISRRGNKLMLPKLLDWYLLDFAKDLESLVDWVCLQLPDEIRKEAVKCLERRGMQPLEEFVQV

Query:  IPYDFSFRLLLNK
        +PYDFSFRLLL++
Subjt:  IPYDFSFRLLLNK

AT5G42690.1 Protein of unknown function, DUF5475.6e-11846.56Show/hide
Query:  KEMADEKDGIKINRRRLNTEKKMALLQDVDKLKKKLRHEENVHRALKRAFTRPLGALPRLPPYLPPSTLELLAEVAILEEEVVWLSERVVNFRQDLYQEA
        K++   ++G+ +NR+ LN EK + L +DV+KL+KKLR EEN+HRA++RAF+RPLGALPRLPP+LPPS LELLAEVA+LEEE+V L E +V+ RQ+LYQEA
Subjt:  KEMADEKDGIKINRRRLNTEKKMALLQDVDKLKKKLRHEENVHRALKRAFTRPLGALPRLPPYLPPSTLELLAEVAILEEEVVWLSERVVNFRQDLYQEA

Query:  VFVSSQRNVENFVNAVESISMRSLQHGQSKSLTLNDLSSPTPTYRPQPLVAGSISSRKLLSSHKIPDQTGNYSARLLNAKQNSWKSNSPSKENQFCSSYS
        VF SS  ++EN        S    +H Q+KS                                    ++ + SAR         +S SP        S  
Subjt:  VFVSSQRNVENFVNAVESISMRSLQHGQSKSLTLNDLSSPTPTYRPQPLVAGSISSRKLLSSHKIPDQTGNYSARLLNAKQNSWKSNSPSKENQFCSSYS

Query:  VKDKPSPEKKATKIISSFKKNKTPTKHEAAEKSLDALKLQLGSRLMDYEREQESSFGALDNLESKASPNKISEDIVKCLCSIFVQVSTPREKCVEFQTPR
         K K + +  AT I +  K  KT   H    KSL+A KL+   R      E+ S  G          PNKISED+VKCL +IF+++S+ +   V      
Subjt:  VKDKPSPEKKATKIISSFKKNKTPTKHEAAEKSLDALKLQLGSRLMDYEREQESSFGALDNLESKASPNKISEDIVKCLCSIFVQVSTPREKCVEFQTPR

Query:  ASSNTCTSSVEAEHLDPYHVCAESKGNDIGFYLHLFAVEANSIHLNE-MANTLPLIHRLKYLLGKLASVKLEGLHQQQKLAFWINTYNSCIMNALLEHGI
          + +  +  +    DPY +C+  +  DIG Y +   VE  S++ N   +++L LI +LK LLG+L+ V ++ L+QQ+KLAFWIN YNSC+MN  LEHGI
Subjt:  ASSNTCTSSVEAEHLDPYHVCAESKGNDIGFYLHLFAVEANSIHLNE-MANTLPLIHRLKYLLGKLASVKLEGLHQQQKLAFWINTYNSCIMNALLEHGI

Query:  PETPERVVALMQKAKIVVGGYILNAITIEHFILRLPYHLKFMCSKAVKSDEMMKAHDLFGLEWSEPLVTFALCCGSWSSPSVRVYTGSHVEEELEEAKRS
        PE+P+ +V LMQKA I VGG+ LNAITIEHFILRLP+H K++  K  K +E M     FGLE SEPLVTFAL CGSWSSP+VRVYT S VEEELE AKR 
Subjt:  PETPERVVALMQKAKIVVGGYILNAITIEHFILRLPYHLKFMCSKAVKSDEMMKAHDLFGLEWSEPLVTFALCCGSWSSPSVRVYTGSHVEEELEEAKRS

Query:  YLQASVGISRRGNKLMLPKLLDWYLLDFAKDLESLVDWVCLQLPDEIRKEAVKCLER-RGMQPLEEFVQVIPYDFSFRLLLN
        YL+ASVGIS    K+ +PKL+DWY  DFAKD+ESL+DW+ LQLP E+ K+A+ C+E+     P    V +IPYDF+FR L +
Subjt:  YLQASVGISRRGNKLMLPKLLDWYLLDFAKDLESLVDWVCLQLPDEIRKEAVKCLER-RGMQPLEEFVQVIPYDFSFRLLLN

AT5G42690.2 Protein of unknown function, DUF5471.5e-11846.74Show/hide
Query:  KEMADEKDGIKINRRRLNTEKKMALLQDVDKLKKKLRHEENVHRALKRAFTRPLGALPRLPPYLPPSTLELLAEVAILEEEVVWLSERVVNFRQDLYQEA
        K++   ++G+ +NR+ LN EK + L +DV+KL+KKLR EEN+HRA++RAF+RPLGALPRLPP+LPPS LELLAEVA+LEEE+V L E +V+ RQ+LYQEA
Subjt:  KEMADEKDGIKINRRRLNTEKKMALLQDVDKLKKKLRHEENVHRALKRAFTRPLGALPRLPPYLPPSTLELLAEVAILEEEVVWLSERVVNFRQDLYQEA

Query:  VFVSSQRNVENFVNAVESISMRSLQHGQSKSLTLNDLSSPTPTYRPQPLVAGSISSRKLLSSHKIPDQTGNYSARLLNAKQNSWKSNSPSKENQFCSSYS
        VF SS  ++EN        S    +H Q+KS                                    ++ + SAR         +S SP        S  
Subjt:  VFVSSQRNVENFVNAVESISMRSLQHGQSKSLTLNDLSSPTPTYRPQPLVAGSISSRKLLSSHKIPDQTGNYSARLLNAKQNSWKSNSPSKENQFCSSYS

Query:  VKDKPSPEKKATKIISSFKKNKTPTKHEAAEKSLDALKLQLGSRLMDYEREQESSFGALDNLESKASPNKISEDIVKCLCSIFVQVSTPREKCVEFQTPR
         K K + +  AT I +  K  KT   H    KSL+A KL+  S        + SS G  D       PNKISED+VKCL +IF+++S+ +   V      
Subjt:  VKDKPSPEKKATKIISSFKKNKTPTKHEAAEKSLDALKLQLGSRLMDYEREQESSFGALDNLESKASPNKISEDIVKCLCSIFVQVSTPREKCVEFQTPR

Query:  ASSNTCTSSVEAEHLDPYHVCAESKGNDIGFYLHLFAVEANSIHLNE-MANTLPLIHRLKYLLGKLASVKLEGLHQQQKLAFWINTYNSCIMNALLEHGI
          + +  +  +    DPY +C+  +  DIG Y +   VE  S++ N   +++L LI +LK LLG+L+ V ++ L+QQ+KLAFWIN YNSC+MN  LEHGI
Subjt:  ASSNTCTSSVEAEHLDPYHVCAESKGNDIGFYLHLFAVEANSIHLNE-MANTLPLIHRLKYLLGKLASVKLEGLHQQQKLAFWINTYNSCIMNALLEHGI

Query:  PETPERVVALMQKAKIVVGGYILNAITIEHFILRLPYHLKFMCSKAVKSDEMMKAHDLFGLEWSEPLVTFALCCGSWSSPSVRVYTGSHVEEELEEAKRS
        PE+P+ +V LMQKA I VGG+ LNAITIEHFILRLP+H K++  K  K +E M     FGLE SEPLVTFAL CGSWSSP+VRVYT S VEEELE AKR 
Subjt:  PETPERVVALMQKAKIVVGGYILNAITIEHFILRLPYHLKFMCSKAVKSDEMMKAHDLFGLEWSEPLVTFALCCGSWSSPSVRVYTGSHVEEELEEAKRS

Query:  YLQASVGISRRGNKLMLPKLLDWYLLDFAKDLESLVDWVCLQLPDEIRKEAVKCLER-RGMQPLEEFVQVIPYDFSFRLLLN
        YL+ASVGIS    K+ +PKL+DWY  DFAKD+ESL+DW+ LQLP E+ K+A+ C+E+     P    V +IPYDF+FR L +
Subjt:  YLQASVGISRRGNKLMLPKLLDWYLLDFAKDLESLVDWVCLQLPDEIRKEAVKCLER-RGMQPLEEFVQVIPYDFSFRLLLN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATGCCAGAGTTCGACCCAACGTTCAATTTCTGGTAGAGAAAGCGTCAATTCATGATAAACAGGAGAAAGGGAAGCGAGGGGGGAGCAAAGAGATGGCTGATGAGAA
AGATGGAATCAAAATCAATAGACGCAGATTAAACACAGAAAAGAAAATGGCATTGCTACAAGATGTTGATAAGCTAAAGAAGAAGCTGAGGCATGAGGAGAATGTTCATA
GAGCTTTGAAGAGAGCTTTCACTAGGCCTTTAGGAGCTTTGCCTCGTCTTCCTCCTTATCTTCCTCCATCTACACTTGAGCTTCTAGCTGAAGTAGCTATCCTTGAAGAG
GAAGTTGTTTGGCTTTCCGAACGAGTTGTGAATTTTCGACAAGATCTTTACCAAGAAGCTGTCTTTGTTTCCTCACAGCGGAATGTCGAAAATTTTGTCAATGCTGTTGA
GAGTATCTCAATGAGAAGTTTACAGCACGGTCAATCGAAATCTTTGACTTTGAATGATCTCAGTTCTCCAACACCAACATATCGGCCTCAACCGCTGGTTGCCGGAAGCA
TTTCGAGTAGAAAGCTATTATCTAGTCACAAAATCCCTGATCAAACAGGGAATTATTCTGCTAGGCTCTTGAATGCCAAGCAAAACTCTTGGAAATCTAATTCACCTTCA
AAGGAGAATCAGTTTTGTTCTTCTTATTCTGTGAAAGATAAACCTTCTCCAGAAAAGAAAGCCACTAAAATTATTAGCTCATTCAAGAAGAATAAGACACCAACCAAACA
TGAAGCTGCAGAGAAGAGTTTAGATGCTTTGAAGTTGCAGCTTGGATCCAGGTTAATGGATTATGAAAGAGAACAAGAGAGTTCTTTTGGTGCACTGGATAATCTAGAAT
CTAAGGCATCGCCTAACAAAATTTCCGAGGATATTGTGAAGTGCTTATGTTCCATTTTTGTTCAAGTGAGCACTCCGAGAGAGAAGTGTGTCGAATTTCAAACTCCTCGA
GCTTCGTCTAACACCTGCACGAGCAGTGTAGAAGCAGAGCATCTGGATCCATACCATGTATGCGCAGAATCCAAAGGAAATGATATTGGTTTCTATCTGCATCTGTTTGC
AGTCGAAGCGAATTCAATTCATCTCAATGAAATGGCTAACACACTTCCCCTAATTCACAGGCTAAAATATCTACTCGGAAAGCTTGCCTCTGTGAAGTTAGAGGGTCTGC
ACCAGCAGCAGAAACTGGCCTTCTGGATTAACACCTATAATTCTTGCATAATGAATGCACTTTTGGAGCATGGAATACCTGAGACTCCAGAAAGGGTTGTAGCTCTAATG
CAAAAGGCCAAAATCGTCGTTGGGGGATACATTCTCAATGCAATAACAATAGAGCATTTCATTTTGAGACTGCCTTACCACCTGAAATTTATGTGTTCGAAGGCTGTTAA
AAGTGACGAGATGATGAAAGCACATGATTTGTTTGGATTAGAGTGGTCTGAACCATTGGTTACATTTGCTCTTTGTTGCGGAAGCTGGTCGTCTCCTTCGGTGAGAGTGT
ACACAGGAAGTCATGTGGAGGAAGAGTTAGAAGAGGCCAAGAGAAGCTACTTGCAAGCTTCAGTTGGAATATCAAGAAGAGGAAACAAACTAATGCTTCCAAAACTATTG
GATTGGTATTTACTTGACTTTGCAAAGGATTTGGAGTCATTGGTGGATTGGGTTTGCTTACAACTACCTGATGAGATTAGAAAAGAAGCTGTTAAATGCCTTGAAAGAAG
GGGCATGCAGCCTCTTGAAGAGTTTGTTCAAGTGATTCCTTATGATTTCAGTTTTAGATTGCTTTTAAACAAATTGGATTCATGA
mRNA sequenceShow/hide mRNA sequence
AAGAAATCAGAGCAGACGACAATTAATGAGGAGAGCTGTTGGCCCTTGTGCCGTGTCTCGCAATATGCTTAATGCTAACTCAACCGCATCAATTCCATATGGAAATGGAA
TAGTCTCTCTCTCTCTCTCTCATCTCCTGAAAGAATAACAAGAGCAAGAATCTCTACACTCTGAACTTTCGTGCTCGCTCTCTTTGATTCTTTGTGTTTGAGTGTGAGAA
AAATGAATGCCAGAGTTCGACCCAACGTTCAATTTCTGGTAGAGAAAGCGTCAATTCATGATAAACAGGAGAAAGGGAAGCGAGGGGGGAGCAAAGAGATGGCTGATGAG
AAAGATGGAATCAAAATCAATAGACGCAGATTAAACACAGAAAAGAAAATGGCATTGCTACAAGATGTTGATAAGCTAAAGAAGAAGCTGAGGCATGAGGAGAATGTTCA
TAGAGCTTTGAAGAGAGCTTTCACTAGGCCTTTAGGAGCTTTGCCTCGTCTTCCTCCTTATCTTCCTCCATCTACACTTGAGCTTCTAGCTGAAGTAGCTATCCTTGAAG
AGGAAGTTGTTTGGCTTTCCGAACGAGTTGTGAATTTTCGACAAGATCTTTACCAAGAAGCTGTCTTTGTTTCCTCACAGCGGAATGTCGAAAATTTTGTCAATGCTGTT
GAGAGTATCTCAATGAGAAGTTTACAGCACGGTCAATCGAAATCTTTGACTTTGAATGATCTCAGTTCTCCAACACCAACATATCGGCCTCAACCGCTGGTTGCCGGAAG
CATTTCGAGTAGAAAGCTATTATCTAGTCACAAAATCCCTGATCAAACAGGGAATTATTCTGCTAGGCTCTTGAATGCCAAGCAAAACTCTTGGAAATCTAATTCACCTT
CAAAGGAGAATCAGTTTTGTTCTTCTTATTCTGTGAAAGATAAACCTTCTCCAGAAAAGAAAGCCACTAAAATTATTAGCTCATTCAAGAAGAATAAGACACCAACCAAA
CATGAAGCTGCAGAGAAGAGTTTAGATGCTTTGAAGTTGCAGCTTGGATCCAGGTTAATGGATTATGAAAGAGAACAAGAGAGTTCTTTTGGTGCACTGGATAATCTAGA
ATCTAAGGCATCGCCTAACAAAATTTCCGAGGATATTGTGAAGTGCTTATGTTCCATTTTTGTTCAAGTGAGCACTCCGAGAGAGAAGTGTGTCGAATTTCAAACTCCTC
GAGCTTCGTCTAACACCTGCACGAGCAGTGTAGAAGCAGAGCATCTGGATCCATACCATGTATGCGCAGAATCCAAAGGAAATGATATTGGTTTCTATCTGCATCTGTTT
GCAGTCGAAGCGAATTCAATTCATCTCAATGAAATGGCTAACACACTTCCCCTAATTCACAGGCTAAAATATCTACTCGGAAAGCTTGCCTCTGTGAAGTTAGAGGGTCT
GCACCAGCAGCAGAAACTGGCCTTCTGGATTAACACCTATAATTCTTGCATAATGAATGCACTTTTGGAGCATGGAATACCTGAGACTCCAGAAAGGGTTGTAGCTCTAA
TGCAAAAGGCCAAAATCGTCGTTGGGGGATACATTCTCAATGCAATAACAATAGAGCATTTCATTTTGAGACTGCCTTACCACCTGAAATTTATGTGTTCGAAGGCTGTT
AAAAGTGACGAGATGATGAAAGCACATGATTTGTTTGGATTAGAGTGGTCTGAACCATTGGTTACATTTGCTCTTTGTTGCGGAAGCTGGTCGTCTCCTTCGGTGAGAGT
GTACACAGGAAGTCATGTGGAGGAAGAGTTAGAAGAGGCCAAGAGAAGCTACTTGCAAGCTTCAGTTGGAATATCAAGAAGAGGAAACAAACTAATGCTTCCAAAACTAT
TGGATTGGTATTTACTTGACTTTGCAAAGGATTTGGAGTCATTGGTGGATTGGGTTTGCTTACAACTACCTGATGAGATTAGAAAAGAAGCTGTTAAATGCCTTGAAAGA
AGGGGCATGCAGCCTCTTGAAGAGTTTGTTCAAGTGATTCCTTATGATTTCAGTTTTAGATTGCTTTTAAACAAATTGGATTCATGAAGTATTAAAATACAGTTCAAAGT
ATATTGAGGTATATGTCTATGTAGGCTTGCAATTGCAATAATAAAATGATGTTGGGTGTGACTTTAA
Protein sequenceShow/hide protein sequence
MNARVRPNVQFLVEKASIHDKQEKGKRGGSKEMADEKDGIKINRRRLNTEKKMALLQDVDKLKKKLRHEENVHRALKRAFTRPLGALPRLPPYLPPSTLELLAEVAILEE
EVVWLSERVVNFRQDLYQEAVFVSSQRNVENFVNAVESISMRSLQHGQSKSLTLNDLSSPTPTYRPQPLVAGSISSRKLLSSHKIPDQTGNYSARLLNAKQNSWKSNSPS
KENQFCSSYSVKDKPSPEKKATKIISSFKKNKTPTKHEAAEKSLDALKLQLGSRLMDYEREQESSFGALDNLESKASPNKISEDIVKCLCSIFVQVSTPREKCVEFQTPR
ASSNTCTSSVEAEHLDPYHVCAESKGNDIGFYLHLFAVEANSIHLNEMANTLPLIHRLKYLLGKLASVKLEGLHQQQKLAFWINTYNSCIMNALLEHGIPETPERVVALM
QKAKIVVGGYILNAITIEHFILRLPYHLKFMCSKAVKSDEMMKAHDLFGLEWSEPLVTFALCCGSWSSPSVRVYTGSHVEEELEEAKRSYLQASVGISRRGNKLMLPKLL
DWYLLDFAKDLESLVDWVCLQLPDEIRKEAVKCLERRGMQPLEEFVQVIPYDFSFRLLLNKLDS