; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi10G005840 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi10G005840
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionProtein of unknown function, DUF547
Genome locationchr10:8477531..8482082
RNA-Seq ExpressionLsi10G005840
SyntenyLsi10G005840
Gene Ontology termsNA
InterPro domainsIPR006869 - Domain of unknown function DUF547
IPR025757 - Ternary complex factor MIP1, leucine-zipper


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004148063.1 uncharacterized protein LOC101212736 isoform X1 [Cucumis sativus]2.2e-30585.49Show/hide
Query:  MLGVDVKTGHSRSRRILLSLLYAYGSSLDKKAIRDDRLDNLLESSNTVKMDMDQVKEVESKKNSSPKIEVHNSLKQEIIQLEKRLQDQFKLRSALEKTLG
        MLGVDVKTG+SRSRRILLSLL A+GSS+DKKA++DDRL+NLLESSNTVK++MDQV+EVE+KKNSSPKI +HNSLKQEIIQLEKRLQDQFKLRS LEKTLG
Subjt:  MLGVDVKTGHSRSRRILLSLLYAYGSSLDKKAIRDDRLDNLLESSNTVKMDMDQVKEVESKKNSSPKIEVHNSLKQEIIQLEKRLQDQFKLRSALEKTLG

Query:  HGVFPCNESDKISMPKSAVELIREIATLELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDKKSKLPSTPRGRAMEAPLP--------SAFPSACQSLENP
        HGVF CNESDKISMPKSAVELI+EIA LE+EVVHLEQYLLSLYRKAFDGQSSS SPSAKD+KSKLPS  +GR ME+PL         S FPSAC SL+NP
Subjt:  HGVFPCNESDKISMPKSAVELIREIATLELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDKKSKLPSTPRGRAMEAPLP--------SAFPSACQSLENP

Query:  RKDYSDIGRDEKLLVSNYHRSQSSLTTVNAASFDKVSTSVESLDRTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMIKCISS
        RKDYSDIGRDEKLLV++Y RSQSSLTTV+AAS DKVSTSVESLD TLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDH+PETPNRLSEDMIKCIS+
Subjt:  RKDYSDIGRDEKLLVSNYHRSQSSLTTVNAASFDKVSTSVESLDRTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMIKCISS

Query:  IYSKLAEPSLSNYGLSSPTSSLSSVSAFSPGEQSAMCSPGFRNNSSFDVRLDNPFLVEGLKEFSGPYSTMIEISWICGDPQKLCHVKNLLENFRSAYVIF
        IYSKL EPS  N GLSSP SSLSS SAFSPGEQSAMCSPGFRNNSSFDVRLDNPFL+EGLKEFSGPYSTMIEISWICGDPQKLCHVK+LLENFR      
Subjt:  IYSKLAEPSLSNYGLSSPTSSLSSVSAFSPGEQSAMCSPGFRNNSSFDVRLDNPFLVEGLKEFSGPYSTMIEISWICGDPQKLCHVKNLLENFRSAYVIF

Query:  RHQHNKSFPWRFSVIVQLLISRLEEVDLRKLKYEEKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTIQSCILGCRMPSPRQW
                         LLISRLEEVDL KL YEEKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTIQSCILGCRMP PRQW
Subjt:  RHQHNKSFPWRFSVIVQLLISRLEEVDLRKLKYEEKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTIQSCILGCRMPSPRQW

Query:  LRLLLPSRTKFKTGDERQAYIIDRPEPLLHFALCSGSHSDPAVRVYTPKRVFQELESAKDEYIRATFGVGKGQKILLPKIIESFAKDSGLCSSGLMEMIL
        LRLLLPSRTKFK GDERQ YIIDRPEPLLHFALCSGSHSDPAVRVYTPKRVFQELE++KDEYIRATFGV K QK+LLPKIIESF KDSGLCS GLMEMIL
Subjt:  LRLLLPSRTKFKTGDERQAYIIDRPEPLLHFALCSGSHSDPAVRVYTPKRVFQELESAKDEYIRATFGVGKGQKILLPKIIESFAKDSGLCSSGLMEMIL

Query:  KSLPESLRKSIKHSKLGNPRKNVEWIPPSYTFRYLISKELM
        KSLPESLRKS+K S LGNPRKNVEWIPP+YTFRYLISKEL+
Subjt:  KSLPESLRKSIKHSKLGNPRKNVEWIPPSYTFRYLISKELM

XP_008457797.1 PREDICTED: uncharacterized protein LOC103497400 isoform X1 [Cucumis melo]2.4e-30786.43Show/hide
Query:  MLGVDVKTGHSRSRRILLSLLYAYGSSLDKKAIRDDRLDNLLESSNTVKMDMDQVKEVESKKNSSPKIEVHNSLKQEIIQLEKRLQDQFKLRSALEKTLG
        MLGVDVKTG+SRSRRILLSLLYA+GSS+DKKA++DDRL+NLLESSNTVK+DMD+VKEVE+KKNSS KI VH+SLKQEIIQLEKRLQDQFKLRSALEKTLG
Subjt:  MLGVDVKTGHSRSRRILLSLLYAYGSSLDKKAIRDDRLDNLLESSNTVKMDMDQVKEVESKKNSSPKIEVHNSLKQEIIQLEKRLQDQFKLRSALEKTLG

Query:  HGVFPCNESDKISMPKSAVELIREIATLELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDKKSKLPSTPRGRAMEAPLP--------SAFPSACQSLENP
        HGVFPCNESDKISMPKSAVELI+EIA LE+EVVHLEQYLLSLYRKAFDGQSSSVSPSAKD+K+KLPSTP GR MEAPLP        SAFPSAC SL+NP
Subjt:  HGVFPCNESDKISMPKSAVELIREIATLELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDKKSKLPSTPRGRAMEAPLP--------SAFPSACQSLENP

Query:  RKDYSDIGRDEKLLVSNYHRSQSSLTTVNAASFDKVSTSVESLDRTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMIKCISS
        R+DYSDIGRDEKLLV +Y RSQSSLTTVNA S  KVSTSVESLDRTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDH+PETPNRLSEDMIKCIS+
Subjt:  RKDYSDIGRDEKLLVSNYHRSQSSLTTVNAASFDKVSTSVESLDRTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMIKCISS

Query:  IYSKLAEPSLSNYGLSSPTSSLSSVSAFSPGEQSAMCSPGFRNNSSFDVRLDNPFLVEGLKEFSGPYSTMIEISWICGDPQKLCHVKNLLENFRSAYVIF
        IYSKLAEPS  N GLSSP SSLSSVSAFSPGEQSAMCSPGFRNNSSFDV LDNPFLVEGLKEFSGPYSTMIEISWI GDPQKL HVK+LLENFR      
Subjt:  IYSKLAEPSLSNYGLSSPTSSLSSVSAFSPGEQSAMCSPGFRNNSSFDVRLDNPFLVEGLKEFSGPYSTMIEISWICGDPQKLCHVKNLLENFRSAYVIF

Query:  RHQHNKSFPWRFSVIVQLLISRLEEVDLRKLKYEEKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTIQSCILGCRMPSPRQW
                         LLISRLEEVDLR L YEEKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTIQSCILGCRMP PRQW
Subjt:  RHQHNKSFPWRFSVIVQLLISRLEEVDLRKLKYEEKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTIQSCILGCRMPSPRQW

Query:  LRLLLPSRTKFKTGDERQAYIIDRPEPLLHFALCSGSHSDPAVRVYTPKRVFQELESAKDEYIRATFGVGKGQKILLPKIIESFAKDSGLCSSGLMEMIL
        LRLLLPSRTKFK GDERQ YI+DRPEPLLHFALCSGSHSDPAVRVYTPKRVFQELE++KDEYIRATFGV K QKILLPKIIESF KDSGLCS GLMEMIL
Subjt:  LRLLLPSRTKFKTGDERQAYIIDRPEPLLHFALCSGSHSDPAVRVYTPKRVFQELESAKDEYIRATFGVGKGQKILLPKIIESFAKDSGLCSSGLMEMIL

Query:  KSLPESLRKSIKHSKLGNPRKNVEWIPPSYTFRYLISKELM
        KSLPESLRKS+K S LGNPRK VEWIPP+YTFRYLISKEL+
Subjt:  KSLPESLRKSIKHSKLGNPRKNVEWIPPSYTFRYLISKELM

XP_022158699.1 uncharacterized protein LOC111025163 isoform X1 [Momordica charantia]3.7e-30084.6Show/hide
Query:  MLGVDVKTGHSRSRRILLSLLYAYGSSLDKKAIRDDRLDNLLESSNTVKMDMDQVKEVESKKNSSPKIEVHNSLKQEIIQLEKRLQDQFKLRSALEKTLG
        MLGVDV+TGH RS+RILLSLLYAY SSLDKKA++DDRL N LES N V+MDMD+VKEVE+ KN S K+EVHNSLKQEIIQLEKRLQDQFKLRSALEK LG
Subjt:  MLGVDVKTGHSRSRRILLSLLYAYGSSLDKKAIRDDRLDNLLESSNTVKMDMDQVKEVESKKNSSPKIEVHNSLKQEIIQLEKRLQDQFKLRSALEKTLG

Query:  HGVFPCNESDKISMPKSAVELIREIATLELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDKKSKLPSTPRGRAMEAPLP--------SAFPSACQSLENP
        HG+FPC++SDK+SMPKSA+ELI EIATLELEVVHLEQYLLSLYR+AFDGQSSSVSPSA D+KSKLPSTPRG +ME PLP        SA PS CQSLEN 
Subjt:  HGVFPCNESDKISMPKSAVELIREIATLELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDKKSKLPSTPRGRAMEAPLP--------SAFPSACQSLENP

Query:  RKDYSDIGRDEKLLVSNYHRSQSSLTTVNAASFDKVSTSVESLDRTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMIKCISS
        RK+ S+IGRDEKL+VSN+HRS+SSLTTVNAAS +K+STSVESLDRTL ACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMIKCISS
Subjt:  RKDYSDIGRDEKLLVSNYHRSQSSLTTVNAASFDKVSTSVESLDRTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMIKCISS

Query:  IYSKLAEPSLSNYGLSSPTSSLSSVSAFSPGEQSAMCSPGFRNNSSFDVRLDNPFLVEGLKEFSGPYSTMIEISWICGDPQKLCHVKNLLENFRSAYVIF
        I+ KLAEPSL+N+GLSSPTSSLSSVSAFSPGEQ AMCSPG RNNSSFDVRLDNPFLVEGLK+FSGPYSTMIEISWICGDPQKLC+VK+LLENFR      
Subjt:  IYSKLAEPSLSNYGLSSPTSSLSSVSAFSPGEQSAMCSPGFRNNSSFDVRLDNPFLVEGLKEFSGPYSTMIEISWICGDPQKLCHVKNLLENFRSAYVIF

Query:  RHQHNKSFPWRFSVIVQLLISRLEEVDLRKLKYEEKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTIQSCILGCRMPSPRQW
                         LLISRLEEVDL KLKYEEKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTIQSCILGCRMP PRQW
Subjt:  RHQHNKSFPWRFSVIVQLLISRLEEVDLRKLKYEEKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTIQSCILGCRMPSPRQW

Query:  LRLLLPSRTKFKTGDERQAYIIDRPEPLLHFALCSGSHSDPAVRVYTPKRVFQELESAKDEYIRATFGVGKGQK-ILLPKIIESFAKDSGLCSSGLMEMI
        LRLLLPSRTKFKTGDER+AYII+RPEPLLHFALCSGSHSDPAVRVYT KRVFQELESAKDEYIRATFGV K +K ILLPK IESFAKDSGLCSSGLMEMI
Subjt:  LRLLLPSRTKFKTGDERQAYIIDRPEPLLHFALCSGSHSDPAVRVYTPKRVFQELESAKDEYIRATFGVGKGQK-ILLPKIIESFAKDSGLCSSGLMEMI

Query:  LKSLPESLRKSIKHS-KLGNPRKNVEWIPPSYTFRYLISKELM
        L SLPESLRKS+K S +LGNPRKNVEWIPPSYTFRYLISKEL+
Subjt:  LKSLPESLRKSIKHS-KLGNPRKNVEWIPPSYTFRYLISKELM

XP_038900882.1 uncharacterized protein LOC120087938 isoform X1 [Benincasa hispida]0.0e+0090.02Show/hide
Query:  MLGVDVKTGHSRSRRILLSLLYAYGSSLDKKAIRDDRLDNLLESSNTVKMDMDQVKEVESKKNSSPKIEVHNSLKQEIIQLEKRLQDQFKLRSALEKTLG
        MLGVDVKTGHSRSRRILLSLLYAYGSS+DKK IRDDRLDNLLESSNTVKMDMD VKEVE+KKNSSPKIEVHNSLKQEIIQLEKRLQDQFKLRSALEKTLG
Subjt:  MLGVDVKTGHSRSRRILLSLLYAYGSSLDKKAIRDDRLDNLLESSNTVKMDMDQVKEVESKKNSSPKIEVHNSLKQEIIQLEKRLQDQFKLRSALEKTLG

Query:  HGVFPCNESDKISMPKSAVELIREIATLELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDKKSKLPSTPRGRAMEAPLP--------SAFPSACQSLENP
        HGVF CNESDK+SMPKSAVELI+EIATLELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKD+K KLP TPRGRAMEAP P        SAFPSACQSLENP
Subjt:  HGVFPCNESDKISMPKSAVELIREIATLELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDKKSKLPSTPRGRAMEAPLP--------SAFPSACQSLENP

Query:  RKDYSDIGRDEKLLVSNYHRSQSSLTTVNAASFDKVSTSVESLDRTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMIKCISS
        RK+YSDIGRDEKLLVSNYHRSQSSLTTVNAAS DK+STSVESLDRTLR CHSQPVSMMEYAQNVS NIISLAEHLGTRISDH+PETPNRLSEDMIKCIS+
Subjt:  RKDYSDIGRDEKLLVSNYHRSQSSLTTVNAASFDKVSTSVESLDRTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMIKCISS

Query:  IYSKLAEPSLSNYGLSSPTSSLSSVSAFSPGEQSAMCSPGFRNNSSFDVRLDNPFLVEGLKEFSGPYSTMIEISWICGDPQKLCHVKNLLENFRSAYVIF
        IYSKLAEPSL N+GLSSPTSSLSSVSAFSPGEQ AMCSPGFRNNSSFDVRLDNPFLVEGLKEFSGPYSTMIEISWIC DPQKLCHVK+LLENFR      
Subjt:  IYSKLAEPSLSNYGLSSPTSSLSSVSAFSPGEQSAMCSPGFRNNSSFDVRLDNPFLVEGLKEFSGPYSTMIEISWICGDPQKLCHVKNLLENFRSAYVIF

Query:  RHQHNKSFPWRFSVIVQLLISRLEEVDLRKLKYEEKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTIQSCILGCRMPSPRQW
                         LLISRLEEVDLRKLKYEEKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTIQS ILGCRMP PRQW
Subjt:  RHQHNKSFPWRFSVIVQLLISRLEEVDLRKLKYEEKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTIQSCILGCRMPSPRQW

Query:  LRLLLPSRTKFKTGDERQAYIIDRPEPLLHFALCSGSHSDPAVRVYTPKRVFQELESAKDEYIRATFGVGKGQKILLPKIIESFAKDSGLCSSGLMEMIL
        LRLLLPSRTKFKTGDERQAYIIDRPEPLLHFALC GSHSDPAVRVYTPKRVFQELE+AKDEYIRATFGVGK QKILLPKIIESFAKD+GLCSSGLMEMIL
Subjt:  LRLLLPSRTKFKTGDERQAYIIDRPEPLLHFALCSGSHSDPAVRVYTPKRVFQELESAKDEYIRATFGVGKGQKILLPKIIESFAKDSGLCSSGLMEMIL

Query:  KSLPESLRKSIKHSKLGNPRKNVEWIPPSYTFRYLISKELM
        KSLPESLRKS+K S+LGNPRKNVEWIP SYTFRYLISKELM
Subjt:  KSLPESLRKSIKHSKLGNPRKNVEWIPPSYTFRYLISKELM

XP_038900883.1 uncharacterized protein LOC120087938 isoform X2 [Benincasa hispida]0.0e+0088.3Show/hide
Query:  MLGVDVKTGHSRSRRILLSLLYAYGSSLDKKAIRDDRLDNLLESSNTVKMDMDQVKEVESKKNSSPKIEVHNSLKQEIIQLEKRLQDQFKLRSALEKTLG
        MLGVDVKTGHSRSR           SS+DKK IRDDRLDNLLESSNTVKMDMD VKEVE+KKNSSPKIEVHNSLKQEIIQLEKRLQDQFKLRSALEKTLG
Subjt:  MLGVDVKTGHSRSRRILLSLLYAYGSSLDKKAIRDDRLDNLLESSNTVKMDMDQVKEVESKKNSSPKIEVHNSLKQEIIQLEKRLQDQFKLRSALEKTLG

Query:  HGVFPCNESDKISMPKSAVELIREIATLELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDKKSKLPSTPRGRAMEAPLP--------SAFPSACQSLENP
        HGVF CNESDK+SMPKSAVELI+EIATLELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKD+K KLP TPRGRAMEAP P        SAFPSACQSLENP
Subjt:  HGVFPCNESDKISMPKSAVELIREIATLELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDKKSKLPSTPRGRAMEAPLP--------SAFPSACQSLENP

Query:  RKDYSDIGRDEKLLVSNYHRSQSSLTTVNAASFDKVSTSVESLDRTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMIKCISS
        RK+YSDIGRDEKLLVSNYHRSQSSLTTVNAAS DK+STSVESLDRTLR CHSQPVSMMEYAQNVS NIISLAEHLGTRISDH+PETPNRLSEDMIKCIS+
Subjt:  RKDYSDIGRDEKLLVSNYHRSQSSLTTVNAASFDKVSTSVESLDRTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMIKCISS

Query:  IYSKLAEPSLSNYGLSSPTSSLSSVSAFSPGEQSAMCSPGFRNNSSFDVRLDNPFLVEGLKEFSGPYSTMIEISWICGDPQKLCHVKNLLENFRSAYVIF
        IYSKLAEPSL N+GLSSPTSSLSSVSAFSPGEQ AMCSPGFRNNSSFDVRLDNPFLVEGLKEFSGPYSTMIEISWIC DPQKLCHVK+LLENFR      
Subjt:  IYSKLAEPSLSNYGLSSPTSSLSSVSAFSPGEQSAMCSPGFRNNSSFDVRLDNPFLVEGLKEFSGPYSTMIEISWICGDPQKLCHVKNLLENFRSAYVIF

Query:  RHQHNKSFPWRFSVIVQLLISRLEEVDLRKLKYEEKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTIQSCILGCRMPSPRQW
                         LLISRLEEVDLRKLKYEEKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTIQS ILGCRMP PRQW
Subjt:  RHQHNKSFPWRFSVIVQLLISRLEEVDLRKLKYEEKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTIQSCILGCRMPSPRQW

Query:  LRLLLPSRTKFKTGDERQAYIIDRPEPLLHFALCSGSHSDPAVRVYTPKRVFQELESAKDEYIRATFGVGKGQKILLPKIIESFAKDSGLCSSGLMEMIL
        LRLLLPSRTKFKTGDERQAYIIDRPEPLLHFALC GSHSDPAVRVYTPKRVFQELE+AKDEYIRATFGVGK QKILLPKIIESFAKD+GLCSSGLMEMIL
Subjt:  LRLLLPSRTKFKTGDERQAYIIDRPEPLLHFALCSGSHSDPAVRVYTPKRVFQELESAKDEYIRATFGVGKGQKILLPKIIESFAKDSGLCSSGLMEMIL

Query:  KSLPESLRKSIKHSKLGNPRKNVEWIPPSYTFRYLISKELM
        KSLPESLRKS+K S+LGNPRKNVEWIP SYTFRYLISKELM
Subjt:  KSLPESLRKSIKHSKLGNPRKNVEWIPPSYTFRYLISKELM

TrEMBL top hitse value%identityAlignment
A0A0A0LMQ5 Uncharacterized protein1.1e-30585.49Show/hide
Query:  MLGVDVKTGHSRSRRILLSLLYAYGSSLDKKAIRDDRLDNLLESSNTVKMDMDQVKEVESKKNSSPKIEVHNSLKQEIIQLEKRLQDQFKLRSALEKTLG
        MLGVDVKTG+SRSRRILLSLL A+GSS+DKKA++DDRL+NLLESSNTVK++MDQV+EVE+KKNSSPKI +HNSLKQEIIQLEKRLQDQFKLRS LEKTLG
Subjt:  MLGVDVKTGHSRSRRILLSLLYAYGSSLDKKAIRDDRLDNLLESSNTVKMDMDQVKEVESKKNSSPKIEVHNSLKQEIIQLEKRLQDQFKLRSALEKTLG

Query:  HGVFPCNESDKISMPKSAVELIREIATLELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDKKSKLPSTPRGRAMEAPLP--------SAFPSACQSLENP
        HGVF CNESDKISMPKSAVELI+EIA LE+EVVHLEQYLLSLYRKAFDGQSSS SPSAKD+KSKLPS  +GR ME+PL         S FPSAC SL+NP
Subjt:  HGVFPCNESDKISMPKSAVELIREIATLELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDKKSKLPSTPRGRAMEAPLP--------SAFPSACQSLENP

Query:  RKDYSDIGRDEKLLVSNYHRSQSSLTTVNAASFDKVSTSVESLDRTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMIKCISS
        RKDYSDIGRDEKLLV++Y RSQSSLTTV+AAS DKVSTSVESLD TLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDH+PETPNRLSEDMIKCIS+
Subjt:  RKDYSDIGRDEKLLVSNYHRSQSSLTTVNAASFDKVSTSVESLDRTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMIKCISS

Query:  IYSKLAEPSLSNYGLSSPTSSLSSVSAFSPGEQSAMCSPGFRNNSSFDVRLDNPFLVEGLKEFSGPYSTMIEISWICGDPQKLCHVKNLLENFRSAYVIF
        IYSKL EPS  N GLSSP SSLSS SAFSPGEQSAMCSPGFRNNSSFDVRLDNPFL+EGLKEFSGPYSTMIEISWICGDPQKLCHVK+LLENFR      
Subjt:  IYSKLAEPSLSNYGLSSPTSSLSSVSAFSPGEQSAMCSPGFRNNSSFDVRLDNPFLVEGLKEFSGPYSTMIEISWICGDPQKLCHVKNLLENFRSAYVIF

Query:  RHQHNKSFPWRFSVIVQLLISRLEEVDLRKLKYEEKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTIQSCILGCRMPSPRQW
                         LLISRLEEVDL KL YEEKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTIQSCILGCRMP PRQW
Subjt:  RHQHNKSFPWRFSVIVQLLISRLEEVDLRKLKYEEKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTIQSCILGCRMPSPRQW

Query:  LRLLLPSRTKFKTGDERQAYIIDRPEPLLHFALCSGSHSDPAVRVYTPKRVFQELESAKDEYIRATFGVGKGQKILLPKIIESFAKDSGLCSSGLMEMIL
        LRLLLPSRTKFK GDERQ YIIDRPEPLLHFALCSGSHSDPAVRVYTPKRVFQELE++KDEYIRATFGV K QK+LLPKIIESF KDSGLCS GLMEMIL
Subjt:  LRLLLPSRTKFKTGDERQAYIIDRPEPLLHFALCSGSHSDPAVRVYTPKRVFQELESAKDEYIRATFGVGKGQKILLPKIIESFAKDSGLCSSGLMEMIL

Query:  KSLPESLRKSIKHSKLGNPRKNVEWIPPSYTFRYLISKELM
        KSLPESLRKS+K S LGNPRKNVEWIPP+YTFRYLISKEL+
Subjt:  KSLPESLRKSIKHSKLGNPRKNVEWIPPSYTFRYLISKELM

A0A1S3C5X2 uncharacterized protein LOC103497400 isoform X11.2e-30786.43Show/hide
Query:  MLGVDVKTGHSRSRRILLSLLYAYGSSLDKKAIRDDRLDNLLESSNTVKMDMDQVKEVESKKNSSPKIEVHNSLKQEIIQLEKRLQDQFKLRSALEKTLG
        MLGVDVKTG+SRSRRILLSLLYA+GSS+DKKA++DDRL+NLLESSNTVK+DMD+VKEVE+KKNSS KI VH+SLKQEIIQLEKRLQDQFKLRSALEKTLG
Subjt:  MLGVDVKTGHSRSRRILLSLLYAYGSSLDKKAIRDDRLDNLLESSNTVKMDMDQVKEVESKKNSSPKIEVHNSLKQEIIQLEKRLQDQFKLRSALEKTLG

Query:  HGVFPCNESDKISMPKSAVELIREIATLELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDKKSKLPSTPRGRAMEAPLP--------SAFPSACQSLENP
        HGVFPCNESDKISMPKSAVELI+EIA LE+EVVHLEQYLLSLYRKAFDGQSSSVSPSAKD+K+KLPSTP GR MEAPLP        SAFPSAC SL+NP
Subjt:  HGVFPCNESDKISMPKSAVELIREIATLELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDKKSKLPSTPRGRAMEAPLP--------SAFPSACQSLENP

Query:  RKDYSDIGRDEKLLVSNYHRSQSSLTTVNAASFDKVSTSVESLDRTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMIKCISS
        R+DYSDIGRDEKLLV +Y RSQSSLTTVNA S  KVSTSVESLDRTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDH+PETPNRLSEDMIKCIS+
Subjt:  RKDYSDIGRDEKLLVSNYHRSQSSLTTVNAASFDKVSTSVESLDRTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMIKCISS

Query:  IYSKLAEPSLSNYGLSSPTSSLSSVSAFSPGEQSAMCSPGFRNNSSFDVRLDNPFLVEGLKEFSGPYSTMIEISWICGDPQKLCHVKNLLENFRSAYVIF
        IYSKLAEPS  N GLSSP SSLSSVSAFSPGEQSAMCSPGFRNNSSFDV LDNPFLVEGLKEFSGPYSTMIEISWI GDPQKL HVK+LLENFR      
Subjt:  IYSKLAEPSLSNYGLSSPTSSLSSVSAFSPGEQSAMCSPGFRNNSSFDVRLDNPFLVEGLKEFSGPYSTMIEISWICGDPQKLCHVKNLLENFRSAYVIF

Query:  RHQHNKSFPWRFSVIVQLLISRLEEVDLRKLKYEEKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTIQSCILGCRMPSPRQW
                         LLISRLEEVDLR L YEEKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTIQSCILGCRMP PRQW
Subjt:  RHQHNKSFPWRFSVIVQLLISRLEEVDLRKLKYEEKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTIQSCILGCRMPSPRQW

Query:  LRLLLPSRTKFKTGDERQAYIIDRPEPLLHFALCSGSHSDPAVRVYTPKRVFQELESAKDEYIRATFGVGKGQKILLPKIIESFAKDSGLCSSGLMEMIL
        LRLLLPSRTKFK GDERQ YI+DRPEPLLHFALCSGSHSDPAVRVYTPKRVFQELE++KDEYIRATFGV K QKILLPKIIESF KDSGLCS GLMEMIL
Subjt:  LRLLLPSRTKFKTGDERQAYIIDRPEPLLHFALCSGSHSDPAVRVYTPKRVFQELESAKDEYIRATFGVGKGQKILLPKIIESFAKDSGLCSSGLMEMIL

Query:  KSLPESLRKSIKHSKLGNPRKNVEWIPPSYTFRYLISKELM
        KSLPESLRKS+K S LGNPRK VEWIPP+YTFRYLISKEL+
Subjt:  KSLPESLRKSIKHSKLGNPRKNVEWIPPSYTFRYLISKELM

A0A5A7TUI2 Uncharacterized protein5.2e-30086.28Show/hide
Query:  RILLSLLYAYGSSLDKKAIRDDRLDNLLESSNTVKMDMDQVKEVESKKNSSPKIEVHNSLKQEIIQLEKRLQDQFKLRSALEKTLGHGVFPCNESDKISM
        RILLSLLYA+GSS+DKKA++DDRL+NLLESSNTVK+DMD+VKEVE+KKNSS KI VH+SLKQEIIQLEKRLQDQFKLRSALEKTLGHGVFPCNESDKISM
Subjt:  RILLSLLYAYGSSLDKKAIRDDRLDNLLESSNTVKMDMDQVKEVESKKNSSPKIEVHNSLKQEIIQLEKRLQDQFKLRSALEKTLGHGVFPCNESDKISM

Query:  PKSAVELIREIATLELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDKKSKLPSTPRGRAMEAPLP--------SAFPSACQSLENPRKDYSDIGRDEKLL
        PKSAVELI+EIA LE+EVVHLEQYLLSLYRKAFDGQSSSVSPSAKD+K+KLPSTP GR MEAPLP        SAFPSAC SL+NPR+DYSDIGRDEKLL
Subjt:  PKSAVELIREIATLELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDKKSKLPSTPRGRAMEAPLP--------SAFPSACQSLENPRKDYSDIGRDEKLL

Query:  VSNYHRSQSSLTTVNAASFDKVSTSVESLDRTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMIKCISSIYSKLAEPSLSNYG
        V +Y RSQSSLTTVNA S  KVSTSVESLDRTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDH+PETPNRLSEDMIKCIS+IYSKLAEPS  N G
Subjt:  VSNYHRSQSSLTTVNAASFDKVSTSVESLDRTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMIKCISSIYSKLAEPSLSNYG

Query:  LSSPTSSLSSVSAFSPGEQSAMCSPGFRNNSSFDVRLDNPFLVEGLKEFSGPYSTMIEISWICGDPQKLCHVKNLLENFRSAYVIFRHQHNKSFPWRFSV
        LSSP SSLSSVSAFSPGEQSAMCSPGFRNNSSFDV LDNPFLVEGLKEFSGPYSTMIEISWI GDPQKL HVK+LLENFR                    
Subjt:  LSSPTSSLSSVSAFSPGEQSAMCSPGFRNNSSFDVRLDNPFLVEGLKEFSGPYSTMIEISWICGDPQKLCHVKNLLENFRSAYVIFRHQHNKSFPWRFSV

Query:  IVQLLISRLEEVDLRKLKYEEKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTIQSCILGCRMPSPRQWLRLLLPSRTKFKTG
           LLISRLEEVDLR L YEEKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTIQSCILGCRMP PRQWLRLLLPSRTKFK G
Subjt:  IVQLLISRLEEVDLRKLKYEEKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTIQSCILGCRMPSPRQWLRLLLPSRTKFKTG

Query:  DERQAYIIDRPEPLLHFALCSGSHSDPAVRVYTPKRVFQELESAKDEYIRATFGVGKGQKILLPKIIESFAKDSGLCSSGLMEMILKSLPESLRKSIKHS
        DERQ YI+DRPEPLLHFALCSGSHSDPAVRVYTPKRVFQELE++KDEYIRATFGV K QKILLPKIIESF KDSGLCS GLMEMILKSLPESLRKS+K S
Subjt:  DERQAYIIDRPEPLLHFALCSGSHSDPAVRVYTPKRVFQELESAKDEYIRATFGVGKGQKILLPKIIESFAKDSGLCSSGLMEMILKSLPESLRKSIKHS

Query:  KLGNPRKNVEWIPPSYTFRYLISKELM
         LGNPRK VEWIPP+YTFRYLISKEL+
Subjt:  KLGNPRKNVEWIPPSYTFRYLISKELM

A0A5D3BMC4 Uncharacterized protein1.2e-30786.43Show/hide
Query:  MLGVDVKTGHSRSRRILLSLLYAYGSSLDKKAIRDDRLDNLLESSNTVKMDMDQVKEVESKKNSSPKIEVHNSLKQEIIQLEKRLQDQFKLRSALEKTLG
        MLGVDVKTG+SRSRRILLSLLYA+GSS+DKKA++DDRL+NLLESSNTVK+DMD+VKEVE+KKNSS KI VH+SLKQEIIQLEKRLQDQFKLRSALEKTLG
Subjt:  MLGVDVKTGHSRSRRILLSLLYAYGSSLDKKAIRDDRLDNLLESSNTVKMDMDQVKEVESKKNSSPKIEVHNSLKQEIIQLEKRLQDQFKLRSALEKTLG

Query:  HGVFPCNESDKISMPKSAVELIREIATLELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDKKSKLPSTPRGRAMEAPLP--------SAFPSACQSLENP
        HGVFPCNESDKISMPKSAVELI+EIA LE+EVVHLEQYLLSLYRKAFDGQSSSVSPSAKD+K+KLPSTP GR MEAPLP        SAFPSAC SL+NP
Subjt:  HGVFPCNESDKISMPKSAVELIREIATLELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDKKSKLPSTPRGRAMEAPLP--------SAFPSACQSLENP

Query:  RKDYSDIGRDEKLLVSNYHRSQSSLTTVNAASFDKVSTSVESLDRTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMIKCISS
        R+DYSDIGRDEKLLV +Y RSQSSLTTVNA S  KVSTSVESLDRTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDH+PETPNRLSEDMIKCIS+
Subjt:  RKDYSDIGRDEKLLVSNYHRSQSSLTTVNAASFDKVSTSVESLDRTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMIKCISS

Query:  IYSKLAEPSLSNYGLSSPTSSLSSVSAFSPGEQSAMCSPGFRNNSSFDVRLDNPFLVEGLKEFSGPYSTMIEISWICGDPQKLCHVKNLLENFRSAYVIF
        IYSKLAEPS  N GLSSP SSLSSVSAFSPGEQSAMCSPGFRNNSSFDV LDNPFLVEGLKEFSGPYSTMIEISWI GDPQKL HVK+LLENFR      
Subjt:  IYSKLAEPSLSNYGLSSPTSSLSSVSAFSPGEQSAMCSPGFRNNSSFDVRLDNPFLVEGLKEFSGPYSTMIEISWICGDPQKLCHVKNLLENFRSAYVIF

Query:  RHQHNKSFPWRFSVIVQLLISRLEEVDLRKLKYEEKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTIQSCILGCRMPSPRQW
                         LLISRLEEVDLR L YEEKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTIQSCILGCRMP PRQW
Subjt:  RHQHNKSFPWRFSVIVQLLISRLEEVDLRKLKYEEKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTIQSCILGCRMPSPRQW

Query:  LRLLLPSRTKFKTGDERQAYIIDRPEPLLHFALCSGSHSDPAVRVYTPKRVFQELESAKDEYIRATFGVGKGQKILLPKIIESFAKDSGLCSSGLMEMIL
        LRLLLPSRTKFK GDERQ YI+DRPEPLLHFALCSGSHSDPAVRVYTPKRVFQELE++KDEYIRATFGV K QKILLPKIIESF KDSGLCS GLMEMIL
Subjt:  LRLLLPSRTKFKTGDERQAYIIDRPEPLLHFALCSGSHSDPAVRVYTPKRVFQELESAKDEYIRATFGVGKGQKILLPKIIESFAKDSGLCSSGLMEMIL

Query:  KSLPESLRKSIKHSKLGNPRKNVEWIPPSYTFRYLISKELM
        KSLPESLRKS+K S LGNPRK VEWIPP+YTFRYLISKEL+
Subjt:  KSLPESLRKSIKHSKLGNPRKNVEWIPPSYTFRYLISKELM

A0A6J1E058 uncharacterized protein LOC111025163 isoform X11.8e-30084.6Show/hide
Query:  MLGVDVKTGHSRSRRILLSLLYAYGSSLDKKAIRDDRLDNLLESSNTVKMDMDQVKEVESKKNSSPKIEVHNSLKQEIIQLEKRLQDQFKLRSALEKTLG
        MLGVDV+TGH RS+RILLSLLYAY SSLDKKA++DDRL N LES N V+MDMD+VKEVE+ KN S K+EVHNSLKQEIIQLEKRLQDQFKLRSALEK LG
Subjt:  MLGVDVKTGHSRSRRILLSLLYAYGSSLDKKAIRDDRLDNLLESSNTVKMDMDQVKEVESKKNSSPKIEVHNSLKQEIIQLEKRLQDQFKLRSALEKTLG

Query:  HGVFPCNESDKISMPKSAVELIREIATLELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDKKSKLPSTPRGRAMEAPLP--------SAFPSACQSLENP
        HG+FPC++SDK+SMPKSA+ELI EIATLELEVVHLEQYLLSLYR+AFDGQSSSVSPSA D+KSKLPSTPRG +ME PLP        SA PS CQSLEN 
Subjt:  HGVFPCNESDKISMPKSAVELIREIATLELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDKKSKLPSTPRGRAMEAPLP--------SAFPSACQSLENP

Query:  RKDYSDIGRDEKLLVSNYHRSQSSLTTVNAASFDKVSTSVESLDRTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMIKCISS
        RK+ S+IGRDEKL+VSN+HRS+SSLTTVNAAS +K+STSVESLDRTL ACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMIKCISS
Subjt:  RKDYSDIGRDEKLLVSNYHRSQSSLTTVNAASFDKVSTSVESLDRTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMIKCISS

Query:  IYSKLAEPSLSNYGLSSPTSSLSSVSAFSPGEQSAMCSPGFRNNSSFDVRLDNPFLVEGLKEFSGPYSTMIEISWICGDPQKLCHVKNLLENFRSAYVIF
        I+ KLAEPSL+N+GLSSPTSSLSSVSAFSPGEQ AMCSPG RNNSSFDVRLDNPFLVEGLK+FSGPYSTMIEISWICGDPQKLC+VK+LLENFR      
Subjt:  IYSKLAEPSLSNYGLSSPTSSLSSVSAFSPGEQSAMCSPGFRNNSSFDVRLDNPFLVEGLKEFSGPYSTMIEISWICGDPQKLCHVKNLLENFRSAYVIF

Query:  RHQHNKSFPWRFSVIVQLLISRLEEVDLRKLKYEEKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTIQSCILGCRMPSPRQW
                         LLISRLEEVDL KLKYEEKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTIQSCILGCRMP PRQW
Subjt:  RHQHNKSFPWRFSVIVQLLISRLEEVDLRKLKYEEKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTIQSCILGCRMPSPRQW

Query:  LRLLLPSRTKFKTGDERQAYIIDRPEPLLHFALCSGSHSDPAVRVYTPKRVFQELESAKDEYIRATFGVGKGQK-ILLPKIIESFAKDSGLCSSGLMEMI
        LRLLLPSRTKFKTGDER+AYII+RPEPLLHFALCSGSHSDPAVRVYT KRVFQELESAKDEYIRATFGV K +K ILLPK IESFAKDSGLCSSGLMEMI
Subjt:  LRLLLPSRTKFKTGDERQAYIIDRPEPLLHFALCSGSHSDPAVRVYTPKRVFQELESAKDEYIRATFGVGKGQK-ILLPKIIESFAKDSGLCSSGLMEMI

Query:  LKSLPESLRKSIKHS-KLGNPRKNVEWIPPSYTFRYLISKELM
        L SLPESLRKS+K S +LGNPRKNVEWIPPSYTFRYLISKEL+
Subjt:  LKSLPESLRKSIKHS-KLGNPRKNVEWIPPSYTFRYLISKELM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G23700.1 Protein of unknown function, DUF5475.0e-13844.52Show/hide
Query:  DKKAIRDDR-LDNLLESSNTVKMDMDQV--KEVESKKNSSPKIEVHNSLKQEIIQLEKRLQDQFKLRSALEKTLGHGVFPCNESDKISMPKSAVELIREI
        +KK + D+  +D+ L++S  +K+D+ +   K  E KK+ SP ++  +SLKQEI +LEKRLQ+QF +R ALEK LG+   P  +    S PK   ELI+EI
Subjt:  DKKAIRDDR-LDNLLESSNTVKMDMDQV--KEVESKKNSSPKIEVHNSLKQEIIQLEKRLQDQFKLRSALEKTLGHGVFPCNESDKISMPKSAVELIREI

Query:  ATLELEVVHLEQYLLSLYRKAFDGQSSSVS-PSAKDKKSKLP-STPRG-----------------------RAMEAPLPS----------AFPSACQSLE
        A LELEV HLEQYLLSLYRKAFD Q+SSVS P++K + S  P ST RG                       R +E  L S          A    C S +
Subjt:  ATLELEVVHLEQYLLSLYRKAFDGQSSSVS-PSAKDKKSKLP-STPRG-----------------------RAMEAPLPS----------AFPSACQSLE

Query:  NPRKDYSDIGRDEKLLVS---------------------NYHRSQSSLTT-------------VNAASFDKVSTSVES-------------------LDR
        N  K+ S  GR     VS                     ++++  S + +                  FD+ S+ ++S                   +D 
Subjt:  NPRKDYSDIGRDEKLLVS---------------------NYHRSQSSLTT-------------VNAASFDKVSTSVES-------------------LDR

Query:  TLR------------------------ACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMIKCISSIYSKLAEPSLSNYGLSSPTSS
         +R                        ACHSQP+S+ EY QN  SN  SLAEH+GTRISDH+  TPN+LSE+MIKC S+IYSKLA+P   N+G SSP+SS
Subjt:  TLR------------------------ACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMIKCISSIYSKLAEPSLSNYGLSSPTSS

Query:  LSSVSAFSPGEQSAMCSPGFRNNSSFDVRLDNPFLVEGLKEFSGPYSTMIEISWICGDPQKLCHVKNLLENFRSAYVIFRHQHNKSFPWRFSVIVQLLIS
         SS S FSP +Q  M SP FR NSSFD +           EFSGPYS+MIE+S I  + ++           R   ++     N++F         LL+ 
Subjt:  LSSVSAFSPGEQSAMCSPGFRNNSSFDVRLDNPFLVEGLKEFSGPYSTMIEISWICGDPQKLCHVKNLLENFRSAYVIFRHQHNKSFPWRFSVIVQLLIS

Query:  RLEEVDLRKLKYEEKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTIQSCILGCRMPSPRQWLRLLLPSRTKFKTGDERQAYI
        +LE VD RKL ++EKLAFWIN+HN+LVMHT+LA G+PQNN KR  LL K AY IGG  +S++ IQS IL  +MP P QWL+LLL  + KF+TGDE Q Y 
Subjt:  RLEEVDLRKLKYEEKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTIQSCILGCRMPSPRQWLRLLLPSRTKFKTGDERQAYI

Query:  IDRPEPLLHFALCSGSHSDPAVRVYTPKRVFQELESAKDEYIRATFGVGKGQKILLPKIIESFAKDSGLCSSGLMEMILKSLPESLRKSIKHSKLGNPRK
        ++  EPLL+FALCSG+HSDPA+RV+TPK ++QELE+AK+EYIRATFGV K QK++LPKIIESF+KDSGL  + LMEMI + LPE+++K+IK    G  RK
Subjt:  IDRPEPLLHFALCSGSHSDPAVRVYTPKRVFQELESAKDEYIRATFGVGKGQKILLPKIIESFAKDSGLCSSGLMEMILKSLPESLRKSIKHSKLGNPRK

Query:  N-VEWIPPSYTFRYLISKELM
        + VEW P ++ FRYLI++EL+
Subjt:  N-VEWIPPSYTFRYLISKELM

AT5G66600.1 Protein of unknown function, DUF5476.7e-16753.17Show/hide
Query:  MLGVDVKTGHSRSRRILLSLLYAYGSSLDKKAIRDDRLDNLL-ESSNTVKMDMDQVKEVESKKNSSPKIEVHNSLKQEIIQLEKRLQDQFKLRSALEKTL
        ML + +   H RS+           S  +KK +  D+  N   E+S  +K+DM +  E    K++        SLKQEI  LE RLQDQFK+R ALEK L
Subjt:  MLGVDVKTGHSRSRRILLSLLYAYGSSLDKKAIRDDRLDNLL-ESSNTVKMDMDQVKEVESKKNSSPKIEVHNSLKQEIIQLEKRLQDQFKLRSALEKTL

Query:  GH---GVFPCNESDKISMPKSAVELIREIATLELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDKKSKLP--STPRGR---AMEAPLPSAFPSACQSL--
        G+     +   E++ I+MPK A +LI+++A LE+EV+HLEQYLLSLYRKAF+ Q SSVSP+ ++KK K P  +TPR R   + +   PS        L  
Subjt:  GH---GVFPCNESDKISMPKSAVELIREIATLELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDKKSKLP--STPRGR---AMEAPLPSAFPSACQSL--

Query:  --ENPRKDYSDIGRDEKLLVSNYHRSQSSLTTVNAASFDKVSTSVESLDRTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMI
          +N  K       D   +  ++ RS S      +A   + ++  +S  +  R+CHSQP+    Y QN   N+ISLAEHLGTRISDHVPETPN+LSE M+
Subjt:  --ENPRKDYSDIGRDEKLLVSNYHRSQSSLTTVNAASFDKVSTSVESLDRTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMI

Query:  KCISSIYSKLAE-PSLSNYGLSSPTSSLSSVSAFSPGEQSAMCSPGFRNNSSFDVRLDNPFLVEGLKEFSGPYSTMIEISWICGDPQKLCHVKNLLENFR
        KC+S IY KLAE PS+ + GLSSP SSLSS SAFSP +Q    SPGF N+SSFDVRLDN F VEG K+FSGPYS+++E+  I  D +K   V++LL+NF+
Subjt:  KCISSIYSKLAE-PSLSNYGLSSPTSSLSSVSAFSPGEQSAMCSPGFRNNSSFDVRLDNPFLVEGLKEFSGPYSTMIEISWICGDPQKLCHVKNLLENFR

Query:  SAYVIFRHQHNKSFPWRFSVIVQLLISRLEEVDLRKLKYEEKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTIQSCILGCRM
        S                       LISRLEEVD RKLK+EEKLAFWIN+HN+LVMH +LAYG+PQNNVKR  LLLK+AYNIGGHTIS + IQS ILGC+M
Subjt:  SAYVIFRHQHNKSFPWRFSVIVQLLISRLEEVDLRKLKYEEKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTIQSCILGCRM

Query:  PSPRQWLRLLLPSRTKFKTGDERQAYIIDRPEPLLHFALCSGSHSDPAVRVYTPKRVFQELESAKDEYIRATFGVGKGQKILLPKIIESFAKDSGLCSSG
          P QWLRLL  SR KFK GDER AY ID PEPLLHFAL SGSHSDPAVRVYTPKR+ QELE++K+EYIR    + K Q+ILLPK++E+FAKDSGLC +G
Subjt:  PSPRQWLRLLLPSRTKFKTGDERQAYIIDRPEPLLHFALCSGSHSDPAVRVYTPKRVFQELESAKDEYIRATFGVGKGQKILLPKIIESFAKDSGLCSSG

Query:  LMEMILKSLPESLRKSIK--HSKLGNPRKNVEWIPPSYTFRYLISKE
        L EM+ +S+PES RK +K   S    PRK ++WIP S+TFRYLI +E
Subjt:  LMEMILKSLPESLRKSIK--HSKLGNPRKNVEWIPPSYTFRYLISKE

AT5G66600.2 Protein of unknown function, DUF5472.6e-16654.6Show/hide
Query:  DKKAIRDDRLDNLL-ESSNTVKMDMDQVKEVESKKNSSPKIEVHNSLKQEIIQLEKRLQDQFKLRSALEKTLGH---GVFPCNESDKISMPKSAVELIRE
        +KK +  D+  N   E+S  +K+DM +  E    K++        SLKQEI  LE RLQDQFK+R ALEK LG+     +   E++ I+MPK A +LI++
Subjt:  DKKAIRDDRLDNLL-ESSNTVKMDMDQVKEVESKKNSSPKIEVHNSLKQEIIQLEKRLQDQFKLRSALEKTLGH---GVFPCNESDKISMPKSAVELIRE

Query:  IATLELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDKKSKLP--STPRGR---AMEAPLPSAFPSACQSL----ENPRKDYSDIGRDEKLLVSNYHRSQS
        +A LE+EV+HLEQYLLSLYRKAF+ Q SSVSP+ ++KK K P  +TPR R   + +   PS        L    +N  K       D   +  ++ RS S
Subjt:  IATLELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDKKSKLP--STPRGR---AMEAPLPSAFPSACQSL----ENPRKDYSDIGRDEKLLVSNYHRSQS

Query:  SLTTVNAASFDKVSTSVESLDRTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMIKCISSIYSKLAE-PSLSNYGLSSPTSSL
              +A   + ++  +S  +  R+CHSQP+    Y QN   N+ISLAEHLGTRISDHVPETPN+LSE M+KC+S IY KLAE PS+ + GLSSP SSL
Subjt:  SLTTVNAASFDKVSTSVESLDRTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMIKCISSIYSKLAE-PSLSNYGLSSPTSSL

Query:  SSVSAFSPGEQSAMCSPGFRNNSSFDVRLDNPFLVEGLKEFSGPYSTMIEISWICGDPQKLCHVKNLLENFRSAYVIFRHQHNKSFPWRFSVIVQLLISR
        SS SAFSP +Q    SPGF N+SSFDVRLDN F VEG K+FSGPYS+++E+  I  D +K   V++LL+NF+S                       LISR
Subjt:  SSVSAFSPGEQSAMCSPGFRNNSSFDVRLDNPFLVEGLKEFSGPYSTMIEISWICGDPQKLCHVKNLLENFRSAYVIFRHQHNKSFPWRFSVIVQLLISR

Query:  LEEVDLRKLKYEEKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTIQSCILGCRMPSPRQWLRLLLPSRTKFKTGDERQAYII
        LEEVD RKLK+EEKLAFWIN+HN+LVMH +LAYG+PQNNVKR  LLLK+AYNIGGHTIS + IQS ILGC+M  P QWLRLL  SR KFK GDER AY I
Subjt:  LEEVDLRKLKYEEKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTIQSCILGCRMPSPRQWLRLLLPSRTKFKTGDERQAYII

Query:  DRPEPLLHFALCSGSHSDPAVRVYTPKRVFQELESAKDEYIRATFGVGKGQKILLPKIIESFAKDSGLCSSGLMEMILKSLPESLRKSIK--HSKLGNPR
        D PEPLLHFAL SGSHSDPAVRVYTPKR+ QELE++K+EYIR    + K Q+ILLPK++E+FAKDSGLC +GL EM+ +S+PES RK +K   S    PR
Subjt:  DRPEPLLHFALCSGSHSDPAVRVYTPKRVFQELESAKDEYIRATFGVGKGQKILLPKIIESFAKDSGLCSSGLMEMILKSLPESLRKSIK--HSKLGNPR

Query:  KNVEWIPPSYTFRYLISKE
        K ++WIP S+TFRYLI +E
Subjt:  KNVEWIPPSYTFRYLISKE

AT5G66600.3 Protein of unknown function, DUF5476.7e-16753.17Show/hide
Query:  MLGVDVKTGHSRSRRILLSLLYAYGSSLDKKAIRDDRLDNLL-ESSNTVKMDMDQVKEVESKKNSSPKIEVHNSLKQEIIQLEKRLQDQFKLRSALEKTL
        ML + +   H RS+           S  +KK +  D+  N   E+S  +K+DM +  E    K++        SLKQEI  LE RLQDQFK+R ALEK L
Subjt:  MLGVDVKTGHSRSRRILLSLLYAYGSSLDKKAIRDDRLDNLL-ESSNTVKMDMDQVKEVESKKNSSPKIEVHNSLKQEIIQLEKRLQDQFKLRSALEKTL

Query:  GH---GVFPCNESDKISMPKSAVELIREIATLELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDKKSKLP--STPRGR---AMEAPLPSAFPSACQSL--
        G+     +   E++ I+MPK A +LI+++A LE+EV+HLEQYLLSLYRKAF+ Q SSVSP+ ++KK K P  +TPR R   + +   PS        L  
Subjt:  GH---GVFPCNESDKISMPKSAVELIREIATLELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDKKSKLP--STPRGR---AMEAPLPSAFPSACQSL--

Query:  --ENPRKDYSDIGRDEKLLVSNYHRSQSSLTTVNAASFDKVSTSVESLDRTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMI
          +N  K       D   +  ++ RS S      +A   + ++  +S  +  R+CHSQP+    Y QN   N+ISLAEHLGTRISDHVPETPN+LSE M+
Subjt:  --ENPRKDYSDIGRDEKLLVSNYHRSQSSLTTVNAASFDKVSTSVESLDRTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMI

Query:  KCISSIYSKLAE-PSLSNYGLSSPTSSLSSVSAFSPGEQSAMCSPGFRNNSSFDVRLDNPFLVEGLKEFSGPYSTMIEISWICGDPQKLCHVKNLLENFR
        KC+S IY KLAE PS+ + GLSSP SSLSS SAFSP +Q    SPGF N+SSFDVRLDN F VEG K+FSGPYS+++E+  I  D +K   V++LL+NF+
Subjt:  KCISSIYSKLAE-PSLSNYGLSSPTSSLSSVSAFSPGEQSAMCSPGFRNNSSFDVRLDNPFLVEGLKEFSGPYSTMIEISWICGDPQKLCHVKNLLENFR

Query:  SAYVIFRHQHNKSFPWRFSVIVQLLISRLEEVDLRKLKYEEKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTIQSCILGCRM
        S                       LISRLEEVD RKLK+EEKLAFWIN+HN+LVMH +LAYG+PQNNVKR  LLLK+AYNIGGHTIS + IQS ILGC+M
Subjt:  SAYVIFRHQHNKSFPWRFSVIVQLLISRLEEVDLRKLKYEEKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTIQSCILGCRM

Query:  PSPRQWLRLLLPSRTKFKTGDERQAYIIDRPEPLLHFALCSGSHSDPAVRVYTPKRVFQELESAKDEYIRATFGVGKGQKILLPKIIESFAKDSGLCSSG
          P QWLRLL  SR KFK GDER AY ID PEPLLHFAL SGSHSDPAVRVYTPKR+ QELE++K+EYIR    + K Q+ILLPK++E+FAKDSGLC +G
Subjt:  PSPRQWLRLLLPSRTKFKTGDERQAYIIDRPEPLLHFALCSGSHSDPAVRVYTPKRVFQELESAKDEYIRATFGVGKGQKILLPKIIESFAKDSGLCSSG

Query:  LMEMILKSLPESLRKSIK--HSKLGNPRKNVEWIPPSYTFRYLISKE
        L EM+ +S+PES RK +K   S    PRK ++WIP S+TFRYLI +E
Subjt:  LMEMILKSLPESLRKSIK--HSKLGNPRKNVEWIPPSYTFRYLISKE

AT5G66600.4 Protein of unknown function, DUF5472.3e-16752.83Show/hide
Query:  MLGVDVKTGHSRSRRILLSLLYA------YGSSLDKKAIRDDRLDNLL-ESSNTVKMDMDQVKEVESKKNSSPKIEVHNSLKQEIIQLEKRLQDQFKLRS
        ML + +   H RS+R  +  ++         S  +KK +  D+  N   E+S  +K+DM +  E    K++        SLKQEI  LE RLQDQFK+R 
Subjt:  MLGVDVKTGHSRSRRILLSLLYA------YGSSLDKKAIRDDRLDNLL-ESSNTVKMDMDQVKEVESKKNSSPKIEVHNSLKQEIIQLEKRLQDQFKLRS

Query:  ALEKTLGH---GVFPCNESDKISMPKSAVELIREIATLELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDKKSKLP--STPRGR---AMEAPLPSAFPSA
        ALEK LG+     +   E++ I+MPK A +LI+++A LE+EV+HLEQYLLSLYRKAF+ Q SSVSP+ ++KK K P  +TPR R   + +   PS     
Subjt:  ALEKTLGH---GVFPCNESDKISMPKSAVELIREIATLELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDKKSKLP--STPRGR---AMEAPLPSAFPSA

Query:  CQSL----ENPRKDYSDIGRDEKLLVSNYHRSQSSLTTVNAASFDKVSTSVESLDRTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNR
           L    +N  K       D   +  ++ RS S      +A   + ++  +S  +  R+CHSQP+    Y QN   N+ISLAEHLGTRISDHVPETPN+
Subjt:  CQSL----ENPRKDYSDIGRDEKLLVSNYHRSQSSLTTVNAASFDKVSTSVESLDRTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNR

Query:  LSEDMIKCISSIYSKLAE-PSLSNYGLSSPTSSLSSVSAFSPGEQSAMCSPGFRNNSSFDVRLDNPFLVEGLKEFSGPYSTMIEISWICGDPQKLCHVKN
        LSE M+KC+S IY KLAE PS+ + GLSSP SSLSS SAFSP +Q    SPGF N+SSFDVRLDN F VEG K+FSGPYS+++E+  I  D +K   V++
Subjt:  LSEDMIKCISSIYSKLAE-PSLSNYGLSSPTSSLSSVSAFSPGEQSAMCSPGFRNNSSFDVRLDNPFLVEGLKEFSGPYSTMIEISWICGDPQKLCHVKN

Query:  LLENFRSAYVIFRHQHNKSFPWRFSVIVQLLISRLEEVDLRKLKYEEKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTIQSC
        LL+NF+S                       LISRLEEVD RKLK+EEKLAFWIN+HN+LVMH +LAYG+PQNNVKR  LLLK+AYNIGGHTIS + IQS 
Subjt:  LLENFRSAYVIFRHQHNKSFPWRFSVIVQLLISRLEEVDLRKLKYEEKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTIQSC

Query:  ILGCRMPSPRQWLRLLLPSRTKFKTGDERQAYIIDRPEPLLHFALCSGSHSDPAVRVYTPKRVFQELESAKDEYIRATFGVGKGQKILLPKIIESFAKDS
        ILGC+M  P QWLRLL  SR KFK GDER AY ID PEPLLHFAL SGSHSDPAVRVYTPKR+ QELE++K+EYIR    + K Q+ILLPK++E+FAKDS
Subjt:  ILGCRMPSPRQWLRLLLPSRTKFKTGDERQAYIIDRPEPLLHFALCSGSHSDPAVRVYTPKRVFQELESAKDEYIRATFGVGKGQKILLPKIIESFAKDS

Query:  GLCSSGLMEMILKSLPESLRKSIK--HSKLGNPRKNVEWIPPSYTFRYLISKE
        GLC +GL EM+ +S+PES RK +K   S    PRK ++WIP S+TFRYLI +E
Subjt:  GLCSSGLMEMILKSLPESLRKSIK--HSKLGNPRKNVEWIPPSYTFRYLISKE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTGGGAGTGGATGTAAAAACAGGACACAGCCGCTCCAGAAGAATTCTTTTGTCTTTGCTCTACGCATATGGCAGCTCGCTCGACAAGAAAGCGATTCGAGATGATCG
ATTAGATAATTTGCTCGAATCTTCCAACACTGTAAAGATGGATATGGATCAGGTGAAAGAAGTTGAGAGTAAGAAAAACTCATCACCGAAGATTGAAGTGCATAATTCTT
TGAAGCAGGAGATTATTCAACTTGAGAAGCGATTGCAAGACCAGTTCAAGCTCCGCTCTGCATTAGAAAAAACATTGGGTCATGGGGTCTTTCCATGTAATGAATCAGAC
AAAATCTCAATGCCAAAGTCAGCCGTGGAATTAATCAGGGAGATTGCAACTTTAGAACTAGAAGTTGTGCATTTAGAACAATATCTTCTTTCTTTGTACCGAAAAGCATT
TGATGGACAATCATCCTCTGTATCCCCATCTGCTAAGGACAAGAAGTCAAAATTGCCTTCAACCCCTAGAGGCAGGGCAATGGAAGCTCCCCTACCTTCAGCATTCCCTT
CTGCTTGTCAGTCACTTGAAAATCCAAGGAAGGACTATAGTGATATTGGAAGAGATGAAAAACTATTGGTTTCCAATTATCATCGTTCTCAATCGTCACTGACGACAGTA
AATGCAGCTTCTTTTGATAAAGTTTCTACTTCAGTAGAATCATTGGATAGAACTCTGCGCGCATGTCATTCCCAACCGGTGTCCATGATGGAGTATGCACAAAATGTTTC
ATCCAATATAATCAGTCTCGCAGAGCATCTTGGAACTCGCATTTCTGACCACGTACCCGAGACTCCTAACCGGCTTTCAGAAGACATGATCAAGTGCATATCATCTATAT
ATAGCAAACTTGCAGAACCATCTTTGTCGAATTATGGACTTTCATCTCCAACATCTTCTTTGTCATCGGTTAGTGCATTTTCTCCTGGAGAGCAGAGTGCTATGTGCAGT
CCAGGGTTCAGAAACAACTCATCCTTCGACGTTCGATTGGACAATCCTTTTCTCGTAGAAGGTCTTAAAGAATTTAGTGGGCCATACAGTACCATGATTGAGATTTCATG
GATTTGTGGAGACCCTCAGAAGCTATGTCATGTCAAGAATTTGCTCGAAAATTTCAGGTCAGCCTATGTAATATTTAGGCATCAGCATAACAAATCTTTTCCATGGCGTT
TTTCTGTTATCGTACAGTTGCTTATCTCTCGTTTGGAGGAAGTCGATTTGAGGAAACTTAAGTACGAGGAAAAGTTGGCTTTCTGGATAAATATTCACAATTCGCTGGTG
ATGCATACGTATTTGGCTTATGGGGTACCCCAAAACAATGTCAAGAGAGCCTTTTTACTCTTGAAGTCTGCATATAATATAGGAGGCCATACAATTAGTGTAGACACAAT
TCAAAGTTGTATTCTTGGCTGTCGGATGCCTAGTCCTCGGCAGTGGCTTCGGCTCTTGCTTCCTTCAAGGACAAAATTCAAGACCGGAGATGAACGACAAGCATATATAA
TAGACCGTCCAGAACCCCTTTTACATTTCGCACTCTGTTCTGGGAGCCACTCCGATCCTGCGGTTCGTGTTTATACGCCTAAAAGAGTGTTTCAAGAACTGGAATCTGCA
AAGGACGAGTACATTCGAGCAACCTTCGGTGTAGGAAAGGGTCAAAAGATTCTTCTCCCAAAAATCATTGAATCATTTGCAAAGGACTCTGGTCTGTGTTCATCGGGCCT
AATGGAGATGATCCTCAAAAGCTTGCCTGAATCTCTGAGAAAGAGTATTAAACACTCGAAGCTCGGGAATCCTCGGAAGAATGTCGAGTGGATTCCTCCAAGTTACACTT
TCAGGTATCTGATTTCTAAGGAGCTGATGAACTAA
mRNA sequenceShow/hide mRNA sequence
GAGAGAGCCAAGTGTTTTGTAATCAAAATTTTAAAAGTTGTTTATATGGTGTGCTCTTTGATTTTCAAGCACCAATTATCATAGAAGTTGTTAAGTTGAATCATCCCGTA
CCTTTTCACTTCTGAGATTGACATTCATTTTTATGCAATATATATATGTAGAGAAAATGCAGAACCCATTACAGGTTGCCTGAAACCAGAGATATTTGCCCGGCGACTCT
CATTATTGAAGAAGAACATCGATTTCTCAACCCATTTCTGAGTTCTTGCACTAATCACCATACACATTCTAACATTTAATACTGTTCGACTGATTTTTCCCACTCGAAAC
AAACCCCCCATTTGAAATCTCTCTTGGAATCTCTAATTTGGCCACTTTCCCACTTCTCAATCGCTTCTCCAGTTGGCGATTTGCGCCGATTCCCGACTCTAATTGGACTT
TTCTTTCTTCATTCAAACCCTTCGACCTGTCCACTTCTCACCTCGCGAAAAAAGATGACCAAATTTTCTAAAAACACCCATTTCAGAAACTCCACCTTTTCTCTGAGCTG
ATGGGACCCCTTTTCTATGGAAGCTTGTTACTAGAAAAGTTTCCTTTTCTATGACCGTTTTTTCTTTGTGGGTCTGTAGCAGCCTTTCCGTAGCCTCTGTGCTGAGCCGC
AGATTGATCACTTTCTGATTTCTTCCATTTTAGCTCCCTTCATCTCTCTCCCTGTTTCATTACTTCTATTCTCTTGTTTGGGAGTTGATTCTTTTGTTTTTCGGTGTATT
TATAATTTTGAGTTGGTTTCACGGTTTTTGGATTCTGGTTAGAAGTGGGGTTAGCAAAAAGAAAAAAATGCTGGGAGTGGATGTAAAAACAGGACACAGCCGCTCCAGAA
GAATTCTTTTGTCTTTGCTCTACGCATATGGCAGCTCGCTCGACAAGAAAGCGATTCGAGATGATCGATTAGATAATTTGCTCGAATCTTCCAACACTGTAAAGATGGAT
ATGGATCAGGTGAAAGAAGTTGAGAGTAAGAAAAACTCATCACCGAAGATTGAAGTGCATAATTCTTTGAAGCAGGAGATTATTCAACTTGAGAAGCGATTGCAAGACCA
GTTCAAGCTCCGCTCTGCATTAGAAAAAACATTGGGTCATGGGGTCTTTCCATGTAATGAATCAGACAAAATCTCAATGCCAAAGTCAGCCGTGGAATTAATCAGGGAGA
TTGCAACTTTAGAACTAGAAGTTGTGCATTTAGAACAATATCTTCTTTCTTTGTACCGAAAAGCATTTGATGGACAATCATCCTCTGTATCCCCATCTGCTAAGGACAAG
AAGTCAAAATTGCCTTCAACCCCTAGAGGCAGGGCAATGGAAGCTCCCCTACCTTCAGCATTCCCTTCTGCTTGTCAGTCACTTGAAAATCCAAGGAAGGACTATAGTGA
TATTGGAAGAGATGAAAAACTATTGGTTTCCAATTATCATCGTTCTCAATCGTCACTGACGACAGTAAATGCAGCTTCTTTTGATAAAGTTTCTACTTCAGTAGAATCAT
TGGATAGAACTCTGCGCGCATGTCATTCCCAACCGGTGTCCATGATGGAGTATGCACAAAATGTTTCATCCAATATAATCAGTCTCGCAGAGCATCTTGGAACTCGCATT
TCTGACCACGTACCCGAGACTCCTAACCGGCTTTCAGAAGACATGATCAAGTGCATATCATCTATATATAGCAAACTTGCAGAACCATCTTTGTCGAATTATGGACTTTC
ATCTCCAACATCTTCTTTGTCATCGGTTAGTGCATTTTCTCCTGGAGAGCAGAGTGCTATGTGCAGTCCAGGGTTCAGAAACAACTCATCCTTCGACGTTCGATTGGACA
ATCCTTTTCTCGTAGAAGGTCTTAAAGAATTTAGTGGGCCATACAGTACCATGATTGAGATTTCATGGATTTGTGGAGACCCTCAGAAGCTATGTCATGTCAAGAATTTG
CTCGAAAATTTCAGGTCAGCCTATGTAATATTTAGGCATCAGCATAACAAATCTTTTCCATGGCGTTTTTCTGTTATCGTACAGTTGCTTATCTCTCGTTTGGAGGAAGT
CGATTTGAGGAAACTTAAGTACGAGGAAAAGTTGGCTTTCTGGATAAATATTCACAATTCGCTGGTGATGCATACGTATTTGGCTTATGGGGTACCCCAAAACAATGTCA
AGAGAGCCTTTTTACTCTTGAAGTCTGCATATAATATAGGAGGCCATACAATTAGTGTAGACACAATTCAAAGTTGTATTCTTGGCTGTCGGATGCCTAGTCCTCGGCAG
TGGCTTCGGCTCTTGCTTCCTTCAAGGACAAAATTCAAGACCGGAGATGAACGACAAGCATATATAATAGACCGTCCAGAACCCCTTTTACATTTCGCACTCTGTTCTGG
GAGCCACTCCGATCCTGCGGTTCGTGTTTATACGCCTAAAAGAGTGTTTCAAGAACTGGAATCTGCAAAGGACGAGTACATTCGAGCAACCTTCGGTGTAGGAAAGGGTC
AAAAGATTCTTCTCCCAAAAATCATTGAATCATTTGCAAAGGACTCTGGTCTGTGTTCATCGGGCCTAATGGAGATGATCCTCAAAAGCTTGCCTGAATCTCTGAGAAAG
AGTATTAAACACTCGAAGCTCGGGAATCCTCGGAAGAATGTCGAGTGGATTCCTCCAAGTTACACTTTCAGGTATCTGATTTCTAAGGAGCTGATGAACTAATTTGCTTG
CTTGCATTTTATTTCTTCTTTTTTGGAATCAACTCAGATGCTAAAATTTTTGTGCATTGAATTTGATTTCTCATTTGTACATAGAAGAAACCGTAAGCAAAGGCATCTGA
GAATACAAAAAAAATGGAACTGATGTATTTTGTAGTTGAAATGTTTATAGTATCTTCCCTTCAATTTGTTGAGCTTGAAA
Protein sequenceShow/hide protein sequence
MLGVDVKTGHSRSRRILLSLLYAYGSSLDKKAIRDDRLDNLLESSNTVKMDMDQVKEVESKKNSSPKIEVHNSLKQEIIQLEKRLQDQFKLRSALEKTLGHGVFPCNESD
KISMPKSAVELIREIATLELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDKKSKLPSTPRGRAMEAPLPSAFPSACQSLENPRKDYSDIGRDEKLLVSNYHRSQSSLTTV
NAASFDKVSTSVESLDRTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMIKCISSIYSKLAEPSLSNYGLSSPTSSLSSVSAFSPGEQSAMCS
PGFRNNSSFDVRLDNPFLVEGLKEFSGPYSTMIEISWICGDPQKLCHVKNLLENFRSAYVIFRHQHNKSFPWRFSVIVQLLISRLEEVDLRKLKYEEKLAFWINIHNSLV
MHTYLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTIQSCILGCRMPSPRQWLRLLLPSRTKFKTGDERQAYIIDRPEPLLHFALCSGSHSDPAVRVYTPKRVFQELESA
KDEYIRATFGVGKGQKILLPKIIESFAKDSGLCSSGLMEMILKSLPESLRKSIKHSKLGNPRKNVEWIPPSYTFRYLISKELMN