; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh11G009840 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh11G009840
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionProtein of Unknown Function (DUF239)
Genome locationCmo_Chr11:5220268..5227736
RNA-Seq ExpressionCmoCh11G009840
SyntenyCmoCh11G009840
Gene Ontology termsNA
InterPro domainsIPR004314 - Neprosin
IPR025521 - Neprosin activation peptide


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6588301.1 hypothetical protein SDJN03_16866, partial [Cucurbita argyrosperma subsp. sororia]1.8e-21899.2Show/hide
Query:  MALFCVLLVVLQSFSLVCGLTYSYKHVSSLRFDRIQTHLDSINKPPLLTIQSPDGDTIDCVHKRKQPALDHPLLKNHKIQRAPTGWPKMKKNERRAGSGA
        MALFCVLLVVLQSFSLVCGLTYSYKHVSSLRFDRIQTHLDSINKPPLLTIQSPDGDTIDCVHKRKQPALDHPLLKNHKIQRAPTGWPKMKKNERRAGSGA
Subjt:  MALFCVLLVVLQSFSLVCGLTYSYKHVSSLRFDRIQTHLDSINKPPLLTIQSPDGDTIDCVHKRKQPALDHPLLKNHKIQRAPTGWPKMKKNERRAGSGA

Query:  GGPFQTWHGNATRCPKGTIPVRRSTVKDVLRAKSLFDFGKKKRPILLDQQMDAPDVVSGNGHEHAIAYTRSLGEMYGAKATINVWEPSIQMVNEFSLSQI
         GPFQTWHGNATRCPKGTIPVRRSTVKDVLRAKSLFDFGKKKRPILLD+QMDAPDVVSGNGHEHAIAYTRSLGEMYGAKATINVWEPSIQMVNEFSLSQI
Subjt:  GGPFQTWHGNATRCPKGTIPVRRSTVKDVLRAKSLFDFGKKKRPILLDQQMDAPDVVSGNGHEHAIAYTRSLGEMYGAKATINVWEPSIQMVNEFSLSQI

Query:  WILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSRIAIGAAISPVSSLTGNQYDITILIWKDPKLGNWWMGFGD
        WILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSRIAIGAAISPVSSLTGNQYDITILIWKDPKLGNWWMGFGD
Subjt:  WILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSRIAIGAAISPVSSLTGNQYDITILIWKDPKLGNWWMGFGD

Query:  NTLVGYWPAELFTHLADHATMVEWGGEVVNSRSNGQHTSTQMGSGHFANDGFGKASYFRNLEIVDSDNNLSDVE
        NTLVGYWPAELFTHLADHATMVEWGGEVVNSRSNGQHTSTQMGSGHFANDGFGKASYFRNLEIVDSDNNLSDV+
Subjt:  NTLVGYWPAELFTHLADHATMVEWGGEVVNSRSNGQHTSTQMGSGHFANDGFGKASYFRNLEIVDSDNNLSDVE

XP_022926346.1 uncharacterized protein LOC111433521 [Cucurbita moschata]1.5e-22599.74Show/hide
Query:  MGRTRGVSVSMALFCVLLVVLQSFSLVCGLTYSYKHVSSLRFDRIQTHLDSINKPPLLTIQSPDGDTIDCVHKRKQPALDHPLLKNHKIQRAPTGWPKMK
        MGRTRGVSVSMALFCVLLVVLQSFSLVCGLTYSYKHVSSLRFDRIQTHLDSINKPPLLTIQSPDGDTIDCVHKRKQPALDHPLLKNHKIQRAPTGWPKMK
Subjt:  MGRTRGVSVSMALFCVLLVVLQSFSLVCGLTYSYKHVSSLRFDRIQTHLDSINKPPLLTIQSPDGDTIDCVHKRKQPALDHPLLKNHKIQRAPTGWPKMK

Query:  KNERRAGSGAGGPFQTWHGNATRCPKGTIPVRRSTVKDVLRAKSLFDFGKKKRPILLDQQMDAPDVVSGNGHEHAIAYTRSLGEMYGAKATINVWEPSIQ
        KNERRAGSGAGGPFQTWHGNATRCPKGTIPVRRSTVKDVLRAKSLFDFGKKKRPILLDQQMDAPDVVSGNGHEHAIAYTRSLGEMYGAKATINVWEPSIQ
Subjt:  KNERRAGSGAGGPFQTWHGNATRCPKGTIPVRRSTVKDVLRAKSLFDFGKKKRPILLDQQMDAPDVVSGNGHEHAIAYTRSLGEMYGAKATINVWEPSIQ

Query:  MVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSRIAIGAAISPVSSLTGNQYDITILIWKDPK
        MVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSRIAIGAAISPVSSLTGNQYDITILIWKDPK
Subjt:  MVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSRIAIGAAISPVSSLTGNQYDITILIWKDPK

Query:  LGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRSNGQHTSTQMGSGHFANDGFGKASYFRNLEIVDSDNNLSDVE
        LGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRSNGQHTSTQMGSGHFANDGFGKASYFRNLEIVDSDNNLSDV+
Subjt:  LGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRSNGQHTSTQMGSGHFANDGFGKASYFRNLEIVDSDNNLSDVE

XP_022974267.1 uncharacterized protein LOC111472904 [Cucurbita maxima]8.9e-21895.37Show/hide
Query:  MGRTRGVSVSMALFCVLLVVLQSFSLVCGLTYSYKHVSSLRFDRIQTHLDSINKPPLLTIQSPDGDTIDCVHKRKQPALDHPLLKNHKIQRAPTGWPKMK
        MGRT+GVSVSM+LFCVLL+VLQSFSLVCGLTYSY+HVSSLRFDRIQTHLDSINKPPLLTIQSPDGD IDCVHKRKQPALDHPLLK+HKIQRAPTGWPKMK
Subjt:  MGRTRGVSVSMALFCVLLVVLQSFSLVCGLTYSYKHVSSLRFDRIQTHLDSINKPPLLTIQSPDGDTIDCVHKRKQPALDHPLLKNHKIQRAPTGWPKMK

Query:  KN-----ERRAGSGAGGPFQTWHGNATRCPKGTIPVRRSTVKDVLRAKSLFDFGKKKRPILLDQQMDAPDVVSGNGHEHAIAYTRSLGEMYGAKATINVW
        KN     ER+AGSGAGGPFQTWHGNATRCPKGTIPVRRSTVKDVLRAKSLFDFGKKKRPILLD+QMDAPDVVSGNGHEHAIAYTRS GEMYGAKATINVW
Subjt:  KN-----ERRAGSGAGGPFQTWHGNATRCPKGTIPVRRSTVKDVLRAKSLFDFGKKKRPILLDQQMDAPDVVSGNGHEHAIAYTRSLGEMYGAKATINVW

Query:  EPSIQMVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSRIAIGAAISPVSSLTGNQYDITILI
        EPSIQMVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPE YGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSRIAIGAAISPVSSLTGNQYDITILI
Subjt:  EPSIQMVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSRIAIGAAISPVSSLTGNQYDITILI

Query:  WKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRSNGQHTSTQMGSGHFANDGFGKASYFRNLEIVDSDNNLSDVE
        WKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRSNGQHTSTQMGSGHFANDGFGKASYFRNLEIVD+DNNL DV+
Subjt:  WKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRSNGQHTSTQMGSGHFANDGFGKASYFRNLEIVDSDNNLSDVE

XP_023530832.1 uncharacterized protein LOC111793262 [Cucurbita pepo subsp. pepo]4.3e-22097.66Show/hide
Query:  MGRTRGVSVSMALFCVLLVVLQSFSLVCGLTYSYKHVSSLRFDRIQTHLDSINKPPLLTIQSPDGDTIDCVHKRKQPALDHPLLKNHKIQRAPTGWPKMK
        MGRTRGVSV+MALFCVLLVVLQSFSLVCGLTYSY+HVSSLRF RIQ HLDSINKPPLLTIQSPDGD IDCVHKRKQPALDHPLLKNHKIQRAPTGWPKMK
Subjt:  MGRTRGVSVSMALFCVLLVVLQSFSLVCGLTYSYKHVSSLRFDRIQTHLDSINKPPLLTIQSPDGDTIDCVHKRKQPALDHPLLKNHKIQRAPTGWPKMK

Query:  KNERRAGSGAGGPFQTWHGNATRCPKGTIPVRRSTVKDVLRAKSLFDFGKKKRPILLDQQMDAPDVVSGNGHEHAIAYTRSLGEMYGAKATINVWEPSIQ
        KNERRAGSGAGGPFQTWH NATRCPKGTIPVRRSTVKDVLRAKSLFDFGKKKRPILLD+QMDAPDVVSGNGHEHAIAYTRS GEMYGAKATINVWEPSIQ
Subjt:  KNERRAGSGAGGPFQTWHGNATRCPKGTIPVRRSTVKDVLRAKSLFDFGKKKRPILLDQQMDAPDVVSGNGHEHAIAYTRSLGEMYGAKATINVWEPSIQ

Query:  MVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSRIAIGAAISPVSSLTGNQYDITILIWKDPK
        MVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSRIAIGAAISPVSSLTGNQYDITILIWKDPK
Subjt:  MVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSRIAIGAAISPVSSLTGNQYDITILIWKDPK

Query:  LGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRSNGQHTSTQMGSGHFANDGFGKASYFRNLEIVDSDNNLSDVE
        LGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRSNGQHTSTQMGSGHFANDGFGKASYFRNLEIVDSDNNLSDV+
Subjt:  LGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRSNGQHTSTQMGSGHFANDGFGKASYFRNLEIVDSDNNLSDVE

XP_038878456.1 uncharacterized protein LOC120070684 isoform X2 [Benincasa hispida]1.9e-19180.87Show/hide
Query:  MGRTRGVSVSMA-----------------LFCVLLVVLQSFSLVCGLTYSYKHVSSLRFDRIQTHLDSINKPPLLTIQSPDGDTIDCVHKRKQPALDHPL
        MG  RGVS+S++                 LF +L+++LQ F+LVCGL Y+YK VSSLR +RIQ HLDSINKPPLLTIQSPDGD IDCVHKRKQPALDHPL
Subjt:  MGRTRGVSVSMA-----------------LFCVLLVVLQSFSLVCGLTYSYKHVSSLRFDRIQTHLDSINKPPLLTIQSPDGDTIDCVHKRKQPALDHPL

Query:  LKNHKIQRAPTGWPKMKK-NERR-------AGSGAGGPFQTWHGNATRCPKGTIPVRRSTVKDVLRAKSLFDFGKKKRPILLDQQMDAPDVVSGNGHEHA
        LKNHKIQR PT WPK KK NE R       AGSGAGG  QTW  N TRCPKG+IPVRRSTV DVLR+KSLFDFGKKKRPILLD+++DAPDVVSGNGHEHA
Subjt:  LKNHKIQRAPTGWPKMKK-NERR-------AGSGAGGPFQTWHGNATRCPKGTIPVRRSTVKDVLRAKSLFDFGKKKRPILLDQQMDAPDVVSGNGHEHA

Query:  IAYTRSLGEMYGAKATINVWEPSIQMVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSRIAIG
        IAYT S  EMYGAKATINVW+PSIQMVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTN++IAIG
Subjt:  IAYTRSLGEMYGAKATINVWEPSIQMVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSRIAIG

Query:  AAISPVSSLTGNQYDITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRSNGQHTSTQMGSGHFANDGFGKASYFRNLEIVD
        AAISP+SS +G+QYDITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSR+NGQHTSTQMGSGHF +DGFGKASYFRNLEIVD
Subjt:  AAISPVSSLTGNQYDITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRSNGQHTSTQMGSGHFANDGFGKASYFRNLEIVD

Query:  SDNNLSDVEILNV
        SDN+LS V+ +++
Subjt:  SDNNLSDVEILNV

TrEMBL top hitse value%identityAlignment
A0A0A0LXY9 Uncharacterized protein4.6e-18880.94Show/hide
Query:  RTRGVSVSMA---------LFCVLLVVLQSFSLVCGLTYSY-KHVSSLRFDRIQTHLDSINKPPLLTIQSPDGDTIDCVHKRKQPALDHPLLKNHKIQRA
        +T GVS S++         +FC   V+ Q F+LVCGL Y+Y KH+SSLR DRIQ HLDSINKPPLLTIQSPDGD IDCVHKRKQPALDHPLLKNHKIQR 
Subjt:  RTRGVSVSMA---------LFCVLLVVLQSFSLVCGLTYSY-KHVSSLRFDRIQTHLDSINKPPLLTIQSPDGDTIDCVHKRKQPALDHPLLKNHKIQRA

Query:  PTGWPKMK--------KNERRAGSGAGGPFQTWHGNATRCPKGTIPVRRSTVKDVLRAKSLFDFGKKKRPILLDQQMDAPDVVSGNGHEHAIAYTRSLGE
        PT WPK K         +ERRAGSGA   FQTW  N TRCPKGT+PVRR+TVKDVLR+KSLFDFGKKKRPILLD+++DAPDVVSGNGHEHAIAYT S  E
Subjt:  PTGWPKMK--------KNERRAGSGAGGPFQTWHGNATRCPKGTIPVRRSTVKDVLRAKSLFDFGKKKRPILLDQQMDAPDVVSGNGHEHAIAYTRSLGE

Query:  MYGAKATINVWEPSIQMVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSRIAIGAAISPVSSL
        MYGAKATINVW+PSI+MVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLC+GFVQTNS+IAIGAAISP+SS+
Subjt:  MYGAKATINVWEPSIQMVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSRIAIGAAISPVSSL

Query:  TGNQYDITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRSNGQHTSTQMGSGHFANDGFGKASYFRNLEIVDSDNNLSDVE
         G+QYDITILIWKDPKLGNWWMGFG+NTLVGYWPAELFTHLADHATMVEWGGEVVNSR NGQHTSTQMGSGHF +DGF KASYFRNLEIVDSDN+LS V+
Subjt:  TGNQYDITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRSNGQHTSTQMGSGHFANDGFGKASYFRNLEIVDSDNNLSDVE

Query:  ILNV
         +++
Subjt:  ILNV

A0A1S3B8R5 uncharacterized protein LOC1034872731.2e-18881.23Show/hide
Query:  MGRTRGVSVSMA---------LFCVLLVVLQSFSLVCGLTYSYKHVSSLRFDRIQTHLDSINKPPLLTIQSPDGDTIDCVHKRKQPALDHPLLKNHKIQR
        MG   GVS S++         +FC   VV Q F+LVCGL Y+Y+ +SSLR DRIQ HLDSINKPPLLTIQSPDGD IDCVHKRKQPALDHPLLKNHKIQR
Subjt:  MGRTRGVSVSMA---------LFCVLLVVLQSFSLVCGLTYSYKHVSSLRFDRIQTHLDSINKPPLLTIQSPDGDTIDCVHKRKQPALDHPLLKNHKIQR

Query:  APTGWPKMK--------KNERRAGSGAGGPFQTWHGNATRCPKGTIPVRRSTVKDVLRAKSLFDFGKKKRPILLDQQMDAPDVVSGNGHEHAIAYTRSLG
         PT WPK K          ERRAGSGA   FQTW  N TRCPKGTIPVRR+TVKDVLR+KSLFDFGKK+RPILLD+++DAPDVVSGNGHEHAIAYT S  
Subjt:  APTGWPKMK--------KNERRAGSGAGGPFQTWHGNATRCPKGTIPVRRSTVKDVLRAKSLFDFGKKKRPILLDQQMDAPDVVSGNGHEHAIAYTRSLG

Query:  EMYGAKATINVWEPSIQMVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSRIAIGAAISPVSS
        EMYGAKATINVW+PSIQMVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNS+IAIGAAISP+SS
Subjt:  EMYGAKATINVWEPSIQMVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSRIAIGAAISPVSS

Query:  LTGNQYDITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRSNGQHTSTQMGSGHFANDGFGKASYFRNLEIVDSDNNLSDV
        ++G+QYDITILIWKDPKLGNWWMGFG+NTLVGYWPAELFTHLADHATMVEWGGEVVNSR NGQHTSTQMGSGHF +DGF KASYFRNLEIVDSDN+LS V
Subjt:  LTGNQYDITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRSNGQHTSTQMGSGHFANDGFGKASYFRNLEIVDSDNNLSDV

Query:  EILNV
        + +++
Subjt:  EILNV

A0A6J1EEL6 uncharacterized protein LOC1114335217.3e-22699.74Show/hide
Query:  MGRTRGVSVSMALFCVLLVVLQSFSLVCGLTYSYKHVSSLRFDRIQTHLDSINKPPLLTIQSPDGDTIDCVHKRKQPALDHPLLKNHKIQRAPTGWPKMK
        MGRTRGVSVSMALFCVLLVVLQSFSLVCGLTYSYKHVSSLRFDRIQTHLDSINKPPLLTIQSPDGDTIDCVHKRKQPALDHPLLKNHKIQRAPTGWPKMK
Subjt:  MGRTRGVSVSMALFCVLLVVLQSFSLVCGLTYSYKHVSSLRFDRIQTHLDSINKPPLLTIQSPDGDTIDCVHKRKQPALDHPLLKNHKIQRAPTGWPKMK

Query:  KNERRAGSGAGGPFQTWHGNATRCPKGTIPVRRSTVKDVLRAKSLFDFGKKKRPILLDQQMDAPDVVSGNGHEHAIAYTRSLGEMYGAKATINVWEPSIQ
        KNERRAGSGAGGPFQTWHGNATRCPKGTIPVRRSTVKDVLRAKSLFDFGKKKRPILLDQQMDAPDVVSGNGHEHAIAYTRSLGEMYGAKATINVWEPSIQ
Subjt:  KNERRAGSGAGGPFQTWHGNATRCPKGTIPVRRSTVKDVLRAKSLFDFGKKKRPILLDQQMDAPDVVSGNGHEHAIAYTRSLGEMYGAKATINVWEPSIQ

Query:  MVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSRIAIGAAISPVSSLTGNQYDITILIWKDPK
        MVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSRIAIGAAISPVSSLTGNQYDITILIWKDPK
Subjt:  MVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSRIAIGAAISPVSSLTGNQYDITILIWKDPK

Query:  LGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRSNGQHTSTQMGSGHFANDGFGKASYFRNLEIVDSDNNLSDVE
        LGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRSNGQHTSTQMGSGHFANDGFGKASYFRNLEIVDSDNNLSDV+
Subjt:  LGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRSNGQHTSTQMGSGHFANDGFGKASYFRNLEIVDSDNNLSDVE

A0A6J1I9T9 uncharacterized protein LOC1114729044.3e-21895.37Show/hide
Query:  MGRTRGVSVSMALFCVLLVVLQSFSLVCGLTYSYKHVSSLRFDRIQTHLDSINKPPLLTIQSPDGDTIDCVHKRKQPALDHPLLKNHKIQRAPTGWPKMK
        MGRT+GVSVSM+LFCVLL+VLQSFSLVCGLTYSY+HVSSLRFDRIQTHLDSINKPPLLTIQSPDGD IDCVHKRKQPALDHPLLK+HKIQRAPTGWPKMK
Subjt:  MGRTRGVSVSMALFCVLLVVLQSFSLVCGLTYSYKHVSSLRFDRIQTHLDSINKPPLLTIQSPDGDTIDCVHKRKQPALDHPLLKNHKIQRAPTGWPKMK

Query:  KN-----ERRAGSGAGGPFQTWHGNATRCPKGTIPVRRSTVKDVLRAKSLFDFGKKKRPILLDQQMDAPDVVSGNGHEHAIAYTRSLGEMYGAKATINVW
        KN     ER+AGSGAGGPFQTWHGNATRCPKGTIPVRRSTVKDVLRAKSLFDFGKKKRPILLD+QMDAPDVVSGNGHEHAIAYTRS GEMYGAKATINVW
Subjt:  KN-----ERRAGSGAGGPFQTWHGNATRCPKGTIPVRRSTVKDVLRAKSLFDFGKKKRPILLDQQMDAPDVVSGNGHEHAIAYTRSLGEMYGAKATINVW

Query:  EPSIQMVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSRIAIGAAISPVSSLTGNQYDITILI
        EPSIQMVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPE YGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSRIAIGAAISPVSSLTGNQYDITILI
Subjt:  EPSIQMVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSRIAIGAAISPVSSLTGNQYDITILI

Query:  WKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRSNGQHTSTQMGSGHFANDGFGKASYFRNLEIVDSDNNLSDVE
        WKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRSNGQHTSTQMGSGHFANDGFGKASYFRNLEIVD+DNNL DV+
Subjt:  WKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRSNGQHTSTQMGSGHFANDGFGKASYFRNLEIVDSDNNLSDVE

A0A6J1JIQ1 uncharacterized protein LOC1114854602.1e-18883.55Show/hide
Query:  FCVLLVVLQSFSLVCGLTYSYKHVSSLRFDRIQTHLDSINKPPLLTIQSPDGDTIDCVHKRKQPALDHPLLKNHKIQRAPTGWPKMKK--------NERR
        F + L+++Q F LVCGL +S K VSSLR DRIQ HLD+INKPPLLTIQSPDGD IDCVHKRKQPALDHPLLKNHKIQR PT WPK KK         + R
Subjt:  FCVLLVVLQSFSLVCGLTYSYKHVSSLRFDRIQTHLDSINKPPLLTIQSPDGDTIDCVHKRKQPALDHPLLKNHKIQRAPTGWPKMKK--------NERR

Query:  AGSGAGGPFQTWHGNATRCPKGTIPVRRSTVKDVLRAKSLFDFGKKKRPILLDQQMDAPDVVSGNGHEHAIAYTRSLGEMYGAKATINVWEPSIQMVNEF
        AGSGAGGPFQTW  N TRCPKG+IPVRRSTV DVLRAKS+FD+GKKKRPILLD+Q+DAPDVVSGNGHEHAIAYT S  EMYGAKATINVW+PSIQ+VNEF
Subjt:  AGSGAGGPFQTWHGNATRCPKGTIPVRRSTVKDVLRAKSLFDFGKKKRPILLDQQMDAPDVVSGNGHEHAIAYTRSLGEMYGAKATINVWEPSIQMVNEF

Query:  SLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSRIAIGAAISPVSSLTGNQYDITILIWKDPKLGNWW
        SLSQ+WI+SGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLC+GFVQTN++IAIGAAISPVSS +G+QYDITILIWKDPKLG+WW
Subjt:  SLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSRIAIGAAISPVSSLTGNQYDITILIWKDPKLGNWW

Query:  MGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRSNGQHTSTQMGSGHFANDGFGKASYFRNLEIVDSDNNLSDVEILNV
        MGFGDNTLVGYWPAELFTHL DHATMVEWGGEVVNSR+NGQHTSTQMGSGHF +DGFGKASYFRNLEIVDSDN+LSDV+ +++
Subjt:  MGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRSNGQHTSTQMGSGHFANDGFGKASYFRNLEIVDSDNNLSDVEILNV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G55360.1 Protein of Unknown Function (DUF239)4.2e-11757.26Show/hide
Query:  FCVLLVVLQSFSLVCGLTYSYKHVSSLRFDRIQTHLDSINKPPLLTIQSPDGDTIDCVHKRKQPALDHPLLKNHKIQRAPTGWPK-----MKKNERRAGS
        F V L +   FS    L+Y+ +   S +   ++ HL+ +NKP + +IQS DGD IDCV   KQPA DHP LK+HKIQ  P   P+      K +  ++  
Subjt:  FCVLLVVLQSFSLVCGLTYSYKHVSSLRFDRIQTHLDSINKPPLLTIQSPDGDTIDCVHKRKQPALDHPLLKNHKIQRAPTGWPK-----MKKNERRAGS

Query:  GAGGPFQTWHGNATRCPKGTIPVRRSTVKDVLRAKSLFDFGKKKRPILLDQQMDAPDVVSGNGHEHAIAYTRSLGEMYGAKATINVWEPSIQMVNEFSLS
          G   Q WH    +C +GTIP+RR+   DVLRA S+  +GKKKR  +   +   PD+++ +GH+HAIAY     + YGAKATINVWEP IQ  NEFSLS
Subjt:  GAGGPFQTWHGNATRCPKGTIPVRRSTVKDVLRAKSLFDFGKKKRPILLDQQMDAPDVVSGNGHEHAIAYTRSLGEMYGAKATINVWEPSIQMVNEFSLS

Query:  QIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSRIAIGAAISPVSSLTGNQYDITILIWKDPKLGNWWMGF
        QIW+L GSF G DLNSIEAGWQVSP+LYGD+  RLFTYWTSDAYQATGCYNLLC+GF+Q NS IA+GA+ISPVS    +QYDI+ILIWKDPK G+WWM F
Subjt:  QIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSRIAIGAAISPVSSLTGNQYDITILIWKDPKLGNWWMGF

Query:  GDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRSNGQHTSTQMGSGHFANDGFGKASYFRNLEIVDSDNNL
        G+  ++GYWP+ LF++L + A+M+EWGGEVVNS+S+GQHTSTQMGSG F  +GF KASYFRN+++VD  NNL
Subjt:  GDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRSNGQHTSTQMGSGHFANDGFGKASYFRNLEIVDSDNNL

AT3G13510.1 Protein of Unknown Function (DUF239)2.9e-11856.61Show/hide
Query:  VSVSMALFCVLLVVLQSFSLVCGLTYSYKHVSSLRFDRIQTHLDSINKPPLLTIQSPDGDTIDCVHKRKQPALDHPLLKNHKIQRAPTGWPKMKKNERRA
        V  +   F  L V+L   SL C    +  + SS +   ++ HL+ +NKPP+ TIQSPDGD IDC+   KQPA DHP LK+HKIQ  P+  P+   ++ + 
Subjt:  VSVSMALFCVLLVVLQSFSLVCGLTYSYKHVSSLRFDRIQTHLDSINKPPLLTIQSPDGDTIDCVHKRKQPALDHPLLKNHKIQRAPTGWPKMKKNERRA

Query:  GSGAGGPF----QTWHGNATRCPKGTIPVRRSTVKDVLRAKSLFDFGKKKRPILLDQQMDAPDVVSGNGHEHAIAYTRSLGEMYGAKATINVWEPSIQMV
         +   G      Q WH    +C +GTIP+RR+   DVLRA S+  +GKKK   +   +   PD+++ NGH+HAIAY     + YGAKAT+NVWEP IQ  
Subjt:  GSGAGGPF----QTWHGNATRCPKGTIPVRRSTVKDVLRAKSLFDFGKKKRPILLDQQMDAPDVVSGNGHEHAIAYTRSLGEMYGAKATINVWEPSIQMV

Query:  NEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSRIAIGAAISPVSSLTGNQYDITILIWKDPKLG
        NEFSLSQIW+L GSF G DLNSIEAGWQVSP+LYGD+  RLFTYWTSDAYQATGCYNLLC+GF+Q NS IA+GA+ISPVS    +QYDI+ILIWKDPK G
Subjt:  NEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSRIAIGAAISPVSSLTGNQYDITILIWKDPKLG

Query:  NWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRSNGQHTSTQMGSGHFANDGFGKASYFRNLEIVDSDNNL
        +WWM FG+  ++GYWP+ LF++L + A+M+EWGGEVVNS+S G HT TQMGSGHF  +GF KASYFRN+++VD  NNL
Subjt:  NWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRSNGQHTSTQMGSGHFANDGFGKASYFRNLEIVDSDNNL

AT5G18460.1 Protein of Unknown Function (DUF239)4.3e-16273.57Show/hide
Query:  TYSYKHVSSLRFDRIQTHLDSINKPPLLTIQSPDGDTIDCVHKRKQPALDHPLLKNHKIQRAPTGWPKMKKNE---RRAGSGAGGPFQTWHGNATRCPKG
        T  Y+ VSSLR  RIQ HL+ INK P+ TIQSPDGD IDCV KRKQPALDHPLLK+HKIQ+AP   PKMK  +   + A +   G +Q WH N TRCPKG
Subjt:  TYSYKHVSSLRFDRIQTHLDSINKPPLLTIQSPDGDTIDCVHKRKQPALDHPLLKNHKIQRAPTGWPKMKKNE---RRAGSGAGGPFQTWHGNATRCPKG

Query:  TIPVRRSTVKDVLRAKSLFDFGKKKRPILLDQQMDAPDVVSGNGHEHAIAYTRSLGEMYGAKATINVWEPSIQMVNEFSLSQIWILSGSFDGSDLNSIEA
        T+P+RR+T+ DVLRAKSLFDFGKK+R I LDQ+ + PD +  NGHEHAIAYT S  E+YGAKATINVW+P I+ VNEFSLSQIWILSGSF G DLNSIEA
Subjt:  TIPVRRSTVKDVLRAKSLFDFGKKKRPILLDQQMDAPDVVSGNGHEHAIAYTRSLGEMYGAKATINVWEPSIQMVNEFSLSQIWILSGSFDGSDLNSIEA

Query:  GWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSRIAIGAAISPVSSLTGNQYDITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLAD
        GWQVSPELYGD+RPRLFTYWTSD+YQATGCYNLLC+GF+QTN++IAIGAAISP+S+  GNQ+DITILIWKDPK+GNWWMG GD+TLVGYWPAELFTHLAD
Subjt:  GWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSRIAIGAAISPVSSLTGNQYDITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLAD

Query:  HATMVEWGGEVVNSRSNGQHTSTQMGSGHFANDGFGKASYFRNLEIVDSDNNL---SDVEILNVNDQ
        HAT VEWGGEVVN+R++G+HT+TQMGSGHF ++GFGKASYFRNLE+VDSDN+L    DV+IL  N +
Subjt:  HATMVEWGGEVVNSRSNGQHTSTQMGSGHFANDGFGKASYFRNLEIVDSDNNL---SDVEILNVNDQ

AT5G56530.1 Protein of Unknown Function (DUF239)7.0e-12058.58Show/hide
Query:  SLVCGLTYSYKHVSSLRFDRIQTHLDSINKPPLLTIQSPDGDTIDCVHKRKQPALDHPLLKNHKIQRAPTGWPKM----KKNERRAGSGAGGPFQTWHGN
        SL C    S   VS   F+ +  HL+ +NKP + +IQSPDGD IDCVH  KQPA DHP LK+HKIQ  P+  P+      K   +         Q WH N
Subjt:  SLVCGLTYSYKHVSSLRFDRIQTHLDSINKPPLLTIQSPDGDTIDCVHKRKQPALDHPLLKNHKIQRAPTGWPKM----KKNERRAGSGAGGPFQTWHGN

Query:  ATRCPKGTIPVRRSTVKDVLRAKSLFDFGKKKRPILLDQQMDAPDVVSGNGHEHAIAYTRSLGEMYGAKATINVWEPSIQMVNEFSLSQIWILSGSFDGS
           C +GTIPVRR+  +DVLRA S+  +GKKK   +   +   PD+++ +GH+HAIAY    G+ YGAKATINVWEP +Q  NEFSLSQ+WIL GSF G 
Subjt:  ATRCPKGTIPVRRSTVKDVLRAKSLFDFGKKKRPILLDQQMDAPDVVSGNGHEHAIAYTRSLGEMYGAKATINVWEPSIQMVNEFSLSQIWILSGSFDGS

Query:  DLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSRIAIGAAISPVSSLTGNQYDITILIWKDPKLGNWWMGFGDNTLVGYWPAE
        DLNSIEAGWQVSP+LYGD+  RLFTYWTSDAYQATGCYNLLC+GF+Q NS+IA+GA+ISPVS     QYDI+I IWKDPK G+WWM FGD  ++GYWP+ 
Subjt:  DLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSRIAIGAAISPVSSLTGNQYDITILIWKDPKLGNWWMGFGDNTLVGYWPAE

Query:  LFTHLADHATMVEWGGEVVNSRSNGQHTSTQMGSGHFANDGFGKASYFRNLEIVDSDNNLSDVEILN
        LF++LAD A++VEWGGEVVN   +G HT+TQMGSG F ++GF KASYFRN+++VDS NNL + + LN
Subjt:  LFTHLADHATMVEWGGEVVNSRSNGQHTSTQMGSGHFANDGFGKASYFRNLEIVDSDNNLSDVEILN

AT5G56530.2 Protein of Unknown Function (DUF239)7.0e-12058.58Show/hide
Query:  SLVCGLTYSYKHVSSLRFDRIQTHLDSINKPPLLTIQSPDGDTIDCVHKRKQPALDHPLLKNHKIQRAPTGWPKM----KKNERRAGSGAGGPFQTWHGN
        SL C    S   VS   F+ +  HL+ +NKP + +IQSPDGD IDCVH  KQPA DHP LK+HKIQ  P+  P+      K   +         Q WH N
Subjt:  SLVCGLTYSYKHVSSLRFDRIQTHLDSINKPPLLTIQSPDGDTIDCVHKRKQPALDHPLLKNHKIQRAPTGWPKM----KKNERRAGSGAGGPFQTWHGN

Query:  ATRCPKGTIPVRRSTVKDVLRAKSLFDFGKKKRPILLDQQMDAPDVVSGNGHEHAIAYTRSLGEMYGAKATINVWEPSIQMVNEFSLSQIWILSGSFDGS
           C +GTIPVRR+  +DVLRA S+  +GKKK   +   +   PD+++ +GH+HAIAY    G+ YGAKATINVWEP +Q  NEFSLSQ+WIL GSF G 
Subjt:  ATRCPKGTIPVRRSTVKDVLRAKSLFDFGKKKRPILLDQQMDAPDVVSGNGHEHAIAYTRSLGEMYGAKATINVWEPSIQMVNEFSLSQIWILSGSFDGS

Query:  DLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSRIAIGAAISPVSSLTGNQYDITILIWKDPKLGNWWMGFGDNTLVGYWPAE
        DLNSIEAGWQVSP+LYGD+  RLFTYWTSDAYQATGCYNLLC+GF+Q NS+IA+GA+ISPVS     QYDI+I IWKDPK G+WWM FGD  ++GYWP+ 
Subjt:  DLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSRIAIGAAISPVSSLTGNQYDITILIWKDPKLGNWWMGFGDNTLVGYWPAE

Query:  LFTHLADHATMVEWGGEVVNSRSNGQHTSTQMGSGHFANDGFGKASYFRNLEIVDSDNNLSDVEILN
        LF++LAD A++VEWGGEVVN   +G HT+TQMGSG F ++GF KASYFRN+++VDS NNL + + LN
Subjt:  LFTHLADHATMVEWGGEVVNSRSNGQHTSTQMGSGHFANDGFGKASYFRNLEIVDSDNNLSDVEILN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAGAACAAGAGGGGTTTCAGTTTCAATGGCATTGTTCTGTGTTCTTCTTGTTGTTCTTCAGAGCTTCTCTTTGGTGTGTGGCCTCACTTATAGCTATAAACATGT
TAGTAGCTTGAGATTTGACAGGATTCAAACCCATTTGGACTCCATTAACAAGCCTCCTCTTCTCACCATTCAGAGCCCAGATGGTGATACTATAGATTGTGTTCATAAAA
GAAAACAGCCAGCTCTGGATCATCCCCTCTTGAAGAACCACAAAATTCAGAGAGCACCAACAGGGTGGCCGAAAATGAAGAAGAATGAAAGGAGGGCAGGATCAGGTGCG
GGAGGTCCATTTCAAACTTGGCATGGGAACGCGACACGGTGTCCAAAGGGAACTATTCCGGTGCGGCGCAGTACAGTGAAGGATGTGTTAAGAGCCAAGTCTTTGTTTGA
CTTTGGGAAGAAGAAACGACCGATCCTGCTTGATCAACAAATGGACGCTCCTGATGTGGTCAGTGGGAATGGTCACGAGCATGCGATCGCATACACTAGATCGCTCGGGG
AGATGTATGGAGCGAAGGCGACAATAAACGTGTGGGAACCGTCCATCCAAATGGTCAACGAGTTCAGCCTCTCTCAGATATGGATCCTCTCTGGATCATTCGACGGCTCG
GATCTCAACAGTATAGAGGCTGGTTGGCAGGTTAGTCCGGAGCTTTATGGTGACAGCAGACCGAGATTGTTCACATATTGGACGAGTGATGCATATCAAGCAACTGGTTG
CTATAACCTTTTATGTGCTGGATTTGTACAAACTAACAGCAGAATCGCCATCGGAGCTGCTATTTCTCCGGTCTCTTCCCTCACCGGCAACCAATATGACATTACCATTC
TCATTTGGAAGGATCCAAAATTGGGAAACTGGTGGATGGGATTCGGTGACAACACACTGGTGGGTTACTGGCCGGCGGAGCTATTCACTCACCTGGCCGACCATGCCACC
ATGGTGGAGTGGGGCGGCGAGGTCGTCAACTCAAGGTCCAATGGCCAGCACACTTCCACCCAAATGGGCTCCGGCCACTTCGCCAACGATGGCTTCGGCAAAGCTAGCTA
CTTTCGGAACCTCGAGATCGTCGACTCTGATAATAACTTAAGTGACGTAGAAATCCTGAATGTCAATGATCAACAACATCGCTACTGCAACCATGACGACCCCATGCAAG
AAATGCCACAAGAGAAGTTCGTTCTTATTTCTAAATTCTGGGCTTGGTGA
mRNA sequenceShow/hide mRNA sequence
TTCTATTTATCTGAATCCAAAGCAGGCAATCATGATCTTGATGCTGAGACCTCTTTAATGGATCTTCTTCTTCTTCTTCTTCTTCTTCGTTTGCAAACTAGCCCTCAGAG
AATTTGATATTTCGTTTTTCGCCTGTTTTTGGGGGAGGGATTATGGGAAGAACAAGAGGGGTTTCAGTTTCAATGGCATTGTTCTGTGTTCTTCTTGTTGTTCTTCAGAG
CTTCTCTTTGGTGTGTGGCCTCACTTATAGCTATAAACATGTTAGTAGCTTGAGATTTGACAGGATTCAAACCCATTTGGACTCCATTAACAAGCCTCCTCTTCTCACCA
TTCAGAGCCCAGATGGTGATACTATAGATTGTGTTCATAAAAGAAAACAGCCAGCTCTGGATCATCCCCTCTTGAAGAACCACAAAATTCAGAGAGCACCAACAGGGTGG
CCGAAAATGAAGAAGAATGAAAGGAGGGCAGGATCAGGTGCGGGAGGTCCATTTCAAACTTGGCATGGGAACGCGACACGGTGTCCAAAGGGAACTATTCCGGTGCGGCG
CAGTACAGTGAAGGATGTGTTAAGAGCCAAGTCTTTGTTTGACTTTGGGAAGAAGAAACGACCGATCCTGCTTGATCAACAAATGGACGCTCCTGATGTGGTCAGTGGGA
ATGGTCACGAGCATGCGATCGCATACACTAGATCGCTCGGGGAGATGTATGGAGCGAAGGCGACAATAAACGTGTGGGAACCGTCCATCCAAATGGTCAACGAGTTCAGC
CTCTCTCAGATATGGATCCTCTCTGGATCATTCGACGGCTCGGATCTCAACAGTATAGAGGCTGGTTGGCAGGTTAGTCCGGAGCTTTATGGTGACAGCAGACCGAGATT
GTTCACATATTGGACGAGTGATGCATATCAAGCAACTGGTTGCTATAACCTTTTATGTGCTGGATTTGTACAAACTAACAGCAGAATCGCCATCGGAGCTGCTATTTCTC
CGGTCTCTTCCCTCACCGGCAACCAATATGACATTACCATTCTCATTTGGAAGGATCCAAAATTGGGAAACTGGTGGATGGGATTCGGTGACAACACACTGGTGGGTTAC
TGGCCGGCGGAGCTATTCACTCACCTGGCCGACCATGCCACCATGGTGGAGTGGGGCGGCGAGGTCGTCAACTCAAGGTCCAATGGCCAGCACACTTCCACCCAAATGGG
CTCCGGCCACTTCGCCAACGATGGCTTCGGCAAAGCTAGCTACTTTCGGAACCTCGAGATCGTCGACTCTGATAATAACTTAAGTGACGTAGAAATCCTGAATGTCAATG
ATCAACAACATCGCTACTGCAACCATGACGACCCCATGCAAGAAATGCCACAAGAGAAGTTCGTTCTTATTTCTAAATTCTGGGCTTGGTGA
Protein sequenceShow/hide protein sequence
MGRTRGVSVSMALFCVLLVVLQSFSLVCGLTYSYKHVSSLRFDRIQTHLDSINKPPLLTIQSPDGDTIDCVHKRKQPALDHPLLKNHKIQRAPTGWPKMKKNERRAGSGA
GGPFQTWHGNATRCPKGTIPVRRSTVKDVLRAKSLFDFGKKKRPILLDQQMDAPDVVSGNGHEHAIAYTRSLGEMYGAKATINVWEPSIQMVNEFSLSQIWILSGSFDGS
DLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSRIAIGAAISPVSSLTGNQYDITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHAT
MVEWGGEVVNSRSNGQHTSTQMGSGHFANDGFGKASYFRNLEIVDSDNNLSDVEILNVNDQQHRYCNHDDPMQEMPQEKFVLISKFWAW