; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr020014 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr020014
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionAT hook motif-containing protein
Genome locationtig00153446:916121..917578
RNA-Seq ExpressionSgr020014
SyntenySgr020014
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022150988.1 uncharacterized protein LOC111019016 isoform X1 [Momordica charantia]2.4e-19075.56Show/hide
Query:  MSQAEQGISADNLVDVPLKRKRGRPRKYPKLSYDENALIANNGGKKHLETIRISPGSGVNGNQSHPSNPIQNAADVILGRVVSGVIEAVFEAGYLLCVRV
        MSQA+QGIS+DNLVDVPLKRKRGRPRKYPKL+YDE+ LIA N GKKHLE I ISPGSGVNGNQSHP+ PIQ+ AD +LG+VVSGVIEAVFEAGYLLCVRV
Subjt:  MSQAEQGISADNLVDVPLKRKRGRPRKYPKLSYDENALIANNGGKKHLETIRISPGSGVNGNQSHPSNPIQNAADVILGRVVSGVIEAVFEAGYLLCVRV

Query:  GNSGITLRGVVFKPGHYVPVLAENDVAPDVQMIRRNTVPFATGNQTHGYNPRSKN--VPSHESSDAKLRFRHTPPHSNQDALKDKSISSIFAKITPSGSS
        GNSGITLRGVVFKPGHYVPV AENDVAPDVQMIRRNTVPF TGNQTH    RSKN  VPSH+SS AKL F HTP HSN DA KDK+ISSIFA+ITPSGSS
Subjt:  GNSGITLRGVVFKPGHYVPVLAENDVAPDVQMIRRNTVPFATGNQTHGYNPRSKN--VPSHESSDAKLRFRHTPPHSNQDALKDKSISSIFAKITPSGSS

Query:  RGNVVPVVPQPAKLTNGPSVANESFAIQTTDVESSKGKEVLIGTFPSNELAPTNVTVGIESFPFQPQTSQQVLQDDVSVENSSHNQSLVVEVRDSEAKLM
        RGNV+PVV QPAK TNGPSVANESF IQTT+VESSKGKEVL+GTF SNE APTNVT+GIESFPFQPQTSQQVLQDD S+EN+SHNQS VVEV DSEAK M
Subjt:  RGNVVPVVPQPAKLTNGPSVANESFAIQTTDVESSKGKEVLIGTFPSNELAPTNVTVGIESFPFQPQTSQQVLQDDVSVENSSHNQSLVVEVRDSEAKLM

Query:  TLPSTPFQNLVTEVIKRIQAPSLSAEMQTENNKATGKISAKECVDSSEVAANIVDGPLMIEPLKAVQPLHHSSVSIRKSLDDESRTGKMTELLQVNDTLT
        T+PSTPF+NLVTEVIKRIQAPSLSAE+QTENNK T K+SAKEC DSSEVAANIVDGPLMIEPLKAVQPL  S VSI K+LDDESRTGKMTELLQ      
Subjt:  TLPSTPFQNLVTEVIKRIQAPSLSAEMQTENNKATGKISAKECVDSSEVAANIVDGPLMIEPLKAVQPLHHSSVSIRKSLDDESRTGKMTELLQVNDTLT

Query:  RMHFIRSSSIRLHYYYFFYMGRIHLVNDATACANEVMSFDKVLQQNMIQNSEPWAKVQNPGLMLKADGLEESKTEIGDEGAGNQKQI
                                                 VLQ+NM+QN EPWA+V NPGLMLK+D  EESKTE+GDE A NQ QI
Subjt:  RMHFIRSSSIRLHYYYFFYMGRIHLVNDATACANEVMSFDKVLQQNMIQNSEPWAKVQNPGLMLKADGLEESKTEIGDEGAGNQKQI

XP_022150997.1 uncharacterized protein LOC111019016 isoform X2 [Momordica charantia]1.3e-18874.95Show/hide
Query:  MSQAEQGISADNLVDVPLKRKRGRPRKYPKLSYDENALIANNGGKKHLETIRISPGSGVNGNQSHPSNPIQNAADVILGRVVSGVIEAVFEAGYLLCVRV
        MSQA+QGIS+DNLVDVPLKRKRGRPRKYPKL+YDE+ LIA N GKKHLE I ISPGSGVNGNQSHP+ PIQ+ AD +LG+VVSGVIEAVFEAGYLLCVRV
Subjt:  MSQAEQGISADNLVDVPLKRKRGRPRKYPKLSYDENALIANNGGKKHLETIRISPGSGVNGNQSHPSNPIQNAADVILGRVVSGVIEAVFEAGYLLCVRV

Query:  GNSGITLRGVVFKPGHYVPVLAENDVAPDVQMIRRNTVPFATGNQTHGYNPRSKN--VPSHESSDAKLRFRHTPPHSNQDALKDKSISSIFAKITPSGSS
        GNSGITLRGVVFKPGHYVPV AENDVAPDVQMIRRNTVPF TGNQTH    RSKN  VPSH+SS AKL F HTP HSN DA KDK+ISSIFA+ITPSGSS
Subjt:  GNSGITLRGVVFKPGHYVPVLAENDVAPDVQMIRRNTVPFATGNQTHGYNPRSKN--VPSHESSDAKLRFRHTPPHSNQDALKDKSISSIFAKITPSGSS

Query:  RGNVVPVVPQPAKLTNGPSVANESFAIQTTDVESSKGKEVLIGTFPSNELAPTNVTVGIESFPFQPQTSQQVLQDDVSVENSSHNQSLVVEVRDSEAKLM
        RGNV+PVV QPAK TNGPSVANESF IQTT+VESSKGKEVL+GTF SNE APTNVT+GIESFPFQPQTSQQVLQDD S+EN+SHNQS VVEV DSEAK M
Subjt:  RGNVVPVVPQPAKLTNGPSVANESFAIQTTDVESSKGKEVLIGTFPSNELAPTNVTVGIESFPFQPQTSQQVLQDDVSVENSSHNQSLVVEVRDSEAKLM

Query:  TLPSTPFQNLVTEVIKRIQAPSLSAEMQTENNKATGKISAKECVDSSEVAANIVDGPLMIEPLKAVQPLHHSSVSIRKSLDDESRTGKMTELLQVNDTLT
        T+PSTPF+NLVTEVIKRIQAPSLSAE+QTENNK T K+SAKEC DSSEVAANIVDGPLMIEPLKAVQPL  S VSI K+LDDESRTGKMTEL        
Subjt:  TLPSTPFQNLVTEVIKRIQAPSLSAEMQTENNKATGKISAKECVDSSEVAANIVDGPLMIEPLKAVQPLHHSSVSIRKSLDDESRTGKMTELLQVNDTLT

Query:  RMHFIRSSSIRLHYYYFFYMGRIHLVNDATACANEVMSFDKVLQQNMIQNSEPWAKVQNPGLMLKADGLEESKTEIGDEGAGNQKQI
                                                  LQ+NM+QN EPWA+V NPGLMLK+D  EESKTE+GDE A NQ QI
Subjt:  RMHFIRSSSIRLHYYYFFYMGRIHLVNDATACANEVMSFDKVLQQNMIQNSEPWAKVQNPGLMLKADGLEESKTEIGDEGAGNQKQI

XP_022932145.1 uncharacterized protein LOC111438469 isoform X1 [Cucurbita moschata]1.4e-16969.39Show/hide
Query:  MSQAEQGISADNLVDVPLKRKRGRPRKYPKLSYDENALIANNGGKKHLETIRISPGSGVNGNQSHPSNPIQNAADVILGRVVSGVIEAVFEAGYLLCVRV
        MSQA+QGISADNLVD PLKRKRGRPRKYPKLSYDEN LI+ N GKKHLE I +SPGSGVNGNQS P+  IQN +D +LG+VVSGVIEAVFEAGYLLCVRV
Subjt:  MSQAEQGISADNLVDVPLKRKRGRPRKYPKLSYDENALIANNGGKKHLETIRISPGSGVNGNQSHPSNPIQNAADVILGRVVSGVIEAVFEAGYLLCVRV

Query:  GNSGITLRGVVFKPGHYVPVLAENDVAPDVQMIRRNTVPFATGNQTHGYNPRSKN--VPSHESSDAKLRFRHTPPHSNQDALKDKSISSIFAKITPSGSS
        GNSGITLRGVVFKPGHYVPV AENDVAPD+QMIRRN VPFATGNQ+ G NP S N  VPSHESS A L FR++PPHSN DA K+KS+SSI A+I PSGSS
Subjt:  GNSGITLRGVVFKPGHYVPVLAENDVAPDVQMIRRNTVPFATGNQTHGYNPRSKN--VPSHESSDAKLRFRHTPPHSNQDALKDKSISSIFAKITPSGSS

Query:  RGNVVPVVPQPAKLTNGPSVANESFAIQTTDVESSKGKEVLIGTFPSNELAPTNVTVGIESFPFQPQTSQQ-VLQDDVSVENSSHNQSLVVEVRDSEAKL
        RGNVVPVV  PAKLTNGP   +E+F +QT D+ESSKGKEVLIG+F SNE AP +VTVGIESF FQPQTSQQ VLQDDVSVEN+SHN+SLV+EV DSE K 
Subjt:  RGNVVPVVPQPAKLTNGPSVANESFAIQTTDVESSKGKEVLIGTFPSNELAPTNVTVGIESFPFQPQTSQQ-VLQDDVSVENSSHNQSLVVEVRDSEAKL

Query:  MTLPSTPFQNLVTEVIKRIQAPSLSAEMQTENNKATGKISAKECVDSSEVAANIV-DGPLMIEPLKAVQPLHHSSVSIRKSLDDESRTGKMTELLQVNDT
        M LPSTPF++LVTEVIKRIQAP LSAEMQTENNK T  ISAKE   SSEV AN++ DG LMIEPLKAVQPLH SS  I K+LDDESRTGKMTELLQ    
Subjt:  MTLPSTPFQNLVTEVIKRIQAPSLSAEMQTENNKATGKISAKECVDSSEVAANIV-DGPLMIEPLKAVQPLHHSSVSIRKSLDDESRTGKMTELLQVNDT

Query:  LTRMHFIRSSSIRLHYYYFFYMGRIHLVNDATACANEVMSFDKVLQQNMIQNSEPWAKVQNPGLMLKADGLEESKTEIGD-EGAGNQKQI
                                                   VLQ+NM+Q  +PW  V +PGLMLK++   ES+ EIGD E AGNQKQ+
Subjt:  LTRMHFIRSSSIRLHYYYFFYMGRIHLVNDATACANEVMSFDKVLQQNMIQNSEPWAKVQNPGLMLKADGLEESKTEIGD-EGAGNQKQI

XP_038905587.1 uncharacterized protein LOC120091567 isoform X1 [Benincasa hispida]1.7e-17571.05Show/hide
Query:  MSQAEQGISADNLVDVPLKRKRGRPRKYPKLSYDENALIANNGGKKHLETIRISPGSGVNGNQSHPSNPIQNAADVILGRVVSGVIEAVFEAGYLLCVRV
        MSQ +QGIS DNLVDVPLKRKRGRPRKYPKL+YDEN  I  N GK+HLE I ISPGSG NG+QSHP+  IQ+  D +LG+VVSGVIEAVFEAGYLLCVR 
Subjt:  MSQAEQGISADNLVDVPLKRKRGRPRKYPKLSYDENALIANNGGKKHLETIRISPGSGVNGNQSHPSNPIQNAADVILGRVVSGVIEAVFEAGYLLCVRV

Query:  GNSGITLRGVVFKPGHYVPVLAENDVAPDVQMIRRNTVPFATGNQTHGYNPRSKN--VPSHESSDAKLRFRHTPPHSNQDALKDKSISSIFAKITPSGSS
        GNSGITLRGVVFKPGHYVPV AENDVAPDVQMIRRN +P ATGNQ  G NPRSKN  +PSHESS  KL F++TPPHSN+DALKD SISSI A+ITPSGSS
Subjt:  GNSGITLRGVVFKPGHYVPVLAENDVAPDVQMIRRNTVPFATGNQTHGYNPRSKN--VPSHESSDAKLRFRHTPPHSNQDALKDKSISSIFAKITPSGSS

Query:  RGNVVPVVPQPAKLTNGPSVANESFAIQTTDVESSKGKEVLIGTFPSNELAPTNVTVGIESFPFQPQTSQQVLQDDVSVENSSHNQSLVVEVRDSEAKLM
        RGNVVPVV QPAKLTNGPSV  E+  IQT D+ESSKGKEVL+GTF SNE APT+VTVGIESFPFQPQTSQQVL DDV VENS  NQSLVVEV DS  K M
Subjt:  RGNVVPVVPQPAKLTNGPSVANESFAIQTTDVESSKGKEVLIGTFPSNELAPTNVTVGIESFPFQPQTSQQVLQDDVSVENSSHNQSLVVEVRDSEAKLM

Query:  TLPSTPFQNLVTEVIKRIQAPSLSAEMQTENNKATGKISAKECVDSSEVAANIVDGPLMIEPLKAVQPLHHSSVSIRKSLDDESRTGKMTELLQVNDTLT
         LPSTPF++LVTEVIKRIQAPSL+ + QTE+NK    ISAKEC DSSEV ANI DG LMIEPLKAVQPLH SS  I K+LDDESRTGKMTELLQ      
Subjt:  TLPSTPFQNLVTEVIKRIQAPSLSAEMQTENNKATGKISAKECVDSSEVAANIVDGPLMIEPLKAVQPLHHSSVSIRKSLDDESRTGKMTELLQVNDTLT

Query:  RMHFIRSSSIRLHYYYFFYMGRIHLVNDATACANEVMSFDKVLQQNMIQNSEPWAKVQNPGLMLKADGLEESKTEIGDEGAGNQKQI
                                                 VLQ+NM+Q  EPWA+VQNPGLMLK+DG  ESK EIGDE AGNQKQI
Subjt:  RMHFIRSSSIRLHYYYFFYMGRIHLVNDATACANEVMSFDKVLQQNMIQNSEPWAKVQNPGLMLKADGLEESKTEIGDEGAGNQKQI

XP_038905591.1 uncharacterized protein LOC120091567 isoform X2 [Benincasa hispida]9.1e-17470.43Show/hide
Query:  MSQAEQGISADNLVDVPLKRKRGRPRKYPKLSYDENALIANNGGKKHLETIRISPGSGVNGNQSHPSNPIQNAADVILGRVVSGVIEAVFEAGYLLCVRV
        MSQ +QGIS DNLVDVPLKRKRGRPRKYPKL+YDEN  I  N GK+HLE I ISPGSG NG+QSHP+  IQ+  D +LG+VVSGVIEAVFEAGYLLCVR 
Subjt:  MSQAEQGISADNLVDVPLKRKRGRPRKYPKLSYDENALIANNGGKKHLETIRISPGSGVNGNQSHPSNPIQNAADVILGRVVSGVIEAVFEAGYLLCVRV

Query:  GNSGITLRGVVFKPGHYVPVLAENDVAPDVQMIRRNTVPFATGNQTHGYNPRSKN--VPSHESSDAKLRFRHTPPHSNQDALKDKSISSIFAKITPSGSS
        GNSGITLRGVVFKPGHYVPV AENDVAPDVQMIRRN +P ATGNQ  G NPRSKN  +PSHESS  KL F++TPPHSN+DALKD SISSI A+ITPSGSS
Subjt:  GNSGITLRGVVFKPGHYVPVLAENDVAPDVQMIRRNTVPFATGNQTHGYNPRSKN--VPSHESSDAKLRFRHTPPHSNQDALKDKSISSIFAKITPSGSS

Query:  RGNVVPVVPQPAKLTNGPSVANESFAIQTTDVESSKGKEVLIGTFPSNELAPTNVTVGIESFPFQPQTSQQVLQDDVSVENSSHNQSLVVEVRDSEAKLM
        RGNVVPVV QPAKLTNGPSV  E+  IQT D+ESSKGKEVL+GTF SNE APT+VTVGIESFPFQPQTSQQVL DDV VENS  NQSLVVEV DS  K M
Subjt:  RGNVVPVVPQPAKLTNGPSVANESFAIQTTDVESSKGKEVLIGTFPSNELAPTNVTVGIESFPFQPQTSQQVLQDDVSVENSSHNQSLVVEVRDSEAKLM

Query:  TLPSTPFQNLVTEVIKRIQAPSLSAEMQTENNKATGKISAKECVDSSEVAANIVDGPLMIEPLKAVQPLHHSSVSIRKSLDDESRTGKMTELLQVNDTLT
         LPSTPF++LVTEVIKRIQAPSL+ + QTE+NK    ISAKEC DSSEV ANI DG LMIEPLKAVQPLH SS  I K+LDDESRTGKMTEL        
Subjt:  TLPSTPFQNLVTEVIKRIQAPSLSAEMQTENNKATGKISAKECVDSSEVAANIVDGPLMIEPLKAVQPLHHSSVSIRKSLDDESRTGKMTELLQVNDTLT

Query:  RMHFIRSSSIRLHYYYFFYMGRIHLVNDATACANEVMSFDKVLQQNMIQNSEPWAKVQNPGLMLKADGLEESKTEIGDEGAGNQKQI
                                                  LQ+NM+Q  EPWA+VQNPGLMLK+DG  ESK EIGDE AGNQKQI
Subjt:  RMHFIRSSSIRLHYYYFFYMGRIHLVNDATACANEVMSFDKVLQQNMIQNSEPWAKVQNPGLMLKADGLEESKTEIGDEGAGNQKQI

TrEMBL top hitse value%identityAlignment
A0A6J1DAY0 uncharacterized protein LOC111019016 isoform X11.2e-19075.56Show/hide
Query:  MSQAEQGISADNLVDVPLKRKRGRPRKYPKLSYDENALIANNGGKKHLETIRISPGSGVNGNQSHPSNPIQNAADVILGRVVSGVIEAVFEAGYLLCVRV
        MSQA+QGIS+DNLVDVPLKRKRGRPRKYPKL+YDE+ LIA N GKKHLE I ISPGSGVNGNQSHP+ PIQ+ AD +LG+VVSGVIEAVFEAGYLLCVRV
Subjt:  MSQAEQGISADNLVDVPLKRKRGRPRKYPKLSYDENALIANNGGKKHLETIRISPGSGVNGNQSHPSNPIQNAADVILGRVVSGVIEAVFEAGYLLCVRV

Query:  GNSGITLRGVVFKPGHYVPVLAENDVAPDVQMIRRNTVPFATGNQTHGYNPRSKN--VPSHESSDAKLRFRHTPPHSNQDALKDKSISSIFAKITPSGSS
        GNSGITLRGVVFKPGHYVPV AENDVAPDVQMIRRNTVPF TGNQTH    RSKN  VPSH+SS AKL F HTP HSN DA KDK+ISSIFA+ITPSGSS
Subjt:  GNSGITLRGVVFKPGHYVPVLAENDVAPDVQMIRRNTVPFATGNQTHGYNPRSKN--VPSHESSDAKLRFRHTPPHSNQDALKDKSISSIFAKITPSGSS

Query:  RGNVVPVVPQPAKLTNGPSVANESFAIQTTDVESSKGKEVLIGTFPSNELAPTNVTVGIESFPFQPQTSQQVLQDDVSVENSSHNQSLVVEVRDSEAKLM
        RGNV+PVV QPAK TNGPSVANESF IQTT+VESSKGKEVL+GTF SNE APTNVT+GIESFPFQPQTSQQVLQDD S+EN+SHNQS VVEV DSEAK M
Subjt:  RGNVVPVVPQPAKLTNGPSVANESFAIQTTDVESSKGKEVLIGTFPSNELAPTNVTVGIESFPFQPQTSQQVLQDDVSVENSSHNQSLVVEVRDSEAKLM

Query:  TLPSTPFQNLVTEVIKRIQAPSLSAEMQTENNKATGKISAKECVDSSEVAANIVDGPLMIEPLKAVQPLHHSSVSIRKSLDDESRTGKMTELLQVNDTLT
        T+PSTPF+NLVTEVIKRIQAPSLSAE+QTENNK T K+SAKEC DSSEVAANIVDGPLMIEPLKAVQPL  S VSI K+LDDESRTGKMTELLQ      
Subjt:  TLPSTPFQNLVTEVIKRIQAPSLSAEMQTENNKATGKISAKECVDSSEVAANIVDGPLMIEPLKAVQPLHHSSVSIRKSLDDESRTGKMTELLQVNDTLT

Query:  RMHFIRSSSIRLHYYYFFYMGRIHLVNDATACANEVMSFDKVLQQNMIQNSEPWAKVQNPGLMLKADGLEESKTEIGDEGAGNQKQI
                                                 VLQ+NM+QN EPWA+V NPGLMLK+D  EESKTE+GDE A NQ QI
Subjt:  RMHFIRSSSIRLHYYYFFYMGRIHLVNDATACANEVMSFDKVLQQNMIQNSEPWAKVQNPGLMLKADGLEESKTEIGDEGAGNQKQI

A0A6J1DCA8 uncharacterized protein LOC111019016 isoform X26.3e-18974.95Show/hide
Query:  MSQAEQGISADNLVDVPLKRKRGRPRKYPKLSYDENALIANNGGKKHLETIRISPGSGVNGNQSHPSNPIQNAADVILGRVVSGVIEAVFEAGYLLCVRV
        MSQA+QGIS+DNLVDVPLKRKRGRPRKYPKL+YDE+ LIA N GKKHLE I ISPGSGVNGNQSHP+ PIQ+ AD +LG+VVSGVIEAVFEAGYLLCVRV
Subjt:  MSQAEQGISADNLVDVPLKRKRGRPRKYPKLSYDENALIANNGGKKHLETIRISPGSGVNGNQSHPSNPIQNAADVILGRVVSGVIEAVFEAGYLLCVRV

Query:  GNSGITLRGVVFKPGHYVPVLAENDVAPDVQMIRRNTVPFATGNQTHGYNPRSKN--VPSHESSDAKLRFRHTPPHSNQDALKDKSISSIFAKITPSGSS
        GNSGITLRGVVFKPGHYVPV AENDVAPDVQMIRRNTVPF TGNQTH    RSKN  VPSH+SS AKL F HTP HSN DA KDK+ISSIFA+ITPSGSS
Subjt:  GNSGITLRGVVFKPGHYVPVLAENDVAPDVQMIRRNTVPFATGNQTHGYNPRSKN--VPSHESSDAKLRFRHTPPHSNQDALKDKSISSIFAKITPSGSS

Query:  RGNVVPVVPQPAKLTNGPSVANESFAIQTTDVESSKGKEVLIGTFPSNELAPTNVTVGIESFPFQPQTSQQVLQDDVSVENSSHNQSLVVEVRDSEAKLM
        RGNV+PVV QPAK TNGPSVANESF IQTT+VESSKGKEVL+GTF SNE APTNVT+GIESFPFQPQTSQQVLQDD S+EN+SHNQS VVEV DSEAK M
Subjt:  RGNVVPVVPQPAKLTNGPSVANESFAIQTTDVESSKGKEVLIGTFPSNELAPTNVTVGIESFPFQPQTSQQVLQDDVSVENSSHNQSLVVEVRDSEAKLM

Query:  TLPSTPFQNLVTEVIKRIQAPSLSAEMQTENNKATGKISAKECVDSSEVAANIVDGPLMIEPLKAVQPLHHSSVSIRKSLDDESRTGKMTELLQVNDTLT
        T+PSTPF+NLVTEVIKRIQAPSLSAE+QTENNK T K+SAKEC DSSEVAANIVDGPLMIEPLKAVQPL  S VSI K+LDDESRTGKMTEL        
Subjt:  TLPSTPFQNLVTEVIKRIQAPSLSAEMQTENNKATGKISAKECVDSSEVAANIVDGPLMIEPLKAVQPLHHSSVSIRKSLDDESRTGKMTELLQVNDTLT

Query:  RMHFIRSSSIRLHYYYFFYMGRIHLVNDATACANEVMSFDKVLQQNMIQNSEPWAKVQNPGLMLKADGLEESKTEIGDEGAGNQKQI
                                                  LQ+NM+QN EPWA+V NPGLMLK+D  EESKTE+GDE A NQ QI
Subjt:  RMHFIRSSSIRLHYYYFFYMGRIHLVNDATACANEVMSFDKVLQQNMIQNSEPWAKVQNPGLMLKADGLEESKTEIGDEGAGNQKQI

A0A6J1EVJ4 uncharacterized protein LOC111438469 isoform X16.6e-17069.39Show/hide
Query:  MSQAEQGISADNLVDVPLKRKRGRPRKYPKLSYDENALIANNGGKKHLETIRISPGSGVNGNQSHPSNPIQNAADVILGRVVSGVIEAVFEAGYLLCVRV
        MSQA+QGISADNLVD PLKRKRGRPRKYPKLSYDEN LI+ N GKKHLE I +SPGSGVNGNQS P+  IQN +D +LG+VVSGVIEAVFEAGYLLCVRV
Subjt:  MSQAEQGISADNLVDVPLKRKRGRPRKYPKLSYDENALIANNGGKKHLETIRISPGSGVNGNQSHPSNPIQNAADVILGRVVSGVIEAVFEAGYLLCVRV

Query:  GNSGITLRGVVFKPGHYVPVLAENDVAPDVQMIRRNTVPFATGNQTHGYNPRSKN--VPSHESSDAKLRFRHTPPHSNQDALKDKSISSIFAKITPSGSS
        GNSGITLRGVVFKPGHYVPV AENDVAPD+QMIRRN VPFATGNQ+ G NP S N  VPSHESS A L FR++PPHSN DA K+KS+SSI A+I PSGSS
Subjt:  GNSGITLRGVVFKPGHYVPVLAENDVAPDVQMIRRNTVPFATGNQTHGYNPRSKN--VPSHESSDAKLRFRHTPPHSNQDALKDKSISSIFAKITPSGSS

Query:  RGNVVPVVPQPAKLTNGPSVANESFAIQTTDVESSKGKEVLIGTFPSNELAPTNVTVGIESFPFQPQTSQQ-VLQDDVSVENSSHNQSLVVEVRDSEAKL
        RGNVVPVV  PAKLTNGP   +E+F +QT D+ESSKGKEVLIG+F SNE AP +VTVGIESF FQPQTSQQ VLQDDVSVEN+SHN+SLV+EV DSE K 
Subjt:  RGNVVPVVPQPAKLTNGPSVANESFAIQTTDVESSKGKEVLIGTFPSNELAPTNVTVGIESFPFQPQTSQQ-VLQDDVSVENSSHNQSLVVEVRDSEAKL

Query:  MTLPSTPFQNLVTEVIKRIQAPSLSAEMQTENNKATGKISAKECVDSSEVAANIV-DGPLMIEPLKAVQPLHHSSVSIRKSLDDESRTGKMTELLQVNDT
        M LPSTPF++LVTEVIKRIQAP LSAEMQTENNK T  ISAKE   SSEV AN++ DG LMIEPLKAVQPLH SS  I K+LDDESRTGKMTELLQ    
Subjt:  MTLPSTPFQNLVTEVIKRIQAPSLSAEMQTENNKATGKISAKECVDSSEVAANIV-DGPLMIEPLKAVQPLHHSSVSIRKSLDDESRTGKMTELLQVNDT

Query:  LTRMHFIRSSSIRLHYYYFFYMGRIHLVNDATACANEVMSFDKVLQQNMIQNSEPWAKVQNPGLMLKADGLEESKTEIGD-EGAGNQKQI
                                                   VLQ+NM+Q  +PW  V +PGLMLK++   ES+ EIGD E AGNQKQ+
Subjt:  LTRMHFIRSSSIRLHYYYFFYMGRIHLVNDATACANEVMSFDKVLQQNMIQNSEPWAKVQNPGLMLKADGLEESKTEIGD-EGAGNQKQI

A0A6J1F1E7 uncharacterized protein LOC111438469 isoform X24.7e-16868.78Show/hide
Query:  MSQAEQGISADNLVDVPLKRKRGRPRKYPKLSYDENALIANNGGKKHLETIRISPGSGVNGNQSHPSNPIQNAADVILGRVVSGVIEAVFEAGYLLCVRV
        MSQA+QGISADNLVD PLKRKRGRPRKYPKLSYDEN LI+ N GKKHLE I +SPGSGVNGNQS P+  IQN +D +LG+VVSGVIEAVFEAGYLLCVRV
Subjt:  MSQAEQGISADNLVDVPLKRKRGRPRKYPKLSYDENALIANNGGKKHLETIRISPGSGVNGNQSHPSNPIQNAADVILGRVVSGVIEAVFEAGYLLCVRV

Query:  GNSGITLRGVVFKPGHYVPVLAENDVAPDVQMIRRNTVPFATGNQTHGYNPRSKN--VPSHESSDAKLRFRHTPPHSNQDALKDKSISSIFAKITPSGSS
        GNSGITLRGVVFKPGHYVPV AENDVAPD+QMIRRN VPFATGNQ+ G NP S N  VPSHESS A L FR++PPHSN DA K+KS+SSI A+I PSGSS
Subjt:  GNSGITLRGVVFKPGHYVPVLAENDVAPDVQMIRRNTVPFATGNQTHGYNPRSKN--VPSHESSDAKLRFRHTPPHSNQDALKDKSISSIFAKITPSGSS

Query:  RGNVVPVVPQPAKLTNGPSVANESFAIQTTDVESSKGKEVLIGTFPSNELAPTNVTVGIESFPFQPQTSQQ-VLQDDVSVENSSHNQSLVVEVRDSEAKL
        RGNVVPVV  PAKLTNGP   +E+F +QT D+ESSKGKEVLIG+F SNE AP +VTVGIESF FQPQTSQQ VLQDDVSVEN+SHN+SLV+EV DSE K 
Subjt:  RGNVVPVVPQPAKLTNGPSVANESFAIQTTDVESSKGKEVLIGTFPSNELAPTNVTVGIESFPFQPQTSQQ-VLQDDVSVENSSHNQSLVVEVRDSEAKL

Query:  MTLPSTPFQNLVTEVIKRIQAPSLSAEMQTENNKATGKISAKECVDSSEVAANIV-DGPLMIEPLKAVQPLHHSSVSIRKSLDDESRTGKMTELLQVNDT
        M LPSTPF++LVTEVIKRIQAP LSAEMQTENNK T  ISAKE   SSEV AN++ DG LMIEPLKAVQPLH SS  I K+LDDESRTGKMTEL      
Subjt:  MTLPSTPFQNLVTEVIKRIQAPSLSAEMQTENNKATGKISAKECVDSSEVAANIV-DGPLMIEPLKAVQPLHHSSVSIRKSLDDESRTGKMTELLQVNDT

Query:  LTRMHFIRSSSIRLHYYYFFYMGRIHLVNDATACANEVMSFDKVLQQNMIQNSEPWAKVQNPGLMLKADGLEESKTEIGD-EGAGNQKQI
                                                    LQ+NM+Q  +PW  V +PGLMLK++   ES+ EIGD E AGNQKQ+
Subjt:  LTRMHFIRSSSIRLHYYYFFYMGRIHLVNDATACANEVMSFDKVLQQNMIQNSEPWAKVQNPGLMLKADGLEESKTEIGD-EGAGNQKQI

A0A6J1JAQ9 uncharacterized protein LOC111483279 isoform X17.3e-16969.18Show/hide
Query:  MSQAEQGISADNLVDVPLKRKRGRPRKYPKLSYDENALIANNGGKKHLETIRISPGSGVNGNQSHPSNPIQNAADVILGRVVSGVIEAVFEAGYLLCVRV
        MSQA+QGISADNLVD PLKRKRGRPRKYPKLSYDEN LI+ N GKKHLE I ISPGSGVNGNQS P+  IQN +D +LG+VVSGVIEAVFEAGYLLCVRV
Subjt:  MSQAEQGISADNLVDVPLKRKRGRPRKYPKLSYDENALIANNGGKKHLETIRISPGSGVNGNQSHPSNPIQNAADVILGRVVSGVIEAVFEAGYLLCVRV

Query:  GNSGITLRGVVFKPGHYVPVLAENDVAPDVQMIRRNTVPFATGNQTHGYNPRSKN--VPSHESSDAKLRFRHTPPHSNQDALKDKSISSIFAKITPSGSS
        GNSGITLRGVVFKPGHYVPV AENDVAPD+QMIRRN VPFATGNQ+ G NP S N  VPSHESS A L F+++PPHSN DA K+KS+SSI A+ITPSGSS
Subjt:  GNSGITLRGVVFKPGHYVPVLAENDVAPDVQMIRRNTVPFATGNQTHGYNPRSKN--VPSHESSDAKLRFRHTPPHSNQDALKDKSISSIFAKITPSGSS

Query:  RGNVVPVVPQPAKLTNGPSVANESFAIQTTDVESSKGKEVLIGTFPSNELAPTNVTVGIESFPFQPQTSQQ-VLQDDVSVENSSHNQSLVVEVRDSEAKL
        RGNVVPVV  PAKLTNGP   +E+F +QT D+ESSKGKEVLIG+F SNE AP +VTVGIESF FQPQTSQQ VLQD+VSVEN+SHN+SLV+EV DSE K 
Subjt:  RGNVVPVVPQPAKLTNGPSVANESFAIQTTDVESSKGKEVLIGTFPSNELAPTNVTVGIESFPFQPQTSQQ-VLQDDVSVENSSHNQSLVVEVRDSEAKL

Query:  MTLPSTPFQNLVTEVIKRIQAPSLSAEMQTENNKATGKISAKECVDSSEVAANIV-DGPLMIEPLKAVQPLHHSSVSIRKSLDDESRTGKMTELLQVNDT
        M LPSTPF++LVTEVIKRIQAP+LSAEMQTENNK T  ISAKE   SSEV AN++ DG LMIEPLKAVQPLH SS  I K+LDDESRTGKMTELLQ    
Subjt:  MTLPSTPFQNLVTEVIKRIQAPSLSAEMQTENNKATGKISAKECVDSSEVAANIV-DGPLMIEPLKAVQPLHHSSVSIRKSLDDESRTGKMTELLQVNDT

Query:  LTRMHFIRSSSIRLHYYYFFYMGRIHLVNDATACANEVMSFDKVLQQNMIQNSEPWAKVQNPGLMLKADGLEESKTEIGD-EGAGNQKQI
                                                   VLQ+NM+Q  +PW  V +P LMLK++   ES+ EIGD E AGNQKQ+
Subjt:  LTRMHFIRSSSIRLHYYYFFYMGRIHLVNDATACANEVMSFDKVLQQNMIQNSEPWAKVQNPGLMLKADGLEESKTEIGD-EGAGNQKQI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G21895.1 DNA binding2.3e-1038.28Show/hide
Query:  KRKRGRPRKYPKLSYDENALIANNGGKKHLETIRISPGSGVNGNQSHPSNPIQNAADVILGRVVSGVIEAVFEAGYLLCVRVGNSGITLRGVVFKPGHYV
        KRKRGRPRK      DEN+                +P   +N                ++G+VV+GVIE  F+AGYLL V+V +S   LRG+VF  G   
Subjt:  KRKRGRPRKYPKLSYDENALIANNGGKKHLETIRISPGSGVNGNQSHPSNPIQNAADVILGRVVSGVIEAVFEAGYLLCVRVGNSGITLRGVVFKPGHYV

Query:  PVLAENDVAPDVQMIRRNTVPFATGNQT
        P+  ENDVAP V+M  R  +     NQT
Subjt:  PVLAENDVAPDVQMIRRNTVPFATGNQT

AT5G52890.1 AT hook motif-containing protein1.4e-1031.51Show/hide
Query:  MSQAEQGISADNLVDVPLKRKRGRPRKYPKLSYDENALIANNGGKKHLETIRISPGSGVNGNQSHPSNPIQNAADV-ILGRVVSGVIEAVFEAGYLLCVR
        M Q  QG S+     +  KRKRGRPR+                                  ++S    P+    D  ++GRVVSGV+E  FEAGY L V+
Subjt:  MSQAEQGISADNLVDVPLKRKRGRPRKYPKLSYDENALIANNGGKKHLETIRISPGSGVNGNQSHPSNPIQNAADV-ILGRVVSGVIEAVFEAGYLLCVR

Query:  VGNSGITLRGVVFKPGHYVPVLAENDVAPDVQMIRRNTVPFATGNQ
        V ++   L+GVVF P    P+    D+ P  +M  RN +P  +  Q
Subjt:  VGNSGITLRGVVFKPGHYVPVLAENDVAPDVQMIRRNTVPFATGNQ

AT5G52890.2 AT hook motif-containing protein3.4e-0924.69Show/hide
Query:  MSQAEQGISADNLVDVPLKRKRGRPRKYPKLSYDENALIANNGGKKHLETIRISPGSGVNGNQSHPSNPIQNAADV-ILGRVVSGVIEAVFEAGYLLCVR
        M Q  QG S+     +  KRKRGRPR+                                  ++S    P+    D  ++GRVVSGV+E  FEAGY L V+
Subjt:  MSQAEQGISADNLVDVPLKRKRGRPRKYPKLSYDENALIANNGGKKHLETIRISPGSGVNGNQSHPSNPIQNAADV-ILGRVVSGVIEAVFEAGYLLCVR

Query:  VGNSGITLRGVVFKPGHYVPVLAENDVAPDVQMIRRNTVPFAT-------------GNQTH--GYNPRSKNVPSHESSDAKLRFRHTPPHSNQDA-----
        V ++   L+GVVF P    P+    D+ P  +M  RN +P  +             GNQT   G  P++  + + ++  A         H  +DA     
Subjt:  VGNSGITLRGVVFKPGHYVPVLAENDVAPDVQMIRRNTVPFAT-------------GNQTH--GYNPRSKNVPSHESSDAKLRFRHTPPHSNQDA-----

Query:  ------------LKDKSISSIFAK-ITPSGSS-------RGNVVPVVPQPAKLTNGPSVANESFAIQTTDVESS--KGKEVLIGTFPSNELAPTNVTVGI
                    +KD + SS   K I P+G +         + VP        +   + +  +  +  T  + S  KG   L+  F + E + T  T   
Subjt:  ------------LKDKSISSIFAK-ITPSGSS-------RGNVVPVVPQPAKLTNGPSVANESFAIQTTDVESS--KGKEVLIGTFPSNELAPTNVTVGI

Query:  ESFP---FQPQTSQQVLQDD
         +     FQ QT +   +D+
Subjt:  ESFP---FQPQTSQQVLQDD

AT5G54930.1 AT hook motif-containing protein2.0e-2233.81Show/hide
Query:  DVPLKRKRGRPRKYPKLSYDENALIANNGGKKHLETIRISPGSGVNGNQSHPSNPIQNAADVILGRVVSGVIEAVFEAGYLLCVRVGNSGITLRGVVFKP
        D+  KRKRGRPRK  KL  +E++L                 G   + ++S   +  +N  + ++G+ +SGVIEA FEAG+LL V+VGNS   LRGVVFKP
Subjt:  DVPLKRKRGRPRKYPKLSYDENALIANNGGKKHLETIRISPGSGVNGNQSHPSNPIQNAADVILGRVVSGVIEAVFEAGYLLCVRVGNSGITLRGVVFKP

Query:  GHYVPVLAENDVAPDVQMIRRNT-VPFATGNQTHGYNPRSKNVPSHESSDAKLRFRHTPPHSNQDALKDKSISSIFAKITPSGSSRGN--VVPVVPQPAK
        GH  PV  +NDVAPDV MIRRN+ V    G+   G   R +     E   + +R R   P                  I P+  +  N  +VPVV QPA 
Subjt:  GHYVPVLAENDVAPDVQMIRRNT-VPFATGNQTHGYNPRSKNVPSHESSDAKLRFRHTPPHSNQDALKDKSISSIFAKITPSGSSRGN--VVPVVPQPAK

Query:  LTNGPSVANESFAIQTTDVESSKGKEVLIGTFPSNELAPTNVTVGIESFPFQPQTSQQVLQDDVSVENSSHNQSLVVE
        L NG     E   I  + +++  G          ++ +  +     E+   Q     QV     SVE  S  Q+L +E
Subjt:  LTNGPSVANESFAIQTTDVESSKGKEVLIGTFPSNELAPTNVTVGIESFPFQPQTSQQVLQDDVSVENSSHNQSLVVE

AT5G54930.2 AT hook motif-containing protein2.0e-2233.81Show/hide
Query:  DVPLKRKRGRPRKYPKLSYDENALIANNGGKKHLETIRISPGSGVNGNQSHPSNPIQNAADVILGRVVSGVIEAVFEAGYLLCVRVGNSGITLRGVVFKP
        D+  KRKRGRPRK  KL  +E++L                 G   + ++S   +  +N  + ++G+ +SGVIEA FEAG+LL V+VGNS   LRGVVFKP
Subjt:  DVPLKRKRGRPRKYPKLSYDENALIANNGGKKHLETIRISPGSGVNGNQSHPSNPIQNAADVILGRVVSGVIEAVFEAGYLLCVRVGNSGITLRGVVFKP

Query:  GHYVPVLAENDVAPDVQMIRRNT-VPFATGNQTHGYNPRSKNVPSHESSDAKLRFRHTPPHSNQDALKDKSISSIFAKITPSGSSRGN--VVPVVPQPAK
        GH  PV  +NDVAPDV MIRRN+ V    G+   G   R +     E   + +R R   P                  I P+  +  N  +VPVV QPA 
Subjt:  GHYVPVLAENDVAPDVQMIRRNT-VPFATGNQTHGYNPRSKNVPSHESSDAKLRFRHTPPHSNQDALKDKSISSIFAKITPSGSSRGN--VVPVVPQPAK

Query:  LTNGPSVANESFAIQTTDVESSKGKEVLIGTFPSNELAPTNVTVGIESFPFQPQTSQQVLQDDVSVENSSHNQSLVVE
        L NG     E   I  + +++  G          ++ +  +     E+   Q     QV     SVE  S  Q+L +E
Subjt:  LTNGPSVANESFAIQTTDVESSKGKEVLIGTFPSNELAPTNVTVGIESFPFQPQTSQQVLQDDVSVENSSHNQSLVVE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTCAGGCTGAACAAGGAATCAGCGCTGATAATTTAGTTGACGTTCCCTTGAAGCGAAAACGTGGCCGTCCAAGAAAATATCCGAAGTTAAGTTATGATGAGAATGC
TCTTATTGCAAATAATGGAGGTAAAAAACATTTGGAGACTATTCGTATATCACCTGGTTCTGGAGTGAATGGAAACCAATCACATCCATCTAATCCAATTCAAAATGCAG
CTGATGTGATACTGGGACGAGTTGTGTCTGGTGTCATTGAGGCAGTATTTGAAGCTGGGTATCTGCTGTGTGTTAGGGTTGGCAACTCTGGAATCACTTTGAGGGGTGTT
GTCTTTAAGCCTGGGCACTATGTCCCTGTTTTGGCAGAGAACGACGTGGCCCCAGATGTTCAAATGATTAGAAGAAATACGGTTCCTTTTGCTACAGGAAATCAAACCCA
TGGATATAACCCGCGGTCTAAAAATGTCCCATCCCATGAATCATCAGATGCCAAGCTCAGGTTTAGACATACACCTCCACATTCTAATCAGGATGCTTTGAAAGACAAAT
CTATATCGTCTATATTCGCAAAAATTACCCCTTCAGGAAGCTCAAGAGGTAATGTGGTCCCTGTTGTGCCTCAACCTGCTAAATTAACAAATGGACCCTCAGTTGCTAAT
GAATCATTTGCAATTCAAACAACTGATGTGGAATCCTCAAAAGGCAAGGAGGTTCTTATAGGTACTTTTCCATCAAATGAATTAGCTCCCACCAATGTGACAGTTGGGAT
TGAAAGCTTTCCTTTCCAACCCCAAACTAGCCAGCAGGTCTTACAAGATGATGTATCAGTAGAAAATAGTTCTCACAACCAATCCTTGGTAGTGGAAGTGCGTGATTCAG
AAGCTAAACTAATGACATTGCCTAGCACGCCTTTTCAGAATCTTGTGACTGAAGTGATCAAGAGAATTCAAGCCCCCTCTCTGTCAGCTGAGATGCAGACTGAGAATAAC
AAAGCAACTGGTAAGATATCAGCTAAAGAATGTGTAGATAGCTCAGAGGTTGCAGCTAACATAGTGGATGGACCTTTGATGATTGAGCCCCTAAAAGCAGTGCAACCCCT
TCATCACAGTTCAGTGTCCATTCGCAAATCTCTGGATGACGAGTCTAGAACTGGCAAAATGACTGAGCTGTTACAGGTAAATGATACTCTCACTAGAATGCATTTTATTC
GATCTTCAAGTATTCGGTTACATTACTATTACTTTTTCTATATGGGAAGAATTCATTTGGTTAATGACGCCACTGCCTGTGCTAATGAAGTAATGTCCTTTGACAAGGTT
TTGCAGCAAAACATGATTCAAAATTCAGAGCCATGGGCTAAAGTGCAGAACCCGGGTTTGATGCTGAAGGCAGATGGTCTGGAGGAATCAAAAACAGAGATTGGGGATGA
AGGAGCTGGCAACCAAAAGCAAATCTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGTCAGGCTGAACAAGGAATCAGCGCTGATAATTTAGTTGACGTTCCCTTGAAGCGAAAACGTGGCCGTCCAAGAAAATATCCGAAGTTAAGTTATGATGAGAATGC
TCTTATTGCAAATAATGGAGGTAAAAAACATTTGGAGACTATTCGTATATCACCTGGTTCTGGAGTGAATGGAAACCAATCACATCCATCTAATCCAATTCAAAATGCAG
CTGATGTGATACTGGGACGAGTTGTGTCTGGTGTCATTGAGGCAGTATTTGAAGCTGGGTATCTGCTGTGTGTTAGGGTTGGCAACTCTGGAATCACTTTGAGGGGTGTT
GTCTTTAAGCCTGGGCACTATGTCCCTGTTTTGGCAGAGAACGACGTGGCCCCAGATGTTCAAATGATTAGAAGAAATACGGTTCCTTTTGCTACAGGAAATCAAACCCA
TGGATATAACCCGCGGTCTAAAAATGTCCCATCCCATGAATCATCAGATGCCAAGCTCAGGTTTAGACATACACCTCCACATTCTAATCAGGATGCTTTGAAAGACAAAT
CTATATCGTCTATATTCGCAAAAATTACCCCTTCAGGAAGCTCAAGAGGTAATGTGGTCCCTGTTGTGCCTCAACCTGCTAAATTAACAAATGGACCCTCAGTTGCTAAT
GAATCATTTGCAATTCAAACAACTGATGTGGAATCCTCAAAAGGCAAGGAGGTTCTTATAGGTACTTTTCCATCAAATGAATTAGCTCCCACCAATGTGACAGTTGGGAT
TGAAAGCTTTCCTTTCCAACCCCAAACTAGCCAGCAGGTCTTACAAGATGATGTATCAGTAGAAAATAGTTCTCACAACCAATCCTTGGTAGTGGAAGTGCGTGATTCAG
AAGCTAAACTAATGACATTGCCTAGCACGCCTTTTCAGAATCTTGTGACTGAAGTGATCAAGAGAATTCAAGCCCCCTCTCTGTCAGCTGAGATGCAGACTGAGAATAAC
AAAGCAACTGGTAAGATATCAGCTAAAGAATGTGTAGATAGCTCAGAGGTTGCAGCTAACATAGTGGATGGACCTTTGATGATTGAGCCCCTAAAAGCAGTGCAACCCCT
TCATCACAGTTCAGTGTCCATTCGCAAATCTCTGGATGACGAGTCTAGAACTGGCAAAATGACTGAGCTGTTACAGGTAAATGATACTCTCACTAGAATGCATTTTATTC
GATCTTCAAGTATTCGGTTACATTACTATTACTTTTTCTATATGGGAAGAATTCATTTGGTTAATGACGCCACTGCCTGTGCTAATGAAGTAATGTCCTTTGACAAGGTT
TTGCAGCAAAACATGATTCAAAATTCAGAGCCATGGGCTAAAGTGCAGAACCCGGGTTTGATGCTGAAGGCAGATGGTCTGGAGGAATCAAAAACAGAGATTGGGGATGA
AGGAGCTGGCAACCAAAAGCAAATCTGA
Protein sequenceShow/hide protein sequence
MSQAEQGISADNLVDVPLKRKRGRPRKYPKLSYDENALIANNGGKKHLETIRISPGSGVNGNQSHPSNPIQNAADVILGRVVSGVIEAVFEAGYLLCVRVGNSGITLRGV
VFKPGHYVPVLAENDVAPDVQMIRRNTVPFATGNQTHGYNPRSKNVPSHESSDAKLRFRHTPPHSNQDALKDKSISSIFAKITPSGSSRGNVVPVVPQPAKLTNGPSVAN
ESFAIQTTDVESSKGKEVLIGTFPSNELAPTNVTVGIESFPFQPQTSQQVLQDDVSVENSSHNQSLVVEVRDSEAKLMTLPSTPFQNLVTEVIKRIQAPSLSAEMQTENN
KATGKISAKECVDSSEVAANIVDGPLMIEPLKAVQPLHHSSVSIRKSLDDESRTGKMTELLQVNDTLTRMHFIRSSSIRLHYYYFFYMGRIHLVNDATACANEVMSFDKV
LQQNMIQNSEPWAKVQNPGLMLKADGLEESKTEIGDEGAGNQKQI