; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi01G018190 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi01G018190
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionPAX-interacting protein 1
Genome locationchr01:18437318..18444854
RNA-Seq ExpressionLsi01G018190
SyntenyLsi01G018190
Gene Ontology termsGO:0008080 - N-acetyltransferase activity (molecular function)
InterPro domainsIPR001357 - BRCT domain
IPR016181 - Acyl-CoA N-acyltransferase
IPR036420 - BRCT domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008449208.1 PREDICTED: uncharacterized protein LOC103491160 isoform X1 [Cucumis melo]1.3e-30380.83Show/hide
Query:  MAPRKKPARRSSIPIVGKEGRVTDVNHPESVGKGIGEKCHSQGEYSFVLVNPNDFDSHSKTYLQEVLQLYKRELPTMAYAANTGKESTFMEKCVSNGKYC
        MAPRKKP RRSSI IVGKEG   DVNH E VGKG+GEKC SQGEYSFVLVNPNDFDS+SK+YLQ+VLQLYKRELPTMAYAANTGK+STFMEKC+SNGKYC
Subjt:  MAPRKKPARRSSIPIVGKEGRVTDVNHPESVGKGIGEKCHSQGEYSFVLVNPNDFDSHSKTYLQEVLQLYKRELPTMAYAANTGKESTFMEKCVSNGKYC

Query:  TLLLESKSEVEPGLIIAAITYQIVPADTQYAEIPLAAVCSAYQHK-----------------------GFGHILYVELRKRLQSVGIRTIFCWGDKESEG
        TLLLESKSEV+ GL+IAAITYQIVPADTQYAEIPLAAV  AYQHK                       GFGHILY+ELRKRLQSVGIRTIFCWGDKESEG
Subjt:  TLLLESKSEVEPGLIIAAITYQIVPADTQYAEIPLAAVCSAYQHK-----------------------GFGHILYVELRKRLQSVGIRTIFCWGDKESEG

Query:  FWSKQSFLSIAEVDTKGKSRRIPVRADIRRALCFPGGSTLMISHIQGISMCSADLPKSLSLLKPEAPSSYPHAKRIGVANQGCKVSNATDQQTFKKLNFQ
        FWSKQ FLSIAEVDTKGK RRIPVRADIRRALCFPGGSTLMISHIQGISMCSADLPK  SLLKPEA    P+A RI VA+QGC VSNATDQ T + LNFQ
Subjt:  FWSKQSFLSIAEVDTKGKSRRIPVRADIRRALCFPGGSTLMISHIQGISMCSADLPKSLSLLKPEAPSSYPHAKRIGVANQGCKVSNATDQQTFKKLNFQ

Query:  PDEFVTLAPLGEDNKIQVPKPGLLLTDLQNQDAVHDSNSPVSFSEIESRRQASIAELSNTIGNLDE-NCSCSTQSAKRIWEASLSSLKSKKVKGVHLHHL
        PDEFVTL PLGE+N+IQ P         QNQDAVHDSN PVSF+EIE+   ASIAELSNTIGNLDE +CSCS QSAKR+WEASLSSLKSKKVKGV+L H 
Subjt:  PDEFVTLAPLGEDNKIQVPKPGLLLTDLQNQDAVHDSNSPVSFSEIESRRQASIAELSNTIGNLDE-NCSCSTQSAKRIWEASLSSLKSKKVKGVHLHHL

Query:  HFDSTKSFVPKSNGCDTYSQACSLANSKHEILASVSPKNPSTSNYTQNFCKEFESVNVASEDLDFGEDTLGKPFKIMLMNIADEAKKTQLMKVIEELGGS
        H DS K+ VPKSNG +  SQACSLANSKHEIL+S+ PK P T+ YTQNFC+EF SVNVASEDL+  E+TLGK FKIMLMNIADEAKKTQLMKVIEELGGS
Subjt:  HFDSTKSFVPKSNGCDTYSQACSLANSKHEILASVSPKNPSTSNYTQNFCKEFESVNVASEDLDFGEDTLGKPFKIMLMNIADEAKKTQLMKVIEELGGS

Query:  LTSDGSTCTHVITGKVRKTLNFCTALCSGAWIVSPSWLKESYREGRFVDELPYVLNDDDYTSKYRASLKTAVLRAKACPGSLFAGYDVCISAYAQPPPKT
        LT+DGST THVITGKVRKTLNFCTALCSGAWIVS SWLKESYREGRFVDELP +LNDDDYTSKYRASLKT VLRAKA PG+LF GYDVCISA+AQPPPKT
Subjt:  LTSDGSTCTHVITGKVRKTLNFCTALCSGAWIVSPSWLKESYREGRFVDELPYVLNDDDYTSKYRASLKTAVLRAKACPGSLFAGYDVCISAYAQPPPKT

Query:  LSVIVKSAGGNVIHGLGKVKEVSKTIFLACEEDVEEALLAVKKGIWTFNSEWLMTCIMRQELDLEAPQFAESL
        LS+IVKSAGG+VIH L KV  VSKTIF+ACEEDVEEAL+AV+KGIWTFN EWLMTCIMRQE+DLEAPQFAESL
Subjt:  LSVIVKSAGGNVIHGLGKVKEVSKTIFLACEEDVEEALLAVKKGIWTFNSEWLMTCIMRQELDLEAPQFAESL

XP_008449209.1 PREDICTED: uncharacterized protein LOC103491160 isoform X2 [Cucumis melo]4.0e-30280.53Show/hide
Query:  MAPRKKPARRSSIPIVGKEGRVTDVNHPESVGKGIGEKCHSQGEYSFVLVNPNDFDSHSKTYLQEVLQLYKRELPTMAYAANTGKESTFMEKCVSNGKYC
        MAPRKKP RRSSI  +GKEG   DVNH E VGKG+GEKC SQGEYSFVLVNPNDFDS+SK+YLQ+VLQLYKRELPTMAYAANTGK+STFMEKC+SNGKYC
Subjt:  MAPRKKPARRSSIPIVGKEGRVTDVNHPESVGKGIGEKCHSQGEYSFVLVNPNDFDSHSKTYLQEVLQLYKRELPTMAYAANTGKESTFMEKCVSNGKYC

Query:  TLLLESKSEVEPGLIIAAITYQIVPADTQYAEIPLAAVCSAYQHK-----------------------GFGHILYVELRKRLQSVGIRTIFCWGDKESEG
        TLLLESKSEV+ GL+IAAITYQIVPADTQYAEIPLAAV  AYQHK                       GFGHILY+ELRKRLQSVGIRTIFCWGDKESEG
Subjt:  TLLLESKSEVEPGLIIAAITYQIVPADTQYAEIPLAAVCSAYQHK-----------------------GFGHILYVELRKRLQSVGIRTIFCWGDKESEG

Query:  FWSKQSFLSIAEVDTKGKSRRIPVRADIRRALCFPGGSTLMISHIQGISMCSADLPKSLSLLKPEAPSSYPHAKRIGVANQGCKVSNATDQQTFKKLNFQ
        FWSKQ FLSIAEVDTKGK RRIPVRADIRRALCFPGGSTLMISHIQGISMCSADLPK  SLLKPEA    P+A RI VA+QGC VSNATDQ T + LNFQ
Subjt:  FWSKQSFLSIAEVDTKGKSRRIPVRADIRRALCFPGGSTLMISHIQGISMCSADLPKSLSLLKPEAPSSYPHAKRIGVANQGCKVSNATDQQTFKKLNFQ

Query:  PDEFVTLAPLGEDNKIQVPKPGLLLTDLQNQDAVHDSNSPVSFSEIESRRQASIAELSNTIGNLDE-NCSCSTQSAKRIWEASLSSLKSKKVKGVHLHHL
        PDEFVTL PLGE+N+IQ P         QNQDAVHDSN PVSF+EIE+   ASIAELSNTIGNLDE +CSCS QSAKR+WEASLSSLKSKKVKGV+L H 
Subjt:  PDEFVTLAPLGEDNKIQVPKPGLLLTDLQNQDAVHDSNSPVSFSEIESRRQASIAELSNTIGNLDE-NCSCSTQSAKRIWEASLSSLKSKKVKGVHLHHL

Query:  HFDSTKSFVPKSNGCDTYSQACSLANSKHEILASVSPKNPSTSNYTQNFCKEFESVNVASEDLDFGEDTLGKPFKIMLMNIADEAKKTQLMKVIEELGGS
        H DS K+ VPKSNG +  SQACSLANSKHEIL+S+ PK P T+ YTQNFC+EF SVNVASEDL+  E+TLGK FKIMLMNIADEAKKTQLMKVIEELGGS
Subjt:  HFDSTKSFVPKSNGCDTYSQACSLANSKHEILASVSPKNPSTSNYTQNFCKEFESVNVASEDLDFGEDTLGKPFKIMLMNIADEAKKTQLMKVIEELGGS

Query:  LTSDGSTCTHVITGKVRKTLNFCTALCSGAWIVSPSWLKESYREGRFVDELPYVLNDDDYTSKYRASLKTAVLRAKACPGSLFAGYDVCISAYAQPPPKT
        LT+DGST THVITGKVRKTLNFCTALCSGAWIVS SWLKESYREGRFVDELP +LNDDDYTSKYRASLKT VLRAKA PG+LF GYDVCISA+AQPPPKT
Subjt:  LTSDGSTCTHVITGKVRKTLNFCTALCSGAWIVSPSWLKESYREGRFVDELPYVLNDDDYTSKYRASLKTAVLRAKACPGSLFAGYDVCISAYAQPPPKT

Query:  LSVIVKSAGGNVIHGLGKVKEVSKTIFLACEEDVEEALLAVKKGIWTFNSEWLMTCIMRQELDLEAPQFAESL
        LS+IVKSAGG+VIH L KV  VSKTIF+ACEEDVEEAL+AV+KGIWTFN EWLMTCIMRQE+DLEAPQFAESL
Subjt:  LSVIVKSAGGNVIHGLGKVKEVSKTIFLACEEDVEEALLAVKKGIWTFNSEWLMTCIMRQELDLEAPQFAESL

XP_008449211.1 PREDICTED: uncharacterized protein LOC103491160 isoform X3 [Cucumis melo]1.4e-30783.69Show/hide
Query:  MAPRKKPARRSSIPIVGKEGRVTDVNHPESVGKGIGEKCHSQGEYSFVLVNPNDFDSHSKTYLQEVLQLYKRELPTMAYAANTGKESTFMEKCVSNGKYC
        MAPRKKP RRSSI IVGKEG   DVNH E VGKG+GEKC SQGEYSFVLVNPNDFDS+SK+YLQ+VLQLYKRELPTMAYAANTGK+STFMEKC+SNGKYC
Subjt:  MAPRKKPARRSSIPIVGKEGRVTDVNHPESVGKGIGEKCHSQGEYSFVLVNPNDFDSHSKTYLQEVLQLYKRELPTMAYAANTGKESTFMEKCVSNGKYC

Query:  TLLLESKSEVEPGLIIAAITYQIVPADTQYAEIPLAAVCSAYQHKGFGHILYVELRKRLQSVGIRTIFCWGDKESEGFWSKQSFLSIAEVDTKGKSRRIP
        TLLLESKSEV+ GL+IAAITYQIVPADTQYAEIPLAAV  AYQHKGFGHILY+ELRKRLQSVGIRTIFCWGDKESEGFWSKQ FLSIAEVDTKGK RRIP
Subjt:  TLLLESKSEVEPGLIIAAITYQIVPADTQYAEIPLAAVCSAYQHKGFGHILYVELRKRLQSVGIRTIFCWGDKESEGFWSKQSFLSIAEVDTKGKSRRIP

Query:  VRADIRRALCFPGGSTLMISHIQGISMCSADLPKSLSLLKPEAPSSYPHAKRIGVANQGCKVSNATDQQTFKKLNFQPDEFVTLAPLGEDNKIQVPKPGL
        VRADIRRALCFPGGSTLMISHIQGISMCSADLPK  SLLKPEA    P+A RI VA+QGC VSNATDQ T + LNFQPDEFVTL PLGE+N+IQ P    
Subjt:  VRADIRRALCFPGGSTLMISHIQGISMCSADLPKSLSLLKPEAPSSYPHAKRIGVANQGCKVSNATDQQTFKKLNFQPDEFVTLAPLGEDNKIQVPKPGL

Query:  LLTDLQNQDAVHDSNSPVSFSEIESRRQASIAELSNTIGNLDE-NCSCSTQSAKRIWEASLSSLKSKKVKGVHLHHLHFDSTKSFVPKSNGCDTYSQACS
             QNQDAVHDSN PVSF+EIE+   ASIAELSNTIGNLDE +CSCS QSAKR+WEASLSSLKSKKVKGV+L H H DS K+ VPKSNG +  SQACS
Subjt:  LLTDLQNQDAVHDSNSPVSFSEIESRRQASIAELSNTIGNLDE-NCSCSTQSAKRIWEASLSSLKSKKVKGVHLHHLHFDSTKSFVPKSNGCDTYSQACS

Query:  LANSKHEILASVSPKNPSTSNYTQNFCKEFESVNVASEDLDFGEDTLGKPFKIMLMNIADEAKKTQLMKVIEELGGSLTSDGSTCTHVITGKVRKTLNFC
        LANSKHEIL+S+ PK P T+ YTQNFC+EF SVNVASEDL+  E+TLGK FKIMLMNIADEAKKTQLMKVIEELGGSLT+DGST THVITGKVRKTLNFC
Subjt:  LANSKHEILASVSPKNPSTSNYTQNFCKEFESVNVASEDLDFGEDTLGKPFKIMLMNIADEAKKTQLMKVIEELGGSLTSDGSTCTHVITGKVRKTLNFC

Query:  TALCSGAWIVSPSWLKESYREGRFVDELPYVLNDDDYTSKYRASLKTAVLRAKACPGSLFAGYDVCISAYAQPPPKTLSVIVKSAGGNVIHGLGKVKEVS
        TALCSGAWIVS SWLKESYREGRFVDELP +LNDDDYTSKYRASLKT VLRAKA PG+LF GYDVCISA+AQPPPKTLS+IVKSAGG+VIH L KV  VS
Subjt:  TALCSGAWIVSPSWLKESYREGRFVDELPYVLNDDDYTSKYRASLKTAVLRAKACPGSLFAGYDVCISAYAQPPPKTLSVIVKSAGGNVIHGLGKVKEVS

Query:  KTIFLACEEDVEEALLAVKKGIWTFNSEWLMTCIMRQELDLEAPQFAESL
        KTIF+ACEEDVEEAL+AV+KGIWTFN EWLMTCIMRQE+DLEAPQFAESL
Subjt:  KTIFLACEEDVEEALLAVKKGIWTFNSEWLMTCIMRQELDLEAPQFAESL

XP_038881370.1 uncharacterized protein LOC120072912 isoform X1 [Benincasa hispida]0.0e+0088.77Show/hide
Query:  MAPRKKPARRSSIPIVGKEGRVTDVNHPESVGKGIGEKCHSQGEYSFVLVNPNDFDSHSKTYLQEVLQLYKRELPTMAYAANTGKESTFMEKCVSNGKYC
        MAP+KKP RRSSIPIVGKEG V DVNHPES GKGIGEKCHSQGEYSFVLVNPNDFDSHSK+YLQEVLQLYKRELPTM YAANTGK+STFM+KCVSNGKYC
Subjt:  MAPRKKPARRSSIPIVGKEGRVTDVNHPESVGKGIGEKCHSQGEYSFVLVNPNDFDSHSKTYLQEVLQLYKRELPTMAYAANTGKESTFMEKCVSNGKYC

Query:  TLLLESKSEVEPGLIIAAITYQIVPADTQYAEIPLAAVCSAYQHKGFGHILYVELRKRLQSVGIRTIFCWGDKESEGFWSKQSFLSIAEVDTKGKSRRIP
        TLLLESKSEV+PGLIIAAITYQIVPADTQYAEIPLAAV SAYQHKGFGHILY+ELRKRLQSVGIRTIFCWGDKESEGFWSKQ FLSIAEVDTKGK+RRIP
Subjt:  TLLLESKSEVEPGLIIAAITYQIVPADTQYAEIPLAAVCSAYQHKGFGHILYVELRKRLQSVGIRTIFCWGDKESEGFWSKQSFLSIAEVDTKGKSRRIP

Query:  VRADIRRALCFPGGSTLMISHIQGISMCSADLPKSLSLLKPEAPSSYPHAKRIGVANQGCKVSNATDQQTFKKLNFQPDEFVTLAPLGEDNKIQVPKPGL
        VRADIRRALCFPGGSTLM+SHIQGISMCSADLPKS SLLKPEAPSS   A+RIGVANQGC VSNA DQQTF+ LNFQPD+FVTLAPLGE+NKIQ P    
Subjt:  VRADIRRALCFPGGSTLMISHIQGISMCSADLPKSLSLLKPEAPSSYPHAKRIGVANQGCKVSNATDQQTFKKLNFQPDEFVTLAPLGEDNKIQVPKPGL

Query:  LLTDLQNQDAVHDSNSPVSFSEIESRRQASIAELSNTIGNLDE-NCSCSTQSAKRIWEASLSSLKSKKVKGVHLHHLHFDSTKSFVPKSNGCDTYSQACS
             Q QDAVHD NSP SF+EIE+ R ASIAELSNTIGNLDE +CSCSTQSAKR+WEASLSSLKSKKVKGVHL H H DS K+FVP+SNG DT+SQACS
Subjt:  LLTDLQNQDAVHDSNSPVSFSEIESRRQASIAELSNTIGNLDE-NCSCSTQSAKRIWEASLSSLKSKKVKGVHLHHLHFDSTKSFVPKSNGCDTYSQACS

Query:  LANSKHEILASVSPKNPSTSNYTQNFCKEFESVNVASEDLDFGEDTLGKPFKIMLMNIADEAKKTQLMKVIEELGGSLTSDGSTCTHVITGKVRKTLNFC
        LANSKHEILAS+S KNPSTSNYTQNFCKEF SVNVASE LD  E+TLGK FKIMLMNIADEAKKTQLMKVIEELGGSLT+DGST THVITGKVRKTLNFC
Subjt:  LANSKHEILASVSPKNPSTSNYTQNFCKEFESVNVASEDLDFGEDTLGKPFKIMLMNIADEAKKTQLMKVIEELGGSLTSDGSTCTHVITGKVRKTLNFC

Query:  TALCSGAWIVSPSWLKESYREGRFVDELPYVLNDDDYTSKYRASLKTAVLRAKACPGSLFAGYDVCISAYAQPPPKTLSVIVKSAGGNVIHGLGKVKEVS
        TALCSGAWIVSPSWLKESYREGRFVDELPYVLNDDDYTSKYRASLKTAVLRAKA PG+LF GYDVCISA+AQPPPKTLSVIVKSAGGNVIH LGKV EVS
Subjt:  TALCSGAWIVSPSWLKESYREGRFVDELPYVLNDDDYTSKYRASLKTAVLRAKACPGSLFAGYDVCISAYAQPPPKTLSVIVKSAGGNVIHGLGKVKEVS

Query:  KTIFLACEEDVEEALLAVKKGIWTFNSEWLMTCIMRQELDLEAPQFAESL
        KTIF+A EEDVEEALLAVKKGIWTFNSEWLMTC+MRQELDLEAPQFAESL
Subjt:  KTIFLACEEDVEEALLAVKKGIWTFNSEWLMTCIMRQELDLEAPQFAESL

XP_038881371.1 uncharacterized protein LOC120072912 isoform X2 [Benincasa hispida]0.0e+0086.46Show/hide
Query:  MAPRKKPARRSSIPIVGKEGRVTDVNHPESVGKGIGEKCHSQGEYSFVLVNPNDFDSHSKTYLQEVLQLYKRELPTMAYAANTGKESTFMEKCVSNGKYC
        MAP+KKP RRSSIPI                  GIGEKCHSQGEYSFVLVNPNDFDSHSK+YLQEVLQLYKRELPTM YAANTGK+STFM+KCVSNGKYC
Subjt:  MAPRKKPARRSSIPIVGKEGRVTDVNHPESVGKGIGEKCHSQGEYSFVLVNPNDFDSHSKTYLQEVLQLYKRELPTMAYAANTGKESTFMEKCVSNGKYC

Query:  TLLLESKSEVEPGLIIAAITYQIVPADTQYAEIPLAAVCSAYQHKGFGHILYVELRKRLQSVGIRTIFCWGDKESEGFWSKQSFLSIAEVDTKGKSRRIP
        TLLLESKSEV+PGLIIAAITYQIVPADTQYAEIPLAAV SAYQHKGFGHILY+ELRKRLQSVGIRTIFCWGDKESEGFWSKQ FLSIAEVDTKGK+RRIP
Subjt:  TLLLESKSEVEPGLIIAAITYQIVPADTQYAEIPLAAVCSAYQHKGFGHILYVELRKRLQSVGIRTIFCWGDKESEGFWSKQSFLSIAEVDTKGKSRRIP

Query:  VRADIRRALCFPGGSTLMISHIQGISMCSADLPKSLSLLKPEAPSSYPHAKRIGVANQGCKVSNATDQQTFKKLNFQPDEFVTLAPLGEDNKIQVPKPGL
        VRADIRRALCFPGGSTLM+SHIQGISMCSADLPKS SLLKPEAPSS   A+RIGVANQGC VSNA DQQTF+ LNFQPD+FVTLAPLGE+NKIQ P    
Subjt:  VRADIRRALCFPGGSTLMISHIQGISMCSADLPKSLSLLKPEAPSSYPHAKRIGVANQGCKVSNATDQQTFKKLNFQPDEFVTLAPLGEDNKIQVPKPGL

Query:  LLTDLQNQDAVHDSNSPVSFSEIESRRQASIAELSNTIGNLDE-NCSCSTQSAKRIWEASLSSLKSKKVKGVHLHHLHFDSTKSFVPKSNGCDTYSQACS
             Q QDAVHD NSP SF+EIE+ R ASIAELSNTIGNLDE +CSCSTQSAKR+WEASLSSLKSKKVKGVHL H H DS K+FVP+SNG DT+SQACS
Subjt:  LLTDLQNQDAVHDSNSPVSFSEIESRRQASIAELSNTIGNLDE-NCSCSTQSAKRIWEASLSSLKSKKVKGVHLHHLHFDSTKSFVPKSNGCDTYSQACS

Query:  LANSKHEILASVSPKNPSTSNYTQNFCKEFESVNVASEDLDFGEDTLGKPFKIMLMNIADEAKKTQLMKVIEELGGSLTSDGSTCTHVITGKVRKTLNFC
        LANSKHEILAS+S KNPSTSNYTQNFCKEF SVNVASE LD  E+TLGK FKIMLMNIADEAKKTQLMKVIEELGGSLT+DGST THVITGKVRKTLNFC
Subjt:  LANSKHEILASVSPKNPSTSNYTQNFCKEFESVNVASEDLDFGEDTLGKPFKIMLMNIADEAKKTQLMKVIEELGGSLTSDGSTCTHVITGKVRKTLNFC

Query:  TALCSGAWIVSPSWLKESYREGRFVDELPYVLNDDDYTSKYRASLKTAVLRAKACPGSLFAGYDVCISAYAQPPPKTLSVIVKSAGGNVIHGLGKVKEVS
        TALCSGAWIVSPSWLKESYREGRFVDELPYVLNDDDYTSKYRASLKTAVLRAKA PG+LF GYDVCISA+AQPPPKTLSVIVKSAGGNVIH LGKV EVS
Subjt:  TALCSGAWIVSPSWLKESYREGRFVDELPYVLNDDDYTSKYRASLKTAVLRAKACPGSLFAGYDVCISAYAQPPPKTLSVIVKSAGGNVIHGLGKVKEVS

Query:  KTIFLACEEDVEEALLAVKKGIWTFNSEWLMTCIMRQELDLEAPQFAESL
        KTIF+A EEDVEEALLAVKKGIWTFNSEWLMTC+MRQELDLEAPQFAESL
Subjt:  KTIFLACEEDVEEALLAVKKGIWTFNSEWLMTCIMRQELDLEAPQFAESL

TrEMBL top hitse value%identityAlignment
A0A1S3BKX0 uncharacterized protein LOC103491160 isoform X36.9e-30883.69Show/hide
Query:  MAPRKKPARRSSIPIVGKEGRVTDVNHPESVGKGIGEKCHSQGEYSFVLVNPNDFDSHSKTYLQEVLQLYKRELPTMAYAANTGKESTFMEKCVSNGKYC
        MAPRKKP RRSSI IVGKEG   DVNH E VGKG+GEKC SQGEYSFVLVNPNDFDS+SK+YLQ+VLQLYKRELPTMAYAANTGK+STFMEKC+SNGKYC
Subjt:  MAPRKKPARRSSIPIVGKEGRVTDVNHPESVGKGIGEKCHSQGEYSFVLVNPNDFDSHSKTYLQEVLQLYKRELPTMAYAANTGKESTFMEKCVSNGKYC

Query:  TLLLESKSEVEPGLIIAAITYQIVPADTQYAEIPLAAVCSAYQHKGFGHILYVELRKRLQSVGIRTIFCWGDKESEGFWSKQSFLSIAEVDTKGKSRRIP
        TLLLESKSEV+ GL+IAAITYQIVPADTQYAEIPLAAV  AYQHKGFGHILY+ELRKRLQSVGIRTIFCWGDKESEGFWSKQ FLSIAEVDTKGK RRIP
Subjt:  TLLLESKSEVEPGLIIAAITYQIVPADTQYAEIPLAAVCSAYQHKGFGHILYVELRKRLQSVGIRTIFCWGDKESEGFWSKQSFLSIAEVDTKGKSRRIP

Query:  VRADIRRALCFPGGSTLMISHIQGISMCSADLPKSLSLLKPEAPSSYPHAKRIGVANQGCKVSNATDQQTFKKLNFQPDEFVTLAPLGEDNKIQVPKPGL
        VRADIRRALCFPGGSTLMISHIQGISMCSADLPK  SLLKPEA    P+A RI VA+QGC VSNATDQ T + LNFQPDEFVTL PLGE+N+IQ P    
Subjt:  VRADIRRALCFPGGSTLMISHIQGISMCSADLPKSLSLLKPEAPSSYPHAKRIGVANQGCKVSNATDQQTFKKLNFQPDEFVTLAPLGEDNKIQVPKPGL

Query:  LLTDLQNQDAVHDSNSPVSFSEIESRRQASIAELSNTIGNLDE-NCSCSTQSAKRIWEASLSSLKSKKVKGVHLHHLHFDSTKSFVPKSNGCDTYSQACS
             QNQDAVHDSN PVSF+EIE+   ASIAELSNTIGNLDE +CSCS QSAKR+WEASLSSLKSKKVKGV+L H H DS K+ VPKSNG +  SQACS
Subjt:  LLTDLQNQDAVHDSNSPVSFSEIESRRQASIAELSNTIGNLDE-NCSCSTQSAKRIWEASLSSLKSKKVKGVHLHHLHFDSTKSFVPKSNGCDTYSQACS

Query:  LANSKHEILASVSPKNPSTSNYTQNFCKEFESVNVASEDLDFGEDTLGKPFKIMLMNIADEAKKTQLMKVIEELGGSLTSDGSTCTHVITGKVRKTLNFC
        LANSKHEIL+S+ PK P T+ YTQNFC+EF SVNVASEDL+  E+TLGK FKIMLMNIADEAKKTQLMKVIEELGGSLT+DGST THVITGKVRKTLNFC
Subjt:  LANSKHEILASVSPKNPSTSNYTQNFCKEFESVNVASEDLDFGEDTLGKPFKIMLMNIADEAKKTQLMKVIEELGGSLTSDGSTCTHVITGKVRKTLNFC

Query:  TALCSGAWIVSPSWLKESYREGRFVDELPYVLNDDDYTSKYRASLKTAVLRAKACPGSLFAGYDVCISAYAQPPPKTLSVIVKSAGGNVIHGLGKVKEVS
        TALCSGAWIVS SWLKESYREGRFVDELP +LNDDDYTSKYRASLKT VLRAKA PG+LF GYDVCISA+AQPPPKTLS+IVKSAGG+VIH L KV  VS
Subjt:  TALCSGAWIVSPSWLKESYREGRFVDELPYVLNDDDYTSKYRASLKTAVLRAKACPGSLFAGYDVCISAYAQPPPKTLSVIVKSAGGNVIHGLGKVKEVS

Query:  KTIFLACEEDVEEALLAVKKGIWTFNSEWLMTCIMRQELDLEAPQFAESL
        KTIF+ACEEDVEEAL+AV+KGIWTFN EWLMTCIMRQE+DLEAPQFAESL
Subjt:  KTIFLACEEDVEEALLAVKKGIWTFNSEWLMTCIMRQELDLEAPQFAESL

A0A1S3BM51 uncharacterized protein LOC103491160 isoform X16.1e-30480.83Show/hide
Query:  MAPRKKPARRSSIPIVGKEGRVTDVNHPESVGKGIGEKCHSQGEYSFVLVNPNDFDSHSKTYLQEVLQLYKRELPTMAYAANTGKESTFMEKCVSNGKYC
        MAPRKKP RRSSI IVGKEG   DVNH E VGKG+GEKC SQGEYSFVLVNPNDFDS+SK+YLQ+VLQLYKRELPTMAYAANTGK+STFMEKC+SNGKYC
Subjt:  MAPRKKPARRSSIPIVGKEGRVTDVNHPESVGKGIGEKCHSQGEYSFVLVNPNDFDSHSKTYLQEVLQLYKRELPTMAYAANTGKESTFMEKCVSNGKYC

Query:  TLLLESKSEVEPGLIIAAITYQIVPADTQYAEIPLAAVCSAYQHK-----------------------GFGHILYVELRKRLQSVGIRTIFCWGDKESEG
        TLLLESKSEV+ GL+IAAITYQIVPADTQYAEIPLAAV  AYQHK                       GFGHILY+ELRKRLQSVGIRTIFCWGDKESEG
Subjt:  TLLLESKSEVEPGLIIAAITYQIVPADTQYAEIPLAAVCSAYQHK-----------------------GFGHILYVELRKRLQSVGIRTIFCWGDKESEG

Query:  FWSKQSFLSIAEVDTKGKSRRIPVRADIRRALCFPGGSTLMISHIQGISMCSADLPKSLSLLKPEAPSSYPHAKRIGVANQGCKVSNATDQQTFKKLNFQ
        FWSKQ FLSIAEVDTKGK RRIPVRADIRRALCFPGGSTLMISHIQGISMCSADLPK  SLLKPEA    P+A RI VA+QGC VSNATDQ T + LNFQ
Subjt:  FWSKQSFLSIAEVDTKGKSRRIPVRADIRRALCFPGGSTLMISHIQGISMCSADLPKSLSLLKPEAPSSYPHAKRIGVANQGCKVSNATDQQTFKKLNFQ

Query:  PDEFVTLAPLGEDNKIQVPKPGLLLTDLQNQDAVHDSNSPVSFSEIESRRQASIAELSNTIGNLDE-NCSCSTQSAKRIWEASLSSLKSKKVKGVHLHHL
        PDEFVTL PLGE+N+IQ P         QNQDAVHDSN PVSF+EIE+   ASIAELSNTIGNLDE +CSCS QSAKR+WEASLSSLKSKKVKGV+L H 
Subjt:  PDEFVTLAPLGEDNKIQVPKPGLLLTDLQNQDAVHDSNSPVSFSEIESRRQASIAELSNTIGNLDE-NCSCSTQSAKRIWEASLSSLKSKKVKGVHLHHL

Query:  HFDSTKSFVPKSNGCDTYSQACSLANSKHEILASVSPKNPSTSNYTQNFCKEFESVNVASEDLDFGEDTLGKPFKIMLMNIADEAKKTQLMKVIEELGGS
        H DS K+ VPKSNG +  SQACSLANSKHEIL+S+ PK P T+ YTQNFC+EF SVNVASEDL+  E+TLGK FKIMLMNIADEAKKTQLMKVIEELGGS
Subjt:  HFDSTKSFVPKSNGCDTYSQACSLANSKHEILASVSPKNPSTSNYTQNFCKEFESVNVASEDLDFGEDTLGKPFKIMLMNIADEAKKTQLMKVIEELGGS

Query:  LTSDGSTCTHVITGKVRKTLNFCTALCSGAWIVSPSWLKESYREGRFVDELPYVLNDDDYTSKYRASLKTAVLRAKACPGSLFAGYDVCISAYAQPPPKT
        LT+DGST THVITGKVRKTLNFCTALCSGAWIVS SWLKESYREGRFVDELP +LNDDDYTSKYRASLKT VLRAKA PG+LF GYDVCISA+AQPPPKT
Subjt:  LTSDGSTCTHVITGKVRKTLNFCTALCSGAWIVSPSWLKESYREGRFVDELPYVLNDDDYTSKYRASLKTAVLRAKACPGSLFAGYDVCISAYAQPPPKT

Query:  LSVIVKSAGGNVIHGLGKVKEVSKTIFLACEEDVEEALLAVKKGIWTFNSEWLMTCIMRQELDLEAPQFAESL
        LS+IVKSAGG+VIH L KV  VSKTIF+ACEEDVEEAL+AV+KGIWTFN EWLMTCIMRQE+DLEAPQFAESL
Subjt:  LSVIVKSAGGNVIHGLGKVKEVSKTIFLACEEDVEEALLAVKKGIWTFNSEWLMTCIMRQELDLEAPQFAESL

A0A1S3BMF3 uncharacterized protein LOC103491160 isoform X22.0e-30280.53Show/hide
Query:  MAPRKKPARRSSIPIVGKEGRVTDVNHPESVGKGIGEKCHSQGEYSFVLVNPNDFDSHSKTYLQEVLQLYKRELPTMAYAANTGKESTFMEKCVSNGKYC
        MAPRKKP RRSSI  +GKEG   DVNH E VGKG+GEKC SQGEYSFVLVNPNDFDS+SK+YLQ+VLQLYKRELPTMAYAANTGK+STFMEKC+SNGKYC
Subjt:  MAPRKKPARRSSIPIVGKEGRVTDVNHPESVGKGIGEKCHSQGEYSFVLVNPNDFDSHSKTYLQEVLQLYKRELPTMAYAANTGKESTFMEKCVSNGKYC

Query:  TLLLESKSEVEPGLIIAAITYQIVPADTQYAEIPLAAVCSAYQHK-----------------------GFGHILYVELRKRLQSVGIRTIFCWGDKESEG
        TLLLESKSEV+ GL+IAAITYQIVPADTQYAEIPLAAV  AYQHK                       GFGHILY+ELRKRLQSVGIRTIFCWGDKESEG
Subjt:  TLLLESKSEVEPGLIIAAITYQIVPADTQYAEIPLAAVCSAYQHK-----------------------GFGHILYVELRKRLQSVGIRTIFCWGDKESEG

Query:  FWSKQSFLSIAEVDTKGKSRRIPVRADIRRALCFPGGSTLMISHIQGISMCSADLPKSLSLLKPEAPSSYPHAKRIGVANQGCKVSNATDQQTFKKLNFQ
        FWSKQ FLSIAEVDTKGK RRIPVRADIRRALCFPGGSTLMISHIQGISMCSADLPK  SLLKPEA    P+A RI VA+QGC VSNATDQ T + LNFQ
Subjt:  FWSKQSFLSIAEVDTKGKSRRIPVRADIRRALCFPGGSTLMISHIQGISMCSADLPKSLSLLKPEAPSSYPHAKRIGVANQGCKVSNATDQQTFKKLNFQ

Query:  PDEFVTLAPLGEDNKIQVPKPGLLLTDLQNQDAVHDSNSPVSFSEIESRRQASIAELSNTIGNLDE-NCSCSTQSAKRIWEASLSSLKSKKVKGVHLHHL
        PDEFVTL PLGE+N+IQ P         QNQDAVHDSN PVSF+EIE+   ASIAELSNTIGNLDE +CSCS QSAKR+WEASLSSLKSKKVKGV+L H 
Subjt:  PDEFVTLAPLGEDNKIQVPKPGLLLTDLQNQDAVHDSNSPVSFSEIESRRQASIAELSNTIGNLDE-NCSCSTQSAKRIWEASLSSLKSKKVKGVHLHHL

Query:  HFDSTKSFVPKSNGCDTYSQACSLANSKHEILASVSPKNPSTSNYTQNFCKEFESVNVASEDLDFGEDTLGKPFKIMLMNIADEAKKTQLMKVIEELGGS
        H DS K+ VPKSNG +  SQACSLANSKHEIL+S+ PK P T+ YTQNFC+EF SVNVASEDL+  E+TLGK FKIMLMNIADEAKKTQLMKVIEELGGS
Subjt:  HFDSTKSFVPKSNGCDTYSQACSLANSKHEILASVSPKNPSTSNYTQNFCKEFESVNVASEDLDFGEDTLGKPFKIMLMNIADEAKKTQLMKVIEELGGS

Query:  LTSDGSTCTHVITGKVRKTLNFCTALCSGAWIVSPSWLKESYREGRFVDELPYVLNDDDYTSKYRASLKTAVLRAKACPGSLFAGYDVCISAYAQPPPKT
        LT+DGST THVITGKVRKTLNFCTALCSGAWIVS SWLKESYREGRFVDELP +LNDDDYTSKYRASLKT VLRAKA PG+LF GYDVCISA+AQPPPKT
Subjt:  LTSDGSTCTHVITGKVRKTLNFCTALCSGAWIVSPSWLKESYREGRFVDELPYVLNDDDYTSKYRASLKTAVLRAKACPGSLFAGYDVCISAYAQPPPKT

Query:  LSVIVKSAGGNVIHGLGKVKEVSKTIFLACEEDVEEALLAVKKGIWTFNSEWLMTCIMRQELDLEAPQFAESL
        LS+IVKSAGG+VIH L KV  VSKTIF+ACEEDVEEAL+AV+KGIWTFN EWLMTCIMRQE+DLEAPQFAESL
Subjt:  LSVIVKSAGGNVIHGLGKVKEVSKTIFLACEEDVEEALLAVKKGIWTFNSEWLMTCIMRQELDLEAPQFAESL

A0A6J1EGI9 uncharacterized protein LOC111432350 isoform X17.0e-29279.6Show/hide
Query:  MAPRKKPARRSSIPIVGKEGRVTDVNHPESVGKGIGEKCHSQGEYSFVLVNPNDFDSHSKTYLQEVLQLYKRELPTMAYAANTGKESTFMEKCVSNGKYC
        MA  ++PARRS  PIVGKEG V D + PE    G+GEKCHSQGEYSFVLVNPNDFDS SK++LQ VL LYKRELP M YAANTGK+STFMEKCVSNGKYC
Subjt:  MAPRKKPARRSSIPIVGKEGRVTDVNHPESVGKGIGEKCHSQGEYSFVLVNPNDFDSHSKTYLQEVLQLYKRELPTMAYAANTGKESTFMEKCVSNGKYC

Query:  TLLLESKSEVEPGLIIAAITYQIVPADTQYAEIPLAAVCSAYQHKGFGHILYVELRKRLQSVGIRTIFCWGDKESEGFWSKQSFLSIAEVDTKGKSRRIP
        TLLL+S SE + G IIAAITYQIVPADTQYAEIPLAAV SAYQHKGF  ILY+ELRKRLQSVGIRTIFCWGDKESEGFWSKQ FLSIAEVDTKGK+RRIP
Subjt:  TLLLESKSEVEPGLIIAAITYQIVPADTQYAEIPLAAVCSAYQHKGFGHILYVELRKRLQSVGIRTIFCWGDKESEGFWSKQSFLSIAEVDTKGKSRRIP

Query:  VRADIRRALCFPGGSTLMISHI-QGISMCSADLPKSLSLLKPEAPSSYPHAKRIGVANQGCKVSNATDQQTFKKLNFQPDEFVTLAPLGEDNKIQVPKPG
        VRADIRRALCFPGGSTLM+SHI QG S CSAD PKS +LLKP+ P+SY  AKR  VA+QGCKVSNA DQQT + L FQP+EF +L  LG +NKI      
Subjt:  VRADIRRALCFPGGSTLMISHI-QGISMCSADLPKSLSLLKPEAPSSYPHAKRIGVANQGCKVSNATDQQTFKKLNFQPDEFVTLAPLGEDNKIQVPKPG

Query:  LLLTDLQNQDAVHDSNSPVSFSEIESRRQASIAELSNTIGNLDE-NCSCSTQSAKRIWEASLSSLKSKKVKGVHLHHLHFDSTKSFVPKSNGCDTYSQAC
            D QNQD V D N PVSF+E+E+ + ASIAEL+  IGNLDE  CSCSTQ AKR WEASLSSLKSKKVKGVHL H H DS +SFVP+SNG DT SQAC
Subjt:  LLLTDLQNQDAVHDSNSPVSFSEIESRRQASIAELSNTIGNLDE-NCSCSTQSAKRIWEASLSSLKSKKVKGVHLHHLHFDSTKSFVPKSNGCDTYSQAC

Query:  SLANSKHEILASVSPKNPSTSNYTQNFCKEFESVNVASEDLDF-GEDTLGKPFKIMLMNIADEAKKTQLMKVIEELGGSLTSDGSTCTHVITGKVRKTLN
        SLANSKHEILAS+ PKN STS YTQNFC+EF SV VASEDL   G  T GK F+IMLMNIADEAKKTQL+KVIEELGGSLTSDGST THVITGKVRKTLN
Subjt:  SLANSKHEILASVSPKNPSTSNYTQNFCKEFESVNVASEDLDF-GEDTLGKPFKIMLMNIADEAKKTQLMKVIEELGGSLTSDGSTCTHVITGKVRKTLN

Query:  FCTALCSGAWIVSPSWLKESYREGRFVDELPYVLNDDDYTSKYRASLKTAVLRAKACPGSLFAGYDVCISAYAQPPPKTLSVIVKSAGGNVIHGLGKVKE
        FCTALCSGAWIVSPSWLKESYREGRFVDE PYVLNDDDYTSKYRASLKTAVLRAKA PG+LF GYDVCIS++AQPPPKTLSVIVKSAGGNVI+GLGKV  
Subjt:  FCTALCSGAWIVSPSWLKESYREGRFVDELPYVLNDDDYTSKYRASLKTAVLRAKACPGSLFAGYDVCISAYAQPPPKTLSVIVKSAGGNVIHGLGKVKE

Query:  VSKTIFLACEEDVEEALLAVKKGIWTFNSEWLMTCIMRQELDLEAPQFAESL
        VS+TIF+ACEEDVEEAL+A+K+GIWTFNSEWLM+C+MRQELD+EAPQFAESL
Subjt:  VSKTIFLACEEDVEEALLAVKKGIWTFNSEWLMTCIMRQELDLEAPQFAESL

A0A6J1HT38 uncharacterized protein LOC111465956 isoform X11.2e-29680.52Show/hide
Query:  MAPRKKPARRSSIPIVGKEGRVTDVNHPESVGKGIGEKCHSQGEYSFVLVNPNDFDSHSKTYLQEVLQLYKRELPTMAYAANTGKESTFMEKCVSNGKYC
        MA  ++P RRS  PIVGKEG V D N PE    G+GEKCHSQGEYSFVLVNPN+FDS SK+YLQ VL LYKRELP M YAANTGK+STFMEKCVSNGKYC
Subjt:  MAPRKKPARRSSIPIVGKEGRVTDVNHPESVGKGIGEKCHSQGEYSFVLVNPNDFDSHSKTYLQEVLQLYKRELPTMAYAANTGKESTFMEKCVSNGKYC

Query:  TLLLESKSEVEPGLIIAAITYQIVPADTQYAEIPLAAVCSAYQHKGFGHILYVELRKRLQSVGIRTIFCWGDKESEGFWSKQSFLSIAEVDTKGKSRRIP
        TLLL+S SE +PG IIAAITYQIVPADTQYAEIPLAAV SAYQHKGF  ILY+ELRKRLQSVGIRTIFCWGDKESEGFWSKQ FLSIAEVDTKGK+RRIP
Subjt:  TLLLESKSEVEPGLIIAAITYQIVPADTQYAEIPLAAVCSAYQHKGFGHILYVELRKRLQSVGIRTIFCWGDKESEGFWSKQSFLSIAEVDTKGKSRRIP

Query:  VRADIRRALCFPGGSTLMISHI-QGISMCSADLPKSLSLLKPEAPSSYPHAKRIGVANQGCKVSNATDQQTFKKLNFQPDEFVTLAPLGEDNKIQVPKPG
        VRADIRRALCFPGGSTLM+SHI QG S CSAD PKS +LLKP+ P SY  AKR  VA+QGCKVSNA DQQT + LNFQP+EF +L PLG +NKI      
Subjt:  VRADIRRALCFPGGSTLMISHI-QGISMCSADLPKSLSLLKPEAPSSYPHAKRIGVANQGCKVSNATDQQTFKKLNFQPDEFVTLAPLGEDNKIQVPKPG

Query:  LLLTDLQNQDAVHDSNSPVSFSEIESRRQASIAELSNTIGNLDE-NCSCSTQSAKRIWEASLSSLKSKKVKGVHLHHLHFDSTKSFVPKSNGCDTYSQAC
            D QNQDAV D N PV F+E+E+R+ AS AEL+  IGNLDE  CSCSTQ AKR+WEASLSSLKSKKVKGVHL H   DS +SFVP+SNGCDT SQAC
Subjt:  LLLTDLQNQDAVHDSNSPVSFSEIESRRQASIAELSNTIGNLDE-NCSCSTQSAKRIWEASLSSLKSKKVKGVHLHHLHFDSTKSFVPKSNGCDTYSQAC

Query:  SLANSKHEILASVSPKNPSTSNYTQNFCKEFESVNVASEDL-DFGEDTLGKPFKIMLMNIADEAKKTQLMKVIEELGGSLTSDGSTCTHVITGKVRKTLN
        SLANSKHEILASV PKNPSTS YTQNFC+E  SVNVASEDL   G  TLGK F+IMLMNIADEAKKTQL+KVIEELGGSLTSDGST THVITGKVRKTLN
Subjt:  SLANSKHEILASVSPKNPSTSNYTQNFCKEFESVNVASEDL-DFGEDTLGKPFKIMLMNIADEAKKTQLMKVIEELGGSLTSDGSTCTHVITGKVRKTLN

Query:  FCTALCSGAWIVSPSWLKESYREGRFVDELPYVLNDDDYTSKYRASLKTAVLRAKACPGSLFAGYDVCISAYAQPPPKTLSVIVKSAGGNVIHGLGKVKE
        FCTALCSGAWIVSPSWLKESYREGRFVDE PY+LNDDDYTSKYRASLKTAVLRAKA PG+LF GYDVCISA+AQPPPKTLSVIVKSAGGNVI+GLGKV  
Subjt:  FCTALCSGAWIVSPSWLKESYREGRFVDELPYVLNDDDYTSKYRASLKTAVLRAKACPGSLFAGYDVCISAYAQPPPKTLSVIVKSAGGNVIHGLGKVKE

Query:  VSKTIFLACEEDVEEALLAVKKGIWTFNSEWLMTCIMRQELDLEAPQFAESL
        VS+TIF+ACEEDVEEAL+A+K+GIWTFNSEWLM+C+MRQELD+EAPQFAESL
Subjt:  VSKTIFLACEEDVEEALLAVKKGIWTFNSEWLMTCIMRQELDLEAPQFAESL

SwissProt top hitse value%identityAlignment
A0JNA8 PAX-interacting protein 13.6e-1932.11Show/hide
Query:  QLMKVIEELGGSLTSDGSTCTHVITGKVRKTLNFCTALCSGAWIVSPSWLKESYREGRFVDELPYVLNDDDYTSKYRASLKTAVLRAKACPGSLFAGYDV
        Q +K +  LGG +      CTH+I  KV +T+ F TA+     IV+P WL+E ++  +FVDE  Y+L D +    +  SL+ ++ RA A P  LF     
Subjt:  QLMKVIEELGGSLTSDGSTCTHVITGKVRKTLNFCTALCSGAWIVSPSWLKESYREGRFVDELPYVLNDDDYTSKYRASLKTAVLRAKACPGSLFAGYDV

Query:  CISAYAQPPPKTLSVIVKSAGGNVIH---GLGKV------KEVSKTIFLACEEDVEEALLAVKKGIWTFNSEWLMTCIMRQELDLEAPQF
         I+    P   T+  IV+ AGG V+       K+      K +S+ + ++CE D+        +GI   N+E+++T ++ Q LD E+ +F
Subjt:  CISAYAQPPPKTLSVIVKSAGGNVIH---GLGKV------KEVSKTIFLACEEDVEEALLAVKKGIWTFNSEWLMTCIMRQELDLEAPQF

Q6NZQ4 PAX-interacting protein 15.2e-1831.58Show/hide
Query:  QLMKVIEELGGSLTSDGSTCTHVITGKVRKTLNFCTALCSGAWIVSPSWLKESYREGRFVDELPYVLNDDDYTSKYRASLKTAVLRAKACPGSLFAGYDV
        Q +K +  LGG +      CTH+I  KV +T+ F TA+     IV+P WL+E ++   F+DE  Y+L D +    +  SL+ ++ RA   P  LF     
Subjt:  QLMKVIEELGGSLTSDGSTCTHVITGKVRKTLNFCTALCSGAWIVSPSWLKESYREGRFVDELPYVLNDDDYTSKYRASLKTAVLRAKACPGSLFAGYDV

Query:  CISAYAQPPPKTLSVIVKSAGGNVI---HGLGKV------KEVSKTIFLACEEDVEEALLAVKKGIWTFNSEWLMTCIMRQELDLEAPQF
         I+    P   T+  IV+ AGG V+       K+      K +S+ I ++CE D+        +GI   N+E+++T ++ Q LD E+ +F
Subjt:  CISAYAQPPPKTLSVIVKSAGGNVI---HGLGKV------KEVSKTIFLACEEDVEEALLAVKKGIWTFNSEWLMTCIMRQELDLEAPQF

Q6ZW49 PAX-interacting protein 13.6e-1932.11Show/hide
Query:  QLMKVIEELGGSLTSDGSTCTHVITGKVRKTLNFCTALCSGAWIVSPSWLKESYREGRFVDELPYVLNDDDYTSKYRASLKTAVLRAKACPGSLFAGYDV
        Q +K +  LGG +      CTH+I  KV +T+ F TA+     IV+P WL+E +R  +F+DE  Y+L D +    +  SL+ ++ RA   P  LF     
Subjt:  QLMKVIEELGGSLTSDGSTCTHVITGKVRKTLNFCTALCSGAWIVSPSWLKESYREGRFVDELPYVLNDDDYTSKYRASLKTAVLRAKACPGSLFAGYDV

Query:  CISAYAQPPPKTLSVIVKSAGGNVIH---GLGKVKE------VSKTIFLACEEDVEEALLAVKKGIWTFNSEWLMTCIMRQELDLEAPQF
         I+    P   T+  IV+ AGG V+       K+ E      +S+ I ++CE D+        +GI   N+E+++T ++ Q LD E+ +F
Subjt:  CISAYAQPPPKTLSVIVKSAGGNVIH---GLGKVKE------VSKTIFLACEEDVEEALLAVKKGIWTFNSEWLMTCIMRQELDLEAPQF

Q767L8 Mediator of DNA damage checkpoint protein 13.1e-1828.57Show/hide
Query:  KVIEELGGSLTSDGSTCTHVITGKVRKTLNFCTALCSGAWIVSPSWLKESYREGRFVDELPYVLNDDDYTSKYRASLKTAVLRAKACPGSLFAGYDVCIS
        + +  LGGSL S  +  +H++T ++R+T+ F  AL  G  I+S  WL +S + G F+    YV+ D +    +  SL+ A+ RA+     L  GY++ ++
Subjt:  KVIEELGGSLTSDGSTCTHVITGKVRKTLNFCTALCSGAWIVSPSWLKESYREGRFVDELPYVLNDDDYTSKYRASLKTAVLRAKACPGSLFAGYDVCIS

Query:  AYAQPPPKTLSVIVKSAGGNVIHGLGKVKEVSKTIFLACEEDVEEALLAVKKGIWTFNSEWLMTCIMRQELDLEA
           QPPP  +  I+   GG V+  + +  +  + + + C +D     +  + G+   + E+L+T +++QE   EA
Subjt:  AYAQPPPKTLSVIVKSAGGNVIHGLGKVKEVSKTIFLACEEDVEEALLAVKKGIWTFNSEWLMTCIMRQELDLEA

Q7YR40 Mediator of DNA damage checkpoint protein 15.2e-1827.43Show/hide
Query:  KVIEELGGSLTSDGSTCTHVITGKVRKTLNFCTALCSGAWIVSPSWLKESYREGRFVDELPYVLNDDDYTSKYRASLKTAVLRAKACPGSLFAGYDVCIS
        + +  LGGSL    +  +H++T ++R+T+ F  AL  G  I+S  WL +S++ G F+    YV+ D +    +  SL+ A+ RA+     L  GY++ ++
Subjt:  KVIEELGGSLTSDGSTCTHVITGKVRKTLNFCTALCSGAWIVSPSWLKESYREGRFVDELPYVLNDDDYTSKYRASLKTAVLRAKACPGSLFAGYDVCIS

Query:  AYAQPPPKTLSVIVKSAGGNVIHGLGKVKEVSKTIFLACEEDVEEALLAVKKGIWTFNSEWLMTCIMRQELDLEA
           QPPP  +  I+   GG  +  + +  +  + + + C +D     + ++ G+   + E+L+T +++QE   EA
Subjt:  AYAQPPPKTLSVIVKSAGGNVIHGLGKVKEVSKTIFLACEEDVEEALLAVKKGIWTFNSEWLMTCIMRQELDLEA

Arabidopsis top hitse value%identityAlignment
AT1G04020.1 breast cancer associated RING 18.0e-0630.36Show/hide
Query:  THVIT-----GKVRKTLNFCTALCSGAWIVSPSWLKESYREGRFVDELPYVLNDDDYTSKYRASLKTAVLRAKACPGSLFAGYD-VCISAYAQPPPKTLS
        THVI      G   +TL     + +G WI++ +W+K S +  + VDE P+ +  D  T   +   KTA LRA+     LF G        + +   + L 
Subjt:  THVIT-----GKVRKTLNFCTALCSGAWIVSPSWLKESYREGRFVDELPYVLNDDDYTSKYRASLKTAVLRAKACPGSLFAGYD-VCISAYAQPPPKTLS

Query:  VIVKSAGGNVIH
         +VK AGG +++
Subjt:  VIVKSAGGNVIH

AT1G04020.2 breast cancer associated RING 18.0e-0630.36Show/hide
Query:  THVIT-----GKVRKTLNFCTALCSGAWIVSPSWLKESYREGRFVDELPYVLNDDDYTSKYRASLKTAVLRAKACPGSLFAGYD-VCISAYAQPPPKTLS
        THVI      G   +TL     + +G WI++ +W+K S +  + VDE P+ +  D  T   +   KTA LRA+     LF G        + +   + L 
Subjt:  THVIT-----GKVRKTLNFCTALCSGAWIVSPSWLKESYREGRFVDELPYVLNDDDYTSKYRASLKTAVLRAKACPGSLFAGYD-VCISAYAQPPPKTLS

Query:  VIVKSAGGNVIH
         +VK AGG +++
Subjt:  VIVKSAGGNVIH

AT2G41450.1 N-acetyltransferases;N-acetyltransferases5.7e-14543.35Show/hide
Query:  MAPRKKPARRSSIPIVGKEGRVTD--------------VNHPESVGKGIGEKCHSQGEY----SFVLVNPNDFDSHSKTYLQEVLQLYKRELPTMAYAAN
        MAP++    +SS+  +G      D              +  P +    I EK     +Y     F+L+NP D D  +K++LQEVL+LY +ELP M YA+N
Subjt:  MAPRKKPARRSSIPIVGKEGRVTD--------------VNHPESVGKGIGEKCHSQGEY----SFVLVNPNDFDSHSKTYLQEVLQLYKRELPTMAYAAN

Query:  TGKESTFMEKCVSNGKYCTLLLESKSEVEPGLIIAAITYQIVPADTQYAEIPLAAVCSAYQHKGFGHILYVELRKRLQSVGIRTIFCWGDKESEGFWSKQ
        TGK+S F+E+CVS GKYC+L+L+S    +   I+AAITYQIVPADTQYAEIPLAAV   +Q KGFG ++Y EL KRL SVGIRTI+CW DKESEGFW KQ
Subjt:  TGKESTFMEKCVSNGKYCTLLLESKSEVEPGLIIAAITYQIVPADTQYAEIPLAAVCSAYQHKGFGHILYVELRKRLQSVGIRTIFCWGDKESEGFWSKQ

Query:  SFLSIAEVDTKGKSRRIPVRADIRRALCFPGGSTLMISHIQGISMCSADLPKSLSLLKPEAPSSYPHAKRIGVANQGCKVSNATDQQTFKKLNFQPDEFV
         F+ +AEVD KGK++ + ++++IR+ALCFPGGSTLM+SH+    + + ++  S       +P S  +     V     K+  +  +  +           
Subjt:  SFLSIAEVDTKGKSRRIPVRADIRRALCFPGGSTLMISHIQGISMCSADLPKSLSLLKPEAPSSYPHAKRIGVANQGCKVSNATDQQTFKKLNFQPDEFV

Query:  TLAPLGEDNKIQVPKPGLLLTDLQNQDAVHDSNSPVSFSEIESRRQASIAELSNTIGNLDENCSCSTQSAKRIWEASLSSLKSKKVKGVHLHHLHFDSTK
               D    +  P   +T  +N + + D              QA+ A         D    CST   KR WEASLSSL+SK+++         ++  
Subjt:  TLAPLGEDNKIQVPKPGLLLTDLQNQDAVHDSNSPVSFSEIESRRQASIAELSNTIGNLDENCSCSTQSAKRIWEASLSSLKSKKVKGVHLHHLHFDSTK

Query:  SFVPKSNGCDTYSQACSLANSKHEILASVSPKNPSTSNYTQNFCK--EFESVNVASEDLDFGEDTLGKPFKIMLMNIADEAKKTQLMKVIEELGGSLTSD
        S + K++   + ++     NS    +         T +     CK  + E   +A+  +D      G+ ++I+LM+I DE K+  L +VI +LGG++T D
Subjt:  SFVPKSNGCDTYSQACSLANSKHEILASVSPKNPSTSNYTQNFCK--EFESVNVASEDLDFGEDTLGKPFKIMLMNIADEAKKTQLMKVIEELGGSLTSD

Query:  GSTCTHVITGKVRKTLNFCTALCSGAWIVSPSWLKESYREGRFVDELPYVLNDDDYTSKYRASLKTAVLRAKACPGSLFAGYDVCISAYAQPPPKTLSVI
        G+T TH++TGKVRKTLN CTALCSGAWIVSPSWLKES REGRF +E  ++L+D+DY  KY   LK+ VLRAKA P SL  GYD+C+    + P KT S I
Subjt:  GSTCTHVITGKVRKTLNFCTALCSGAWIVSPSWLKESYREGRFVDELPYVLNDDDYTSKYRASLKTAVLRAKACPGSLFAGYDVCISAYAQPPPKTLSVI

Query:  VKSAGGNVIHGLGKVKEVSKTIFLACEEDVEEALLAVKKGIWTFNSEWLMTCIMRQELDLEAPQFAESL
        +KSAGGNVI G+ KVKE SK I++ CEED   AL A KKGIWTF+SEW M C+M+Q+LDL+ PQF ESL
Subjt:  VKSAGGNVIHGLGKVKEVSKTIFLACEEDVEEALLAVKKGIWTFNSEWLMTCIMRQELDLEAPQFAESL

AT2G41450.2 N-acetyltransferases;N-acetyltransferases1.3e-14445.95Show/hide
Query:  FVLVNPNDFDSHSKTYLQEVLQLYKRELPTMAYAANTGKESTFMEKCVSNGKYCTLLLESKSEVEPGLIIAAITYQIVPADTQYAEIPLAAVCSAYQHKG
        F+L+NP D D  +K++LQEVL+LY +ELP M YA+NTGK+S F+E+CVS GKYC+L+L+S    +   I+AAITYQIVPADTQYAEIPLAAV   +Q KG
Subjt:  FVLVNPNDFDSHSKTYLQEVLQLYKRELPTMAYAANTGKESTFMEKCVSNGKYCTLLLESKSEVEPGLIIAAITYQIVPADTQYAEIPLAAVCSAYQHKG

Query:  FGHILYVELRKRLQSVGIRTIFCWGDKESEGFWSKQSFLSIAEVDTKGKSRRIPVRADIRRALCFPGGSTLMISHIQGISMCSADLPKSLSLLKPEAPSS
        FG ++Y EL KRL SVGIRTI+CW DKESEGFW KQ F+ +AEVD KGK++ + ++++IR+ALCFPGGSTLM+SH+    + + ++  S       +P S
Subjt:  FGHILYVELRKRLQSVGIRTIFCWGDKESEGFWSKQSFLSIAEVDTKGKSRRIPVRADIRRALCFPGGSTLMISHIQGISMCSADLPKSLSLLKPEAPSS

Query:  YPHAKRIGVANQGCKVSNATDQQTFKKLNFQPDEFVTLAPLGEDNKIQVPKPGLLLTDLQNQDAVHDSNSPVSFSEIESRRQASIAELSNTIGNLDENCS
          +     V     K+  +  +  +                  D    +  P   +T  +N + + D              QA+ A         D    
Subjt:  YPHAKRIGVANQGCKVSNATDQQTFKKLNFQPDEFVTLAPLGEDNKIQVPKPGLLLTDLQNQDAVHDSNSPVSFSEIESRRQASIAELSNTIGNLDENCS

Query:  CSTQSAKRIWEASLSSLKSKKVKGVHLHHLHFDSTKSFVPKSNGCDTYSQACSLANSKHEILASVSPKNPSTSNYTQNFCK--EFESVNVASEDLDFGED
        CST   KR WEASLSSL+SK+++         ++  S + K++   + ++     NS    +         T +     CK  + E   +A+  +D    
Subjt:  CSTQSAKRIWEASLSSLKSKKVKGVHLHHLHFDSTKSFVPKSNGCDTYSQACSLANSKHEILASVSPKNPSTSNYTQNFCK--EFESVNVASEDLDFGED

Query:  TLGKPFKIMLMNIADEAKKTQLMKVIEELGGSLTSDGSTCTHVITGKVRKTLNFCTALCSGAWIVSPSWLKESYREGRFVDELPYVLNDDDYTSKYRASL
          G+ ++I+LM+I DE K+  L +VI +LGG++T DG+T TH++TGKVRKTLN CTALCSGAWIVSPSWLKES REGRF +E  ++L+D+DY  KY   L
Subjt:  TLGKPFKIMLMNIADEAKKTQLMKVIEELGGSLTSDGSTCTHVITGKVRKTLNFCTALCSGAWIVSPSWLKESYREGRFVDELPYVLNDDDYTSKYRASL

Query:  KTAVLRAKACPGSLFAGYDVCISAYAQPPPKTLSVIVKSAGGNVIHGLGKVKEVSKTIFLACEEDVEEALLAVKKGIWTFNSEWLMTCIMRQELDLEAPQ
        K+ VLRAKA P SL  GYD+C+    + P KT S I+KSAGGNVI G+ KVKE SK I++ CEED   AL A KKGIWTF+SEW M C+M+Q+LDL+ PQ
Subjt:  KTAVLRAKACPGSLFAGYDVCISAYAQPPPKTLSVIVKSAGGNVIHGLGKVKEVSKTIFLACEEDVEEALLAVKKGIWTFNSEWLMTCIMRQELDLEAPQ

Query:  FAESL
        F ESL
Subjt:  FAESL

AT4G03130.1 BRCT domain-containing DNA repair protein4.2e-1530.16Show/hide
Query:  NIADEAKKTQLMKVIEELGGSLTSDGSTCTHVITGKVRKTLNFCTALCSGAWIVSPSWLKESYREGRFVDELPYVLNDDDYTSKYRASLKTAVLRAKACP
        N+ DE  K Q  K++  LG S  S  +  TH I  +  +T N   A+  G ++V+P WL+   +    +DE  Y+L D     K    L T++ RAK  P
Subjt:  NIADEAKKTQLMKVIEELGGSLTSDGSTCTHVITGKVRKTLNFCTALCSGAWIVSPSWLKESYREGRFVDELPYVLNDDDYTSKYRASLKTAVLRAKACP

Query:  GSLFAGYDVCISAYAQPPPKTLSVIVKSAGGNVIHGLGKVKEVSK-----TIFLACEEDVEEALLAVKKGIWTFNSEWLMTCIMRQELD
          L  G+ VCI+   +P    ++ +VK   G V+     +    +      + L+C+ED +  L  V +G   F SE L+  I+ Q+L+
Subjt:  GSLFAGYDVCISAYAQPPPKTLSVIVKSAGGNVIHGLGKVKEVSK-----TIFLACEEDVEEALLAVKKGIWTFNSEWLMTCIMRQELD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGCCCAGAAAGAAACCCGCACGACGCTCTTCAATCCCCATCGTGGGGAAAGAGGGACGTGTTACTGATGTTAATCATCCAGAGAGCGTTGGAAAAGGTATTGGGGA
AAAGTGTCACTCTCAAGGGGAGTACTCGTTTGTGCTCGTTAATCCGAATGATTTTGATAGTCACAGTAAAACTTATCTTCAGGAAGTATTACAATTATACAAAAGGGAAT
TACCCACAATGGCCTATGCTGCAAATACTGGAAAGGAATCAACTTTTATGGAAAAATGTGTATCTAATGGGAAATATTGTACATTGCTCTTGGAATCCAAATCTGAGGTT
GAACCAGGATTGATCATAGCTGCAATTACCTACCAAATAGTCCCTGCTGACACGCAATATGCTGAGATTCCTCTTGCTGCTGTCTGTTCAGCTTACCAACACAAGGGTTT
TGGTCACATACTATACGTGGAACTTAGAAAAAGACTTCAAAGTGTTGGCATCCGTACAATATTCTGTTGGGGAGACAAGGAATCTGAAGGGTTTTGGTCCAAACAGAGCT
TTTTGTCCATAGCAGAAGTGGACACCAAGGGAAAATCTCGTAGAATACCCGTTAGAGCTGACATTCGTAGAGCATTATGCTTTCCTGGTGGTTCTACCCTCATGATTTCA
CACATTCAGGGAATTTCAATGTGTTCGGCAGACTTGCCAAAGTCATTATCTCTATTAAAGCCTGAAGCCCCTTCATCGTACCCACATGCCAAAAGAATTGGTGTTGCCAA
TCAAGGTTGCAAGGTTTCAAATGCAACTGATCAGCAGACCTTTAAAAAATTGAACTTCCAACCTGACGAGTTTGTAACTTTAGCACCCCTTGGAGAAGATAACAAAATTC
AGGTGCCGAAACCTGGTTTGTTGCTAACAGATCTTCAGAACCAGGATGCAGTGCATGACTCCAACAGTCCAGTTTCTTTTTCAGAAATAGAGAGCCGCAGGCAAGCTAGT
ATTGCAGAATTATCCAATACTATTGGTAATTTGGATGAGAATTGTTCTTGCTCTACACAGAGCGCAAAGAGAATTTGGGAAGCATCACTTTCTTCGCTGAAGTCAAAGAA
AGTAAAAGGAGTCCATCTGCACCATTTACATTTCGACTCTACCAAAAGTTTTGTTCCTAAAAGTAATGGATGTGATACCTACTCTCAAGCATGCTCATTAGCCAATTCAA
AGCATGAGATTTTAGCTAGTGTTTCTCCTAAAAATCCTTCAACCAGTAATTATACACAAAATTTCTGTAAAGAATTTGAAAGTGTTAATGTGGCATCAGAGGACCTCGAT
TTTGGAGAAGACACATTAGGAAAGCCGTTTAAAATTATGCTGATGAATATTGCAGATGAAGCTAAGAAAACTCAGCTCATGAAGGTGATTGAAGAGCTCGGTGGTTCTCT
CACCTCTGATGGGAGTACGTGCACCCATGTCATTACAGGAAAAGTGCGGAAAACTCTAAATTTCTGCACTGCTCTATGCTCAGGAGCCTGGATTGTCTCCCCTAGTTGGT
TAAAGGAAAGCTATCGGGAAGGCAGATTTGTTGACGAGTTGCCTTACGTACTGAATGATGATGACTACACGTCGAAGTACAGAGCCAGCCTAAAAACTGCAGTTCTCAGA
GCAAAAGCATGTCCTGGATCTTTATTTGCAGGGTATGATGTTTGCATATCAGCTTACGCTCAACCACCACCTAAAACTCTATCTGTGATAGTCAAGTCAGCCGGTGGAAA
TGTGATTCATGGGCTGGGAAAAGTAAAAGAAGTATCGAAAACAATCTTCTTGGCGTGCGAGGAAGATGTAGAGGAAGCCTTACTGGCTGTAAAAAAAGGGATATGGACTT
TTAACAGTGAATGGCTGATGACCTGTATTATGAGACAAGAACTAGATCTGGAGGCCCCTCAATTTGCTGAGTCCCTGTAA
mRNA sequenceShow/hide mRNA sequence
ACAAGAAGCTACGCGAAATCCCACAACAATGGCGCCCAGAAAGAAACCCGCACGACGCTCTTCAATCCCCATCGTGGGGAAAGAGGGACGTGTTACTGATGTTAATCATC
CAGAGAGCGTTGGAAAAGGTATTGGGGAAAAGTGTCACTCTCAAGGGGAGTACTCGTTTGTGCTCGTTAATCCGAATGATTTTGATAGTCACAGTAAAACTTATCTTCAG
GAAGTATTACAATTATACAAAAGGGAATTACCCACAATGGCCTATGCTGCAAATACTGGAAAGGAATCAACTTTTATGGAAAAATGTGTATCTAATGGGAAATATTGTAC
ATTGCTCTTGGAATCCAAATCTGAGGTTGAACCAGGATTGATCATAGCTGCAATTACCTACCAAATAGTCCCTGCTGACACGCAATATGCTGAGATTCCTCTTGCTGCTG
TCTGTTCAGCTTACCAACACAAGGGTTTTGGTCACATACTATACGTGGAACTTAGAAAAAGACTTCAAAGTGTTGGCATCCGTACAATATTCTGTTGGGGAGACAAGGAA
TCTGAAGGGTTTTGGTCCAAACAGAGCTTTTTGTCCATAGCAGAAGTGGACACCAAGGGAAAATCTCGTAGAATACCCGTTAGAGCTGACATTCGTAGAGCATTATGCTT
TCCTGGTGGTTCTACCCTCATGATTTCACACATTCAGGGAATTTCAATGTGTTCGGCAGACTTGCCAAAGTCATTATCTCTATTAAAGCCTGAAGCCCCTTCATCGTACC
CACATGCCAAAAGAATTGGTGTTGCCAATCAAGGTTGCAAGGTTTCAAATGCAACTGATCAGCAGACCTTTAAAAAATTGAACTTCCAACCTGACGAGTTTGTAACTTTA
GCACCCCTTGGAGAAGATAACAAAATTCAGGTGCCGAAACCTGGTTTGTTGCTAACAGATCTTCAGAACCAGGATGCAGTGCATGACTCCAACAGTCCAGTTTCTTTTTC
AGAAATAGAGAGCCGCAGGCAAGCTAGTATTGCAGAATTATCCAATACTATTGGTAATTTGGATGAGAATTGTTCTTGCTCTACACAGAGCGCAAAGAGAATTTGGGAAG
CATCACTTTCTTCGCTGAAGTCAAAGAAAGTAAAAGGAGTCCATCTGCACCATTTACATTTCGACTCTACCAAAAGTTTTGTTCCTAAAAGTAATGGATGTGATACCTAC
TCTCAAGCATGCTCATTAGCCAATTCAAAGCATGAGATTTTAGCTAGTGTTTCTCCTAAAAATCCTTCAACCAGTAATTATACACAAAATTTCTGTAAAGAATTTGAAAG
TGTTAATGTGGCATCAGAGGACCTCGATTTTGGAGAAGACACATTAGGAAAGCCGTTTAAAATTATGCTGATGAATATTGCAGATGAAGCTAAGAAAACTCAGCTCATGA
AGGTGATTGAAGAGCTCGGTGGTTCTCTCACCTCTGATGGGAGTACGTGCACCCATGTCATTACAGGAAAAGTGCGGAAAACTCTAAATTTCTGCACTGCTCTATGCTCA
GGAGCCTGGATTGTCTCCCCTAGTTGGTTAAAGGAAAGCTATCGGGAAGGCAGATTTGTTGACGAGTTGCCTTACGTACTGAATGATGATGACTACACGTCGAAGTACAG
AGCCAGCCTAAAAACTGCAGTTCTCAGAGCAAAAGCATGTCCTGGATCTTTATTTGCAGGGTATGATGTTTGCATATCAGCTTACGCTCAACCACCACCTAAAACTCTAT
CTGTGATAGTCAAGTCAGCCGGTGGAAATGTGATTCATGGGCTGGGAAAAGTAAAAGAAGTATCGAAAACAATCTTCTTGGCGTGCGAGGAAGATGTAGAGGAAGCCTTA
CTGGCTGTAAAAAAAGGGATATGGACTTTTAACAGTGAATGGCTGATGACCTGTATTATGAGACAAGAACTAGATCTGGAGGCCCCTCAATTTGCTGAGTCCCTGTAAGA
AACAAGGTCAGCTAACTTTCAGTTTTGAACAAGCAGCCATTTCGTCCTTGTAAATCACTCTTTTCTCACTGACTCTAATTTGGTACATAAACAAATTTTGATTGTTTCAT
CTTTATATTCATTAGTACATGAAGACAATAAGTTTGTTCAAGGAAGATTGCTGAAGTTTTAGCTAGTTTAGAGTAAGTTACTGAAAGGAGACAGTTGGATGTGTACTCAG
TAGAAAGTATTATACACTATGTTTTGGGAATATTATCCTATTTAGTAACTTTGGCACTTGTAAAATTGAGGAATG
Protein sequenceShow/hide protein sequence
MAPRKKPARRSSIPIVGKEGRVTDVNHPESVGKGIGEKCHSQGEYSFVLVNPNDFDSHSKTYLQEVLQLYKRELPTMAYAANTGKESTFMEKCVSNGKYCTLLLESKSEV
EPGLIIAAITYQIVPADTQYAEIPLAAVCSAYQHKGFGHILYVELRKRLQSVGIRTIFCWGDKESEGFWSKQSFLSIAEVDTKGKSRRIPVRADIRRALCFPGGSTLMIS
HIQGISMCSADLPKSLSLLKPEAPSSYPHAKRIGVANQGCKVSNATDQQTFKKLNFQPDEFVTLAPLGEDNKIQVPKPGLLLTDLQNQDAVHDSNSPVSFSEIESRRQAS
IAELSNTIGNLDENCSCSTQSAKRIWEASLSSLKSKKVKGVHLHHLHFDSTKSFVPKSNGCDTYSQACSLANSKHEILASVSPKNPSTSNYTQNFCKEFESVNVASEDLD
FGEDTLGKPFKIMLMNIADEAKKTQLMKVIEELGGSLTSDGSTCTHVITGKVRKTLNFCTALCSGAWIVSPSWLKESYREGRFVDELPYVLNDDDYTSKYRASLKTAVLR
AKACPGSLFAGYDVCISAYAQPPPKTLSVIVKSAGGNVIHGLGKVKEVSKTIFLACEEDVEEALLAVKKGIWTFNSEWLMTCIMRQELDLEAPQFAESL