; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi11G015540 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi11G015540
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
Descriptiontranscription factor SPT20 homolog
Genome locationchr11:23865064..23873608
RNA-Seq ExpressionLsi11G015540
SyntenyLsi11G015540
Gene Ontology termsNA
InterPro domainsIPR010820 - UBA-like domain DUF1421


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6589219.1 hypothetical protein SDJN03_17784, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0077.72Show/hide
Query:  MASGSTGRPNSGSKGFDFGTDDVLCSYEDYGNQESSNGSHGDLSVANSSKDFHKSRMSTVYPAATYGQPEDSIKQDVTSTVENCMKKYSDNILRFLEGIS
        MASGS GRPNSGSK FDFG++D+LCSYEDYGNQESSNG+H DLSVANSSKDFHKSRMSTVYPAA Y QPEDSIKQDV STVEN MKKYSDNILRFLEGIS
Subjt:  MASGSTGRPNSGSKGFDFGTDDVLCSYEDYGNQESSNGSHGDLSVANSSKDFHKSRMSTVYPAATYGQPEDSIKQDVTSTVENCMKKYSDNILRFLEGIS

Query:  SRLSQLELSCYNLDKSVGEMRSDVLRDHEEEDLKLKSLEKHLQETQGMHYSQSCRNVKLVCNSNSTLVHRSVQIIRDKQELAETQKDLAKLHLLQKESSS
        SRLSQLEL+CYNLDKSVGEMRSDV+RDHEEEDLKLKSLEKHLQE                       VHRSVQIIRDKQELAETQKDLAKLHL+QKES S
Subjt:  SRLSQLELSCYNLDKSVGEMRSDVLRDHEEEDLKLKSLEKHLQETQGMHYSQSCRNVKLVCNSNSTLVHRSVQIIRDKQELAETQKDLAKLHLLQKESSS

Query:  SSHSHSNDERASPVASDLKKNENPSENHNNQQLALALPHQIVPQQNPITPPPPAALPQNVPQQQSYYIPSNQLPSQPTHIQHTQTQY-QQLADVSRLPSH
        SSHSHSN+ERASPVASD  KNENPSENH NQQLALALPHQ++ QQNP+TPPPPAALPQN+PQQQ+YYI S  LP+Q  +IQH Q QY QQL DVSRL   
Subjt:  SSHSHSNDERASPVASDLKKNENPSENHNNQQLALALPHQIVPQQNPITPPPPAALPQNVPQQQSYYIPSNQLPSQPTHIQHTQTQY-QQLADVSRLPSH

Query:  MTNPQLSQTPPPQQFNQYQQQWT----QQQQPPQQVQPP-QQQPSMQPQIRMPPTSVYSSYSMNQPTSMPETLSNSMPMQLSFSPIPQPGSSRIDTVPYG
           PQ SQTPPPQQFNQY QQWT    QQQQPPQ VQPP QQQPSMQPQIR  P+SVY SYSMNQPTSMPETL NSMPMQ +FSP+PQPGSSR+DTVPYG
Subjt:  MTNPQLSQTPPPQQFNQYQQQWT----QQQQPPQQVQPP-QQQPSMQPQIRMPPTSVYSSYSMNQPTSMPETLSNSMPMQLSFSPIPQPGSSRIDTVPYG

Query:  YVGSGVTVSQQPPQVKNAFGPQAGEGYLPSGPQSALSAGGAYMMYDRESGRPPHHPPQPQQLPHHPPQPQQLPHHPSQPQQPHFNQSGYPSANASLQIPQ
        Y GSG TV QQPPQVKNAFGP AGEGYLPSGPQ ALS+GG+YMMYDRESGR           PHH PQPQ         QQPHFNQ  YP ANASLQIPQ
Subjt:  YVGSGVTVSQQPPQVKNAFGPQAGEGYLPSGPQSALSAGGAYMMYDRESGRPPHHPPQPQQLPHHPPQPQQLPHHPSQPQQPHFNQSGYPSANASLQIPQ

Query:  HPSGPHVIARTPSHPHFMRNQNQNQNHPYGEIVEKLVGMGFRSDHVASVIHRMEESGQPVDFNAVLDGLSNSGGPQRSRLFVCMLKLISAMTSLAIASII
          SGPHV+AR PSH H MR    NQ+HPYGEIVEKLVGMGFRSDH+ASVIHRMEESGQP+DFNAVLDGLSN GGPQR+                      
Subjt:  HPSGPHVIARTPSHPHFMRNQNQNQNHPYGEIVEKLVGMGFRSDHVASVIHRMEESGQPVDFNAVLDGLSNSGGPQRSRLFVCMLKLISAMTSLAIASII

Query:  SLSSPKKSQPSFPYSLRSMASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSHTDPVSVTNSSKYCAYLYEQDFHKSRMSTVFPAAA-YGQADDS
        S  +      SF +S+R MASGSAGR NS+PK+FDFGSDDILCS+EDYGKQD SNGSH+DPVSVTNSSK        DFHK RMST FPAAA YGQ DDS
Subjt:  SLSSPKKSQPSFPYSLRSMASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSHTDPVSVTNSSKYCAYLYEQDFHKSRMSTVFPAAA-YGQADDS

Query:  LSQNLISTVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQ
        ++Q++IS VENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGEMRS++ARDHEE DSKLKSLEKH+QEVHRSVQIIRDKQELAETQKDLAKLQVSQ
Subjt:  LSQNLISTVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQ

Query:  KEPSSSSHSQSNEERASSVASDPKKNENPSEIHNQQLALALPHQIVPQQNPITPPSAVLPQNMPQQQQSYYISASQLPGQPPHIQHAQGQYISPDSQHRA
        KEPSSSSHSQSNEERASSVASDPKKNEN SEIH QQLALALPHQIVPQQNPI P SA LP N+P QQQSYYIS +QL GQPPHIQHA GQYISPD QHRA
Subjt:  KEPSSSSHSQSNEERASSVASDPKKNENPSEIHNQQLALALPHQIVPQQNPITPPSAVLPQNMPQQQQSYYISASQLPGQPPHIQHAQGQYISPDSQHRA

Query:  SQPQDVSQMSNPQLSQTPPQPFNQYQQQWAQPPSQQPQPPQQPSMQPQIRPPPSSVYPSPYPPNQPSSMTETLSSSMPMQMSFPSIPQPGSSRMDAGPYG
         QPQDVS  +NPQLSQ+PPQPFNQYQQQWAQ PSQQPQPPQQ SMQPQIRPPP+S Y  PYPPNQPSS+ ETLSS+    MSF SIP PGSSR D  PYG
Subjt:  SQPQDVSQMSNPQLSQTPPQPFNQYQQQWAQPPSQQPQPPQQPSMQPQIRPPPSSVYPSPYPPNQPSSMTETLSSSMPMQMSFPSIPQPGSSRMDAGPYG

Query:  YAAASGGSAPQQPPQVKNAYGSATGEGYMPPGQQSGGAYMMYDRESGRPPHHSPQQPHHPSQQPHFNQSGYPPANAPHQVPPQAPSGPHVSARNPSHSHL
        YAAASGGS+PQQPPQVKN YG ATGEGY+PPGQ    AYMMYDRESGRP       PHHP QQPHFNQSGYPPANAPHQ+ PQA + P VS+RNPSHSHL
Subjt:  YAAASGGSAPQQPPQVKNAYGSATGEGYMPPGQQSGGAYMMYDRESGRPPHHSPQQPHHPSQQPHFNQSGYPPANAPHQVPPQAPSGPHVSARNPSHSHL

Query:  IEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVLDRLSTPTGPGPQRAW
        IEKLVGMGFRGDHVASIIQRMED G+PVDFN VLDRLS+   PGPQRAW
Subjt:  IEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVLDRLSTPTGPGPQRAW

KAG7011974.1 hypothetical protein SDJN02_26882, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0077.64Show/hide
Query:  EKRKKEIHPSLFLLSSSNRTPF----SAFLLRSMASGSTGRPNSGSKGFDFGTDDVLCSYEDYGNQESSNGSHGDLSVANSSKDFHKSRMSTVYPAATYG
        +K+KK I+ S+F   S NR+PF    S F LRSMASGS GRPNSGSKGFDFGTDDVLCSYEDYGNQESSNGSH DLSVANSSKDFHKSR+STVYPAA YG
Subjt:  EKRKKEIHPSLFLLSSSNRTPF----SAFLLRSMASGSTGRPNSGSKGFDFGTDDVLCSYEDYGNQESSNGSHGDLSVANSSKDFHKSRMSTVYPAATYG

Query:  QPEDSIKQDVTSTVENCMKKYSDNILRFLEGISSRLSQLELSCYNLDKSVGEMRSDVLRDHEEEDLKLKSLEKHLQETQGMHYSQSCRNVKLVCNSNSTL
        QPEDS+KQDV STVEN MKKYSDNILRFLEGISSRLSQLEL+ YNLDKSVGEMRSD++RDHEE DLKLKSLEKHLQE                       
Subjt:  QPEDSIKQDVTSTVENCMKKYSDNILRFLEGISSRLSQLELSCYNLDKSVGEMRSDVLRDHEEEDLKLKSLEKHLQETQGMHYSQSCRNVKLVCNSNSTL

Query:  VHRSVQIIRDKQELAETQKDLAKLHLLQKESSSSSHSHSNDERASPVASDLKKNENPSENHNNQQLALALPHQIVPQQNPITPPPPAALPQNVPQQQSYY
        VHRSVQIIRDKQELAETQKDLAKLHLLQKESS S HSHSN+ERASP A D KKNE PS+NH NQQLALALPHQIVPQQ+   PPPPAALP+NVPQQQ YY
Subjt:  VHRSVQIIRDKQELAETQKDLAKLHLLQKESSSSSHSHSNDERASPVASDLKKNENPSENHNNQQLALALPHQIVPQQNPITPPPPAALPQNVPQQQSYY

Query:  IPSNQLPSQPTHIQHTQTQYQQLADVSRLPSHMTN--PQLSQT--PPPQQFNQYQQQWTQQQQPPQQVQPPQQQPSMQPQIRMPPTSVYSSYSMNQPTSM
                    IQH Q+Q+Q           MTN   QLSQT  PPPQQF+QYQQQW   QQPPQQ QPPQQ PSMQPQIR+PPTSVYSSYSMNQPTSM
Subjt:  IPSNQLPSQPTHIQHTQTQYQQLADVSRLPSHMTN--PQLSQT--PPPQQFNQYQQQWTQQQQPPQQVQPPQQQPSMQPQIRMPPTSVYSSYSMNQPTSM

Query:  PETLSNSMPMQLSFSPIPQPGSSRIDTVPYGYVGSGVTVSQQPPQVKNAF--GPQAGEGYLPSGPQSALSAGGAYMMYDRESGR----PPHHPPQPQQLP
        PET     PMQ+SFSPIPQPGSSR+DTV YGYVGS  T+ QQPPQVKNAF  GPQAGEGYLPSGPQS LS+GGAYM+YDRE+GR    PPHHPPQPQQ P
Subjt:  PETLSNSMPMQLSFSPIPQPGSSRIDTVPYGYVGSGVTVSQQPPQVKNAF--GPQAGEGYLPSGPQSALSAGGAYMMYDRESGR----PPHHPPQPQQLP

Query:  HHPPQPQQLPHHPSQPQQPHFNQSGYPSANASLQIPQHPSGPHVIART-PSHPHFMRNQNQNQNHPYGEIVEKLVGMGFRSDHVASVIHRMEESGQPVDF
        HHPPQPQQ PHHP QPQQPHFNQSGYP AN  +QIPQHPSGPHV+AR  P+  HFMR    NQNHPYGEIV+KLVGMGFRSDH+ SVIHRMEESGQP+DF
Subjt:  HHPPQPQQLPHHPSQPQQPHFNQSGYPSANASLQIPQHPSGPHVIART-PSHPHFMRNQNQNQNHPYGEIVEKLVGMGFRSDHVASVIHRMEESGQPVDF

Query:  NAVLDGLSNSGGPQRSRLFVCMLKLISAMTSLAIASIISLSSPKKSQPSFPYSLRSMASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSHTDPV
        NAVLDGLSNSG                                                   GRPNS+PKSFDFGSD+ILCSFEDY KQ+PSNGSH++PV
Subjt:  NAVLDGLSNSGGPQRSRLFVCMLKLISAMTSLAIASIISLSSPKKSQPSFPYSLRSMASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSHTDPV

Query:  SVTNSSKYCAYLYEQDFHKSRMSTVFPAAAYGQADDSLSQNLISTVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKL
        SV NSSK        DFHKSRMSTVFP AAYGQ DDS++Q++I+ VENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGEMRS+LARDHEEA+SKL
Subjt:  SVTNSSKYCAYLYEQDFHKSRMSTVFPAAAYGQADDSLSQNLISTVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKL

Query:  KSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSSHSQSNEERASSVASDPKKNENPSEIHNQQLALALPHQIVPQQNPIT-PPSAVLPQN
        KS+EKHVQEVHRSVQIIRDKQELAETQKDLAKLQV QKEPS SSHSQ+NEER   V++DPKKNENPSEIHNQQLALALPHQIVPQQNPIT PPSA LPQN
Subjt:  KSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSSHSQSNEERASSVASDPKKNENPSEIHNQQLALALPHQIVPQQNPIT-PPSAVLPQN

Query:  MPQQQQSYYISASQLPG-QPPHIQHAQGQYISPDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQQWAQPPSQQPQPPQQPSMQPQIRPPPSSVYPSPY
        +PQQQQSYYIS+SQLPG QP HIQHAQ QYIS DSQHRASQPQDVSQM+NPQLSQT PQPFNQYQQQWAQPPSQ  QPPQQ SMQPQIRPPP+SVYPSPY
Subjt:  MPQQQQSYYISASQLPG-QPPHIQHAQGQYISPDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQQWAQPPSQQPQPPQQPSMQPQIRPPPSSVYPSPY

Query:  -PPNQPSSMTETLSSSMPMQMSFPSIPQPGSSRMDAGPYGYAAASGGSAPQQPPQVKNAYGSATGEGYMPPGQQ----SGGAYMMYDRESGRPPHHSPQQ
         PPNQP+SM ETLSSSMPMQMSF SIPQPGSSR DA PYGYAAASGGSAPQQPPQVKNAYG ATGEGYMPPGQQ    SGGAYMMYDRESGRPPHH PQQ
Subjt:  -PPNQPSSMTETLSSSMPMQMSFPSIPQPGSSRMDAGPYGYAAASGGSAPQQPPQVKNAYGSATGEGYMPPGQQ----SGGAYMMYDRESGRPPHHSPQQ

Query:  P-HHPSQQPHFNQSGYPPANAPHQVPPQAPSGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVLDRLSTPTGPGPQRAW
        P HHPSQQ HF+QSGYPPANAPHQVPPQAP+GPHVSARNPSHSHLIEKLVGMGFRGDHV +IIQRMEDSGQ VDFNAVLDRLSTP GPGPQRAW
Subjt:  P-HHPSQQPHFNQSGYPPANAPHQVPPQAPSGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVLDRLSTPTGPGPQRAW

KAG7022919.1 hypothetical protein SDJN02_16655, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0076.11Show/hide
Query:  MASGSTGRPNSGSKGFDFGTDDVLCSYEDYGNQESSNGSHGDLSVANSSKDFHKSRMSTVYPAATYGQPEDSIKQDVTSTVENCMKKYSDNILRFLEGIS
        MASGS GRPNSGSK FDFG++D+LCSYEDYGNQESSNG+H DLSVANSSKDFHKSRMSTVYPAA Y QPEDSIKQDV STVEN MKKYSDNILRFLEGIS
Subjt:  MASGSTGRPNSGSKGFDFGTDDVLCSYEDYGNQESSNGSHGDLSVANSSKDFHKSRMSTVYPAATYGQPEDSIKQDVTSTVENCMKKYSDNILRFLEGIS

Query:  SRLSQLELSCYNLDKSVGEMRSDVLRDHEEEDLKLKSLEKHLQETQGMHYSQSCRNVKLVCNSNSTLVHRSVQIIRDKQELAETQKDLAKLHLLQKESSS
        SRLSQLEL+CYNLDKSVGEMRSDV+RDHEEEDLKLKSLEKHLQE                       VHRSVQIIRDKQELAETQKDLAKLHL+QKES S
Subjt:  SRLSQLELSCYNLDKSVGEMRSDVLRDHEEEDLKLKSLEKHLQETQGMHYSQSCRNVKLVCNSNSTLVHRSVQIIRDKQELAETQKDLAKLHLLQKESSS

Query:  SSHSHSNDERASPVASDLKKNENPSENHNNQQLALALPHQIVPQQNPITPPPPAALPQNVPQQQSYYIPSNQLPSQPTHIQHTQTQY-QQLADVSRLPSH
        SSHSHSN+ERASPVASD  KNENPSENH NQQLALALPHQ++ QQNP+TPPPPAALPQNVPQQQ+YYI S  LP+Q  HIQH Q QY QQL DVSRL   
Subjt:  SSHSHSNDERASPVASDLKKNENPSENHNNQQLALALPHQIVPQQNPITPPPPAALPQNVPQQQSYYIPSNQLPSQPTHIQHTQTQY-QQLADVSRLPSH

Query:  MTNPQLSQTPPPQQFNQYQQQWT---QQQQPPQQVQPP-QQQPSMQPQIRMPPTSVYSSYSMNQPTSMPETLSNSMPMQLSFSPIPQPGSSRIDTVPYGY
           PQ SQTPPPQQFNQY QQWT   QQQQPPQ VQPP QQQPSMQPQIR  P+SVY SYSMNQPTSMPETL NSMPMQ +FSP+PQPGSSR+DTVPYGY
Subjt:  MTNPQLSQTPPPQQFNQYQQQWT---QQQQPPQQVQPP-QQQPSMQPQIRMPPTSVYSSYSMNQPTSMPETLSNSMPMQLSFSPIPQPGSSRIDTVPYGY

Query:  VGSGVTVSQQPPQVKNAFGPQAGEGYLPSGPQSALSAGGAYMMYDRESGRPPHHPPQPQQLPHHPPQPQQLPHHPSQPQQPHFNQSGYPSANASLQIPQH
         GSG TV QQPPQVKNAFGP AGEGYLPSGPQ ALS+GG+YMMYDRESGR           PHH PQPQ         QQPHFNQ  YP ANASLQIPQ 
Subjt:  VGSGVTVSQQPPQVKNAFGPQAGEGYLPSGPQSALSAGGAYMMYDRESGRPPHHPPQPQQLPHHPPQPQQLPHHPSQPQQPHFNQSGYPSANASLQIPQH

Query:  PSGPHVIARTPSHPHFMRNQNQNQNHPYGEIVEKLVGMGFRSDHVASVIHRMEESGQPVDFNAVLDGLSNSGGPQRSRLFVCMLKLISAMTSLAIASIIS
         SGPHV+AR PSH H MR    NQ+HPYGEIVEKLVGMGFRSDH+ASVIHRMEESGQP+DFNAVLDGLSN GGPQR                        
Subjt:  PSGPHVIARTPSHPHFMRNQNQNQNHPYGEIVEKLVGMGFRSDHVASVIHRMEESGQPVDFNAVLDGLSNSGGPQRSRLFVCMLKLISAMTSLAIASIIS

Query:  LSSPKKSQPSFPYSLRSMASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSHTDPVSVTNSSKYCAYLYEQDFHKSRMSTVFPAAA-YGQADDSL
                        S  S   GR NS+PK+FDFGSDDILCS+EDYGKQD SNGSH+DPVSVTNSSK        DFHK RMST FPAAA YGQ DDS+
Subjt:  LSSPKKSQPSFPYSLRSMASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSHTDPVSVTNSSKYCAYLYEQDFHKSRMSTVFPAAA-YGQADDSL

Query:  SQNLISTVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQK
        +Q++IS VENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGEMRS++ARDHE              EVHRSVQIIRDKQELAETQKDLAKLQVSQK
Subjt:  SQNLISTVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQK

Query:  EPSSSSHSQSNEERASSVASDPKKNENPSEIHNQQLALALPHQIVPQQNPITPPSAVLPQNMPQQQQSYYISASQLPGQPPHIQHAQGQYISPDSQHRAS
        EPSSSSHSQSNEERASSVASDPKKNEN SEIH QQLALALPHQIVPQQNPI P SA LP N+P QQQSYYIS +QL GQPPHIQHA GQYISPD QHRA 
Subjt:  EPSSSSHSQSNEERASSVASDPKKNENPSEIHNQQLALALPHQIVPQQNPITPPSAVLPQNMPQQQQSYYISASQLPGQPPHIQHAQGQYISPDSQHRAS

Query:  QPQDVSQMSNPQLSQTPPQPFNQYQQQWAQPPSQQPQPPQQPSMQPQIRPPPSSVYPSPYPPNQPSSMTETLSSSMPMQMSFPSIPQPGSSRMDAGPYGY
        QPQDVS  +NPQLSQ+PPQPFNQYQQQWAQ PSQQPQPPQQ SMQPQIRPPP+S Y  PYPPNQPSS+ ETLSS+    MSF SIP PGSSR D  PYGY
Subjt:  QPQDVSQMSNPQLSQTPPQPFNQYQQQWAQPPSQQPQPPQQPSMQPQIRPPPSSVYPSPYPPNQPSSMTETLSSSMPMQMSFPSIPQPGSSRMDAGPYGY

Query:  AAASGGSAPQQPPQVKNAYGSATGEGYMPPGQQSGGAYMMYDRESGRPPHHSPQQPHHPSQQPHFNQSGYPPANAPHQVPPQAPSGPHVSARNPSHSHLI
        AAASGGS+PQQPPQVKN YG ATGEGY+PPGQ    AYMMYDRESGRP       PHHP QQPHFNQSGYPPANAPHQ+ PQA + P VS+RNPSHSHLI
Subjt:  AAASGGSAPQQPPQVKNAYGSATGEGYMPPGQQSGGAYMMYDRESGRPPHHSPQQPHHPSQQPHFNQSGYPPANAPHQVPPQAPSGPHVSARNPSHSHLI

Query:  EKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVLDRLSTPTGPGPQRA
        EKLVGMGFRGDHVASIIQRMED G+PVDFN VLDRLS+   PGPQRA
Subjt:  EKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVLDRLSTPTGPGPQRA

XP_038888030.1 ataxin-2 homolog [Benincasa hispida]4.6e-27289.16Show/hide
Query:  MASGSTGRPNSGSKGFDFGTDDVLCSYEDYGNQESSNGSHGDLSVANSSKDFHKSRMSTVYPAATYGQPEDSIKQDVTSTVENCMKKYSDNILRFLEGIS
        MASGSTGRPNSGSKGFDFGTDDVLCSYEDYGNQESSNGSH DLS+ANSSKDFHKSRMSTVYPAA YGQP+DSIKQDV STVEN MKKYSDNILRFLEGIS
Subjt:  MASGSTGRPNSGSKGFDFGTDDVLCSYEDYGNQESSNGSHGDLSVANSSKDFHKSRMSTVYPAATYGQPEDSIKQDVTSTVENCMKKYSDNILRFLEGIS

Query:  SRLSQLELSCYNLDKSVGEMRSDVLRDHEEEDLKLKSLEKHLQETQGMHYSQSCRNVKLVCNSNSTLVHRSVQIIRDKQELAETQKDLAKLHLLQKESSS
        SRLSQLEL+CYNLDKSVGEMRSDVLRDHEEEDLKLKSLEKHLQE                       VHRSVQIIRDKQELAETQKDLAKLHLLQKESSS
Subjt:  SRLSQLELSCYNLDKSVGEMRSDVLRDHEEEDLKLKSLEKHLQETQGMHYSQSCRNVKLVCNSNSTLVHRSVQIIRDKQELAETQKDLAKLHLLQKESSS

Query:  SSHSHSNDERASPVASDLKKNENPSENHNNQQLALALPHQIVPQQNPITPPPPAALPQNVPQQQSYYIPSNQLPSQPTHIQHTQTQYQQLADVSRLPSHM
        SSHSHSNDERASPVASD KKNENPSENHNNQQLALALPHQIVP QNPITPPPPAALPQNVPQQQSYYIPSNQLPSQPTHIQH Q QYQQL DVSRLPS M
Subjt:  SSHSHSNDERASPVASDLKKNENPSENHNNQQLALALPHQIVPQQNPITPPPPAALPQNVPQQQSYYIPSNQLPSQPTHIQHTQTQYQQLADVSRLPSHM

Query:  TNPQLSQTPPPQQFNQYQQQWTQQQQPPQQVQPPQQQPSM-QPQIRMPPTSVYSSYSMNQPTSMPETLSNSMPMQLSFSPIPQPGSSRIDTVPYGYVGSG
        TN QLSQT PPQQFNQYQQQWTQQQQPPQQVQPPQQQPS+ QPQIR PPTSVYSSYSMNQPTSMPET+SNSMPMQLSFSPIPQPGSSRIDTVPYGYVGSG
Subjt:  TNPQLSQTPPPQQFNQYQQQWTQQQQPPQQVQPPQQQPSM-QPQIRMPPTSVYSSYSMNQPTSMPETLSNSMPMQLSFSPIPQPGSSRIDTVPYGYVGSG

Query:  VTVSQQPPQVKNAFGPQAGEGYLPSGPQSALSAGGAYMMYDRESGRPPHHPPQPQQLPHHPPQPQQLPHHPSQPQQPHFNQSGYPSANASLQIPQHPSGP
        VTVSQQPPQVKNAFGPQAGEGYLPSGPQSALS+GG+YMMYDRESGRPPHHPPQPQQLPHHPP          QPQQPHFNQSGYPSANA LQIPQH S P
Subjt:  VTVSQQPPQVKNAFGPQAGEGYLPSGPQSALSAGGAYMMYDRESGRPPHHPPQPQQLPHHPPQPQQLPHHPSQPQQPHFNQSGYPSANASLQIPQHPSGP

Query:  HVIARTPSHPHFMRNQNQNQNHPYGEIVEKLVGMGFRSDHVASVIHRMEESGQPVDFNAVLDGLSNSGGPQR
        HVIAR P+HPHFMR  NQNQNHPYGEIVEKLVGMGFRSDHVASVIHRMEESGQPVDFNAVLDGLSNSG PQR
Subjt:  HVIARTPSHPHFMRNQNQNQNHPYGEIVEKLVGMGFRSDHVASVIHRMEESGQPVDFNAVLDGLSNSGGPQR

XP_038888365.1 ataxin-2 homolog [Benincasa hispida]5.3e-26893.8Show/hide
Query:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSHTDPVSVTNSSKYCAYLYEQDFHKSRMSTVFPAAAYG--QADDSLSQNLISTVENSMKKHS
        MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSHTDPVS+TNS+K        DFHKSRMSTVFPAAAYG  QADDS+SQN+ISTVENSMKKHS
Subjt:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSHTDPVSVTNSSKYCAYLYEQDFHKSRMSTVFPAAAYG--QADDSLSQNLISTVENSMKKHS

Query:  DNLLRFLEGISSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSSHSQSNEERAS
        DNLLRFLEGISSRLSQLELYCYNLDKSVGEMRS+LARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSSHSQSNEERAS
Subjt:  DNLLRFLEGISSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSSHSQSNEERAS

Query:  SVASDPKKNENPSEIHNQQLALALPHQIVPQQNPITPPSAVLPQNMPQQQQSYYISASQLPGQPPHIQHAQGQYISPDSQHRASQPQDVSQMSNPQLSQT
        SVASDPKKNENPSEIHNQQLALALPHQIVPQQN IT PSA LPQNMPQQQQSYYIS+SQLPGQPPH+QHAQGQYISPDS +RASQPQDVSQMSNPQLSQT
Subjt:  SVASDPKKNENPSEIHNQQLALALPHQIVPQQNPITPPSAVLPQNMPQQQQSYYISASQLPGQPPHIQHAQGQYISPDSQHRASQPQDVSQMSNPQLSQT

Query:  PPQPFNQYQQQWAQPPSQQPQPPQQPSMQPQIRPPPSSVYPSPYPPNQPSSMTETLSSSMPMQMSFPSIPQPGSSRMDAGPYGYAAASGGSAPQQPPQVK
        PPQPFNQY QQWAQPPSQQPQPPQQPSMQPQIRPPP SVYPS YPPNQP+SM ETLSSSMPM MSFPSIPQPGSSRMDAGPYGYAAASGGSAPQQPPQVK
Subjt:  PPQPFNQYQQQWAQPPSQQPQPPQQPSMQPQIRPPPSSVYPSPYPPNQPSSMTETLSSSMPMQMSFPSIPQPGSSRMDAGPYGYAAASGGSAPQQPPQVK

Query:  NAYGSATGEGYMPPGQQSGGAYMMYDRESGRPPHHSPQQPHHPSQQPHFNQSGYPPANAPHQVPPQAPSGPHVSARNPSHSHLIEKLVGMGFRGDHVASI
        NAYG ATGEGYMPPGQQSGGAYMMYDRESGRPPHH PQQPHHPSQQPHFNQSGYPPAN  HQVPPQAP+GPHVSARNPSHSHLIEKLVGMGFRGDHVASI
Subjt:  NAYGSATGEGYMPPGQQSGGAYMMYDRESGRPPHHSPQQPHHPSQQPHFNQSGYPPANAPHQVPPQAPSGPHVSARNPSHSHLIEKLVGMGFRGDHVASI

Query:  IQRMEDSGQPVDFNAVLDRLSTPTGPGPQRAW
        IQRMEDSGQPVDFNAVLDRLSTPTGPGPQRAW
Subjt:  IQRMEDSGQPVDFNAVLDRLSTPTGPGPQRAW

TrEMBL top hitse value%identityAlignment
A0A0A0K1T3 Structural constituent of cell wall6.1e-26286.61Show/hide
Query:  MASGSTGRPNSGSKGFDFGTDDVLCSYEDYGNQESSNGSHGDLSVANSSKDFHKSRMSTVYPAATYGQPEDSIKQDVTSTVENCMKKYSDNILRFLEGIS
        MASGSTGRPNSGSKGFDFGTDDVLCSYEDYGNQESSNGSH DLSVANS+KDFHKSRMSTVYPAA YGQ EDSIKQDV STVEN MKKYSDNILRFLEGIS
Subjt:  MASGSTGRPNSGSKGFDFGTDDVLCSYEDYGNQESSNGSHGDLSVANSSKDFHKSRMSTVYPAATYGQPEDSIKQDVTSTVENCMKKYSDNILRFLEGIS

Query:  SRLSQLELSCYNLDKSVGEMRSDVLRDHEEEDLKLKSLEKHLQETQGMHYSQSCRNVKLVCNSNSTLVHRSVQIIRDKQELAETQKDLAKLHLLQKESSS
        SRLSQLEL+CYNLDKSVGEMRSDVLRD EEEDLKLKSLEKHLQE                       VHRSVQIIRDKQELAETQKDLAKLHLLQKESSS
Subjt:  SRLSQLELSCYNLDKSVGEMRSDVLRDHEEEDLKLKSLEKHLQETQGMHYSQSCRNVKLVCNSNSTLVHRSVQIIRDKQELAETQKDLAKLHLLQKESSS

Query:  SSHSHSNDERASPVASDLKKNENPSENHNNQQLALALPHQIVPQQNPITPPPPAALPQNVPQQQSYYIPSNQLPSQ-PTHIQHTQTQYQQLADVSRLPSH
        S+HSHSNDERASPVASD KKNEN SEN NNQQLALALPHQIVP QNPITPPPPAALPQNVPQQQSYY+ SNQLPSQ PTHIQH QTQYQQL DVSRLPS 
Subjt:  SSHSHSNDERASPVASDLKKNENPSENHNNQQLALALPHQIVPQQNPITPPPPAALPQNVPQQQSYYIPSNQLPSQ-PTHIQHTQTQYQQLADVSRLPSH

Query:  MTNPQLSQTPPPQQFNQYQQQWT--QQQQPPQQVQPPQQQPSM-QPQIRMPPTSVYSSYSMNQPTSMPETLSNSMPMQLSFSPIPQPGSSRIDTVPYGYV
        MTN QLSQTPPPQQFNQYQQQWT  QQQQPPQQVQPPQQQPSM QPQIR PPTSVY SYSMNQPTSMPETLSNSMPMQLSFSPIPQPGSSRIDTVPYGYV
Subjt:  MTNPQLSQTPPPQQFNQYQQQWT--QQQQPPQQVQPPQQQPSM-QPQIRMPPTSVYSSYSMNQPTSMPETLSNSMPMQLSFSPIPQPGSSRIDTVPYGYV

Query:  GSGVTVSQQPPQVKNAFGPQAGEGYLPSGPQSALSAGGAYMMYDRESGRPPHHPPQPQQLPHHPPQPQQLPHHPSQPQQPHFNQSGYPSANASLQIPQHP
        GSGVTVSQQPPQVKNA+  QAGEGYLPSGPQSALS G +YMMYDRESGRP HHPPQPQQ P         PHHP QPQQ HFNQSGY  AN SLQI QHP
Subjt:  GSGVTVSQQPPQVKNAFGPQAGEGYLPSGPQSALSAGGAYMMYDRESGRPPHHPPQPQQLPHHPPQPQQLPHHPSQPQQPHFNQSGYPSANASLQIPQHP

Query:  SGPHVIARTPSHPHFMRNQNQNQNHPYGEIVEKLVGMGFRSDHVASVIHRMEESGQPVDFNAVLDGLSNSGGPQR
        SGPHV+AR P+HPH+MRNQN NQNHPYGEIVEKLVGMGFRSDHVAS+IHRMEESGQPVDFNAVLDGLSNSGGPQR
Subjt:  SGPHVIARTPSHPHFMRNQNQNQNHPYGEIVEKLVGMGFRSDHVASVIHRMEESGQPVDFNAVLDGLSNSGGPQR

A0A1S3C082 transcription factor SPT20 homolog8.2e-26787.83Show/hide
Query:  MASGSTGRPNSGSKGFDFGTDDVLCSYEDYGNQESSNGSHGDLSVANSSKDFHKSRMSTVYPAATYGQPEDSIKQDVTSTVENCMKKYSDNILRFLEGIS
        MASGSTGRPNSGSKGFDFGTDDVLCSYEDYGNQESSNGSH DLSVANSSKDFHKSRMSTVYPAA YGQ EDSIKQDV STVEN MKKYSDNILRFLEGIS
Subjt:  MASGSTGRPNSGSKGFDFGTDDVLCSYEDYGNQESSNGSHGDLSVANSSKDFHKSRMSTVYPAATYGQPEDSIKQDVTSTVENCMKKYSDNILRFLEGIS

Query:  SRLSQLELSCYNLDKSVGEMRSDVLRDHEEEDLKLKSLEKHLQETQGMHYSQSCRNVKLVCNSNSTLVHRSVQIIRDKQELAETQKDLAKLHLLQKESSS
        SRLSQLEL+CYNLDKSVGEMRSDVLRDHEEEDLKLKSLEKHLQE                       VHRSVQIIRDKQELAETQKDLAKLHLLQKESSS
Subjt:  SRLSQLELSCYNLDKSVGEMRSDVLRDHEEEDLKLKSLEKHLQETQGMHYSQSCRNVKLVCNSNSTLVHRSVQIIRDKQELAETQKDLAKLHLLQKESSS

Query:  SSHSHSNDERASPVASDLKKNENPSENHNNQQLALALPHQIVPQQNPITPPPPAALPQNVPQQQSYYIPSNQLPSQPTHIQHTQTQYQQLADVSRLPSHM
        SSHSHSNDERASPVASD KKNENPSEN NNQQLALALPHQIVP QNPI PPPPAALPQNVPQQQSYY+ SNQLPSQPTHIQH QTQYQQL DVSRLPSHM
Subjt:  SSHSHSNDERASPVASDLKKNENPSENHNNQQLALALPHQIVPQQNPITPPPPAALPQNVPQQQSYYIPSNQLPSQPTHIQHTQTQYQQLADVSRLPSHM

Query:  TNPQLSQTPPPQQFNQYQQQWT---QQQQPPQQVQPPQQQPSM-QPQIRMPPTSVYSSYSMNQPTSMPETLSNSMPMQLSFSPIPQPGSSRIDTVPYGYV
        TN QLSQTPPPQQFNQYQQQWT   QQQQPPQQVQPPQQQPSM QPQIR PPTSVYSSYSMNQPTSMPETLSNSMPMQLSFSPIPQPGSSRIDTVPYGYV
Subjt:  TNPQLSQTPPPQQFNQYQQQWT---QQQQPPQQVQPPQQQPSM-QPQIRMPPTSVYSSYSMNQPTSMPETLSNSMPMQLSFSPIPQPGSSRIDTVPYGYV

Query:  GSGVTVSQQPPQVKNAFGPQAGEGYLPSGPQSALSAGGAYMMYDRESGRPPHHPPQPQQLPHHPPQPQQLPHHPSQPQQPHFNQSGYPSANASLQIPQHP
        GSGVTVSQQPPQVKNAF  QAGEGYLPSGPQSALS+G +YMMYDRESGRPPHHPPQPQQ PHHPP          QPQQ HFNQSGYP AN S+QI QHP
Subjt:  GSGVTVSQQPPQVKNAFGPQAGEGYLPSGPQSALSAGGAYMMYDRESGRPPHHPPQPQQLPHHPPQPQQLPHHPSQPQQPHFNQSGYPSANASLQIPQHP

Query:  SGPHVIARTPSHPHFMRNQNQNQNHPYGEIVEKLVGMGFRSDHVASVIHRMEESGQPVDFNAVLDGLSNSGGPQR
        SGPHV+AR P+HPH+MR  NQNQNHPYGEIVEKLVGMGFRSDHVASVIHRMEESGQPVDFNAVLDGLSNSGGPQR
Subjt:  SGPHVIARTPSHPHFMRNQNQNQNHPYGEIVEKLVGMGFRSDHVASVIHRMEESGQPVDFNAVLDGLSNSGGPQR

A0A1S3C1W2 arginine-glutamic acid dipeptide repeats protein-like7.0e-25891.13Show/hide
Query:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSHTDPVSVTNSSKYCAYLYEQDFHKSRMSTVFPAAAYGQADDSLSQNLISTVENSMKKHSDN
        MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGS +DPVSVTN  K        DFHKSRMSTVFPAA YGQADD++SQN+ISTVENSMKKHSDN
Subjt:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSHTDPVSVTNSSKYCAYLYEQDFHKSRMSTVFPAAAYGQADDSLSQNLISTVENSMKKHSDN

Query:  LLRFLEGISSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSSHSQSNEERASSV
        LLRFLEGISSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSS+HSQSNEERASSV
Subjt:  LLRFLEGISSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSSHSQSNEERASSV

Query:  ASDPKKNENPSEIHNQQLALALPHQIVPQQNPITPPSAVLPQNMPQQQQSYYISASQLPGQPPHIQHAQGQYISPDSQHRASQPQDVSQMSNPQLSQTPP
        ASD KK ENPSEIHNQQLALALPHQIVPQQNPITPPSA LPQNMPQQQQSYYIS SQLPGQPPHIQHAQ QYIS DSQHRASQPQDVSQMSNPQLSQTPP
Subjt:  ASDPKKNENPSEIHNQQLALALPHQIVPQQNPITPPSAVLPQNMPQQQQSYYISASQLPGQPPHIQHAQGQYISPDSQHRASQPQDVSQMSNPQLSQTPP

Query:  QPFNQYQQQWAQPPSQQPQPPQQPSMQPQIRPPPSSVYPSPYPPNQPSSMTETLSSSMPMQMSFPSIPQPGSSRMDAGPYGYAAASGGSAPQQPPQVKNA
        QPFNQYQQQWAQPPSQQPQPPQQPSMQ QIRPPP SVYPS YPPNQP+SM ETL SSMPMQMSFPSIPQPGSSR+DAGPYGYA  SGGSAPQQPPQVKNA
Subjt:  QPFNQYQQQWAQPPSQQPQPPQQPSMQPQIRPPPSSVYPSPYPPNQPSSMTETLSSSMPMQMSFPSIPQPGSSRMDAGPYGYAAASGGSAPQQPPQVKNA

Query:  YGSATGEGYMPPGQQSGGAYMMYDRESGRPPHHSPQQPHHPSQQPHFNQSGYPPANAPHQVPPQAPSGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQ
        YG  TGEGYMPPGQQSGGAYMMYDRESGRP       PHHP QQ HFNQSGYP ANAPHQVPPQAP+GPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQ
Subjt:  YGSATGEGYMPPGQQSGGAYMMYDRESGRPPHHSPQQPHHPSQQPHFNQSGYPPANAPHQVPPQAPSGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQ

Query:  RMEDSGQPVDFNAVLDRLSTPTGPGPQRAW
        RMEDSGQPVDFNAVLDRLS+P+GPGPQRAW
Subjt:  RMEDSGQPVDFNAVLDRLSTPTGPGPQRAW

A0A5A7SKA3 Transcription factor SPT20-like protein2.6e-26583.5Show/hide
Query:  PFSAFLLRSMASGSTGRPNSGSKGFDFGTDDVLCSYEDYGNQESSNGSHGDLSVANSSK----------------------------DFHKSRMSTVYPA
        P   FLLRSMASGSTGRPNSGSKGFDFGTDDVLCSYEDYGNQESSNGSH DLSVANSSK                            DFHKSRMSTVYPA
Subjt:  PFSAFLLRSMASGSTGRPNSGSKGFDFGTDDVLCSYEDYGNQESSNGSHGDLSVANSSK----------------------------DFHKSRMSTVYPA

Query:  ATYGQPEDSIKQDVTSTVENCMKKYSDNILRFLEGISSRLSQLELSCYNLDKSVGEMRSDVLRDHEEEDLKLKSLEKHLQETQGMHYSQSCRNVKLVCNS
        A YGQ EDSIKQDV STVEN MKKYSDNILRFLEGISSRLSQLEL+CYNLDKSVGEMRSDVLRDHEEEDLKLKSLEKHLQE                   
Subjt:  ATYGQPEDSIKQDVTSTVENCMKKYSDNILRFLEGISSRLSQLELSCYNLDKSVGEMRSDVLRDHEEEDLKLKSLEKHLQETQGMHYSQSCRNVKLVCNS

Query:  NSTLVHRSVQIIRDKQELAETQKDLAKLHLLQKESSSSSHSHSNDERASPVASDLKKNENPSENHNNQQLALALPHQIVPQQNPITPPPPAALPQNVPQQ
            VHRSVQIIRDKQELAETQKDLAKLHLLQKESSSSSHSHSNDERASPVASD KKNENPSEN NNQQLALALPHQIVP QNPI PPPPAALPQNVPQQ
Subjt:  NSTLVHRSVQIIRDKQELAETQKDLAKLHLLQKESSSSSHSHSNDERASPVASDLKKNENPSENHNNQQLALALPHQIVPQQNPITPPPPAALPQNVPQQ

Query:  QSYYIPSNQLPSQPTHIQHTQTQYQQLADVSRLPSHMTNPQLSQTPPPQQFNQYQQQWT---QQQQPPQQVQPPQQQPSM-QPQIRMPPTSVYSSYSMNQ
        QSYY+ SNQLPSQPTHIQH QTQYQQL DVSRLPSHMTN QLSQTPPPQQFNQYQQQWT   QQQQPPQQVQPPQQQPSM QPQIR PPTSVYSSYSMNQ
Subjt:  QSYYIPSNQLPSQPTHIQHTQTQYQQLADVSRLPSHMTNPQLSQTPPPQQFNQYQQQWT---QQQQPPQQVQPPQQQPSM-QPQIRMPPTSVYSSYSMNQ

Query:  PTSMPETLSNSMPMQLSFSPIPQPGSSRIDTVPYGYVGSGVTVSQQPPQVKNAFGPQAGEGYLPSGPQSALSAGGAYMMYDRESGRPPHHPPQPQQLPHH
        PTSMPETLSNSMPMQLSFSPIPQPGSSRIDTVPYGYVGSGVTVSQQPPQVKNAF  QAGEGYLPSGPQSALS+G +YMMYDRESGRPPHHPPQPQQ PHH
Subjt:  PTSMPETLSNSMPMQLSFSPIPQPGSSRIDTVPYGYVGSGVTVSQQPPQVKNAFGPQAGEGYLPSGPQSALSAGGAYMMYDRESGRPPHHPPQPQQLPHH

Query:  PPQPQQLPHHPSQPQQPHFNQSGYPSANASLQIPQHPSGPHVIARTPSHPHFMRNQNQNQNHPYGEIVEKLVGMGFRSDHVASVIHRMEESGQPVDFNAV
        PP          QPQQ HFNQSGYP AN S+QI QHPSGPHV+AR P+HPH+MR  NQNQNHPYGEIVEKLVGMGFRSDHVASVIHRMEESGQPVDFNAV
Subjt:  PPQPQQLPHHPSQPQQPHFNQSGYPSANASLQIPQHPSGPHVIARTPSHPHFMRNQNQNQNHPYGEIVEKLVGMGFRSDHVASVIHRMEESGQPVDFNAV

Query:  LDGLSNSGGPQR
        LDGLSNSGGPQR
Subjt:  LDGLSNSGGPQR

A0A5D3C9N4 Transcription factor SPT20-like protein2.7e-26283.75Show/hide
Query:  MASGSTGRPNSGSKGFDFGTDDVLCSYEDYGNQESSNGSHGDLSVANSSK----------------------------DFHKSRMSTVYPAATYGQPEDS
        MASGSTGRPNSGSKGFDFGTDDVLCSYEDYGNQESSNGSH DLSVANSSK                            DFHKSRMSTVYPAA YGQ EDS
Subjt:  MASGSTGRPNSGSKGFDFGTDDVLCSYEDYGNQESSNGSHGDLSVANSSK----------------------------DFHKSRMSTVYPAATYGQPEDS

Query:  IKQDVTSTVENCMKKYSDNILRFLEGISSRLSQLELSCYNLDKSVGEMRSDVLRDHEEEDLKLKSLEKHLQETQGMHYSQSCRNVKLVCNSNSTLVHRSV
        IKQDV STVEN MKKYSDNILRFLEGISSRLSQLEL+CYNLDKSVGEMRSDVLRDHEEEDLKLKSLEKHLQE                       VHRSV
Subjt:  IKQDVTSTVENCMKKYSDNILRFLEGISSRLSQLELSCYNLDKSVGEMRSDVLRDHEEEDLKLKSLEKHLQETQGMHYSQSCRNVKLVCNSNSTLVHRSV

Query:  QIIRDKQELAETQKDLAKLHLLQKESSSSSHSHSNDERASPVASDLKKNENPSENHNNQQLALALPHQIVPQQNPITPPPPAALPQNVPQQQSYYIPSNQ
        QIIRDKQELAETQKDLAKLHLLQKESSSSSHSHSNDERASPVASD KKNENPSEN NNQQLALALPHQIVP QNPI PPPPAALPQNVPQQQSYY+ SNQ
Subjt:  QIIRDKQELAETQKDLAKLHLLQKESSSSSHSHSNDERASPVASDLKKNENPSENHNNQQLALALPHQIVPQQNPITPPPPAALPQNVPQQQSYYIPSNQ

Query:  LPSQPTHIQHTQTQYQQLADVSRLPSHMTNPQLSQTPPPQQFNQYQQQWT---QQQQPPQQVQPPQQQPSM-QPQIRMPPTSVYSSYSMNQPTSMPETLS
        LPSQPTHIQH QTQYQQL DVSRLPSHMTN QLSQTPPPQQFNQYQQQWT   QQQQPPQQVQPPQQQPSM QPQIR PPTSVYSSYSMNQPTSMPETLS
Subjt:  LPSQPTHIQHTQTQYQQLADVSRLPSHMTNPQLSQTPPPQQFNQYQQQWT---QQQQPPQQVQPPQQQPSM-QPQIRMPPTSVYSSYSMNQPTSMPETLS

Query:  NSMPMQLSFSPIPQPGSSRIDTVPYGYVGSGVTVSQQPPQVKNAFGPQAGEGYLPSGPQSALSAGGAYMMYDRESGRPPHHPPQPQQLPHHPPQPQQLPH
        NSMPMQLSFSPIPQPGSSRIDTVPYGYVGSGVTVSQQPPQVKNAF  QAGEGYLPSGPQSALS+G +YMMYDRESGRPPHHPPQPQQ PHHPP       
Subjt:  NSMPMQLSFSPIPQPGSSRIDTVPYGYVGSGVTVSQQPPQVKNAFGPQAGEGYLPSGPQSALSAGGAYMMYDRESGRPPHHPPQPQQLPHHPPQPQQLPH

Query:  HPSQPQQPHFNQSGYPSANASLQIPQHPSGPHVIARTPSHPHFMRNQNQNQNHPYGEIVEKLVGMGFRSDHVASVIHRMEESGQPVDFNAVLDGLSNSGG
           QPQQ HFNQSGYP AN S+QI QHPSGPHV+AR P+HPH+MR  NQNQNHPYGEIVEKLVGMGFRSDHVASVIHRMEESGQPVDFNAVLDGLSNSGG
Subjt:  HPSQPQQPHFNQSGYPSANASLQIPQHPSGPHVIARTPSHPHFMRNQNQNQNHPYGEIVEKLVGMGFRSDHVASVIHRMEESGQPVDFNAVLDGLSNSGG

Query:  PQR
        PQR
Subjt:  PQR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G01560.1 Protein of unknown function (DUF1421)6.7e-1929.3Show/hide
Query:  MSTVFPAAAYGQADD--------SLSQNLIST------VENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHV
        +ST  P   +G  D            QN+ +T      ++ +MKKH+D LL  +EG+S+RLSQLE   +NL+  V +++  +   H   D K++ L+  +
Subjt:  MSTVFPAAAYGQADD--------SLSQNLIST------VENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHV

Query:  QEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSSHSQSNEERASSVASDPKKNENPSEIHNQQLAL-ALPHQIVPQQNPITPPSAVLPQNMPQQQQS
         EV   VQ+++DKQE+ E Q  L+K QV      S+ H++++     S+  DP   ++P+ +  QQ  L + P        P  PPS+ LP  +P Q   
Subjt:  QEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSSHSQSNEERASSVASDPKKNENPSEIHNQQLAL-ALPHQIVPQQNPITPPSAVLPQNMPQQQQS

Query:  YYISASQLPGQPPHIQHAQGQYISPDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQQWAQPPSQQPQPPQQPSMQPQIRPPPSSVYP-SPYPPNQPSS
           S+ Q P  PP               H    P +      PQ +QTP QP   YQ      P QQPQ PQQP       P     Y    YPPN P  
Subjt:  YYISASQLPGQPPHIQHAQGQYISPDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQQWAQPPSQQPQPPQQPSMQPQIRPPPSSVYP-SPYPPNQPSS

Query:  MTETLSSSMPMQMSFPSIPQPGSSRMDAGPYGYAAASGGSAPQQPPQVKNAYGSATGEGYMPPGQQSGGAYMMYDRESGRPPHHSPQQPHHPSQQPHFNQ
          +  + S P Q  F + PQP  S  D         +GG +            S    GY+       G+ M     S +PPH S             N 
Subjt:  MTETLSSSMPMQMSFPSIPQPGSSRMDAGPYGYAAASGGSAPQQPPQVKNAYGSATGEGYMPPGQQSGGAYMMYDRESGRPPHHSPQQPHHPSQQPHFNQ

Query:  SGYPPANAPHQVPPQAP-----------SGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVLDRLSTPTGPGP
        +GYP  +    +P   P           S P   +R P    +I+++  MGF  D V + ++++ ++GQ VD N VLD+L    G  P
Subjt:  SGYPPANAPHQVPPQAP-----------SGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVLDRLSTPTGPGP

AT4G28300.1 Protein of unknown function (DUF1421)7.7e-10850.46Show/hide
Query:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSHTDP-VSVTNSSKYCAYLYEQDFHKSRM--STVFPAAAYGQADDSLSQNLISTVENSMKKH
        MASGS+GR NS  K FDFGSDDILCS++DY  QD SNG H+DP ++ +NS+K        +FHK+RM  S+VFP ++Y   +DSLSQ++  TVE +MK +
Subjt:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSHTDP-VSVTNSSKYCAYLYEQDFHKSRM--STVFPAAAYGQADDSLSQNLISTVENSMKKH

Query:  SDNLLRFLEGISSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSSHSQSNEERA
        +DN++RFLEG+SSRLSQLELYCYNLDK++GEMRSEL   HE+AD KL+SL+KH+QEVHRSVQI+RDKQELA+TQK+LAKLQ+ QKE SSSSHSQ  E+R 
Subjt:  SDNLLRFLEGISSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSSHSQSNEERA

Query:  SSVASDPKKNENPSEIHNQQLALALPHQIVPQQNPITPPSAVLPQNMPQQQQSYYISASQLPGQPPHIQH--AQGQYISPDSQHRASQPQD-----VSQM
        ++   +PKK+EN S+ HNQQLALALPHQI PQ         V PQ  PQQ Q Y      +P  P  +Q+  A     +P SQ +A   Q          
Subjt:  SSVASDPKKNENPSEIHNQQLALALPHQIVPQQNPITPPSAVLPQNMPQQQQSYYISASQLPGQPPHIQH--AQGQYISPDSQHRASQPQD-----VSQM

Query:  SNPQLSQTPPQPFNQYQQQWAQPPSQQPQPPQQPSMQPQIRPPPSSVYP--SPYPP-NQPSSMTETLSSSMPMQMSFPSIPQPGSSRMDAGPYGYAAASG
        S+P  +QT  Q F QYQQ W         PP     QPQ RP  S  YP  SP PP NQP    E+L SSM MQ  +   PQ          YGY AA  
Subjt:  SNPQLSQTPPQPFNQYQQQWAQPPSQQPQPPQQPSMQPQIRPPPSSVYP--SPYPP-NQPSSMTETLSSSMPMQMSFPSIPQPGSSRMDAGPYGYAAASG

Query:  GSAPQQPPQVKNAYGSATGEGYMPPGQQSGGAYMMYDRESGRPPHHSPQQPHHPSQQPHFNQ----SGYPPANAPHQVPPQAPSGPHVSARNPSHSHLIE
          AP  P Q K +Y   TG+GY+P G      Y     E GR   + P QP    QQ H+ Q     GY P   PHQ        P V      +  LIE
Subjt:  GSAPQQPPQVKNAYGSATGEGYMPPGQQSGGAYMMYDRESGRPPHHSPQQPHHPSQQPHFNQ----SGYPPANAPHQVPPQAPSGPHVSARNPSHSHLIE

Query:  KLVGMGFRGDHVASIIQRMEDSGQPVDFNAVLDRLSTPTGPGPQRAW
        KLV MGFRGDHV ++IQRME+SGQP+DFN +LDRLS  +  GP R W
Subjt:  KLVGMGFRGDHVASIIQRMEDSGQPVDFNAVLDRLSTPTGPGPQRAW

AT4G28300.2 Protein of unknown function (DUF1421)5.2e-8849.58Show/hide
Query:  STVFPAAAYGQADDSLSQNLISTVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQE
        S+VFP ++Y   +DSLSQ++  TVE +MK ++DN++RFLEG+SSRLSQLELYCYNLDK++GEMRSEL   HE+AD KL+SL+KH+QEVHRSVQI+RDKQE
Subjt:  STVFPAAAYGQADDSLSQNLISTVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQE

Query:  LAETQKDLAKLQVSQKEPSSSSHSQSNEERASSVASDPKKNENPSEIHNQQLALALPHQIVPQQNPITPPSAVLPQNMPQQQQSYYISASQLPGQPPHIQ
        LA+TQK+LAKLQ+ QKE SSSSHSQ  E+R ++   +PKK+EN S+ HNQQLALALPHQI PQ         V PQ  PQQ Q Y      +P  P  +Q
Subjt:  LAETQKDLAKLQVSQKEPSSSSHSQSNEERASSVASDPKKNENPSEIHNQQLALALPHQIVPQQNPITPPSAVLPQNMPQQQQSYYISASQLPGQPPHIQ

Query:  H--AQGQYISPDSQHRASQPQD-----VSQMSNPQLSQTPPQPFNQYQQQWAQPPSQQPQPPQQPSMQPQIRPPPSSVYP--SPYPP-NQPSSMTETLSS
        +  A     +P SQ +A   Q          S+P  +QT  Q F QYQQ W         PP     QPQ RP  S  YP  SP PP NQP    E+L S
Subjt:  H--AQGQYISPDSQHRASQPQD-----VSQMSNPQLSQTPPQPFNQYQQQWAQPPSQQPQPPQQPSMQPQIRPPPSSVYP--SPYPP-NQPSSMTETLSS

Query:  SMPMQMSFPSIPQPGSSRMDAGPYGYAAASGGSAPQQPPQVKNAYGSATGEGYMPPGQQSGGAYMMYDRESGRPPHHSPQQPHHPSQQPHFNQ----SGY
        SM MQ  +   PQ          YGY AA    AP  P Q K +Y   TG+GY+P G      Y     E GR   + P QP    QQ H+ Q     GY
Subjt:  SMPMQMSFPSIPQPGSSRMDAGPYGYAAASGGSAPQQPPQVKNAYGSATGEGYMPPGQQSGGAYMMYDRESGRPPHHSPQQPHHPSQQPHFNQ----SGY

Query:  PPANAPHQVPPQAPSGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVLDRLSTPTGPGPQRAW
         P   PHQ        P V      +  LIEKLV MGFRGDHV ++IQRME+SGQP+DFN +LDRLS  +  GP R W
Subjt:  PPANAPHQVPPQAPSGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVLDRLSTPTGPGPQRAW

AT5G14540.1 Protein of unknown function (DUF1421)2.6e-2329.47Show/hide
Query:  NGSHTDPVSVTNSSKYCAYLYEQDFHKSRMSTVFPAAAYGQAD-DSLSQNLISTVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGEMRSELAR
        + S   PVS +++  Y             M ++ P+  + + D +S    +IS ++ +MK H+D LL  +EG+S+RL+QLE    +L+  V +++  +  
Subjt:  NGSHTDPVSVTNSSKYCAYLYEQDFHKSRMSTVFPAAAYGQAD-DSLSQNLISTVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGEMRSELAR

Query:  DHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSSHSQSNEERASSVASDPKKNENPSEIHNQQLALALPHQIVPQQNPITP
         H + D KL+ LE  + EV   VQ+++DKQE+ E Q  L+KLQ+S+      +HS   E  A   AS P+   + +         +L  Q +P Q  I P
Subjt:  DHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSSHSQSNEERASSVASDPKKNENPSEIHNQQLALALPHQIVPQQNPITP

Query:  PSAVLPQNMPQQQQSYYISASQLPGQPPHIQHAQGQYISPDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQQWAQPPSQQPQPPQQPSMQPQIRPPPS
        P++         Q      + QLP  P      Q  Y  P  Q   SQP    Q         PP P     Q   QPP QQPQ PQQ        PPP 
Subjt:  PSAVLPQNMPQQQQSYYISASQLPGQPPHIQHAQGQYISPDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQQWAQPPSQQPQPPQQPSMQPQIRPPPS

Query:  SVYPSPYPPNQPSSMTETLSSSMPMQMSFPSIPQPGSSRMDAGPYGYAAASGGSAPQQPPQVKNAYG----SATGEGYMPPGQQSGGAYMMYDRESGRPP
          +PS Y P +P    ++   + P Q   PS P PGS+          +    +AP  PP + +  G    S    GY P      G    Y       P
Subjt:  SVYPSPYPPNQPSSMTETLSSSMPMQMSFPSIPQPGSSRMDAGPYGYAAASGGSAPQQPPQVKNAYG----SATGEGYMPPGQQSGGAYMMYDRESGRPP

Query:  HH-----SPQQPHHPSQQPHFNQSGYPPANAPHQVPPQAPSGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVLDRL
         H     S   P  P  +P     G P A+A         S    S        +I+K+V MGF  D V   ++ + ++GQ VD N VLD+L
Subjt:  HH-----SPQQPHHPSQQPHFNQSGYPPANAPHQVPPQAPSGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVLDRL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAAAAGGAAAAAAGAAATTCATCCCTCTCTTTTTCTTTTATCTTCCTCGAATCGGACCCCATTTTCGGCTTTCCTACTGCGATCGATGGCGTCTGGTTCAACGGG
TCGACCTAATTCGGGCTCTAAAGGGTTTGATTTTGGTACCGATGATGTTCTTTGTTCCTATGAGGACTATGGTAACCAGGAATCTTCTAACGGAAGCCATGGCGATCTCT
CTGTTGCGAATTCTAGCAAGGATTTTCATAAAAGTAGAATGTCTACTGTATATCCTGCTGCTACCTATGGTCAACCAGAAGATTCCATCAAACAAGATGTCACTTCTACT
GTTGAGAACTGCATGAAAAAGTATTCTGATAATATTTTGCGTTTTCTAGAGGGAATAAGTTCACGCCTATCACAACTTGAACTGAGCTGCTACAACCTTGATAAATCTGT
TGGAGAAATGCGATCTGACGTACTTCGTGACCATGAAGAGGAAGATTTGAAGCTTAAATCTCTGGAGAAGCATCTACAAGAGACACAAGGCATGCACTACTCACAATCAT
GTAGAAATGTTAAATTAGTTTGCAATTCAAACTCAACACTTGTCCATAGGTCTGTGCAGATTATAAGAGACAAGCAAGAGCTTGCTGAGACTCAGAAAGACCTAGCCAAA
CTTCATCTTTTGCAGAAAGAATCATCTTCATCCAGCCATTCACATTCAAATGATGAGAGAGCTTCACCAGTTGCCTCTGATCTTAAGAAGAACGAAAATCCGTCTGAGAA
TCACAATAATCAGCAATTAGCTCTCGCCCTGCCACACCAGATTGTCCCACAGCAAAATCCTATTACACCACCCCCTCCAGCAGCTTTACCACAGAATGTGCCTCAACAGC
AATCTTATTACATCCCTTCAAACCAATTGCCAAGTCAACCAACCCATATCCAGCATACCCAGACCCAATATCAACAACTTGCAGATGTTTCTCGGTTGCCATCACATATG
ACTAATCCCCAGCTAAGTCAAACTCCACCACCACAACAATTCAATCAGTATCAACAACAATGGACGCAGCAGCAGCAGCCACCTCAGCAGGTACAACCACCACAACAGCA
GCCTTCTATGCAACCTCAGATCAGGATGCCGCCTACTTCAGTCTACTCATCCTATTCAATGAATCAACCGACTTCTATGCCAGAGACTCTGTCAAACAGCATGCCTATGC
AATTGTCATTTTCACCTATTCCTCAGCCGGGTTCAAGCCGCATTGACACTGTGCCATATGGATATGTTGGAAGTGGTGTTACTGTGTCCCAGCAACCTCCTCAAGTTAAA
AATGCTTTCGGACCACAAGCTGGAGAAGGTTACTTACCTTCTGGACCGCAGTCTGCACTTTCCGCCGGAGGTGCATATATGATGTATGACAGGGAAAGTGGAAGACCGCC
ACACCATCCTCCTCAACCTCAACAACTACCACACCATCCTCCTCAACCTCAACAACTACCACACCATCCTTCTCAACCTCAACAACCACACTTCAACCAAAGTGGATACC
CTTCAGCCAATGCATCTCTTCAGATTCCTCAGCATCCATCAGGCCCCCACGTTATCGCCAGGACTCCGAGCCATCCGCATTTTATGCGCAACCAAAACCAAAACCAAAAC
CACCCTTACGGCGAAATAGTTGAGAAACTGGTTGGCATGGGTTTCAGGAGTGACCATGTCGCTAGTGTAATTCATAGGATGGAGGAGAGCGGCCAACCTGTCGACTTCAA
TGCCGTTTTAGACGGGTTAAGTAATTCCGGAGGTCCTCAGCGGAGTCGCCTGTTTGTTTGTATGTTGAAGCTCATATCGGCCATGACCAGCTTAGCCATTGCCTCAATCA
TCTCTCTCTCTTCCCCCAAAAAATCTCAGCCTTCATTCCCATACTCACTGCGATCTATGGCGTCTGGTTCAGCAGGTCGCCCTAACTCCTCCCCCAAATCGTTTGATTTT
GGTTCTGATGATATCCTTTGCTCATTTGAAGACTACGGTAAACAGGACCCTTCAAACGGTAGCCATACTGATCCCGTTTCCGTTACCAATTCTAGCAAGTATTGTGCATA
TTTGTACGAGCAGGATTTTCACAAGAGTAGAATGTCTACTGTATTCCCTGCTGCAGCCTATGGTCAAGCAGATGATTCCCTTAGTCAAAATTTGATTTCCACTGTTGAGA
ACAGCATGAAAAAGCATTCTGATAACCTTTTGCGTTTTCTTGAGGGAATAAGTTCACGCCTATCACAACTTGAACTATATTGCTACAACCTTGATAAATCTGTTGGAGAA
ATGCGGTCTGAATTAGCCCGTGACCATGAAGAGGCAGATTCAAAGCTTAAATCTCTTGAGAAGCATGTACAAGAGGTCCACAGGTCTGTACAGATTATAAGAGACAAGCA
AGAACTTGCTGAGACTCAAAAGGACTTGGCTAAACTTCAGGTCTCGCAGAAAGAGCCATCTTCGTCGAGCCATTCGCAGTCAAATGAGGAGAGGGCTTCATCAGTTGCCT
CTGATCCTAAAAAGAATGAAAATCCATCTGAGATTCACAACCAGCAGTTAGCTTTGGCCTTGCCACATCAGATCGTCCCACAGCAAAATCCTATAACTCCCCCTTCAGCA
GTTTTGCCTCAGAATATGCCTCAACAACAGCAATCTTACTACATCTCTGCATCTCAATTACCTGGTCAACCACCCCATATCCAGCATGCTCAGGGCCAATATATCTCACC
TGATTCCCAGCACCGGGCATCACAACCTCAAGATGTTTCACAGATGTCCAATCCCCAACTAAGTCAAACTCCACCACAACCATTCAATCAGTATCAACAACAATGGGCGC
AGCCACCATCTCAGCAGCCACAACCTCCTCAACAGCCTTCTATGCAACCTCAGATCAGACCACCCCCCAGTTCAGTCTACCCTTCTCCTTATCCACCAAATCAACCGTCT
TCTATGACCGAGACACTGTCAAGCAGCATGCCCATGCAAATGTCCTTTCCATCTATTCCTCAACCCGGCTCAAGCCGCATGGATGCAGGGCCTTATGGGTATGCTGCTGC
AAGTGGTGGTTCTGCTCCACAGCAGCCTCCTCAAGTGAAAAATGCTTATGGTTCAGCAACAGGTGAGGGATATATGCCTCCTGGACAACAATCTGGAGGAGCATATATGA
TGTATGATAGGGAAAGCGGAAGACCGCCACACCATTCGCCTCAACAACCACACCATCCGTCTCAACAACCGCACTTCAATCAAAGTGGATATCCTCCGGCCAATGCACCT
CATCAGGTTCCTCCTCAGGCTCCATCAGGCCCCCATGTTTCAGCCAGGAATCCAAGCCATTCACATCTAATCGAAAAACTGGTTGGCATGGGCTTCAGGGGTGACCATGT
TGCAAGTATAATCCAGAGAATGGAGGATAGTGGCCAACCTGTTGACTTCAACGCAGTTCTAGACAGGTTGAGTACTCCTACAGGTCCAGGTCCACAAAGAGCGTGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAAAAAGGAAAAAAGAAATTCATCCCTCTCTTTTTCTTTTATCTTCCTCGAATCGGACCCCATTTTCGGCTTTCCTACTGCGATCGATGGCGTCTGGTTCAACGGG
TCGACCTAATTCGGGCTCTAAAGGGTTTGATTTTGGTACCGATGATGTTCTTTGTTCCTATGAGGACTATGGTAACCAGGAATCTTCTAACGGAAGCCATGGCGATCTCT
CTGTTGCGAATTCTAGCAAGGATTTTCATAAAAGTAGAATGTCTACTGTATATCCTGCTGCTACCTATGGTCAACCAGAAGATTCCATCAAACAAGATGTCACTTCTACT
GTTGAGAACTGCATGAAAAAGTATTCTGATAATATTTTGCGTTTTCTAGAGGGAATAAGTTCACGCCTATCACAACTTGAACTGAGCTGCTACAACCTTGATAAATCTGT
TGGAGAAATGCGATCTGACGTACTTCGTGACCATGAAGAGGAAGATTTGAAGCTTAAATCTCTGGAGAAGCATCTACAAGAGACACAAGGCATGCACTACTCACAATCAT
GTAGAAATGTTAAATTAGTTTGCAATTCAAACTCAACACTTGTCCATAGGTCTGTGCAGATTATAAGAGACAAGCAAGAGCTTGCTGAGACTCAGAAAGACCTAGCCAAA
CTTCATCTTTTGCAGAAAGAATCATCTTCATCCAGCCATTCACATTCAAATGATGAGAGAGCTTCACCAGTTGCCTCTGATCTTAAGAAGAACGAAAATCCGTCTGAGAA
TCACAATAATCAGCAATTAGCTCTCGCCCTGCCACACCAGATTGTCCCACAGCAAAATCCTATTACACCACCCCCTCCAGCAGCTTTACCACAGAATGTGCCTCAACAGC
AATCTTATTACATCCCTTCAAACCAATTGCCAAGTCAACCAACCCATATCCAGCATACCCAGACCCAATATCAACAACTTGCAGATGTTTCTCGGTTGCCATCACATATG
ACTAATCCCCAGCTAAGTCAAACTCCACCACCACAACAATTCAATCAGTATCAACAACAATGGACGCAGCAGCAGCAGCCACCTCAGCAGGTACAACCACCACAACAGCA
GCCTTCTATGCAACCTCAGATCAGGATGCCGCCTACTTCAGTCTACTCATCCTATTCAATGAATCAACCGACTTCTATGCCAGAGACTCTGTCAAACAGCATGCCTATGC
AATTGTCATTTTCACCTATTCCTCAGCCGGGTTCAAGCCGCATTGACACTGTGCCATATGGATATGTTGGAAGTGGTGTTACTGTGTCCCAGCAACCTCCTCAAGTTAAA
AATGCTTTCGGACCACAAGCTGGAGAAGGTTACTTACCTTCTGGACCGCAGTCTGCACTTTCCGCCGGAGGTGCATATATGATGTATGACAGGGAAAGTGGAAGACCGCC
ACACCATCCTCCTCAACCTCAACAACTACCACACCATCCTCCTCAACCTCAACAACTACCACACCATCCTTCTCAACCTCAACAACCACACTTCAACCAAAGTGGATACC
CTTCAGCCAATGCATCTCTTCAGATTCCTCAGCATCCATCAGGCCCCCACGTTATCGCCAGGACTCCGAGCCATCCGCATTTTATGCGCAACCAAAACCAAAACCAAAAC
CACCCTTACGGCGAAATAGTTGAGAAACTGGTTGGCATGGGTTTCAGGAGTGACCATGTCGCTAGTGTAATTCATAGGATGGAGGAGAGCGGCCAACCTGTCGACTTCAA
TGCCGTTTTAGACGGGTTAAGTAATTCCGGAGGTCCTCAGCGGAGTCGCCTGTTTGTTTGTATGTTGAAGCTCATATCGGCCATGACCAGCTTAGCCATTGCCTCAATCA
TCTCTCTCTCTTCCCCCAAAAAATCTCAGCCTTCATTCCCATACTCACTGCGATCTATGGCGTCTGGTTCAGCAGGTCGCCCTAACTCCTCCCCCAAATCGTTTGATTTT
GGTTCTGATGATATCCTTTGCTCATTTGAAGACTACGGTAAACAGGACCCTTCAAACGGTAGCCATACTGATCCCGTTTCCGTTACCAATTCTAGCAAGTATTGTGCATA
TTTGTACGAGCAGGATTTTCACAAGAGTAGAATGTCTACTGTATTCCCTGCTGCAGCCTATGGTCAAGCAGATGATTCCCTTAGTCAAAATTTGATTTCCACTGTTGAGA
ACAGCATGAAAAAGCATTCTGATAACCTTTTGCGTTTTCTTGAGGGAATAAGTTCACGCCTATCACAACTTGAACTATATTGCTACAACCTTGATAAATCTGTTGGAGAA
ATGCGGTCTGAATTAGCCCGTGACCATGAAGAGGCAGATTCAAAGCTTAAATCTCTTGAGAAGCATGTACAAGAGGTCCACAGGTCTGTACAGATTATAAGAGACAAGCA
AGAACTTGCTGAGACTCAAAAGGACTTGGCTAAACTTCAGGTCTCGCAGAAAGAGCCATCTTCGTCGAGCCATTCGCAGTCAAATGAGGAGAGGGCTTCATCAGTTGCCT
CTGATCCTAAAAAGAATGAAAATCCATCTGAGATTCACAACCAGCAGTTAGCTTTGGCCTTGCCACATCAGATCGTCCCACAGCAAAATCCTATAACTCCCCCTTCAGCA
GTTTTGCCTCAGAATATGCCTCAACAACAGCAATCTTACTACATCTCTGCATCTCAATTACCTGGTCAACCACCCCATATCCAGCATGCTCAGGGCCAATATATCTCACC
TGATTCCCAGCACCGGGCATCACAACCTCAAGATGTTTCACAGATGTCCAATCCCCAACTAAGTCAAACTCCACCACAACCATTCAATCAGTATCAACAACAATGGGCGC
AGCCACCATCTCAGCAGCCACAACCTCCTCAACAGCCTTCTATGCAACCTCAGATCAGACCACCCCCCAGTTCAGTCTACCCTTCTCCTTATCCACCAAATCAACCGTCT
TCTATGACCGAGACACTGTCAAGCAGCATGCCCATGCAAATGTCCTTTCCATCTATTCCTCAACCCGGCTCAAGCCGCATGGATGCAGGGCCTTATGGGTATGCTGCTGC
AAGTGGTGGTTCTGCTCCACAGCAGCCTCCTCAAGTGAAAAATGCTTATGGTTCAGCAACAGGTGAGGGATATATGCCTCCTGGACAACAATCTGGAGGAGCATATATGA
TGTATGATAGGGAAAGCGGAAGACCGCCACACCATTCGCCTCAACAACCACACCATCCGTCTCAACAACCGCACTTCAATCAAAGTGGATATCCTCCGGCCAATGCACCT
CATCAGGTTCCTCCTCAGGCTCCATCAGGCCCCCATGTTTCAGCCAGGAATCCAAGCCATTCACATCTAATCGAAAAACTGGTTGGCATGGGCTTCAGGGGTGACCATGT
TGCAAGTATAATCCAGAGAATGGAGGATAGTGGCCAACCTGTTGACTTCAACGCAGTTCTAGACAGGTTGAGTACTCCTACAGGTCCAGGTCCACAAAGAGCGTGGTGAG
AGTAATCAAATCATCCCCTGTTTGCGGCCGATTCTGGCCATGACCAGCCTCATACATTGCGTCTTTTTAATGCATTGAATAAAACACTGGTTTATGATTTTATTGTCCTC
GTATATATATTGTCATGGTTTGTGAGATCTAAAACGTCGGCTGTATGATTTAAACCTTGTGTGAATATCTTCTTCCAAATGCCAATCCTTCCATATATTTCCGTTTTGAT
GTTTATTTGCACCTTTCAATGATTAATTAGGTTTGTAATTTGTTCTCTGAATCTTTATTTTTTTTGGTGGAGATTGAGGCCTTGGCTCCCAAAGTGAGGGTGAGGTCTTT
CTTTTAGTGTCATAGGGTGTGTAATGGTGCTGTCCATTGTCTCCTGACCAGGGATTTTGTGTATGTTTTTGTCGGTCTGTTTCTACCGCCG
Protein sequenceShow/hide protein sequence
MEKRKKEIHPSLFLLSSSNRTPFSAFLLRSMASGSTGRPNSGSKGFDFGTDDVLCSYEDYGNQESSNGSHGDLSVANSSKDFHKSRMSTVYPAATYGQPEDSIKQDVTST
VENCMKKYSDNILRFLEGISSRLSQLELSCYNLDKSVGEMRSDVLRDHEEEDLKLKSLEKHLQETQGMHYSQSCRNVKLVCNSNSTLVHRSVQIIRDKQELAETQKDLAK
LHLLQKESSSSSHSHSNDERASPVASDLKKNENPSENHNNQQLALALPHQIVPQQNPITPPPPAALPQNVPQQQSYYIPSNQLPSQPTHIQHTQTQYQQLADVSRLPSHM
TNPQLSQTPPPQQFNQYQQQWTQQQQPPQQVQPPQQQPSMQPQIRMPPTSVYSSYSMNQPTSMPETLSNSMPMQLSFSPIPQPGSSRIDTVPYGYVGSGVTVSQQPPQVK
NAFGPQAGEGYLPSGPQSALSAGGAYMMYDRESGRPPHHPPQPQQLPHHPPQPQQLPHHPSQPQQPHFNQSGYPSANASLQIPQHPSGPHVIARTPSHPHFMRNQNQNQN
HPYGEIVEKLVGMGFRSDHVASVIHRMEESGQPVDFNAVLDGLSNSGGPQRSRLFVCMLKLISAMTSLAIASIISLSSPKKSQPSFPYSLRSMASGSAGRPNSSPKSFDF
GSDDILCSFEDYGKQDPSNGSHTDPVSVTNSSKYCAYLYEQDFHKSRMSTVFPAAAYGQADDSLSQNLISTVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGE
MRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSSSHSQSNEERASSVASDPKKNENPSEIHNQQLALALPHQIVPQQNPITPPSA
VLPQNMPQQQQSYYISASQLPGQPPHIQHAQGQYISPDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQQWAQPPSQQPQPPQQPSMQPQIRPPPSSVYPSPYPPNQPS
SMTETLSSSMPMQMSFPSIPQPGSSRMDAGPYGYAAASGGSAPQQPPQVKNAYGSATGEGYMPPGQQSGGAYMMYDRESGRPPHHSPQQPHHPSQQPHFNQSGYPPANAP
HQVPPQAPSGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVLDRLSTPTGPGPQRAW