; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI07G04640 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI07G04640
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
Descriptiontranscription factor SPT20 homolog isoform X1
Genome locationChr7:3443567..3446918
RNA-Seq ExpressionCSPI07G04640
SyntenyCSPI07G04640
Gene Ontology termsNA
InterPro domainsIPR010820 - UBA-like domain DUF1421


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031573.1 arginine-glutamic acid dipeptide repeats protein-like [Cucumis melo var. makuwa]5.7e-27597.28Show/hide
Query:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSVNNPGKDFHKGRMSTVFPGSGYGQADDTISQNVISTVENSMKKHSDNLLRFLEGI
        MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVS+ NPGKDFHK RMSTVFP +GY QADDTISQNVISTVENSMKKHSDNLLRFLEGI
Subjt:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSVNNPGKDFHKGRMSTVFPGSGYGQADDTISQNVISTVENSMKKHSDNLLRFLEGI

Query:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSTNHSQSNEERASSVASDPKKKE
        SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSS+NHSQSNEERASSVASD KKKE
Subjt:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSTNHSQSNEERASSVASDPKKKE

Query:  NSSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPPHIQHAQSQYIPSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQ
        N SEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPPHIQHAQSQYI SDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQ
Subjt:  NSSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPPHIQHAQSQYIPSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQ

Query:  QWAQPPSQQPQPPQQPSMQQIRPPPPSVYPSTYPPPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAAGSGGSAPQQPPQVKNAYGPPTGEG
        QWAQPPSQQPQPPQQPSMQQIRPPPPSVYPSTY PPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYA GSGGSAPQQPPQVKNAYGPPTGEG
Subjt:  QWAQPPSQQPQPPQQPSMQQIRPPPPSVYPSTYPPPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAAGSGGSAPQQPPQVKNAYGPPTGEG

Query:  YMPPGQQSGGAYMMYDRESGRPPHHPPQQTHFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVL
        YMPPGQQSGGAYMMYDRESGRPPHHPPQQ HFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVL
Subjt:  YMPPGQQSGGAYMMYDRESGRPPHHPPQQTHFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVL

Query:  DRLSSSSGPGPQRAW
        DRLSS SGPGPQRAW
Subjt:  DRLSSSSGPGPQRAW

XP_004136824.1 trithorax group protein osa [Cucumis sativus]2.0e-28399.61Show/hide
Query:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSVNNPGKDFHKGRMSTVFPGSGYGQADDTISQNVISTVENSMKKHSDNLLRFLEGI
        MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSVNNPGKDFHKGRMSTVFP SGYGQADDTISQNVISTVENSMKKHSDNLLRFLEGI
Subjt:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSVNNPGKDFHKGRMSTVFPGSGYGQADDTISQNVISTVENSMKKHSDNLLRFLEGI

Query:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSTNHSQSNEERASSVASDPKKKE
        SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSTNHSQSNEERASSVASDPKKKE
Subjt:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSTNHSQSNEERASSVASDPKKKE

Query:  NSSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPPHIQHAQSQYIPSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQ
        NSSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPPHIQHAQSQYIPSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQ
Subjt:  NSSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPPHIQHAQSQYIPSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQ

Query:  QWAQPPSQQPQPPQQPSMQQIRPPPPSVYPSTYPPPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAAGSGGSAPQQPPQVKNAYGPPTGEG
        QWAQPPSQQPQPPQQPSMQQIRPPPPSVYPSTYPPPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAAGSGGSAPQQPPQVKNAYGPPTGEG
Subjt:  QWAQPPSQQPQPPQQPSMQQIRPPPPSVYPSTYPPPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAAGSGGSAPQQPPQVKNAYGPPTGEG

Query:  YMPPGQQSGGAYMMYDRESGRPPHHPPQQTHFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVL
        YMPPGQQSGGAYMMYDRESGRPPHHPPQQTHFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVL
Subjt:  YMPPGQQSGGAYMMYDRESGRPPHHPPQQTHFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVL

Query:  DRLSSSSGPGPQRAW
        DRLSS SGPGPQRAW
Subjt:  DRLSSSSGPGPQRAW

XP_008455322.1 PREDICTED: arginine-glutamic acid dipeptide repeats protein-like [Cucumis melo]8.8e-27697.67Show/hide
Query:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSVNNPGKDFHKGRMSTVFPGSGYGQADDTISQNVISTVENSMKKHSDNLLRFLEGI
        MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSV NPGKDFHK RMSTVFP +GYGQADDTISQNVISTVENSMKKHSDNLLRFLEGI
Subjt:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSVNNPGKDFHKGRMSTVFPGSGYGQADDTISQNVISTVENSMKKHSDNLLRFLEGI

Query:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSTNHSQSNEERASSVASDPKKKE
        SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSS+NHSQSNEERASSVASD KKKE
Subjt:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSTNHSQSNEERASSVASDPKKKE

Query:  NSSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPPHIQHAQSQYIPSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQ
        N SEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPPHIQHAQSQYI SDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQ
Subjt:  NSSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPPHIQHAQSQYIPSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQ

Query:  QWAQPPSQQPQPPQQPSMQQIRPPPPSVYPSTYPPPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAAGSGGSAPQQPPQVKNAYGPPTGEG
        QWAQPPSQQPQPPQQPSMQQIRPPPPSVYPSTY PPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYA GSGGSAPQQPPQVKNAYGPPTGEG
Subjt:  QWAQPPSQQPQPPQQPSMQQIRPPPPSVYPSTYPPPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAAGSGGSAPQQPPQVKNAYGPPTGEG

Query:  YMPPGQQSGGAYMMYDRESGRPPHHPPQQTHFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVL
        YMPPGQQSGGAYMMYDRESGRPPHHPPQQ HFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVL
Subjt:  YMPPGQQSGGAYMMYDRESGRPPHHPPQQTHFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVL

Query:  DRLSSSSGPGPQRAW
        DRLSS SGPGPQRAW
Subjt:  DRLSSSSGPGPQRAW

XP_022952329.1 class E vacuolar protein-sorting machinery protein hse1-like [Cucurbita moschata]2.5e-23886.39Show/hide
Query:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSVNNPGKDFHKGRMSTVFPGSGYGQADDTISQNVISTVENSMKKHSDNLLRFLEGI
        MASGSAGRPNS+PKSFDFGSD+ILCSFEDY KQ+PSNGS SDPVSV N  KDFHK RMSTVFPG+ YGQ DD+I+Q+VI+ VENSMKKHSDNLLRFLEGI
Subjt:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSVNNPGKDFHKGRMSTVFPGSGYGQADDTISQNVISTVENSMKKHSDNLLRFLEGI

Query:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSTNHSQSNEERASSVASDPKKKE
        SSRLSQLELYCYNLDKSVGEMRS+LARDHEEADSKLKS+EKHVQEVHRSVQIIRDKQELAETQKDLAKLQV QKEPS ++HSQ+NEER   V++DPKK E
Subjt:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSTNHSQSNEERASSVASDPKKKE

Query:  NSSEIHNQQLALALPHQIVPQQNPIT-PPSAALPQNMPQQQQSYYISQSQLPG-QPPHIQHAQSQYIPSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQY
        N SEIHNQQLALALPHQIVPQQNPIT PPSAALPQN+PQQQQSYYIS SQLPG QP HIQHAQ+QYI SDSQHRASQPQDVSQM+NPQLSQT PQPFNQY
Subjt:  NSSEIHNQQLALALPHQIVPQQNPIT-PPSAALPQNMPQQQQSYYISQSQLPG-QPPHIQHAQSQYIPSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQY

Query:  QQQWAQPPSQQPQPPQQPSMQ-QIRPPPPSVYPSTYPPPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAAGSGGSAPQQPPQVKNAYGPPT
        QQQWAQPPSQ  QPPQQ SMQ QIRPPP SVYPS YPPPNQPTSMPETL SSMPMQMSF SIPQPGSSR DA PYGYAA SGGSAPQQPPQVKNAYGP T
Subjt:  QQQWAQPPSQQPQPPQQPSMQ-QIRPPPPSVYPSTYPPPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAAGSGGSAPQQPPQVKNAYGPPT

Query:  GEGYMPPGQQ----SGGAYMMYDRESGRP-------PHHPPQQTHFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQR
        GEGYMPPGQQ    SGGAYMMYDRESGRP       PHHP QQ+HF+QSGYP ANAPHQVPPQAP GPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQR
Subjt:  GEGYMPPGQQ----SGGAYMMYDRESGRP-------PHHPPQQTHFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQR

Query:  MEDSGQPVDFNAVLDRLSSSSGPGPQRAW
        MEDSGQ VDFNAVLDRLS+ +GPGPQRAW
Subjt:  MEDSGQPVDFNAVLDRLSSSSGPGPQRAW

XP_038888365.1 ataxin-2 homolog [Benincasa hispida]2.6e-25190.48Show/hide
Query:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSVNNPGKDFHKGRMSTVFPGSGYG--QADDTISQNVISTVENSMKKHSDNLLRFLE
        MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGS +DPVS+ N  KDFHK RMSTVFP + YG  QADD+ISQNVISTVENSMKKHSDNLLRFLE
Subjt:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSVNNPGKDFHKGRMSTVFPGSGYG--QADDTISQNVISTVENSMKKHSDNLLRFLE

Query:  GISSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSTNHSQSNEERASSVASDPKK
        GISSRLSQLELYCYNLDKSVGEMRS+LARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSS++HSQSNEERASSVASDPKK
Subjt:  GISSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSTNHSQSNEERASSVASDPKK

Query:  KENSSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPPHIQHAQSQYIPSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQY
         EN SEIHNQQLALALPHQIVPQQN IT PSAALPQNMPQQQQSYYIS SQLPGQPPH+QHAQ QYI  DS +RASQPQDVSQMSNPQLSQTPPQPFNQY
Subjt:  KENSSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPPHIQHAQSQYIPSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQY

Query:  QQQWAQPPSQQPQPPQQPSMQ-QIRPPPPSVYPSTYPPPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAAGSGGSAPQQPPQVKNAYGPPT
         QQWAQPPSQQPQPPQQPSMQ QIRPPPPSVYPSTY PPNQPTSMPETL SSMPM MSFPSIPQPGSSR+DAGPYGYAA SGGSAPQQPPQVKNAYGP T
Subjt:  QQQWAQPPSQQPQPPQQPSMQ-QIRPPPPSVYPSTYPPPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAAGSGGSAPQQPPQVKNAYGPPT

Query:  GEGYMPPGQQSGGAYMMYDRESGRPPHHPP-------QQTHFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDS
        GEGYMPPGQQSGGAYMMYDRESGRPPHHPP       QQ HFNQSGYP AN  HQVPPQAP GPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDS
Subjt:  GEGYMPPGQQSGGAYMMYDRESGRPPHHPP-------QQTHFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDS

Query:  GQPVDFNAVLDRLSSSSGPGPQRAW
        GQPVDFNAVLDRLS+ +GPGPQRAW
Subjt:  GQPVDFNAVLDRLSSSSGPGPQRAW

TrEMBL top hitse value%identityAlignment
A0A0A0K720 DUF1421 domain-containing protein9.5e-28499.61Show/hide
Query:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSVNNPGKDFHKGRMSTVFPGSGYGQADDTISQNVISTVENSMKKHSDNLLRFLEGI
        MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSVNNPGKDFHKGRMSTVFP SGYGQADDTISQNVISTVENSMKKHSDNLLRFLEGI
Subjt:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSVNNPGKDFHKGRMSTVFPGSGYGQADDTISQNVISTVENSMKKHSDNLLRFLEGI

Query:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSTNHSQSNEERASSVASDPKKKE
        SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSTNHSQSNEERASSVASDPKKKE
Subjt:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSTNHSQSNEERASSVASDPKKKE

Query:  NSSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPPHIQHAQSQYIPSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQ
        NSSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPPHIQHAQSQYIPSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQ
Subjt:  NSSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPPHIQHAQSQYIPSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQ

Query:  QWAQPPSQQPQPPQQPSMQQIRPPPPSVYPSTYPPPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAAGSGGSAPQQPPQVKNAYGPPTGEG
        QWAQPPSQQPQPPQQPSMQQIRPPPPSVYPSTYPPPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAAGSGGSAPQQPPQVKNAYGPPTGEG
Subjt:  QWAQPPSQQPQPPQQPSMQQIRPPPPSVYPSTYPPPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAAGSGGSAPQQPPQVKNAYGPPTGEG

Query:  YMPPGQQSGGAYMMYDRESGRPPHHPPQQTHFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVL
        YMPPGQQSGGAYMMYDRESGRPPHHPPQQTHFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVL
Subjt:  YMPPGQQSGGAYMMYDRESGRPPHHPPQQTHFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVL

Query:  DRLSSSSGPGPQRAW
        DRLSS SGPGPQRAW
Subjt:  DRLSSSSGPGPQRAW

A0A1S3C1W2 arginine-glutamic acid dipeptide repeats protein-like4.3e-27697.67Show/hide
Query:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSVNNPGKDFHKGRMSTVFPGSGYGQADDTISQNVISTVENSMKKHSDNLLRFLEGI
        MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSV NPGKDFHK RMSTVFP +GYGQADDTISQNVISTVENSMKKHSDNLLRFLEGI
Subjt:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSVNNPGKDFHKGRMSTVFPGSGYGQADDTISQNVISTVENSMKKHSDNLLRFLEGI

Query:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSTNHSQSNEERASSVASDPKKKE
        SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSS+NHSQSNEERASSVASD KKKE
Subjt:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSTNHSQSNEERASSVASDPKKKE

Query:  NSSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPPHIQHAQSQYIPSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQ
        N SEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPPHIQHAQSQYI SDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQ
Subjt:  NSSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPPHIQHAQSQYIPSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQ

Query:  QWAQPPSQQPQPPQQPSMQQIRPPPPSVYPSTYPPPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAAGSGGSAPQQPPQVKNAYGPPTGEG
        QWAQPPSQQPQPPQQPSMQQIRPPPPSVYPSTY PPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYA GSGGSAPQQPPQVKNAYGPPTGEG
Subjt:  QWAQPPSQQPQPPQQPSMQQIRPPPPSVYPSTYPPPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAAGSGGSAPQQPPQVKNAYGPPTGEG

Query:  YMPPGQQSGGAYMMYDRESGRPPHHPPQQTHFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVL
        YMPPGQQSGGAYMMYDRESGRPPHHPPQQ HFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVL
Subjt:  YMPPGQQSGGAYMMYDRESGRPPHHPPQQTHFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVL

Query:  DRLSSSSGPGPQRAW
        DRLSS SGPGPQRAW
Subjt:  DRLSSSSGPGPQRAW

A0A5D3C6G6 Arginine-glutamic acid dipeptide repeats protein-like2.8e-27597.28Show/hide
Query:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSVNNPGKDFHKGRMSTVFPGSGYGQADDTISQNVISTVENSMKKHSDNLLRFLEGI
        MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVS+ NPGKDFHK RMSTVFP +GY QADDTISQNVISTVENSMKKHSDNLLRFLEGI
Subjt:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSVNNPGKDFHKGRMSTVFPGSGYGQADDTISQNVISTVENSMKKHSDNLLRFLEGI

Query:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSTNHSQSNEERASSVASDPKKKE
        SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSS+NHSQSNEERASSVASD KKKE
Subjt:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSTNHSQSNEERASSVASDPKKKE

Query:  NSSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPPHIQHAQSQYIPSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQ
        N SEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPPHIQHAQSQYI SDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQ
Subjt:  NSSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPPHIQHAQSQYIPSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQ

Query:  QWAQPPSQQPQPPQQPSMQQIRPPPPSVYPSTYPPPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAAGSGGSAPQQPPQVKNAYGPPTGEG
        QWAQPPSQQPQPPQQPSMQQIRPPPPSVYPSTY PPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYA GSGGSAPQQPPQVKNAYGPPTGEG
Subjt:  QWAQPPSQQPQPPQQPSMQQIRPPPPSVYPSTYPPPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAAGSGGSAPQQPPQVKNAYGPPTGEG

Query:  YMPPGQQSGGAYMMYDRESGRPPHHPPQQTHFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVL
        YMPPGQQSGGAYMMYDRESGRPPHHPPQQ HFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVL
Subjt:  YMPPGQQSGGAYMMYDRESGRPPHHPPQQTHFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVL

Query:  DRLSSSSGPGPQRAW
        DRLSS SGPGPQRAW
Subjt:  DRLSSSSGPGPQRAW

A0A6J1GLD5 class E vacuolar protein-sorting machinery protein hse1-like1.2e-23886.39Show/hide
Query:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSVNNPGKDFHKGRMSTVFPGSGYGQADDTISQNVISTVENSMKKHSDNLLRFLEGI
        MASGSAGRPNS+PKSFDFGSD+ILCSFEDY KQ+PSNGS SDPVSV N  KDFHK RMSTVFPG+ YGQ DD+I+Q+VI+ VENSMKKHSDNLLRFLEGI
Subjt:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSVNNPGKDFHKGRMSTVFPGSGYGQADDTISQNVISTVENSMKKHSDNLLRFLEGI

Query:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSTNHSQSNEERASSVASDPKKKE
        SSRLSQLELYCYNLDKSVGEMRS+LARDHEEADSKLKS+EKHVQEVHRSVQIIRDKQELAETQKDLAKLQV QKEPS ++HSQ+NEER   V++DPKK E
Subjt:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSTNHSQSNEERASSVASDPKKKE

Query:  NSSEIHNQQLALALPHQIVPQQNPIT-PPSAALPQNMPQQQQSYYISQSQLPG-QPPHIQHAQSQYIPSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQY
        N SEIHNQQLALALPHQIVPQQNPIT PPSAALPQN+PQQQQSYYIS SQLPG QP HIQHAQ+QYI SDSQHRASQPQDVSQM+NPQLSQT PQPFNQY
Subjt:  NSSEIHNQQLALALPHQIVPQQNPIT-PPSAALPQNMPQQQQSYYISQSQLPG-QPPHIQHAQSQYIPSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQY

Query:  QQQWAQPPSQQPQPPQQPSMQ-QIRPPPPSVYPSTYPPPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAAGSGGSAPQQPPQVKNAYGPPT
        QQQWAQPPSQ  QPPQQ SMQ QIRPPP SVYPS YPPPNQPTSMPETL SSMPMQMSF SIPQPGSSR DA PYGYAA SGGSAPQQPPQVKNAYGP T
Subjt:  QQQWAQPPSQQPQPPQQPSMQ-QIRPPPPSVYPSTYPPPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAAGSGGSAPQQPPQVKNAYGPPT

Query:  GEGYMPPGQQ----SGGAYMMYDRESGRP-------PHHPPQQTHFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQR
        GEGYMPPGQQ    SGGAYMMYDRESGRP       PHHP QQ+HF+QSGYP ANAPHQVPPQAP GPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQR
Subjt:  GEGYMPPGQQ----SGGAYMMYDRESGRP-------PHHPPQQTHFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQR

Query:  MEDSGQPVDFNAVLDRLSSSSGPGPQRAW
        MEDSGQ VDFNAVLDRLS+ +GPGPQRAW
Subjt:  MEDSGQPVDFNAVLDRLSSSSGPGPQRAW

A0A6J1HZW1 ataxin-2 homolog1.5e-23685.98Show/hide
Query:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSVNNPGKDFHKGRMSTVFPGSGYGQADDTISQNVISTVENSMKKHSDNLLRFLEGI
        MASGSAGRPNS+PKSFDFGSD+ILCSFEDY KQ+PSNGS SDPVSV N  KDFHK RMSTVFPG+ YGQ DD+I+Q+VI+TVENSMKKHSDNLLRFLEGI
Subjt:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSVNNPGKDFHKGRMSTVFPGSGYGQADDTISQNVISTVENSMKKHSDNLLRFLEGI

Query:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSTNHSQSNEERASSVASDPKKKE
        SSRLSQLELYCYNLDKSVGEMRS+LARDHEEADSKLKS+EKHVQEVHRSVQIIRDKQELAETQKDLAKLQV QKEPS ++HSQ+NEER   V++DPKK E
Subjt:  SSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSTNHSQSNEERASSVASDPKKKE

Query:  NSSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPG-QPPHIQHAQSQYIPSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQ
        N SEIHNQQLALALPHQIVPQQNP+TPPSAALPQN+PQQ QSYYIS SQLPG QP HIQHAQ+QYI SDS HRASQPQDVSQM+NPQLSQT PQPFNQYQ
Subjt:  NSSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPG-QPPHIQHAQSQYIPSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQ

Query:  QQWAQPPSQQPQPPQQPSMQ-QIRPPPPSVYPSTYPPPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAAGSGGSAPQQPPQVKNAYGPPTG
        QQWAQPPSQ  QPPQQ SMQ QIRPPP SVYPS Y PPNQPTSMPETL SSMPMQMSF SIPQPGSSR DA PYGYAA SGGSAPQQPPQVKNAYGP TG
Subjt:  QQWAQPPSQQPQPPQQPSMQ-QIRPPPPSVYPSTYPPPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAAGSGGSAPQQPPQVKNAYGPPTG

Query:  EGYMPPGQQ----SGGAYMMYDRESGRP-------PHHPPQQTHFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRM
        EGYMPPGQQ    SGGAYMMYDRESGRP       PHHP QQ+HFNQSGYP ANAP QVPPQAP GPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRM
Subjt:  EGYMPPGQQ----SGGAYMMYDRESGRP-------PHHPPQQTHFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRM

Query:  EDSGQPVDFNAVLDRLSSSSGPGPQRAW
        EDSGQ VDFNAVLDRLS+ +GPGPQRAW
Subjt:  EDSGQPVDFNAVLDRLSSSSGPGPQRAW

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G01560.1 Protein of unknown function (DUF1421)2.6e-2331.63Show/hide
Query:  NGSLSD--PVSVNNPGKDF---HKGRMSTVFPGSGYGQADDTISQNVISTVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGEMRSELARDHEE
        N S SD  PVS  +P  +F        S + P  G    + TI   +I   + +MKKH+D LL  +EG+S+RLSQLE   +NL+  V +++  +   H  
Subjt:  NGSLSD--PVSVNNPGKDF---HKGRMSTVFPGSGYGQADDTISQNVISTVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGEMRSELARDHEE

Query:  ADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSTNHSQSNEERASSVASDPKKKENSSEIHNQQLALALPHQIVPQQNPITPPSAA
         D K++ L+  + EV   VQ+++DKQE+ E Q  L+K QVS +   +  HS   +  A S A  P ++   +       + A P Q         PPS+ 
Subjt:  ADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSTNHSQSNEERASSVASDPKKKENSSEIHNQQLALALPHQIVPQQNPITPPSAA

Query:  LPQNMPQQQQSYYISQSQLPGQPPHIQHAQSQYIPSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQQWAQPPSQQPQPPQQPSMQQIRPPPPSVYPS
        LP  +P Q      S  Q P  PP          PS  Q   S P        PQ +QTP QP   YQ      P QQPQ PQQ       PPP S Y  
Subjt:  LPQNMPQQQQSYYISQSQLPGQPPHIQHAQSQYIPSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQQWAQPPSQQPQPPQQPSMQQIRPPPPSVYPS

Query:  TYPPPNQPTSMPETLPSSMPMQMSFPS-----IPQPGSSRVDAGPYGYAAGSGGSAPQQPPQVKNAYGPPTGEGYMPPGQQSGGAYMMYDRESGRPPHHP
           PP Q  S P   P   P   S PS      PQP  S  D        G+GG +    P            GY+       G+ M     S +PPH  
Subjt:  TYPPPNQPTSMPETLPSSMPMQMSFPS-----IPQPGSSRVDAGPYGYAAGSGGSAPQQPPQVKNAYGPPTGEGYMPPGQQSGGAYMMYDRESGRPPHHP

Query:  PQQTHFNQ--SGYPLANAPHQVPPQAPAGPHVSARNPSHS---HLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVLDRLSSSSGPGP
           T + Q  +  PL +A   V   +  G   S R+ S +    +I+++  MGF  D V + ++++ ++GQ VD N VLD+L +  G  P
Subjt:  PQQTHFNQ--SGYPLANAPHQVPPQAPAGPHVSARNPSHS---HLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVLDRLSSSSGPGP

AT4G28300.1 Protein of unknown function (DUF1421)1.3e-10749.53Show/hide
Query:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDP-VSVNNPGKDFHKGRM--STVFPGSGYGQADDTISQNVISTVENSMKKHSDNLLRFL
        MASGS+GR NS  K FDFGSDDILCS++DY  QD SNG  SDP ++ +N  K+FHK RM  S+VFP S Y   +D++SQ++  TVE +MK ++DN++RFL
Subjt:  MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDP-VSVNNPGKDFHKGRM--STVFPGSGYGQADDTISQNVISTVENSMKKHSDNLLRFL

Query:  EGISSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSTNHSQSNEERASSVASDPK
        EG+SSRLSQLELYCYNLDK++GEMRSEL   HE+AD KL+SL+KH+QEVHRSVQI+RDKQELA+TQK+LAKLQ+ QKE SS++HSQ  E+R ++   +PK
Subjt:  EGISSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSTNHSQSNEERASSVASDPK

Query:  KKENSSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPPHIQHAQSQYIPSDSQHRASQPQDVSQM-------SNPQLSQT
        K EN+S+ HNQQLALALPHQI PQ           PQ  PQQ Q Y      +P  P  +Q+  +    S    +   P   SQ        S+P  +QT
Subjt:  KKENSSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPPHIQHAQSQYIPSDSQHRASQPQDVSQM-------SNPQLSQT

Query:  PPQPFNQYQQQWAQPPSQQPQPPQQPSMQQIRPPPPSVYP--STYPPPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAAGSGGSAPQQPPQ
          Q F QYQQ W         PPQ     Q RP     YP  S  PP NQP    E+LPSSM MQ  +   PQ          YGY A     AP  P Q
Subjt:  PPQPFNQYQQQWAQPPSQQPQPPQQPSMQQIRPPPPSVYP--STYPPPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAAGSGGSAPQQPPQ

Query:  VKNAYGPPTGEGYMPPGQQSGGAYMMYDRESGRPPHHPP------QQTHFNQ--SGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHV
         K +Y P TG+GY+P G      Y     E GR  + PP      QQ H+ Q   G   +  PHQ        P V      +  LIEKLV MGFRGDHV
Subjt:  VKNAYGPPTGEGYMPPGQQSGGAYMMYDRESGRPPHHPP------QQTHFNQ--SGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHV

Query:  ASIIQRMEDSGQPVDFNAVLDRLSSSSGPGPQRAW
         ++IQRME+SGQP+DFN +LDRLS  S  GP R W
Subjt:  ASIIQRMEDSGQPVDFNAVLDRLSSSSGPGPQRAW

AT4G28300.2 Protein of unknown function (DUF1421)1.5e-8747.89Show/hide
Query:  STVFPGSGYGQADDTISQNVISTVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQE
        S+VFP S Y   +D++SQ++  TVE +MK ++DN++RFLEG+SSRLSQLELYCYNLDK++GEMRSEL   HE+AD KL+SL+KH+QEVHRSVQI+RDKQE
Subjt:  STVFPGSGYGQADDTISQNVISTVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQE

Query:  LAETQKDLAKLQVSQKEPSSTNHSQSNEERASSVASDPKKKENSSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPPHIQ
        LA+TQK+LAKLQ+ QKE SS++HSQ  E+R ++   +PKK EN+S+ HNQQLALALPHQI PQ           PQ  PQQ Q Y      +P  P  +Q
Subjt:  LAETQKDLAKLQVSQKEPSSTNHSQSNEERASSVASDPKKKENSSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPPHIQ

Query:  HAQSQYIPSDSQHRASQPQDVSQM-------SNPQLSQTPPQPFNQYQQQWAQPPSQQPQPPQQPSMQQIRPPPPSVYP--STYPPPNQPTSMPETLPSS
        +  +    S    +   P   SQ        S+P  +QT  Q F QYQQ W         PPQ     Q RP     YP  S  PP NQP    E+LPSS
Subjt:  HAQSQYIPSDSQHRASQPQDVSQM-------SNPQLSQTPPQPFNQYQQQWAQPPSQQPQPPQQPSMQQIRPPPPSVYP--STYPPPNQPTSMPETLPSS

Query:  MPMQMSFPSIPQPGSSRVDAGPYGYAAGSGGSAPQQPPQVKNAYGPPTGEGYMPPGQQSGGAYMMYDRESGRPPHHPP------QQTHFNQ--SGYPLAN
        M MQ  +   PQ          YGY A     AP  P Q K +Y P TG+GY+P G      Y     E GR  + PP      QQ H+ Q   G   + 
Subjt:  MPMQMSFPSIPQPGSSRVDAGPYGYAAGSGGSAPQQPPQVKNAYGPPTGEGYMPPGQQSGGAYMMYDRESGRPPHHPP------QQTHFNQ--SGYPLAN

Query:  APHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVLDRLSSSSGPGPQRAW
         PHQ        P V      +  LIEKLV MGFRGDHV ++IQRME+SGQP+DFN +LDRLS  S  GP R W
Subjt:  APHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVLDRLSSSSGPGPQRAW

AT5G14540.1 Protein of unknown function (DUF1421)1.6e-2531.22Show/hide
Query:  SDPVSVNNPGKDFHKGRMSTVFPGSGYGQADDTISQN-VISTVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSL
        SDP  V+      + G M ++ P   + + D    ++ +IS ++ +MK H+D LL  +EG+S+RL+QLE    +L+  V +++  +   H + D KL+ L
Subjt:  SDPVSVNNPGKDFHKGRMSTVFPGSGYGQADDTISQN-VISTVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGEMRSELARDHEEADSKLKSL

Query:  EKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSTNHSQSNEERASSVASDPKKKENSSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQ
        E  + EV   VQ+++DKQE+ E Q  L+KLQ+S+       HS   E  A   AS P+   +++         +L  Q +P Q  I PP++         
Subjt:  EKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSTNHSQSNEERASSVASDPKKKENSSEIHNQQLALALPHQIVPQQNPITPPSAALPQNMPQQ

Query:  QQSYYISQSQLPGQPPHIQHAQSQYIPSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQQWAQPPSQQPQPPQQPSMQQIRP----PPPSVYP-STYP
        Q        QLP  P      Q  Y P   Q   SQP    Q         PP P     Q   QPP QQPQ PQQP  Q   P    P    YP  +YP
Subjt:  QQSYYISQSQLPGQPPHIQHAQSQYIPSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQQWAQPPSQQPQPPQQPSMQQIRP----PPPSVYP-STYP

Query:  --PPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAAGSGGSAPQQPPQVKNAYGPPTGEGYMP---PGQQSGGAYMMYDRESGRPPHHPPQQ
          PP QP S P   P S P Q  + + P P S     G    +    G +P+  P      GPP+  G  P   P  QSG         SG  P  P  +
Subjt:  --PPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAAGSGGSAPQQPPQVKNAYGPPTGEGYMP---PGQQSGGAYMMYDRESGRPPHHPPQQ

Query:  THFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVLDRL
              G P+A+A         +    S        +I+K+V MGF  D V   ++ + ++GQ VD N VLD+L
Subjt:  THFNQSGYPLANAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVLDRL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCTGGTTCAGCAGGTCGCCCCAATTCCTCCCCTAAATCCTTTGATTTTGGTTCTGATGATATCCTTTGCTCTTTTGAAGACTACGGTAAACAGGACCCTTCAAA
CGGTAGCCTCAGCGATCCCGTTTCCGTTAACAATCCTGGCAAGGATTTTCACAAGGGTAGAATGTCTACAGTATTCCCTGGTTCTGGCTATGGTCAAGCAGATGATACCA
TTAGTCAAAATGTGATTTCCACGGTTGAGAACAGCATGAAAAAGCATTCTGATAACCTTTTGCGTTTTCTTGAGGGAATAAGTTCACGCCTATCACAACTTGAACTCTAT
TGCTACAACCTTGATAAATCCGTTGGAGAAATGCGGTCTGAATTAGCTCGTGACCATGAAGAAGCAGATTCAAAGCTTAAATCTCTTGAGAAGCATGTTCAAGAGGTCCA
CAGGTCTGTACAGATTATAAGAGACAAGCAAGAACTCGCTGAGACTCAGAAAGACTTGGCTAAACTTCAGGTCTCGCAGAAAGAGCCATCTTCAACAAACCATTCTCAGT
CTAATGAGGAGAGGGCTTCTTCAGTTGCCTCGGATCCTAAAAAGAAGGAAAATTCATCTGAGATTCACAACCAGCAATTAGCTTTGGCTTTGCCACATCAGATCGTCCCT
CAGCAAAATCCTATAACACCCCCTTCAGCAGCTTTGCCTCAGAACATGCCTCAACAACAACAATCTTACTACATCTCTCAATCCCAATTGCCCGGTCAACCACCCCATAT
CCAGCATGCTCAGAGCCAATATATCCCATCTGATTCCCAACACCGGGCATCACAACCTCAAGATGTTTCACAGATGTCCAATCCACAGCTAAGTCAAACTCCACCACAGC
CATTCAATCAATATCAACAACAATGGGCGCAACCACCATCTCAGCAGCCACAACCTCCTCAACAGCCTTCTATGCAACAAATCAGACCGCCTCCCCCTTCAGTATACCCT
TCTACTTATCCACCACCAAATCAACCAACTTCTATGCCTGAGACACTGCCAAGCAGCATGCCCATGCAAATGTCTTTTCCATCTATTCCTCAACCTGGTTCAAGCCGTGT
GGATGCAGGGCCTTATGGGTATGCTGCCGGAAGTGGTGGTTCTGCTCCACAACAACCTCCTCAAGTGAAAAATGCTTATGGTCCACCAACAGGTGAGGGATATATGCCTC
CTGGACAACAATCTGGAGGAGCATATATGATGTATGATAGGGAAAGCGGAAGACCGCCACACCATCCACCTCAACAAACGCACTTCAATCAAAGTGGATATCCTCTGGCC
AATGCACCTCATCAGGTTCCTCCTCAAGCACCAGCAGGCCCCCATGTTTCAGCTAGGAATCCAAGTCATTCACATCTAATCGAAAAATTAGTTGGCATGGGTTTCAGGGG
TGACCATGTTGCAAGTATAATCCAGAGAATGGAAGACAGTGGCCAACCTGTTGACTTCAACGCAGTTCTTGACAGGTTGAGTTCTTCTTCAGGTCCAGGTCCACAGAGAG
CTTGGTGA
mRNA sequenceShow/hide mRNA sequence
TTTTTAATTAATTTCCTTTTCCTAATGGAATAATATTAAACACATTATATGATATAATTTGTTAATTAAAAACCTTACACAAGTCTATATAAAAACAGTCTTGGAATTGA
AGGCATCCGAATACACGTTTTAGTCTTCAATCTCGATAATCGAAAAGCTCAATCATCTATTTTTTCCCCAAAAAATCTCAATCGTCATCTTCATCCCCATTCTCTCCGTT
ACTCACTGCGATCTATGGCGTCTGGTTCAGCAGGTCGCCCCAATTCCTCCCCTAAATCCTTTGATTTTGGTTCTGATGATATCCTTTGCTCTTTTGAAGACTACGGTAAA
CAGGACCCTTCAAACGGTAGCCTCAGCGATCCCGTTTCCGTTAACAATCCTGGCAAGGATTTTCACAAGGGTAGAATGTCTACAGTATTCCCTGGTTCTGGCTATGGTCA
AGCAGATGATACCATTAGTCAAAATGTGATTTCCACGGTTGAGAACAGCATGAAAAAGCATTCTGATAACCTTTTGCGTTTTCTTGAGGGAATAAGTTCACGCCTATCAC
AACTTGAACTCTATTGCTACAACCTTGATAAATCCGTTGGAGAAATGCGGTCTGAATTAGCTCGTGACCATGAAGAAGCAGATTCAAAGCTTAAATCTCTTGAGAAGCAT
GTTCAAGAGGTCCACAGGTCTGTACAGATTATAAGAGACAAGCAAGAACTCGCTGAGACTCAGAAAGACTTGGCTAAACTTCAGGTCTCGCAGAAAGAGCCATCTTCAAC
AAACCATTCTCAGTCTAATGAGGAGAGGGCTTCTTCAGTTGCCTCGGATCCTAAAAAGAAGGAAAATTCATCTGAGATTCACAACCAGCAATTAGCTTTGGCTTTGCCAC
ATCAGATCGTCCCTCAGCAAAATCCTATAACACCCCCTTCAGCAGCTTTGCCTCAGAACATGCCTCAACAACAACAATCTTACTACATCTCTCAATCCCAATTGCCCGGT
CAACCACCCCATATCCAGCATGCTCAGAGCCAATATATCCCATCTGATTCCCAACACCGGGCATCACAACCTCAAGATGTTTCACAGATGTCCAATCCACAGCTAAGTCA
AACTCCACCACAGCCATTCAATCAATATCAACAACAATGGGCGCAACCACCATCTCAGCAGCCACAACCTCCTCAACAGCCTTCTATGCAACAAATCAGACCGCCTCCCC
CTTCAGTATACCCTTCTACTTATCCACCACCAAATCAACCAACTTCTATGCCTGAGACACTGCCAAGCAGCATGCCCATGCAAATGTCTTTTCCATCTATTCCTCAACCT
GGTTCAAGCCGTGTGGATGCAGGGCCTTATGGGTATGCTGCCGGAAGTGGTGGTTCTGCTCCACAACAACCTCCTCAAGTGAAAAATGCTTATGGTCCACCAACAGGTGA
GGGATATATGCCTCCTGGACAACAATCTGGAGGAGCATATATGATGTATGATAGGGAAAGCGGAAGACCGCCACACCATCCACCTCAACAAACGCACTTCAATCAAAGTG
GATATCCTCTGGCCAATGCACCTCATCAGGTTCCTCCTCAAGCACCAGCAGGCCCCCATGTTTCAGCTAGGAATCCAAGTCATTCACATCTAATCGAAAAATTAGTTGGC
ATGGGTTTCAGGGGTGACCATGTTGCAAGTATAATCCAGAGAATGGAAGACAGTGGCCAACCTGTTGACTTCAACGCAGTTCTTGACAGGTTGAGTTCTTCTTCAGGTCC
AGGTCCACAGAGAGCTTGGTGAAGAGTAATTTAATCATCCCGTCTGCGGCCGATTGGCCATGACCAGCCTCATACATTGCGTCTTTTTAATGCATTGAATAAAATACTGG
TTTATGATTTAATTGTCCTCCGTATATATTTTGTTATGGTTTGTGAGATTTTAAAACGTCGGCTGTATGATTTAAACTTCGTGTGAATATCATCTTCTACTTCCAAATTC
CAATCTTTCTATATTCCCTTTTATGTTTATTTGCACCTTTT
Protein sequenceShow/hide protein sequence
MASGSAGRPNSSPKSFDFGSDDILCSFEDYGKQDPSNGSLSDPVSVNNPGKDFHKGRMSTVFPGSGYGQADDTISQNVISTVENSMKKHSDNLLRFLEGISSRLSQLELY
CYNLDKSVGEMRSELARDHEEADSKLKSLEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVSQKEPSSTNHSQSNEERASSVASDPKKKENSSEIHNQQLALALPHQIVP
QQNPITPPSAALPQNMPQQQQSYYISQSQLPGQPPHIQHAQSQYIPSDSQHRASQPQDVSQMSNPQLSQTPPQPFNQYQQQWAQPPSQQPQPPQQPSMQQIRPPPPSVYP
STYPPPNQPTSMPETLPSSMPMQMSFPSIPQPGSSRVDAGPYGYAAGSGGSAPQQPPQVKNAYGPPTGEGYMPPGQQSGGAYMMYDRESGRPPHHPPQQTHFNQSGYPLA
NAPHQVPPQAPAGPHVSARNPSHSHLIEKLVGMGFRGDHVASIIQRMEDSGQPVDFNAVLDRLSSSSGPGPQRAW