; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr018688 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr018688
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionprotein ALP1-like
Genome locationtig00153207:515911..525434
RNA-Seq ExpressionSgr018688
SyntenySgr018688
Gene Ontology termsGO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR010369 - Protein SOSEKI
IPR027806 - Harbinger transposase-derived nuclease domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004147700.1 protein ALP1-like [Cucumis sativus]1.5e-19186.77Show/hide
Query:  MGPIRGFKRKKKAEKKVDQNVLA-ASLPSQPQPFDWWDDFSQRFTGPLSQSKNSTKFESVFKISRKTFSYICSLVKEAMLAKTSNFTDLNGKPLSLNDQV
        MGPIRGFKRKKK EKKVDQNV A ASL SQ QP DWWD+FSQR TGPLSQSKN TKFESVFKISRKTFSYICSLVKE M+AKTS+FTDLNGKPLSLNDQV
Subjt:  MGPIRGFKRKKKAEKKVDQNVLA-ASLPSQPQPFDWWDDFSQRFTGPLSQSKNSTKFESVFKISRKTFSYICSLVKEAMLAKTSNFTDLNGKPLSLNDQV

Query:  AVALRRLSSGESLSSIGDSFGMNQSSVSQITWRFVEAMEEKGLHHLSWPSREEDMDQVKSKFKKIRGLPNCCGVIDTTHIMMTLPTTESANGVWLDREKN
        AVALRRL SGESLS+IGDSFG+NQSSVSQITWRFVEAMEEKGLHHLSWPS EEDMD++KSKFKKIRGLPNCCGV++TTHIMMTLPT+ESANG+WLDREKN
Subjt:  AVALRRLSSGESLSSIGDSFGMNQSSVSQITWRFVEAMEEKGLHHLSWPSREEDMDQVKSKFKKIRGLPNCCGVIDTTHIMMTLPTTESANGVWLDREKN

Query:  CSMILQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFKRSQDGERLNGKNMKLSESSELGEYVIGDS----------------------EFNKRHFATR
        CSMILQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFK SQDGERLNGK MKLSESSELGEY+IGDS                      EFNKRHFATR
Subjt:  CSMILQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFKRSQDGERLNGKNMKLSESSELGEYVIGDS----------------------EFNKRHFATR

Query:  LVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMGDEVQDEMPLSHHHDSGYRQQSCEFVDNTASIGREKLSMFLSGKLPP
        LVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDM DEVQDEMPLSHHHD  YRQQSCEFVDNTASI REKLSM+LSGKLPP
Subjt:  LVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMGDEVQDEMPLSHHHDSGYRQQSCEFVDNTASIGREKLSMFLSGKLPP

XP_008461643.1 PREDICTED: putative nuclease HARBI1 [Cucumis melo]5.5e-19186.51Show/hide
Query:  MGPIRGFKRKKKAEKKVDQNVLA-ASLPSQPQPFDWWDDFSQRFTGPLSQSKNSTKFESVFKISRKTFSYICSLVKEAMLAKTSNFTDLNGKPLSLNDQV
        MGPIRGFKRKKK EKKVDQNV A ASL SQ QP DWWD+FSQR TGPLSQSKN TKFESVFKISRKTFSYICSLVKE M+AKTS+FTDLNGKPLSLNDQV
Subjt:  MGPIRGFKRKKKAEKKVDQNVLA-ASLPSQPQPFDWWDDFSQRFTGPLSQSKNSTKFESVFKISRKTFSYICSLVKEAMLAKTSNFTDLNGKPLSLNDQV

Query:  AVALRRLSSGESLSSIGDSFGMNQSSVSQITWRFVEAMEEKGLHHLSWPSREEDMDQVKSKFKKIRGLPNCCGVIDTTHIMMTLPTTESANGVWLDREKN
        AVALRRL SGESLS+IGDSFG+NQSSVSQITWRFVEAMEEKGLHHLSWPS EEDMD++KSKFKKIRGLPNCCGVI+TTHIMMTLPT+ESANG+WLDREKN
Subjt:  AVALRRLSSGESLSSIGDSFGMNQSSVSQITWRFVEAMEEKGLHHLSWPSREEDMDQVKSKFKKIRGLPNCCGVIDTTHIMMTLPTTESANGVWLDREKN

Query:  CSMILQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFKRSQDGERLNGKNMKLSESSELGEYVIGDS----------------------EFNKRHFATR
        CSMILQVIVDPEMRFCDIITGWPGSLSD+LVLQSSGFFK SQDGERLNGK M+LSESSELGEY+IGDS                      EFNKRHFATR
Subjt:  CSMILQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFKRSQDGERLNGKNMKLSESSELGEYVIGDS----------------------EFNKRHFATR

Query:  LVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMGDEVQDEMPLSHHHDSGYRQQSCEFVDNTASIGREKLSMFLSGKLPP
        LVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDM DEVQDEMPLSHHHD  YRQQSCEFVDNTASI REKLSM+LSGKLPP
Subjt:  LVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMGDEVQDEMPLSHHHDSGYRQQSCEFVDNTASIGREKLSMFLSGKLPP

XP_022138922.1 protein ALP1-like [Momordica charantia]4.4e-19688.3Show/hide
Query:  MGPIRGFKRKKKAEKKVDQNVL-AASLPSQPQPFDWWDDFSQRFTGPLSQSKNSTKFESVFKISRKTFSYICSLVKEAMLAKTSNFTDLNGKPLSLNDQV
        MGPIRGFKRKKKAEKKVDQNVL AASL SQPQP DWWDDFSQR TGPLSQSKN TKFESVFKISRKTFSYICSLVKEAM+AKTSNFTDLNGKPLS+NDQV
Subjt:  MGPIRGFKRKKKAEKKVDQNVL-AASLPSQPQPFDWWDDFSQRFTGPLSQSKNSTKFESVFKISRKTFSYICSLVKEAMLAKTSNFTDLNGKPLSLNDQV

Query:  AVALRRLSSGESLSSIGDSFGMNQSSVSQITWRFVEAMEEKGLHHLSWPSREEDMDQVKSKFKKIRGLPNCCGVIDTTHIMMTLPTTESANGVWLDREKN
        AVALRRLSSGESLS IGDSFGMNQSSVSQITWRFVEAMEEKGLHHLSWPS EEDMDQ+KSKFKKI+GLPNCCGVI+TTHIMMTLPT ES NGVWLDREKN
Subjt:  AVALRRLSSGESLSSIGDSFGMNQSSVSQITWRFVEAMEEKGLHHLSWPSREEDMDQVKSKFKKIRGLPNCCGVIDTTHIMMTLPTTESANGVWLDREKN

Query:  CSMILQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFKRSQDGERLNGKNMKLSESSELGEYVIGDS----------------------EFNKRHFATR
        CSMILQVIVDPEMRFCDI+ GWPGSLSDALVLQSSGFFK SQDGERLNGKNMKLSESSELGEY+IGDS                      EFNKRH+ATR
Subjt:  CSMILQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFKRSQDGERLNGKNMKLSESSELGEYVIGDS----------------------EFNKRHFATR

Query:  LVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMGDEVQDEMPLSHHHDSGYRQQSCEFVDNTASIGREKLSMFLSGKLPP
        LVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDM DEVQDEMPLSHHHDSGYRQQSC+FVDNTAS+ REKLSM+LSGKLPP
Subjt:  LVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMGDEVQDEMPLSHHHDSGYRQQSCEFVDNTASIGREKLSMFLSGKLPP

XP_022941714.1 protein ALP1-like [Cucurbita moschata]2.8e-18785.24Show/hide
Query:  MGPIRGFKRKKKAEKKVDQNVLA-ASLPSQPQPFDWWDDFSQRFTGPLSQSKNSTKFESVFKISRKTFSYICSLVKEAMLAKTSNFTDLNGKPLSLNDQV
        MGPIRGFKRKK   KKVDQNVL  +SL SQPQP DWWD+FSQR TGPLS+SKN T FESVFKISRKTFSYI SLVKEAM+AKTSNFTDLNGKPLS+NDQV
Subjt:  MGPIRGFKRKKKAEKKVDQNVLA-ASLPSQPQPFDWWDDFSQRFTGPLSQSKNSTKFESVFKISRKTFSYICSLVKEAMLAKTSNFTDLNGKPLSLNDQV

Query:  AVALRRLSSGESLSSIGDSFGMNQSSVSQITWRFVEAMEEKGLHHLSWPSREEDMDQVKSKFKKIRGLPNCCGVIDTTHIMMTLPTTESANGVWLDREKN
        AVALRRLSSGESLS+IGDSFGMNQSSVSQITWRFVEAMEEKGLHHLSWPS EE MD++KSKFKKI+GLPNCCGVI+TTHIMMTLPTTESA+GVWLDREKN
Subjt:  AVALRRLSSGESLSSIGDSFGMNQSSVSQITWRFVEAMEEKGLHHLSWPSREEDMDQVKSKFKKIRGLPNCCGVIDTTHIMMTLPTTESANGVWLDREKN

Query:  CSMILQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFKRSQDGERLNGKNMKLSESSELGEYVIGDS----------------------EFNKRHFATR
        CSM+LQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFK SQDGERLNGK MKLSESSE+GEY+IGDS                      EFNKRHFATR
Subjt:  CSMILQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFKRSQDGERLNGKNMKLSESSELGEYVIGDS----------------------EFNKRHFATR

Query:  LVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMGDEVQDEMPLSHHHDSGYRQQSCEFVDNTASIGREKLSMFLSGKLPP
        LVAQRALTRLKEMWKIIKGVMWKPDKHRLPRI+LVCCLLHNIVIDM DEVQDEMPLSHHHD  YRQQSCEFVDNTAS+ REKLSM+LSGKLPP
Subjt:  LVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMGDEVQDEMPLSHHHDSGYRQQSCEFVDNTASIGREKLSMFLSGKLPP

XP_038891834.1 protein ALP1-like [Benincasa hispida]1.6e-21484.63Show/hide
Query:  PKCLCPLFATLSCYYFICFESESFLHVNSIVSFDSVGGCWWVLQVSAHLCFFLEFRMGPIRGFKRKKKAEKKVDQNVL-AASLPSQPQPFDWWDDFSQRF
        PKC    FATL C+ FI FESESFLHVNSIV FDS+GG  WV  + A+ CFF +FRMGPIRGFKRKKK EKKVDQNV  AASL SQ QP DWWD+FSQR 
Subjt:  PKCLCPLFATLSCYYFICFESESFLHVNSIVSFDSVGGCWWVLQVSAHLCFFLEFRMGPIRGFKRKKKAEKKVDQNVL-AASLPSQPQPFDWWDDFSQRF

Query:  TGPLSQSKNSTKFESVFKISRKTFSYICSLVKEAMLAKTSNFTDLNGKPLSLNDQVAVALRRLSSGESLSSIGDSFGMNQSSVSQITWRFVEAMEEKGLH
        TGPLSQSKN TKFESVFKISRKTFSYICSLVKE M+AKTSNFTDLNGKPLSLNDQVAVALRRL SGESLS+IG+SFGMNQSSVSQITWRFVEAMEEKGLH
Subjt:  TGPLSQSKNSTKFESVFKISRKTFSYICSLVKEAMLAKTSNFTDLNGKPLSLNDQVAVALRRLSSGESLSSIGDSFGMNQSSVSQITWRFVEAMEEKGLH

Query:  HLSWPSREEDMDQVKSKFKKIRGLPNCCGVIDTTHIMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFKRSQDG
        HLSWPS EEDMDQ+KSKFKKIRGLPNCCGVI+TTHIMMTLPTTESANG+WLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFK SQD 
Subjt:  HLSWPSREEDMDQVKSKFKKIRGLPNCCGVIDTTHIMMTLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFKRSQDG

Query:  ERLNGKNMKLSESSELGEYVIGDS----------------------EFNKRHFATRLVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVI
        ERLNGK MKLSESSELGEY+IGDS                      EFNKRHFATRLVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVI
Subjt:  ERLNGKNMKLSESSELGEYVIGDS----------------------EFNKRHFATRLVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVI

Query:  DMGDEVQDEMPLSHHHDSGYRQQSCEFVDNTASIGREKLSMFLSGKLPP
        DM DEVQDEMPLSHHHD  YRQQSCEFVDNTASI REKLSM+LSGKLPP
Subjt:  DMGDEVQDEMPLSHHHDSGYRQQSCEFVDNTASIGREKLSMFLSGKLPP

TrEMBL top hitse value%identityAlignment
A0A0A0KS64 DDE Tnp4 domain-containing protein7.0e-19286.77Show/hide
Query:  MGPIRGFKRKKKAEKKVDQNVLA-ASLPSQPQPFDWWDDFSQRFTGPLSQSKNSTKFESVFKISRKTFSYICSLVKEAMLAKTSNFTDLNGKPLSLNDQV
        MGPIRGFKRKKK EKKVDQNV A ASL SQ QP DWWD+FSQR TGPLSQSKN TKFESVFKISRKTFSYICSLVKE M+AKTS+FTDLNGKPLSLNDQV
Subjt:  MGPIRGFKRKKKAEKKVDQNVLA-ASLPSQPQPFDWWDDFSQRFTGPLSQSKNSTKFESVFKISRKTFSYICSLVKEAMLAKTSNFTDLNGKPLSLNDQV

Query:  AVALRRLSSGESLSSIGDSFGMNQSSVSQITWRFVEAMEEKGLHHLSWPSREEDMDQVKSKFKKIRGLPNCCGVIDTTHIMMTLPTTESANGVWLDREKN
        AVALRRL SGESLS+IGDSFG+NQSSVSQITWRFVEAMEEKGLHHLSWPS EEDMD++KSKFKKIRGLPNCCGV++TTHIMMTLPT+ESANG+WLDREKN
Subjt:  AVALRRLSSGESLSSIGDSFGMNQSSVSQITWRFVEAMEEKGLHHLSWPSREEDMDQVKSKFKKIRGLPNCCGVIDTTHIMMTLPTTESANGVWLDREKN

Query:  CSMILQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFKRSQDGERLNGKNMKLSESSELGEYVIGDS----------------------EFNKRHFATR
        CSMILQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFK SQDGERLNGK MKLSESSELGEY+IGDS                      EFNKRHFATR
Subjt:  CSMILQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFKRSQDGERLNGKNMKLSESSELGEYVIGDS----------------------EFNKRHFATR

Query:  LVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMGDEVQDEMPLSHHHDSGYRQQSCEFVDNTASIGREKLSMFLSGKLPP
        LVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDM DEVQDEMPLSHHHD  YRQQSCEFVDNTASI REKLSM+LSGKLPP
Subjt:  LVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMGDEVQDEMPLSHHHDSGYRQQSCEFVDNTASIGREKLSMFLSGKLPP

A0A1S3CEZ1 putative nuclease HARBI12.7e-19186.51Show/hide
Query:  MGPIRGFKRKKKAEKKVDQNVLA-ASLPSQPQPFDWWDDFSQRFTGPLSQSKNSTKFESVFKISRKTFSYICSLVKEAMLAKTSNFTDLNGKPLSLNDQV
        MGPIRGFKRKKK EKKVDQNV A ASL SQ QP DWWD+FSQR TGPLSQSKN TKFESVFKISRKTFSYICSLVKE M+AKTS+FTDLNGKPLSLNDQV
Subjt:  MGPIRGFKRKKKAEKKVDQNVLA-ASLPSQPQPFDWWDDFSQRFTGPLSQSKNSTKFESVFKISRKTFSYICSLVKEAMLAKTSNFTDLNGKPLSLNDQV

Query:  AVALRRLSSGESLSSIGDSFGMNQSSVSQITWRFVEAMEEKGLHHLSWPSREEDMDQVKSKFKKIRGLPNCCGVIDTTHIMMTLPTTESANGVWLDREKN
        AVALRRL SGESLS+IGDSFG+NQSSVSQITWRFVEAMEEKGLHHLSWPS EEDMD++KSKFKKIRGLPNCCGVI+TTHIMMTLPT+ESANG+WLDREKN
Subjt:  AVALRRLSSGESLSSIGDSFGMNQSSVSQITWRFVEAMEEKGLHHLSWPSREEDMDQVKSKFKKIRGLPNCCGVIDTTHIMMTLPTTESANGVWLDREKN

Query:  CSMILQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFKRSQDGERLNGKNMKLSESSELGEYVIGDS----------------------EFNKRHFATR
        CSMILQVIVDPEMRFCDIITGWPGSLSD+LVLQSSGFFK SQDGERLNGK M+LSESSELGEY+IGDS                      EFNKRHFATR
Subjt:  CSMILQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFKRSQDGERLNGKNMKLSESSELGEYVIGDS----------------------EFNKRHFATR

Query:  LVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMGDEVQDEMPLSHHHDSGYRQQSCEFVDNTASIGREKLSMFLSGKLPP
        LVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDM DEVQDEMPLSHHHD  YRQQSCEFVDNTASI REKLSM+LSGKLPP
Subjt:  LVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMGDEVQDEMPLSHHHDSGYRQQSCEFVDNTASIGREKLSMFLSGKLPP

A0A6J1CCK2 protein ALP1-like2.1e-19688.3Show/hide
Query:  MGPIRGFKRKKKAEKKVDQNVL-AASLPSQPQPFDWWDDFSQRFTGPLSQSKNSTKFESVFKISRKTFSYICSLVKEAMLAKTSNFTDLNGKPLSLNDQV
        MGPIRGFKRKKKAEKKVDQNVL AASL SQPQP DWWDDFSQR TGPLSQSKN TKFESVFKISRKTFSYICSLVKEAM+AKTSNFTDLNGKPLS+NDQV
Subjt:  MGPIRGFKRKKKAEKKVDQNVL-AASLPSQPQPFDWWDDFSQRFTGPLSQSKNSTKFESVFKISRKTFSYICSLVKEAMLAKTSNFTDLNGKPLSLNDQV

Query:  AVALRRLSSGESLSSIGDSFGMNQSSVSQITWRFVEAMEEKGLHHLSWPSREEDMDQVKSKFKKIRGLPNCCGVIDTTHIMMTLPTTESANGVWLDREKN
        AVALRRLSSGESLS IGDSFGMNQSSVSQITWRFVEAMEEKGLHHLSWPS EEDMDQ+KSKFKKI+GLPNCCGVI+TTHIMMTLPT ES NGVWLDREKN
Subjt:  AVALRRLSSGESLSSIGDSFGMNQSSVSQITWRFVEAMEEKGLHHLSWPSREEDMDQVKSKFKKIRGLPNCCGVIDTTHIMMTLPTTESANGVWLDREKN

Query:  CSMILQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFKRSQDGERLNGKNMKLSESSELGEYVIGDS----------------------EFNKRHFATR
        CSMILQVIVDPEMRFCDI+ GWPGSLSDALVLQSSGFFK SQDGERLNGKNMKLSESSELGEY+IGDS                      EFNKRH+ATR
Subjt:  CSMILQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFKRSQDGERLNGKNMKLSESSELGEYVIGDS----------------------EFNKRHFATR

Query:  LVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMGDEVQDEMPLSHHHDSGYRQQSCEFVDNTASIGREKLSMFLSGKLPP
        LVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDM DEVQDEMPLSHHHDSGYRQQSC+FVDNTAS+ REKLSM+LSGKLPP
Subjt:  LVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMGDEVQDEMPLSHHHDSGYRQQSCEFVDNTASIGREKLSMFLSGKLPP

A0A6J1FP85 protein ALP1-like1.4e-18785.24Show/hide
Query:  MGPIRGFKRKKKAEKKVDQNVLA-ASLPSQPQPFDWWDDFSQRFTGPLSQSKNSTKFESVFKISRKTFSYICSLVKEAMLAKTSNFTDLNGKPLSLNDQV
        MGPIRGFKRKK   KKVDQNVL  +SL SQPQP DWWD+FSQR TGPLS+SKN T FESVFKISRKTFSYI SLVKEAM+AKTSNFTDLNGKPLS+NDQV
Subjt:  MGPIRGFKRKKKAEKKVDQNVLA-ASLPSQPQPFDWWDDFSQRFTGPLSQSKNSTKFESVFKISRKTFSYICSLVKEAMLAKTSNFTDLNGKPLSLNDQV

Query:  AVALRRLSSGESLSSIGDSFGMNQSSVSQITWRFVEAMEEKGLHHLSWPSREEDMDQVKSKFKKIRGLPNCCGVIDTTHIMMTLPTTESANGVWLDREKN
        AVALRRLSSGESLS+IGDSFGMNQSSVSQITWRFVEAMEEKGLHHLSWPS EE MD++KSKFKKI+GLPNCCGVI+TTHIMMTLPTTESA+GVWLDREKN
Subjt:  AVALRRLSSGESLSSIGDSFGMNQSSVSQITWRFVEAMEEKGLHHLSWPSREEDMDQVKSKFKKIRGLPNCCGVIDTTHIMMTLPTTESANGVWLDREKN

Query:  CSMILQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFKRSQDGERLNGKNMKLSESSELGEYVIGDS----------------------EFNKRHFATR
        CSM+LQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFK SQDGERLNGK MKLSESSE+GEY+IGDS                      EFNKRHFATR
Subjt:  CSMILQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFKRSQDGERLNGKNMKLSESSELGEYVIGDS----------------------EFNKRHFATR

Query:  LVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMGDEVQDEMPLSHHHDSGYRQQSCEFVDNTASIGREKLSMFLSGKLPP
        LVAQRALTRLKEMWKIIKGVMWKPDKHRLPRI+LVCCLLHNIVIDM DEVQDEMPLSHHHD  YRQQSCEFVDNTAS+ REKLSM+LSGKLPP
Subjt:  LVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMGDEVQDEMPLSHHHDSGYRQQSCEFVDNTASIGREKLSMFLSGKLPP

A0A6J1K3E1 protein ALP1-like isoform X14.0e-18785.06Show/hide
Query:  MGPIRGFKRK--KKAEKKVDQNVL-AASLPSQPQPFDWWDDFSQRFTGPLSQSKNSTKFESVFKISRKTFSYICSLVKEAMLAKTSNFTDLNGKPLSLND
        MGPIRGFKRK  KKA+KKV Q V  AASL  QPQP DWWD+FSQR TGPLSQSKN TKFESVFKISRKTFSYICSLVKEAM+AKTSNFTDLNGKPLSLND
Subjt:  MGPIRGFKRK--KKAEKKVDQNVL-AASLPSQPQPFDWWDDFSQRFTGPLSQSKNSTKFESVFKISRKTFSYICSLVKEAMLAKTSNFTDLNGKPLSLND

Query:  QVAVALRRLSSGESLSSIGDSFGMNQSSVSQITWRFVEAMEEKGLHHLSWPSREEDMDQVKSKFKKIRGLPNCCGVIDTTHIMMTLPTTESANGVWLDRE
        QVAVALRRL SGESLS+IGDSFGMNQSSVSQITWRFVEAMEEKG+ HLSWPS EEDMDQ+KSKFKKIRGLPNCCGVI+TTHIMMTLPTTESANGVWLDRE
Subjt:  QVAVALRRLSSGESLSSIGDSFGMNQSSVSQITWRFVEAMEEKGLHHLSWPSREEDMDQVKSKFKKIRGLPNCCGVIDTTHIMMTLPTTESANGVWLDRE

Query:  KNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFKRSQDGERLNGKNMKLSESSELGEYVIGDS----------------------EFNKRHFA
        KNCSMILQVIVDPEMRFCDIITGWPGSLSDALVL+SSGFFKRSQDGERLNGK MKLSESSELGEY+IGDS                      EFNKRHF+
Subjt:  KNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFKRSQDGERLNGKNMKLSESSELGEYVIGDS----------------------EFNKRHFA

Query:  TRLVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMGDEVQDEMPLSHHHDSGYRQQSCEFVDNTASIGREKLSMFLSGKLPP
        TRLVAQRALTRLKEMWKIIKG+MWKPDKH+LPRIILVCCLLHNI+IDM DE+QDEMPLSHHHD  YRQQSC+FVDNTASI REKLSM+LS KL P
Subjt:  TRLVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMGDEVQDEMPLSHHHDSGYRQQSCEFVDNTASIGREKLSMFLSGKLPP

SwissProt top hitse value%identityAlignment
Q8GY65 Protein SOSEKI 41.2e-3138.69Show/hide
Query:  TRTLTELKTEPQEVVPVVYYLSRHGQLDHPHLLEVPLSSPLHGLFLRGKLKRFSPFSLLGYGEEDVIRRLDILRGEGFAKMYSWSSKRRYKNGFVWQDIS
        +R   + K   + +VPVVYYLSR+G+LDHPH +EVPLSS  +GL+L+                 DVI RL+ LRG G A +YSWSSKR YKNGFVW D+S
Subjt:  TRTLTELKTEPQEVVPVVYYLSRHGQLDHPHLLEVPLSSPLHGLFLRGKLKRFSPFSLLGYGEEDVIRRLDILRGEGFAKMYSWSSKRRYKNGFVWQDIS

Query:  DDDLIHPSQGREYILKGSDLLQEASSSFRSYETSSSFSESKISSETNTSSTDSNFPVAVKRNNRSWNSLEDLCRNVVYKAKISGESGT-----NASTQTC
        D+D I P  G+EY+LKGS +L                         +  +   NF     R N+SW+S++      VYKA       T     +ASTQT 
Subjt:  DDDLIHPSQGREYILKGSDLLQEASSSFRSYETSSSFSESKISSETNTSSTDSNFPVAVKRNNRSWNSLEDLCRNVVYKAKISGESGT-----NASTQTC

Query:  ERRRRR-------WTGSGAAEECGGEAYSNSGVGKSESLRSMDCDGPADL--RDQTTGRS-NRWKASTVLMQLI
        +RRRR+            + EE      S+S     ESL  M  DG   L   DQ   R+  + + S VLMQLI
Subjt:  ERRRRR-------WTGSGAAEECGGEAYSNSGVGKSESLRSMDCDGPADL--RDQTTGRS-NRWKASTVLMQLI

Q8GYT8 Protein SOSEKI 34.2e-2438.93Show/hide
Query:  SRRTSPSLERANSRTRTLTELKTEPQEVVPVVYYLSRHGQLDHPHLLEVPLSSPLHGLFLRGKLKRFSPFSLLGYGEEDVIRRLDILRGEGFAKMYSWSS
        SR  SP  ERA   T    +   + ++ V +VYYLS++ QL+HPH +EV +SSP +GL+LR                 DVI RL++LRG G A MYSWSS
Subjt:  SRRTSPSLERANSRTRTLTELKTEPQEVVPVVYYLSRHGQLDHPHLLEVPLSSPLHGLFLRGKLKRFSPFSLLGYGEEDVIRRLDILRGEGFAKMYSWSS

Query:  KRRYKNGFVWQDISDDDLIHPSQGREYILKGSDLLQEASS----------------------SFRSYETSSSFSESKISSETNTSS--TDSNFPVAVKRN
        KR Y+NGFVW D+S+DDLI P+ G EY+LKGS+L  E++S                      S RS + SSS S       TN  S   D   P A++  
Subjt:  KRRYKNGFVWQDISDDDLIHPSQGREYILKGSDLLQEASS----------------------SFRSYETSSSFSESKISSETNTSS--TDSNFPVAVKRN

Query:  NRSWNSLED----------LCRNVVYKAKISGESGTNASTQTCE
        + S  S +           L    VYK+    E   +ASTQT E
Subjt:  NRSWNSLED----------LCRNVVYKAKISGESGTNASTQTCE

Q94K49 Protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 12.6e-9045.45Show/hide
Query:  KRKKKAEKKVDQNVLAASLPSQPQPFDWWDDFSQRFTGPLSQSKNSTKFESVFKISRKTFSYICSLVKEAMLAK-TSNFTDLNGKPLSLNDQVAVALRRL
        K KK A+ K  + V A  L  +    DWWD F  R + P   S     F+  F+ S+ TFSYICSLV+E ++++  S   ++ G+ LS+  QVA+ALRRL
Subjt:  KRKKKAEKKVDQNVLAASLPSQPQPFDWWDDFSQRFTGPLSQSKNSTKFESVFKISRKTFSYICSLVKEAMLAK-TSNFTDLNGKPLSLNDQVAVALRRL

Query:  SSGESLSSIGDSFGMNQSSVSQITWRFVEAMEEKGLHHLSWPSREEDMDQVKSKFKKIRGLPNCCGVIDTTHIMMTLPTTESANGVWLDREKNCSMILQV
        +SG+S  S+G +FG+ QS+VSQ+TWRF+EA+EE+  HHL WP  +  ++++KSKF+++ GLPNCCG IDTTHI+MTLP  ++++  W D+EKN SM LQ 
Subjt:  SSGESLSSIGDSFGMNQSSVSQITWRFVEAMEEKGLHHLSWPSREEDMDQVKSKFKKIRGLPNCCGVIDTTHIMMTLPTTESANGVWLDREKNCSMILQV

Query:  IVDPEMRFCDIITGWPGSLSDALVLQSSGFFKRSQDGERLNGKNMKLSESSELGEYVIG--------------DSE--------FNKRHFATRLVAQRAL
        + D EMRF +++TGWPG ++ + +L+ SGFFK  ++ + L+G    LS+ +++ EYV+G              DS+        FN+RH   R VA  A 
Subjt:  IVDPEMRFCDIITGWPGSLSDALVLQSSGFFKRSQDGERLNGKNMKLSESSELGEYVIG--------------DSE--------FNKRHFATRLVAQRAL

Query:  TRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMGDEVQDEMPLSHHHDSGYRQQSCEFVDNTASIGRE
         +LK  W+I+  VMW+PD+ +LP IILVCCLLHNI+ID GD +Q+++PLS HHDSGY  + C+    T  +G E
Subjt:  TRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMGDEVQDEMPLSHHHDSGYRQQSCEFVDNTASIGRE

Q9FJF5 Protein SOSEKI 52.6e-3440.46Show/hide
Query:  RRTSPSLERANSRTRTLTELKTEP--QEVVPVVYYLSRHGQLDHPHLLEVPLSSPLHGLFLRGKLKRFSPFSLLGYGEEDVIRRLDILRGEGFAKMYSWS
        RR+    + +  R R  +E + +P     VPVVYYL R+GQLDHPH +EV LSS   GL+L+                 DVI RL+ LRG+G A +YSWS
Subjt:  RRTSPSLERANSRTRTLTELKTEP--QEVVPVVYYLSRHGQLDHPHLLEVPLSSPLHGLFLRGKLKRFSPFSLLGYGEEDVIRRLDILRGEGFAKMYSWS

Query:  SKRRYKNGFVWQDISDDDLIHPSQGREYILKGSDLLQEA-SSSFRSYETSSSFSESKISSETNTSSTDSNFPVAVKRNNRSWNSLEDLCRNVVYKAKISG
        SKR YKNGFVW D+S+DD I P QG+EY+LKGS++L     S+ RS   +SSF + + S   + +S D    V  +R N+SW+S+ DL    VYKA  S 
Subjt:  SKRRYKNGFVWQDISDDDLIHPSQGREYILKGSDLLQEA-SSSFRSYETSSSFSESKISSETNTSSTDSNFPVAVKRNNRSWNSLEDLCRNVVYKAKISG

Query:  ESGT-----NASTQTCERRRRRWTGSGAAEECGGEA-YSNSGVGKS-------------ESLRSM-DCDGPADLRDQTTG---------RSNRWKASTVL
           T     +ASTQT +RRRRR       EE    A Y N     S             E+L ++   DG   LR   +           S R +AS VL
Subjt:  ESGT-----NASTQTCERRRRRWTGSGAAEECGGEA-YSNSGVGKS-------------ESLRSM-DCDGPADLRDQTTG---------RSNRWKASTVL

Query:  MQLI
        MQLI
Subjt:  MQLI

Q9M2U3 Protein ALP1-like7.6e-12756.62Show/hide
Query:  MGPIRGFKRKKKAEKKVDQNVLAASL-------------------PSQPQPFDWWDDFSQRFTGPLSQSKNSTKFESVFKISRKTFSYICSLVKEAMLAK
        MGPI+  K+KK+AEKKVD+NVL A+                     S  Q  DWWD FS+R  G  +  K    FESVFKISRKTF YICSLVK    AK
Subjt:  MGPIRGFKRKKKAEKKVDQNVLAASL-------------------PSQPQPFDWWDDFSQRFTGPLSQSKNSTKFESVFKISRKTFSYICSLVKEAMLAK

Query:  TSNFTDLNGKPLSLNDQVAVALRRLSSGESLSSIGDSFGMNQSSVSQITWRFVEAMEEKGLHHLSWPSREEDMDQVKSKFKKIRGLPNCCGVIDTTHIMM
         +NF+D NG PLSLND+VAVALRRL SGESLS IG++FGMNQS+VSQITWRFVE+MEE+ +HHLSWPS+   +D++KSKF+KI GLPNCCG ID THI+M
Subjt:  TSNFTDLNGKPLSLNDQVAVALRRLSSGESLSSIGDSFGMNQSSVSQITWRFVEAMEEKGLHHLSWPSREEDMDQVKSKFKKIRGLPNCCGVIDTTHIMM

Query:  TLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFKRSQDGERLNGKNMKLSESSELGEYVIGDS--------------
         LP  E +N VWLD EKN SM LQ +VDP+MRF D+I GWPGSL+D +VL++SGF+K  + G+RLNG+ + LSE +EL EY++GDS              
Subjt:  TLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFKRSQDGERLNGKNMKLSESSELGEYVIGDS--------------

Query:  --------EFNKRHFATRLVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMGDEVQDEMPLSHHHDSGYRQQSCEFVDNTASIGREK
                EFNKRH      AQ AL++LK+ W+II GVMW PD++RLPRII VCCLLHNI+IDM D+  D+ PLS  HD  YRQ+SC+  D  +S+ R++
Subjt:  --------EFNKRHFATRLVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMGDEVQDEMPLSHHHDSGYRQQSCEFVDNTASIGREK

Query:  LSMFLSGK
        LS  L GK
Subjt:  LSMFLSGK

Arabidopsis top hitse value%identityAlignment
AT3G46110.1 Domain of unknown function (DUF966)8.6e-3338.69Show/hide
Query:  TRTLTELKTEPQEVVPVVYYLSRHGQLDHPHLLEVPLSSPLHGLFLRGKLKRFSPFSLLGYGEEDVIRRLDILRGEGFAKMYSWSSKRRYKNGFVWQDIS
        +R   + K   + +VPVVYYLSR+G+LDHPH +EVPLSS  +GL+L+                 DVI RL+ LRG G A +YSWSSKR YKNGFVW D+S
Subjt:  TRTLTELKTEPQEVVPVVYYLSRHGQLDHPHLLEVPLSSPLHGLFLRGKLKRFSPFSLLGYGEEDVIRRLDILRGEGFAKMYSWSSKRRYKNGFVWQDIS

Query:  DDDLIHPSQGREYILKGSDLLQEASSSFRSYETSSSFSESKISSETNTSSTDSNFPVAVKRNNRSWNSLEDLCRNVVYKAKISGESGT-----NASTQTC
        D+D I P  G+EY+LKGS +L                         +  +   NF     R N+SW+S++      VYKA       T     +ASTQT 
Subjt:  DDDLIHPSQGREYILKGSDLLQEASSSFRSYETSSSFSESKISSETNTSSTDSNFPVAVKRNNRSWNSLEDLCRNVVYKAKISGESGT-----NASTQTC

Query:  ERRRRR-------WTGSGAAEECGGEAYSNSGVGKSESLRSMDCDGPADL--RDQTTGRS-NRWKASTVLMQLI
        +RRRR+            + EE      S+S     ESL  M  DG   L   DQ   R+  + + S VLMQLI
Subjt:  ERRRRR-------WTGSGAAEECGGEAYSNSGVGKSESLRSMDCDGPADL--RDQTTGRS-NRWKASTVLMQLI

AT3G46110.2 Domain of unknown function (DUF966)8.6e-3338.69Show/hide
Query:  TRTLTELKTEPQEVVPVVYYLSRHGQLDHPHLLEVPLSSPLHGLFLRGKLKRFSPFSLLGYGEEDVIRRLDILRGEGFAKMYSWSSKRRYKNGFVWQDIS
        +R   + K   + +VPVVYYLSR+G+LDHPH +EVPLSS  +GL+L+                 DVI RL+ LRG G A +YSWSSKR YKNGFVW D+S
Subjt:  TRTLTELKTEPQEVVPVVYYLSRHGQLDHPHLLEVPLSSPLHGLFLRGKLKRFSPFSLLGYGEEDVIRRLDILRGEGFAKMYSWSSKRRYKNGFVWQDIS

Query:  DDDLIHPSQGREYILKGSDLLQEASSSFRSYETSSSFSESKISSETNTSSTDSNFPVAVKRNNRSWNSLEDLCRNVVYKAKISGESGT-----NASTQTC
        D+D I P  G+EY+LKGS +L                         +  +   NF     R N+SW+S++      VYKA       T     +ASTQT 
Subjt:  DDDLIHPSQGREYILKGSDLLQEASSSFRSYETSSSFSESKISSETNTSSTDSNFPVAVKRNNRSWNSLEDLCRNVVYKAKISGESGT-----NASTQTC

Query:  ERRRRR-------WTGSGAAEECGGEAYSNSGVGKSESLRSMDCDGPADL--RDQTTGRS-NRWKASTVLMQLI
        +RRRR+            + EE      S+S     ESL  M  DG   L   DQ   R+  + + S VLMQLI
Subjt:  ERRRRR-------WTGSGAAEECGGEAYSNSGVGKSESLRSMDCDGPADL--RDQTTGRS-NRWKASTVLMQLI

AT3G55350.1 PIF / Ping-Pong family of plant transposases5.4e-12856.62Show/hide
Query:  MGPIRGFKRKKKAEKKVDQNVLAASL-------------------PSQPQPFDWWDDFSQRFTGPLSQSKNSTKFESVFKISRKTFSYICSLVKEAMLAK
        MGPI+  K+KK+AEKKVD+NVL A+                     S  Q  DWWD FS+R  G  +  K    FESVFKISRKTF YICSLVK    AK
Subjt:  MGPIRGFKRKKKAEKKVDQNVLAASL-------------------PSQPQPFDWWDDFSQRFTGPLSQSKNSTKFESVFKISRKTFSYICSLVKEAMLAK

Query:  TSNFTDLNGKPLSLNDQVAVALRRLSSGESLSSIGDSFGMNQSSVSQITWRFVEAMEEKGLHHLSWPSREEDMDQVKSKFKKIRGLPNCCGVIDTTHIMM
         +NF+D NG PLSLND+VAVALRRL SGESLS IG++FGMNQS+VSQITWRFVE+MEE+ +HHLSWPS+   +D++KSKF+KI GLPNCCG ID THI+M
Subjt:  TSNFTDLNGKPLSLNDQVAVALRRLSSGESLSSIGDSFGMNQSSVSQITWRFVEAMEEKGLHHLSWPSREEDMDQVKSKFKKIRGLPNCCGVIDTTHIMM

Query:  TLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFKRSQDGERLNGKNMKLSESSELGEYVIGDS--------------
         LP  E +N VWLD EKN SM LQ +VDP+MRF D+I GWPGSL+D +VL++SGF+K  + G+RLNG+ + LSE +EL EY++GDS              
Subjt:  TLPTTESANGVWLDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFKRSQDGERLNGKNMKLSESSELGEYVIGDS--------------

Query:  --------EFNKRHFATRLVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMGDEVQDEMPLSHHHDSGYRQQSCEFVDNTASIGREK
                EFNKRH      AQ AL++LK+ W+II GVMW PD++RLPRII VCCLLHNI+IDM D+  D+ PLS  HD  YRQ+SC+  D  +S+ R++
Subjt:  --------EFNKRHFATRLVAQRALTRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMGDEVQDEMPLSHHHDSGYRQQSCEFVDNTASIGREK

Query:  LSMFLSGK
        LS  L GK
Subjt:  LSMFLSGK

AT3G63270.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)1.8e-9145.45Show/hide
Query:  KRKKKAEKKVDQNVLAASLPSQPQPFDWWDDFSQRFTGPLSQSKNSTKFESVFKISRKTFSYICSLVKEAMLAK-TSNFTDLNGKPLSLNDQVAVALRRL
        K KK A+ K  + V A  L  +    DWWD F  R + P   S     F+  F+ S+ TFSYICSLV+E ++++  S   ++ G+ LS+  QVA+ALRRL
Subjt:  KRKKKAEKKVDQNVLAASLPSQPQPFDWWDDFSQRFTGPLSQSKNSTKFESVFKISRKTFSYICSLVKEAMLAK-TSNFTDLNGKPLSLNDQVAVALRRL

Query:  SSGESLSSIGDSFGMNQSSVSQITWRFVEAMEEKGLHHLSWPSREEDMDQVKSKFKKIRGLPNCCGVIDTTHIMMTLPTTESANGVWLDREKNCSMILQV
        +SG+S  S+G +FG+ QS+VSQ+TWRF+EA+EE+  HHL WP  +  ++++KSKF+++ GLPNCCG IDTTHI+MTLP  ++++  W D+EKN SM LQ 
Subjt:  SSGESLSSIGDSFGMNQSSVSQITWRFVEAMEEKGLHHLSWPSREEDMDQVKSKFKKIRGLPNCCGVIDTTHIMMTLPTTESANGVWLDREKNCSMILQV

Query:  IVDPEMRFCDIITGWPGSLSDALVLQSSGFFKRSQDGERLNGKNMKLSESSELGEYVIG--------------DSE--------FNKRHFATRLVAQRAL
        + D EMRF +++TGWPG ++ + +L+ SGFFK  ++ + L+G    LS+ +++ EYV+G              DS+        FN+RH   R VA  A 
Subjt:  IVDPEMRFCDIITGWPGSLSDALVLQSSGFFKRSQDGERLNGKNMKLSESSELGEYVIG--------------DSE--------FNKRHFATRLVAQRAL

Query:  TRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMGDEVQDEMPLSHHHDSGYRQQSCEFVDNTASIGRE
         +LK  W+I+  VMW+PD+ +LP IILVCCLLHNI+ID GD +Q+++PLS HHDSGY  + C+    T  +G E
Subjt:  TRLKEMWKIIKGVMWKPDKHRLPRIILVCCLLHNIVIDMGDEVQDEMPLSHHHDSGYRQQSCEFVDNTASIGRE

AT5G59790.1 Domain of unknown function (DUF966)1.9e-3540.46Show/hide
Query:  RRTSPSLERANSRTRTLTELKTEP--QEVVPVVYYLSRHGQLDHPHLLEVPLSSPLHGLFLRGKLKRFSPFSLLGYGEEDVIRRLDILRGEGFAKMYSWS
        RR+    + +  R R  +E + +P     VPVVYYL R+GQLDHPH +EV LSS   GL+L+                 DVI RL+ LRG+G A +YSWS
Subjt:  RRTSPSLERANSRTRTLTELKTEP--QEVVPVVYYLSRHGQLDHPHLLEVPLSSPLHGLFLRGKLKRFSPFSLLGYGEEDVIRRLDILRGEGFAKMYSWS

Query:  SKRRYKNGFVWQDISDDDLIHPSQGREYILKGSDLLQEA-SSSFRSYETSSSFSESKISSETNTSSTDSNFPVAVKRNNRSWNSLEDLCRNVVYKAKISG
        SKR YKNGFVW D+S+DD I P QG+EY+LKGS++L     S+ RS   +SSF + + S   + +S D    V  +R N+SW+S+ DL    VYKA  S 
Subjt:  SKRRYKNGFVWQDISDDDLIHPSQGREYILKGSDLLQEA-SSSFRSYETSSSFSESKISSETNTSSTDSNFPVAVKRNNRSWNSLEDLCRNVVYKAKISG

Query:  ESGT-----NASTQTCERRRRRWTGSGAAEECGGEA-YSNSGVGKS-------------ESLRSM-DCDGPADLRDQTTG---------RSNRWKASTVL
           T     +ASTQT +RRRRR       EE    A Y N     S             E+L ++   DG   LR   +           S R +AS VL
Subjt:  ESGT-----NASTQTCERRRRRWTGSGAAEECGGEA-YSNSGVGKS-------------ESLRSM-DCDGPADLRDQTTG---------RSNRWKASTVL

Query:  MQLI
        MQLI
Subjt:  MQLI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGTAAATGTGGGAGAAAGAAGGGAGCCATTAACGTCAAGTGAATGGCGGAGCAGAAGAACGAGTCCTTCTCTTGAAAGAGCCAATTCCAGAACCAGAACCTTGAC
GGAATTGAAAACAGAACCTCAAGAAGTCGTCCCCGTTGTATACTACCTCTCCCGCCATGGCCAACTTGACCATCCTCATTTGCTCGAGGTTCCTCTTTCTTCTCCTCTCC
ACGGACTGTTTCTCAGAGGTAAACTGAAACGTTTTTCTCCGTTTTCTCTCCTTGGCTATGGTGAAGAAGATGTGATTAGAAGACTGGATATTCTCCGTGGAGAAGGCTTT
GCCAAAATGTATTCGTGGTCTTCGAAGCGGCGGTACAAAAATGGATTCGTGTGGCAGGACATATCAGACGACGACTTGATACATCCATCTCAAGGCCGTGAATACATTCT
CAAAGGATCAGATCTATTGCAAGAGGCGTCCTCGAGTTTCCGCTCTTACGAAACATCATCATCGTTTTCCGAGTCCAAAATTTCCTCGGAAACAAACACTTCGAGCACGG
ATTCGAACTTTCCCGTTGCGGTAAAGAGAAACAACCGATCGTGGAATTCGCTGGAAGACCTTTGCCGGAACGTAGTCTACAAGGCTAAAATCTCCGGGGAAAGCGGCACA
AACGCCTCGACGCAGACCTGCGAGAGGCGGCGGCGGAGATGGACGGGAAGCGGAGCCGCTGAGGAATGTGGCGGAGAGGCATATTCAAATTCTGGAGTTGGAAAATCGGA
GAGCTTGAGGTCTATGGATTGTGATGGACCTGCGGATTTGAGGGATCAGACGACTGGGAGGAGCAACAGGTGGAAGGCATCAACGGTTTTGATGCAGTTGATAAAGCACT
ACATCTCCTTCACTCGTGGTTTGCCAAAATGCTTGTGTCCGTTGTTTGCTACCCTTTCGTGTTATTATTTCATTTGCTTCGAATCTGAGAGTTTTCTTCATGTGAATTCT
ATAGTATCCTTCGATTCTGTTGGTGGGTGCTGGTGGGTCTTACAAGTTTCTGCTCATCTGTGCTTCTTTTTGGAGTTTCGAATGGGACCCATTAGAGGGTTCAAGAGGAA
GAAGAAGGCAGAGAAGAAGGTTGACCAAAATGTCTTGGCTGCTTCACTACCGTCTCAGCCCCAGCCCTTCGATTGGTGGGACGACTTCTCCCAGAGGTTTACTGGACCAT
TATCCCAGTCAAAGAATTCAACAAAATTTGAGTCAGTTTTCAAAATTTCAAGAAAGACATTCAGCTATATCTGTTCACTTGTTAAGGAAGCTATGTTGGCCAAAACTTCA
AATTTTACTGACTTAAATGGCAAGCCTTTGTCTCTAAATGACCAAGTCGCTGTTGCTCTTAGGCGGCTTAGCTCCGGTGAATCATTATCGAGTATTGGTGATTCATTTGG
AATGAATCAATCATCAGTTTCCCAAATAACTTGGCGTTTCGTGGAGGCGATGGAAGAGAAAGGACTCCACCATCTCTCGTGGCCTTCAAGAGAAGAAGATATGGATCAGG
TAAAGTCCAAGTTTAAGAAAATCAGAGGCCTTCCTAATTGTTGTGGTGTAATCGACACAACACACATTATGATGACTTTGCCAACAACAGAATCTGCAAACGGCGTCTGG
CTTGATCGTGAGAAAAACTGCAGCATGATCTTGCAAGTGATTGTAGACCCGGAAATGAGATTCTGTGACATCATTACAGGTTGGCCTGGAAGTCTGAGTGATGCTCTTGT
GCTCCAGAGCTCGGGATTTTTCAAACGTTCCCAAGATGGGGAGCGGTTAAACGGCAAGAATATGAAGCTCTCAGAAAGTTCAGAATTAGGAGAGTATGTGATAGGAGACT
CAGAGTTCAATAAGCGGCATTTCGCCACCAGGTTGGTGGCTCAAAGGGCATTGACAAGGTTGAAAGAGATGTGGAAGATCATTAAAGGGGTAATGTGGAAGCCTGATAAA
CACAGGCTACCAAGGATCATTCTTGTTTGCTGCTTACTTCATAATATAGTGATCGATATGGGGGACGAGGTGCAAGACGAAATGCCCTTGTCTCATCATCACGATTCCGG
TTACCGACAACAAAGTTGTGAATTCGTCGACAATACTGCTTCCATTGGGAGGGAGAAGCTCTCCATGTTCTTATCTGGAAAATTGCCACCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGGTAAATGTGGGAGAAAGAAGGGAGCCATTAACGTCAAGTGAATGGCGGAGCAGAAGAACGAGTCCTTCTCTTGAAAGAGCCAATTCCAGAACCAGAACCTTGAC
GGAATTGAAAACAGAACCTCAAGAAGTCGTCCCCGTTGTATACTACCTCTCCCGCCATGGCCAACTTGACCATCCTCATTTGCTCGAGGTTCCTCTTTCTTCTCCTCTCC
ACGGACTGTTTCTCAGAGGTAAACTGAAACGTTTTTCTCCGTTTTCTCTCCTTGGCTATGGTGAAGAAGATGTGATTAGAAGACTGGATATTCTCCGTGGAGAAGGCTTT
GCCAAAATGTATTCGTGGTCTTCGAAGCGGCGGTACAAAAATGGATTCGTGTGGCAGGACATATCAGACGACGACTTGATACATCCATCTCAAGGCCGTGAATACATTCT
CAAAGGATCAGATCTATTGCAAGAGGCGTCCTCGAGTTTCCGCTCTTACGAAACATCATCATCGTTTTCCGAGTCCAAAATTTCCTCGGAAACAAACACTTCGAGCACGG
ATTCGAACTTTCCCGTTGCGGTAAAGAGAAACAACCGATCGTGGAATTCGCTGGAAGACCTTTGCCGGAACGTAGTCTACAAGGCTAAAATCTCCGGGGAAAGCGGCACA
AACGCCTCGACGCAGACCTGCGAGAGGCGGCGGCGGAGATGGACGGGAAGCGGAGCCGCTGAGGAATGTGGCGGAGAGGCATATTCAAATTCTGGAGTTGGAAAATCGGA
GAGCTTGAGGTCTATGGATTGTGATGGACCTGCGGATTTGAGGGATCAGACGACTGGGAGGAGCAACAGGTGGAAGGCATCAACGGTTTTGATGCAGTTGATAAAGCACT
ACATCTCCTTCACTCGTGGTTTGCCAAAATGCTTGTGTCCGTTGTTTGCTACCCTTTCGTGTTATTATTTCATTTGCTTCGAATCTGAGAGTTTTCTTCATGTGAATTCT
ATAGTATCCTTCGATTCTGTTGGTGGGTGCTGGTGGGTCTTACAAGTTTCTGCTCATCTGTGCTTCTTTTTGGAGTTTCGAATGGGACCCATTAGAGGGTTCAAGAGGAA
GAAGAAGGCAGAGAAGAAGGTTGACCAAAATGTCTTGGCTGCTTCACTACCGTCTCAGCCCCAGCCCTTCGATTGGTGGGACGACTTCTCCCAGAGGTTTACTGGACCAT
TATCCCAGTCAAAGAATTCAACAAAATTTGAGTCAGTTTTCAAAATTTCAAGAAAGACATTCAGCTATATCTGTTCACTTGTTAAGGAAGCTATGTTGGCCAAAACTTCA
AATTTTACTGACTTAAATGGCAAGCCTTTGTCTCTAAATGACCAAGTCGCTGTTGCTCTTAGGCGGCTTAGCTCCGGTGAATCATTATCGAGTATTGGTGATTCATTTGG
AATGAATCAATCATCAGTTTCCCAAATAACTTGGCGTTTCGTGGAGGCGATGGAAGAGAAAGGACTCCACCATCTCTCGTGGCCTTCAAGAGAAGAAGATATGGATCAGG
TAAAGTCCAAGTTTAAGAAAATCAGAGGCCTTCCTAATTGTTGTGGTGTAATCGACACAACACACATTATGATGACTTTGCCAACAACAGAATCTGCAAACGGCGTCTGG
CTTGATCGTGAGAAAAACTGCAGCATGATCTTGCAAGTGATTGTAGACCCGGAAATGAGATTCTGTGACATCATTACAGGTTGGCCTGGAAGTCTGAGTGATGCTCTTGT
GCTCCAGAGCTCGGGATTTTTCAAACGTTCCCAAGATGGGGAGCGGTTAAACGGCAAGAATATGAAGCTCTCAGAAAGTTCAGAATTAGGAGAGTATGTGATAGGAGACT
CAGAGTTCAATAAGCGGCATTTCGCCACCAGGTTGGTGGCTCAAAGGGCATTGACAAGGTTGAAAGAGATGTGGAAGATCATTAAAGGGGTAATGTGGAAGCCTGATAAA
CACAGGCTACCAAGGATCATTCTTGTTTGCTGCTTACTTCATAATATAGTGATCGATATGGGGGACGAGGTGCAAGACGAAATGCCCTTGTCTCATCATCACGATTCCGG
TTACCGACAACAAAGTTGTGAATTCGTCGACAATACTGCTTCCATTGGGAGGGAGAAGCTCTCCATGTTCTTATCTGGAAAATTGCCACCCTAA
Protein sequenceShow/hide protein sequence
MAVNVGERREPLTSSEWRSRRTSPSLERANSRTRTLTELKTEPQEVVPVVYYLSRHGQLDHPHLLEVPLSSPLHGLFLRGKLKRFSPFSLLGYGEEDVIRRLDILRGEGF
AKMYSWSSKRRYKNGFVWQDISDDDLIHPSQGREYILKGSDLLQEASSSFRSYETSSSFSESKISSETNTSSTDSNFPVAVKRNNRSWNSLEDLCRNVVYKAKISGESGT
NASTQTCERRRRRWTGSGAAEECGGEAYSNSGVGKSESLRSMDCDGPADLRDQTTGRSNRWKASTVLMQLIKHYISFTRGLPKCLCPLFATLSCYYFICFESESFLHVNS
IVSFDSVGGCWWVLQVSAHLCFFLEFRMGPIRGFKRKKKAEKKVDQNVLAASLPSQPQPFDWWDDFSQRFTGPLSQSKNSTKFESVFKISRKTFSYICSLVKEAMLAKTS
NFTDLNGKPLSLNDQVAVALRRLSSGESLSSIGDSFGMNQSSVSQITWRFVEAMEEKGLHHLSWPSREEDMDQVKSKFKKIRGLPNCCGVIDTTHIMMTLPTTESANGVW
LDREKNCSMILQVIVDPEMRFCDIITGWPGSLSDALVLQSSGFFKRSQDGERLNGKNMKLSESSELGEYVIGDSEFNKRHFATRLVAQRALTRLKEMWKIIKGVMWKPDK
HRLPRIILVCCLLHNIVIDMGDEVQDEMPLSHHHDSGYRQQSCEFVDNTASIGREKLSMFLSGKLPP