; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0025755 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0025755
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionDDE Tnp4 domain-containing protein
Genome locationchr12:1587555..1591688
RNA-Seq ExpressionPI0025755
SyntenyPI0025755
Gene Ontology termsGO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR027806 - Harbinger transposase-derived nuclease domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYJ99038.1 putative nuclease HARBI1 [Cucumis melo var. makuwa]0.0e+0085.04Show/hide
Query:  MESSDDEKDGTYGKYVPREPSHNLVSNGAKFVDGVLNGQNESCLDNFRMDKHVFYKLCDILQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFR
        MESSDDEKDGTYGKYVPREPSHNLVSNGAKFVD VLNGQNE CL++FRMDKHVFYKLCDILQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFR
Subjt:  MESSDDEKDGTYGKYVPREPSHNLVSNGAKFVDGVLNGQNESCLDNFRMDKHVFYKLCDILQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFR

Query:  YSGETISRHFNNVLNAIMAISLDFFQPPGSSVPPPEILEDPRFYPYFKDCVGAIDGIHIPVMVGVDEQGPFRNKNGQLSQIVLAACSFDLKFHYVLAGWE
        YSGETISRHFNNVLNAIMAISLDFFQPPGS+VPPPEILEDPRFYPYFKDCVGAIDGIHIPVMVGVDEQGPFRNKNGQLSQIVLAACSFDLKFHYVLAGWE
Subjt:  YSGETISRHFNNVLNAIMAISLDFFQPPGSSVPPPEILEDPRFYPYFKDCVGAIDGIHIPVMVGVDEQGPFRNKNGQLSQIVLAACSFDLKFHYVLAGWE

Query:  GSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFVAPYHDIPYHSKEYPGGYHPQDAKELFNLRHSLLRNATERTFGALKVRFPILLSAPPYPLQ
        GSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFVAPYHDI YHSKEYPGGYHPQDAKELFNLRHSLLRNATERTFGALK RFPILLSAPPYPLQ
Subjt:  GSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFVAPYHDIPYHSKEYPGGYHPQDAKELFNLRHSLLRNATERTFGALKVRFPILLSAPPYPLQ

Query:  TQVKLVVATCAIHNYIRRENPNDWFFRFSEQDHVPHMEDSLPQLEAEQLTANIETPIVDVAFETEELEITAQLRDSIAAELWSDYINDISPITAAKEAKP
        TQVKLVVATCAIHNYIRRENP+DWFFR  EQDHVPHMEDSLPQLEAEQLTANIETPIVDVAFETEELEI +QLRDSIAAE+W           A      
Subjt:  TQVKLVVATCAIHNYIRRENPNDWFFRFSEQDHVPHMEDSLPQLEAEQLTANIETPIVDVAFETEELEITAQLRDSIAAELWSDYINDISPITAAKEAKP

Query:  GNWLYLQEQPCLYQ-----ITNYILLYF--VSFLLLHSFGFHLKAPQSSLPP---------------------LALLYLLDSLTSSASL--RKEMLNPAN
         N ++L+ Q   +      +   +LL +   S  +L ++   ++   S+  P                      A+LYLLDSLTSSASL  RKEMLNPAN
Subjt:  GNWLYLQEQPCLYQ-----ITNYILLYF--VSFLLLHSFGFHLKAPQSSLPP---------------------LALLYLLDSLTSSASL--RKEMLNPAN

Query:  DLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRVPSQNRDQHAAATVAAGGGSRKSKKTKRKTTTAPALVADRKRRWWRLCRDD
        DLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRVPSQNRDQH AATV AGGGSRKSKKTKRKTTTAPALVADRKRRWWRLCRDD
Subjt:  DLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRVPSQNRDQHAAATVAAGGGSRKSKKTKRKTTTAPALVADRKRRWWRLCRDD

Query:  GVKPASLGEFLEVERRFGDGAFYGNAVDLEGVVAADQQRNGRSLFADGRVLPPAQTEEDTSAAGALCRFSVSLTGICSGGAG
        GVKPASLGEFLEVERRFGDGAFYGNAVDLEGVV ADQQRNGRSLFADGRVLPPAQTEEDTSA GALCRFSVSLTGICSGGAG
Subjt:  GVKPASLGEFLEVERRFGDGAFYGNAVDLEGVVAADQQRNGRSLFADGRVLPPAQTEEDTSAAGALCRFSVSLTGICSGGAG

XP_004137507.1 putative nuclease HARBI1 isoform X1 [Cucumis sativus]4.6e-22196.17Show/hide
Query:  MESSDDEKDGTYGKYVPREPSHNLVSNGAKFVDGVLNGQNESCLDNFRMDKHVFYKLCDILQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFR
        MESSDDEKDGTYGKYVPREPSHNLVSNGAKFVD VLNGQNE CLD+FRMDKHVFYKLCDILQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFR
Subjt:  MESSDDEKDGTYGKYVPREPSHNLVSNGAKFVDGVLNGQNESCLDNFRMDKHVFYKLCDILQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFR

Query:  YSGETISRHFNNVLNAIMAISLDFFQPPGSSVPPPEILEDPRFYPYFKDCVGAIDGIHIPVMVGVDEQGPFRNKNGQLSQIVLAACSFDLKFHYVLAGWE
        YSGETISRHFNNVLNAIMAISLDFFQPPGS+VPPPEILEDPRFYPYFKDCVG IDGIHIPVMVGVDEQGPFRNKNGQLSQIVLAACSFDLKFHYVLAGWE
Subjt:  YSGETISRHFNNVLNAIMAISLDFFQPPGSSVPPPEILEDPRFYPYFKDCVGAIDGIHIPVMVGVDEQGPFRNKNGQLSQIVLAACSFDLKFHYVLAGWE

Query:  GSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFVAPYHDIPYHSKEYPGGYHPQDAKELFNLRHSLLRNATERTFGALKVRFPILLSAPPYPLQ
        GSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFVAPYHDI Y SKEYPGGYHPQDAKELFNLRHSLLRNATERTF ALK RFPILLSAPPYPLQ
Subjt:  GSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFVAPYHDIPYHSKEYPGGYHPQDAKELFNLRHSLLRNATERTFGALKVRFPILLSAPPYPLQ

Query:  TQVKLVVATCAIHNYIRRENPNDWFFRFSEQDHVPHMEDSLPQLEAEQLTANIETPIVDVAFETEELEITAQLRDSIAAELWSDYINDISPI
        TQVKLVVATCAIHNYIRRENP+DWFFR  EQDHVPHMEDSLPQLEAEQLTANIETPIVDVAFETEELEIT+QLRDSIAAE+WSDYINDISP+
Subjt:  TQVKLVVATCAIHNYIRRENPNDWFFRFSEQDHVPHMEDSLPQLEAEQLTANIETPIVDVAFETEELEITAQLRDSIAAELWSDYINDISPI

XP_016899554.1 PREDICTED: uncharacterized protein LOC103502878 [Cucumis melo]1.2e-22196.17Show/hide
Query:  MESSDDEKDGTYGKYVPREPSHNLVSNGAKFVDGVLNGQNESCLDNFRMDKHVFYKLCDILQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFR
        MESSDDEKDGTYGKYVPREPSHNLVSNGAKFVD VLNGQNE CL++FRMDKHVFYKLCDILQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFR
Subjt:  MESSDDEKDGTYGKYVPREPSHNLVSNGAKFVDGVLNGQNESCLDNFRMDKHVFYKLCDILQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFR

Query:  YSGETISRHFNNVLNAIMAISLDFFQPPGSSVPPPEILEDPRFYPYFKDCVGAIDGIHIPVMVGVDEQGPFRNKNGQLSQIVLAACSFDLKFHYVLAGWE
        YSGETISRHFNNVLNAIMAISLDFFQPPGS+VPPPEILEDPRFYPYFKDCVGAIDGIHIPVMVGVDEQGPFRN NGQLSQIVLAACSFDLKFHYVLAGWE
Subjt:  YSGETISRHFNNVLNAIMAISLDFFQPPGSSVPPPEILEDPRFYPYFKDCVGAIDGIHIPVMVGVDEQGPFRNKNGQLSQIVLAACSFDLKFHYVLAGWE

Query:  GSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFVAPYHDIPYHSKEYPGGYHPQDAKELFNLRHSLLRNATERTFGALKVRFPILLSAPPYPLQ
        GSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFVAPYHDI YHSKEYPGGYHPQDAKELFNLRHSLLRNATERTFGALK RFPILLSAPPYPLQ
Subjt:  GSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFVAPYHDIPYHSKEYPGGYHPQDAKELFNLRHSLLRNATERTFGALKVRFPILLSAPPYPLQ

Query:  TQVKLVVATCAIHNYIRRENPNDWFFRFSEQDHVPHMEDSLPQLEAEQLTANIETPIVDVAFETEELEITAQLRDSIAAELWSDYINDISPI
        TQVKLVVATCAIHNYIRRENP+DWFFR  EQDHVPHMEDSLPQLEAEQLTANIETPIVDVAFETEELEI +QLRDSIAAE+WSDYINDISP+
Subjt:  TQVKLVVATCAIHNYIRRENPNDWFFRFSEQDHVPHMEDSLPQLEAEQLTANIETPIVDVAFETEELEITAQLRDSIAAELWSDYINDISPI

XP_038895429.1 putative nuclease HARBI1 isoform X1 [Benincasa hispida]1.5e-21994.64Show/hide
Query:  MESSDDEKDGTYGKYVPREPSHNLVSNGAKFVDGVLNGQNESCLDNFRMDKHVFYKLCDILQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFR
        MESSDDEKDGTYGKYVPREPSHNLVSNGAKFVD VLNGQNE CLD FRMDKH+FYKLCDILQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFR
Subjt:  MESSDDEKDGTYGKYVPREPSHNLVSNGAKFVDGVLNGQNESCLDNFRMDKHVFYKLCDILQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFR

Query:  YSGETISRHFNNVLNAIMAISLDFFQPPGSSVPPPEILEDPRFYPYFKDCVGAIDGIHIPVMVGVDEQGPFRNKNGQLSQIVLAACSFDLKFHYVLAGWE
        YSGETISRHFNNVLNAIMAISLDFFQPPGS+VPPPEILEDPRFYPYFKDCVGAIDGIHIPVMVGVDEQGPFRNK G LSQIVLAACSFDLKFHYVLAGWE
Subjt:  YSGETISRHFNNVLNAIMAISLDFFQPPGSSVPPPEILEDPRFYPYFKDCVGAIDGIHIPVMVGVDEQGPFRNKNGQLSQIVLAACSFDLKFHYVLAGWE

Query:  GSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFVAPYHDIPYHSKEYPGGYHPQDAKELFNLRHSLLRNATERTFGALKVRFPILLSAPPYPLQ
        GSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGF+APYHDIPYHSKEYPGGYHPQDAKELFNLRHSLLRNAT+RTFGALK RFPILLSAPPYPLQ
Subjt:  GSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFVAPYHDIPYHSKEYPGGYHPQDAKELFNLRHSLLRNATERTFGALKVRFPILLSAPPYPLQ

Query:  TQVKLVVATCAIHNYIRRENPNDWFFRFSEQDHVPHMEDSLPQLEAEQLTANIETPIVDVAFETEELEITAQLRDSIAAELWSDYINDISPI
        TQVKLVVATCAIHNYIRRENP+DW FR  EQDHVPHMEDSLPQL+AEQLT +IETPIVD+AFETEELEIT+QLRD+IAAELWSDYINDISP+
Subjt:  TQVKLVVATCAIHNYIRRENPNDWFFRFSEQDHVPHMEDSLPQLEAEQLTANIETPIVDVAFETEELEITAQLRDSIAAELWSDYINDISPI

XP_038895430.1 putative nuclease HARBI1 isoform X2 [Benincasa hispida]1.5e-21994.64Show/hide
Query:  MESSDDEKDGTYGKYVPREPSHNLVSNGAKFVDGVLNGQNESCLDNFRMDKHVFYKLCDILQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFR
        MESSDDEKDGTYGKYVPREPSHNLVSNGAKFVD VLNGQNE CLD FRMDKH+FYKLCDILQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFR
Subjt:  MESSDDEKDGTYGKYVPREPSHNLVSNGAKFVDGVLNGQNESCLDNFRMDKHVFYKLCDILQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFR

Query:  YSGETISRHFNNVLNAIMAISLDFFQPPGSSVPPPEILEDPRFYPYFKDCVGAIDGIHIPVMVGVDEQGPFRNKNGQLSQIVLAACSFDLKFHYVLAGWE
        YSGETISRHFNNVLNAIMAISLDFFQPPGS+VPPPEILEDPRFYPYFKDCVGAIDGIHIPVMVGVDEQGPFRNK G LSQIVLAACSFDLKFHYVLAGWE
Subjt:  YSGETISRHFNNVLNAIMAISLDFFQPPGSSVPPPEILEDPRFYPYFKDCVGAIDGIHIPVMVGVDEQGPFRNKNGQLSQIVLAACSFDLKFHYVLAGWE

Query:  GSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFVAPYHDIPYHSKEYPGGYHPQDAKELFNLRHSLLRNATERTFGALKVRFPILLSAPPYPLQ
        GSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGF+APYHDIPYHSKEYPGGYHPQDAKELFNLRHSLLRNAT+RTFGALK RFPILLSAPPYPLQ
Subjt:  GSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFVAPYHDIPYHSKEYPGGYHPQDAKELFNLRHSLLRNATERTFGALKVRFPILLSAPPYPLQ

Query:  TQVKLVVATCAIHNYIRRENPNDWFFRFSEQDHVPHMEDSLPQLEAEQLTANIETPIVDVAFETEELEITAQLRDSIAAELWSDYINDISPI
        TQVKLVVATCAIHNYIRRENP+DW FR  EQDHVPHMEDSLPQL+AEQLT +IETPIVD+AFETEELEIT+QLRD+IAAELWSDYINDISP+
Subjt:  TQVKLVVATCAIHNYIRRENPNDWFFRFSEQDHVPHMEDSLPQLEAEQLTANIETPIVDVAFETEELEITAQLRDSIAAELWSDYINDISPI

TrEMBL top hitse value%identityAlignment
A0A0A0LQI1 DDE Tnp4 domain-containing protein1.6e-21964.24Show/hide
Query:  MESSDDEKDGTYGKYVPREPSHNLVSNGAKFVDGVLNGQNESCLDNFRMDKHVFYKLCDILQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFR
        MESSDDEKDGTYGKYVPREPSHNLVSNGAKFVD VLNGQNE CLD+FRMDKH                                                
Subjt:  MESSDDEKDGTYGKYVPREPSHNLVSNGAKFVDGVLNGQNESCLDNFRMDKHVFYKLCDILQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFR

Query:  YSGETISRHFNNVLNAIMAISLDFFQPPGSSVPPPEILEDPRFYPYFKDCVGAIDGIHIPVMVGVDEQGPFRNKNGQLSQIVLAACSFDLKFHYVLAGWE
                                                                                                            
Subjt:  YSGETISRHFNNVLNAIMAISLDFFQPPGSSVPPPEILEDPRFYPYFKDCVGAIDGIHIPVMVGVDEQGPFRNKNGQLSQIVLAACSFDLKFHYVLAGWE

Query:  GSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFVAPYHDIPYHSKEYPGGYHPQDAKELFNLRHSLLRNATERTFGALKVRFPILLSAPPYPLQ
                   AL  RN      GKYYLVDQKYMNMPGFVAPYHDI Y SKEYPGGYHPQDAKELFNLRHSLLRNATERTF ALK RFPILLSAPPYPLQ
Subjt:  GSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFVAPYHDIPYHSKEYPGGYHPQDAKELFNLRHSLLRNATERTFGALKVRFPILLSAPPYPLQ

Query:  TQVKLVVATCAIHNYIRRENPNDWFFRFSEQDHVPHMEDSLPQLEAEQLTANIETPIVDVAFETEELEITAQLRDSIAAELWSDYINDISPI------TA
        TQVKLVVATCAIHNYIRRENP+DWFFR  EQDHVPHMEDSLPQLEAEQLTANIETPIVDVAFETEELEIT+QLRDSIAAE+WSDYINDISP+      TA
Subjt:  TQVKLVVATCAIHNYIRRENPNDWFFRFSEQDHVPHMEDSLPQLEAEQLTANIETPIVDVAFETEELEITAQLRDSIAAELWSDYINDISPI------TA

Query:  AKEAKPG---------NWLYLQEQPCLYQITNYILLYFVSFLLLHSFGFHLKAPQSSLPPL-------------------ALLYLLDSLTSSASL--RKE
        AKEA PG         N ++L+ +   +         FVSFLLLHSFGFHLKAPQ+S+PP+                   A+LYLLDSLTSSASL  RKE
Subjt:  AKEAKPG---------NWLYLQEQPCLYQITNYILLYFVSFLLLHSFGFHLKAPQSSLPPL-------------------ALLYLLDSLTSSASL--RKE

Query:  MLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRVPSQNRDQHAAATVAAGGGSRKSKKTKRKTTTAPALVADRKRRWW
        MLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRVPSQNRDQH AATV AGGGSRKSKKTKRKTT APALVADRKRRWW
Subjt:  MLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRVPSQNRDQHAAATVAAGGGSRKSKKTKRKTTTAPALVADRKRRWW

Query:  RLCRDDGVKPASLGEFLEVERRFGDGAFYGNAVDLEGVVAADQQRNGRSLFADGRVLPPAQTEEDTSAAGALCRFSVSLTGICSGGAG
        RLCRDDGVKPASLGEFLEVERRFGDGAFYGNAVDLEGVVAADQQRNGRSLFADGRVLPPAQTEEDTS A  LCRFSVSLTGICSGGAG
Subjt:  RLCRDDGVKPASLGEFLEVERRFGDGAFYGNAVDLEGVVAADQQRNGRSLFADGRVLPPAQTEEDTSAAGALCRFSVSLTGICSGGAG

A0A1S4DU98 uncharacterized protein LOC1035028785.8e-22296.17Show/hide
Query:  MESSDDEKDGTYGKYVPREPSHNLVSNGAKFVDGVLNGQNESCLDNFRMDKHVFYKLCDILQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFR
        MESSDDEKDGTYGKYVPREPSHNLVSNGAKFVD VLNGQNE CL++FRMDKHVFYKLCDILQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFR
Subjt:  MESSDDEKDGTYGKYVPREPSHNLVSNGAKFVDGVLNGQNESCLDNFRMDKHVFYKLCDILQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFR

Query:  YSGETISRHFNNVLNAIMAISLDFFQPPGSSVPPPEILEDPRFYPYFKDCVGAIDGIHIPVMVGVDEQGPFRNKNGQLSQIVLAACSFDLKFHYVLAGWE
        YSGETISRHFNNVLNAIMAISLDFFQPPGS+VPPPEILEDPRFYPYFKDCVGAIDGIHIPVMVGVDEQGPFRN NGQLSQIVLAACSFDLKFHYVLAGWE
Subjt:  YSGETISRHFNNVLNAIMAISLDFFQPPGSSVPPPEILEDPRFYPYFKDCVGAIDGIHIPVMVGVDEQGPFRNKNGQLSQIVLAACSFDLKFHYVLAGWE

Query:  GSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFVAPYHDIPYHSKEYPGGYHPQDAKELFNLRHSLLRNATERTFGALKVRFPILLSAPPYPLQ
        GSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFVAPYHDI YHSKEYPGGYHPQDAKELFNLRHSLLRNATERTFGALK RFPILLSAPPYPLQ
Subjt:  GSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFVAPYHDIPYHSKEYPGGYHPQDAKELFNLRHSLLRNATERTFGALKVRFPILLSAPPYPLQ

Query:  TQVKLVVATCAIHNYIRRENPNDWFFRFSEQDHVPHMEDSLPQLEAEQLTANIETPIVDVAFETEELEITAQLRDSIAAELWSDYINDISPI
        TQVKLVVATCAIHNYIRRENP+DWFFR  EQDHVPHMEDSLPQLEAEQLTANIETPIVDVAFETEELEI +QLRDSIAAE+WSDYINDISP+
Subjt:  TQVKLVVATCAIHNYIRRENPNDWFFRFSEQDHVPHMEDSLPQLEAEQLTANIETPIVDVAFETEELEITAQLRDSIAAELWSDYINDISPI

A0A5D3BLI7 Putative nuclease HARBI10.0e+0085.04Show/hide
Query:  MESSDDEKDGTYGKYVPREPSHNLVSNGAKFVDGVLNGQNESCLDNFRMDKHVFYKLCDILQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFR
        MESSDDEKDGTYGKYVPREPSHNLVSNGAKFVD VLNGQNE CL++FRMDKHVFYKLCDILQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFR
Subjt:  MESSDDEKDGTYGKYVPREPSHNLVSNGAKFVDGVLNGQNESCLDNFRMDKHVFYKLCDILQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFR

Query:  YSGETISRHFNNVLNAIMAISLDFFQPPGSSVPPPEILEDPRFYPYFKDCVGAIDGIHIPVMVGVDEQGPFRNKNGQLSQIVLAACSFDLKFHYVLAGWE
        YSGETISRHFNNVLNAIMAISLDFFQPPGS+VPPPEILEDPRFYPYFKDCVGAIDGIHIPVMVGVDEQGPFRNKNGQLSQIVLAACSFDLKFHYVLAGWE
Subjt:  YSGETISRHFNNVLNAIMAISLDFFQPPGSSVPPPEILEDPRFYPYFKDCVGAIDGIHIPVMVGVDEQGPFRNKNGQLSQIVLAACSFDLKFHYVLAGWE

Query:  GSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFVAPYHDIPYHSKEYPGGYHPQDAKELFNLRHSLLRNATERTFGALKVRFPILLSAPPYPLQ
        GSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFVAPYHDI YHSKEYPGGYHPQDAKELFNLRHSLLRNATERTFGALK RFPILLSAPPYPLQ
Subjt:  GSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFVAPYHDIPYHSKEYPGGYHPQDAKELFNLRHSLLRNATERTFGALKVRFPILLSAPPYPLQ

Query:  TQVKLVVATCAIHNYIRRENPNDWFFRFSEQDHVPHMEDSLPQLEAEQLTANIETPIVDVAFETEELEITAQLRDSIAAELWSDYINDISPITAAKEAKP
        TQVKLVVATCAIHNYIRRENP+DWFFR  EQDHVPHMEDSLPQLEAEQLTANIETPIVDVAFETEELEI +QLRDSIAAE+W           A      
Subjt:  TQVKLVVATCAIHNYIRRENPNDWFFRFSEQDHVPHMEDSLPQLEAEQLTANIETPIVDVAFETEELEITAQLRDSIAAELWSDYINDISPITAAKEAKP

Query:  GNWLYLQEQPCLYQ-----ITNYILLYF--VSFLLLHSFGFHLKAPQSSLPP---------------------LALLYLLDSLTSSASL--RKEMLNPAN
         N ++L+ Q   +      +   +LL +   S  +L ++   ++   S+  P                      A+LYLLDSLTSSASL  RKEMLNPAN
Subjt:  GNWLYLQEQPCLYQ-----ITNYILLYF--VSFLLLHSFGFHLKAPQSSLPP---------------------LALLYLLDSLTSSASL--RKEMLNPAN

Query:  DLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRVPSQNRDQHAAATVAAGGGSRKSKKTKRKTTTAPALVADRKRRWWRLCRDD
        DLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRVPSQNRDQH AATV AGGGSRKSKKTKRKTTTAPALVADRKRRWWRLCRDD
Subjt:  DLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRVPSQNRDQHAAATVAAGGGSRKSKKTKRKTTTAPALVADRKRRWWRLCRDD

Query:  GVKPASLGEFLEVERRFGDGAFYGNAVDLEGVVAADQQRNGRSLFADGRVLPPAQTEEDTSAAGALCRFSVSLTGICSGGAG
        GVKPASLGEFLEVERRFGDGAFYGNAVDLEGVV ADQQRNGRSLFADGRVLPPAQTEEDTSA GALCRFSVSLTGICSGGAG
Subjt:  GVKPASLGEFLEVERRFGDGAFYGNAVDLEGVVAADQQRNGRSLFADGRVLPPAQTEEDTSAAGALCRFSVSLTGICSGGAG

A0A6J1EE58 putative nuclease HARBI1 isoform X22.2e-21392.35Show/hide
Query:  MESSDDEKDGTYGKYVPREPSHNLVSNGAKFVDGVLNGQNESCLDNFRMDKHVFYKLCDILQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFR
        MESSDDEKDG+YGKYVPREPSHNLV+NGAKFVD VLNGQNE CL+NFRMDKH+FYKLCDILQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFR
Subjt:  MESSDDEKDGTYGKYVPREPSHNLVSNGAKFVDGVLNGQNESCLDNFRMDKHVFYKLCDILQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFR

Query:  YSGETISRHFNNVLNAIMAISLDFFQPPGSSVPPPEILEDPRFYPYFKDCVGAIDGIHIPVMVGVDEQGPFRNKNGQLSQIVLAACSFDLKFHYVLAGWE
        YSGETISRHFNNVLNAIMAISLDFFQPPGS+V PPEIL+DPRFYPYFKDCVGAIDGIHIPVMVGVDEQGPFRNKNG LSQ VLAACSFDLKFHYVLAGWE
Subjt:  YSGETISRHFNNVLNAIMAISLDFFQPPGSSVPPPEILEDPRFYPYFKDCVGAIDGIHIPVMVGVDEQGPFRNKNGQLSQIVLAACSFDLKFHYVLAGWE

Query:  GSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFVAPYHDIPYHSKEYPGGYHPQDAKELFNLRHSLLRNATERTFGALKVRFPILLSAPPYPLQ
        GSA+DLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGF+APYHDIPY S+EY GGYHPQDAKELFNLRHSLLRNAT+RTFGALKVRFPILLSAPPYPLQ
Subjt:  GSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFVAPYHDIPYHSKEYPGGYHPQDAKELFNLRHSLLRNATERTFGALKVRFPILLSAPPYPLQ

Query:  TQVKLVVATCAIHNYIRRENPNDWFFRFSEQDHVPHMEDSLPQLEAEQLTANIETPIVDVAFETEELEITAQLRDSIAAELWSDYINDISPI
        TQVKLVVATCAIHNYIRRENP+DW F+  EQDHV HMEDSLPQLEAEQLTA+IETP VD+AFETEELEIT+QLRD+IA ELWSDYINDISP+
Subjt:  TQVKLVVATCAIHNYIRRENPNDWFFRFSEQDHVPHMEDSLPQLEAEQLTANIETPIVDVAFETEELEITAQLRDSIAAELWSDYINDISPI

A0A6J1KJM0 putative nuclease HARBI19.4e-21292.09Show/hide
Query:  MESSDDEKDGTYGKYVPREPSHNLVSNGAKFVDGVLNGQNESCLDNFRMDKHVFYKLCDILQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFR
        MESSDDEKDG+YGKYVPREPSHNLVSNGAKFVD VLNGQNE CL+NFRMDKH+FYKLCDILQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFR
Subjt:  MESSDDEKDGTYGKYVPREPSHNLVSNGAKFVDGVLNGQNESCLDNFRMDKHVFYKLCDILQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFR

Query:  YSGETISRHFNNVLNAIMAISLDFFQPPGSSVPPPEILEDPRFYPYFKDCVGAIDGIHIPVMVGVDEQGPFRNKNGQLSQIVLAACSFDLKFHYVLAGWE
        YSGETISRHFNNVLNAIMAISLDFFQPPGS+V PPEIL+DPRFYPYFKDCVGAIDGIHIPVMVGVDEQGPFRNKNG LSQ VLAACSFDLKFHYVLAGWE
Subjt:  YSGETISRHFNNVLNAIMAISLDFFQPPGSSVPPPEILEDPRFYPYFKDCVGAIDGIHIPVMVGVDEQGPFRNKNGQLSQIVLAACSFDLKFHYVLAGWE

Query:  GSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFVAPYHDIPYHSKEYPGGYHPQDAKELFNLRHSLLRNATERTFGALKVRFPILLSAPPYPLQ
        GSA+DLQVLNSALTRRNKLH+PEGKYYLVDQKYMNMPGF+APYHDIPY S+EY GGYHPQDAKELFNLRHSLLRNAT+RTFGALKVRFPILLSAPPYPLQ
Subjt:  GSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFVAPYHDIPYHSKEYPGGYHPQDAKELFNLRHSLLRNATERTFGALKVRFPILLSAPPYPLQ

Query:  TQVKLVVATCAIHNYIRRENPNDWFFRFSEQDHVPHMEDSLPQLEAEQLTANIETPIVDVAFETEELEITAQLRDSIAAELWSDYINDISPI
        TQVKLVVATCAIHNYIRRENP+D  FR  EQDHV HMEDSLPQLEAEQLTA+IETP VD+AFETEE EIT+QLRD+IA ELWSDYINDISP+
Subjt:  TQVKLVVATCAIHNYIRRENPNDWFFRFSEQDHVPHMEDSLPQLEAEQLTANIETPIVDVAFETEELEITAQLRDSIAAELWSDYINDISPI

SwissProt top hitse value%identityAlignment
Q6DR24 Uncharacterized protein At3g179503.5e-3852.2Show/hide
Query:  PSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPA---ITFRVPSQNR-DQHAAATVAAGGGSRKSKKTKRKTTTAPALVADRKRRWWRLCRDD-
        PSSPT SS+SSSDLDTESTGSFFHDRS +LGTLMG SF A   + FR  S+       A + A+   +R++ + KR  + +      R+R+WWR CRDD 
Subjt:  PSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPA---ITFRVPSQNR-DQHAAATVAAGGGSRKSKKTKRKTTTAPALVADRKRRWWRLCRDD-

Query:  ------GV-------KPASLGEFLEVERRFGDGAFYGNA-VDLEGVVAA---DQQ--RNGRSLFADGRVLPPAQTE----EDTSAAGALCRFSVSLTGIC
              G+       K +SLGE+LEVERRFGD A Y +A  +LE  V A   DQQ     R+LFADGRVLPPA  E    E T  A +LCRF VSLTGIC
Subjt:  ------GV-------KPASLGEFLEVERRFGDGAFYGNA-VDLEGVVAA---DQQ--RNGRSLFADGRVLPPAQTE----EDTSAAGALCRFSVSLTGIC

Query:  SGGAG
        SGG G
Subjt:  SGGAG

Arabidopsis top hitse value%identityAlignment
AT1G43722.1 unknown protein5.2e-3734.77Show/hide
Query:  YGKYVPREPSHNLVSNGAKFVDGVLNGQNESCLDNFRMDKHVFYKLCDILQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFRYSGETISRHFN
        Y +Y  R P       G + +   L     +CL   RM    F  LC++LQ    L+ T  I IEE +A+F+ I GHN   R V   F  + ET+ R F 
Subjt:  YGKYVPREPSHNLVSNGAKFVDGVLNGQNESCLDNFRMDKHVFYKLCDILQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFRYSGETISRHFN

Query:  NVLNAIMAISLDFFQPPGSSV---PPPEILEDPRFYPYFKDCVGAIDGIHIPVMVGVDEQGPFRNKNGQLSQIVLAACSFDLKFHYVLAGWEGSASDLQV
         VL A   ++ D+ + P        P  +  D R++PYF   VGA+DG H+ V V  D QG + N++   S  ++A C   + F Y+  G  GS  D  V
Subjt:  NVLNAIMAISLDFFQPPGSSV---PPPEILEDPRFYPYFKDCVGAIDGIHIPVMVGVDEQGPFRNKNGQLSQIVLAACSFDLKFHYVLAGWEGSASDLQV

Query:  LNSALTRRNKLHVPEG-KYYLVDQKYMNMPGFVAPYHD-----IPYHSKEYPGGYHPQDAKELFNLRHSLLRNATERTF
        L  A    ++  +P   KYYLVD  Y N  G +APY       + YH  ++  G  P++  ELFN  H+ LR+  ERTF
Subjt:  LNSALTRRNKLHVPEG-KYYLVDQKYMNMPGFVAPYHD-----IPYHSKEYPGGYHPQDAKELFNLRHSLLRNATERTF

AT3G17950.1 unknown protein2.5e-3952.2Show/hide
Query:  PSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPA---ITFRVPSQNR-DQHAAATVAAGGGSRKSKKTKRKTTTAPALVADRKRRWWRLCRDD-
        PSSPT SS+SSSDLDTESTGSFFHDRS +LGTLMG SF A   + FR  S+       A + A+   +R++ + KR  + +      R+R+WWR CRDD 
Subjt:  PSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPA---ITFRVPSQNR-DQHAAATVAAGGGSRKSKKTKRKTTTAPALVADRKRRWWRLCRDD-

Query:  ------GV-------KPASLGEFLEVERRFGDGAFYGNA-VDLEGVVAA---DQQ--RNGRSLFADGRVLPPAQTE----EDTSAAGALCRFSVSLTGIC
              G+       K +SLGE+LEVERRFGD A Y +A  +LE  V A   DQQ     R+LFADGRVLPPA  E    E T  A +LCRF VSLTGIC
Subjt:  ------GV-------KPASLGEFLEVERRFGDGAFYGNA-VDLEGVVAA---DQQ--RNGRSLFADGRVLPPAQTE----EDTSAAGALCRFSVSLTGIC

Query:  SGGAG
        SGG G
Subjt:  SGGAG

AT5G28950.1 unknown protein3.5e-2560.44Show/hide
Query:  PPEILEDPRFYPYFKDCVGAIDGIHIPVMVGVDEQGPFRNKNGQLSQIVLAACSFDLKFHYVLAGWEGSASDLQVLNSALTRR-NKLHVPE
        P +I E  R YPYFKDCVGAID  HI  MV   +   FRN+ G +SQ +LAAC+FD++F YVL+GWEGSA D +VLN ALTR  N+L VPE
Subjt:  PPEILEDPRFYPYFKDCVGAIDGIHIPVMVGVDEQGPFRNKNGQLSQIVLAACSFDLKFHYVLAGWEGSASDLQVLNSALTRR-NKLHVPE

AT5G35695.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)4.5e-2537.06Show/hide
Query:  FHYVLAGWEGSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFVAPYHDIPYHSKEYPGGYH-PQDAKELFNLRHSLLRNATERTFGALKVRFPI
        F YVL+GWEGSA D +VL+ AL           K+YLVD  + N   F+AP+  + YH +E+ G    P+   ELFNLRH  LRN  ER FG  K RF I
Subjt:  FHYVLAGWEGSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFVAPYHDIPYHSKEYPGGYH-PQDAKELFNLRHSLLRNATERTFGALKVRFPI

Query:  LLSAPPYPLQTQVKLVVATCAIHNYIRRENPNDWFFRFSEQDHVPHMEDSLPQLEAEQLTANIETPIVDVAFETEELEITAQLRDSIAAELWSDYIN
          SAPP+  + Q  LV+   A+HN++R+E  +D        D V + E  +   E   +  N       +  + ++ E T   R S+A ++W D  N
Subjt:  LLSAPPYPLQTQVKLVVATCAIHNYIRRENPNDWFFRFSEQDHVPHMEDSLPQLEAEQLTANIETPIVDVAFETEELEITAQLRDSIAAELWSDYIN

AT5G41980.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)2.3e-13363.38Show/hide
Query:  EKDGTYGKYVPREPSHNLVSNGAKFVDGVLNGQNESCLDNFRMDKHVFYKLCDILQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFRYSGETI
        E+D      +P+E S   +S+G KFV  +LNG NE C +NFRMDK VFYKLCD+LQ +GLLRHTNRIKIE QLAIF+FIIGHNLRTRAVQELF YSGETI
Subjt:  EKDGTYGKYVPREPSHNLVSNGAKFVDGVLNGQNESCLDNFRMDKHVFYKLCDILQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFRYSGETI

Query:  SRHFNNVLNAIMAISLDFFQPPGSSVPPPEILEDPRFYPYFKDCVGAIDGIHIPVMVGVDEQGPFRNKNGQLSQIVLAACSFDLKFHYVLAGWEGSASDL
        SRHFNNVLNA++AIS DFFQP  +S    + LE+    PYFKDCVG +D  HIPVMVGVDEQGPFRN NG L+Q VLAA SFDL+F+YVLAGWEGSASD 
Subjt:  SRHFNNVLNAIMAISLDFFQPPGSSVPPPEILEDPRFYPYFKDCVGAIDGIHIPVMVGVDEQGPFRNKNGQLSQIVLAACSFDLKFHYVLAGWEGSASDL

Query:  QVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFVAPYHDIPYHSKEYPGGYHPQDAKELFNLRHSLLRNATERTFGALKVRFPILLSAPPYPLQTQVKLV
        QVLN+ALTRRNKL VP+GKYY+VD KY N+PGF+APYH +  +S+E        +AKE+FN RH LL  A  RTFGALK RFPILLSAPPYPLQTQVKLV
Subjt:  QVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFVAPYHDIPYHSKEYPGGYHPQDAKELFNLRHSLLRNATERTFGALKVRFPILLSAPPYPLQTQVKLV

Query:  VATCAIHNYIRRENPNDWFFRFSEQDHVPHM-EDSLPQLEAEQLTANIETPIVDVAFETEELEITAQLRDSIAAELWSDYINDIS
        +A CA+HNY+R E P+D  FR  E++ +    ED    LE EQ    +E    +  F  EE+E + +LRD IA+ELW+ Y+ ++S
Subjt:  VATCAIHNYIRRENPNDWFFRFSEQDHVPHM-EDSLPQLEAEQLTANIETPIVDVAFETEELEITAQLRDSIAAELWSDYINDIS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAGTTCTGATGATGAAAAGGATGGAACTTATGGGAAATATGTTCCAAGAGAACCGAGTCATAATCTAGTGTCTAATGGTGCAAAATTTGTAGATGGAGTACTCAA
TGGACAAAATGAAAGTTGTTTAGACAATTTCCGCATGGACAAGCACGTATTTTATAAGTTGTGTGATATTTTGCAAGCCAAAGGCTTACTGCGTCATACAAACCGAATTA
AGATTGAAGAGCAACTAGCCATATTCATGTTTATTATTGGACACAATCTTAGGACACGAGCGGTTCAAGAGTTGTTCAGATATTCAGGAGAAACAATAAGTCGCCATTTT
AACAATGTATTGAATGCAATTATGGCAATCTCATTGGACTTCTTTCAACCTCCAGGATCCAGTGTTCCTCCTCCAGAAATTTTAGAAGATCCAAGATTCTATCCCTACTT
TAAGGATTGTGTGGGGGCAATTGATGGCATACACATCCCTGTGATGGTTGGTGTTGATGAGCAAGGACCTTTTCGTAATAAGAATGGGCAACTTTCTCAAATTGTTTTGG
CAGCATGCTCATTTGACCTCAAGTTCCACTACGTTCTAGCAGGATGGGAAGGATCGGCATCTGATTTGCAGGTTCTGAACTCAGCACTTACTAGGCGAAACAAACTACAT
GTTCCTGAAGGTAAATACTACCTTGTGGACCAAAAATATATGAACATGCCTGGTTTTGTTGCCCCTTATCATGATATCCCCTATCATTCAAAGGAATATCCTGGTGGTTA
TCATCCGCAAGATGCCAAAGAGCTATTTAATCTACGACATTCGTTGTTGCGCAATGCAACTGAAAGAACTTTTGGAGCCCTAAAGGTGCGCTTCCCCATACTATTGTCAG
CTCCTCCTTACCCATTACAGACACAAGTTAAATTGGTCGTTGCGACATGTGCAATTCACAATTACATTCGAAGGGAGAACCCCAACGATTGGTTCTTTAGATTTTCTGAA
CAAGACCATGTTCCACATATGGAGGATTCATTGCCTCAATTGGAAGCAGAACAGCTGACAGCAAATATTGAAACTCCAATTGTGGACGTTGCTTTTGAGACAGAAGAACT
AGAAATTACAGCACAGTTGCGAGATAGTATTGCAGCTGAATTGTGGAGTGACTACATTAATGATATATCACCCATTACTGCTGCCAAGGAAGCAAAACCAGGCAACTGGC
TTTACTTGCAAGAGCAACCATGCCTTTATCAGATCACAAACTATATTCTTCTCTACTTTGTTTCTTTTTTACTGCTACATTCATTTGGCTTCCATCTGAAAGCACCCCAA
AGTTCCCTACCTCCTCTTGCGCTTTTGTATTTATTGGATTCTCTGACTTCCTCTGCTTCATTGCGAAAAGAGATGTTGAATCCGGCCAACGATCTGTTACCGCCGCCCTC
TTCTCCGACCAATTCATCCATTTCCTCCTCCGATCTCGACACTGAGTCTACGGGTTCGTTCTTCCATGACCGGAGCACGAGCTTAGGGACTCTAATGGGGGTCAGCTTCC
CGGCGATTACTTTCCGAGTCCCTTCCCAGAACAGAGATCAACACGCCGCCGCGACGGTTGCCGCAGGCGGAGGTTCTCGTAAGAGTAAGAAGACGAAGAGGAAAACGACG
ACGGCGCCGGCACTGGTTGCAGATCGGAAACGGCGGTGGTGGAGACTATGTAGGGATGACGGCGTCAAGCCGGCGTCTTTGGGCGAGTTTCTCGAAGTGGAACGGAGATT
TGGGGATGGTGCCTTCTACGGCAACGCGGTGGATCTGGAAGGCGTGGTTGCGGCGGATCAACAGAGGAATGGTCGGTCTTTGTTCGCCGATGGAAGAGTTCTTCCGCCGG
CGCAAACGGAGGAAGATACGTCGGCGGCCGGCGCTTTATGCCGATTTTCTGTGTCGCTTACCGGAATTTGTAGCGGTGGTGCCGGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGAGTTCTGATGATGAAAAGGATGGAACTTATGGGAAATATGTTCCAAGAGAACCGAGTCATAATCTAGTGTCTAATGGTGCAAAATTTGTAGATGGAGTACTCAA
TGGACAAAATGAAAGTTGTTTAGACAATTTCCGCATGGACAAGCACGTATTTTATAAGTTGTGTGATATTTTGCAAGCCAAAGGCTTACTGCGTCATACAAACCGAATTA
AGATTGAAGAGCAACTAGCCATATTCATGTTTATTATTGGACACAATCTTAGGACACGAGCGGTTCAAGAGTTGTTCAGATATTCAGGAGAAACAATAAGTCGCCATTTT
AACAATGTATTGAATGCAATTATGGCAATCTCATTGGACTTCTTTCAACCTCCAGGATCCAGTGTTCCTCCTCCAGAAATTTTAGAAGATCCAAGATTCTATCCCTACTT
TAAGGATTGTGTGGGGGCAATTGATGGCATACACATCCCTGTGATGGTTGGTGTTGATGAGCAAGGACCTTTTCGTAATAAGAATGGGCAACTTTCTCAAATTGTTTTGG
CAGCATGCTCATTTGACCTCAAGTTCCACTACGTTCTAGCAGGATGGGAAGGATCGGCATCTGATTTGCAGGTTCTGAACTCAGCACTTACTAGGCGAAACAAACTACAT
GTTCCTGAAGGTAAATACTACCTTGTGGACCAAAAATATATGAACATGCCTGGTTTTGTTGCCCCTTATCATGATATCCCCTATCATTCAAAGGAATATCCTGGTGGTTA
TCATCCGCAAGATGCCAAAGAGCTATTTAATCTACGACATTCGTTGTTGCGCAATGCAACTGAAAGAACTTTTGGAGCCCTAAAGGTGCGCTTCCCCATACTATTGTCAG
CTCCTCCTTACCCATTACAGACACAAGTTAAATTGGTCGTTGCGACATGTGCAATTCACAATTACATTCGAAGGGAGAACCCCAACGATTGGTTCTTTAGATTTTCTGAA
CAAGACCATGTTCCACATATGGAGGATTCATTGCCTCAATTGGAAGCAGAACAGCTGACAGCAAATATTGAAACTCCAATTGTGGACGTTGCTTTTGAGACAGAAGAACT
AGAAATTACAGCACAGTTGCGAGATAGTATTGCAGCTGAATTGTGGAGTGACTACATTAATGATATATCACCCATTACTGCTGCCAAGGAAGCAAAACCAGGCAACTGGC
TTTACTTGCAAGAGCAACCATGCCTTTATCAGATCACAAACTATATTCTTCTCTACTTTGTTTCTTTTTTACTGCTACATTCATTTGGCTTCCATCTGAAAGCACCCCAA
AGTTCCCTACCTCCTCTTGCGCTTTTGTATTTATTGGATTCTCTGACTTCCTCTGCTTCATTGCGAAAAGAGATGTTGAATCCGGCCAACGATCTGTTACCGCCGCCCTC
TTCTCCGACCAATTCATCCATTTCCTCCTCCGATCTCGACACTGAGTCTACGGGTTCGTTCTTCCATGACCGGAGCACGAGCTTAGGGACTCTAATGGGGGTCAGCTTCC
CGGCGATTACTTTCCGAGTCCCTTCCCAGAACAGAGATCAACACGCCGCCGCGACGGTTGCCGCAGGCGGAGGTTCTCGTAAGAGTAAGAAGACGAAGAGGAAAACGACG
ACGGCGCCGGCACTGGTTGCAGATCGGAAACGGCGGTGGTGGAGACTATGTAGGGATGACGGCGTCAAGCCGGCGTCTTTGGGCGAGTTTCTCGAAGTGGAACGGAGATT
TGGGGATGGTGCCTTCTACGGCAACGCGGTGGATCTGGAAGGCGTGGTTGCGGCGGATCAACAGAGGAATGGTCGGTCTTTGTTCGCCGATGGAAGAGTTCTTCCGCCGG
CGCAAACGGAGGAAGATACGTCGGCGGCCGGCGCTTTATGCCGATTTTCTGTGTCGCTTACCGGAATTTGTAGCGGTGGTGCCGGCTAA
Protein sequenceShow/hide protein sequence
MESSDDEKDGTYGKYVPREPSHNLVSNGAKFVDGVLNGQNESCLDNFRMDKHVFYKLCDILQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFRYSGETISRHF
NNVLNAIMAISLDFFQPPGSSVPPPEILEDPRFYPYFKDCVGAIDGIHIPVMVGVDEQGPFRNKNGQLSQIVLAACSFDLKFHYVLAGWEGSASDLQVLNSALTRRNKLH
VPEGKYYLVDQKYMNMPGFVAPYHDIPYHSKEYPGGYHPQDAKELFNLRHSLLRNATERTFGALKVRFPILLSAPPYPLQTQVKLVVATCAIHNYIRRENPNDWFFRFSE
QDHVPHMEDSLPQLEAEQLTANIETPIVDVAFETEELEITAQLRDSIAAELWSDYINDISPITAAKEAKPGNWLYLQEQPCLYQITNYILLYFVSFLLLHSFGFHLKAPQ
SSLPPLALLYLLDSLTSSASLRKEMLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRVPSQNRDQHAAATVAAGGGSRKSKKTKRKTT
TAPALVADRKRRWWRLCRDDGVKPASLGEFLEVERRFGDGAFYGNAVDLEGVVAADQQRNGRSLFADGRVLPPAQTEEDTSAAGALCRFSVSLTGICSGGAG