; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G07270 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G07270
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionDDE Tnp4 domain-containing protein
Genome locationChr1:4579655..4584227
RNA-Seq ExpressionCSPI01G07270
SyntenyCSPI01G07270
Gene Ontology termsGO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR027806 - Harbinger transposase-derived nuclease domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYJ99038.1 putative nuclease HARBI1 [Cucumis melo var. makuwa]0.0e+0086.94Show/hide
Query:  MESSDDEKDGTYGKYVPREPSHNLVSNGAKFVDEVLNGQNERCLDHFRMDKHVFYKLCDILQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFR
        MESSDDEKDGTYGKYVPREPSHNLVSNGAKFVDEVLNGQNERCL+HFRMDKHVFYKLCDILQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFR
Subjt:  MESSDDEKDGTYGKYVPREPSHNLVSNGAKFVDEVLNGQNERCLDHFRMDKHVFYKLCDILQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFR

Query:  YSGETISRHFNNVLNAIMAISLDFFQPPGSNVPPPEILEDPRFYPYFKDCVGVIDGIHIPVMVGVDEQGPFRNKNGQLSQIVLAACSFDLKFHYVLAGWE
        YSGETISRHFNNVLNAIMAISLDFFQPPGSNVPPPEILEDPRFYPYFKDCVG IDGIHIPVMVGVDEQGPFRNKNGQLSQIVLAACSFDLKFHYVLAGWE
Subjt:  YSGETISRHFNNVLNAIMAISLDFFQPPGSNVPPPEILEDPRFYPYFKDCVGVIDGIHIPVMVGVDEQGPFRNKNGQLSQIVLAACSFDLKFHYVLAGWE

Query:  GSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFVAPYHDITYQSKEYPGGYHPQDAKELFNLRHSLLRNATERTFEALKARFPILLSAPPYPLQ
        GSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFVAPYHDITY SKEYPGGYHPQDAKELFNLRHSLLRNATERTF ALKARFPILLSAPPYPLQ
Subjt:  GSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFVAPYHDITYQSKEYPGGYHPQDAKELFNLRHSLLRNATERTFEALKARFPILLSAPPYPLQ

Query:  TQVKLVVATCAIHNYIRRENPDDWFFRLYEQDHVPHMEDSLPQLEAEQLTANIETPIVDVAFETEELEITSQLRDSIAAEIWSDYINDISPMKVQFSRTA
        TQVKLVVATCAIHNYIRRENPDDWFFRLYEQDHVPHMEDSLPQLEAEQLTANIETPIVDVAFETEELEI SQLRDSIAAEIW                  
Subjt:  TQVKLVVATCAIHNYIRRENPDDWFFRLYEQDHVPHMEDSLPQLEAEQLTANIETPIVDVAFETEELEITSQLRDSIAAEIWSDYINDISPMKVQFSRTA

Query:  AKEALPGKATGVTCKSNHVFLRSRAVFFSTFVSFL--------------LLHSFGFHLKAPQNSIPPVVCFRSVDNVCHGG--CGGVSFAVLYLLDSLTS
               KATGVTCKSNHVFLRS+A  F    + +              +L ++   ++   ++  P +  +      +    CGGVSFAVLYLLDSLTS
Subjt:  AKEALPGKATGVTCKSNHVFLRSRAVFFSTFVSFL--------------LLHSFGFHLKAPQNSIPPVVCFRSVDNVCHGG--CGGVSFAVLYLLDSLTS

Query:  SASLRERKEMLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRVPSQNRDQHTAATVTAGGGSRKSKKTKRKTTMAPAL
        SASLRERKEMLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRVPSQNRDQHTAATVTAGGGSRKSKKTKRKTT APAL
Subjt:  SASLRERKEMLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRVPSQNRDQHTAATVTAGGGSRKSKKTKRKTTMAPAL

Query:  VADRKRRWWRLCRDDGVKPASLGEFLEVERRFGDGAFYGNAVDLEGVVAADQQRNGRSLFADGRVLPPAQTEEDTSAAATLCRFSVSLTGICSGGAG
        VADRKRRWWRLCRDDGVKPASLGEFLEVERRFGDGAFYGNAVDLEGVV ADQQRNGRSLFADGRVLPPAQTEEDTSA   LCRFSVSLTGICSGGAG
Subjt:  VADRKRRWWRLCRDDGVKPASLGEFLEVERRFGDGAFYGNAVDLEGVVAADQQRNGRSLFADGRVLPPAQTEEDTSAAATLCRFSVSLTGICSGGAG

XP_004137507.1 putative nuclease HARBI1 isoform X1 [Cucumis sativus]4.4e-230100Show/hide
Query:  MESSDDEKDGTYGKYVPREPSHNLVSNGAKFVDEVLNGQNERCLDHFRMDKHVFYKLCDILQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFR
        MESSDDEKDGTYGKYVPREPSHNLVSNGAKFVDEVLNGQNERCLDHFRMDKHVFYKLCDILQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFR
Subjt:  MESSDDEKDGTYGKYVPREPSHNLVSNGAKFVDEVLNGQNERCLDHFRMDKHVFYKLCDILQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFR

Query:  YSGETISRHFNNVLNAIMAISLDFFQPPGSNVPPPEILEDPRFYPYFKDCVGVIDGIHIPVMVGVDEQGPFRNKNGQLSQIVLAACSFDLKFHYVLAGWE
        YSGETISRHFNNVLNAIMAISLDFFQPPGSNVPPPEILEDPRFYPYFKDCVGVIDGIHIPVMVGVDEQGPFRNKNGQLSQIVLAACSFDLKFHYVLAGWE
Subjt:  YSGETISRHFNNVLNAIMAISLDFFQPPGSNVPPPEILEDPRFYPYFKDCVGVIDGIHIPVMVGVDEQGPFRNKNGQLSQIVLAACSFDLKFHYVLAGWE

Query:  GSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFVAPYHDITYQSKEYPGGYHPQDAKELFNLRHSLLRNATERTFEALKARFPILLSAPPYPLQ
        GSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFVAPYHDITYQSKEYPGGYHPQDAKELFNLRHSLLRNATERTFEALKARFPILLSAPPYPLQ
Subjt:  GSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFVAPYHDITYQSKEYPGGYHPQDAKELFNLRHSLLRNATERTFEALKARFPILLSAPPYPLQ

Query:  TQVKLVVATCAIHNYIRRENPDDWFFRLYEQDHVPHMEDSLPQLEAEQLTANIETPIVDVAFETEELEITSQLRDSIAAEIWSDYINDISPM
        TQVKLVVATCAIHNYIRRENPDDWFFRLYEQDHVPHMEDSLPQLEAEQLTANIETPIVDVAFETEELEITSQLRDSIAAEIWSDYINDISPM
Subjt:  TQVKLVVATCAIHNYIRRENPDDWFFRLYEQDHVPHMEDSLPQLEAEQLTANIETPIVDVAFETEELEITSQLRDSIAAEIWSDYINDISPM

XP_016899554.1 PREDICTED: uncharacterized protein LOC103502878 [Cucumis melo]1.7e-22698.47Show/hide
Query:  MESSDDEKDGTYGKYVPREPSHNLVSNGAKFVDEVLNGQNERCLDHFRMDKHVFYKLCDILQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFR
        MESSDDEKDGTYGKYVPREPSHNLVSNGAKFVDEVLNGQNERCL+HFRMDKHVFYKLCDILQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFR
Subjt:  MESSDDEKDGTYGKYVPREPSHNLVSNGAKFVDEVLNGQNERCLDHFRMDKHVFYKLCDILQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFR

Query:  YSGETISRHFNNVLNAIMAISLDFFQPPGSNVPPPEILEDPRFYPYFKDCVGVIDGIHIPVMVGVDEQGPFRNKNGQLSQIVLAACSFDLKFHYVLAGWE
        YSGETISRHFNNVLNAIMAISLDFFQPPGSNVPPPEILEDPRFYPYFKDCVG IDGIHIPVMVGVDEQGPFRN NGQLSQIVLAACSFDLKFHYVLAGWE
Subjt:  YSGETISRHFNNVLNAIMAISLDFFQPPGSNVPPPEILEDPRFYPYFKDCVGVIDGIHIPVMVGVDEQGPFRNKNGQLSQIVLAACSFDLKFHYVLAGWE

Query:  GSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFVAPYHDITYQSKEYPGGYHPQDAKELFNLRHSLLRNATERTFEALKARFPILLSAPPYPLQ
        GSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFVAPYHDITY SKEYPGGYHPQDAKELFNLRHSLLRNATERTF ALKARFPILLSAPPYPLQ
Subjt:  GSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFVAPYHDITYQSKEYPGGYHPQDAKELFNLRHSLLRNATERTFEALKARFPILLSAPPYPLQ

Query:  TQVKLVVATCAIHNYIRRENPDDWFFRLYEQDHVPHMEDSLPQLEAEQLTANIETPIVDVAFETEELEITSQLRDSIAAEIWSDYINDISPM
        TQVKLVVATCAIHNYIRRENPDDWFFRLYEQDHVPHMEDSLPQLEAEQLTANIETPIVDVAFETEELEI SQLRDSIAAEIWSDYINDISPM
Subjt:  TQVKLVVATCAIHNYIRRENPDDWFFRLYEQDHVPHMEDSLPQLEAEQLTANIETPIVDVAFETEELEITSQLRDSIAAEIWSDYINDISPM

XP_038895429.1 putative nuclease HARBI1 isoform X1 [Benincasa hispida]2.5e-22595.48Show/hide
Query:  MEFKFTMESSDDEKDGTYGKYVPREPSHNLVSNGAKFVDEVLNGQNERCLDHFRMDKHVFYKLCDILQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRA
        +EFKFTMESSDDEKDGTYGKYVPREPSHNLVSNGAKFVDEVLNGQNERCLD+FRMDKH+FYKLCDILQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRA
Subjt:  MEFKFTMESSDDEKDGTYGKYVPREPSHNLVSNGAKFVDEVLNGQNERCLDHFRMDKHVFYKLCDILQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRA

Query:  VQELFRYSGETISRHFNNVLNAIMAISLDFFQPPGSNVPPPEILEDPRFYPYFKDCVGVIDGIHIPVMVGVDEQGPFRNKNGQLSQIVLAACSFDLKFHY
        VQELFRYSGETISRHFNNVLNAIMAISLDFFQPPGSNVPPPEILEDPRFYPYFKDCVG IDGIHIPVMVGVDEQGPFRNK G LSQIVLAACSFDLKFHY
Subjt:  VQELFRYSGETISRHFNNVLNAIMAISLDFFQPPGSNVPPPEILEDPRFYPYFKDCVGVIDGIHIPVMVGVDEQGPFRNKNGQLSQIVLAACSFDLKFHY

Query:  VLAGWEGSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFVAPYHDITYQSKEYPGGYHPQDAKELFNLRHSLLRNATERTFEALKARFPILLSA
        VLAGWEGSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGF+APYHDI Y SKEYPGGYHPQDAKELFNLRHSLLRNAT+RTF ALKARFPILLSA
Subjt:  VLAGWEGSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFVAPYHDITYQSKEYPGGYHPQDAKELFNLRHSLLRNATERTFEALKARFPILLSA

Query:  PPYPLQTQVKLVVATCAIHNYIRRENPDDWFFRLYEQDHVPHMEDSLPQLEAEQLTANIETPIVDVAFETEELEITSQLRDSIAAEIWSDYINDISPM
        PPYPLQTQVKLVVATCAIHNYIRRENPDDW FRLYEQDHVPHMEDSLPQL+AEQLT +IETPIVD+AFETEELEITSQLRD+IAAE+WSDYINDISPM
Subjt:  PPYPLQTQVKLVVATCAIHNYIRRENPDDWFFRLYEQDHVPHMEDSLPQLEAEQLTANIETPIVDVAFETEELEITSQLRDSIAAEIWSDYINDISPM

XP_038895430.1 putative nuclease HARBI1 isoform X2 [Benincasa hispida]5.7e-22295.66Show/hide
Query:  MESSDDEKDGTYGKYVPREPSHNLVSNGAKFVDEVLNGQNERCLDHFRMDKHVFYKLCDILQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFR
        MESSDDEKDGTYGKYVPREPSHNLVSNGAKFVDEVLNGQNERCLD+FRMDKH+FYKLCDILQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFR
Subjt:  MESSDDEKDGTYGKYVPREPSHNLVSNGAKFVDEVLNGQNERCLDHFRMDKHVFYKLCDILQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFR

Query:  YSGETISRHFNNVLNAIMAISLDFFQPPGSNVPPPEILEDPRFYPYFKDCVGVIDGIHIPVMVGVDEQGPFRNKNGQLSQIVLAACSFDLKFHYVLAGWE
        YSGETISRHFNNVLNAIMAISLDFFQPPGSNVPPPEILEDPRFYPYFKDCVG IDGIHIPVMVGVDEQGPFRNK G LSQIVLAACSFDLKFHYVLAGWE
Subjt:  YSGETISRHFNNVLNAIMAISLDFFQPPGSNVPPPEILEDPRFYPYFKDCVGVIDGIHIPVMVGVDEQGPFRNKNGQLSQIVLAACSFDLKFHYVLAGWE

Query:  GSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFVAPYHDITYQSKEYPGGYHPQDAKELFNLRHSLLRNATERTFEALKARFPILLSAPPYPLQ
        GSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGF+APYHDI Y SKEYPGGYHPQDAKELFNLRHSLLRNAT+RTF ALKARFPILLSAPPYPLQ
Subjt:  GSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFVAPYHDITYQSKEYPGGYHPQDAKELFNLRHSLLRNATERTFEALKARFPILLSAPPYPLQ

Query:  TQVKLVVATCAIHNYIRRENPDDWFFRLYEQDHVPHMEDSLPQLEAEQLTANIETPIVDVAFETEELEITSQLRDSIAAEIWSDYINDISPM
        TQVKLVVATCAIHNYIRRENPDDW FRLYEQDHVPHMEDSLPQL+AEQLT +IETPIVD+AFETEELEITSQLRD+IAAE+WSDYINDISPM
Subjt:  TQVKLVVATCAIHNYIRRENPDDWFFRLYEQDHVPHMEDSLPQLEAEQLTANIETPIVDVAFETEELEITSQLRDSIAAEIWSDYINDISPM

TrEMBL top hitse value%identityAlignment
A0A0A0LQI1 DDE Tnp4 domain-containing protein8.2e-27575.33Show/hide
Query:  MESSDDEKDGTYGKYVPREPSHNLVSNGAKFVDEVLNGQNERCLDHFRMDKHVFYKLCDILQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFR
        MESSDDEKDGTYGKYVPREPSHNLVSNGAKFVDEVLNGQNERCLDHFRMDKH                                                
Subjt:  MESSDDEKDGTYGKYVPREPSHNLVSNGAKFVDEVLNGQNERCLDHFRMDKHVFYKLCDILQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFR

Query:  YSGETISRHFNNVLNAIMAISLDFFQPPGSNVPPPEILEDPRFYPYFKDCVGVIDGIHIPVMVGVDEQGPFRNKNGQLSQIVLAACSFDLKFHYVLAGWE
                                                                                                            
Subjt:  YSGETISRHFNNVLNAIMAISLDFFQPPGSNVPPPEILEDPRFYPYFKDCVGVIDGIHIPVMVGVDEQGPFRNKNGQLSQIVLAACSFDLKFHYVLAGWE

Query:  GSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFVAPYHDITYQSKEYPGGYHPQDAKELFNLRHSLLRNATERTFEALKARFPILLSAPPYPLQ
                   AL  RN      GKYYLVDQKYMNMPGFVAPYHDITYQSKEYPGGYHPQDAKELFNLRHSLLRNATERTFEALKARFPILLSAPPYPLQ
Subjt:  GSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFVAPYHDITYQSKEYPGGYHPQDAKELFNLRHSLLRNATERTFEALKARFPILLSAPPYPLQ

Query:  TQVKLVVATCAIHNYIRRENPDDWFFRLYEQDHVPHMEDSLPQLEAEQLTANIETPIVDVAFETEELEITSQLRDSIAAEIWSDYINDISPMKVQFSRTA
        TQVKLVVATCAIHNYIRRENPDDWFFRLYEQDHVPHMEDSLPQLEAEQLTANIETPIVDVAFETEELEITSQLRDSIAAEIWSDYINDISPMKVQFSRTA
Subjt:  TQVKLVVATCAIHNYIRRENPDDWFFRLYEQDHVPHMEDSLPQLEAEQLTANIETPIVDVAFETEELEITSQLRDSIAAEIWSDYINDISPMKVQFSRTA

Query:  AKEALPGKATGVTCKSNHVFLRSRAVFFSTFVSFLLLHSFGFHLKAPQNSIPPVVCFRSVDNVCHGGCGGVSFAVLYLLDSLTSSASLRERKEMLNPAND
        AKEALPGKATGVTCKSNHVFLRSRAVFFSTFVSFLLLHSFGFHLKAPQNSIPPVVCFRSVDNVCHGGCGGVSFAVLYLLDSLTSSASLRERKEMLNPAND
Subjt:  AKEALPGKATGVTCKSNHVFLRSRAVFFSTFVSFLLLHSFGFHLKAPQNSIPPVVCFRSVDNVCHGGCGGVSFAVLYLLDSLTSSASLRERKEMLNPAND

Query:  LLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRVPSQNRDQHTAATVTAGGGSRKSKKTKRKTTMAPALVADRKRRWWRLCRDDG
        LLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRVPSQNRDQHTAATVTAGGGSRKSKKTKRKTTMAPALVADRKRRWWRLCRDDG
Subjt:  LLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRVPSQNRDQHTAATVTAGGGSRKSKKTKRKTTMAPALVADRKRRWWRLCRDDG

Query:  VKPASLGEFLEVERRFGDGAFYGNAVDLEGVVAADQQRNGRSLFADGRVLPPAQTEEDTSAAATLCRFSVSLTGICSGGAG
        VKPASLGEFLEVERRFGDGAFYGNAVDLEGVVAADQQRNGRSLFADGRVLPPAQTEEDTS AATLCRFSVSLTGICSGGAG
Subjt:  VKPASLGEFLEVERRFGDGAFYGNAVDLEGVVAADQQRNGRSLFADGRVLPPAQTEEDTSAAATLCRFSVSLTGICSGGAG

A0A1S4DU98 uncharacterized protein LOC1035028788.3e-22798.47Show/hide
Query:  MESSDDEKDGTYGKYVPREPSHNLVSNGAKFVDEVLNGQNERCLDHFRMDKHVFYKLCDILQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFR
        MESSDDEKDGTYGKYVPREPSHNLVSNGAKFVDEVLNGQNERCL+HFRMDKHVFYKLCDILQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFR
Subjt:  MESSDDEKDGTYGKYVPREPSHNLVSNGAKFVDEVLNGQNERCLDHFRMDKHVFYKLCDILQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFR

Query:  YSGETISRHFNNVLNAIMAISLDFFQPPGSNVPPPEILEDPRFYPYFKDCVGVIDGIHIPVMVGVDEQGPFRNKNGQLSQIVLAACSFDLKFHYVLAGWE
        YSGETISRHFNNVLNAIMAISLDFFQPPGSNVPPPEILEDPRFYPYFKDCVG IDGIHIPVMVGVDEQGPFRN NGQLSQIVLAACSFDLKFHYVLAGWE
Subjt:  YSGETISRHFNNVLNAIMAISLDFFQPPGSNVPPPEILEDPRFYPYFKDCVGVIDGIHIPVMVGVDEQGPFRNKNGQLSQIVLAACSFDLKFHYVLAGWE

Query:  GSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFVAPYHDITYQSKEYPGGYHPQDAKELFNLRHSLLRNATERTFEALKARFPILLSAPPYPLQ
        GSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFVAPYHDITY SKEYPGGYHPQDAKELFNLRHSLLRNATERTF ALKARFPILLSAPPYPLQ
Subjt:  GSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFVAPYHDITYQSKEYPGGYHPQDAKELFNLRHSLLRNATERTFEALKARFPILLSAPPYPLQ

Query:  TQVKLVVATCAIHNYIRRENPDDWFFRLYEQDHVPHMEDSLPQLEAEQLTANIETPIVDVAFETEELEITSQLRDSIAAEIWSDYINDISPM
        TQVKLVVATCAIHNYIRRENPDDWFFRLYEQDHVPHMEDSLPQLEAEQLTANIETPIVDVAFETEELEI SQLRDSIAAEIWSDYINDISPM
Subjt:  TQVKLVVATCAIHNYIRRENPDDWFFRLYEQDHVPHMEDSLPQLEAEQLTANIETPIVDVAFETEELEITSQLRDSIAAEIWSDYINDISPM

A0A5D3BLI7 Putative nuclease HARBI10.0e+0086.94Show/hide
Query:  MESSDDEKDGTYGKYVPREPSHNLVSNGAKFVDEVLNGQNERCLDHFRMDKHVFYKLCDILQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFR
        MESSDDEKDGTYGKYVPREPSHNLVSNGAKFVDEVLNGQNERCL+HFRMDKHVFYKLCDILQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFR
Subjt:  MESSDDEKDGTYGKYVPREPSHNLVSNGAKFVDEVLNGQNERCLDHFRMDKHVFYKLCDILQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFR

Query:  YSGETISRHFNNVLNAIMAISLDFFQPPGSNVPPPEILEDPRFYPYFKDCVGVIDGIHIPVMVGVDEQGPFRNKNGQLSQIVLAACSFDLKFHYVLAGWE
        YSGETISRHFNNVLNAIMAISLDFFQPPGSNVPPPEILEDPRFYPYFKDCVG IDGIHIPVMVGVDEQGPFRNKNGQLSQIVLAACSFDLKFHYVLAGWE
Subjt:  YSGETISRHFNNVLNAIMAISLDFFQPPGSNVPPPEILEDPRFYPYFKDCVGVIDGIHIPVMVGVDEQGPFRNKNGQLSQIVLAACSFDLKFHYVLAGWE

Query:  GSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFVAPYHDITYQSKEYPGGYHPQDAKELFNLRHSLLRNATERTFEALKARFPILLSAPPYPLQ
        GSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFVAPYHDITY SKEYPGGYHPQDAKELFNLRHSLLRNATERTF ALKARFPILLSAPPYPLQ
Subjt:  GSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFVAPYHDITYQSKEYPGGYHPQDAKELFNLRHSLLRNATERTFEALKARFPILLSAPPYPLQ

Query:  TQVKLVVATCAIHNYIRRENPDDWFFRLYEQDHVPHMEDSLPQLEAEQLTANIETPIVDVAFETEELEITSQLRDSIAAEIWSDYINDISPMKVQFSRTA
        TQVKLVVATCAIHNYIRRENPDDWFFRLYEQDHVPHMEDSLPQLEAEQLTANIETPIVDVAFETEELEI SQLRDSIAAEIW                  
Subjt:  TQVKLVVATCAIHNYIRRENPDDWFFRLYEQDHVPHMEDSLPQLEAEQLTANIETPIVDVAFETEELEITSQLRDSIAAEIWSDYINDISPMKVQFSRTA

Query:  AKEALPGKATGVTCKSNHVFLRSRAVFFSTFVSFL--------------LLHSFGFHLKAPQNSIPPVVCFRSVDNVCHGG--CGGVSFAVLYLLDSLTS
               KATGVTCKSNHVFLRS+A  F    + +              +L ++   ++   ++  P +  +      +    CGGVSFAVLYLLDSLTS
Subjt:  AKEALPGKATGVTCKSNHVFLRSRAVFFSTFVSFL--------------LLHSFGFHLKAPQNSIPPVVCFRSVDNVCHGG--CGGVSFAVLYLLDSLTS

Query:  SASLRERKEMLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRVPSQNRDQHTAATVTAGGGSRKSKKTKRKTTMAPAL
        SASLRERKEMLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRVPSQNRDQHTAATVTAGGGSRKSKKTKRKTT APAL
Subjt:  SASLRERKEMLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRVPSQNRDQHTAATVTAGGGSRKSKKTKRKTTMAPAL

Query:  VADRKRRWWRLCRDDGVKPASLGEFLEVERRFGDGAFYGNAVDLEGVVAADQQRNGRSLFADGRVLPPAQTEEDTSAAATLCRFSVSLTGICSGGAG
        VADRKRRWWRLCRDDGVKPASLGEFLEVERRFGDGAFYGNAVDLEGVV ADQQRNGRSLFADGRVLPPAQTEEDTSA   LCRFSVSLTGICSGGAG
Subjt:  VADRKRRWWRLCRDDGVKPASLGEFLEVERRFGDGAFYGNAVDLEGVVAADQQRNGRSLFADGRVLPPAQTEEDTSAAATLCRFSVSLTGICSGGAG

A0A6J1EE58 putative nuclease HARBI1 isoform X22.5e-21593.11Show/hide
Query:  MESSDDEKDGTYGKYVPREPSHNLVSNGAKFVDEVLNGQNERCLDHFRMDKHVFYKLCDILQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFR
        MESSDDEKDG+YGKYVPREPSHNLV+NGAKFVDEVLNGQNERCL++FRMDKH+FYKLCDILQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFR
Subjt:  MESSDDEKDGTYGKYVPREPSHNLVSNGAKFVDEVLNGQNERCLDHFRMDKHVFYKLCDILQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFR

Query:  YSGETISRHFNNVLNAIMAISLDFFQPPGSNVPPPEILEDPRFYPYFKDCVGVIDGIHIPVMVGVDEQGPFRNKNGQLSQIVLAACSFDLKFHYVLAGWE
        YSGETISRHFNNVLNAIMAISLDFFQPPGSNV PPEIL+DPRFYPYFKDCVG IDGIHIPVMVGVDEQGPFRNKNG LSQ VLAACSFDLKFHYVLAGWE
Subjt:  YSGETISRHFNNVLNAIMAISLDFFQPPGSNVPPPEILEDPRFYPYFKDCVGVIDGIHIPVMVGVDEQGPFRNKNGQLSQIVLAACSFDLKFHYVLAGWE

Query:  GSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFVAPYHDITYQSKEYPGGYHPQDAKELFNLRHSLLRNATERTFEALKARFPILLSAPPYPLQ
        GSA+DLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGF+APYHDI YQS+EY GGYHPQDAKELFNLRHSLLRNAT+RTF ALK RFPILLSAPPYPLQ
Subjt:  GSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFVAPYHDITYQSKEYPGGYHPQDAKELFNLRHSLLRNATERTFEALKARFPILLSAPPYPLQ

Query:  TQVKLVVATCAIHNYIRRENPDDWFFRLYEQDHVPHMEDSLPQLEAEQLTANIETPIVDVAFETEELEITSQLRDSIAAEIWSDYINDISPM
        TQVKLVVATCAIHNYIRRENPDDW F+LYEQDHV HMEDSLPQLEAEQLTA+IETP VD+AFETEELEITSQLRD+IA E+WSDYINDISPM
Subjt:  TQVKLVVATCAIHNYIRRENPDDWFFRLYEQDHVPHMEDSLPQLEAEQLTANIETPIVDVAFETEELEITSQLRDSIAAEIWSDYINDISPM

A0A6J1KJM0 putative nuclease HARBI11.1e-21392.86Show/hide
Query:  MESSDDEKDGTYGKYVPREPSHNLVSNGAKFVDEVLNGQNERCLDHFRMDKHVFYKLCDILQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFR
        MESSDDEKDG+YGKYVPREPSHNLVSNGAKFVDEVLNGQNERCL++FRMDKH+FYKLCDILQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFR
Subjt:  MESSDDEKDGTYGKYVPREPSHNLVSNGAKFVDEVLNGQNERCLDHFRMDKHVFYKLCDILQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFR

Query:  YSGETISRHFNNVLNAIMAISLDFFQPPGSNVPPPEILEDPRFYPYFKDCVGVIDGIHIPVMVGVDEQGPFRNKNGQLSQIVLAACSFDLKFHYVLAGWE
        YSGETISRHFNNVLNAIMAISLDFFQPPGSNV PPEIL+DPRFYPYFKDCVG IDGIHIPVMVGVDEQGPFRNKNG LSQ VLAACSFDLKFHYVLAGWE
Subjt:  YSGETISRHFNNVLNAIMAISLDFFQPPGSNVPPPEILEDPRFYPYFKDCVGVIDGIHIPVMVGVDEQGPFRNKNGQLSQIVLAACSFDLKFHYVLAGWE

Query:  GSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFVAPYHDITYQSKEYPGGYHPQDAKELFNLRHSLLRNATERTFEALKARFPILLSAPPYPLQ
        GSA+DLQVLNSALTRRNKLH+PEGKYYLVDQKYMNMPGF+APYHDI YQS+EY GGYHPQDAKELFNLRHSLLRNAT+RTF ALK RFPILLSAPPYPLQ
Subjt:  GSASDLQVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFVAPYHDITYQSKEYPGGYHPQDAKELFNLRHSLLRNATERTFEALKARFPILLSAPPYPLQ

Query:  TQVKLVVATCAIHNYIRRENPDDWFFRLYEQDHVPHMEDSLPQLEAEQLTANIETPIVDVAFETEELEITSQLRDSIAAEIWSDYINDISPM
        TQVKLVVATCAIHNYIRRENPDD  FRLYEQDHV HMEDSLPQLEAEQLTA+IETP VD+AFETEE EITSQLRD+IA E+WSDYINDISPM
Subjt:  TQVKLVVATCAIHNYIRRENPDDWFFRLYEQDHVPHMEDSLPQLEAEQLTANIETPIVDVAFETEELEITSQLRDSIAAEIWSDYINDISPM

SwissProt top hitse value%identityAlignment
Q6DR24 Uncharacterized protein At3g179502.8e-3851.71Show/hide
Query:  PSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPA---ITFRVPSQNR-DQHTAATVTAGGGSRKSKKTKRKTTMAPALVADRKRRWWRLCRDD-
        PSSPT SS+SSSDLDTESTGSFFHDRS +LGTLMG SF A   + FR  S+       A +  +   +R++ + KR  + +      R+R+WWR CRDD 
Subjt:  PSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPA---ITFRVPSQNR-DQHTAATVTAGGGSRKSKKTKRKTTMAPALVADRKRRWWRLCRDD-

Query:  ------GV-------KPASLGEFLEVERRFGDGAFYGNA-VDLEGVVAA---DQQ--RNGRSLFADGRVLPPAQTE----EDTSAAATLCRFSVSLTGIC
              G+       K +SLGE+LEVERRFGD A Y +A  +LE  V A   DQQ     R+LFADGRVLPPA  E    E T  A +LCRF VSLTGIC
Subjt:  ------GV-------KPASLGEFLEVERRFGDGAFYGNA-VDLEGVVAA---DQQ--RNGRSLFADGRVLPPAQTE----EDTSAAATLCRFSVSLTGIC

Query:  SGGAG
        SGG G
Subjt:  SGGAG

Arabidopsis top hitse value%identityAlignment
AT1G43722.1 unknown protein7.9e-3633.92Show/hide
Query:  YGKYVPREPSHNLVSNGAKFVDEVLNGQNERCLDHFRMDKHVFYKLCDILQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFRYSGETISRHFN
        Y +Y  R P       G + +   L      CL   RM    F  LC++LQ    L+ T  I IEE +A+F+ I GHN   R V   F  + ET+ R F 
Subjt:  YGKYVPREPSHNLVSNGAKFVDEVLNGQNERCLDHFRMDKHVFYKLCDILQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFRYSGETISRHFN

Query:  NVLNAIMAISLDFFQPPGSNV---PPPEILEDPRFYPYFKDCVGVIDGIHIPVMVGVDEQGPFRNKNGQLSQIVLAACSFDLKFHYVLAGWEGSASDLQV
         VL A   ++ D+ + P        P  +  D R++PYF   VG +DG H+ V V  D QG + N++   S  ++A C   + F Y+  G  GS  D  V
Subjt:  NVLNAIMAISLDFFQPPGSNV---PPPEILEDPRFYPYFKDCVGVIDGIHIPVMVGVDEQGPFRNKNGQLSQIVLAACSFDLKFHYVLAGWEGSASDLQV

Query:  LNSALTRRNKLHVPEG-KYYLVDQKYMNMPGFVAPYHD-----ITYQSKEYPGGYHPQDAKELFNLRHSLLRNATERTFEALK
        L  A    ++  +P   KYYLVD  Y N  G +APY       + Y   ++  G  P++  ELFN  H+ LR+  ERTF   K
Subjt:  LNSALTRRNKLHVPEG-KYYLVDQKYMNMPGFVAPYHD-----ITYQSKEYPGGYHPQDAKELFNLRHSLLRNATERTFEALK

AT3G17950.1 unknown protein2.0e-3951.71Show/hide
Query:  PSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPA---ITFRVPSQNR-DQHTAATVTAGGGSRKSKKTKRKTTMAPALVADRKRRWWRLCRDD-
        PSSPT SS+SSSDLDTESTGSFFHDRS +LGTLMG SF A   + FR  S+       A +  +   +R++ + KR  + +      R+R+WWR CRDD 
Subjt:  PSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPA---ITFRVPSQNR-DQHTAATVTAGGGSRKSKKTKRKTTMAPALVADRKRRWWRLCRDD-

Query:  ------GV-------KPASLGEFLEVERRFGDGAFYGNA-VDLEGVVAA---DQQ--RNGRSLFADGRVLPPAQTE----EDTSAAATLCRFSVSLTGIC
              G+       K +SLGE+LEVERRFGD A Y +A  +LE  V A   DQQ     R+LFADGRVLPPA  E    E T  A +LCRF VSLTGIC
Subjt:  ------GV-------KPASLGEFLEVERRFGDGAFYGNA-VDLEGVVAA---DQQ--RNGRSLFADGRVLPPAQTE----EDTSAAATLCRFSVSLTGIC

Query:  SGGAG
        SGG G
Subjt:  SGGAG

AT3G17950.2 unknown protein6.9e-2444.77Show/hide
Query:  MGVSFPA---ITFRVPSQNR-DQHTAATVTAGGGSRKSKKTKRKTTMAPALVADRKRRWWRLCRDD-------GV-------KPASLGEFLEVERRFGDG
        MG SF A   + FR  S+       A +  +   +R++ + KR  + +      R+R+WWR CRDD       G+       K +SLGE+LEVERRFGD 
Subjt:  MGVSFPA---ITFRVPSQNR-DQHTAATVTAGGGSRKSKKTKRKTTMAPALVADRKRRWWRLCRDD-------GV-------KPASLGEFLEVERRFGDG

Query:  AFYGNA-VDLEGVVAA---DQQ--RNGRSLFADGRVLPPAQTE----EDTSAAATLCRFSVSLTGICSGGAG
        A Y +A  +LE  V A   DQQ     R+LFADGRVLPPA  E    E T  A +LCRF VSLTGICSGG G
Subjt:  AFYGNA-VDLEGVVAA---DQQ--RNGRSLFADGRVLPPAQTE----EDTSAAATLCRFSVSLTGICSGGAG

AT5G28950.1 unknown protein1.4e-2459.34Show/hide
Query:  PPEILEDPRFYPYFKDCVGVIDGIHIPVMVGVDEQGPFRNKNGQLSQIVLAACSFDLKFHYVLAGWEGSASDLQVLNSALTRR-NKLHVPE
        P +I E  R YPYFKDCVG ID  HI  MV   +   FRN+ G +SQ +LAAC+FD++F YVL+GWEGSA D +VLN ALTR  N+L VPE
Subjt:  PPEILEDPRFYPYFKDCVGVIDGIHIPVMVGVDEQGPFRNKNGQLSQIVLAACSFDLKFHYVLAGWEGSASDLQVLNSALTRR-NKLHVPE

AT5G41980.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)1.4e-13362.86Show/hide
Query:  EKDGTYGKYVPREPSHNLVSNGAKFVDEVLNGQNERCLDHFRMDKHVFYKLCDILQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFRYSGETI
        E+D      +P+E S   +S+G KFV ++LNG NE+C ++FRMDK VFYKLCD+LQ +GLLRHTNRIKIE QLAIF+FIIGHNLRTRAVQELF YSGETI
Subjt:  EKDGTYGKYVPREPSHNLVSNGAKFVDEVLNGQNERCLDHFRMDKHVFYKLCDILQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFRYSGETI

Query:  SRHFNNVLNAIMAISLDFFQPPGSNVPPPEILEDPRFYPYFKDCVGVIDGIHIPVMVGVDEQGPFRNKNGQLSQIVLAACSFDLKFHYVLAGWEGSASDL
        SRHFNNVLNA++AIS DFFQP  ++    + LE+    PYFKDCVGV+D  HIPVMVGVDEQGPFRN NG L+Q VLAA SFDL+F+YVLAGWEGSASD 
Subjt:  SRHFNNVLNAIMAISLDFFQPPGSNVPPPEILEDPRFYPYFKDCVGVIDGIHIPVMVGVDEQGPFRNKNGQLSQIVLAACSFDLKFHYVLAGWEGSASDL

Query:  QVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFVAPYHDITYQSKEYPGGYHPQDAKELFNLRHSLLRNATERTFEALKARFPILLSAPPYPLQTQVKLV
        QVLN+ALTRRNKL VP+GKYY+VD KY N+PGF+APYH ++  S+E        +AKE+FN RH LL  A  RTF ALK RFPILLSAPPYPLQTQVKLV
Subjt:  QVLNSALTRRNKLHVPEGKYYLVDQKYMNMPGFVAPYHDITYQSKEYPGGYHPQDAKELFNLRHSLLRNATERTFEALKARFPILLSAPPYPLQTQVKLV

Query:  VATCAIHNYIRRENPDDWFFRLYEQDHVPHM-EDSLPQLEAEQLTANIETPIVDVAFETEELEITSQLRDSIAAEIWSDYINDIS
        +A CA+HNY+R E PDD  FR++E++ +    ED    LE EQ    +E    +  F  EE+E + +LRD IA+E+W+ Y+ ++S
Subjt:  VATCAIHNYIRRENPDDWFFRLYEQDHVPHM-EDSLPQLEAEQLTANIETPIVDVAFETEELEITSQLRDSIAAEIWSDYINDIS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGTTTAAATTTACTATGGAGAGTTCTGATGATGAAAAGGATGGAACTTATGGGAAATATGTTCCAAGAGAACCGAGTCATAATCTAGTGTCTAATGGTGCAAAATT
TGTAGATGAAGTACTCAATGGACAAAATGAACGTTGTTTAGATCATTTCCGCATGGACAAGCACGTATTCTATAAGTTGTGTGATATTTTGCAAGCCAAAGGCTTACTGC
GTCATACAAACCGGATTAAGATTGAAGAGCAACTAGCCATATTCATGTTTATTATTGGTCACAATCTTAGGACACGAGCAGTTCAAGAGTTATTCAGATATTCAGGAGAA
ACAATAAGCCGCCATTTTAACAATGTATTGAATGCAATTATGGCAATATCATTGGACTTCTTTCAACCTCCAGGATCCAATGTTCCTCCTCCAGAAATTTTAGAAGATCC
AAGATTCTATCCCTACTTTAAGGATTGTGTGGGGGTAATTGATGGCATACACATACCTGTGATGGTTGGTGTTGATGAGCAAGGACCTTTTCGTAATAAGAATGGACAAC
TCTCTCAAATTGTTTTGGCAGCATGCTCATTTGACCTCAAGTTCCATTATGTTCTAGCAGGATGGGAAGGATCGGCATCCGATTTGCAGGTTCTGAATTCGGCACTTACT
AGGAGAAACAAACTACATGTTCCTGAAGGTAAATACTACCTTGTGGACCAAAAATATATGAACATGCCTGGTTTTGTTGCCCCCTATCATGATATCACCTATCAATCAAA
GGAATATCCTGGTGGTTATCATCCGCAAGATGCCAAAGAGCTATTTAATCTACGACATTCGTTGTTGCGCAATGCAACCGAAAGAACTTTTGAAGCTCTAAAGGCGCGCT
TCCCCATACTATTGTCAGCTCCTCCTTACCCGTTACAGACACAAGTTAAATTGGTCGTTGCGACATGTGCGATTCACAATTACATTCGAAGGGAGAACCCCGATGATTGG
TTCTTTAGATTATATGAACAAGACCATGTTCCACATATGGAGGACTCATTGCCTCAATTGGAAGCAGAACAGCTGACAGCAAATATTGAAACTCCAATTGTGGACGTTGC
TTTTGAGACAGAAGAATTAGAAATTACATCACAGCTGCGAGATAGTATTGCAGCTGAAATATGGAGTGACTACATTAATGATATATCACCCATGAAAGTCCAATTCTCGA
GAACTGCTGCTAAGGAAGCACTACCAGGAAAGGCAACTGGCGTTACTTGCAAGAGCAACCATGTCTTTCTCAGATCACGAGCTGTATTCTTCTCTACTTTTGTTTCTTTT
CTACTGCTACATTCATTTGGCTTCCATCTGAAAGCACCCCAAAATTCCATACCTCCTGTAGTTTGCTTTCGATCTGTTGACAATGTGTGTCATGGTGGTTGTGGGGGAGT
GAGCTTTGCGGTTTTGTATTTATTGGATTCTCTGACTTCCTCTGCTTCATTGAGAGAGCGAAAAGAGATGTTGAATCCGGCCAACGATCTGTTACCGCCGCCGTCTTCTC
CCACCAATTCATCCATTTCCTCCTCCGATCTCGACACTGAGTCTACGGGTTCGTTCTTCCATGACCGGAGTACGAGCTTAGGGACTCTAATGGGGGTCAGCTTCCCGGCG
ATTACTTTCCGAGTCCCTTCCCAGAACAGAGATCAACACACCGCCGCAACGGTCACCGCAGGCGGAGGTTCTCGTAAGAGTAAGAAGACAAAGAGGAAAACGACGATGGC
ACCGGCGCTGGTTGCAGATCGGAAACGGCGGTGGTGGAGACTATGTAGGGATGACGGCGTCAAGCCGGCGTCTTTAGGCGAGTTTCTTGAAGTGGAACGGAGATTTGGGG
ATGGTGCCTTCTACGGCAACGCGGTGGATCTGGAAGGCGTGGTTGCGGCGGATCAACAGAGGAATGGTCGTTCTTTATTCGCCGATGGAAGAGTTCTTCCACCCGCGCAA
ACGGAGGAAGACACGTCGGCGGCCGCCACTTTATGCCGATTTTCTGTATCACTGACCGGAATTTGTAGCGGCGGTGCCGGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGTTTAAATTTACTATGGAGAGTTCTGATGATGAAAAGGATGGAACTTATGGGAAATATGTTCCAAGAGAACCGAGTCATAATCTAGTGTCTAATGGTGCAAAATT
TGTAGATGAAGTACTCAATGGACAAAATGAACGTTGTTTAGATCATTTCCGCATGGACAAGCACGTATTCTATAAGTTGTGTGATATTTTGCAAGCCAAAGGCTTACTGC
GTCATACAAACCGGATTAAGATTGAAGAGCAACTAGCCATATTCATGTTTATTATTGGTCACAATCTTAGGACACGAGCAGTTCAAGAGTTATTCAGATATTCAGGAGAA
ACAATAAGCCGCCATTTTAACAATGTATTGAATGCAATTATGGCAATATCATTGGACTTCTTTCAACCTCCAGGATCCAATGTTCCTCCTCCAGAAATTTTAGAAGATCC
AAGATTCTATCCCTACTTTAAGGATTGTGTGGGGGTAATTGATGGCATACACATACCTGTGATGGTTGGTGTTGATGAGCAAGGACCTTTTCGTAATAAGAATGGACAAC
TCTCTCAAATTGTTTTGGCAGCATGCTCATTTGACCTCAAGTTCCATTATGTTCTAGCAGGATGGGAAGGATCGGCATCCGATTTGCAGGTTCTGAATTCGGCACTTACT
AGGAGAAACAAACTACATGTTCCTGAAGGTAAATACTACCTTGTGGACCAAAAATATATGAACATGCCTGGTTTTGTTGCCCCCTATCATGATATCACCTATCAATCAAA
GGAATATCCTGGTGGTTATCATCCGCAAGATGCCAAAGAGCTATTTAATCTACGACATTCGTTGTTGCGCAATGCAACCGAAAGAACTTTTGAAGCTCTAAAGGCGCGCT
TCCCCATACTATTGTCAGCTCCTCCTTACCCGTTACAGACACAAGTTAAATTGGTCGTTGCGACATGTGCGATTCACAATTACATTCGAAGGGAGAACCCCGATGATTGG
TTCTTTAGATTATATGAACAAGACCATGTTCCACATATGGAGGACTCATTGCCTCAATTGGAAGCAGAACAGCTGACAGCAAATATTGAAACTCCAATTGTGGACGTTGC
TTTTGAGACAGAAGAATTAGAAATTACATCACAGCTGCGAGATAGTATTGCAGCTGAAATATGGAGTGACTACATTAATGATATATCACCCATGAAAGTCCAATTCTCGA
GAACTGCTGCTAAGGAAGCACTACCAGGAAAGGCAACTGGCGTTACTTGCAAGAGCAACCATGTCTTTCTCAGATCACGAGCTGTATTCTTCTCTACTTTTGTTTCTTTT
CTACTGCTACATTCATTTGGCTTCCATCTGAAAGCACCCCAAAATTCCATACCTCCTGTAGTTTGCTTTCGATCTGTTGACAATGTGTGTCATGGTGGTTGTGGGGGAGT
GAGCTTTGCGGTTTTGTATTTATTGGATTCTCTGACTTCCTCTGCTTCATTGAGAGAGCGAAAAGAGATGTTGAATCCGGCCAACGATCTGTTACCGCCGCCGTCTTCTC
CCACCAATTCATCCATTTCCTCCTCCGATCTCGACACTGAGTCTACGGGTTCGTTCTTCCATGACCGGAGTACGAGCTTAGGGACTCTAATGGGGGTCAGCTTCCCGGCG
ATTACTTTCCGAGTCCCTTCCCAGAACAGAGATCAACACACCGCCGCAACGGTCACCGCAGGCGGAGGTTCTCGTAAGAGTAAGAAGACAAAGAGGAAAACGACGATGGC
ACCGGCGCTGGTTGCAGATCGGAAACGGCGGTGGTGGAGACTATGTAGGGATGACGGCGTCAAGCCGGCGTCTTTAGGCGAGTTTCTTGAAGTGGAACGGAGATTTGGGG
ATGGTGCCTTCTACGGCAACGCGGTGGATCTGGAAGGCGTGGTTGCGGCGGATCAACAGAGGAATGGTCGTTCTTTATTCGCCGATGGAAGAGTTCTTCCACCCGCGCAA
ACGGAGGAAGACACGTCGGCGGCCGCCACTTTATGCCGATTTTCTGTATCACTGACCGGAATTTGTAGCGGCGGTGCCGGTTAA
Protein sequenceShow/hide protein sequence
MEFKFTMESSDDEKDGTYGKYVPREPSHNLVSNGAKFVDEVLNGQNERCLDHFRMDKHVFYKLCDILQAKGLLRHTNRIKIEEQLAIFMFIIGHNLRTRAVQELFRYSGE
TISRHFNNVLNAIMAISLDFFQPPGSNVPPPEILEDPRFYPYFKDCVGVIDGIHIPVMVGVDEQGPFRNKNGQLSQIVLAACSFDLKFHYVLAGWEGSASDLQVLNSALT
RRNKLHVPEGKYYLVDQKYMNMPGFVAPYHDITYQSKEYPGGYHPQDAKELFNLRHSLLRNATERTFEALKARFPILLSAPPYPLQTQVKLVVATCAIHNYIRRENPDDW
FFRLYEQDHVPHMEDSLPQLEAEQLTANIETPIVDVAFETEELEITSQLRDSIAAEIWSDYINDISPMKVQFSRTAAKEALPGKATGVTCKSNHVFLRSRAVFFSTFVSF
LLLHSFGFHLKAPQNSIPPVVCFRSVDNVCHGGCGGVSFAVLYLLDSLTSSASLRERKEMLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPA
ITFRVPSQNRDQHTAATVTAGGGSRKSKKTKRKTTMAPALVADRKRRWWRLCRDDGVKPASLGEFLEVERRFGDGAFYGNAVDLEGVVAADQQRNGRSLFADGRVLPPAQ
TEEDTSAAATLCRFSVSLTGICSGGAG