; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0017400 (gene) of Chayote v1 genome

Gene IDSed0017400
OrganismSechium edule (Chayote v1)
DescriptionDDE Tnp4 domain-containing protein
Genome locationLG08:39267990..39273187
RNA-Seq ExpressionSed0017400
SyntenySed0017400
Gene Ontology termsGO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR027806 - Harbinger transposase-derived nuclease domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7019906.1 putative nuclease HARBI1 [Cucurbita argyrosperma subsp. argyrosperma]5.2e-20989.51Show/hide
Query:  MESSDDEKDGSYGKYIPREPTHNIVSNGAKFVDEVLNGSNERCLDNFRMDKHIFYKLCDILQGKGLLRHTNRVKIEEQLAIFMFIIGHNLRTRAVQELFR
        MESSDDEKDGSYGKY+PREP+HN+VSNGAKFVDEVLNG NERCL+NFRMDKHIFYKLCDILQ KGLLRHTNR+KIEEQLAIFMFIIGHNLRTRAVQELFR
Subjt:  MESSDDEKDGSYGKYIPREPTHNIVSNGAKFVDEVLNGSNERCLDNFRMDKHIFYKLCDILQGKGLLRHTNRVKIEEQLAIFMFIIGHNLRTRAVQELFR

Query:  YSGETISRHFNNVLNAIMAISLDFFQPSGSDVPPEILEDPRFYPYFKDCVGAIDGIHFPVMVGVDEQGPFRNKDGLLSQNVLAACSFDLKFHYVLAGWEG
        YSGETISRHFNNVLNAIMAISLDFFQP GS+VPPEIL+DPRFYPYFKDCVGAIDGIH PVMVGVDEQGPFRNK+GLLSQNVLAACSFDLKFHYVLAGWEG
Subjt:  YSGETISRHFNNVLNAIMAISLDFFQPSGSDVPPEILEDPRFYPYFKDCVGAIDGIHFPVMVGVDEQGPFRNKDGLLSQNVLAACSFDLKFHYVLAGWEG

Query:  SASDFQVLNSALTRRNKLSVPEGKYYLVDQKYMNMPGFIAPYHDVPYYSKEFPAGYHPQDAKELFNLRHSLLRNATDRTFGALKARFPILMLAPPYPLQT
        SA+D QVLNSALTRRNKL VPEGKYYLVDQKYMNMPGFIAPYHD+PY S+E+  GYHPQDAKELFNLRHSLLRNAT+RTFGALK RFPIL+ APPYPLQT
Subjt:  SASDFQVLNSALTRRNKLSVPEGKYYLVDQKYMNMPGFIAPYHDVPYYSKEFPAGYHPQDAKELFNLRHSLLRNATDRTFGALKARFPILMLAPPYPLQT

Query:  QVKLVVATCAIHNYIRRENPDDWLFRLYEQDQVSHMEGSLPPMETEQMMTHIETPAVGIAFETEELEITSQLRDQIATGLWSDYINGISPI
        QVKLVVATCAIHNYIRRENPDDWLF+LYEQD V+HME SLP +E EQ+  HIETP V IAFETEELEITSQLRD IAT LWSDYIN ISP+
Subjt:  QVKLVVATCAIHNYIRRENPDDWLFRLYEQDQVSHMEGSLPPMETEQMMTHIETPAVGIAFETEELEITSQLRDQIATGLWSDYINGISPI

TYJ99038.1 putative nuclease HARBI1 [Cucumis melo var. makuwa]1.7e-28476.26Show/hide
Query:  MESSDDEKDGSYGKYIPREPTHNIVSNGAKFVDEVLNGSNERCLDNFRMDKHIFYKLCDILQGKGLLRHTNRVKIEEQLAIFMFIIGHNLRTRAVQELFR
        MESSDDEKDG+YGKY+PREP+HN+VSNGAKFVDEVLNG NERCL++FRMDKH+FYKLCDILQ KGLLRHTNR+KIEEQLAIFMFIIGHNLRTRAVQELFR
Subjt:  MESSDDEKDGSYGKYIPREPTHNIVSNGAKFVDEVLNGSNERCLDNFRMDKHIFYKLCDILQGKGLLRHTNRVKIEEQLAIFMFIIGHNLRTRAVQELFR

Query:  YSGETISRHFNNVLNAIMAISLDFFQPSGSDV-PPEILEDPRFYPYFKDCVGAIDGIHFPVMVGVDEQGPFRNKDGLLSQNVLAACSFDLKFHYVLAGWE
        YSGETISRHFNNVLNAIMAISLDFFQP GS+V PPEILEDPRFYPYFKDCVGAIDGIH PVMVGVDEQGPFRNK+G LSQ VLAACSFDLKFHYVLAGWE
Subjt:  YSGETISRHFNNVLNAIMAISLDFFQPSGSDV-PPEILEDPRFYPYFKDCVGAIDGIHFPVMVGVDEQGPFRNKDGLLSQNVLAACSFDLKFHYVLAGWE

Query:  GSASDFQVLNSALTRRNKLSVPEGKYYLVDQKYMNMPGFIAPYHDVPYYSKEFPAGYHPQDAKELFNLRHSLLRNATDRTFGALKARFPILMLAPPYPLQ
        GSASD QVLNSALTRRNKL VPEGKYYLVDQKYMNMPGF+APYHD+ Y+SKE+P GYHPQDAKELFNLRHSLLRNAT+RTFGALKARFPIL+ APPYPLQ
Subjt:  GSASDFQVLNSALTRRNKLSVPEGKYYLVDQKYMNMPGFIAPYHDVPYYSKEFPAGYHPQDAKELFNLRHSLLRNATDRTFGALKARFPILMLAPPYPLQ

Query:  TQVKLVVATCAIHNYIRRENPDDWLFRLYEQDQVSHMEGSLPPMETEQMMTHIETPAVGIAFETEELEITSQLRDQIATGLWSDYINGISPIEVHVKRCA
        TQVKLVVATCAIHNYIRRENPDDW FRLYEQD V HME SLP +E EQ+  +IETP V +AFETEELEI SQLRD IA  +W     G++    HV   +
Subjt:  TQVKLVVATCAIHNYIRRENPDDWLFRLYEQDQVSHMEGSLPPMETEQMMTHIETPAVGIAFETEELEITSQLRDQIATGLWSDYINGISPIEVHVKRCA

Query:  -------LGVCLPKLFLLYFVSFISLIQHSFGFHLKYP-KGQVPYIL-YWNRFYVISCS-CGGVSFGPWYFLDSLTSS--------MLNPANDLLPPPSS
               L  C+    LL +      I  ++   ++     + P I+   N  ++ S   CGGVSF   Y LDSLTSS        MLNPANDLLPPPSS
Subjt:  -------LGVCLPKLFLLYFVSFISLIQHSFGFHLKYP-KGQVPYIL-YWNRFYVISCS-CGGVSFGPWYFLDSLTSS--------MLNPANDLLPPPSS

Query:  PTNSSISSSDLDTESTGSFFHDRSTSLGALMGVSFPAITFRLPSQNRDQHAAA---AGGA---SKKPKRKPAAAAALVGDRKRRWWRLCRDDGVKPASLG
        PTNSSISSSDLDTESTGSFFHDRSTSLG LMGVSFPAITFR+PSQNRDQH AA   AGG    SKK KRK   A ALV DRKRRWWRLCRDDGVKPASLG
Subjt:  PTNSSISSSDLDTESTGSFFHDRSTSLGALMGVSFPAITFRLPSQNRDQHAAA---AGGA---SKKPKRKPAAAAALVGDRKRRWWRLCRDDGVKPASLG

Query:  EFLEVERRFGDGGFFGNGVDLDGVVAAYHRRTGRNLFADGRILPP---EEEASPAAALCRFSVSLTGICSGGAG
        EFLEVERRFGDG F+GN VDL+GVV A  +R GR+LFADGR+LPP   EE+ S   ALCRFSVSLTGICSGGAG
Subjt:  EFLEVERRFGDGGFFGNGVDLDGVVAAYHRRTGRNLFADGRILPP---EEEASPAAALCRFSVSLTGICSGGAG

XP_022924205.1 putative nuclease HARBI1 isoform X2 [Cucurbita moschata]1.4e-20989.77Show/hide
Query:  MESSDDEKDGSYGKYIPREPTHNIVSNGAKFVDEVLNGSNERCLDNFRMDKHIFYKLCDILQGKGLLRHTNRVKIEEQLAIFMFIIGHNLRTRAVQELFR
        MESSDDEKDGSYGKY+PREP+HN+V+NGAKFVDEVLNG NERCL+NFRMDKHIFYKLCDILQ KGLLRHTNR+KIEEQLAIFMFIIGHNLRTRAVQELFR
Subjt:  MESSDDEKDGSYGKYIPREPTHNIVSNGAKFVDEVLNGSNERCLDNFRMDKHIFYKLCDILQGKGLLRHTNRVKIEEQLAIFMFIIGHNLRTRAVQELFR

Query:  YSGETISRHFNNVLNAIMAISLDFFQPSGSDVPPEILEDPRFYPYFKDCVGAIDGIHFPVMVGVDEQGPFRNKDGLLSQNVLAACSFDLKFHYVLAGWEG
        YSGETISRHFNNVLNAIMAISLDFFQP GS+VPPEIL+DPRFYPYFKDCVGAIDGIH PVMVGVDEQGPFRNK+GLLSQNVLAACSFDLKFHYVLAGWEG
Subjt:  YSGETISRHFNNVLNAIMAISLDFFQPSGSDVPPEILEDPRFYPYFKDCVGAIDGIHFPVMVGVDEQGPFRNKDGLLSQNVLAACSFDLKFHYVLAGWEG

Query:  SASDFQVLNSALTRRNKLSVPEGKYYLVDQKYMNMPGFIAPYHDVPYYSKEFPAGYHPQDAKELFNLRHSLLRNATDRTFGALKARFPILMLAPPYPLQT
        SA+D QVLNSALTRRNKL VPEGKYYLVDQKYMNMPGFIAPYHD+PY S+E+  GYHPQDAKELFNLRHSLLRNATDRTFGALK RFPIL+ APPYPLQT
Subjt:  SASDFQVLNSALTRRNKLSVPEGKYYLVDQKYMNMPGFIAPYHDVPYYSKEFPAGYHPQDAKELFNLRHSLLRNATDRTFGALKARFPILMLAPPYPLQT

Query:  QVKLVVATCAIHNYIRRENPDDWLFRLYEQDQVSHMEGSLPPMETEQMMTHIETPAVGIAFETEELEITSQLRDQIATGLWSDYINGISPI
        QVKLVVATCAIHNYIRRENPDDWLF+LYEQD VSHME SLP +E EQ+  HIETP V IAFETEELEITSQLRD IAT LWSDYIN ISP+
Subjt:  QVKLVVATCAIHNYIRRENPDDWLFRLYEQDQVSHMEGSLPPMETEQMMTHIETPAVGIAFETEELEITSQLRDQIATGLWSDYINGISPI

XP_023520117.1 putative nuclease HARBI1 [Cucurbita pepo subsp. pepo]3.1e-20989.51Show/hide
Query:  MESSDDEKDGSYGKYIPREPTHNIVSNGAKFVDEVLNGSNERCLDNFRMDKHIFYKLCDILQGKGLLRHTNRVKIEEQLAIFMFIIGHNLRTRAVQELFR
        MESSDDEKDGSYGKY+PREP+HN+V+NGAKFVDEVLNG NERCL+NFRMDKHIFYKLCDILQ KGLLRHTNR+KIEEQLAIFMFIIGHNLRTRAVQELFR
Subjt:  MESSDDEKDGSYGKYIPREPTHNIVSNGAKFVDEVLNGSNERCLDNFRMDKHIFYKLCDILQGKGLLRHTNRVKIEEQLAIFMFIIGHNLRTRAVQELFR

Query:  YSGETISRHFNNVLNAIMAISLDFFQPSGSDVPPEILEDPRFYPYFKDCVGAIDGIHFPVMVGVDEQGPFRNKDGLLSQNVLAACSFDLKFHYVLAGWEG
        YSGETISRHFNNVLNAIMAISLDFFQP GS+VPPEIL+DPRFYPYFKDCVGAIDGIH PVMVGVDEQGPFRNK+GLLSQNVLAACSFDLKFHYVLAGWEG
Subjt:  YSGETISRHFNNVLNAIMAISLDFFQPSGSDVPPEILEDPRFYPYFKDCVGAIDGIHFPVMVGVDEQGPFRNKDGLLSQNVLAACSFDLKFHYVLAGWEG

Query:  SASDFQVLNSALTRRNKLSVPEGKYYLVDQKYMNMPGFIAPYHDVPYYSKEFPAGYHPQDAKELFNLRHSLLRNATDRTFGALKARFPILMLAPPYPLQT
        SA+D QVLNSALTRRNKL VPEGKYYLVDQKYMNMPGFIAPYHD+PY S+E+  GYHPQDAKELFNLRHSLLRNATDRTFGA+K RFPIL+ APPYPLQT
Subjt:  SASDFQVLNSALTRRNKLSVPEGKYYLVDQKYMNMPGFIAPYHDVPYYSKEFPAGYHPQDAKELFNLRHSLLRNATDRTFGALKARFPILMLAPPYPLQT

Query:  QVKLVVATCAIHNYIRRENPDDWLFRLYEQDQVSHMEGSLPPMETEQMMTHIETPAVGIAFETEELEITSQLRDQIATGLWSDYINGISPI
        QVKLVVATCAIHNYIRRENPDDWLF+LYEQD VSHME SLP +E EQ+  HIETP V IAFETEELEITSQLRD IAT LWSDYIN ISP+
Subjt:  QVKLVVATCAIHNYIRRENPDDWLFRLYEQDQVSHMEGSLPPMETEQMMTHIETPAVGIAFETEELEITSQLRDQIATGLWSDYINGISPI

XP_038895429.1 putative nuclease HARBI1 isoform X1 [Benincasa hispida]1.5e-20890.31Show/hide
Query:  MESSDDEKDGSYGKYIPREPTHNIVSNGAKFVDEVLNGSNERCLDNFRMDKHIFYKLCDILQGKGLLRHTNRVKIEEQLAIFMFIIGHNLRTRAVQELFR
        MESSDDEKDG+YGKY+PREP+HN+VSNGAKFVDEVLNG NERCLD FRMDKHIFYKLCDILQ KGLLRHTNR+KIEEQLAIFMFIIGHNLRTRAVQELFR
Subjt:  MESSDDEKDGSYGKYIPREPTHNIVSNGAKFVDEVLNGSNERCLDNFRMDKHIFYKLCDILQGKGLLRHTNRVKIEEQLAIFMFIIGHNLRTRAVQELFR

Query:  YSGETISRHFNNVLNAIMAISLDFFQPSGSDV-PPEILEDPRFYPYFKDCVGAIDGIHFPVMVGVDEQGPFRNKDGLLSQNVLAACSFDLKFHYVLAGWE
        YSGETISRHFNNVLNAIMAISLDFFQP GS+V PPEILEDPRFYPYFKDCVGAIDGIH PVMVGVDEQGPFRNK GLLSQ VLAACSFDLKFHYVLAGWE
Subjt:  YSGETISRHFNNVLNAIMAISLDFFQPSGSDV-PPEILEDPRFYPYFKDCVGAIDGIHFPVMVGVDEQGPFRNKDGLLSQNVLAACSFDLKFHYVLAGWE

Query:  GSASDFQVLNSALTRRNKLSVPEGKYYLVDQKYMNMPGFIAPYHDVPYYSKEFPAGYHPQDAKELFNLRHSLLRNATDRTFGALKARFPILMLAPPYPLQ
        GSASD QVLNSALTRRNKL VPEGKYYLVDQKYMNMPGFIAPYHD+PY+SKE+P GYHPQDAKELFNLRHSLLRNATDRTFGALKARFPIL+ APPYPLQ
Subjt:  GSASDFQVLNSALTRRNKLSVPEGKYYLVDQKYMNMPGFIAPYHDVPYYSKEFPAGYHPQDAKELFNLRHSLLRNATDRTFGALKARFPILMLAPPYPLQ

Query:  TQVKLVVATCAIHNYIRRENPDDWLFRLYEQDQVSHMEGSLPPMETEQMMTHIETPAVGIAFETEELEITSQLRDQIATGLWSDYINGISPI
        TQVKLVVATCAIHNYIRRENPDDWLFRLYEQD V HME SLP ++ EQ+ THIETP V IAFETEELEITSQLRD IA  LWSDYIN ISP+
Subjt:  TQVKLVVATCAIHNYIRRENPDDWLFRLYEQDQVSHMEGSLPPMETEQMMTHIETPAVGIAFETEELEITSQLRDQIATGLWSDYINGISPI

TrEMBL top hitse value%identityAlignment
A0A1S4DU98 uncharacterized protein LOC1035028781.2e-20387.24Show/hide
Query:  MESSDDEKDGSYGKYIPREPTHNIVSNGAKFVDEVLNGSNERCLDNFRMDKHIFYKLCDILQGKGLLRHTNRVKIEEQLAIFMFIIGHNLRTRAVQELFR
        MESSDDEKDG+YGKY+PREP+HN+VSNGAKFVDEVLNG NERCL++FRMDKH+FYKLCDILQ KGLLRHTNR+KIEEQLAIFMFIIGHNLRTRAVQELFR
Subjt:  MESSDDEKDGSYGKYIPREPTHNIVSNGAKFVDEVLNGSNERCLDNFRMDKHIFYKLCDILQGKGLLRHTNRVKIEEQLAIFMFIIGHNLRTRAVQELFR

Query:  YSGETISRHFNNVLNAIMAISLDFFQPSGSDV-PPEILEDPRFYPYFKDCVGAIDGIHFPVMVGVDEQGPFRNKDGLLSQNVLAACSFDLKFHYVLAGWE
        YSGETISRHFNNVLNAIMAISLDFFQP GS+V PPEILEDPRFYPYFKDCVGAIDGIH PVMVGVDEQGPFRN +G LSQ VLAACSFDLKFHYVLAGWE
Subjt:  YSGETISRHFNNVLNAIMAISLDFFQPSGSDV-PPEILEDPRFYPYFKDCVGAIDGIHFPVMVGVDEQGPFRNKDGLLSQNVLAACSFDLKFHYVLAGWE

Query:  GSASDFQVLNSALTRRNKLSVPEGKYYLVDQKYMNMPGFIAPYHDVPYYSKEFPAGYHPQDAKELFNLRHSLLRNATDRTFGALKARFPILMLAPPYPLQ
        GSASD QVLNSALTRRNKL VPEGKYYLVDQKYMNMPGF+APYHD+ Y+SKE+P GYHPQDAKELFNLRHSLLRNAT+RTFGALKARFPIL+ APPYPLQ
Subjt:  GSASDFQVLNSALTRRNKLSVPEGKYYLVDQKYMNMPGFIAPYHDVPYYSKEFPAGYHPQDAKELFNLRHSLLRNATDRTFGALKARFPILMLAPPYPLQ

Query:  TQVKLVVATCAIHNYIRRENPDDWLFRLYEQDQVSHMEGSLPPMETEQMMTHIETPAVGIAFETEELEITSQLRDQIATGLWSDYINGISPI
        TQVKLVVATCAIHNYIRRENPDDW FRLYEQD V HME SLP +E EQ+  +IETP V +AFETEELEI SQLRD IA  +WSDYIN ISP+
Subjt:  TQVKLVVATCAIHNYIRRENPDDWLFRLYEQDQVSHMEGSLPPMETEQMMTHIETPAVGIAFETEELEITSQLRDQIATGLWSDYINGISPI

A0A5D3BLI7 Putative nuclease HARBI18.2e-28576.26Show/hide
Query:  MESSDDEKDGSYGKYIPREPTHNIVSNGAKFVDEVLNGSNERCLDNFRMDKHIFYKLCDILQGKGLLRHTNRVKIEEQLAIFMFIIGHNLRTRAVQELFR
        MESSDDEKDG+YGKY+PREP+HN+VSNGAKFVDEVLNG NERCL++FRMDKH+FYKLCDILQ KGLLRHTNR+KIEEQLAIFMFIIGHNLRTRAVQELFR
Subjt:  MESSDDEKDGSYGKYIPREPTHNIVSNGAKFVDEVLNGSNERCLDNFRMDKHIFYKLCDILQGKGLLRHTNRVKIEEQLAIFMFIIGHNLRTRAVQELFR

Query:  YSGETISRHFNNVLNAIMAISLDFFQPSGSDV-PPEILEDPRFYPYFKDCVGAIDGIHFPVMVGVDEQGPFRNKDGLLSQNVLAACSFDLKFHYVLAGWE
        YSGETISRHFNNVLNAIMAISLDFFQP GS+V PPEILEDPRFYPYFKDCVGAIDGIH PVMVGVDEQGPFRNK+G LSQ VLAACSFDLKFHYVLAGWE
Subjt:  YSGETISRHFNNVLNAIMAISLDFFQPSGSDV-PPEILEDPRFYPYFKDCVGAIDGIHFPVMVGVDEQGPFRNKDGLLSQNVLAACSFDLKFHYVLAGWE

Query:  GSASDFQVLNSALTRRNKLSVPEGKYYLVDQKYMNMPGFIAPYHDVPYYSKEFPAGYHPQDAKELFNLRHSLLRNATDRTFGALKARFPILMLAPPYPLQ
        GSASD QVLNSALTRRNKL VPEGKYYLVDQKYMNMPGF+APYHD+ Y+SKE+P GYHPQDAKELFNLRHSLLRNAT+RTFGALKARFPIL+ APPYPLQ
Subjt:  GSASDFQVLNSALTRRNKLSVPEGKYYLVDQKYMNMPGFIAPYHDVPYYSKEFPAGYHPQDAKELFNLRHSLLRNATDRTFGALKARFPILMLAPPYPLQ

Query:  TQVKLVVATCAIHNYIRRENPDDWLFRLYEQDQVSHMEGSLPPMETEQMMTHIETPAVGIAFETEELEITSQLRDQIATGLWSDYINGISPIEVHVKRCA
        TQVKLVVATCAIHNYIRRENPDDW FRLYEQD V HME SLP +E EQ+  +IETP V +AFETEELEI SQLRD IA  +W     G++    HV   +
Subjt:  TQVKLVVATCAIHNYIRRENPDDWLFRLYEQDQVSHMEGSLPPMETEQMMTHIETPAVGIAFETEELEITSQLRDQIATGLWSDYINGISPIEVHVKRCA

Query:  -------LGVCLPKLFLLYFVSFISLIQHSFGFHLKYP-KGQVPYIL-YWNRFYVISCS-CGGVSFGPWYFLDSLTSS--------MLNPANDLLPPPSS
               L  C+    LL +      I  ++   ++     + P I+   N  ++ S   CGGVSF   Y LDSLTSS        MLNPANDLLPPPSS
Subjt:  -------LGVCLPKLFLLYFVSFISLIQHSFGFHLKYP-KGQVPYIL-YWNRFYVISCS-CGGVSFGPWYFLDSLTSS--------MLNPANDLLPPPSS

Query:  PTNSSISSSDLDTESTGSFFHDRSTSLGALMGVSFPAITFRLPSQNRDQHAAA---AGGA---SKKPKRKPAAAAALVGDRKRRWWRLCRDDGVKPASLG
        PTNSSISSSDLDTESTGSFFHDRSTSLG LMGVSFPAITFR+PSQNRDQH AA   AGG    SKK KRK   A ALV DRKRRWWRLCRDDGVKPASLG
Subjt:  PTNSSISSSDLDTESTGSFFHDRSTSLGALMGVSFPAITFRLPSQNRDQHAAA---AGGA---SKKPKRKPAAAAALVGDRKRRWWRLCRDDGVKPASLG

Query:  EFLEVERRFGDGGFFGNGVDLDGVVAAYHRRTGRNLFADGRILPP---EEEASPAAALCRFSVSLTGICSGGAG
        EFLEVERRFGDG F+GN VDL+GVV A  +R GR+LFADGR+LPP   EE+ S   ALCRFSVSLTGICSGGAG
Subjt:  EFLEVERRFGDGGFFGNGVDLDGVVAAYHRRTGRNLFADGRILPP---EEEASPAAALCRFSVSLTGICSGGAG

A0A6J1C7D1 putative nuclease HARBI18.4e-20588.3Show/hide
Query:  MESSDDEKDGSYGKYIPREPT-HNIVSNGAKFVDEVLNGSNERCLDNFRMDKHIFYKLCDILQGKGLLRHTNRVKIEEQLAIFMFIIGHNLRTRAVQELF
        MESSDDEKDG+YGKY+PRE + HN+VSNGAKFVDEVL G NE CL+NFRMDKHIFYKLCDILQ KGLLRHTNR+KIEEQLAIFMFIIGHNLRTRAVQELF
Subjt:  MESSDDEKDGSYGKYIPREPT-HNIVSNGAKFVDEVLNGSNERCLDNFRMDKHIFYKLCDILQGKGLLRHTNRVKIEEQLAIFMFIIGHNLRTRAVQELF

Query:  RYSGETISRHFNNVLNAIMAISLDFFQPSGSDVPPEILEDPRFYPYFKDCVGAIDGIHFPVMVGVDEQGPFRNKDGLLSQNVLAACSFDLKFHYVLAGWE
        RYSGETISRHFNNVLNAIMAISLDFFQP GS+VP EILEDPRFYPYFKDCVGA+DGIH PVMVGVDEQGPFRNK+GLLSQNVLAACSFDL FHYVLAGWE
Subjt:  RYSGETISRHFNNVLNAIMAISLDFFQPSGSDVPPEILEDPRFYPYFKDCVGAIDGIHFPVMVGVDEQGPFRNKDGLLSQNVLAACSFDLKFHYVLAGWE

Query:  GSASDFQVLNSALTRRNKLSVPEGKYYLVDQKYMNMPGFIAPYHDVPYYSKEFPAGYHPQDAKELFNLRHSLLRNATDRTFGALKARFPILMLAPPYPLQ
        GSASD QVLNSALTRRNKL VPEGKYYLVDQKYMNMPGF+APYHDVPY+SK+FP GYHPQDAK+LFNLRHSLLRNATDRTFGALKARFPIL+ APPYPLQ
Subjt:  GSASDFQVLNSALTRRNKLSVPEGKYYLVDQKYMNMPGFIAPYHDVPYYSKEFPAGYHPQDAKELFNLRHSLLRNATDRTFGALKARFPILMLAPPYPLQ

Query:  TQVKLVVATCAIHNYIRRENPDDWLFRLYEQDQVSHMEGSLPPMETEQMM-THIETPAVGIAFETEELEITSQLRDQIATGLWSDYINGISPI
        TQVKLVVATCAIHNYIRRE PDDWLFRLYEQD + HME SLPP+E  Q++ THIETP V IAFETEELEITSQLRD IAT LWSDYIN +SP+
Subjt:  TQVKLVVATCAIHNYIRRENPDDWLFRLYEQDQVSHMEGSLPPMETEQMM-THIETPAVGIAFETEELEITSQLRDQIATGLWSDYINGISPI

A0A6J1EE58 putative nuclease HARBI1 isoform X26.6e-21089.77Show/hide
Query:  MESSDDEKDGSYGKYIPREPTHNIVSNGAKFVDEVLNGSNERCLDNFRMDKHIFYKLCDILQGKGLLRHTNRVKIEEQLAIFMFIIGHNLRTRAVQELFR
        MESSDDEKDGSYGKY+PREP+HN+V+NGAKFVDEVLNG NERCL+NFRMDKHIFYKLCDILQ KGLLRHTNR+KIEEQLAIFMFIIGHNLRTRAVQELFR
Subjt:  MESSDDEKDGSYGKYIPREPTHNIVSNGAKFVDEVLNGSNERCLDNFRMDKHIFYKLCDILQGKGLLRHTNRVKIEEQLAIFMFIIGHNLRTRAVQELFR

Query:  YSGETISRHFNNVLNAIMAISLDFFQPSGSDVPPEILEDPRFYPYFKDCVGAIDGIHFPVMVGVDEQGPFRNKDGLLSQNVLAACSFDLKFHYVLAGWEG
        YSGETISRHFNNVLNAIMAISLDFFQP GS+VPPEIL+DPRFYPYFKDCVGAIDGIH PVMVGVDEQGPFRNK+GLLSQNVLAACSFDLKFHYVLAGWEG
Subjt:  YSGETISRHFNNVLNAIMAISLDFFQPSGSDVPPEILEDPRFYPYFKDCVGAIDGIHFPVMVGVDEQGPFRNKDGLLSQNVLAACSFDLKFHYVLAGWEG

Query:  SASDFQVLNSALTRRNKLSVPEGKYYLVDQKYMNMPGFIAPYHDVPYYSKEFPAGYHPQDAKELFNLRHSLLRNATDRTFGALKARFPILMLAPPYPLQT
        SA+D QVLNSALTRRNKL VPEGKYYLVDQKYMNMPGFIAPYHD+PY S+E+  GYHPQDAKELFNLRHSLLRNATDRTFGALK RFPIL+ APPYPLQT
Subjt:  SASDFQVLNSALTRRNKLSVPEGKYYLVDQKYMNMPGFIAPYHDVPYYSKEFPAGYHPQDAKELFNLRHSLLRNATDRTFGALKARFPILMLAPPYPLQT

Query:  QVKLVVATCAIHNYIRRENPDDWLFRLYEQDQVSHMEGSLPPMETEQMMTHIETPAVGIAFETEELEITSQLRDQIATGLWSDYINGISPI
        QVKLVVATCAIHNYIRRENPDDWLF+LYEQD VSHME SLP +E EQ+  HIETP V IAFETEELEITSQLRD IAT LWSDYIN ISP+
Subjt:  QVKLVVATCAIHNYIRRENPDDWLFRLYEQDQVSHMEGSLPPMETEQMMTHIETPAVGIAFETEELEITSQLRDQIATGLWSDYINGISPI

A0A6J1KJM0 putative nuclease HARBI12.8e-20889.51Show/hide
Query:  MESSDDEKDGSYGKYIPREPTHNIVSNGAKFVDEVLNGSNERCLDNFRMDKHIFYKLCDILQGKGLLRHTNRVKIEEQLAIFMFIIGHNLRTRAVQELFR
        MESSDDEKDGSYGKY+PREP+HN+VSNGAKFVDEVLNG NERCL+NFRMDKHIFYKLCDILQ KGLLRHTNR+KIEEQLAIFMFIIGHNLRTRAVQELFR
Subjt:  MESSDDEKDGSYGKYIPREPTHNIVSNGAKFVDEVLNGSNERCLDNFRMDKHIFYKLCDILQGKGLLRHTNRVKIEEQLAIFMFIIGHNLRTRAVQELFR

Query:  YSGETISRHFNNVLNAIMAISLDFFQPSGSDVPPEILEDPRFYPYFKDCVGAIDGIHFPVMVGVDEQGPFRNKDGLLSQNVLAACSFDLKFHYVLAGWEG
        YSGETISRHFNNVLNAIMAISLDFFQP GS+VPPEIL+DPRFYPYFKDCVGAIDGIH PVMVGVDEQGPFRNK+GLLSQNVLAACSFDLKFHYVLAGWEG
Subjt:  YSGETISRHFNNVLNAIMAISLDFFQPSGSDVPPEILEDPRFYPYFKDCVGAIDGIHFPVMVGVDEQGPFRNKDGLLSQNVLAACSFDLKFHYVLAGWEG

Query:  SASDFQVLNSALTRRNKLSVPEGKYYLVDQKYMNMPGFIAPYHDVPYYSKEFPAGYHPQDAKELFNLRHSLLRNATDRTFGALKARFPILMLAPPYPLQT
        SA+D QVLNSALTRRNKL +PEGKYYLVDQKYMNMPGFIAPYHD+PY S+E+  GYHPQDAKELFNLRHSLLRNATDRTFGALK RFPIL+ APPYPLQT
Subjt:  SASDFQVLNSALTRRNKLSVPEGKYYLVDQKYMNMPGFIAPYHDVPYYSKEFPAGYHPQDAKELFNLRHSLLRNATDRTFGALKARFPILMLAPPYPLQT

Query:  QVKLVVATCAIHNYIRRENPDDWLFRLYEQDQVSHMEGSLPPMETEQMMTHIETPAVGIAFETEELEITSQLRDQIATGLWSDYINGISPI
        QVKLVVATCAIHNYIRRENPDD LFRLYEQD VSHME SLP +E EQ+  HIETP V IAFETEE EITSQLRD IAT LWSDYIN ISP+
Subjt:  QVKLVVATCAIHNYIRRENPDDWLFRLYEQDQVSHMEGSLPPMETEQMMTHIETPAVGIAFETEELEITSQLRDQIATGLWSDYINGISPI

SwissProt top hitse value%identityAlignment
Q6DR24 Uncharacterized protein At3g179503.3e-3349.27Show/hide
Query:  PSSPTNSSISSSDLDTESTGSFFHDRSTSLGALMGVSFPA---ITFRLPSQNRDQHAAAAGGASK-------KPKRKPAAAAALVGDRKRRWWRLCRDD-
        PSSPT SS+SSSDLDTESTGSFFHDRS +LG LMG SF A   + FR  S+     + A   AS        + KR P+ +A     R+R+WWR CRDD 
Subjt:  PSSPTNSSISSSDLDTESTGSFFHDRSTSLGALMGVSFPA---ITFRLPSQNRDQHAAAAGGASK-------KPKRKPAAAAALVGDRKRRWWRLCRDD-

Query:  ------GV-------KPASLGEFLEVERRFGDGGFFGNG-VDL-DGVVAAYHRRT----GRNLFADGRILPPEE------EASP-AAALCRFSVSLTGIC
              G+       K +SLGE+LEVERRFGD   + +   +L D VVA Y  +      R LFADGR+LPP        E +P A +LCRF VSLTGIC
Subjt:  ------GV-------KPASLGEFLEVERRFGDGGFFGNG-VDL-DGVVAAYHRRT----GRNLFADGRILPPEE------EASP-AAALCRFSVSLTGIC

Query:  SGGAG
        SGG G
Subjt:  SGGAG

Arabidopsis top hitse value%identityAlignment
AT1G43722.1 unknown protein3.0e-3734.77Show/hide
Query:  YGKYIPREPTHNIVSNGAKFVDEVLNGSNERCLDNFRMDKHIFYKLCDILQGKGLLRHTNRVKIEEQLAIFMFIIGHNLRTRAVQELFRYSGETISRHFN
        Y +Y  R P       G + +   L      CL   RM    F  LC++LQ    L+ T  + IEE +A+F+ I GHN   R V   F  + ET+ R F 
Subjt:  YGKYIPREPTHNIVSNGAKFVDEVLNGSNERCLDNFRMDKHIFYKLCDILQGKGLLRHTNRVKIEEQLAIFMFIIGHNLRTRAVQELFRYSGETISRHFN

Query:  NVLNAIMAISLDFFQ-PSGSD---VPPEILEDPRFYPYFKDCVGAIDGIHFPVMVGVDEQGPFRNKDGLLSQNVLAACSFDLKFHYVLAGWEGSASDFQV
         VL A   ++ D+ + P+  +   +P  +  D R++PYF   VGA+DG H  V V  D QG + N+    S N++A C   + F Y+  G  GS  D  V
Subjt:  NVLNAIMAISLDFFQ-PSGSD---VPPEILEDPRFYPYFKDCVGAIDGIHFPVMVGVDEQGPFRNKDGLLSQNVLAACSFDLKFHYVLAGWEGSASDFQV

Query:  LNSALTRRNKLSVPEG-KYYLVDQKYMNMPGFIAPYHD-----VPYYSKEFPAGYHPQDAKELFNLRHSLLRNATDRTF
        L  A    ++  +P   KYYLVD  Y N  G +APY       V Y+  +F  G  P++  ELFN  H+ LR+  +RTF
Subjt:  LNSALTRRNKLSVPEG-KYYLVDQKYMNMPGFIAPYHD-----VPYYSKEFPAGYHPQDAKELFNLRHSLLRNATDRTF

AT3G17950.1 unknown protein2.4e-3449.27Show/hide
Query:  PSSPTNSSISSSDLDTESTGSFFHDRSTSLGALMGVSFPA---ITFRLPSQNRDQHAAAAGGASK-------KPKRKPAAAAALVGDRKRRWWRLCRDD-
        PSSPT SS+SSSDLDTESTGSFFHDRS +LG LMG SF A   + FR  S+     + A   AS        + KR P+ +A     R+R+WWR CRDD 
Subjt:  PSSPTNSSISSSDLDTESTGSFFHDRSTSLGALMGVSFPA---ITFRLPSQNRDQHAAAAGGASK-------KPKRKPAAAAALVGDRKRRWWRLCRDD-

Query:  ------GV-------KPASLGEFLEVERRFGDGGFFGNG-VDL-DGVVAAYHRRT----GRNLFADGRILPPEE------EASP-AAALCRFSVSLTGIC
              G+       K +SLGE+LEVERRFGD   + +   +L D VVA Y  +      R LFADGR+LPP        E +P A +LCRF VSLTGIC
Subjt:  ------GV-------KPASLGEFLEVERRFGDGGFFGNG-VDL-DGVVAAYHRRT----GRNLFADGRILPPEE------EASP-AAALCRFSVSLTGIC

Query:  SGGAG
        SGG G
Subjt:  SGGAG

AT5G28950.1 unknown protein1.2e-2560.87Show/hide
Query:  VPPEILEDPRFYPYFKDCVGAIDGIHFPVMVGVDEQGPFRNKDGLLSQNVLAACSFDLKFHYVLAGWEGSASDFQVLNSALTRR-NKLSVPE
        VP +I E  R YPYFKDCVGAID  H   MV   +   FRN+ G +SQN+LAAC+FD++F YVL+GWEGSA D +VLN ALTR  N+L VPE
Subjt:  VPPEILEDPRFYPYFKDCVGAIDGIHFPVMVGVDEQGPFRNKDGLLSQNVLAACSFDLKFHYVLAGWEGSASDFQVLNSALTRR-NKLSVPE

AT5G35695.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)2.0e-2536.89Show/hide
Query:  FHYVLAGWEGSASDFQVLNSALTRRNKLSVPEGKYYLVDQKYMNMPGFIAPYHDVPYYSKEFPAGYH--PQDAKELFNLRHSLLRNATDRTFGALKARFP
        F YVL+GWEGSA D +VL+ AL           K+YLVD  + N   F+AP+  V Y+ +EF AG    P+   ELFNLRH  LRN  +R FG  K+RF 
Subjt:  FHYVLAGWEGSASDFQVLNSALTRRNKLSVPEGKYYLVDQKYMNMPGFIAPYHDVPYYSKEFPAGYH--PQDAKELFNLRHSLLRNATDRTFGALKARFP

Query:  ILMLAPPYPLQTQVKLVVATCAIHNYIRRENPDDWLFRLYEQDQVSHMEGSLPPMETEQMMTHIETPAVGIAFETEELEITSQLRDQIATGLWSDYINGI
        I   APP+  + Q  LV+   A+HN++R+E   D        D+V + EG +   E   M T+       +  + ++ E T+  R  +A  +W D  N  
Subjt:  ILMLAPPYPLQTQVKLVVATCAIHNYIRRENPDDWLFRLYEQDQVSHMEGSLPPMETEQMMTHIETPAVGIAFETEELEITSQLRDQIATGLWSDYINGI

Query:  SPIEVH
          +E+H
Subjt:  SPIEVH

AT5G41980.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)5.9e-13461.98Show/hide
Query:  EKDGSYGKYIPREPTHNIVSNGAKFVDEVLNGSNERCLDNFRMDKHIFYKLCDILQGKGLLRHTNRVKIEEQLAIFMFIIGHNLRTRAVQELFRYSGETI
        E+D      +P+E +   +S+G KFV ++LNG NE+C +NFRMDK +FYKLCD+LQ +GLLRHTNR+KIE QLAIF+FIIGHNLRTRAVQELF YSGETI
Subjt:  EKDGSYGKYIPREPTHNIVSNGAKFVDEVLNGSNERCLDNFRMDKHIFYKLCDILQGKGLLRHTNRVKIEEQLAIFMFIIGHNLRTRAVQELFRYSGETI

Query:  SRHFNNVLNAIMAISLDFFQPSGSDVPPEILEDPRFYPYFKDCVGAIDGIHFPVMVGVDEQGPFRNKDGLLSQNVLAACSFDLKFHYVLAGWEGSASDFQ
        SRHFNNVLNA++AIS DFFQP+ +    + LE+    PYFKDCVG +D  H PVMVGVDEQGPFRN +GLL+QNVLAA SFDL+F+YVLAGWEGSASD Q
Subjt:  SRHFNNVLNAIMAISLDFFQPSGSDVPPEILEDPRFYPYFKDCVGAIDGIHFPVMVGVDEQGPFRNKDGLLSQNVLAACSFDLKFHYVLAGWEGSASDFQ

Query:  VLNSALTRRNKLSVPEGKYYLVDQKYMNMPGFIAPYHDVPYYSKEFPAGYHPQDAKELFNLRHSLLRNATDRTFGALKARFPILMLAPPYPLQTQVKLVV
        VLN+ALTRRNKL VP+GKYY+VD KY N+PGFIAPYH V   S+E        +AKE+FN RH LL  A  RTFGALK RFPIL+ APPYPLQTQVKLV+
Subjt:  VLNSALTRRNKLSVPEGKYYLVDQKYMNMPGFIAPYHDVPYYSKEFPAGYHPQDAKELFNLRHSLLRNATDRTFGALKARFPILMLAPPYPLQTQVKLVV

Query:  ATCAIHNYIRRENPDDWLFRLYEQDQVSHM-EGSLPPMETEQMMTHIETPAVGIAFETEELEITSQLRDQIATGLWSDYINGIS
        A CA+HNY+R E PDD +FR++E++ ++   E     +E EQ    +E       F  EE+E + +LRD+IA+ LW+ Y+  +S
Subjt:  ATCAIHNYIRRENPDDWLFRLYEQDQVSHM-EGSLPPMETEQMMTHIETPAVGIAFETEELEITSQLRDQIATGLWSDYINGIS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAGTTCTGATGATGAAAAGGATGGAAGTTATGGGAAATATATTCCAAGAGAGCCGACTCATAATATAGTATCGAATGGTGCAAAATTTGTAGACGAAGTACTCAA
TGGATCAAATGAACGTTGCCTAGACAATTTCCGCATGGACAAGCACATATTTTATAAGTTGTGTGATATTTTGCAAGGCAAAGGCTTACTGCGTCATACAAATCGGGTTA
AGATTGAAGAGCAATTAGCCATATTCATGTTTATAATTGGTCACAATCTTAGGACACGAGCAGTTCAAGAGCTGTTCAGATATTCAGGAGAAACGATTAGTCGCCATTTT
AACAATGTATTGAATGCAATTATGGCGATATCATTGGACTTCTTTCAGCCTTCAGGATCTGATGTTCCACCAGAAATCTTAGAAGATCCAAGATTCTATCCCTACTTTAA
GGATTGTGTGGGAGCAATCGATGGCATTCACTTCCCTGTGATGGTTGGTGTTGATGAGCAAGGACCTTTTCGAAATAAGGATGGACTACTTTCTCAAAATGTTTTGGCAG
CATGCTCATTTGATCTCAAGTTTCATTACGTTCTAGCAGGCTGGGAAGGATCGGCATCGGATTTTCAAGTTCTGAACTCAGCACTAACGAGACGAAACAAACTAAGTGTT
CCTGAAGGTAAATACTACCTTGTGGACCAAAAATATATGAACATGCCTGGTTTTATTGCCCCTTATCATGATGTCCCCTATTATTCAAAGGAATTTCCTGCTGGCTATCA
TCCTCAAGATGCCAAGGAGCTATTTAATCTACGGCATTCATTGTTGCGCAATGCAACCGATAGAACTTTTGGAGCTCTAAAGGCGCGCTTCCCCATACTAATGTTAGCTC
CTCCTTACCCATTACAGACACAAGTTAAGTTGGTCGTTGCGACATGTGCGATTCACAATTACATTCGGAGGGAGAACCCCGATGATTGGCTCTTCAGATTGTATGAACAA
GACCAGGTTTCCCATATGGAGGGTTCATTACCTCCAATGGAAACAGAACAGATGATGACACACATTGAGACCCCAGCTGTGGGCATTGCTTTTGAAACAGAAGAACTAGA
AATTACATCTCAGTTGAGGGATCAAATTGCAACTGGATTGTGGAGTGACTATATTAATGGTATATCACCCATCGAGGTGCATGTAAAAAGGTGCGCTCTAGGTGTGTGCC
TTCCAAAACTGTTTCTTCTTTACTTTGTTTCTTTTATTTCTCTTATACAACATTCCTTTGGTTTCCATCTGAAGTACCCTAAGGGCCAAGTTCCCTACATACTTTATTGG
AATCGTTTTTATGTTATTAGTTGTTCGTGTGGGGGAGTGAGCTTTGGGCCTTGGTATTTCTTGGATTCTCTGACTTCCTCTATGCTCAATCCGGCGAACGATCTGTTACC
GCCGCCGTCTTCTCCGACGAATTCATCAATTTCCTCCTCCGATCTCGACACAGAGTCCACCGGTTCCTTCTTCCATGACCGGAGCACGAGCTTGGGGGCTTTAATGGGAG
TCAGCTTCCCGGCGATTACCTTCCGGTTGCCGTCACAGAACCGAGACCAACACGCCGCCGCAGCCGGAGGAGCTTCCAAGAAGCCGAAGAGAAAACCCGCGGCGGCGGCG
GCACTGGTCGGAGATCGGAAGCGGCGGTGGTGGCGGCTGTGCAGAGACGACGGCGTTAAGCCGGCATCTCTGGGCGAGTTTCTTGAGGTCGAACGGAGGTTTGGGGACGG
TGGTTTTTTCGGAAACGGGGTGGATCTGGATGGCGTGGTTGCCGCATATCATCGGAGGACTGGCCGGAATTTGTTCGCCGACGGGAGAATTCTCCCGCCGGAGGAGGAAG
CATCGCCGGCCGCCGCTCTTTGCCGGTTTTCTGTTTCACTAACCGGGATTTGCAGCGGCGGTGCGGGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGAGTTCTGATGATGAAAAGGATGGAAGTTATGGGAAATATATTCCAAGAGAGCCGACTCATAATATAGTATCGAATGGTGCAAAATTTGTAGACGAAGTACTCAA
TGGATCAAATGAACGTTGCCTAGACAATTTCCGCATGGACAAGCACATATTTTATAAGTTGTGTGATATTTTGCAAGGCAAAGGCTTACTGCGTCATACAAATCGGGTTA
AGATTGAAGAGCAATTAGCCATATTCATGTTTATAATTGGTCACAATCTTAGGACACGAGCAGTTCAAGAGCTGTTCAGATATTCAGGAGAAACGATTAGTCGCCATTTT
AACAATGTATTGAATGCAATTATGGCGATATCATTGGACTTCTTTCAGCCTTCAGGATCTGATGTTCCACCAGAAATCTTAGAAGATCCAAGATTCTATCCCTACTTTAA
GGATTGTGTGGGAGCAATCGATGGCATTCACTTCCCTGTGATGGTTGGTGTTGATGAGCAAGGACCTTTTCGAAATAAGGATGGACTACTTTCTCAAAATGTTTTGGCAG
CATGCTCATTTGATCTCAAGTTTCATTACGTTCTAGCAGGCTGGGAAGGATCGGCATCGGATTTTCAAGTTCTGAACTCAGCACTAACGAGACGAAACAAACTAAGTGTT
CCTGAAGGTAAATACTACCTTGTGGACCAAAAATATATGAACATGCCTGGTTTTATTGCCCCTTATCATGATGTCCCCTATTATTCAAAGGAATTTCCTGCTGGCTATCA
TCCTCAAGATGCCAAGGAGCTATTTAATCTACGGCATTCATTGTTGCGCAATGCAACCGATAGAACTTTTGGAGCTCTAAAGGCGCGCTTCCCCATACTAATGTTAGCTC
CTCCTTACCCATTACAGACACAAGTTAAGTTGGTCGTTGCGACATGTGCGATTCACAATTACATTCGGAGGGAGAACCCCGATGATTGGCTCTTCAGATTGTATGAACAA
GACCAGGTTTCCCATATGGAGGGTTCATTACCTCCAATGGAAACAGAACAGATGATGACACACATTGAGACCCCAGCTGTGGGCATTGCTTTTGAAACAGAAGAACTAGA
AATTACATCTCAGTTGAGGGATCAAATTGCAACTGGATTGTGGAGTGACTATATTAATGGTATATCACCCATCGAGGTGCATGTAAAAAGGTGCGCTCTAGGTGTGTGCC
TTCCAAAACTGTTTCTTCTTTACTTTGTTTCTTTTATTTCTCTTATACAACATTCCTTTGGTTTCCATCTGAAGTACCCTAAGGGCCAAGTTCCCTACATACTTTATTGG
AATCGTTTTTATGTTATTAGTTGTTCGTGTGGGGGAGTGAGCTTTGGGCCTTGGTATTTCTTGGATTCTCTGACTTCCTCTATGCTCAATCCGGCGAACGATCTGTTACC
GCCGCCGTCTTCTCCGACGAATTCATCAATTTCCTCCTCCGATCTCGACACAGAGTCCACCGGTTCCTTCTTCCATGACCGGAGCACGAGCTTGGGGGCTTTAATGGGAG
TCAGCTTCCCGGCGATTACCTTCCGGTTGCCGTCACAGAACCGAGACCAACACGCCGCCGCAGCCGGAGGAGCTTCCAAGAAGCCGAAGAGAAAACCCGCGGCGGCGGCG
GCACTGGTCGGAGATCGGAAGCGGCGGTGGTGGCGGCTGTGCAGAGACGACGGCGTTAAGCCGGCATCTCTGGGCGAGTTTCTTGAGGTCGAACGGAGGTTTGGGGACGG
TGGTTTTTTCGGAAACGGGGTGGATCTGGATGGCGTGGTTGCCGCATATCATCGGAGGACTGGCCGGAATTTGTTCGCCGACGGGAGAATTCTCCCGCCGGAGGAGGAAG
CATCGCCGGCCGCCGCTCTTTGCCGGTTTTCTGTTTCACTAACCGGGATTTGCAGCGGCGGTGCGGGCTAA
Protein sequenceShow/hide protein sequence
MESSDDEKDGSYGKYIPREPTHNIVSNGAKFVDEVLNGSNERCLDNFRMDKHIFYKLCDILQGKGLLRHTNRVKIEEQLAIFMFIIGHNLRTRAVQELFRYSGETISRHF
NNVLNAIMAISLDFFQPSGSDVPPEILEDPRFYPYFKDCVGAIDGIHFPVMVGVDEQGPFRNKDGLLSQNVLAACSFDLKFHYVLAGWEGSASDFQVLNSALTRRNKLSV
PEGKYYLVDQKYMNMPGFIAPYHDVPYYSKEFPAGYHPQDAKELFNLRHSLLRNATDRTFGALKARFPILMLAPPYPLQTQVKLVVATCAIHNYIRRENPDDWLFRLYEQ
DQVSHMEGSLPPMETEQMMTHIETPAVGIAFETEELEITSQLRDQIATGLWSDYINGISPIEVHVKRCALGVCLPKLFLLYFVSFISLIQHSFGFHLKYPKGQVPYILYW
NRFYVISCSCGGVSFGPWYFLDSLTSSMLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGALMGVSFPAITFRLPSQNRDQHAAAAGGASKKPKRKPAAAA
ALVGDRKRRWWRLCRDDGVKPASLGEFLEVERRFGDGGFFGNGVDLDGVVAAYHRRTGRNLFADGRILPPEEEASPAAALCRFSVSLTGICSGGAG