; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0016114 (gene) of Snake gourd v1 genome

Gene IDTan0016114
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionDDE Tnp4 domain-containing protein
Genome locationLG05:72660316..72664570
RNA-Seq ExpressionTan0016114
SyntenyTan0016114
Gene Ontology termsNA
InterPro domainsIPR027806 - Harbinger transposase-derived nuclease domain
IPR040344 - Uncharacterized protein At3g17950-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF9667516.1 hypothetical protein SADUNF_Sadunf15G0031300 [Salix dunnii]1.7e-10643.29Show/hide
Query:  MESSDDEKDGTYGKYIRREPSHNVVSNGAKFVDEVLNGQNECCLENFRMDKHIFY---------------------------------------------
        ME+SDDEKDG +G  + +E S + ++NG KFVDEVL G ++ CLENFRMDK  FY                                             
Subjt:  MESSDDEKDGTYGKYIRREPSHNVVSNGAKFVDEVLNGQNECCLENFRMDKHIFY---------------------------------------------

Query:  -----------NSRNG------------------------------KYYLVDQKYMNMPGFIAPYHDIPYHSNEYPGGYHPQDAKELFNLRHSLLRNATD
                   N  N                               KYYLVD KY NMPGFIAPY+ +  H NE+ GGY P+D++ELFN RH LLRN TD
Subjt:  -----------NSRNG------------------------------KYYLVDQKYMNMPGFIAPYHDIPYHSNEYPGGYHPQDAKELFNLRHSLLRNATD

Query:  RTFGALKARFPILLSAPPYPLQTQVKLVVATCAIHNYIRREKPDDWLFRLYEQDHVPHMEDSLPQMETEQLMAHVETPTVDIAFETEELEITSQLRDAIA
        R FGALKARFPIL+SAPPYPLQTQVKLVVA CA+HNYIRREKPDDW+F++++ D V  ME+SLP +E E  +  VE P +DIAF+TE+LE +SQLRD+IA
Subjt:  RTFGALKARFPILLSAPPYPLQTQVKLVVATCAIHNYIRREKPDDWLFRLYEQDHVPHMEDSLPQMETEQLMAHVETPTVDIAFETEELEITSQLRDAIA

Query:  TELWSDYINDISPMKVRFF--STTAKETTSGASDLTCKSNHVFIRSETVSFLLLHSFGFHLKVPQCGGVSFGLLYLFIGFSDLLFCFIDQERAKEMLNPA
         ++W DY ND+S +  R    S   K+T S       ++ HV + S                                    +++ F  Q          
Subjt:  TELWSDYINDISPMKVRFF--STTAKETTSGASDLTCKSNHVFIRSETVSFLLLHSFGFHLKVPQCGGVSFGLLYLFIGFSDLLFCFIDQERAKEMLNPA

Query:  NDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRVPSQNRDQHAAAAGAAGVGGAASRKSKKPKRKTTTAPALVADRKRRWWRL
                             T STGSFFHDRST+LGTLMGV+FPAITFR PSQ R   AA+     V    S      KR+   A A+   R+RRWW L
Subjt:  NDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRVPSQNRDQHAAAAGAAGVGGAASRKSKKPKRKTTTAPALVADRKRRWWRL

Query:  CRD-DGVKPASLGDFLEVERRFGDGAFYGNAMDLEGVVAGDQQR----NGRNLFADGRVLPPAQTEEE-----ASAAGALCRFSVSLTGICSGGAG
        CRD  G KPASL +FLEVERRFGD      A+++EGV+  +  R    NGR LFADGRVLPPA+  ++     +S+AGAL RF VSLTGICSGG G
Subjt:  CRD-DGVKPASLGDFLEVERRFGDGAFYGNAMDLEGVVAGDQQR----NGRNLFADGRVLPPAQTEEE-----ASAAGALCRFSVSLTGICSGGAG

KAG6753832.1 hypothetical protein POTOM_041832 [Populus tomentosa]9.6e-10242.61Show/hide
Query:  MESSDDEKDGTYGKYIRREPSHNVVSNGAKFVDEVLNGQNECCLENFRMDKHIFY---------------------------------------------
        ME+SDDEKDG +  Y+ ++ S   + NG KFVDEVL+GQ++ CLENFRMDK  FY                                             
Subjt:  MESSDDEKDGTYGKYIRREPSHNVVSNGAKFVDEVLNGQNECCLENFRMDKHIFY---------------------------------------------

Query:  -----------------NSRN------GKYYLVDQKYMNMPGFIAPYHDIPYHSNEYPGGYHPQDAKELFNLRHSLLRNATDRTFGALKARFPILLSAPP
                           RN      GKYYLVD KY NMPGFIAPY+ I  H NEY GGY P+D++ELFN RH++LRN TDR+FGALK+RFPIL+SAPP
Subjt:  -----------------NSRN------GKYYLVDQKYMNMPGFIAPYHDIPYHSNEYPGGYHPQDAKELFNLRHSLLRNATDRTFGALKARFPILLSAPP

Query:  YPLQTQVKLVVATCAIHNYIRREKPDDWLFRLYEQDHVPHMEDSLPQMETEQLMAHVETPTVDIAFETEELEITSQLRDAIATELWSDYINDISPMKVRF
        YPLQTQVKLVVA CA+HNYIRREKP D +F++YE D V  ME+SLP +E E  +  VE P +D+AF+TE+LE +SQLRD+IA                  
Subjt:  YPLQTQVKLVVATCAIHNYIRREKPDDWLFRLYEQDHVPHMEDSLPQMETEQLMAHVETPTVDIAFETEELEITSQLRDAIATELWSDYINDISPMKVRF

Query:  FSTTAKETTSGASDLTCKSNHVFIRSETVSFLLLHSFGFHLKVPQCGGVSFGLLYLFIGFSDLLFCFIDQERAKEMLNPANDLLPPPSSPTNSSISSSDL
                                                  +P                      F+D                            S L
Subjt:  FSTTAKETTSGASDLTCKSNHVFIRSETVSFLLLHSFGFHLKVPQCGGVSFGLLYLFIGFSDLLFCFIDQERAKEMLNPANDLLPPPSSPTNSSISSSDL

Query:  DTESTGSFFHDRSTSLGTLMGVSFPAITFRVPSQNRDQHAAAAGAAGVGGAASRKSKKPKRKTTTAPALVADRKRRWWRLCRD-DGVKPASLGDFLEVER
          +STGSFFHDRST+LGTLMGV+F AITFR PSQ+R   AA++         + +    KR+   A ++   R+RRWW LCRD  G KPASLG+FLEVER
Subjt:  DTESTGSFFHDRSTSLGTLMGVSFPAITFRVPSQNRDQHAAAAGAAGVGGAASRKSKKPKRKTTTAPALVADRKRRWWRLCRD-DGVKPASLGDFLEVER

Query:  RFGDGAFYGNAMDLEGVVAGDQQR----NGRNLFADGRVLPPAQTEEE----ASAAGALCRFSVSLTGICSGGAG
        RFGD      A +LEGV+  +Q R    NGR LFADGRVLPPA   ++    +S AGAL RF VSLTGICSGG G
Subjt:  RFGDGAFYGNAMDLEGVVAGDQQR----NGRNLFADGRVLPPAQTEEE----ASAAGALCRFSVSLTGICSGGAG

TXG50878.1 hypothetical protein EZV62_023402 [Acer yangbiense]4.9e-10640.75Show/hide
Query:  MESSDDEKDGTYGKYIRREPSHNVVSNGAKFVDEVLNGQNECCLENFRMDKHIFY---------------------------------------------
        MESSDDE+DG YG  I +  SHNV SNG KFVDEVLNG +E CLENFRMDK +FY                                             
Subjt:  MESSDDEKDGTYGKYIRREPSHNVVSNGAKFVDEVLNGQNECCLENFRMDKHIFY---------------------------------------------

Query:  -----------------------------------NSRN------------------------------------------------GKYYLVDQKYMNM
                                            SRN                                                GKYYL+D KY NM
Subjt:  -----------------------------------NSRN------------------------------------------------GKYYLVDQKYMNM

Query:  PGFIAPYHDIPYHSNEYPGGYHPQDAKELFNLRHSLLRNATDRTFGALKARFPILLSAPPYPLQTQVKLVVATCAIHNYIRREKPDDWLFRLYEQDHVPH
        PGFI+PY D+PYHSN+ P GYHPQDAKELFN R SL R+ATDR FGALK RFPIL+SAPPYPLQTQVKLVV  CA+HN+IRREKPDDW+FR+YEQD    
Subjt:  PGFIAPYHDIPYHSNEYPGGYHPQDAKELFNLRHSLLRNATDRTFGALKARFPILLSAPPYPLQTQVKLVVATCAIHNYIRREKPDDWLFRLYEQDHVPH

Query:  MEDSLPQMETEQLMAHVETPTVDIAFETEELEITSQLRDAIATELWSDYINDISPMKVRFFSTTAKETTSGASDLTCKSNHVFIRSETVSFLLLHSFGFH
        M++S+P +E EQ + H     +DI+F+ ++L++TSQLRD+IA E+W DYI                                                  
Subjt:  MEDSLPQMETEQLMAHVETPTVDIAFETEELEITSQLRDAIATELWSDYINDISPMKVRFFSTTAKETTSGASDLTCKSNHVFIRSETVSFLLLHSFGFH

Query:  LKVPQCGGVSFGLLYLFIGFSDLLFCFIDQERAKEMLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRVPSQNRDQHA
                                                                     T STGSFFHDRST+LGTLMGVSFPAITFR PSQ+RD H 
Subjt:  LKVPQCGGVSFGLLYLFIGFSDLLFCFIDQERAKEMLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRVPSQNRDQHA

Query:  AA-AGAAGVGGAASR---KSKKPKRKTTTAPALVADRKRRWWRLCRDD-GVKPASLGDFLEVERRFGDGAFYGNAMD-LEGVV---AGDQQ--RNGRNLF
         A A   GV G  S+   K+KK KR         A R+ RWWR+C DD   KPASLG+FLEVERRFGD AF+  A D LEGVV   A ++Q  RNGR LF
Subjt:  AA-AGAAGVGGAASR---KSKKPKRKTTTAPALVADRKRRWWRLCRDD-GVKPASLGDFLEVERRFGDGAFYGNAMD-LEGVV---AGDQQ--RNGRNLF

Query:  ADGRVLPPAQT------EEEASAAGALCRFSVSLTGIC
        ADGRVLPPA +      +E +S A  LC+F VS+T IC
Subjt:  ADGRVLPPAQT------EEEASAAGALCRFSVSLTGIC

TYJ99038.1 putative nuclease HARBI1 [Cucumis melo var. makuwa]7.6e-19256.19Show/hide
Query:  MESSDDEKDGTYGKYIRREPSHNVVSNGAKFVDEVLNGQNECCLENFRMDKHIFY---------------------------------------------
        MESSDDEKDGTYGKY+ REPSHN+VSNGAKFVDEVLNGQNE CLE+FRMDKH+FY                                             
Subjt:  MESSDDEKDGTYGKYIRREPSHNVVSNGAKFVDEVLNGQNECCLENFRMDKHIFY---------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------NSRN------GKYYLVDQKYMNMPGFIAPYHDIPYHSNEYPGGYHPQDAKELFNLRHSLLRNATDRTFGALKARFPILLSAPPYPLQ
                       RN      GKYYLVDQKYMNMPGF+APYHDI YHS EYPGGYHPQDAKELFNLRHSLLRNAT+RTFGALKARFPILLSAPPYPLQ
Subjt:  -------------NSRN------GKYYLVDQKYMNMPGFIAPYHDIPYHSNEYPGGYHPQDAKELFNLRHSLLRNATDRTFGALKARFPILLSAPPYPLQ

Query:  TQVKLVVATCAIHNYIRREKPDDWLFRLYEQDHVPHMEDSLPQMETEQLMAHVETPTVDIAFETEELEITSQLRDAIATELWSDYINDISPMKVRFFSTT
        TQVKLVVATCAIHNYIRRE PDDW FRLYEQDHVPHMEDSLPQ+E EQL A++ETP VD+AFETEELEI SQLRD+IA E+W                  
Subjt:  TQVKLVVATCAIHNYIRREKPDDWLFRLYEQDHVPHMEDSLPQMETEQLMAHVETPTVDIAFETEELEITSQLRDAIATELWSDYINDISPMKVRFFSTT

Query:  AKETTSGASDLTCKSNHVFIRSETVSFLLLH-----------------------------------------SFGFHLKVPQCGGVSFGLLYLFIGFSDL
               A+ +TCKSNHVF+RS+  +F LL                                          + G+      CGGVSF +LYL     D 
Subjt:  AKETTSGASDLTCKSNHVFIRSETVSFLLLH-----------------------------------------SFGFHLKVPQCGGVSFGLLYLFIGFSDL

Query:  LFCFIDQERAKEMLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRVPSQNRDQHAAAAGAAGVGGAASRKSKKPKRKT
        L         KEMLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRVPSQNRDQH AA   AG G   SRKSKK KRKT
Subjt:  LFCFIDQERAKEMLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRVPSQNRDQHAAAAGAAGVGGAASRKSKKPKRKT

Query:  TTAPALVADRKRRWWRLCRDDGVKPASLGDFLEVERRFGDGAFYGNAMDLEGVVAGDQQRNGRNLFADGRVLPPAQTEEEASAAGALCRFSVSLTGICSG
        TTAPALVADRKRRWWRLCRDDGVKPASLG+FLEVERRFGDGAFYGNA+DLEGVV  DQQRNGR+LFADGRVLPPAQTEE+ SA GALCRFSVSLTGICSG
Subjt:  TTAPALVADRKRRWWRLCRDDGVKPASLGDFLEVERRFGDGAFYGNAMDLEGVVAGDQQRNGRNLFADGRVLPPAQTEEEASAAGALCRFSVSLTGICSG

Query:  GAG
        GAG
Subjt:  GAG

XP_038895429.1 putative nuclease HARBI1 isoform X1 [Benincasa hispida]2.4e-9753.32Show/hide
Query:  MESSDDEKDGTYGKYIRREPSHNVVSNGAKFVDEVLNGQNECCLENFRMDKHIFY---------------------------------------------
        MESSDDEKDGTYGKY+ REPSHN+VSNGAKFVDEVLNGQNE CL+ FRMDKHIFY                                             
Subjt:  MESSDDEKDGTYGKYIRREPSHNVVSNGAKFVDEVLNGQNECCLENFRMDKHIFY---------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------NSRN------GKYYLVDQKYMNMPGFIAPYHDIPYHSNEYPGGYHPQDAKELFNLRHSLLRNATDRTFGALKARFPILLSAPPYPLQ
                       RN      GKYYLVDQKYMNMPGFIAPYHDIPYHS EYPGGYHPQDAKELFNLRHSLLRNATDRTFGALKARFPILLSAPPYPLQ
Subjt:  -------------NSRN------GKYYLVDQKYMNMPGFIAPYHDIPYHSNEYPGGYHPQDAKELFNLRHSLLRNATDRTFGALKARFPILLSAPPYPLQ

Query:  TQVKLVVATCAIHNYIRREKPDDWLFRLYEQDHVPHMEDSLPQMETEQLMAHVETPTVDIAFETEELEITSQLRDAIATELWSDYINDISPM
        TQVKLVVATCAIHNYIRRE PDDWLFRLYEQDHVPHMEDSLPQ++ EQL  H+ETP VDIAFETEELEITSQLRD IA ELWSDYINDISPM
Subjt:  TQVKLVVATCAIHNYIRREKPDDWLFRLYEQDHVPHMEDSLPQMETEQLMAHVETPTVDIAFETEELEITSQLRDAIATELWSDYINDISPM

TrEMBL top hitse value%identityAlignment
A0A0A0LQI1 DDE Tnp4 domain-containing protein1.8e-23180.15Show/hide
Query:  MESSDDEKDGTYGKYIRREPSHNVVSNGAKFVDEVLNGQNECCLENFRMDKHIFYNSRNGKYYLVDQKYMNMPGFIAPYHDIPYHSNEYPGGYHPQDAKE
        MESSDDEKDGTYGKY+ REPSHN+VSNGAKFVDEVLNGQNE CL++FRMDKH    SRNGKYYLVDQKYMNMPGF+APYHDI Y S EYPGGYHPQDAKE
Subjt:  MESSDDEKDGTYGKYIRREPSHNVVSNGAKFVDEVLNGQNECCLENFRMDKHIFYNSRNGKYYLVDQKYMNMPGFIAPYHDIPYHSNEYPGGYHPQDAKE

Query:  LFNLRHSLLRNATDRTFGALKARFPILLSAPPYPLQTQVKLVVATCAIHNYIRREKPDDWLFRLYEQDHVPHMEDSLPQMETEQLMAHVETPTVDIAFET
        LFNLRHSLLRNAT+RTF ALKARFPILLSAPPYPLQTQVKLVVATCAIHNYIRRE PDDW FRLYEQDHVPHMEDSLPQ+E EQL A++ETP VD+AFET
Subjt:  LFNLRHSLLRNATDRTFGALKARFPILLSAPPYPLQTQVKLVVATCAIHNYIRREKPDDWLFRLYEQDHVPHMEDSLPQMETEQLMAHVETPTVDIAFET

Query:  EELEITSQLRDAIATELWSDYINDISPMKVRFFSTTAKETTSG-ASDLTCKSNHVFIRSET------VSFLLLHSFGFHLKVPQ----------------
        EELEITSQLRD+IA E+WSDYINDISPMKV+F  T AKE   G A+ +TCKSNHVF+RS        VSFLLLHSFGFHLK PQ                
Subjt:  EELEITSQLRDAIATELWSDYINDISPMKVRFFSTTAKETTSG-ASDLTCKSNHVFIRSET------VSFLLLHSFGFHLKVPQ----------------

Query:  ---CGGVSFGLLYLFIGFSDLLFCFIDQERAKEMLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRVPSQNRDQHAAA
           CGGVSF +LYL     D L         KEMLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRVPSQNRDQH AA
Subjt:  ---CGGVSFGLLYLFIGFSDLLFCFIDQERAKEMLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRVPSQNRDQHAAA

Query:  AGAAGVGGAASRKSKKPKRKTTTAPALVADRKRRWWRLCRDDGVKPASLGDFLEVERRFGDGAFYGNAMDLEGVVAGDQQRNGRNLFADGRVLPPAQTEE
           AG G   SRKSKK KRKTT APALVADRKRRWWRLCRDDGVKPASLG+FLEVERRFGDGAFYGNA+DLEGVVA DQQRNGR+LFADGRVLPPAQTEE
Subjt:  AGAAGVGGAASRKSKKPKRKTTTAPALVADRKRRWWRLCRDDGVKPASLGDFLEVERRFGDGAFYGNAMDLEGVVAGDQQRNGRNLFADGRVLPPAQTEE

Query:  EASAAGALCRFSVSLTGICSGGAG
        + S A  LCRFSVSLTGICSGGAG
Subjt:  EASAAGALCRFSVSLTGICSGGAG

A0A5C7H1M7 DDE Tnp4 domain-containing protein2.4e-10640.75Show/hide
Query:  MESSDDEKDGTYGKYIRREPSHNVVSNGAKFVDEVLNGQNECCLENFRMDKHIFY---------------------------------------------
        MESSDDE+DG YG  I +  SHNV SNG KFVDEVLNG +E CLENFRMDK +FY                                             
Subjt:  MESSDDEKDGTYGKYIRREPSHNVVSNGAKFVDEVLNGQNECCLENFRMDKHIFY---------------------------------------------

Query:  -----------------------------------NSRN------------------------------------------------GKYYLVDQKYMNM
                                            SRN                                                GKYYL+D KY NM
Subjt:  -----------------------------------NSRN------------------------------------------------GKYYLVDQKYMNM

Query:  PGFIAPYHDIPYHSNEYPGGYHPQDAKELFNLRHSLLRNATDRTFGALKARFPILLSAPPYPLQTQVKLVVATCAIHNYIRREKPDDWLFRLYEQDHVPH
        PGFI+PY D+PYHSN+ P GYHPQDAKELFN R SL R+ATDR FGALK RFPIL+SAPPYPLQTQVKLVV  CA+HN+IRREKPDDW+FR+YEQD    
Subjt:  PGFIAPYHDIPYHSNEYPGGYHPQDAKELFNLRHSLLRNATDRTFGALKARFPILLSAPPYPLQTQVKLVVATCAIHNYIRREKPDDWLFRLYEQDHVPH

Query:  MEDSLPQMETEQLMAHVETPTVDIAFETEELEITSQLRDAIATELWSDYINDISPMKVRFFSTTAKETTSGASDLTCKSNHVFIRSETVSFLLLHSFGFH
        M++S+P +E EQ + H     +DI+F+ ++L++TSQLRD+IA E+W DYI                                                  
Subjt:  MEDSLPQMETEQLMAHVETPTVDIAFETEELEITSQLRDAIATELWSDYINDISPMKVRFFSTTAKETTSGASDLTCKSNHVFIRSETVSFLLLHSFGFH

Query:  LKVPQCGGVSFGLLYLFIGFSDLLFCFIDQERAKEMLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRVPSQNRDQHA
                                                                     T STGSFFHDRST+LGTLMGVSFPAITFR PSQ+RD H 
Subjt:  LKVPQCGGVSFGLLYLFIGFSDLLFCFIDQERAKEMLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRVPSQNRDQHA

Query:  AA-AGAAGVGGAASR---KSKKPKRKTTTAPALVADRKRRWWRLCRDD-GVKPASLGDFLEVERRFGDGAFYGNAMD-LEGVV---AGDQQ--RNGRNLF
         A A   GV G  S+   K+KK KR         A R+ RWWR+C DD   KPASLG+FLEVERRFGD AF+  A D LEGVV   A ++Q  RNGR LF
Subjt:  AA-AGAAGVGGAASR---KSKKPKRKTTTAPALVADRKRRWWRLCRDD-GVKPASLGDFLEVERRFGDGAFYGNAMD-LEGVV---AGDQQ--RNGRNLF

Query:  ADGRVLPPAQT------EEEASAAGALCRFSVSLTGIC
        ADGRVLPPA +      +E +S A  LC+F VS+T IC
Subjt:  ADGRVLPPAQT------EEEASAAGALCRFSVSLTGIC

A0A5D3BLI7 Putative nuclease HARBI13.7e-19256.19Show/hide
Query:  MESSDDEKDGTYGKYIRREPSHNVVSNGAKFVDEVLNGQNECCLENFRMDKHIFY---------------------------------------------
        MESSDDEKDGTYGKY+ REPSHN+VSNGAKFVDEVLNGQNE CLE+FRMDKH+FY                                             
Subjt:  MESSDDEKDGTYGKYIRREPSHNVVSNGAKFVDEVLNGQNECCLENFRMDKHIFY---------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------NSRN------GKYYLVDQKYMNMPGFIAPYHDIPYHSNEYPGGYHPQDAKELFNLRHSLLRNATDRTFGALKARFPILLSAPPYPLQ
                       RN      GKYYLVDQKYMNMPGF+APYHDI YHS EYPGGYHPQDAKELFNLRHSLLRNAT+RTFGALKARFPILLSAPPYPLQ
Subjt:  -------------NSRN------GKYYLVDQKYMNMPGFIAPYHDIPYHSNEYPGGYHPQDAKELFNLRHSLLRNATDRTFGALKARFPILLSAPPYPLQ

Query:  TQVKLVVATCAIHNYIRREKPDDWLFRLYEQDHVPHMEDSLPQMETEQLMAHVETPTVDIAFETEELEITSQLRDAIATELWSDYINDISPMKVRFFSTT
        TQVKLVVATCAIHNYIRRE PDDW FRLYEQDHVPHMEDSLPQ+E EQL A++ETP VD+AFETEELEI SQLRD+IA E+W                  
Subjt:  TQVKLVVATCAIHNYIRREKPDDWLFRLYEQDHVPHMEDSLPQMETEQLMAHVETPTVDIAFETEELEITSQLRDAIATELWSDYINDISPMKVRFFSTT

Query:  AKETTSGASDLTCKSNHVFIRSETVSFLLLH-----------------------------------------SFGFHLKVPQCGGVSFGLLYLFIGFSDL
               A+ +TCKSNHVF+RS+  +F LL                                          + G+      CGGVSF +LYL     D 
Subjt:  AKETTSGASDLTCKSNHVFIRSETVSFLLLH-----------------------------------------SFGFHLKVPQCGGVSFGLLYLFIGFSDL

Query:  LFCFIDQERAKEMLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRVPSQNRDQHAAAAGAAGVGGAASRKSKKPKRKT
        L         KEMLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRVPSQNRDQH AA   AG G   SRKSKK KRKT
Subjt:  LFCFIDQERAKEMLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRVPSQNRDQHAAAAGAAGVGGAASRKSKKPKRKT

Query:  TTAPALVADRKRRWWRLCRDDGVKPASLGDFLEVERRFGDGAFYGNAMDLEGVVAGDQQRNGRNLFADGRVLPPAQTEEEASAAGALCRFSVSLTGICSG
        TTAPALVADRKRRWWRLCRDDGVKPASLG+FLEVERRFGDGAFYGNA+DLEGVV  DQQRNGR+LFADGRVLPPAQTEE+ SA GALCRFSVSLTGICSG
Subjt:  TTAPALVADRKRRWWRLCRDDGVKPASLGDFLEVERRFGDGAFYGNAMDLEGVVAGDQQRNGRNLFADGRVLPPAQTEEEASAAGALCRFSVSLTGICSG

Query:  GAG
        GAG
Subjt:  GAG

A0A6J1EE58 putative nuclease HARBI1 isoform X22.6e-9753.45Show/hide
Query:  MESSDDEKDGTYGKYIRREPSHNVVSNGAKFVDEVLNGQNECCLENFRMDKHIFYN--------------------------------------------
        MESSDDEKDG+YGKY+ REPSHN+V+NGAKFVDEVLNGQNE CLENFRMDKHIFY                                             
Subjt:  MESSDDEKDGTYGKYIRREPSHNVVSNGAKFVDEVLNGQNECCLENFRMDKHIFYN--------------------------------------------

Query:  -----------------------------------------------------------------------------SRN--------------------
                                                                                     S+N                    
Subjt:  -----------------------------------------------------------------------------SRN--------------------

Query:  ----------------------GKYYLVDQKYMNMPGFIAPYHDIPYHSNEYPGGYHPQDAKELFNLRHSLLRNATDRTFGALKARFPILLSAPPYPLQT
                              GKYYLVDQKYMNMPGFIAPYHDIPY S EY GGYHPQDAKELFNLRHSLLRNATDRTFGALK RFPILLSAPPYPLQT
Subjt:  ----------------------GKYYLVDQKYMNMPGFIAPYHDIPYHSNEYPGGYHPQDAKELFNLRHSLLRNATDRTFGALKARFPILLSAPPYPLQT

Query:  QVKLVVATCAIHNYIRREKPDDWLFRLYEQDHVPHMEDSLPQMETEQLMAHVETPTVDIAFETEELEITSQLRDAIATELWSDYINDISPM
        QVKLVVATCAIHNYIRRE PDDWLF+LYEQDHV HMEDSLPQ+E EQL AH+ETPTVDIAFETEELEITSQLRDAIATELWSDYINDISPM
Subjt:  QVKLVVATCAIHNYIRREKPDDWLFRLYEQDHVPHMEDSLPQMETEQLMAHVETPTVDIAFETEELEITSQLRDAIATELWSDYINDISPM

A0A6J1KJM0 putative nuclease HARBI18.4e-9653.45Show/hide
Query:  MESSDDEKDGTYGKYIRREPSHNVVSNGAKFVDEVLNGQNECCLENFRMDKHIFYN--------------------------------------------
        MESSDDEKDG+YGKY+ REPSHN+VSNGAKFVDEVLNGQNE CLENFRMDKHIFY                                             
Subjt:  MESSDDEKDGTYGKYIRREPSHNVVSNGAKFVDEVLNGQNECCLENFRMDKHIFYN--------------------------------------------

Query:  -----------------------------------------------------------------------------SRN--------------------
                                                                                     S+N                    
Subjt:  -----------------------------------------------------------------------------SRN--------------------

Query:  ----------------------GKYYLVDQKYMNMPGFIAPYHDIPYHSNEYPGGYHPQDAKELFNLRHSLLRNATDRTFGALKARFPILLSAPPYPLQT
                              GKYYLVDQKYMNMPGFIAPYHDIPY S EY GGYHPQDAKELFNLRHSLLRNATDRTFGALK RFPILLSAPPYPLQT
Subjt:  ----------------------GKYYLVDQKYMNMPGFIAPYHDIPYHSNEYPGGYHPQDAKELFNLRHSLLRNATDRTFGALKARFPILLSAPPYPLQT

Query:  QVKLVVATCAIHNYIRREKPDDWLFRLYEQDHVPHMEDSLPQMETEQLMAHVETPTVDIAFETEELEITSQLRDAIATELWSDYINDISPM
        QVKLVVATCAIHNYIRRE PDD LFRLYEQDHV HMEDSLPQ+E EQL AH+ETPTVDIAFETEE EITSQLRDAIATELWSDYINDISPM
Subjt:  QVKLVVATCAIHNYIRREKPDDWLFRLYEQDHVPHMEDSLPQMETEQLMAHVETPTVDIAFETEELEITSQLRDAIATELWSDYINDISPM

SwissProt top hitse value%identityAlignment
Q6DR24 Uncharacterized protein At3g179509.5e-3650.72Show/hide
Query:  PSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPA---ITFRVPSQNRDQHAAAAGAAGVGGAASRKSKKPKRKTTTAPALVADRKRRWWRLCRD
        PSSPT SS+SSSDLDTESTGSFFHDRS +LGTLMG SF A   + FR  S+     + A   A    A  R++ + KR  + +      R+R+WWR CRD
Subjt:  PSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPA---ITFRVPSQNRDQHAAAAGAAGVGGAASRKSKKPKRKTTTAPALVADRKRRWWRLCRD

Query:  D-------GV-------KPASLGDFLEVERRFGDGAFYGNA-MDLEGVVAG---DQQ--RNGRNLFADGRVLPPAQTE----EEASAAGALCRFSVSLTG
        D       G+       K +SLG++LEVERRFGD A Y +A  +LE  V     DQQ     R LFADGRVLPPA  E    E    A +LCRF VSLTG
Subjt:  D-------GV-------KPASLGDFLEVERRFGDGAFYGNA-MDLEGVVAG---DQQ--RNGRNLFADGRVLPPAQTE----EEASAAGALCRFSVSLTG

Query:  ICSGGAG
        ICSGG G
Subjt:  ICSGGAG

Arabidopsis top hitse value%identityAlignment
AT3G17950.1 unknown protein6.7e-3750.72Show/hide
Query:  PSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPA---ITFRVPSQNRDQHAAAAGAAGVGGAASRKSKKPKRKTTTAPALVADRKRRWWRLCRD
        PSSPT SS+SSSDLDTESTGSFFHDRS +LGTLMG SF A   + FR  S+     + A   A    A  R++ + KR  + +      R+R+WWR CRD
Subjt:  PSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPA---ITFRVPSQNRDQHAAAAGAAGVGGAASRKSKKPKRKTTTAPALVADRKRRWWRLCRD

Query:  D-------GV-------KPASLGDFLEVERRFGDGAFYGNA-MDLEGVVAG---DQQ--RNGRNLFADGRVLPPAQTE----EEASAAGALCRFSVSLTG
        D       G+       K +SLG++LEVERRFGD A Y +A  +LE  V     DQQ     R LFADGRVLPPA  E    E    A +LCRF VSLTG
Subjt:  D-------GV-------KPASLGDFLEVERRFGDGAFYGNA-MDLEGVVAG---DQQ--RNGRNLFADGRVLPPAQTE----EEASAAGALCRFSVSLTG

Query:  ICSGGAG
        ICSGG G
Subjt:  ICSGGAG

AT3G17950.2 unknown protein2.3e-2143.68Show/hide
Query:  MGVSFPA---ITFRVPSQNRDQHAAAAGAAGVGGAASRKSKKPKRKTTTAPALVADRKRRWWRLCRDD-------GV-------KPASLGDFLEVERRFG
        MG SF A   + FR  S+     + A   A    A  R++ + KR  + +      R+R+WWR CRDD       G+       K +SLG++LEVERRFG
Subjt:  MGVSFPA---ITFRVPSQNRDQHAAAAGAAGVGGAASRKSKKPKRKTTTAPALVADRKRRWWRLCRDD-------GV-------KPASLGDFLEVERRFG

Query:  DGAFYGNA-MDLEGVVAG---DQQ--RNGRNLFADGRVLPPAQTE----EEASAAGALCRFSVSLTGICSGGAG
        D A Y +A  +LE  V     DQQ     R LFADGRVLPPA  E    E    A +LCRF VSLTGICSGG G
Subjt:  DGAFYGNA-MDLEGVVAG---DQQ--RNGRNLFADGRVLPPAQTE----EEASAAGALCRFSVSLTGICSGGAG

AT4G10890.1 unknown protein7.0e-1034.82Show/hide
Query:  NVVSNGAKFVDEVLNGQN-------ECCLENFRMDKHIFYNSRNGKYYLVDQKYMNMPGFIAPYHDIPYHSNEYPGGYHPQDAKELFNLRHSLLRNATDR
        +V+S   KF  + L  Q        + C  N     H      N KYYLV+  Y    G++ P+  I YH  ++  G  P   +ELFN +H  LR+  DR
Subjt:  NVVSNGAKFVDEVLNGQN-------ECCLENFRMDKHIFYNSRNGKYYLVDQKYMNMPGFIAPYHDIPYHSNEYPGGYHPQDAKELFNLRHSLLRNATDR

Query:  TFGALKARFPIL
        TFG  KA++ IL
Subjt:  TFGALKARFPIL

AT5G35695.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)7.0e-1833.13Show/hide
Query:  KYYLVDQKYMNMPGFIAPYHDIPYHSNEYPGGYH-PQDAKELFNLRHSLLRNATDRTFGALKARFPILLSAPPYPLQTQVKLVVATCAIHNYIRREKPDD
        K+YLVD  + N   F+AP+  + YH  E+ G    P+   ELFNLRH  LRN  +R FG  K+RF I  SAPP+  + Q  LV+   A+HN++R+E   D
Subjt:  KYYLVDQKYMNMPGFIAPYHDIPYHSNEYPGGYH-PQDAKELFNLRHSLLRNATDRTFGALKARFPILLSAPPYPLQTQVKLVVATCAIHNYIRREKPDD

Query:  WLFRLYEQDHVPHM--EDSLPQMETEQLMAHVETPTVDIAFETEELEITSQLRDAIATELWSDYIN
              E D    +  E  +   E   +  +       +  + ++ E T+  R ++A ++W D  N
Subjt:  WLFRLYEQDHVPHM--EDSLPQMETEQLMAHVETPTVDIAFETEELEITSQLRDAIATELWSDYIN

AT5G41980.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)1.7e-4354.76Show/hide
Query:  GKYYLVDQKYMNMPGFIAPYHDIPYHSNEYPGGYHPQDAKELFNLRHSLLRNATDRTFGALKARFPILLSAPPYPLQTQVKLVVATCAIHNYIRREKPDD
        GKYY+VD KY N+PGFIAPYH +  +S E        +AKE+FN RH LL  A  RTFGALK RFPILLSAPPYPLQTQVKLV+A CA+HNY+R EKPDD
Subjt:  GKYYLVDQKYMNMPGFIAPYHDIPYHSNEYPGGYHPQDAKELFNLRHSLLRNATDRTFGALKARFPILLSAPPYPLQTQVKLVVATCAIHNYIRREKPDD

Query:  WLFRLYEQDHVPHM-EDSLPQMETEQLMAHVETPTVDIAFETEELEITSQLRDAIATELWSDYINDIS
         +FR++E++ +    ED    +E EQ    VE    +  F  EE+E + +LRD IA+ELW+ Y+ ++S
Subjt:  WLFRLYEQDHVPHM-EDSLPQMETEQLMAHVETPTVDIAFETEELEITSQLRDAIATELWSDYINDIS

AT5G41980.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)6.1e-0648.98Show/hide
Query:  EKDGTYGKYIRREPSHNVVSNGAKFVDEVLNGQNECCLENFRMDKHIFY
        E+D      + +E S   +S+G KFV ++LNG NE C ENFRMDK +FY
Subjt:  EKDGTYGKYIRREPSHNVVSNGAKFVDEVLNGQNECCLENFRMDKHIFY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAGTTCTGATGATGAAAAGGATGGAACTTATGGGAAATATATTCGAAGAGAACCGAGTCATAATGTAGTATCTAATGGTGCTAAATTTGTAGATGAAGTACTCAA
TGGACAAAATGAATGTTGTTTAGAAAATTTCCGCATGGACAAGCACATATTTTATAATAGTAGAAATGGTAAATACTACCTTGTAGACCAAAAATATATGAACATGCCTG
GTTTTATTGCCCCTTATCATGATATTCCCTATCATTCAAATGAATATCCTGGTGGTTATCATCCTCAAGATGCCAAAGAGCTATTTAATCTACGTCATTCATTGTTGCGC
AATGCAACTGATAGAACTTTTGGAGCTCTAAAAGCACGCTTCCCCATATTATTGTCAGCTCCTCCTTACCCGTTACAGACACAAGTTAAGTTGGTTGTTGCGACATGTGC
GATTCACAATTACATTCGGAGGGAGAAACCCGATGATTGGCTCTTTAGATTGTATGAACAAGACCATGTTCCCCATATGGAGGATTCATTGCCTCAAATGGAGACAGAAC
AGCTGATGGCACATGTTGAGACTCCAACGGTGGACATTGCTTTCGAGACAGAAGAACTAGAAATTACATCACAGTTGCGGGATGCTATTGCAACTGAATTGTGGAGTGAC
TATATTAATGATATATCACCCATGAAGGTCAGATTCTTCAGTACGACTGCCAAGGAGACAACATCAGGGGCAAGTGACTTAACTTGCAAGAGCAACCATGTCTTTATTAG
ATCAGAAACTGTTTCTTTTTTACTGCTACATTCCTTTGGCTTCCATCTGAAAGTACCCCAATGTGGGGGAGTGAGCTTTGGGCTTTTGTATTTATTTATTGGATTCTCTG
ACCTCCTCTTCTGCTTCATTGATCAAGAGAGAGCAAAAGAGATGTTGAATCCGGCGAACGATCTGTTACCGCCGCCGTCTTCTCCGACCAATTCATCCATTTCCTCCTCC
GATCTCGACACTGAGTCTACAGGTTCGTTCTTCCATGACAGGAGCACGAGCTTGGGGACCTTAATGGGGGTCAGCTTCCCGGCGATTACCTTCCGAGTGCCCTCACAGAA
TCGAGACCAACACGCCGCCGCGGCCGGGGCTGCCGGCGTTGGCGGCGCCGCTTCCCGCAAGAGCAAGAAGCCGAAGAGAAAAACGACGACGGCGCCGGCACTCGTCGCAG
ATCGGAAGCGGAGGTGGTGGCGACTCTGCAGGGACGACGGCGTTAAGCCGGCATCTCTGGGTGACTTTCTCGAGGTGGAACGGAGATTTGGGGACGGTGCTTTTTATGGC
AACGCGATGGATCTGGAAGGCGTGGTTGCGGGGGATCAACAGAGGAATGGCAGGAATTTGTTCGCTGACGGGAGAGTTCTTCCGCCGGCGCAAACGGAAGAAGAAGCGTC
GGCGGCCGGTGCTCTATGCCGATTTTCCGTATCGCTCACTGGGATTTGCAGCGGCGGTGCCGGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGAGTTCTGATGATGAAAAGGATGGAACTTATGGGAAATATATTCGAAGAGAACCGAGTCATAATGTAGTATCTAATGGTGCTAAATTTGTAGATGAAGTACTCAA
TGGACAAAATGAATGTTGTTTAGAAAATTTCCGCATGGACAAGCACATATTTTATAATAGTAGAAATGGTAAATACTACCTTGTAGACCAAAAATATATGAACATGCCTG
GTTTTATTGCCCCTTATCATGATATTCCCTATCATTCAAATGAATATCCTGGTGGTTATCATCCTCAAGATGCCAAAGAGCTATTTAATCTACGTCATTCATTGTTGCGC
AATGCAACTGATAGAACTTTTGGAGCTCTAAAAGCACGCTTCCCCATATTATTGTCAGCTCCTCCTTACCCGTTACAGACACAAGTTAAGTTGGTTGTTGCGACATGTGC
GATTCACAATTACATTCGGAGGGAGAAACCCGATGATTGGCTCTTTAGATTGTATGAACAAGACCATGTTCCCCATATGGAGGATTCATTGCCTCAAATGGAGACAGAAC
AGCTGATGGCACATGTTGAGACTCCAACGGTGGACATTGCTTTCGAGACAGAAGAACTAGAAATTACATCACAGTTGCGGGATGCTATTGCAACTGAATTGTGGAGTGAC
TATATTAATGATATATCACCCATGAAGGTCAGATTCTTCAGTACGACTGCCAAGGAGACAACATCAGGGGCAAGTGACTTAACTTGCAAGAGCAACCATGTCTTTATTAG
ATCAGAAACTGTTTCTTTTTTACTGCTACATTCCTTTGGCTTCCATCTGAAAGTACCCCAATGTGGGGGAGTGAGCTTTGGGCTTTTGTATTTATTTATTGGATTCTCTG
ACCTCCTCTTCTGCTTCATTGATCAAGAGAGAGCAAAAGAGATGTTGAATCCGGCGAACGATCTGTTACCGCCGCCGTCTTCTCCGACCAATTCATCCATTTCCTCCTCC
GATCTCGACACTGAGTCTACAGGTTCGTTCTTCCATGACAGGAGCACGAGCTTGGGGACCTTAATGGGGGTCAGCTTCCCGGCGATTACCTTCCGAGTGCCCTCACAGAA
TCGAGACCAACACGCCGCCGCGGCCGGGGCTGCCGGCGTTGGCGGCGCCGCTTCCCGCAAGAGCAAGAAGCCGAAGAGAAAAACGACGACGGCGCCGGCACTCGTCGCAG
ATCGGAAGCGGAGGTGGTGGCGACTCTGCAGGGACGACGGCGTTAAGCCGGCATCTCTGGGTGACTTTCTCGAGGTGGAACGGAGATTTGGGGACGGTGCTTTTTATGGC
AACGCGATGGATCTGGAAGGCGTGGTTGCGGGGGATCAACAGAGGAATGGCAGGAATTTGTTCGCTGACGGGAGAGTTCTTCCGCCGGCGCAAACGGAAGAAGAAGCGTC
GGCGGCCGGTGCTCTATGCCGATTTTCCGTATCGCTCACTGGGATTTGCAGCGGCGGTGCCGGCTAA
Protein sequenceShow/hide protein sequence
MESSDDEKDGTYGKYIRREPSHNVVSNGAKFVDEVLNGQNECCLENFRMDKHIFYNSRNGKYYLVDQKYMNMPGFIAPYHDIPYHSNEYPGGYHPQDAKELFNLRHSLLR
NATDRTFGALKARFPILLSAPPYPLQTQVKLVVATCAIHNYIRREKPDDWLFRLYEQDHVPHMEDSLPQMETEQLMAHVETPTVDIAFETEELEITSQLRDAIATELWSD
YINDISPMKVRFFSTTAKETTSGASDLTCKSNHVFIRSETVSFLLLHSFGFHLKVPQCGGVSFGLLYLFIGFSDLLFCFIDQERAKEMLNPANDLLPPPSSPTNSSISSS
DLDTESTGSFFHDRSTSLGTLMGVSFPAITFRVPSQNRDQHAAAAGAAGVGGAASRKSKKPKRKTTTAPALVADRKRRWWRLCRDDGVKPASLGDFLEVERRFGDGAFYG
NAMDLEGVVAGDQQRNGRNLFADGRVLPPAQTEEEASAAGALCRFSVSLTGICSGGAG