; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0027793 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0027793
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotransposon protein
Genome locationchr8:5126470..5127611
RNA-Seq ExpressionLag0027793
SyntenyLag0027793
Gene Ontology termsNA
InterPro domainsIPR027806 - Harbinger transposase-derived nuclease domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035620.1 retrotransposon protein [Cucumis melo var. makuwa]2.5e-11063.55Show/hide
Query:  MDRRCFSILRTLLRIVVGLTGTKVVDVEEMVAIFLHILAHDVKNRVVRTQFAKFGETVSRHFNSVLHGVLRLHEVLIKKPDPVTTSCTDPRWKWFQNCLG
        MDRRCF+IL  LLR + GLT T+VVDVEEMVA+FLHILAHDVKNRV++ +F + GET+SRHFN VL  V+RLHE L+KKP PV   CTD RW+WF+NCLG
Subjt:  MDRRCFSILRTLLRIVVGLTGTKVVDVEEMVAIFLHILAHDVKNRVVRTQFAKFGETVSRHFNSVLHGVLRLHEVLIKKPDPVTTSCTDPRWKWFQNCLG

Query:  ALDGTYVKVNVSVLDCPRYRTRKGEIATNMFVVCDTK----------------------EISHSSYL--------------GGKEGFLAPYRGERYHLSE
        ALDGTY+KVNV   D  RYRTRKGE+ATN+  VCDTK                       +S  + L                 EGFLAPYRG+RYHL E
Subjt:  ALDGTYVKVNVSVLDCPRYRTRKGEIATNMFVVCDTK----------------------EISHSSYL--------------GGKEGFLAPYRGERYHLSE

Query:  WRGAGNAPTTPKEFFNMKHSSARNVIERAFGLLKGRWAILRSKSYYPVRIQGRTIAACYLLHNLINREMGDREIPDDLDEVDSASATTDSENINFIETSD
        WRG  NAP+T KEFFNMKHSSARNVIERAFG+LKGRWAILR KSYYPV +Q RTI AC LLHNLINREM + +I D++DEVDS  ATT +++I++IETS+
Subjt:  WRGAGNAPTTPKEFFNMKHSSARNVIERAFGLLKGRWAILRSKSYYPVRIQGRTIAACYLLHNLINREMGDREIPDDLDEVDSASATTDSENINFIETSD

Query:  EWSQWRDDLAEHMFSEWELRN
        EWSQWRD+LAE MF+EWELRN
Subjt:  EWSQWRDDLAEHMFSEWELRN

KAA0046727.1 retrotransposon protein [Cucumis melo var. makuwa]5.5e-11063.24Show/hide
Query:  MDRRCFSILRTLLRIVVGLTGTKVVDVEEMVAIFLHILAHDVKNRVVRTQFAKFGETVSRHFNSVLHGVLRLHEVLIKKPDPVTTSCTDPRWKWFQNCLG
        MDRRCF+IL  LLR + GLT T+VVDVEEMVA+FLHILAHDVKNRV++ +F + GET+SRHFN VL  V+RLH+ L+KKP PV   CTD RW+WF+NCLG
Subjt:  MDRRCFSILRTLLRIVVGLTGTKVVDVEEMVAIFLHILAHDVKNRVVRTQFAKFGETVSRHFNSVLHGVLRLHEVLIKKPDPVTTSCTDPRWKWFQNCLG

Query:  ALDGTYVKVNVSVLDCPRYRTRKGEIATNMFVVCDTK----------------------EISHSSYL--------------GGKEGFLAPYRGERYHLSE
        ALDGTY+KVNV   D  RYRTRKGE+ATN+  VCDTK                       +S  + L                 EGFLAPYRG+RYHL E
Subjt:  ALDGTYVKVNVSVLDCPRYRTRKGEIATNMFVVCDTK----------------------EISHSSYL--------------GGKEGFLAPYRGERYHLSE

Query:  WRGAGNAPTTPKEFFNMKHSSARNVIERAFGLLKGRWAILRSKSYYPVRIQGRTIAACYLLHNLINREMGDREIPDDLDEVDSASATTDSENINFIETSD
        WRG  NAP+T KEFFNMKHSSARNVIERAFG+LKGRWAILR KSYYPV +Q RTI AC LLHNLINREM + +I D++DEVDS  ATT +++I++IETS+
Subjt:  WRGAGNAPTTPKEFFNMKHSSARNVIERAFGLLKGRWAILRSKSYYPVRIQGRTIAACYLLHNLINREMGDREIPDDLDEVDSASATTDSENINFIETSD

Query:  EWSQWRDDLAEHMFSEWELRN
        EWSQWRD+LAE MF+EWELRN
Subjt:  EWSQWRDDLAEHMFSEWELRN

KAA0048361.1 retrotransposon protein [Cucumis melo var. makuwa]1.1e-11063.86Show/hide
Query:  MDRRCFSILRTLLRIVVGLTGTKVVDVEEMVAIFLHILAHDVKNRVVRTQFAKFGETVSRHFNSVLHGVLRLHEVLIKKPDPVTTSCTDPRWKWFQNCLG
        MDRRCF+IL  LLR + GLT T+VVDVEEMVA+FLHILAHDVKNRV++ +F + GET+SRHFN VL  V+RLHE L+KKP PV   CTD RWKWF+NCLG
Subjt:  MDRRCFSILRTLLRIVVGLTGTKVVDVEEMVAIFLHILAHDVKNRVVRTQFAKFGETVSRHFNSVLHGVLRLHEVLIKKPDPVTTSCTDPRWKWFQNCLG

Query:  ALDGTYVKVNVSVLDCPRYRTRKGEIATNMFVVCDTK----------------------EISHSSYL--------------GGKEGFLAPYRGERYHLSE
        ALDGTY+KVNV   D  RYRTRKGE+ATN+  VCDTK                       +S  + L                 EGFLAPYRG+RYHL E
Subjt:  ALDGTYVKVNVSVLDCPRYRTRKGEIATNMFVVCDTK----------------------EISHSSYL--------------GGKEGFLAPYRGERYHLSE

Query:  WRGAGNAPTTPKEFFNMKHSSARNVIERAFGLLKGRWAILRSKSYYPVRIQGRTIAACYLLHNLINREMGDREIPDDLDEVDSASATTDSENINFIETSD
        WRG  NAP+T KEFFNMKHSSARNVIERAFG+LKGRWAILR KSYYPV +Q RTI AC LLHNLINREM + +I D++DEVDS  ATT +++I++IETS+
Subjt:  WRGAGNAPTTPKEFFNMKHSSARNVIERAFGLLKGRWAILRSKSYYPVRIQGRTIAACYLLHNLINREMGDREIPDDLDEVDSASATTDSENINFIETSD

Query:  EWSQWRDDLAEHMFSEWELRN
        EWSQWRD+LAE MF+EWELRN
Subjt:  EWSQWRDDLAEHMFSEWELRN

KAA0051057.1 retrotransposon protein [Cucumis melo var. makuwa]2.7e-10962.93Show/hide
Query:  MDRRCFSILRTLLRIVVGLTGTKVVDVEEMVAIFLHILAHDVKNRVVRTQFAKFGETVSRHFNSVLHGVLRLHEVLIKKPDPVTTSCTDPRWKWFQNCLG
        MDRRCF+IL  LLR + GLT T+VVDVEEMVA+FLHIL HDVKNRV++ +F + GET+SRHFN VL  V+RLHE L+KKP PV   CTD RW+WF+NCLG
Subjt:  MDRRCFSILRTLLRIVVGLTGTKVVDVEEMVAIFLHILAHDVKNRVVRTQFAKFGETVSRHFNSVLHGVLRLHEVLIKKPDPVTTSCTDPRWKWFQNCLG

Query:  ALDGTYVKVNVSVLDCPRYRTRKGEIATNMFVVCDTK----------------------EISHSSYL--------------GGKEGFLAPYRGERYHLSE
        ALDGTY+KVNV   D  RYRTRKGE+ATN+  VCDTK                       +S  + L                 EGFLAPYRG+RYHL E
Subjt:  ALDGTYVKVNVSVLDCPRYRTRKGEIATNMFVVCDTK----------------------EISHSSYL--------------GGKEGFLAPYRGERYHLSE

Query:  WRGAGNAPTTPKEFFNMKHSSARNVIERAFGLLKGRWAILRSKSYYPVRIQGRTIAACYLLHNLINREMGDREIPDDLDEVDSASATTDSENINFIETSD
        WRG  NAP+T KEFFNMKHS ARNVIERAFG+LKGRWAILR KSYYPV +Q RTI AC LLHNLINREM + +I D++DEVDS  ATT +++I++IETS+
Subjt:  WRGAGNAPTTPKEFFNMKHSSARNVIERAFGLLKGRWAILRSKSYYPVRIQGRTIAACYLLHNLINREMGDREIPDDLDEVDSASATTDSENINFIETSD

Query:  EWSQWRDDLAEHMFSEWELRN
        EWSQWRD+LAE MF+EWELRN
Subjt:  EWSQWRDDLAEHMFSEWELRN

KAA0065893.1 retrotransposon protein [Cucumis melo var. makuwa]3.0e-10862.93Show/hide
Query:  MDRRCFSILRTLLRIVVGLTGTKVVDVEEMVAIFLHILAHDVKNRVVRTQFAKFGETVSRHFNSVLHGVLRLHEVLIKKPDPVTTSCTDPRWKWFQNCLG
        MDRRCF+IL  LLR + GLT T+VVDVEEMVA+FL+ILAHDVKNRV++ +F + GET+SRHFN VL  V+RLHE L+KKP PV   CTD RW+WF+NCLG
Subjt:  MDRRCFSILRTLLRIVVGLTGTKVVDVEEMVAIFLHILAHDVKNRVVRTQFAKFGETVSRHFNSVLHGVLRLHEVLIKKPDPVTTSCTDPRWKWFQNCLG

Query:  ALDGTYVKVNVSVLDCPRYRTRKGEIATNMFVVCDTK----------------------EISHSSYL--------------GGKEGFLAPYRGERYHLSE
        ALDGTY+KVNV   D  RYRTRKGE+ATN+  VCDTK                       +S  + L                 EGFLAPYRG+RYHL E
Subjt:  ALDGTYVKVNVSVLDCPRYRTRKGEIATNMFVVCDTK----------------------EISHSSYL--------------GGKEGFLAPYRGERYHLSE

Query:  WRGAGNAPTTPKEFFNMKHSSARNVIERAFGLLKGRWAILRSKSYYPVRIQGRTIAACYLLHNLINREMGDREIPDDLDEVDSASATTDSENINFIETSD
        W G  NAP+T KEFFNMKHSSARNVIERAFG+LKGRWAILR KSYYPV +Q  TI AC LLHNLINREM + +I D++DEVDS  ATT +++I++IETS+
Subjt:  WRGAGNAPTTPKEFFNMKHSSARNVIERAFGLLKGRWAILRSKSYYPVRIQGRTIAACYLLHNLINREMGDREIPDDLDEVDSASATTDSENINFIETSD

Query:  EWSQWRDDLAEHMFSEWELRN
        EWSQWRDDLAE MF+EWELRN
Subjt:  EWSQWRDDLAEHMFSEWELRN

TrEMBL top hitse value%identityAlignment
A0A5A7TXW1 Retrotransposon protein2.7e-11063.24Show/hide
Query:  MDRRCFSILRTLLRIVVGLTGTKVVDVEEMVAIFLHILAHDVKNRVVRTQFAKFGETVSRHFNSVLHGVLRLHEVLIKKPDPVTTSCTDPRWKWFQNCLG
        MDRRCF+IL  LLR + GLT T+VVDVEEMVA+FLHILAHDVKNRV++ +F + GET+SRHFN VL  V+RLH+ L+KKP PV   CTD RW+WF+NCLG
Subjt:  MDRRCFSILRTLLRIVVGLTGTKVVDVEEMVAIFLHILAHDVKNRVVRTQFAKFGETVSRHFNSVLHGVLRLHEVLIKKPDPVTTSCTDPRWKWFQNCLG

Query:  ALDGTYVKVNVSVLDCPRYRTRKGEIATNMFVVCDTK----------------------EISHSSYL--------------GGKEGFLAPYRGERYHLSE
        ALDGTY+KVNV   D  RYRTRKGE+ATN+  VCDTK                       +S  + L                 EGFLAPYRG+RYHL E
Subjt:  ALDGTYVKVNVSVLDCPRYRTRKGEIATNMFVVCDTK----------------------EISHSSYL--------------GGKEGFLAPYRGERYHLSE

Query:  WRGAGNAPTTPKEFFNMKHSSARNVIERAFGLLKGRWAILRSKSYYPVRIQGRTIAACYLLHNLINREMGDREIPDDLDEVDSASATTDSENINFIETSD
        WRG  NAP+T KEFFNMKHSSARNVIERAFG+LKGRWAILR KSYYPV +Q RTI AC LLHNLINREM + +I D++DEVDS  ATT +++I++IETS+
Subjt:  WRGAGNAPTTPKEFFNMKHSSARNVIERAFGLLKGRWAILRSKSYYPVRIQGRTIAACYLLHNLINREMGDREIPDDLDEVDSASATTDSENINFIETSD

Query:  EWSQWRDDLAEHMFSEWELRN
        EWSQWRD+LAE MF+EWELRN
Subjt:  EWSQWRDDLAEHMFSEWELRN

A0A5A7U476 Retrotransposon protein5.4e-11163.86Show/hide
Query:  MDRRCFSILRTLLRIVVGLTGTKVVDVEEMVAIFLHILAHDVKNRVVRTQFAKFGETVSRHFNSVLHGVLRLHEVLIKKPDPVTTSCTDPRWKWFQNCLG
        MDRRCF+IL  LLR + GLT T+VVDVEEMVA+FLHILAHDVKNRV++ +F + GET+SRHFN VL  V+RLHE L+KKP PV   CTD RWKWF+NCLG
Subjt:  MDRRCFSILRTLLRIVVGLTGTKVVDVEEMVAIFLHILAHDVKNRVVRTQFAKFGETVSRHFNSVLHGVLRLHEVLIKKPDPVTTSCTDPRWKWFQNCLG

Query:  ALDGTYVKVNVSVLDCPRYRTRKGEIATNMFVVCDTK----------------------EISHSSYL--------------GGKEGFLAPYRGERYHLSE
        ALDGTY+KVNV   D  RYRTRKGE+ATN+  VCDTK                       +S  + L                 EGFLAPYRG+RYHL E
Subjt:  ALDGTYVKVNVSVLDCPRYRTRKGEIATNMFVVCDTK----------------------EISHSSYL--------------GGKEGFLAPYRGERYHLSE

Query:  WRGAGNAPTTPKEFFNMKHSSARNVIERAFGLLKGRWAILRSKSYYPVRIQGRTIAACYLLHNLINREMGDREIPDDLDEVDSASATTDSENINFIETSD
        WRG  NAP+T KEFFNMKHSSARNVIERAFG+LKGRWAILR KSYYPV +Q RTI AC LLHNLINREM + +I D++DEVDS  ATT +++I++IETS+
Subjt:  WRGAGNAPTTPKEFFNMKHSSARNVIERAFGLLKGRWAILRSKSYYPVRIQGRTIAACYLLHNLINREMGDREIPDDLDEVDSASATTDSENINFIETSD

Query:  EWSQWRDDLAEHMFSEWELRN
        EWSQWRD+LAE MF+EWELRN
Subjt:  EWSQWRDDLAEHMFSEWELRN

A0A5A7U9H2 Retrotransposon protein1.3e-10962.93Show/hide
Query:  MDRRCFSILRTLLRIVVGLTGTKVVDVEEMVAIFLHILAHDVKNRVVRTQFAKFGETVSRHFNSVLHGVLRLHEVLIKKPDPVTTSCTDPRWKWFQNCLG
        MDRRCF+IL  LLR + GLT T+VVDVEEMVA+FLHIL HDVKNRV++ +F + GET+SRHFN VL  V+RLHE L+KKP PV   CTD RW+WF+NCLG
Subjt:  MDRRCFSILRTLLRIVVGLTGTKVVDVEEMVAIFLHILAHDVKNRVVRTQFAKFGETVSRHFNSVLHGVLRLHEVLIKKPDPVTTSCTDPRWKWFQNCLG

Query:  ALDGTYVKVNVSVLDCPRYRTRKGEIATNMFVVCDTK----------------------EISHSSYL--------------GGKEGFLAPYRGERYHLSE
        ALDGTY+KVNV   D  RYRTRKGE+ATN+  VCDTK                       +S  + L                 EGFLAPYRG+RYHL E
Subjt:  ALDGTYVKVNVSVLDCPRYRTRKGEIATNMFVVCDTK----------------------EISHSSYL--------------GGKEGFLAPYRGERYHLSE

Query:  WRGAGNAPTTPKEFFNMKHSSARNVIERAFGLLKGRWAILRSKSYYPVRIQGRTIAACYLLHNLINREMGDREIPDDLDEVDSASATTDSENINFIETSD
        WRG  NAP+T KEFFNMKHS ARNVIERAFG+LKGRWAILR KSYYPV +Q RTI AC LLHNLINREM + +I D++DEVDS  ATT +++I++IETS+
Subjt:  WRGAGNAPTTPKEFFNMKHSSARNVIERAFGLLKGRWAILRSKSYYPVRIQGRTIAACYLLHNLINREMGDREIPDDLDEVDSASATTDSENINFIETSD

Query:  EWSQWRDDLAEHMFSEWELRN
        EWSQWRD+LAE MF+EWELRN
Subjt:  EWSQWRDDLAEHMFSEWELRN

A0A5A7VCK9 Retrotransposon protein1.5e-10862.93Show/hide
Query:  MDRRCFSILRTLLRIVVGLTGTKVVDVEEMVAIFLHILAHDVKNRVVRTQFAKFGETVSRHFNSVLHGVLRLHEVLIKKPDPVTTSCTDPRWKWFQNCLG
        MDRRCF+IL  LLR + GLT T+VVDVEEMVA+FL+ILAHDVKNRV++ +F + GET+SRHFN VL  V+RLHE L+KKP PV   CTD RW+WF+NCLG
Subjt:  MDRRCFSILRTLLRIVVGLTGTKVVDVEEMVAIFLHILAHDVKNRVVRTQFAKFGETVSRHFNSVLHGVLRLHEVLIKKPDPVTTSCTDPRWKWFQNCLG

Query:  ALDGTYVKVNVSVLDCPRYRTRKGEIATNMFVVCDTK----------------------EISHSSYL--------------GGKEGFLAPYRGERYHLSE
        ALDGTY+KVNV   D  RYRTRKGE+ATN+  VCDTK                       +S  + L                 EGFLAPYRG+RYHL E
Subjt:  ALDGTYVKVNVSVLDCPRYRTRKGEIATNMFVVCDTK----------------------EISHSSYL--------------GGKEGFLAPYRGERYHLSE

Query:  WRGAGNAPTTPKEFFNMKHSSARNVIERAFGLLKGRWAILRSKSYYPVRIQGRTIAACYLLHNLINREMGDREIPDDLDEVDSASATTDSENINFIETSD
        W G  NAP+T KEFFNMKHSSARNVIERAFG+LKGRWAILR KSYYPV +Q  TI AC LLHNLINREM + +I D++DEVDS  ATT +++I++IETS+
Subjt:  WRGAGNAPTTPKEFFNMKHSSARNVIERAFGLLKGRWAILRSKSYYPVRIQGRTIAACYLLHNLINREMGDREIPDDLDEVDSASATTDSENINFIETSD

Query:  EWSQWRDDLAEHMFSEWELRN
        EWSQWRDDLAE MF+EWELRN
Subjt:  EWSQWRDDLAEHMFSEWELRN

A0A5D3BDX0 Retrotransposon protein1.2e-11063.55Show/hide
Query:  MDRRCFSILRTLLRIVVGLTGTKVVDVEEMVAIFLHILAHDVKNRVVRTQFAKFGETVSRHFNSVLHGVLRLHEVLIKKPDPVTTSCTDPRWKWFQNCLG
        MDRRCF+IL  LLR + GLT T+VVDVEEMVA+FLHILAHDVKNRV++ +F + GET+SRHFN VL  V+RLHE L+KKP PV   CTD RW+WF+NCLG
Subjt:  MDRRCFSILRTLLRIVVGLTGTKVVDVEEMVAIFLHILAHDVKNRVVRTQFAKFGETVSRHFNSVLHGVLRLHEVLIKKPDPVTTSCTDPRWKWFQNCLG

Query:  ALDGTYVKVNVSVLDCPRYRTRKGEIATNMFVVCDTK----------------------EISHSSYL--------------GGKEGFLAPYRGERYHLSE
        ALDGTY+KVNV   D  RYRTRKGE+ATN+  VCDTK                       +S  + L                 EGFLAPYRG+RYHL E
Subjt:  ALDGTYVKVNVSVLDCPRYRTRKGEIATNMFVVCDTK----------------------EISHSSYL--------------GGKEGFLAPYRGERYHLSE

Query:  WRGAGNAPTTPKEFFNMKHSSARNVIERAFGLLKGRWAILRSKSYYPVRIQGRTIAACYLLHNLINREMGDREIPDDLDEVDSASATTDSENINFIETSD
        WRG  NAP+T KEFFNMKHSSARNVIERAFG+LKGRWAILR KSYYPV +Q RTI AC LLHNLINREM + +I D++DEVDS  ATT +++I++IETS+
Subjt:  WRGAGNAPTTPKEFFNMKHSSARNVIERAFGLLKGRWAILRSKSYYPVRIQGRTIAACYLLHNLINREMGDREIPDDLDEVDSASATTDSENINFIETSD

Query:  EWSQWRDDLAEHMFSEWELRN
        EWSQWRD+LAE MF+EWELRN
Subjt:  EWSQWRDDLAEHMFSEWELRN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G43722.1 unknown protein9.0e-1829.32Show/hide
Query:  MDRRCFSILRTLLRIVVGLTGTKVVDVEEMVAIFLHILAHDVKNRVVRTQFAKFGETVSRHFNSVLHGVLRLHEVLIKKPD-------PVTTSCTDPRWK
        M   CF+ L  +L+    L  T  + +EE VA+FL I  H+   R V  +F +  ETV R F  VL     L    I+ P        P         W 
Subjt:  MDRRCFSILRTLLRIVVGLTGTKVVDVEEMVAIFLHILAHDVKNRVVRTQFAKFGETVSRHFNSVLHGVLRLHEVLIKKPD-------PVTTSCTDPRWK

Query:  WFQNCLGALDGTYVKVNVSVLDCPRYRTRKGEIATNMFVVCDTKEI--------------------------------SHSSYL-----GGKEGFLAPYR
        +F   +GA+DGT+V V V       Y  R    + N+  +CD K +                                S   YL       K+G LAPYR
Subjt:  WFQNCLGALDGTYVKVNVSVLDCPRYRTRKGEIATNMFVVCDTKEI--------------------------------SHSSYL-----GGKEGFLAPYR

Query:  GE-----RYHLSEWRGAGNAPTTPKEFFNMKHSSARNVIERAFGLLKGR
               RYH+S++   G  P    E FN  H+S R+VIER F + K +
Subjt:  GE-----RYHLSEWRGAGNAPTTPKEFFNMKHSSARNVIERAFGLLKGR

AT4G10890.1 unknown protein3.8e-0847.27Show/hide
Query:  GFLAPYRGERYHLSEWRGAGNAPTTPKEFFNMKHSSARNVIERAFGLLKGRWAIL
        G+L P+R   YHL ++ G G  P T +E FN KH   R+VI+R FG+ K +W IL
Subjt:  GFLAPYRGERYHLSEWRGAGNAPTTPKEFFNMKHSSARNVIERAFGLLKGRWAIL

AT5G28950.1 unknown protein1.0e-0535.59Show/hide
Query:  WKWFQNCLGALDGTYVKVNVSVLDCPRYRTRKGEIATNMFVVCDTKEISHSSYLGGKEG
        + +F++C+GA+D T++   VS    P +R RKG+I+ NM   C+  ++     L G EG
Subjt:  WKWFQNCLGALDGTYVKVNVSVLDCPRYRTRKGEIATNMFVVCDTKEISHSSYLGGKEG

AT5G35695.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)1.1e-1835.66Show/hide
Query:  FLAPYRGERYHLSEWRGAGNAPTTPKEFFNMKHSSARNVIERAFGLLKGRWAILRSKSYYPVRIQGRTIAACYLLHNLINRE--MGDREIPDDL-DEVDS
        FLAP+RG RYHL E+ G    P TP E FN++H S RNVIER FG+ K R+AI +S   +  + Q   +  C  LHN + +E    + + PD++ +E D 
Subjt:  FLAPYRGERYHLSEWRGAGNAPTTPKEFFNMKHSSARNVIERAFGLLKGRWAILRSKSYYPVRIQGRTIAACYLLHNLINRE--MGDREIPDDL-DEVDS

Query:  ASATTDSENINFIETS----------DEWSQWRDDLAEHMFSE
         +   ++ N N I+            +  + WR  +AE M+ +
Subjt:  ASATTDSENINFIETS----------DEWSQWRDDLAEHMFSE

AT5G41980.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)2.7e-2227.65Show/hide
Query:  MDRRCFSILRTLLRIVVGLTGTKVVDVEEMVAIFLHILAHDVKNRVVRTQFAKFGETVSRHFNSVLHGVLRLHEVLIK-KPDPVTTSCTDPRWKWFQNCL
        MD+  F  L  LL+    L  T  + +E  +AIFL I+ H+++ R V+  F   GET+SRHFN+VL+ V+ + +   +   +  T    DP   +F++C+
Subjt:  MDRRCFSILRTLLRIVVGLTGTKVVDVEEMVAIFLHILAHDVKNRVVRTQFAKFGETVSRHFNSVLHGVLRLHEVLIK-KPDPVTTSCTDPRWKWFQNCL

Query:  GALDGTYVKVNVSVLDCPRYRTRKGEIATNMFVVCDTKEISHSSYLGGKE-------------------------------------GFLAPYRGERYHL
        G +D  ++ V V V +   +R   G +  N+ +   + ++  +  L G E                                     GF+APY G     
Subjt:  GALDGTYVKVNVSVLDCPRYRTRKGEIATNMFVVCDTKEISHSSYLGGKE-------------------------------------GFLAPYRGERYHL

Query:  SEWRGAGNAPTTPKEFFNMKHSSARNVIERAFGLLKGRWAILRSKSYYPVRIQGRTIAACYLLHNLINREMGDREIPDDLDEVDSASATTDSE
             + N+    KE FN +H      I R FG LK R+ IL S   YP++ Q + + A   LHN +  E  D  +    +E   A A  D E
Subjt:  SEWRGAGNAPTTPKEFFNMKHSSARNVIERAFGLLKGRWAILRSKSYYPVRIQGRTIAACYLLHNLINREMGDREIPDDLDEVDSASATTDSE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGGACAGAAGATGTTTTTCCATACTTCGCACCTTGTTAAGAATAGTTGTCGGCTTAACTGGAACAAAGGTCGTAGATGTAGAAGAGATGGTTGCCATTTTCTTACA
CATCCTAGCCCATGACGTTAAAAATCGAGTAGTTCGTACGCAGTTCGCTAAGTTTGGGGAGACAGTTTCTAGGCACTTCAACTCCGTCCTTCACGGGGTGTTACGACTAC
ATGAAGTTCTTATAAAAAAACCTGATCCAGTCACGACCTCCTGTACGGATCCGAGGTGGAAATGGTTTCAGAATTGCCTTGGTGCGTTAGATGGAACATACGTGAAGGTC
AATGTTAGTGTCCTCGATTGTCCGAGGTATAGAACAAGGAAAGGTGAAATTGCAACAAACATGTTTGTTGTTTGTGATACAAAGGAGATTTCACATTCGTCCTACCTGGG
TGGGAAGGAGGGTTTCCTGGCACCGTATAGAGGGGAACGTTACCACCTATCTGAATGGCGTGGTGCGGGAAATGCACCAACTACTCCCAAAGAATTTTTCAACATGAAGC
ATTCATCTGCAAGGAATGTGATTGAGAGGGCATTCGGTTTGTTAAAAGGAAGGTGGGCCATCCTCCGCTCGAAATCGTACTATCCAGTTCGAATTCAAGGACGGACCATT
GCAGCGTGCTACTTACTACACAATCTTATCAATAGGGAGATGGGTGACCGTGAAATTCCCGATGATCTAGATGAGGTGGATTCTGCATCGGCTACAACTGATTCTGAGAA
TATCAATTTCATTGAGACTTCCGACGAATGGAGCCAGTGGAGGGATGACTTGGCAGAACATATGTTTTCGGAATGGGAGTTACGTAATACTTAG
mRNA sequenceShow/hide mRNA sequence
ATGATGGACAGAAGATGTTTTTCCATACTTCGCACCTTGTTAAGAATAGTTGTCGGCTTAACTGGAACAAAGGTCGTAGATGTAGAAGAGATGGTTGCCATTTTCTTACA
CATCCTAGCCCATGACGTTAAAAATCGAGTAGTTCGTACGCAGTTCGCTAAGTTTGGGGAGACAGTTTCTAGGCACTTCAACTCCGTCCTTCACGGGGTGTTACGACTAC
ATGAAGTTCTTATAAAAAAACCTGATCCAGTCACGACCTCCTGTACGGATCCGAGGTGGAAATGGTTTCAGAATTGCCTTGGTGCGTTAGATGGAACATACGTGAAGGTC
AATGTTAGTGTCCTCGATTGTCCGAGGTATAGAACAAGGAAAGGTGAAATTGCAACAAACATGTTTGTTGTTTGTGATACAAAGGAGATTTCACATTCGTCCTACCTGGG
TGGGAAGGAGGGTTTCCTGGCACCGTATAGAGGGGAACGTTACCACCTATCTGAATGGCGTGGTGCGGGAAATGCACCAACTACTCCCAAAGAATTTTTCAACATGAAGC
ATTCATCTGCAAGGAATGTGATTGAGAGGGCATTCGGTTTGTTAAAAGGAAGGTGGGCCATCCTCCGCTCGAAATCGTACTATCCAGTTCGAATTCAAGGACGGACCATT
GCAGCGTGCTACTTACTACACAATCTTATCAATAGGGAGATGGGTGACCGTGAAATTCCCGATGATCTAGATGAGGTGGATTCTGCATCGGCTACAACTGATTCTGAGAA
TATCAATTTCATTGAGACTTCCGACGAATGGAGCCAGTGGAGGGATGACTTGGCAGAACATATGTTTTCGGAATGGGAGTTACGTAATACTTAG
Protein sequenceShow/hide protein sequence
MMDRRCFSILRTLLRIVVGLTGTKVVDVEEMVAIFLHILAHDVKNRVVRTQFAKFGETVSRHFNSVLHGVLRLHEVLIKKPDPVTTSCTDPRWKWFQNCLGALDGTYVKV
NVSVLDCPRYRTRKGEIATNMFVVCDTKEISHSSYLGGKEGFLAPYRGERYHLSEWRGAGNAPTTPKEFFNMKHSSARNVIERAFGLLKGRWAILRSKSYYPVRIQGRTI
AACYLLHNLINREMGDREIPDDLDEVDSASATTDSENINFIETSDEWSQWRDDLAEHMFSEWELRNT