; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0021136 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0021136
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotransposon protein
Genome locationchr7:4969647..4972222
RNA-Seq ExpressionLag0021136
SyntenyLag0021136
Gene Ontology termsNA
InterPro domainsIPR027806 - Harbinger transposase-derived nuclease domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADN33754.1 retrotransposon protein [Cucumis melo subsp. melo]7.8e-12242.41Show/hide
Query:  LIHEDDLACRENTRMDRRTFTILCTLLRTTGRLRATSYVDVEEMVAMFLHIVAHDVKNRVIRRQFVRSGETVSRHFNVVLDAVLRLHSVLLKAPEPVTNA
        +IHE DL CR++TRMDRRTF ILC LLR    L +T  VDVEEMVAMFLH++AHDVKNRVI+++FVRSGETVSRHFN+VL AVLRL+  L+K P PVT+ 
Subjt:  LIHEDDLACRENTRMDRRTFTILCTLLRTTGRLRATSYVDVEEMVAMFLHIVAHDVKNRVIRRQFVRSGETVSRHFNVVLDAVLRLHSVLLKAPEPVTNA

Query:  CTDDRWKWFENCLGALDGTHIKVNISAVDRPRYRTRKGEIATNVLAVCSQNGEFIFVFPGWEGSASDSRVLRDAVSRPSGLKVPRGYYYLCDAGYPNADG
        C D RWK FENCLGALDGT+IKVN+ A DRP +RTRKGEIATNVL VC   G+F++V  GWEGSA+DSR+LRDA+S+ +GL+VP+GYYYLCDAGYPNA+G
Subjt:  CTDDRWKWFENCLGALDGTHIKVNISAVDRPRYRTRKGEIATNVLAVCSQNGEFIFVFPGWEGSASDSRVLRDAVSRPSGLKVPRGYYYLCDAGYPNADG

Query:  FLAPYRSTRYHLTEWR--------------------------------GRWAILRGKSYYPVDVQVKTITACCYLHNLITREMGQDPSMEAAHTGEQD-S
        FLAPY+  RYHL EWR                                GRW ILRGKSYYP+ VQ +TI AC  LHNLI REM     +E    G+   +
Subjt:  FLAPYRSTRYHLTEWR--------------------------------GRWAILRGKSYYPVDVQVKTITACCYLHNLITREMGQDPSMEAAHTGEQD-S

Query:  NDMGTENITFVESTTHGVPGGMNWRIEYGRVCLSFKERKTHMDAGGRRSACAVPTAPSPSGGMACRQWDVSGWLPNQIGRMMKERLLGCNI---------
            +E+I ++E+T         WR +      +  + +     GG   +C +    S  G  +       G+L  Q+ RMM E+L GC +         
Subjt:  NDMGTENITFVESTTHGVPGGMNWRIEYGRVCLSFKERKTHMDAGGRRSACAVPTAPSPSGGMACRQWDVSGWLPNQIGRMMKERLLGCNI---------

Query:  -----------------------VNDERKCIEAEKEIFDLWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRTRTPIEITPEPEPVADLDE----DM
                                NDE KCI AEKE+FD WV   P AKGL N PFP+++EL  VFG+D A G    T  ++    EP    D     D 
Subjt:  -----------------------VNDERKCIEAEKEIFDLWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRTRTPIEITPEPEPVADLDE----DM

Query:  NVDFEDCFVPSPPVID-------PTLGEE----LCGTPTGTGE---------WIPDDRTTNQ--KIALWPTKRDELERSRRKELYAELQSIPGVSMEDGL
        N DF   +     ++        P+   E      G+    G           +  D+T  Q  +IA WP +    +   R E +  L+ +P ++  D  
Subjt:  NVDFEDCFVPSPPVID-------PTLGEE----LCGTPTGTGE---------WIPDDRTTNQ--KIALWPTKRDELERSRRKELYAELQSIPGVSMEDGL

Query:  VVARALLSDERMLTHFMDFPPEWKFDYCMEIL
        ++ R LLS    L  F+  P + +  +C  +L
Subjt:  VVARALLSDERMLTHFMDFPPEWKFDYCMEIL

KAA0034843.1 retrotransposon protein [Cucumis melo var. makuwa]1.1e-12842.77Show/hide
Query:  ELIAILTIICATQYQFIATVLGILHSGYSWGFRSPTHVRHRIRQLNFFRLIHEDDLACRENTRMDRRTFTILCTLLRTTGRLRATSYVDVEEMVAMFLHI
        EL +I+    A+Q Q +  +L +L +        P   RHRIRQL +FR+IH  DL CR++TRMDRR F ILC LLRT   L +T  VDVEEMVAMFLHI
Subjt:  ELIAILTIICATQYQFIATVLGILHSGYSWGFRSPTHVRHRIRQLNFFRLIHEDDLACRENTRMDRRTFTILCTLLRTTGRLRATSYVDVEEMVAMFLHI

Query:  VAHDVKNRVIRRQFVRSGETVSRHFNVVLDAVLRLHSVLLKAPEPVTNACTDDRWKWFENCLGALDGTHIKVNISAVDRPRYRTRKGEIATNVLAVCSQN
        +AHDVKNRVI+R+F+RSGET+SRHFN+VL AV+RLH  LLK P+PV N CTD RW+WFENCLGALDGT+IKVN+ A DR RYRTRKGE+ATNVL V    
Subjt:  VAHDVKNRVIRRQFVRSGETVSRHFNVVLDAVLRLHSVLLKAPEPVTNACTDDRWKWFENCLGALDGTHIKVNISAVDRPRYRTRKGEIATNVLAVCSQN

Query:  GEFIFVFPGWEGSASDSRVLRDAVSRPSGLKVPRGYYYLCDAGYPNADGFLAPYRSTRYHLTEWR--------------------------------GRW
        G+F++V  GWEGSA+DSR+LRDA+SRP+ LKVP+GYYYL DAGYPNA+GFLAPYR  RYHL EWR                                GRW
Subjt:  GEFIFVFPGWEGSASDSRVLRDAVSRPSGLKVPRGYYYLCDAGYPNADGFLAPYRSTRYHLTEWR--------------------------------GRW

Query:  AILRGKSYYPVDVQVKTITACCYLHNLITREMGQDPSMEAAHTGEQDSNDMGTENITFVESTTHGVPGGMNWRIEYGRVCLSFKERKTHMDAGGRRSACA
        AILRGKSY+PV+VQ  TI ACC LHNLI REM               +N    +NI  + S++           E G V L        ++AGG R    
Subjt:  AILRGKSYYPVDVQVKTITACCYLHNLITREMGQDPSMEAAHTGEQDSNDMGTENITFVESTTHGVPGGMNWRIEYGRVCLSFKERKTHMDAGGRRSACA

Query:  VPTAPSPSGGMACRQWDVSGWLPNQIGRMMKERLLGCNI-------------------------------VNDERKCIEAEKEIFDLWVEGHPQAKGLRN
             S +G          G+L NQ+ RMM  ++ GCNI                                NDE+KCI AEKE+FD W   HP AKGL N
Subjt:  VPTAPSPSGGMACRQWDVSGWLPNQIGRMMKERLLGCNI-------------------------------VNDERKCIEAEKEIFDLWVEGHPQAKGLRN

Query:  RPFPWFNELALVFGKDSARGVRTRTPIEITPEPEPVAD---LDEDMNVDFEDCFVPSPPVIDPTLGE-------ELCGTPTGTGEWIPDDRTTN------
        + F  ++EL+ VFGKD A G R  +  +I     P  D    D   + DF   +     +    L E       E     +G+    P   T +      
Subjt:  RPFPWFNELALVFGKDSARGVRTRTPIEITPEPEPVAD---LDEDMNVDFEDCFVPSPPVIDPTLGE-------ELCGTPTGTGEWIPDDRTTN------

Query:  ---------QKIALWPTKRDELERSRRKELYAELQSIPGVSMEDGLVVARALLSDERMLTHFMDFPPEWKFDYCMEIL
                  +IA WP  + +     R+E+  +L++IP +++ D   + R L+ +   +  F++ P   K+ YC  IL
Subjt:  ---------QKIALWPTKRDELERSRRKELYAELQSIPGVSMEDGLVVARALLSDERMLTHFMDFPPEWKFDYCMEIL

KAA0048191.1 retrotransposon protein [Cucumis melo var. makuwa]1.0e-12145.26Show/hide
Query:  MDRRTFTILCTLLRTTGRLRATSYVDVEEMVAMFLHIVAHDVKNRVIRRQFVRSGETVSRHFNVVLDAVLRLHSVLLKAPEPVTNACTDDRWKWFE----
        MD+R FTILCT+LRT G L AT YVDVEEM A+FLHIVAHDVKNRV RR F RS  TVSRHFNVVL+AVLR+H +LLK P+ VT++C+ ++W+WF+    
Subjt:  MDRRTFTILCTLLRTTGRLRATSYVDVEEMVAMFLHIVAHDVKNRVIRRQFVRSGETVSRHFNVVLDAVLRLHSVLLKAPEPVTNACTDDRWKWFE----

Query:  -----NCLGALDGTHIKVNISAVDRPRYRTRKGEIATNVLAVCSQNGEFIFVFPGWEGSASDSRVLRDAVSRPSGLKVPRGYYYLCDAGYPNADGFLAPY
               + ALDGTHIKVN+S  D PRYR+RK +I TNVL +CSQNGEFIFV PGWEGSASDSRVLRD VSRP GLKVP+GYYYLCDA Y N +GFLAPY
Subjt:  -----NCLGALDGTHIKVNISAVDRPRYRTRKGEIATNVLAVCSQNGEFIFVFPGWEGSASDSRVLRDAVSRPSGLKVPRGYYYLCDAGYPNADGFLAPY

Query:  RSTRYHLTEWRG-------------------------------RWAILRGKSYYPVDVQVKTITACCYLHNLITREMGQDPSMEAAHTGEQDSNDMGTEN
        R  RYHL EWRG                               RWAILRG+SYYPVD+Q K ITACC LHNLI REM  + + E  H GE DS++M  EN
Subjt:  RSTRYHLTEWRG-------------------------------RWAILRGKSYYPVDVQVKTITACCYLHNLITREMGQDPSMEAAHTGEQDSNDMGTEN

Query:  ITFVESTTHGVPGGMNWRIEYGRVCLSFKERKTHMDAGGRRSACAVPTAPSPSGGMAC--RQWDVSGWLPN----------QIGRMMKERLLGCNIVNDE
        I FVE+T         WR     +     E   +M +   ++     T       + C  +  +  GW  +          QI ++MKE++ G NI    
Subjt:  ITFVESTTHGVPGGMNWRIEYGRVCLSFKERKTHMDAGGRRSACAVPTAPSPSGGMAC--RQWDVSGWLPN----------QIGRMMKERLLGCNIVNDE

Query:  RKCIEAEKEIFDLWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRTRTPIEITPEPEPVADLDE-DMNVDFEDCFVPSPPVIDPTLGEELCGTPTGT
           +E   +I      GHP  + L N+PFP+F +L +VFG+D A G R +T +E+    + V D +E DM+++ E   + +P  ++   GE++  TPT  
Subjt:  RKCIEAEKEIFDLWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRTRTPIEITPEPEPVADLDE-DMNVDFEDCFVPSPPVIDPTLGEELCGTPTGT

Query:  GEWIPDDRTTNQ------------------------KIALWPTKRDELERSRRKELYAE
           +   R + +                        KIA W  +  E+E S  K LYAE
Subjt:  GEWIPDDRTTNQ------------------------KIALWPTKRDELERSRRKELYAE

KAA0062747.1 retrotransposon protein [Cucumis melo var. makuwa]7.5e-11742.43Show/hide
Query:  MDRRTFTILCTLLRTTGRLRATSYVDVEEMVAMFLHIVAHDVKNRVIRRQFVRSGETVSRHFNVVLDAVLRLHSVLLKAPEPVTNACTDDRWKWFE---N
        MDRR F ILC LLRTT  L  T  +DVEEMVAMFLHI+AH VKNR+I+R+FVRSGETVSRHFN+VL A  RLH  LLK P+PVTN+CTD RWKWFE   N
Subjt:  MDRRTFTILCTLLRTTGRLRATSYVDVEEMVAMFLHIVAHDVKNRVIRRQFVRSGETVSRHFNVVLDAVLRLHSVLLKAPEPVTNACTDDRWKWFE---N

Query:  CLGALDGTHIKVNISAVDRPRYRTRKGEIATNVLAVCSQNGEFIFVFPGWEGSASDSRVLRDAVSRPSGLKVPRGYYYLCDAGYPNADGFLAPYRSTRYH
        CL + +GT+IKVN+SA DRPRYRTRKGE+ATNVL  C   G+F+FV  GWEGSA+DSR+LRDA+SR +GLKVP+GYYYLCDAGYPNA+GFLAPYR  RYH
Subjt:  CLGALDGTHIKVNISAVDRPRYRTRKGEIATNVLAVCSQNGEFIFVFPGWEGSASDSRVLRDAVSRPSGLKVPRGYYYLCDAGYPNADGFLAPYRSTRYH

Query:  LTEWRGR--------------------------------WAILRGKSYYPVDVQVKTITACCYLHNLITREMGQDPSMEAAHTGEQDSNDMGTENITFVE
        L+EWRG                                 WAILRGKSYYPVDVQ +TI ACC LHNLI REM     ++    G+      G + I ++E
Subjt:  LTEWRGR--------------------------------WAILRGKSYYPVDVQVKTITACCYLHNLITREMGQDPSMEAAHTGEQDSNDMGTENITFVE

Query:  STTHGVPGGMNWRIEYGRVCLSFKERKTHMDAGGRRSACAVPTAPSPSGGMACRQWD-----VSGWLPNQIGRMMKERLLGCNIVNDERKCIEAEKEIFD
        ++         WR +      S  E +  +    R     +         +    W+       G +       M+         N+E +CI AE+++FD
Subjt:  STTHGVPGGMNWRIEYGRVCLSFKERKTHMDAGGRRSACAVPTAPSPSGGMACRQWD-----VSGWLPNQIGRMMKERLLGCNIVNDERKCIEAEKEIFD

Query:  LWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRTRTPIEITP-------EPEPVADL-DEDMNVDFEDCFVPSPPVIDPTLGEELCGTPTGTGEWIP
         WV+ HP  KGL ++ FP++++L+ VFGKD A G R+ T +++         +  P+ D  DED+   +      SP  +     E +        E + 
Subjt:  LWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRTRTPIEITP-------EPEPVADL-DEDMNVDFEDCFVPSPPVIDPTLGEELCGTPTGTGEWIP

Query:  DDRTTNQKIALWPTKRDELERSRRKELYAELQSIPGVSMEDGLVVARALLSDERMLTHFMDFPPEWKFDYCMEIL
              + IA W  ++  +E   R ++  +LQ IP +  +    + + L      +  F+  P E K +YC  +L
Subjt:  DDRTTNQKIALWPTKRDELERSRRKELYAELQSIPGVSMEDGLVVARALLSDERMLTHFMDFPPEWKFDYCMEIL

KAA0065306.1 retrotransposon protein [Cucumis melo var. makuwa]9.8e-13346.6Show/hide
Query:  MDRRTFTILCTLLRTTGRLRATSYVDVEEMVAMFLHIVAHDVKNRVIRRQFVRSGETVSRHFNVVLDAVLRLHSVLLKAPEPVTNACTDDRWKWFENCLG
        MDRR FTILCT+LRT G L AT YVDVEEM+A+FLHIVAHDVKNRV RR F RSGETVSRHFN    AVLRLH +LLK P+PVT +C+ ++W+WF+ CLG
Subjt:  MDRRTFTILCTLLRTTGRLRATSYVDVEEMVAMFLHIVAHDVKNRVIRRQFVRSGETVSRHFNVVLDAVLRLHSVLLKAPEPVTNACTDDRWKWFENCLG

Query:  ALDGTHIKVNISAVDRPRYRTRKGEIATNVLAVCSQNGEFIFVFPGWEGSASDSRVLRDAVSRPSGLKVPRGYYYLCDAGYPNADGFLAPYRSTRYHLTE
        ALDGTHIKVN+S  DRPRYR+RKG+I TNVL VC QNGEFIFV PGWEGSASDSRVLRDAVSR +GLKVP+GYYYLCDAGYPNA+GFLAPYR  RYHLTE
Subjt:  ALDGTHIKVNISAVDRPRYRTRKGEIATNVLAVCSQNGEFIFVFPGWEGSASDSRVLRDAVSRPSGLKVPRGYYYLCDAGYPNADGFLAPYRSTRYHLTE

Query:  WRG-------------------------------RWAILRGKSYYPVDVQVKTITACCYLHNLITREMGQDPSMEAAHTGEQDSNDMGTENITFVESTTH
        WRG                               RW IL+G+SYY VD+Q K ITACC LHNLI REMG + + +  H GE DS++M  ENI FVE+T  
Subjt:  WRG-------------------------------RWAILRGKSYYPVDVQVKTITACCYLHNLITREMGQDPSMEAAHTGEQDSNDMGTENITFVESTTH

Query:  GVPGGMNWRIEYGRVCLSFKERKTHMDAGGRRSACAVPTAPSPSGGMACRQWDVSGWLPNQIGRMMKERLLGCNIVNDERKCIEAEKEIFDL----WVEG
               WR     +     E   +M               S +      +W            +  E L+ C +   E     A+   F L        
Subjt:  GVPGGMNWRIEYGRVCLSFKERKTHMDAGGRRSACAVPTAPSPSGGMACRQWDVSGWLPNQIGRMMKERLLGCNIVNDERKCIEAEKEIFDL----WVEG

Query:  HPQAKGLRNRPFPWFNELALVFGKDSARGVRTRTPIEITPEPEPVADLDEDMNVDFEDCFVPSPPVIDPTLGEELCGTPTG-----------------TG
          +  G     F W  E         A G R +TP+EI  +     + ++DM+++ ED  +P+P  ++P  GE++  TPT                  +G
Subjt:  HPQAKGLRNRPFPWFNELALVFGKDSARGVRTRTPIEITPEPEPVADLDEDMNVDFEDCFVPSPPVIDPTLGEELCGTPTG-----------------TG

Query:  EWIPDDRTTNQ-------KIALWPTKRDELERSRRKELYAELQSIPGVSMEDGLVVARALLSDERMLTHFMDFPPEWKFDYCMEILGR
        + +   R + +       KIA W  ++ E+E S  K LY ELQ+IPG+ ++D L+VA +LL D  ML  F+D+P EWK+  CM ILGR
Subjt:  EWIPDDRTTNQ-------KIALWPTKRDELERSRRKELYAELQSIPGVSMEDGLVVARALLSDERMLTHFMDFPPEWKFDYCMEILGR

TrEMBL top hitse value%identityAlignment
A0A5A7SWD8 Retrotransposon protein5.4e-12942.77Show/hide
Query:  ELIAILTIICATQYQFIATVLGILHSGYSWGFRSPTHVRHRIRQLNFFRLIHEDDLACRENTRMDRRTFTILCTLLRTTGRLRATSYVDVEEMVAMFLHI
        EL +I+    A+Q Q +  +L +L +        P   RHRIRQL +FR+IH  DL CR++TRMDRR F ILC LLRT   L +T  VDVEEMVAMFLHI
Subjt:  ELIAILTIICATQYQFIATVLGILHSGYSWGFRSPTHVRHRIRQLNFFRLIHEDDLACRENTRMDRRTFTILCTLLRTTGRLRATSYVDVEEMVAMFLHI

Query:  VAHDVKNRVIRRQFVRSGETVSRHFNVVLDAVLRLHSVLLKAPEPVTNACTDDRWKWFENCLGALDGTHIKVNISAVDRPRYRTRKGEIATNVLAVCSQN
        +AHDVKNRVI+R+F+RSGET+SRHFN+VL AV+RLH  LLK P+PV N CTD RW+WFENCLGALDGT+IKVN+ A DR RYRTRKGE+ATNVL V    
Subjt:  VAHDVKNRVIRRQFVRSGETVSRHFNVVLDAVLRLHSVLLKAPEPVTNACTDDRWKWFENCLGALDGTHIKVNISAVDRPRYRTRKGEIATNVLAVCSQN

Query:  GEFIFVFPGWEGSASDSRVLRDAVSRPSGLKVPRGYYYLCDAGYPNADGFLAPYRSTRYHLTEWR--------------------------------GRW
        G+F++V  GWEGSA+DSR+LRDA+SRP+ LKVP+GYYYL DAGYPNA+GFLAPYR  RYHL EWR                                GRW
Subjt:  GEFIFVFPGWEGSASDSRVLRDAVSRPSGLKVPRGYYYLCDAGYPNADGFLAPYRSTRYHLTEWR--------------------------------GRW

Query:  AILRGKSYYPVDVQVKTITACCYLHNLITREMGQDPSMEAAHTGEQDSNDMGTENITFVESTTHGVPGGMNWRIEYGRVCLSFKERKTHMDAGGRRSACA
        AILRGKSY+PV+VQ  TI ACC LHNLI REM               +N    +NI  + S++           E G V L        ++AGG R    
Subjt:  AILRGKSYYPVDVQVKTITACCYLHNLITREMGQDPSMEAAHTGEQDSNDMGTENITFVESTTHGVPGGMNWRIEYGRVCLSFKERKTHMDAGGRRSACA

Query:  VPTAPSPSGGMACRQWDVSGWLPNQIGRMMKERLLGCNI-------------------------------VNDERKCIEAEKEIFDLWVEGHPQAKGLRN
             S +G          G+L NQ+ RMM  ++ GCNI                                NDE+KCI AEKE+FD W   HP AKGL N
Subjt:  VPTAPSPSGGMACRQWDVSGWLPNQIGRMMKERLLGCNI-------------------------------VNDERKCIEAEKEIFDLWVEGHPQAKGLRN

Query:  RPFPWFNELALVFGKDSARGVRTRTPIEITPEPEPVAD---LDEDMNVDFEDCFVPSPPVIDPTLGE-------ELCGTPTGTGEWIPDDRTTN------
        + F  ++EL+ VFGKD A G R  +  +I     P  D    D   + DF   +     +    L E       E     +G+    P   T +      
Subjt:  RPFPWFNELALVFGKDSARGVRTRTPIEITPEPEPVAD---LDEDMNVDFEDCFVPSPPVIDPTLGE-------ELCGTPTGTGEWIPDDRTTN------

Query:  ---------QKIALWPTKRDELERSRRKELYAELQSIPGVSMEDGLVVARALLSDERMLTHFMDFPPEWKFDYCMEIL
                  +IA WP  + +     R+E+  +L++IP +++ D   + R L+ +   +  F++ P   K+ YC  IL
Subjt:  ---------QKIALWPTKRDELERSRRKELYAELQSIPGVSMEDGLVVARALLSDERMLTHFMDFPPEWKFDYCMEIL

A0A5A7U3R2 Retrotransposon protein4.9e-12245.26Show/hide
Query:  MDRRTFTILCTLLRTTGRLRATSYVDVEEMVAMFLHIVAHDVKNRVIRRQFVRSGETVSRHFNVVLDAVLRLHSVLLKAPEPVTNACTDDRWKWFE----
        MD+R FTILCT+LRT G L AT YVDVEEM A+FLHIVAHDVKNRV RR F RS  TVSRHFNVVL+AVLR+H +LLK P+ VT++C+ ++W+WF+    
Subjt:  MDRRTFTILCTLLRTTGRLRATSYVDVEEMVAMFLHIVAHDVKNRVIRRQFVRSGETVSRHFNVVLDAVLRLHSVLLKAPEPVTNACTDDRWKWFE----

Query:  -----NCLGALDGTHIKVNISAVDRPRYRTRKGEIATNVLAVCSQNGEFIFVFPGWEGSASDSRVLRDAVSRPSGLKVPRGYYYLCDAGYPNADGFLAPY
               + ALDGTHIKVN+S  D PRYR+RK +I TNVL +CSQNGEFIFV PGWEGSASDSRVLRD VSRP GLKVP+GYYYLCDA Y N +GFLAPY
Subjt:  -----NCLGALDGTHIKVNISAVDRPRYRTRKGEIATNVLAVCSQNGEFIFVFPGWEGSASDSRVLRDAVSRPSGLKVPRGYYYLCDAGYPNADGFLAPY

Query:  RSTRYHLTEWRG-------------------------------RWAILRGKSYYPVDVQVKTITACCYLHNLITREMGQDPSMEAAHTGEQDSNDMGTEN
        R  RYHL EWRG                               RWAILRG+SYYPVD+Q K ITACC LHNLI REM  + + E  H GE DS++M  EN
Subjt:  RSTRYHLTEWRG-------------------------------RWAILRGKSYYPVDVQVKTITACCYLHNLITREMGQDPSMEAAHTGEQDSNDMGTEN

Query:  ITFVESTTHGVPGGMNWRIEYGRVCLSFKERKTHMDAGGRRSACAVPTAPSPSGGMAC--RQWDVSGWLPN----------QIGRMMKERLLGCNIVNDE
        I FVE+T         WR     +     E   +M +   ++     T       + C  +  +  GW  +          QI ++MKE++ G NI    
Subjt:  ITFVESTTHGVPGGMNWRIEYGRVCLSFKERKTHMDAGGRRSACAVPTAPSPSGGMAC--RQWDVSGWLPN----------QIGRMMKERLLGCNIVNDE

Query:  RKCIEAEKEIFDLWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRTRTPIEITPEPEPVADLDE-DMNVDFEDCFVPSPPVIDPTLGEELCGTPTGT
           +E   +I      GHP  + L N+PFP+F +L +VFG+D A G R +T +E+    + V D +E DM+++ E   + +P  ++   GE++  TPT  
Subjt:  RKCIEAEKEIFDLWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRTRTPIEITPEPEPVADLDE-DMNVDFEDCFVPSPPVIDPTLGEELCGTPTGT

Query:  GEWIPDDRTTNQ------------------------KIALWPTKRDELERSRRKELYAE
           +   R + +                        KIA W  +  E+E S  K LYAE
Subjt:  GEWIPDDRTTNQ------------------------KIALWPTKRDELERSRRKELYAE

A0A5A7VG45 Retrotransposon protein4.7e-13346.6Show/hide
Query:  MDRRTFTILCTLLRTTGRLRATSYVDVEEMVAMFLHIVAHDVKNRVIRRQFVRSGETVSRHFNVVLDAVLRLHSVLLKAPEPVTNACTDDRWKWFENCLG
        MDRR FTILCT+LRT G L AT YVDVEEM+A+FLHIVAHDVKNRV RR F RSGETVSRHFN    AVLRLH +LLK P+PVT +C+ ++W+WF+ CLG
Subjt:  MDRRTFTILCTLLRTTGRLRATSYVDVEEMVAMFLHIVAHDVKNRVIRRQFVRSGETVSRHFNVVLDAVLRLHSVLLKAPEPVTNACTDDRWKWFENCLG

Query:  ALDGTHIKVNISAVDRPRYRTRKGEIATNVLAVCSQNGEFIFVFPGWEGSASDSRVLRDAVSRPSGLKVPRGYYYLCDAGYPNADGFLAPYRSTRYHLTE
        ALDGTHIKVN+S  DRPRYR+RKG+I TNVL VC QNGEFIFV PGWEGSASDSRVLRDAVSR +GLKVP+GYYYLCDAGYPNA+GFLAPYR  RYHLTE
Subjt:  ALDGTHIKVNISAVDRPRYRTRKGEIATNVLAVCSQNGEFIFVFPGWEGSASDSRVLRDAVSRPSGLKVPRGYYYLCDAGYPNADGFLAPYRSTRYHLTE

Query:  WRG-------------------------------RWAILRGKSYYPVDVQVKTITACCYLHNLITREMGQDPSMEAAHTGEQDSNDMGTENITFVESTTH
        WRG                               RW IL+G+SYY VD+Q K ITACC LHNLI REMG + + +  H GE DS++M  ENI FVE+T  
Subjt:  WRG-------------------------------RWAILRGKSYYPVDVQVKTITACCYLHNLITREMGQDPSMEAAHTGEQDSNDMGTENITFVESTTH

Query:  GVPGGMNWRIEYGRVCLSFKERKTHMDAGGRRSACAVPTAPSPSGGMACRQWDVSGWLPNQIGRMMKERLLGCNIVNDERKCIEAEKEIFDL----WVEG
               WR     +     E   +M               S +      +W            +  E L+ C +   E     A+   F L        
Subjt:  GVPGGMNWRIEYGRVCLSFKERKTHMDAGGRRSACAVPTAPSPSGGMACRQWDVSGWLPNQIGRMMKERLLGCNIVNDERKCIEAEKEIFDL----WVEG

Query:  HPQAKGLRNRPFPWFNELALVFGKDSARGVRTRTPIEITPEPEPVADLDEDMNVDFEDCFVPSPPVIDPTLGEELCGTPTG-----------------TG
          +  G     F W  E         A G R +TP+EI  +     + ++DM+++ ED  +P+P  ++P  GE++  TPT                  +G
Subjt:  HPQAKGLRNRPFPWFNELALVFGKDSARGVRTRTPIEITPEPEPVADLDEDMNVDFEDCFVPSPPVIDPTLGEELCGTPTG-----------------TG

Query:  EWIPDDRTTNQ-------KIALWPTKRDELERSRRKELYAELQSIPGVSMEDGLVVARALLSDERMLTHFMDFPPEWKFDYCMEILGR
        + +   R + +       KIA W  ++ E+E S  K LY ELQ+IPG+ ++D L+VA +LL D  ML  F+D+P EWK+  CM ILGR
Subjt:  EWIPDDRTTNQ-------KIALWPTKRDELERSRRKELYAELQSIPGVSMEDGLVVARALLSDERMLTHFMDFPPEWKFDYCMEILGR

A0A5D3DG22 Retrotransposon protein3.6e-11742.43Show/hide
Query:  MDRRTFTILCTLLRTTGRLRATSYVDVEEMVAMFLHIVAHDVKNRVIRRQFVRSGETVSRHFNVVLDAVLRLHSVLLKAPEPVTNACTDDRWKWFE---N
        MDRR F ILC LLRTT  L  T  +DVEEMVAMFLHI+AH VKNR+I+R+FVRSGETVSRHFN+VL A  RLH  LLK P+PVTN+CTD RWKWFE   N
Subjt:  MDRRTFTILCTLLRTTGRLRATSYVDVEEMVAMFLHIVAHDVKNRVIRRQFVRSGETVSRHFNVVLDAVLRLHSVLLKAPEPVTNACTDDRWKWFE---N

Query:  CLGALDGTHIKVNISAVDRPRYRTRKGEIATNVLAVCSQNGEFIFVFPGWEGSASDSRVLRDAVSRPSGLKVPRGYYYLCDAGYPNADGFLAPYRSTRYH
        CL + +GT+IKVN+SA DRPRYRTRKGE+ATNVL  C   G+F+FV  GWEGSA+DSR+LRDA+SR +GLKVP+GYYYLCDAGYPNA+GFLAPYR  RYH
Subjt:  CLGALDGTHIKVNISAVDRPRYRTRKGEIATNVLAVCSQNGEFIFVFPGWEGSASDSRVLRDAVSRPSGLKVPRGYYYLCDAGYPNADGFLAPYRSTRYH

Query:  LTEWRGR--------------------------------WAILRGKSYYPVDVQVKTITACCYLHNLITREMGQDPSMEAAHTGEQDSNDMGTENITFVE
        L+EWRG                                 WAILRGKSYYPVDVQ +TI ACC LHNLI REM     ++    G+      G + I ++E
Subjt:  LTEWRGR--------------------------------WAILRGKSYYPVDVQVKTITACCYLHNLITREMGQDPSMEAAHTGEQDSNDMGTENITFVE

Query:  STTHGVPGGMNWRIEYGRVCLSFKERKTHMDAGGRRSACAVPTAPSPSGGMACRQWD-----VSGWLPNQIGRMMKERLLGCNIVNDERKCIEAEKEIFD
        ++         WR +      S  E +  +    R     +         +    W+       G +       M+         N+E +CI AE+++FD
Subjt:  STTHGVPGGMNWRIEYGRVCLSFKERKTHMDAGGRRSACAVPTAPSPSGGMACRQWD-----VSGWLPNQIGRMMKERLLGCNIVNDERKCIEAEKEIFD

Query:  LWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRTRTPIEITP-------EPEPVADL-DEDMNVDFEDCFVPSPPVIDPTLGEELCGTPTGTGEWIP
         WV+ HP  KGL ++ FP++++L+ VFGKD A G R+ T +++         +  P+ D  DED+   +      SP  +     E +        E + 
Subjt:  LWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRTRTPIEITP-------EPEPVADL-DEDMNVDFEDCFVPSPPVIDPTLGEELCGTPTGTGEWIP

Query:  DDRTTNQKIALWPTKRDELERSRRKELYAELQSIPGVSMEDGLVVARALLSDERMLTHFMDFPPEWKFDYCMEIL
              + IA W  ++  +E   R ++  +LQ IP +  +    + + L      +  F+  P E K +YC  +L
Subjt:  DDRTTNQKIALWPTKRDELERSRRKELYAELQSIPGVSMEDGLVVARALLSDERMLTHFMDFPPEWKFDYCMEIL

E5GBB2 Retrotransposon protein3.8e-12242.41Show/hide
Query:  LIHEDDLACRENTRMDRRTFTILCTLLRTTGRLRATSYVDVEEMVAMFLHIVAHDVKNRVIRRQFVRSGETVSRHFNVVLDAVLRLHSVLLKAPEPVTNA
        +IHE DL CR++TRMDRRTF ILC LLR    L +T  VDVEEMVAMFLH++AHDVKNRVI+++FVRSGETVSRHFN+VL AVLRL+  L+K P PVT+ 
Subjt:  LIHEDDLACRENTRMDRRTFTILCTLLRTTGRLRATSYVDVEEMVAMFLHIVAHDVKNRVIRRQFVRSGETVSRHFNVVLDAVLRLHSVLLKAPEPVTNA

Query:  CTDDRWKWFENCLGALDGTHIKVNISAVDRPRYRTRKGEIATNVLAVCSQNGEFIFVFPGWEGSASDSRVLRDAVSRPSGLKVPRGYYYLCDAGYPNADG
        C D RWK FENCLGALDGT+IKVN+ A DRP +RTRKGEIATNVL VC   G+F++V  GWEGSA+DSR+LRDA+S+ +GL+VP+GYYYLCDAGYPNA+G
Subjt:  CTDDRWKWFENCLGALDGTHIKVNISAVDRPRYRTRKGEIATNVLAVCSQNGEFIFVFPGWEGSASDSRVLRDAVSRPSGLKVPRGYYYLCDAGYPNADG

Query:  FLAPYRSTRYHLTEWR--------------------------------GRWAILRGKSYYPVDVQVKTITACCYLHNLITREMGQDPSMEAAHTGEQD-S
        FLAPY+  RYHL EWR                                GRW ILRGKSYYP+ VQ +TI AC  LHNLI REM     +E    G+   +
Subjt:  FLAPYRSTRYHLTEWR--------------------------------GRWAILRGKSYYPVDVQVKTITACCYLHNLITREMGQDPSMEAAHTGEQD-S

Query:  NDMGTENITFVESTTHGVPGGMNWRIEYGRVCLSFKERKTHMDAGGRRSACAVPTAPSPSGGMACRQWDVSGWLPNQIGRMMKERLLGCNI---------
            +E+I ++E+T         WR +      +  + +     GG   +C +    S  G  +       G+L  Q+ RMM E+L GC +         
Subjt:  NDMGTENITFVESTTHGVPGGMNWRIEYGRVCLSFKERKTHMDAGGRRSACAVPTAPSPSGGMACRQWDVSGWLPNQIGRMMKERLLGCNI---------

Query:  -----------------------VNDERKCIEAEKEIFDLWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRTRTPIEITPEPEPVADLDE----DM
                                NDE KCI AEKE+FD WV   P AKGL N PFP+++EL  VFG+D A G    T  ++    EP    D     D 
Subjt:  -----------------------VNDERKCIEAEKEIFDLWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRTRTPIEITPEPEPVADLDE----DM

Query:  NVDFEDCFVPSPPVID-------PTLGEE----LCGTPTGTGE---------WIPDDRTTNQ--KIALWPTKRDELERSRRKELYAELQSIPGVSMEDGL
        N DF   +     ++        P+   E      G+    G           +  D+T  Q  +IA WP +    +   R E +  L+ +P ++  D  
Subjt:  NVDFEDCFVPSPPVID-------PTLGEE----LCGTPTGTGE---------WIPDDRTTNQ--KIALWPTKRDELERSRRKELYAELQSIPGVSMEDGL

Query:  VVARALLSDERMLTHFMDFPPEWKFDYCMEIL
        ++ R LLS    L  F+  P + +  +C  +L
Subjt:  VVARALLSDERMLTHFMDFPPEWKFDYCMEIL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G43722.1 unknown protein9.4e-3333.47Show/hide
Query:  RSPTHVRHRIRQLNFFRLIHEDDLACRENTRMDRRTFTILCTLLRTTGRLRATSYVDVEEMVAMFLHIVAHDVKNRVIRRQFVRSGETVSRHFNVVLDAV
        R+P  +   +   N +R + +D  AC +  RM    FT LC +L+T   L+ T  + +EE VAMFL I  H+   R +  +F R+ ETV R F  VL A 
Subjt:  RSPTHVRHRIRQLNFFRLIHEDDLACRENTRMDRRTFTILCTLLRTTGRLRATSYVDVEEMVAMFLHIVAHDVKNRVIRRQFVRSGETVSRHFNVVLDAV

Query:  LRLHSVLLKAPE-------PVTNACTDDRWKWFENCLGALDGTHIKVNISAVDRPRYRTRKGEIATNVLAVCSQNGEFIFVFPGWEGSASDSRVLRDAVS
          L    ++ P        P         W +F   +GA+DGTH+ V +    +  Y  R    + N++A+C     F +++ G  GS  D+ VL+ A  
Subjt:  LRLHSVLLKAPE-------PVTNACTDDRWKWFENCLGALDGTHIKVNISAVDRPRYRTRKGEIATNVLAVCSQNGEFIFVFPGWEGSASDSRVLRDAVS

Query:  RPSGLKVPRG-YYYLCDAGYPNADGFLAPYRST-----RYHLTEW
          S   +P    YYL D+GYPN  G LAPYRS+     RYH++++
Subjt:  RPSGLKVPRG-YYYLCDAGYPNADGFLAPYRST-----RYHLTEW

AT5G28730.1 unknown protein7.7e-1931.28Show/hide
Query:  IHEDDLACRENTRMDRRTFTILCTLLRTTGRLRATSYVDVEEMVAMFLHIVAHDVKNRVIRRQFVRSGETVSRHFNVVLDAVLRLHSVLLKAPE-----P
        I+ ++++C+   RM    FT LC +L     L++++ + ++E VA+FL I A +   R I  +F  + ET+ R F+ VL A+ RL    ++  +      
Subjt:  IHEDDLACRENTRMDRRTFTILCTLLRTTGRLRATSYVDVEEMVAMFLHIVAHDVKNRVIRRQFVRSGETVSRHFNVVLDAVLRLHSVLLKAPE-----P

Query:  VTNACTDDRWKWFENCLGALDGTHIKVNISAVDRPRYRTRKGEIATNVLAVCSQNGEFIFVFPGWEGSASDSRVLRDAVSRPSGLKV-PRGYYYLCDAGY
        ++N   DD   W                      P      G  + NVLA+C  +  F + F G  GS  D+RVL  A+S      V P   YYL D+GY
Subjt:  VTNACTDDRWKWFENCLGALDGTHIKVNISAVDRPRYRTRKGEIATNVLAVCSQNGEFIFVFPGWEGSASDSRVLRDAVSRPSGLKV-PRGYYYLCDAGY

Query:  PNADGFLAPYR
         N  G+LAPYR
Subjt:  PNADGFLAPYR

AT5G28950.1 unknown protein1.5e-1739.13Show/hide
Query:  WKWFENCLGALDGTHIKVNISAVDRPRYRTRKGEIATNVLAVCSQNGEFIFVFPGWEGSASDSRVLRDAVSRPSG-LKVP---RGYYYLCDAGYPNADGF
        + +F++C+GA+D THI   +S    P +R RKG+I+ N+LA C+ + EF++V  GWEGSA DS+VL DA++R S  L VP        + +    N D  
Subjt:  WKWFENCLGALDGTHIKVNISAVDRPRYRTRKGEIATNVLAVCSQNGEFIFVFPGWEGSASDSRVLRDAVSRPSG-LKVP---RGYYYLCDAGYPNADGF

Query:  LAPYRSTRYHLTEWR
        L      R +  +WR
Subjt:  LAPYRSTRYHLTEWR

AT5G35695.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)7.0e-1228.4Show/hide
Query:  FIFVFPGWEGSASDSRVLRDAVSRPSGLKVPRGYYYLCDAGYPNADGFLAPYRSTRYHLTEWRG--------------------------------RWAI
        FI+V  GWEGSA DSRVL DA+ +          +YL D G+ N   FLAP+R  RYHL E+ G                                R+AI
Subjt:  FIFVFPGWEGSASDSRVLRDAVSRPSGLKVPRGYYYLCDAGYPNADGFLAPYRSTRYHLTEWRG--------------------------------RWAI

Query:  LRGKSYYPVDVQVKTITACCYLHNLITREMGQDPS---MEAAHTGEQDSNDMGTENITFVES
         +    +    Q   +  C  LHN + +E   D +    E  + G+  +N+    N   +++
Subjt:  LRGKSYYPVDVQVKTITACCYLHNLITREMGQDPS---MEAAHTGEQDSNDMGTENITFVES

AT5G41980.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)6.3e-4535.36Show/hide
Query:  FRLIHEDDLACRENTRMDRRTFTILCTLLRTTGRLRATSYVDVEEMVAMFLHIVAHDVKNRVIRRQFVRSGETVSRHFNVVLDAVLRLHSVLLKAPEPVT
        +++++  +  C EN RMD+  F  LC LL+T G LR T+ + +E  +A+FL I+ H+++ R ++  F  SGET+SRHFN VL+AV+ +        +P +
Subjt:  FRLIHEDDLACRENTRMDRRTFTILCTLLRTTGRLRATSYVDVEEMVAMFLHIVAHDVKNRVIRRQFVRSGETVSRHFNVVLDAVLRLHSVLLKAPEPVT

Query:  NACT-DDRWKWFENCLGALDGTHIKVNISAVDRPRYRTRKGEIATNVLAVCSQNGEFIFVFPGWEGSASDSRVLRDAVSRPSGLKVPRGYYYLCDAGYPN
        N+ T ++   +F++C+G +D  HI V +   ++  +R   G +  NVLA  S +  F +V  GWEGSASD +VL  A++R + L+VP+G YY+ D  YPN
Subjt:  NACT-DDRWKWFENCLGALDGTHIKVNISAVDRPRYRTRKGEIATNVLAVCSQNGEFIFVFPGWEGSASDSRVLRDAVSRPSGLKVPRGYYYLCDAGYPN

Query:  ADGFLAPYRSTRYHLTE-----------------------WRGRWAILRGKSYYPVDVQVKTITACCYLHNLITREMGQD
          GF+APY     +  E                        + R+ IL     YP+  QVK + A C LHN +  E   D
Subjt:  ADGFLAPYRSTRYHLTE-----------------------WRGRWAILRGKSYYPVDVQVKTITACCYLHNLITREMGQD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGCCCAAGATAATGAAATTCAAGAATTGATCGCAATTCTTACTATCATTTGTGCGACACAGTATCAGTTCATCGCGACAGTTCTTGGGATCTTGCACTCGGGTTA
TTCCTGGGGATTTCGATCCCCGACTCATGTACGACACCGAATCAGGCAACTGAACTTCTTCCGTCTTATTCACGAGGATGATTTGGCTTGTCGTGAAAATACACGCATGG
ATCGACGAACATTCACGATCCTGTGTACCCTTTTACGGACCACTGGTAGACTACGAGCCACTAGCTACGTCGATGTGGAGGAGATGGTTGCAATGTTCCTCCACATCGTT
GCACACGACGTTAAGAATCGTGTGATCCGTCGCCAATTTGTGCGTTCGGGAGAGACCGTATCCCGACACTTCAATGTCGTACTGGATGCAGTACTTCGATTGCACTCAGT
CCTATTGAAAGCACCTGAACCAGTTACCAATGCATGTACGGACGATAGGTGGAAGTGGTTTGAGAACTGCTTAGGTGCTCTTGATGGTACACACATCAAGGTCAACATCA
GTGCGGTAGATCGCCCCAGATATCGGACGAGGAAGGGGGAGATTGCTACTAATGTACTGGCCGTTTGTTCTCAGAATGGAGAGTTCATTTTCGTGTTCCCAGGGTGGGAA
GGATCTGCATCAGACTCAAGGGTACTTCGTGATGCAGTCTCACGTCCTTCGGGACTAAAGGTTCCCAGGGGCTACTACTACCTCTGTGATGCCGGATATCCTAACGCGGA
TGGATTCCTAGCACCATATCGTAGCACACGATACCACCTAACCGAGTGGCGAGGGCGATGGGCAATATTGAGGGGCAAATCTTACTACCCTGTTGATGTGCAAGTGAAAA
CCATCACCGCTTGCTGCTACCTTCATAACCTAATAACACGGGAGATGGGACAGGATCCTTCAATGGAGGCCGCTCATACGGGGGAGCAAGATTCTAATGATATGGGAACA
GAGAACATCACGTTCGTTGAATCCACCACGCATGGAGTTCCTGGAGGGATGAATTGGCGAATCGAATATGGTCGCGTCTGCCTCAGCTTCAAAGAAAGAAAAACACATAT
GGACGCCGGAGGAAGACGAAGTGCTTGTGCAGTGCCTACTGCACCTAGTCCAAGTGGGGGGATGGCGTGCAGACAATGGGACGTTTCGGGCTGGTTACCAAATCAGATTG
GAAGGATGATGAAAGAAAGACTACTAGGATGCAACATAGTCAATGATGAGAGAAAATGCATAGAGGCGGAGAAGGAGATATTCGATTTGTGGGTCGAGGGACATCCACAG
GCCAAGGGCCTCCGTAACCGGCCATTCCCATGGTTCAATGAGTTGGCGCTCGTTTTCGGGAAGGACAGCGCTAGAGGAGTGAGAACCCGAACCCCAATTGAGATAACACC
TGAACCAGAGCCGGTAGCTGATTTGGATGAAGACATGAACGTGGATTTTGAGGATTGCTTCGTCCCAAGTCCCCCAGTTATTGATCCCACACTTGGAGAAGAATTATGTG
GGACACCGACTGGTACTGGAGAATGGATTCCAGATGACCGCACAACAAATCAGAAGATTGCGCTCTGGCCGACCAAGAGGGACGAGCTCGAGAGGAGTCGGCGGAAGGAG
TTATATGCGGAGCTACAATCCATTCCTGGGGTGTCTATGGAGGACGGCTTGGTGGTCGCACGGGCACTACTATCAGATGAGAGGATGTTGACTCACTTTATGGACTTCCC
TCCAGAATGGAAGTTCGACTACTGTATGGAGATCCTCGGTAGGGCATCGAGACAGCCCCCGCAGCCATGA
mRNA sequenceShow/hide mRNA sequence
ATGGATGCCCAAGATAATGAAATTCAAGAATTGATCGCAATTCTTACTATCATTTGTGCGACACAGTATCAGTTCATCGCGACAGTTCTTGGGATCTTGCACTCGGGTTA
TTCCTGGGGATTTCGATCCCCGACTCATGTACGACACCGAATCAGGCAACTGAACTTCTTCCGTCTTATTCACGAGGATGATTTGGCTTGTCGTGAAAATACACGCATGG
ATCGACGAACATTCACGATCCTGTGTACCCTTTTACGGACCACTGGTAGACTACGAGCCACTAGCTACGTCGATGTGGAGGAGATGGTTGCAATGTTCCTCCACATCGTT
GCACACGACGTTAAGAATCGTGTGATCCGTCGCCAATTTGTGCGTTCGGGAGAGACCGTATCCCGACACTTCAATGTCGTACTGGATGCAGTACTTCGATTGCACTCAGT
CCTATTGAAAGCACCTGAACCAGTTACCAATGCATGTACGGACGATAGGTGGAAGTGGTTTGAGAACTGCTTAGGTGCTCTTGATGGTACACACATCAAGGTCAACATCA
GTGCGGTAGATCGCCCCAGATATCGGACGAGGAAGGGGGAGATTGCTACTAATGTACTGGCCGTTTGTTCTCAGAATGGAGAGTTCATTTTCGTGTTCCCAGGGTGGGAA
GGATCTGCATCAGACTCAAGGGTACTTCGTGATGCAGTCTCACGTCCTTCGGGACTAAAGGTTCCCAGGGGCTACTACTACCTCTGTGATGCCGGATATCCTAACGCGGA
TGGATTCCTAGCACCATATCGTAGCACACGATACCACCTAACCGAGTGGCGAGGGCGATGGGCAATATTGAGGGGCAAATCTTACTACCCTGTTGATGTGCAAGTGAAAA
CCATCACCGCTTGCTGCTACCTTCATAACCTAATAACACGGGAGATGGGACAGGATCCTTCAATGGAGGCCGCTCATACGGGGGAGCAAGATTCTAATGATATGGGAACA
GAGAACATCACGTTCGTTGAATCCACCACGCATGGAGTTCCTGGAGGGATGAATTGGCGAATCGAATATGGTCGCGTCTGCCTCAGCTTCAAAGAAAGAAAAACACATAT
GGACGCCGGAGGAAGACGAAGTGCTTGTGCAGTGCCTACTGCACCTAGTCCAAGTGGGGGGATGGCGTGCAGACAATGGGACGTTTCGGGCTGGTTACCAAATCAGATTG
GAAGGATGATGAAAGAAAGACTACTAGGATGCAACATAGTCAATGATGAGAGAAAATGCATAGAGGCGGAGAAGGAGATATTCGATTTGTGGGTCGAGGGACATCCACAG
GCCAAGGGCCTCCGTAACCGGCCATTCCCATGGTTCAATGAGTTGGCGCTCGTTTTCGGGAAGGACAGCGCTAGAGGAGTGAGAACCCGAACCCCAATTGAGATAACACC
TGAACCAGAGCCGGTAGCTGATTTGGATGAAGACATGAACGTGGATTTTGAGGATTGCTTCGTCCCAAGTCCCCCAGTTATTGATCCCACACTTGGAGAAGAATTATGTG
GGACACCGACTGGTACTGGAGAATGGATTCCAGATGACCGCACAACAAATCAGAAGATTGCGCTCTGGCCGACCAAGAGGGACGAGCTCGAGAGGAGTCGGCGGAAGGAG
TTATATGCGGAGCTACAATCCATTCCTGGGGTGTCTATGGAGGACGGCTTGGTGGTCGCACGGGCACTACTATCAGATGAGAGGATGTTGACTCACTTTATGGACTTCCC
TCCAGAATGGAAGTTCGACTACTGTATGGAGATCCTCGGTAGGGCATCGAGACAGCCCCCGCAGCCATGA
Protein sequenceShow/hide protein sequence
MDAQDNEIQELIAILTIICATQYQFIATVLGILHSGYSWGFRSPTHVRHRIRQLNFFRLIHEDDLACRENTRMDRRTFTILCTLLRTTGRLRATSYVDVEEMVAMFLHIV
AHDVKNRVIRRQFVRSGETVSRHFNVVLDAVLRLHSVLLKAPEPVTNACTDDRWKWFENCLGALDGTHIKVNISAVDRPRYRTRKGEIATNVLAVCSQNGEFIFVFPGWE
GSASDSRVLRDAVSRPSGLKVPRGYYYLCDAGYPNADGFLAPYRSTRYHLTEWRGRWAILRGKSYYPVDVQVKTITACCYLHNLITREMGQDPSMEAAHTGEQDSNDMGT
ENITFVESTTHGVPGGMNWRIEYGRVCLSFKERKTHMDAGGRRSACAVPTAPSPSGGMACRQWDVSGWLPNQIGRMMKERLLGCNIVNDERKCIEAEKEIFDLWVEGHPQ
AKGLRNRPFPWFNELALVFGKDSARGVRTRTPIEITPEPEPVADLDEDMNVDFEDCFVPSPPVIDPTLGEELCGTPTGTGEWIPDDRTTNQKIALWPTKRDELERSRRKE
LYAELQSIPGVSMEDGLVVARALLSDERMLTHFMDFPPEWKFDYCMEILGRASRQPPQP