; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0038813 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0038813
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr2:27557900..27568133
RNA-Seq ExpressionLag0038813
SyntenyLag0038813
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001969 - Aspartic peptidase, active site
IPR004147 - UbiB domain
IPR021109 - Aspartic peptidase domain superfamily
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0040321.1 RNA-directed DNA polymerase-like protein [Cucumis melo var. makuwa]1.8e-6938.65Show/hide
Query:  EWIKHVENFFNYMNTPENKK-------GGLRYDIKKQLALQPIGHLNEAISAAATIEEQISNRFKRTYSRRT--------------VGDQGNTFNKTMTG
        E+I+         N  EN++       G LR+DIK+++ LQP   L+EAIS A T+EE ++ + K T  + T              + +Q +T     T 
Subjt:  EWIKHVENFFNYMNTPENKK-------GGLRYDIKKQLALQPIGHLNEAISAAATIEEQISNRFKRTYSRRT--------------VGDQGNTFNKTMTG

Query:  DTKAQ----------------QNAALK----------GEPVSLVIQRLLLAPKSDPSYQRHALFKTRCTINGKICNVIIDSGSTENVVASRLVSTLNLPL
        D + Q                ++ AL+          G  +  VIQR+L+  K + + Q   LFKTRC IN K+ ++IIDS S+EN VA RLV TLNL  
Subjt:  DTKAQ----------------QNAALK----------GEPVSLVIQRLLLAPKSDPSYQRHALFKTRCTINGKICNVIIDSGSTENVVASRLVSTLNLPL

Query:  HPHPAPYK-------------ASPTSIPP---------------------------------KKGQLFTLTSGKNLLSDKNSHILGLVI--KNFSNQAST
          HP PYK                T++ P                                  KGQLFT  S K L+ ++   ILGLVI  K    Q   
Subjt:  HPHPAPYK-------------ASPTSIPP---------------------------------KKGQLFTLTSGKNLLSDKNSHILGLVI--KNFSNQAST

Query:  VEIPLEVQNLLDEFNSIMDNKNNLPPLRDIQHAIDFLPGATLPNLPHYKMSPTEYKILHDQVQELLDKGHIQPSLSPCAVPALLTPKK-----MVADEY-
        +E  L  Q LL EF  +    + LPPLRDIQH ID + GA+LPNL +Y+MSP EY+ILH+ +++LL KGH +PSLSPCA PALLTPKK     M  D   
Subjt:  VEIPLEVQNLLDEFNSIMDNKNNLPPLRDIQHAIDFLPGATLPNLPHYKMSPTEYKILHDQVQELLDKGHIQPSLSPCAVPALLTPKK-----MVADEY-

Query:  --------------------------------LKSGYHQIHIRPGDEWKTTFKTNEGLFEWLVMPFGLSNAPSTFMRLMHQVYQARLRR
                                        L+SGYHQI IR GDEWKTTFKTNEGLFEW+VMPFGLSNAP+ FMRLM+QV+   L +
Subjt:  --------------------------------LKSGYHQIHIRPGDEWKTTFKTNEGLFEWLVMPFGLSNAPSTFMRLMHQVYQARLRR

KAA0047078.1 reverse transcriptase [Cucumis melo var. makuwa]2.3e-8035.26Show/hide
Query:  KEAENITVLSPQETTVRLLSVEDDVREIRKILEMMCIKMGCKTGQQPSEIQEIVNLGKKPQEFFREGGRTSNWLEGQNLEGKPFQEVITAPRTFQEQQSG
        +EAE   VLSP+ T+ RLLS+E  V  I   L+++  ++   T  Q        N+ ++ QE  R+ G+    + G  +          +    QE++  
Subjt:  KEAENITVLSPQETTVRLLSVEDDVREIRKILEMMCIKMGCKTGQQPSEIQEIVNLGKKPQEFFREGGRTSNWLEGQNLEGKPFQEVITAPRTFQEQQSG

Query:  LQEYNYNPSFRRQ------TEWGGDISSEEEYEE-----IQRAERRNFRHHQYQENDFKMKVDIPTYGGK-------------MDIEIF-----------
         Q+Y  NP  R Q       +W    SS++E +E       R  R  F   + +  + KMK+D+P+Y GK             +D+ ++           
Subjt:  LQEYNYNPSFRRQ------TEWGGDISSEEEYEE-----IQRAERRNFRHHQYQENDFKMKVDIPTYGGK-------------MDIEIF-----------

Query:  --LEWIKHVENFFNYMNTPENKK-------GGLRYDIKKQLALQPIGHLNEAISAAATIEEQISNRFKRTYS------------------RRTVGDQG--
           ++IK   +    +N  EN++       GGLR+DIK+++ LQP   L+EAIS A T+EE  + R K   +                  +R V ++G  
Subjt:  --LEWIKHVENFFNYMNTPENKK-------GGLRYDIKKQLALQPIGHLNEAISAAATIEEQISNRFKRTYS------------------RRTVGDQG--

Query:  -NTFNKTMTG----------------------------DTKAQQNAALK----------GEPVSLVIQRLLLAPKSDPSYQRHALFKTRCTINGKICNVI
         N +N+   G                            D+ ++ +  L+          G  VS VIQR+LLAPK + + Q H+LFKTRCTINGK+C+VI
Subjt:  -NTFNKTMTG----------------------------DTKAQQNAALK----------GEPVSLVIQRLLLAPKSDPSYQRHALFKTRCTINGKICNVI

Query:  IDSGSTENVVASRLVSTLNLPLHPHPAPYKAS----------------PTSIPPKKGQLFTLTSGKNLLSDKNSHILGLVIKNFSNQASTVEIPLEVQNL
        ID+GS+EN VA +LV+ LNL   PHP PYK                  P SI    G    +      + ++   +LGL+I + SN+     +   +Q L
Subjt:  IDSGSTENVVASRLVSTLNLPLHPHPAPYKAS----------------PTSIPPKKGQLFTLTSGKNLLSDKNSHILGLVIKNFSNQASTVEIPLEVQNL

Query:  LDEFNSIMDNKNNLPPLRDIQHAIDFLPGATLPNLPHYKMSPTEYKILHDQVQELLDKGHIQPSLSPCAVPALLTPKKMVADEYLKSGYHQIHIRPGDEW
         +EF  +      LPPLRDIQ  ID +PGA+LP L HY+MSP EY+ILH+ +++LL+KGHI+PS SP         +KM+    LKSGYHQ+ IRPGDEW
Subjt:  LDEFNSIMDNKNNLPPLRDIQHAIDFLPGATLPNLPHYKMSPTEYKILHDQVQELLDKGHIQPSLSPCAVPALLTPKKMVADEYLKSGYHQIHIRPGDEW

Query:  KTTFKTNEGLFEWLVMPFGLSNAPSTFMRLMHQVYQARLRR
        KTTFKTNEGLFEW++M FGLSN PSTFMRLM+QV    L +
Subjt:  KTTFKTNEGLFEWLVMPFGLSNAPSTFMRLMHQVYQARLRR

KAA0054966.1 transposon Ty3-I Gag-Pol polyprotein isoform X1 [Cucumis melo var. makuwa]8.7e-8029.88Show/hide
Query:  KEAENITVLSPQETTVRLLSVEDDVREIRKILEMMCIKMGCKTGQ------QPSEIQEIVNLGKKPQEFFREGGRTSNWLEGQNLEGKPFQEVITAPRTF
        +EA    +LSP+ ++  L SVE  + EIR++L  +  ++  +  Q      +P  +Q     G++  E+FR                + FQE     R  
Subjt:  KEAENITVLSPQETTVRLLSVEDDVREIRKILEMMCIKMGCKTGQ------QPSEIQEIVNLGKKPQEFFREGGRTSNWLEGQNLEGKPFQEVITAPRTF

Query:  QEQQSGLQEYNYNPSFRRQTEWGG---DISSEEEYEEIQRAERRNFRHHQYQEN--------DFKMKVDIPTYGGKMDIEIFLEWIKHVENFFNYMNTPE
         E Q  L +   +    R+ EW     +I +    EE    +   FR H+Y +N        ++KMK+D+P+Y GK +IE FL+W+K+ ENFF YM T +
Subjt:  QEQQSGLQEYNYNPSFRRQTEWGG---DISSEEEYEEIQRAERRNFRHHQYQEN--------DFKMKVDIPTYGGKMDIEIFLEWIKHVENFFNYMNTPE

Query:  NKK------------------------------------------------------------------------------------------------G
        NKK                                                                                                G
Subjt:  NKK------------------------------------------------------------------------------------------------G

Query:  GLRYDIKKQLALQPIGHLNEAISAAATIEEQISNRFKRTYSR---------RTVGD--------------------------------------QGNTFN
        GLR+D+K+++ LQP  HL+EAI+ A T+EE I NR K T  R          T G+                                       GN + 
Subjt:  GLRYDIKKQLALQPIGHLNEAISAAATIEEQISNRFKRTYSR---------RTVGD--------------------------------------QGNTFN

Query:  KTMTG-----------------------------DTKAQQNAALKGEPVSLVIQRLLLAPKSDPSYQRHALFKTRCTINGKICNVIIDSGSTENVVASRL
            G                             D + +   A +G+ +S ++QR+L++PK +   QRH+LFKTRCTI GK+CNVIIDSGS+EN V+ +L
Subjt:  KTMTG-----------------------------DTKAQQNAALKGEPVSLVIQRLLLAPKSDPSYQRHALFKTRCTINGKICNVIIDSGSTENVVASRL

Query:  VSTLNLPLHPHPAPYKAS----------------PTSI--------------------------------------------------------------
        V+ LNL   PH  PYK                  P SI                                                              
Subjt:  VSTLNLPLHPHPAPYKAS----------------PTSI--------------------------------------------------------------

Query:  ---PPKKGQLFTLTSGKNLLSDKNSHILGLVIKNFSNQASTVEIPLEVQNLLDEFNSIMDNKNNLPPLRDIQHAIDFLPGATLPNLPHYKMSPTEYKILH
             KKG LF   SGK  L ++ + ILG+V+    +     +IP  ++ L  ++  I      LPPLRDI H I+ L GA+ P+LPHY MSP EYKILH
Subjt:  ---PPKKGQLFTLTSGKNLLSDKNSHILGLVIKNFSNQASTVEIPLEVQNLLDEFNSIMDNKNNLPPLRDIQHAIDFLPGATLPNLPHYKMSPTEYKILH

Query:  DQVQELLDKGHIQPSLSPCAVPALLTPKK-----MVADEY---------------------------------LKSGYHQIHIRPGDEWKTTFKTNEGLF
        D ++ELL KGHI+PS S C VPALLTPKK     M  D                                   L+S YHQI IRPGDEWKT FKTNEGLF
Subjt:  DQVQELLDKGHIQPSLSPCAVPALLTPKK-----MVADEY---------------------------------LKSGYHQIHIRPGDEWKTTFKTNEGLF

Query:  EWLVMPFGLSNAPSTFMRLMHQVYQARLRR
        EWLVMPF LSNAPSTFMRLM++V    L +
Subjt:  EWLVMPFGLSNAPSTFMRLMHQVYQARLRR

XP_011648447.2 uncharacterized protein LOC105434464 [Cucumis sativus]2.6e-7639Show/hide
Query:  RAERRNFRH--HQYQENDFKMKVDIPTYGGKMDIEIFLEWIKHVENFFNYMNTPENKKGGL------RYDIKKQLAL-----------------------
        RA R N R+   + + +D+KMK+D+  Y GK +IE FL+WIK  ENFFNYM+TPE KK  L         +++ +A+                       
Subjt:  RAERRNFRH--HQYQENDFKMKVDIPTYGGKMDIEIFLEWIKHVENFFNYMNTPENKKGGL------RYDIKKQLAL-----------------------

Query:  ----QPI----GHLNEAISAAATIEEQISNRFK----RTYSRRTVG---------------DQGNTF-----NKTMTGDTKAQQN-----AALKGEPVSL
            QP     G   E  +    +E +    FK      YSR  +G                Q  T       + M+ D+K  ++      A  GE VS 
Subjt:  ----QPI----GHLNEAISAAATIEEQISNRFK----RTYSRRTVG---------------DQGNTF-----NKTMTGDTKAQQN-----AALKGEPVSL

Query:  VIQRLLLAPKSDPSYQRHALFKTRCTINGKICNVIIDSGSTENVVASRLVSTLNLPLHPHPAPYKAS----------------PTSIP-PKKGQLFT---
        VIQR+L+ PK +   QRH LFK RCTING++C+VIID+ S++N VA +LV+ LNL    HP  YK                  P SI    K Q+     
Subjt:  VIQRLLLAPKSDPSYQRHALFKTRCTINGKICNVIIDSGSTENVVASRLVSTLNLPLHPHPAPYKAS----------------PTSIP-PKKGQLFT---

Query:  ------LTSGKNLLSDKNS----------------HILGLVIKNFSNQASTVE-IPLEVQNLLDEFNSIMDNKNNLPPLRDIQHAIDFLPGATLPNLPHY
              L  G+    D  S                 ++ L I   + +   VE I  E+Q LL EF  I +    LPPLRDIQH ID +PGA+LPNL HY
Subjt:  ------LTSGKNLLSDKNS----------------HILGLVIKNFSNQASTVE-IPLEVQNLLDEFNSIMDNKNNLPPLRDIQHAIDFLPGATLPNLPHY

Query:  KMSPTEYKILHDQVQELLDKGHIQPSLSPCAVPALLTPKK-----MVADEY---------------------------------LKSGYHQIHIRPGDEW
        +MSP EYK LHD ++ELL KGHI+PSLSPCAVPALLT KK     M  D                                   LKSGYHQI IRPGDEW
Subjt:  KMSPTEYKILHDQVQELLDKGHIQPSLSPCAVPALLTPKK-----MVADEY---------------------------------LKSGYHQIHIRPGDEW

Query:  KTTFKTNEGLFEWLVMPFGLSNAPSTFMRLMHQVYQARLRR
        KTTFKT EGLFEW+VMPFGLSNAP+TFMRLM+Q+    L +
Subjt:  KTTFKTNEGLFEWLVMPFGLSNAPSTFMRLMHQVYQARLRR

XP_031744062.1 uncharacterized protein LOC116404773 [Cucumis sativus]5.2e-7734.31Show/hide
Query:  RAERRNFR--HHQYQENDFKMKVDIPTYGGKMDIEIFLEWIKHVENFFNYMNTPENKKGGL--------------RYDIKKQ-LALQPI-----------
        RA R N R    + + +D+KMK+D+P Y GK +IE FL+WIK  ENFFNYM+TPE KK  L              + +I +Q    QPI           
Subjt:  RAERRNFR--HHQYQENDFKMKVDIPTYGGKMDIEIFLEWIKHVENFFNYMNTPENKKGGL--------------RYDIKKQ-LALQPI-----------

Query:  ---------------------------------------GHLNE------AISAAATIEEQISNRFK---------------------------------
                                                +L+E      A     T+EE I+ R K                                 
Subjt:  ---------------------------------------GHLNE------AISAAATIEEQISNRFK---------------------------------

Query:  -------------------RTYSRRTVG-------------------------DQGNTFNKTMTGDTKAQQNAALKGEPVSLVIQRLLLAPKSDPSYQRH
                            +YSR ++G                         + G T   ++  + + +   A  GE VS  IQR+L+ PK + + QRH
Subjt:  -------------------RTYSRRTVG-------------------------DQGNTFNKTMTGDTKAQQNAALKGEPVSLVIQRLLLAPKSDPSYQRH

Query:  ALFKTRCTINGKICNVIIDSGSTENVVASRLVSTLNLPLHPHPAPYKAS----------------PTSI-------------------------------
         LFKTRCTING++C+VIIDSGS+EN VA +LV  LNL    HP PYK                  P SI                               
Subjt:  ALFKTRCTINGKICNVIIDSGSTENVVASRLVSTLNLPLHPHPAPYKAS----------------PTSI-------------------------------

Query:  ----------------------PPKK---------GQLFTLTSGKNLLSDKNSHILGLVIKNFSNQASTVEIPLEVQNLLDEFNSIMDNKNNLPPLRDIQ
                              P  K          QLF   SGK +L ++  +ILGLV+   + +    +I  ++Q LL EF  I +    LPPLRDIQ
Subjt:  ----------------------PPKK---------GQLFTLTSGKNLLSDKNSHILGLVIKNFSNQASTVEIPLEVQNLLDEFNSIMDNKNNLPPLRDIQ

Query:  HAIDFLPGATLPNLPHYKMSPTEYKILHDQVQELLDKGHIQPSLSPCAVPALLTPKK-----MVADEY--------------------------------
        H ID +PGA+LPNL HY+MSP EYKILHD ++ELL KGHI+PSLSPCAVPALLTPKK     M  D                                  
Subjt:  HAIDFLPGATLPNLPHYKMSPTEYKILHDQVQELLDKGHIQPSLSPCAVPALLTPKK-----MVADEY--------------------------------

Query:  -LKSGYHQIHIRPGDEWKTTFKTNEGLFEWLVMPFGLSNAPSTFMRLMHQ
         LKSGYHQI +RPGDEWKT FKTNEGLFEW+VMPFGLSNAPSTFMRLM+Q
Subjt:  -LKSGYHQIHIRPGDEWKTTFKTNEGLFEWLVMPFGLSNAPSTFMRLMHQ

TrEMBL top hitse value%identityAlignment
A0A2N9I9E4 Uncharacterized protein6.1e-6345.63Show/hide
Query:  GEPVSLVIQRLLLAPKSDPSYQRHALFKTRCTINGKICNVIIDSGSTENVVASRLVSTLNLPLHPHPAPY-----KASPTSIPPKKGQLFTLTSGKN---
        GE V+ V+QR+L  PK +   QRH++F++ C+IN K+C++I+D+ S EN +A RLV  L LP   HP PY     K  PT +  K   + TL   +    
Subjt:  GEPVSLVIQRLLLAPKSDPSYQRHALFKTRCTINGKICNVIIDSGSTENVVASRLVSTLNLPLHPHPAPY-----KASPTSIPPKKGQLFTLTSGKN---

Query:  LLSDKNSHILGLVIKNF--SNQASTVEIPLEVQNLLDEFNSIM--DNKNNLPPLRDIQHAIDFLPGATLPNLPHYKMSPTEYKILHDQVQELLDKGHIQP
        + ++    +  L IK      +     IP ++  LL+EF  +   D  NNLPP+RDIQ  ID LPGA+LPNLPHY+MSP E ++L ++++ELL KG I+ 
Subjt:  LLSDKNSHILGLVIKNF--SNQASTVEIPLEVQNLLDEFNSIM--DNKNNLPPLRDIQHAIDFLPGATLPNLPHYKMSPTEYKILHDQVQELLDKGHIQP

Query:  SLSPCAVPALLTPKK-----MVADEY---------------------------------LKSGYHQIHIRPGDEWKTTFKTNEGLFEWLVMPFGLSNAPS
        S+SPCAVPALLTPKK     M  D                                   L+SGYHQI IRPGDEWKTTFK+ +GL+EWLVMPFGLSNAPS
Subjt:  SLSPCAVPALLTPKK-----MVADEY---------------------------------LKSGYHQIHIRPGDEWKTTFKTNEGLFEWLVMPFGLSNAPS

Query:  TFMRLMHQV
        TFMR+M+QV
Subjt:  TFMRLMHQV

A0A5A7V4G7 Retrovirus-related Pol polyprotein from transposon 17.62.2e-6531.73Show/hide
Query:  KEAENITVLSPQETTVRLLSVEDDVREIRKILEMMCIKMGCKTGQQ-----PSEIQEIVNLGKKPQEFFREGGRTSNWLEGQNLEGKPFQEVITAPRTFQ
        +E E I  LS + +TVRLL+VED + ++   ++ M   +   T +      P+ I+    +         +G R S W+                 R F 
Subjt:  KEAENITVLSPQETTVRLLSVEDDVREIRKILEMMCIKMGCKTGQQ-----PSEIQEIVNLGKKPQEFFREGGRTSNWLEGQNLEGKPFQEVITAPRTFQ

Query:  EQQSGLQEYNYNPSFRRQTE---WGGDISSEEEYEEIQRAERRNFRHHQYQENDFKMKVDIPTYGGKMDIEIFLEWIKHVENFFNYMNTPENKK-GGLRY
         Q+        N   RR T+      D +S+EEYE  Q  ++ +    + +         +     K + E+ +  +K+      +   P  K+  G + 
Subjt:  EQQSGLQEYNYNPSFRRQTE---WGGDISSEEEYEEIQRAERRNFRHHQYQENDFKMKVDIPTYGGKMDIEIFLEWIKHVENFFNYMNTPENKK-GGLRY

Query:  DIKKQLALQPIG---HLNEAISAAATIEEQISNRFKRTY------------------SRRTVGDQGNTFNKTMTGDTKAQQNAAL----KGEPVSLVIQR
        D +   ++   G    + E     + +  +  N + R                     R+T+    +        D + ++   L     G+ +S ++QR
Subjt:  DIKKQLALQPIG---HLNEAISAAATIEEQISNRFKRTY------------------SRRTVGDQGNTFNKTMTGDTKAQQNAAL----KGEPVSLVIQR

Query:  LLLAPKSDPSYQRHALFKTRCTINGKICNVIIDSGSTENVVASRLVSTLNLPLHPHPAPYKASPTSIPPKKGQ----------LFTLTSGK---------
        +L+  K + + QRH+LFKTRCTI+GK+C+VIIDSGS+EN VA +LV++LNL + PHP PYK        K+G+          L  + S K         
Subjt:  LLLAPKSDPSYQRHALFKTRCTINGKICNVIIDSGSTENVVASRLVSTLNLPLHPHPAPYKASPTSIPPKKGQ----------LFTLTSGK---------

Query:  ----NLLSDK--NSHILGLVIKNFSNQASTVEIPLEVQNLLDEFNSIMDNKNNLPPLRDIQHAIDFLPGATLPNLPHYKMSPTEYKILHDQVQELLDKGH
            +LL D+     +LGLV+   S   ++  +   ++ L  EF  +      LPPL DIQH ID +PGA+LP+LPHY+MSP EY++LHD ++ LL KGH
Subjt:  ----NLLSDK--NSHILGLVIKNFSNQASTVEIPLEVQNLLDEFNSIMDNKNNLPPLRDIQHAIDFLPGATLPNLPHYKMSPTEYKILHDQVQELLDKGH

Query:  IQPSLSPCAVPALLTPKK-----MVADEY---------------------------------LKSGYHQIHIRPGDEWKTTFKTNEGLFEWLVMPFGLSN
        I+PSLSPC VPALLTPKK     M  D                                   L+S YHQI IRP DEWKTTFK NEGLFEWL MPFGLSN
Subjt:  IQPSLSPCAVPALLTPKK-----MVADEY---------------------------------LKSGYHQIHIRPGDEWKTTFKTNEGLFEWLVMPFGLSN

Query:  APSTFMRLMHQVYQARLRRNGEVV
        APSTF     + +   LR+  +V+
Subjt:  APSTFMRLMHQVYQARLRRNGEVV

A0A5D3C3X9 Reverse transcriptase1.1e-8035.26Show/hide
Query:  KEAENITVLSPQETTVRLLSVEDDVREIRKILEMMCIKMGCKTGQQPSEIQEIVNLGKKPQEFFREGGRTSNWLEGQNLEGKPFQEVITAPRTFQEQQSG
        +EAE   VLSP+ T+ RLLS+E  V  I   L+++  ++   T  Q        N+ ++ QE  R+ G+    + G  +          +    QE++  
Subjt:  KEAENITVLSPQETTVRLLSVEDDVREIRKILEMMCIKMGCKTGQQPSEIQEIVNLGKKPQEFFREGGRTSNWLEGQNLEGKPFQEVITAPRTFQEQQSG

Query:  LQEYNYNPSFRRQ------TEWGGDISSEEEYEE-----IQRAERRNFRHHQYQENDFKMKVDIPTYGGK-------------MDIEIF-----------
         Q+Y  NP  R Q       +W    SS++E +E       R  R  F   + +  + KMK+D+P+Y GK             +D+ ++           
Subjt:  LQEYNYNPSFRRQ------TEWGGDISSEEEYEE-----IQRAERRNFRHHQYQENDFKMKVDIPTYGGK-------------MDIEIF-----------

Query:  --LEWIKHVENFFNYMNTPENKK-------GGLRYDIKKQLALQPIGHLNEAISAAATIEEQISNRFKRTYS------------------RRTVGDQG--
           ++IK   +    +N  EN++       GGLR+DIK+++ LQP   L+EAIS A T+EE  + R K   +                  +R V ++G  
Subjt:  --LEWIKHVENFFNYMNTPENKK-------GGLRYDIKKQLALQPIGHLNEAISAAATIEEQISNRFKRTYS------------------RRTVGDQG--

Query:  -NTFNKTMTG----------------------------DTKAQQNAALK----------GEPVSLVIQRLLLAPKSDPSYQRHALFKTRCTINGKICNVI
         N +N+   G                            D+ ++ +  L+          G  VS VIQR+LLAPK + + Q H+LFKTRCTINGK+C+VI
Subjt:  -NTFNKTMTG----------------------------DTKAQQNAALK----------GEPVSLVIQRLLLAPKSDPSYQRHALFKTRCTINGKICNVI

Query:  IDSGSTENVVASRLVSTLNLPLHPHPAPYKAS----------------PTSIPPKKGQLFTLTSGKNLLSDKNSHILGLVIKNFSNQASTVEIPLEVQNL
        ID+GS+EN VA +LV+ LNL   PHP PYK                  P SI    G    +      + ++   +LGL+I + SN+     +   +Q L
Subjt:  IDSGSTENVVASRLVSTLNLPLHPHPAPYKAS----------------PTSIPPKKGQLFTLTSGKNLLSDKNSHILGLVIKNFSNQASTVEIPLEVQNL

Query:  LDEFNSIMDNKNNLPPLRDIQHAIDFLPGATLPNLPHYKMSPTEYKILHDQVQELLDKGHIQPSLSPCAVPALLTPKKMVADEYLKSGYHQIHIRPGDEW
         +EF  +      LPPLRDIQ  ID +PGA+LP L HY+MSP EY+ILH+ +++LL+KGHI+PS SP         +KM+    LKSGYHQ+ IRPGDEW
Subjt:  LDEFNSIMDNKNNLPPLRDIQHAIDFLPGATLPNLPHYKMSPTEYKILHDQVQELLDKGHIQPSLSPCAVPALLTPKKMVADEYLKSGYHQIHIRPGDEW

Query:  KTTFKTNEGLFEWLVMPFGLSNAPSTFMRLMHQVYQARLRR
        KTTFKTNEGLFEW++M FGLSN PSTFMRLM+QV    L +
Subjt:  KTTFKTNEGLFEWLVMPFGLSNAPSTFMRLMHQVYQARLRR

A0A5D3DGR0 Reverse transcriptase4.2e-8029.88Show/hide
Query:  KEAENITVLSPQETTVRLLSVEDDVREIRKILEMMCIKMGCKTGQ------QPSEIQEIVNLGKKPQEFFREGGRTSNWLEGQNLEGKPFQEVITAPRTF
        +EA    +LSP+ ++  L SVE  + EIR++L  +  ++  +  Q      +P  +Q     G++  E+FR                + FQE     R  
Subjt:  KEAENITVLSPQETTVRLLSVEDDVREIRKILEMMCIKMGCKTGQ------QPSEIQEIVNLGKKPQEFFREGGRTSNWLEGQNLEGKPFQEVITAPRTF

Query:  QEQQSGLQEYNYNPSFRRQTEWGG---DISSEEEYEEIQRAERRNFRHHQYQEN--------DFKMKVDIPTYGGKMDIEIFLEWIKHVENFFNYMNTPE
         E Q  L +   +    R+ EW     +I +    EE    +   FR H+Y +N        ++KMK+D+P+Y GK +IE FL+W+K+ ENFF YM T +
Subjt:  QEQQSGLQEYNYNPSFRRQTEWGG---DISSEEEYEEIQRAERRNFRHHQYQEN--------DFKMKVDIPTYGGKMDIEIFLEWIKHVENFFNYMNTPE

Query:  NKK------------------------------------------------------------------------------------------------G
        NKK                                                                                                G
Subjt:  NKK------------------------------------------------------------------------------------------------G

Query:  GLRYDIKKQLALQPIGHLNEAISAAATIEEQISNRFKRTYSR---------RTVGD--------------------------------------QGNTFN
        GLR+D+K+++ LQP  HL+EAI+ A T+EE I NR K T  R          T G+                                       GN + 
Subjt:  GLRYDIKKQLALQPIGHLNEAISAAATIEEQISNRFKRTYSR---------RTVGD--------------------------------------QGNTFN

Query:  KTMTG-----------------------------DTKAQQNAALKGEPVSLVIQRLLLAPKSDPSYQRHALFKTRCTINGKICNVIIDSGSTENVVASRL
            G                             D + +   A +G+ +S ++QR+L++PK +   QRH+LFKTRCTI GK+CNVIIDSGS+EN V+ +L
Subjt:  KTMTG-----------------------------DTKAQQNAALKGEPVSLVIQRLLLAPKSDPSYQRHALFKTRCTINGKICNVIIDSGSTENVVASRL

Query:  VSTLNLPLHPHPAPYKAS----------------PTSI--------------------------------------------------------------
        V+ LNL   PH  PYK                  P SI                                                              
Subjt:  VSTLNLPLHPHPAPYKAS----------------PTSI--------------------------------------------------------------

Query:  ---PPKKGQLFTLTSGKNLLSDKNSHILGLVIKNFSNQASTVEIPLEVQNLLDEFNSIMDNKNNLPPLRDIQHAIDFLPGATLPNLPHYKMSPTEYKILH
             KKG LF   SGK  L ++ + ILG+V+    +     +IP  ++ L  ++  I      LPPLRDI H I+ L GA+ P+LPHY MSP EYKILH
Subjt:  ---PPKKGQLFTLTSGKNLLSDKNSHILGLVIKNFSNQASTVEIPLEVQNLLDEFNSIMDNKNNLPPLRDIQHAIDFLPGATLPNLPHYKMSPTEYKILH

Query:  DQVQELLDKGHIQPSLSPCAVPALLTPKK-----MVADEY---------------------------------LKSGYHQIHIRPGDEWKTTFKTNEGLF
        D ++ELL KGHI+PS S C VPALLTPKK     M  D                                   L+S YHQI IRPGDEWKT FKTNEGLF
Subjt:  DQVQELLDKGHIQPSLSPCAVPALLTPKK-----MVADEY---------------------------------LKSGYHQIHIRPGDEWKTTFKTNEGLF

Query:  EWLVMPFGLSNAPSTFMRLMHQVYQARLRR
        EWLVMPF LSNAPSTFMRLM++V    L +
Subjt:  EWLVMPFGLSNAPSTFMRLMHQVYQARLRR

A0A5D3DIC3 RNA-directed DNA polymerase-like protein8.8e-7038.65Show/hide
Query:  EWIKHVENFFNYMNTPENKK-------GGLRYDIKKQLALQPIGHLNEAISAAATIEEQISNRFKRTYSRRT--------------VGDQGNTFNKTMTG
        E+I+         N  EN++       G LR+DIK+++ LQP   L+EAIS A T+EE ++ + K T  + T              + +Q +T     T 
Subjt:  EWIKHVENFFNYMNTPENKK-------GGLRYDIKKQLALQPIGHLNEAISAAATIEEQISNRFKRTYSRRT--------------VGDQGNTFNKTMTG

Query:  DTKAQ----------------QNAALK----------GEPVSLVIQRLLLAPKSDPSYQRHALFKTRCTINGKICNVIIDSGSTENVVASRLVSTLNLPL
        D + Q                ++ AL+          G  +  VIQR+L+  K + + Q   LFKTRC IN K+ ++IIDS S+EN VA RLV TLNL  
Subjt:  DTKAQ----------------QNAALK----------GEPVSLVIQRLLLAPKSDPSYQRHALFKTRCTINGKICNVIIDSGSTENVVASRLVSTLNLPL

Query:  HPHPAPYK-------------ASPTSIPP---------------------------------KKGQLFTLTSGKNLLSDKNSHILGLVI--KNFSNQAST
          HP PYK                T++ P                                  KGQLFT  S K L+ ++   ILGLVI  K    Q   
Subjt:  HPHPAPYK-------------ASPTSIPP---------------------------------KKGQLFTLTSGKNLLSDKNSHILGLVI--KNFSNQAST

Query:  VEIPLEVQNLLDEFNSIMDNKNNLPPLRDIQHAIDFLPGATLPNLPHYKMSPTEYKILHDQVQELLDKGHIQPSLSPCAVPALLTPKK-----MVADEY-
        +E  L  Q LL EF  +    + LPPLRDIQH ID + GA+LPNL +Y+MSP EY+ILH+ +++LL KGH +PSLSPCA PALLTPKK     M  D   
Subjt:  VEIPLEVQNLLDEFNSIMDNKNNLPPLRDIQHAIDFLPGATLPNLPHYKMSPTEYKILHDQVQELLDKGHIQPSLSPCAVPALLTPKK-----MVADEY-

Query:  --------------------------------LKSGYHQIHIRPGDEWKTTFKTNEGLFEWLVMPFGLSNAPSTFMRLMHQVYQARLRR
                                        L+SGYHQI IR GDEWKTTFKTNEGLFEW+VMPFGLSNAP+ FMRLM+QV+   L +
Subjt:  --------------------------------LKSGYHQIHIRPGDEWKTTFKTNEGLFEWLVMPFGLSNAPSTFMRLMHQVYQARLRR

SwissProt top hitse value%identityAlignment
P0CT41 Transposon Tf2-12 polyprotein3.0e-1125.46Show/hide
Query:  FTLTSGKNLLSDKNSHILGLVIKNFSNQASTVEIPLEVQNLLDEFNSIM--DNKNNLP-PLRDIQHAIDFLPGATLPNLPHYKMSPTEYKILHDQVQELL
        FT     N+    + H L  +     N+ S +    E+ ++  EF  I    N   LP P++ ++  ++         + +Y + P + + ++D++ + L
Subjt:  FTLTSGKNLLSDKNSHILGLVIKNFSNQASTVEIPLEVQNLLDEFNSIM--DNKNNLP-PLRDIQHAIDFLPGATLPNLPHYKMSPTEYKILHDQVQELL

Query:  DKGHIQPSLSPCAVPALLTPKK-----MVAD-----EY----------------------------LKSGYHQIHIRPGDEWKTTFKTNEGLFEWLVMPF
          G I+ S +  A P +  PKK     MV D     +Y                            LKS YH I +R GDE K  F+   G+FE+LVMP+
Subjt:  DKGHIQPSLSPCAVPALLTPKK-----MVAD-----EY----------------------------LKSGYHQIHIRPGDEWKTTFKTNEGLFEWLVMPF

Query:  GLSNAPSTFMRLMHQV
        G+S AP+ F   ++ +
Subjt:  GLSNAPSTFMRLMHQV

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein8.8e-1932.65Show/hide
Query:  SNQASTVEIPLEVQNLLDEFNSIMDNKNNLPPLR------DIQHAIDFLPGATLPNLPHYKMSPTEYKILHDQVQELLDKGHIQPSLSPCAVPALLTPKK
        SN+ +   +P+ +Q    ++  I+  +N+LPP         ++H I+  PGA LP L  Y ++    + ++  VQ+LLD   I PS SPC+ P +L PKK
Subjt:  SNQASTVEIPLEVQNLLDEFNSIMDNKNNLPPLR------DIQHAIDFLPGATLPNLPHYKMSPTEYKILHDQVQELLDKGHIQPSLSPCAVPALLTPKK

Query:  ----------------MVADEY----------------------LKSGYHQIHIRPGDEWKTTFKTNEGLFEWLVMPFGLSNAPSTFMRLMHQVYQ
                         ++D +                      L SGYHQI + P D +KT F T  G +E+ VMPFGL NAPSTF R M   ++
Subjt:  ----------------MVADEY----------------------LKSGYHQIHIRPGDEWKTTFKTNEGLFEWLVMPFGLSNAPSTFMRLMHQVYQ

Q94BU1 Uncharacterized aarF domain-containing protein kinase At1g71810, chloroplastic5.7e-2673.42Show/hide
Query:  QVYQARLRRNGEVVAVKVQRPGVQAAISLDILILRYLAGVFRKVAKLNTDFQAVIDEWATSLFKEMDYRREAKNGRKFR
        QVYQARLRR+G+VVAVKVQRPGV+AAI+LD LILRY+AG+ +K  + N+D +AV+DEWATSLFKEMDY  EA+NG KFR
Subjt:  QVYQARLRRNGEVVAVKVQRPGVQAAISLDILILRYLAGVFRKVAKLNTDFQAVIDEWATSLFKEMDYRREAKNGRKFR

Q99315 Transposon Ty3-G Gag-Pol polyprotein8.8e-1932.65Show/hide
Query:  SNQASTVEIPLEVQNLLDEFNSIMDNKNNLPPLR------DIQHAIDFLPGATLPNLPHYKMSPTEYKILHDQVQELLDKGHIQPSLSPCAVPALLTPKK
        SN+ +   +P+ +Q    ++  I+  +N+LPP         ++H I+  PGA LP L  Y ++    + ++  VQ+LLD   I PS SPC+ P +L PKK
Subjt:  SNQASTVEIPLEVQNLLDEFNSIMDNKNNLPPLR------DIQHAIDFLPGATLPNLPHYKMSPTEYKILHDQVQELLDKGHIQPSLSPCAVPALLTPKK

Query:  ----------------MVADEY----------------------LKSGYHQIHIRPGDEWKTTFKTNEGLFEWLVMPFGLSNAPSTFMRLMHQVYQ
                         ++D +                      L SGYHQI + P D +KT F T  G +E+ VMPFGL NAPSTF R M   ++
Subjt:  ----------------MVADEY----------------------LKSGYHQIHIRPGDEWKTTFKTNEGLFEWLVMPFGLSNAPSTFMRLMHQVYQ

Q9MA15 Protein ACTIVITY OF BC1 COMPLEX KINASE 3, chloroplastic8.0e-1246.25Show/hide
Query:  QVYQARLRRNGEVVAVKVQRPGVQAAISLDILILRYLAGVFRK-VAKLNTDFQAVIDEWATSLFKEMDYRREAKNGRKFR
        QVY+A+LR +G+VVAVKVQRPG++ AI LD  ++R +  +  K V  + TD   +IDE+A  +++E++Y +EA+N R+F+
Subjt:  QVYQARLRRNGEVVAVKVQRPGVQAAISLDILILRYLAGVFRK-VAKLNTDFQAVIDEWATSLFKEMDYRREAKNGRKFR

Arabidopsis top hitse value%identityAlignment
AT1G71810.1 Protein kinase superfamily protein4.1e-2773.42Show/hide
Query:  QVYQARLRRNGEVVAVKVQRPGVQAAISLDILILRYLAGVFRKVAKLNTDFQAVIDEWATSLFKEMDYRREAKNGRKFR
        QVYQARLRR+G+VVAVKVQRPGV+AAI+LD LILRY+AG+ +K  + N+D +AV+DEWATSLFKEMDY  EA+NG KFR
Subjt:  QVYQARLRRNGEVVAVKVQRPGVQAAISLDILILRYLAGVFRKVAKLNTDFQAVIDEWATSLFKEMDYRREAKNGRKFR

AT1G79600.1 Protein kinase superfamily protein5.7e-1346.25Show/hide
Query:  QVYQARLRRNGEVVAVKVQRPGVQAAISLDILILRYLAGVFRK-VAKLNTDFQAVIDEWATSLFKEMDYRREAKNGRKFR
        QVY+A+LR +G+VVAVKVQRPG++ AI LD  ++R +  +  K V  + TD   +IDE+A  +++E++Y +EA+N R+F+
Subjt:  QVYQARLRRNGEVVAVKVQRPGVQAAISLDILILRYLAGVFRK-VAKLNTDFQAVIDEWATSLFKEMDYRREAKNGRKFR

AT3G24190.1 Protein kinase superfamily protein1.0e-1430.46Show/hide
Query:  KILHDQVQELLDKGHIQPSLSPCAV---------PALLTPKKMVADEYLKSGYHQIHIRPGDEWKTTFKTNEGLFEWLVMPFGLSNAPSTFMRLMHQVYQ
        K+  ++V   ++   I  SL P  +         P +L+P  M     L+    ++   P D      +   G   W  +   LS +P     L  QVY+
Subjt:  KILHDQVQELLDKGHIQPSLSPCAV---------PALLTPKKMVADEYLKSGYHQIHIRPGDEWKTTFKTNEGLFEWLVMPFGLSNAPSTFMRLMHQVYQ

Query:  ARLRRNGEVVAVKVQRPGVQAAISLDILILRYLAGVFRKVAKLNTDFQAVIDEWATSLFKEMDYRREAKNGRKF
         RL+ NG++VAVKVQRP V   +++D+ ++R L    RK  +++ D   ++DEWA   F+E+DY  E +NG  F
Subjt:  ARLRRNGEVVAVKVQRPGVQAAISLDILILRYLAGVFRKVAKLNTDFQAVIDEWATSLFKEMDYRREAKNGRKF

AT5G24970.1 Protein kinase superfamily protein5.9e-1032.76Show/hide
Query:  QVYQARLRRNGEVVAVKVQRPGVQAAISLDILILRYLAGVFRKVAKLNTDFQAVIDEWATSLFKEMDYRREAKNGRKFR--VYFDPFFTNLNSNNSPEVA
        QVY+A L  +G++VAVKVQRPG+   ++ D L+ + + G  ++ AK   D    ++E    +F E+DY  EAKN  +F     FD     ++ N  P   
Subjt:  QVYQARLRRNGEVVAVKVQRPGVQAAISLDILILRYLAGVFRKVAKLNTDFQAVIDEWATSLFKEMDYRREAKNGRKFR--VYFDPFFTNLNSNNSPEVA

Query:  ATNPTTAIVAKIPRCH
        + N     + K+P+ +
Subjt:  ATNPTTAIVAKIPRCH

AT5G24970.2 Protein kinase superfamily protein5.9e-1032.76Show/hide
Query:  QVYQARLRRNGEVVAVKVQRPGVQAAISLDILILRYLAGVFRKVAKLNTDFQAVIDEWATSLFKEMDYRREAKNGRKFR--VYFDPFFTNLNSNNSPEVA
        QVY+A L  +G++VAVKVQRPG+   ++ D L+ + + G  ++ AK   D    ++E    +F E+DY  EAKN  +F     FD     ++ N  P   
Subjt:  QVYQARLRRNGEVVAVKVQRPGVQAAISLDILILRYLAGVFRKVAKLNTDFQAVIDEWATSLFKEMDYRREAKNGRKFR--VYFDPFFTNLNSNNSPEVA

Query:  ATNPTTAIVAKIPRCH
        + N     + K+P+ +
Subjt:  ATNPTTAIVAKIPRCH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCAAAGAGGCGGAGAATATCACCGTCCTCTCACCACAAGAAACGACTGTACGCTTGCTGTCAGTCGAGGATGATGTGCGTGAGATTAGGAAAATCTTAGAGATGAT
GTGCATCAAAATGGGCTGCAAAACTGGACAGCAACCCTCGGAGATTCAAGAAATTGTGAACTTGGGGAAAAAACCACAAGAATTTTTTAGAGAAGGTGGCAGAACAAGCA
ATTGGCTAGAAGGACAAAATTTGGAGGGGAAACCATTCCAAGAAGTGATAACAGCTCCAAGAACATTTCAAGAACAACAATCGGGACTCCAAGAATACAATTATAATCCT
TCATTTCGAAGACAAACAGAATGGGGTGGAGATATTTCAAGTGAAGAAGAGTATGAAGAAATCCAAAGGGCAGAACGAAGAAATTTTCGACATCATCAATATCAAGAAAA
TGATTTCAAGATGAAAGTTGATATCCCAACCTATGGGGGAAAGATGGACATCGAGATCTTTCTTGAGTGGATAAAGCATGTTGAAAACTTCTTTAACTATATGAACACAC
CCGAAAACAAGAAGGGCGGGCTGCGCTATGACATTAAGAAGCAATTGGCCTTACAACCGATAGGACACTTGAATGAAGCTATCTCTGCTGCAGCAACAATCGAAGAACAA
ATTTCAAATCGGTTCAAACGAACCTATTCTAGAAGAACCGTGGGTGATCAAGGGAACACATTCAATAAAACTATGACCGGGGATACAAAAGCTCAACAAAATGCAGCACT
CAAAGGAGAACCAGTTTCCCTAGTAATCCAACGACTTCTTCTTGCCCCAAAATCAGACCCTAGCTATCAACGCCACGCCTTGTTCAAGACTCGTTGTACAATCAACGGGA
AGATCTGCAATGTCATTATAGACAGTGGTAGCACAGAAAACGTGGTCGCTAGTAGACTTGTTTCCACTTTGAACCTCCCATTACATCCACATCCAGCACCATACAAGGCT
TCCCCTACGTCCATTCCTCCAAAGAAAGGTCAGCTTTTTACTTTAACTTCTGGGAAAAATTTGCTTAGTGACAAAAACTCTCATATTCTTGGATTGGTTATCAAGAACTT
TTCTAATCAAGCTTCTACTGTGGAAATACCTTTGGAGGTGCAAAATCTGCTCGATGAATTCAACAGCATCATGGATAACAAGAACAACCTTCCACCATTAAGGGACATTC
AGCATGCCATTGATTTTTTGCCTGGTGCTACGCTCCCAAATTTACCTCACTACAAGATGAGCCCAACAGAGTATAAGATCCTTCACGATCAAGTGCAGGAACTTCTCGAT
AAGGGGCATATTCAACCGAGTTTGAGCCCGTGCGCCGTCCCAGCTTTGTTAACCCCAAAAAAGATGGTAGCTGACGAATACTTAAAAAGTGGATACCATCAAATCCACAT
TCGACCGGGAGATGAGTGGAAAACGACATTTAAAACTAATGAAGGACTCTTTGAGTGGCTCGTGATGCCGTTCGGACTATCTAATGCCCCGAGTACGTTTATGCGGCTTA
TGCACCAGGTTTATCAAGCAAGGCTTCGACGTAATGGAGAAGTGGTTGCTGTCAAAGTCCAAAGGCCTGGAGTTCAGGCTGCTATATCCCTGGATATCTTGATCTTGCGC
TATTTAGCAGGTGTATTTCGCAAAGTTGCCAAGTTAAATACTGATTTTCAGGCAGTTATTGATGAATGGGCAACGAGTCTTTTCAAGGAGATGGATTACAGGAGAGAAGC
AAAAAATGGTCGCAAGTTCAGGGTCTACTTTGATCCCTTCTTCACTAACTTAAACTCTAATAACTCTCCTGAGGTTGCCGCAACAAACCCCACCACTGCCATTGTAGCCA
AAATCCCTCGTTGTCATGAGAAGCCGAAAGTTGTCACCAATCGGATCCTAGCTCGTCTAAGAATGTTTTTGTTCATTAGAATAAGGAATCACAAACAAGGAAAATTCGCC
GTACAACTAAGTTTGTTTGTGAAGTCTAAGTTCGCCCGTACAGCAAACCCCACCCTTGTTGTGATGTTGAAGGATTCTGACAACGGGTGGGCTAGGTTTGCTAAAAGTGG
AAGAGAAGGAGAGAGTTTAGAGAGGAGAAGAAGTGGTGTAGCCAAAACTTATCTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGCAAAGAGGCGGAGAATATCACCGTCCTCTCACCACAAGAAACGACTGTACGCTTGCTGTCAGTCGAGGATGATGTGCGTGAGATTAGGAAAATCTTAGAGATGAT
GTGCATCAAAATGGGCTGCAAAACTGGACAGCAACCCTCGGAGATTCAAGAAATTGTGAACTTGGGGAAAAAACCACAAGAATTTTTTAGAGAAGGTGGCAGAACAAGCA
ATTGGCTAGAAGGACAAAATTTGGAGGGGAAACCATTCCAAGAAGTGATAACAGCTCCAAGAACATTTCAAGAACAACAATCGGGACTCCAAGAATACAATTATAATCCT
TCATTTCGAAGACAAACAGAATGGGGTGGAGATATTTCAAGTGAAGAAGAGTATGAAGAAATCCAAAGGGCAGAACGAAGAAATTTTCGACATCATCAATATCAAGAAAA
TGATTTCAAGATGAAAGTTGATATCCCAACCTATGGGGGAAAGATGGACATCGAGATCTTTCTTGAGTGGATAAAGCATGTTGAAAACTTCTTTAACTATATGAACACAC
CCGAAAACAAGAAGGGCGGGCTGCGCTATGACATTAAGAAGCAATTGGCCTTACAACCGATAGGACACTTGAATGAAGCTATCTCTGCTGCAGCAACAATCGAAGAACAA
ATTTCAAATCGGTTCAAACGAACCTATTCTAGAAGAACCGTGGGTGATCAAGGGAACACATTCAATAAAACTATGACCGGGGATACAAAAGCTCAACAAAATGCAGCACT
CAAAGGAGAACCAGTTTCCCTAGTAATCCAACGACTTCTTCTTGCCCCAAAATCAGACCCTAGCTATCAACGCCACGCCTTGTTCAAGACTCGTTGTACAATCAACGGGA
AGATCTGCAATGTCATTATAGACAGTGGTAGCACAGAAAACGTGGTCGCTAGTAGACTTGTTTCCACTTTGAACCTCCCATTACATCCACATCCAGCACCATACAAGGCT
TCCCCTACGTCCATTCCTCCAAAGAAAGGTCAGCTTTTTACTTTAACTTCTGGGAAAAATTTGCTTAGTGACAAAAACTCTCATATTCTTGGATTGGTTATCAAGAACTT
TTCTAATCAAGCTTCTACTGTGGAAATACCTTTGGAGGTGCAAAATCTGCTCGATGAATTCAACAGCATCATGGATAACAAGAACAACCTTCCACCATTAAGGGACATTC
AGCATGCCATTGATTTTTTGCCTGGTGCTACGCTCCCAAATTTACCTCACTACAAGATGAGCCCAACAGAGTATAAGATCCTTCACGATCAAGTGCAGGAACTTCTCGAT
AAGGGGCATATTCAACCGAGTTTGAGCCCGTGCGCCGTCCCAGCTTTGTTAACCCCAAAAAAGATGGTAGCTGACGAATACTTAAAAAGTGGATACCATCAAATCCACAT
TCGACCGGGAGATGAGTGGAAAACGACATTTAAAACTAATGAAGGACTCTTTGAGTGGCTCGTGATGCCGTTCGGACTATCTAATGCCCCGAGTACGTTTATGCGGCTTA
TGCACCAGGTTTATCAAGCAAGGCTTCGACGTAATGGAGAAGTGGTTGCTGTCAAAGTCCAAAGGCCTGGAGTTCAGGCTGCTATATCCCTGGATATCTTGATCTTGCGC
TATTTAGCAGGTGTATTTCGCAAAGTTGCCAAGTTAAATACTGATTTTCAGGCAGTTATTGATGAATGGGCAACGAGTCTTTTCAAGGAGATGGATTACAGGAGAGAAGC
AAAAAATGGTCGCAAGTTCAGGGTCTACTTTGATCCCTTCTTCACTAACTTAAACTCTAATAACTCTCCTGAGGTTGCCGCAACAAACCCCACCACTGCCATTGTAGCCA
AAATCCCTCGTTGTCATGAGAAGCCGAAAGTTGTCACCAATCGGATCCTAGCTCGTCTAAGAATGTTTTTGTTCATTAGAATAAGGAATCACAAACAAGGAAAATTCGCC
GTACAACTAAGTTTGTTTGTGAAGTCTAAGTTCGCCCGTACAGCAAACCCCACCCTTGTTGTGATGTTGAAGGATTCTGACAACGGGTGGGCTAGGTTTGCTAAAAGTGG
AAGAGAAGGAGAGAGTTTAGAGAGGAGAAGAAGTGGTGTAGCCAAAACTTATCTTTAA
Protein sequenceShow/hide protein sequence
MGKEAENITVLSPQETTVRLLSVEDDVREIRKILEMMCIKMGCKTGQQPSEIQEIVNLGKKPQEFFREGGRTSNWLEGQNLEGKPFQEVITAPRTFQEQQSGLQEYNYNP
SFRRQTEWGGDISSEEEYEEIQRAERRNFRHHQYQENDFKMKVDIPTYGGKMDIEIFLEWIKHVENFFNYMNTPENKKGGLRYDIKKQLALQPIGHLNEAISAAATIEEQ
ISNRFKRTYSRRTVGDQGNTFNKTMTGDTKAQQNAALKGEPVSLVIQRLLLAPKSDPSYQRHALFKTRCTINGKICNVIIDSGSTENVVASRLVSTLNLPLHPHPAPYKA
SPTSIPPKKGQLFTLTSGKNLLSDKNSHILGLVIKNFSNQASTVEIPLEVQNLLDEFNSIMDNKNNLPPLRDIQHAIDFLPGATLPNLPHYKMSPTEYKILHDQVQELLD
KGHIQPSLSPCAVPALLTPKKMVADEYLKSGYHQIHIRPGDEWKTTFKTNEGLFEWLVMPFGLSNAPSTFMRLMHQVYQARLRRNGEVVAVKVQRPGVQAAISLDILILR
YLAGVFRKVAKLNTDFQAVIDEWATSLFKEMDYRREAKNGRKFRVYFDPFFTNLNSNNSPEVAATNPTTAIVAKIPRCHEKPKVVTNRILARLRMFLFIRIRNHKQGKFA
VQLSLFVKSKFARTANPTLVVMLKDSDNGWARFAKSGREGESLERRRSGVAKTYL