; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0011650 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0011650
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr1:29971063..29976592
RNA-Seq ExpressionLag0011650
SyntenyLag0011650
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7588381.1 Integrase catalytic core [Arabidopsis suecica]1.6e-15136.36Show/hide
Query:  TIWQDLVDYRPTYDC-----SCGGIKPIIQHMESEFVMIFLMGLNDSYSSVRAQILLMNPIPDITKVFSLVIQEERQRIAGNLVPSTSSDQITLLAAEAS
        T+W +L       DC     SC   K + +  ++  V+ FL GLN+SYS +R+QI++   +P + ++++L+ Q+  QR   +  P+ S+     ++A  S
Subjt:  TIWQDLVDYRPTYDC-----SCGGIKPIIQHMESEFVMIFLMGLNDSYSSVRAQILLMNPIPDITKVFSLVIQEERQRIAGNLVPSTSSDQITLLAAEAS

Query:  KKQNNNRFRRNDNQRPVCSHCNVKGHTVDKCYKIHGYPPGYRSRNTKASSTKAVEAN------------AVTQPQSN---------------------FF
         + + N       Q+ +CSHC   GHTVDKCYKIHGYP G++ +N K  + K V  N            A+T+  +N                     F 
Subjt:  KKQNNNRFRRNDNQRPVCSHCNVKGHTVDKCYKIHGYPPGYRSRNTKASSTKAVEAN------------AVTQPQSN---------------------FF

Query:  SSLNQTQYSICSSA----------------------------VHNSSAWILDSGAARHICHQFSLFQNWRRVYGITVVLPTTYRMSVEFMGD--------
        S L  T  +  SS+                            V +S +WI+DSGA  H+CH  +LF +       +V LPT + + +  +G         
Subjt:  SSLNQTQYSICSSA----------------------------VHNSSAWILDSGAARHICHQFSLFQNWRRVYGITVVLPTTYRMSVEFMGD--------

Query:  ------------------------------------IQDKNRLMMIGRAESSNGLYIL------LPPDKPCLLSETICSVSM---VLGMIVLDTSHL---
                                            IQD  + +MIGR E  + LY+L       P D+    +  +   S+    LG   ++ S +   
Subjt:  ------------------------------------IQDKNRLMMIGRAESSNGLYIL------LPPDKPCLLSETICSVSM---VLGMIVLDTSHL---

Query:  ------------------RAKQRRLAFPFNNHVASDIFDVVHCDVWGPFRTPTYAGYKYFLTLVDDCSRYTWTFLMHSKSDAIHIIPRFFQLVLTQFNKT
                           AKQ+ L F   N+V    FD+VH DVWGPF  PT+ GY+YFLT+VDD +R TW +L+ +KS+ + I P F ++V TQ+  T
Subjt:  ------------------RAKQRRLAFPFNNHVASDIFDVVHCDVWGPFRTPTYAGYKYFLTLVDDCSRYTWTFLMHSKSDAIHIIPRFFQLVLTQFNKT

Query:  IKVFRSDNAPKLQFKEFFATKGTVHQFSCIETPQQNSVAERKHQHLLNVARALLFQSKVPLRFWGDCVLTATYLINRIPAPLLKHKTPFELLHKRSVDYS
        +K  RSDNAP+L+F E F  KG +H FSC ETP+QNSV ERKHQH+LNVAR+L+FQ+ VPL +WG+CVLTA +LINR+P PLLK K+P E+L  +  DY 
Subjt:  IKVFRSDNAPKLQFKEFFATKGTVHQFSCIETPQQNSVAERKHQHLLNVARALLFQSKVPLRFWGDCVLTATYLINRIPAPLLKHKTPFELLHKRSVDYS

Query:  GLRVFGCLCYASTLANNRSKFDPRAKPCVFLGYSPGVKGYRLSDIVRRQLIISRDVVFFENKFPFHSTDVSA-----EAIDTLFSDHVLPCSIVDPVALH
        G RVFGCLCY+ST + NR+KF PRAKPC+FLGY PGVKGY+L D+    + +SR+VVF E+ FPF+  + S      ++ID    D  +  + + PV + 
Subjt:  GLRVFGCLCYASTLANNRSKFDPRAKPCVFLGYSPGVKGYRLSDIVRRQLIISRDVVFFENKFPFHSTDVSA-----EAIDTLFSDHVLPCSIVDPVALH

Query:  EANNLLDSQEHQEPLIFPGSSTDFVAAQPDAQTDVPNPAQDSVVQPDLVDLVDPEVVNQPSTHVSLRRSTRPHVKPSFLNQYHCN---SACLYPIDDYLS
        E+                          P    DVPN    +V  P       P+ VN+           R    P +LN Y+CN   S+  YP+ DY+S
Subjt:  EANNLLDSQEHQEPLIFPGSSTDFVAAQPDAQTDVPNPAQDSVVQPDLVDLVDPEVVNQPSTHVSLRRSTRPHVKPSFLNQYHCN---SACLYPIDDYLS

Query:  YDHFSTTHKHFILNVSAAYEPSYFHQAIKFDHWKEAMDSEIRAMERTSTWTIVPLPPGKHIVGCKWVYRNKYKTDGTVDRYKARLVAKGYSQQEGIDFFI
        YD  ST ++ +I +V+   EP+ F QA K D W  AM++E++A+E T+TW I  LP  KH +GC+WVY+ K   DG+++RYKARLVAKGY+QQEG+DF  
Subjt:  YDHFSTTHKHFILNVSAAYEPSYFHQAIKFDHWKEAMDSEIRAMERTSTWTIVPLPPGKHIVGCKWVYRNKYKTDGTVDRYKARLVAKGYSQQEGIDFFI

Query:  LF
         F
Subjt:  LF

KZV17946.1 hypothetical protein F511_10775 [Dorcoceras hygrometricum]3.3e-15237.73Show/hide
Query:  TIWQDLVDYRPTYDCSCGGIKPIIQHMESEFVMIFLMGLNDSYSSVRAQILLMNPIPDITKVFSLVIQEERQRIAGNLVPSTSSDQITLLA----AEASK
        T+W +L D++P   C CG +K  + +   E  M FLMGLN+SY+ +RAQILLM+P+P I+K+FSLV+QEERQR     V     +Q  +++      A K
Subjt:  TIWQDLVDYRPTYDCSCGGIKPIIQHMESEFVMIFLMGLNDSYSSVRAQILLMNPIPDITKVFSLVIQEERQRIAGNLVPSTSSDQITLLA----AEASK

Query:  KQNNNRFRRNDNQRPVCSHCNVKGHTVDKCYKIHGYPPGYRSRNTKASSTK-------------AVEANAVTQPQS-----NFFSS-----------LNQ
           N++  + D  +  CSHC++  HTVDKCYK+HGYPPG+     K S  K             A   N   +P+       F SS           L Q
Subjt:  KQNNNRFRRNDNQRPVCSHCNVKGHTVDKCYKIHGYPPGYRSRNTKASSTK-------------AVEANAVTQPQS-----NFFSS-----------LNQ

Query:  T----------QYSICSS-AVHNSSAWILDSGAARHIC---HQFSLFQNWRRVYGITVVLPTTYRMSVEFMG----------------------------
        T           YS+ +S  +   S+WI+D+GA  HIC   H F  F+     +   V LP    + V  +G                            
Subjt:  T----------QYSICSS-AVHNSSAWILDSGAARHIC---HQFSLFQNWRRVYGITVVLPTTYRMSVEFMG----------------------------

Query:  ----------------DIQDKNRLMMIGRAESSNGLYIL--------------------------LPPDKPCLLSETICSVSMVLG--MIVLDTSHLRAK
                         IQ  N+   IG       LYIL                          +P  K  +L +T+ + S +    +   +  HL +K
Subjt:  ----------------DIQDKNRLMMIGRAESSNGLYIL--------------------------LPPDKPCLLSETICSVSMVLG--MIVLDTSHLRAK

Query:  QRRLAFPFNNHVASDIFDVVHCDVWGPFRTPTYAGYKYFLTLVDDCSRYTWTFLMHSKSDAIHIIPRFFQLVLTQFNKTIKVFRSDNAPKLQFKEFFATK
        Q+RL F  NN +    FD+VH D+WGPF      G+KYFLT+VDD SRYTW  L+ SKS+ I I P F +++  QF K+IK  RSDNAP+L+F EFF  +
Subjt:  QRRLAFPFNNHVASDIFDVVHCDVWGPFRTPTYAGYKYFLTLVDDCSRYTWTFLMHSKSDAIHIIPRFFQLVLTQFNKTIKVFRSDNAPKLQFKEFFATK

Query:  GTVHQFSCIETPQQNSVAERKHQHLLNVARALLFQSKVPLRFWGDCVLTATYLINRIPAPLLKHKTPFELLHKRSVDYSGLRVFGCLCYASTLANNRSKF
        G V   SC+E PQQNSV ERKHQH+LNVARALLFQS +PL +W +C+LTA YLINR PAPLL +KTPFEL+H +   YS LRVFGCLCY STL N R+KF
Subjt:  GTVHQFSCIETPQQNSVAERKHQHLLNVARALLFQSKVPLRFWGDCVLTATYLINRIPAPLLKHKTPFELLHKRSVDYSGLRVFGCLCYASTLANNRSKF

Query:  DPRAKPCVFLGYSPGVKGYRLSDIVRRQLIISRDVVFFENKFPFHSTDVSAEAIDTLFSDHVLPCSIVDPVALHEANNLLDSQEHQEPLIFPGSSTDFVA
         PRA   +FLGY PG KGY+L ++   ++ ISRDV+F E  FPF +   S+                      H  +N+++   +Q P            
Subjt:  DPRAKPCVFLGYSPGVKGYRLSDIVRRQLIISRDVVFFENKFPFHSTDVSAEAIDTLFSDHVLPCSIVDPVALHEANNLLDSQEHQEPLIFPGSSTDFVA

Query:  AQPDAQTDVPNPAQDSVVQPDLVDLVDPEVVNQPSTHVSLRRSTRPHVKPSFLNQYHCNSAC-------LYPIDDYLSYDHFSTTHKHFILNVSAAYEPS
           +  T++P                    VN   T +S     R   KPS LN YHC + C        +PI + LS    S  +K  ++N+S+  +P+
Subjt:  AQPDAQTDVPNPAQDSVVQPDLVDLVDPEVVNQPSTHVSLRRSTRPHVKPSFLNQYHCNSAC-------LYPIDDYLSYDHFSTTHKHFILNVSAAYEPS

Query:  YFHQAIKFDHWKEAMDSEIRAMERTSTWTIVPLPPGKHIVGCKWVYRNKYKTDGTVDRYKARLVAKGYSQQEGIDFFILF
         ++QA+    W +AM +E+ A+E  +TW+IV LP GKH VGC+WVY+ K++ DG+++RYKARLVAKGY+QQEG+++F  F
Subjt:  YFHQAIKFDHWKEAMDSEIRAMERTSTWTIVPLPPGKHIVGCKWVYRNKYKTDGTVDRYKARLVAKGYSQQEGIDFFILF

KZV25004.1 Cysteine-rich RLK (receptor-like protein kinase) 8 [Dorcoceras hygrometricum]7.8e-16237.95Show/hide
Query:  DIKSTPVKLQLVDQSVTIWQDLVDYRPTYDCSCGGIKPIIQHMESEFVMIFLMGLNDSYSSVRAQILLMNPIPDITKVFSLVIQEERQRIAGNLVPSTSS
        D+ S   KL+      T+W +L DY+PT  C+CG ++    +   E VM FLMGLNDSY+ VRAQ+L++ P+P I KVF+LVIQEERQR     V     
Subjt:  DIKSTPVKLQLVDQSVTIWQDLVDYRPTYDCSCGGIKPIIQHMESEFVMIFLMGLNDSYSSVRAQILLMNPIPDITKVFSLVIQEERQRIAGNLVPSTSS

Query:  DQITLLAAEASKKQNNNRFRRNDN------QRPVCSHCNVKGHTVDKCYKIHGYPPGY------------------RSRNTKASSTKAVEANAVTQPQS-
        D   +L+   S        R + N       R +CSHC+ + HTVDKCYK+HGYPPG+                   S  T   + +   ++++TQ Q  
Subjt:  DQITLLAAEASKKQNNNRFRRNDN------QRPVCSHCNVKGHTVDKCYKIHGYPPGY------------------RSRNTKASSTKAVEANAVTQPQS-

Query:  ---NFFSSLNQTQYS----------------ICSSAVH----NSSAWILDSGAARHICHQFSLFQNWRRVYGITVVLPTT--------------------
            F SS  QT+ +                ICS+  H        WI+D+GA  HIC   S+F++ R +    VVLP T                    
Subjt:  ---NFFSSLNQTQYS----------------ICSSAVH----NSSAWILDSGAARHICHQFSLFQNWRRVYGITVVLPTT--------------------

Query:  ---------------------YRMSVEFMGD---IQDKNRLMMIGRAESSNGLYILLPPDK--PCLLSET-------------------ICSVSMVLGMI
                             +  SV FM D   IQD +++ MIG  +    LY+L  PD+  P  +  T                   + S+  VL + 
Subjt:  ---------------------YRMSVEFMGD---IQDKNRLMMIGRAESSNGLYILLPPDK--PCLLSET-------------------ICSVSMVLGMI

Query:  VLD------TSHLRAKQRRLAFPFNNHVASDIFDVVHCDVWGPFRTPTYAGYKYFLTLVDDCSRYTWTFLMHSKSDAIHIIPRFFQLVLTQFNKTIKVFR
          D      + HL +KQRRL     N++++ IF+++H D WGPF   +  G+++F T+VDD SRYTW +++ SKSD + I P F ++V TQF  T+K  R
Subjt:  VLD------TSHLRAKQRRLAFPFNNHVASDIFDVVHCDVWGPFRTPTYAGYKYFLTLVDDCSRYTWTFLMHSKSDAIHIIPRFFQLVLTQFNKTIKVFR

Query:  SDNAPKLQFKEFFATKGTVHQFSCIETPQQNSVAERKHQHLLNVARALLFQSKVPLRFWGDCVLTATYLINRIPAPLLKHKTPFELLHKRSVDYSGLRVF
        SDNAP+L F +FFA  G  H  SC+E PQQNSV ERKHQH+LNVARALLFQS +PL +W DC+ T+ YLINR P+P+L HKTPFELLH +   YS L+VF
Subjt:  SDNAPKLQFKEFFATKGTVHQFSCIETPQQNSVAERKHQHLLNVARALLFQSKVPLRFWGDCVLTATYLINRIPAPLLKHKTPFELLHKRSVDYSGLRVF

Query:  GCLCYASTLANNRSKFDPRAKPCVFLGYSPGVKGYRLSDIVRRQLIISRDVVFFENKFPFHSTDVSAEAIDTLFSDHVLPCSIVDPVALHEANNLLDSQE
        GCLCYASTL ++R KF PRA  CVF+GY PG KGY+L ++   ++ ISRDV+F EN FP+ +T   + + D  F   V P S + P      +   D+Q+
Subjt:  GCLCYASTLANNRSKFDPRAKPCVFLGYSPGVKGYRLSDIVRRQLIISRDVVFFENKFPFHSTDVSAEAIDTLFSDHVLPCSIVDPVALHEANNLLDSQE

Query:  HQEPLIFPGSSTDFVAAQPDAQTDVPNPAQDSVVQPDLVDLVDPEVVNQPSTHVSLRRSTRPHVKPSFLNQYH-------CNSACLYPIDDYLSYDHFST
        H                                                        R++RPH  PS L  YH       C+++  +PI   ++Y   S+
Subjt:  HQEPLIFPGSSTDFVAAQPDAQTDVPNPAQDSVVQPDLVDLVDPEVVNQPSTHVSLRRSTRPHVKPSFLNQYH-------CNSACLYPIDDYLSYDHFST

Query:  THKHFILNVSAAYEPSYFHQAIKFDHWKEAMDSEIRAMERTSTWTIVPLPPGKHIVGCKWVYRNKYKTDGTVDRYKARLVAKGYSQQEGIDFFILF
        +H+ F+ N+S+  EP+ F QA+    W++AMD E++A+E   TW+IV LP GK  VGC+WVY+ K+  DG++ RYKARLVAKGY+QQEG+D+   F
Subjt:  THKHFILNVSAAYEPSYFHQAIKFDHWKEAMDSEIRAMERTSTWTIVPLPPGKHIVGCKWVYRNKYKTDGTVDRYKARLVAKGYSQQEGIDFFILF

KZV39348.1 hypothetical protein F511_17540 [Dorcoceras hygrometricum]6.4e-15637.47Show/hide
Query:  TIWQDLVDYRPTYDCSCGGIKPIIQHMESEFVMIFLMGLNDSYSSVRAQILLMNPIPDITKVFSLVIQEERQRIAGNLVPSTSSDQITLL--AAEASKKQ
        T+W +L D++P   C CG +K  + +   E  M FLMGLN+SY+ +RAQILLM+P+P I+K+FSLV+QEERQR     V     DQ  ++   A  +  +
Subjt:  TIWQDLVDYRPTYDCSCGGIKPIIQHMESEFVMIFLMGLNDSYSSVRAQILLMNPIPDITKVFSLVIQEERQRIAGNLVPSTSSDQITLL--AAEASKKQ

Query:  NNNRFRRNDNQRPVCSHCNVKGHTVDKCYKIHGYPPGYRSRNTKASSTKA-------------------------------------------VEANAVT
             +   + +  C+HC++  HTVDKCYK+HGYPPG+     K S  K+                                           +      
Subjt:  NNNRFRRNDNQRPVCSHCNVKGHTVDKCYKIHGYPPGYRSRNTKASSTKA-------------------------------------------VEANAVT

Query:  QPQSNFFSSLNQTQYSICSSAVHNSSAWILDSGAARHIC---HQFSLFQ-----------------------------------------NWRRVYGITV
        QP  +  S  N T     S     + +WI+D+GA  HIC   H F  F+                                         N   +  +T 
Subjt:  QPQSNFFSSLNQTQYSICSSAVHNSSAWILDSGAARHIC---HQFSLFQ-----------------------------------------NWRRVYGITV

Query:  VLPTTYRMSVEFMGDIQDKNRLMMIGRAESSNGLYILL-PPDKPCLLSETICSVSM---------------VLGMIVLDT------------SHLRAKQR
         +P +   S E +  IQ  N+   IG       LY+L  PP     +  T+ S +                +LG ++ ++             HL +KQ+
Subjt:  VLPTTYRMSVEFMGDIQDKNRLMMIGRAESSNGLYILL-PPDKPCLLSETICSVSM---------------VLGMIVLDT------------SHLRAKQR

Query:  RLAFPFNNHVASDIFDVVHCDVWGPFRTPTYAGYKYFLTLVDDCSRYTWTFLMHSKSDAIHIIPRFFQLVLTQFNKTIKVFRSDNAPKLQFKEFFATKGT
        RL F  NN V    FD+VH D+WGPF      G+KYFLT+VDD SRYTW  L+ SKSD   I P F +++ TQF K+IK  RSDNAP+LQF EFF  +G 
Subjt:  RLAFPFNNHVASDIFDVVHCDVWGPFRTPTYAGYKYFLTLVDDCSRYTWTFLMHSKSDAIHIIPRFFQLVLTQFNKTIKVFRSDNAPKLQFKEFFATKGT

Query:  VHQFSCIETPQQNSVAERKHQHLLNVARALLFQSKVPLRFWGDCVLTATYLINRIPAPLLKHKTPFELLHKRSVDYSGLRVFGCLCYASTLANNRSKFDP
        V   SC+E PQQNS+ ERKHQH+LNVARALLFQS +PL +W DC+LT+ YLINR+PAP+L +KTPFE++H +  ++S LRVFGCLCY STL ++R+KF P
Subjt:  VHQFSCIETPQQNSVAERKHQHLLNVARALLFQSKVPLRFWGDCVLTATYLINRIPAPLLKHKTPFELLHKRSVDYSGLRVFGCLCYASTLANNRSKFDP

Query:  RAKPCVFLGYSPGVKGYRLSDIVRRQLIISRDVVFFENKFPFHSTDVSAEAIDTLFSDHVLPCSIVDPVALHEANNLLDSQEHQEPLIFPGSSTDFVAAQ
        RA   +FLGY PG KGY+L ++   ++ ISRDV F E  FPF +  +SA            PC       L+E + L  +Q  Q P              
Subjt:  RAKPCVFLGYSPGVKGYRLSDIVRRQLIISRDVVFFENKFPFHSTDVSAEAIDTLFSDHVLPCSIVDPVALHEANNLLDSQEHQEPLIFPGSSTDFVAAQ

Query:  PDAQTDVPNPAQDSVVQPDLVDLVDPEVVNQPSTHVSLRRSTRPHVKPSFLNQYHCNSAC-------LYPIDDYLSYDHFSTTHKHFILNVSAAYEPSYF
               PN              VD  VVN P        + R   +PS L+ YHC + C        +P+   LS    S  +K  ++N+S+  EP+ +
Subjt:  PDAQTDVPNPAQDSVVQPDLVDLVDPEVVNQPSTHVSLRRSTRPHVKPSFLNQYHCNSAC-------LYPIDDYLSYDHFSTTHKHFILNVSAAYEPSYF

Query:  HQAIKFDHWKEAMDSEIRAMERTSTWTIVPLPPGKHIVGCKWVYRNKYKTDGTVDRYKARLVAKGYSQQEGIDFFILF
        +QA+    W +AMD+E+ A+ER +TW+IV LPPGKH VGC+WVY+ K++ DG+++RYKARLVAKGY+QQEG++FF  F
Subjt:  HQAIKFDHWKEAMDSEIRAMERTSTWTIVPLPPGKHIVGCKWVYRNKYKTDGTVDRYKARLVAKGYSQQEGIDFFILF

RVW82526.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]1.5e-15736.69Show/hide
Query:  TIWQDLVDYRPTYDCSCGGIKPIIQHMESEFVMIFLMGLNDSYSSVRAQILLMNPIPDITKVFSLVIQEERQRIAGNLVPSTSSDQITLL------AAEA
        ++W +L +++    C+CGG++  ++  + E VM FL+GLN+S++ ++AQILLM P P + KVFSLV+QEE QR   +L  S S    T +      A+ A
Subjt:  TIWQDLVDYRPTYDCSCGGIKPIIQHMESEFVMIFLMGLNDSYSSVRAQILLMNPIPDITKVFSLVIQEERQRIAGNLVPSTSSDQITLL------AAEA

Query:  SKKQNNNRFRRNDNQRPVCSHCNVKGHTVDKCYKIHGYPPGYRSR-----------------------------------------------------NT
        S   N++R R++   RP+C+HCN+ GHTVD+CYKIHGY PG+R+R                                                     ++
Subjt:  SKKQNNNRFRRNDNQRPVCSHCNVKGHTVDKCYKIHGYPPGYRSR-----------------------------------------------------NT

Query:  KASSTKAVEANAVTQPQSNFFSSLNQTQYSICSSAVHNSSAWILDSGAARHICHQFSLFQNWRRVYGITVVLPTTYRMSVEFMGD---------------
          SS    ++N + Q  SNF   L+ +     SS+  N S WILDSGA  H+C   S+F +       TV LPT  ++ +  +G                
Subjt:  KASSTKAVEANAVTQPQSNFFSSLNQTQYSICSSAVHNSSAWILDSGAARHICHQFSLFQNWRRVYGITVVLPTTYRMSVEFMGD---------------

Query:  -----------------------------IQDKNRLMMIGRAESSNGLYIL-------------------------------------LPPDKPCLLSET
                                     IQD ++  +IG       LY+L                                     L   KP L  ++
Subjt:  -----------------------------IQDKNRLMMIGRAESSNGLYIL-------------------------------------LPPDKPCLLSET

Query:  ICSVSMVLGMIVLDTSHLRAKQRRLAFPFNNHVASDIFDVVHCDVWGPFRTPTYAGYKYFLTLVDDCSRYTWTFLMHSKSDAIHIIPRFFQLVLTQFNKT
          + ++   +  L      AKQ+RL F  +N+++S  FD++HCD+WGPF  PT+ G++YFLT+VDDC+R TW  L+ +KSD   I P+FF +V T+F  T
Subjt:  ICSVSMVLGMIVLDTSHLRAKQRRLAFPFNNHVASDIFDVVHCDVWGPFRTPTYAGYKYFLTLVDDCSRYTWTFLMHSKSDAIHIIPRFFQLVLTQFNKT

Query:  IKVFRSDNAPKLQFKEFFATKGTVHQFSCIETPQQNSVAERKHQHLLNVARALLFQSKVPLRFWGDCVLTATYLINRIPAPLLKHKTPFELLHKRSVDYS
        IK  RSDNAP+L     F     +H FSC+ETPQQNSV ERKHQH+LNVARAL FQS +P+ +WGDCVLT+ YLINRIP+PLL +KTPFELLH +S  YS
Subjt:  IKVFRSDNAPKLQFKEFFATKGTVHQFSCIETPQQNSVAERKHQHLLNVARALLFQSKVPLRFWGDCVLTATYLINRIPAPLLKHKTPFELLHKRSVDYS

Query:  GLRVFGCLCYASTLANNRSKFDPRAKPCVFLGYSPGVKGYRLSDIVRRQLIISRDVVFFENKFPFHSTDVSAEAIDTLFSDHVLPCSIVDPVALHEANNL
         L+ FGCLCY+STL + R KF PRA PCVFLGY  G KGY++ D+   ++ +SR+V F E+ FPF  +  +       FS  VLP               
Subjt:  GLRVFGCLCYASTLANNRSKFDPRAKPCVFLGYSPGVKGYRLSDIVRRQLIISRDVVFFENKFPFHSTDVSAEAIDTLFSDHVLPCSIVDPVALHEANNL

Query:  LDSQEHQEPLIFPGSSTDFVAAQPDAQTDVPNPAQDSVVQPDLVDLVDPEVVNQPSTHVSLRRSTRPHVKPSFLNQYHCNSACL-----------YPIDD
                P+  P  S D   + P+      NP  DS           P   +  +T     RS+R    P +L+ YHC+ A             YP+ D
Subjt:  LDSQEHQEPLIFPGSSTDFVAAQPDAQTDVPNPAQDSVVQPDLVDLVDPEVVNQPSTHVSLRRSTRPHVKPSFLNQYHCNSACL-----------YPIDD

Query:  YLSYDHFSTTHKHFILNVSAAYEPSYFHQAIKFDHWKEAMDSEIRAMERTSTWTIVPLPPGKHIVGCKWVYRNKYKTDGTVDRYKARLVAKGYSQQEGID
         +SY+  S + + F +++S   EP+ + +A+    W+ AM +E++A+E  +TW++  LPPGK  VGCKW+YR KY  DG+++RYKARLVAKG++QQEG+D
Subjt:  YLSYDHFSTTHKHFILNVSAAYEPSYFHQAIKFDHWKEAMDSEIRAMERTSTWTIVPLPPGKHIVGCKWVYRNKYKTDGTVDRYKARLVAKGYSQQEGID

Query:  FFILF
        FF+ F
Subjt:  FFILF

TrEMBL top hitse value%identityAlignment
A0A2N9GZW3 Integrase catalytic domain-containing protein2.8e-16537.81Show/hide
Query:  TIWQDLVDYRPTYDCSCGGIKPIIQHMESEFVMIFLMGLNDSYSSVRAQILLMNPIPDITKVFSLVIQEERQRIAGNLVPSTSSDQITLLAAEASKKQNN
        ++W +L ++RP  DCSCG +K ++ + + E+VM FLMGLNDS+S VRAQIL+ +P+P ITK F+LVIQEERQR       + ++D + L     + + N 
Subjt:  TIWQDLVDYRPTYDCSCGGIKPIIQHMESEFVMIFLMGLNDSYSSVRAQILLMNPIPDITKVFSLVIQEERQRIAGNLVPSTSSDQITLLAAEASKKQNN

Query:  NRFRRNDNQRPVCSHCNVKGHTVDKCYKIHGYPPGYRSRNTKASSTKAVEANAV--------TQPQSNFFSSLNQTQYSICS--SAVH------------
         + +     RP+CSHC + GHTVDKCYK+HGYPPGY+    KA    A +++AV        TQ Q     S+  +Q S+ S  S+ H            
Subjt:  NRFRRNDNQRPVCSHCNVKGHTVDKCYKIHGYPPGYRSRNTKASSTKAVEANAV--------TQPQSNFFSSLNQTQYSICS--SAVH------------

Query:  -------------------------------------------NSSAWILDSGAARHICHQFSLFQNWRRVYGITVVLPTTYRMSVEFMGDIQDKNRLMM
                                                   + S WILD+GA  H+ H    F +        + LP   ++    +G +Q    L++
Subjt:  -------------------------------------------NSSAWILDSGAARHICHQFSLFQNWRRVYGITVVLPTTYRMSVEFMGDIQDKNRLMM

Query:  --------------------------------------------IGRAESSNGLYILLPPDK--PCLLSETICSVSMVLGMIVLDTSHLR----------
                                                    IG     NGLY L       P      + + + V    V D  H R          
Subjt:  --------------------------------------------IGRAESSNGLYILLPPDK--PCLLSETICSVSMVLGMIVLDTSHLR----------

Query:  -----------------------AKQRRLAFPFNNHVASDIFDVVHCDVWGPFRTPTYAGYKYFLTLVDDCSRYTWTFLMHSKSDAIHIIPRFFQLVLTQ
                               +KQ+RL F    H A   FD++HCD+WGP+  PT    +YFLT+VDDC+R TW FLM  KS+   +I  FF L+ TQ
Subjt:  -----------------------AKQRRLAFPFNNHVASDIFDVVHCDVWGPFRTPTYAGYKYFLTLVDDCSRYTWTFLMHSKSDAIHIIPRFFQLVLTQ

Query:  FNKTIKVFRSDNAPKLQFKEFFATKGTVHQFSCIETPQQNSVAERKHQHLLNVARALLFQSKVPLRFWGDCVLTATYLINRIPAPLLKHKTPFELLHKRS
        F+ +IK+ RSDN P+ +   F+A  GT+HQ SC+ TPQQN+  ERKHQHLL VARAL FQ+ +PL FWG CVLTAT+LINRIP PLL +K+PFELL K+ 
Subjt:  FNKTIKVFRSDNAPKLQFKEFFATKGTVHQFSCIETPQQNSVAERKHQHLLNVARALLFQSKVPLRFWGDCVLTATYLINRIPAPLLKHKTPFELLHKRS

Query:  VDYSGLRVFGCLCYASTLANNRSKFDPRAKPCVFLGYSPGVKGYRLSDIVRRQLIISRDVVFFENKFPFHSTDVSAEAIDTLFSDHVLPCSIVD-PVAL-
         +YS LRVFGCLCYA+TL++NR KF PR+K CV LGY  G+KGYRL D+  +Q+ +SRDV+F+EN FPFH+   S     T  +  VLP  I D P++L 
Subjt:  VDYSGLRVFGCLCYASTLANNRSKFDPRAKPCVFLGYSPGVKGYRLSDIVRRQLIISRDVVFFENKFPFHSTDVSAEAIDTLFSDHVLPCSIVD-PVAL-

Query:  ---HEANNLLDSQEHQEPLIFPGSSTDFVAAQPDAQTDVPNPAQDSVVQ-PDLVDLVDPEVVNQPSTHVSLRRSTRPHVKPSFLNQYHCNSA--------
            + N    S     PL  P S +        + T  P P   +++Q PD+V        N PST  +LR+STR H  PS+L  +HCN+A        
Subjt:  ---HEANNLLDSQEHQEPLIFPGSSTDFVAAQPDAQTDVPNPAQDSVVQ-PDLVDLVDPEVVNQPSTHVSLRRSTRPHVKPSFLNQYHCNSA--------

Query:  -----------CLYPIDDYLSYDHFSTTHKHFILNVSAAYEPSYFHQAIKFDHWKEAMDSEIRAMERTSTWTIVPLPPGKHIVGCKWVYRNKYKTDGTVD
                    ++P+ +Y+SY   +  +  F+L+ SA  EP+ FH+A K  +W +AM +E+ A+E   TW++ PLPPGK  +G KWV++ K ++DG+++
Subjt:  -----------CLYPIDDYLSYDHFSTTHKHFILNVSAAYEPSYFHQAIKFDHWKEAMDSEIRAMERTSTWTIVPLPPGKHIVGCKWVYRNKYKTDGTVD

Query:  RYKARLVAKGYSQQEGIDFFILF
        RYKARLVAKGY+QQEG D+F  F
Subjt:  RYKARLVAKGYSQQEGIDFFILF

A0A2N9HKE6 Uncharacterized protein1.1e-16640.07Show/hide
Query:  TIWQDLVDYRPTYDCSCGGIKPIIQHMESEFVMIFLMGLNDSYSSVRAQILLMNPIPDITKVFSLVIQEERQRIAGNLVPSTSSDQITLLAAEASKKQNN
        ++W +L ++RP  DCSCG +K ++ + + E+VM FLMGLNDS+S VRAQIL+ +P+P ITK F+LVIQEERQR       + ++D + L     + + N 
Subjt:  TIWQDLVDYRPTYDCSCGGIKPIIQHMESEFVMIFLMGLNDSYSSVRAQILLMNPIPDITKVFSLVIQEERQRIAGNLVPSTSSDQITLLAAEASKKQNN

Query:  NRFRRNDNQRPVCSHCNVKGHTVDKCYKIHGYPPGYRSRNTKASSTKAVEANAV--------TQPQSNFFSSLNQTQYSICS------------------
         + +     RP+CSHC + GHTVDKCYK+HGYPPGY+    KA    A +++AV        TQ Q     S+  +Q S+ S                  
Subjt:  NRFRRNDNQRPVCSHCNVKGHTVDKCYKIHGYPPGYRSRNTKASSTKAVEANAV--------TQPQSNFFSSLNQTQYSICS------------------

Query:  --SAVHNSSAWILDSGAARHICHQFSLFQNWRRVYGITVVLPTTYRMSVEFMGDIQDKNRLMM---------IGRAESSNGLYILLPPDK--PCLLSETI
          S+  + +A  +    + H+ H    F +        + LP   ++    +G +Q    L++         IG     NGLY L       P     ++
Subjt:  --SAVHNSSAWILDSGAARHICHQFSLFQNWRRVYGITVVLPTTYRMSVEFMGDIQDKNRLMM---------IGRAESSNGLYILLPPDK--PCLLSETI

Query:  CSVSMVLGMIVLDTSHLR---------------------------------AKQRRLAFPFNNHVASDIFDVVHCDVWGPFRTPTYAGYKYFLTLVDDCS
         + + V    V D  H R                                 +KQ+RL F    H A   FD++HCD+WGP+  PT    +YFLT+VDDC+
Subjt:  CSVSMVLGMIVLDTSHLR---------------------------------AKQRRLAFPFNNHVASDIFDVVHCDVWGPFRTPTYAGYKYFLTLVDDCS

Query:  RYTWTFLMHSKSDAIHIIPRFFQLVLTQFNKTIKVFRSDNAPKLQFKEFFATKGTVHQFSCIETPQQNSVAERKHQHLLNVARALLFQSKVPLRFWGDCV
        R TW FLM  KS+   +I  FF L+ TQF+ +IK+ RSDN P+ +   F+A  GT+HQ SC+ TPQQN+  ERKHQHLL VARAL FQ+ +PL FWG CV
Subjt:  RYTWTFLMHSKSDAIHIIPRFFQLVLTQFNKTIKVFRSDNAPKLQFKEFFATKGTVHQFSCIETPQQNSVAERKHQHLLNVARALLFQSKVPLRFWGDCV

Query:  LTATYLINRIPAPLLKHKTPFELLHKRSVDYSGLRVFGCLCYASTLANNRSKFDPRAKPCVFLGYSPGVKGYRLSDIVRRQLIISRDVVFFENKFPFHST
        LTAT+LINRIP PLL +K  FELL K+  +YS LRVFGCLCYA+TL++NR KF PR+K CV LGY  G+KGYRL D+  +Q+ +SRDV+F+EN FPFH+ 
Subjt:  LTATYLINRIPAPLLKHKTPFELLHKRSVDYSGLRVFGCLCYASTLANNRSKFDPRAKPCVFLGYSPGVKGYRLSDIVRRQLIISRDVVFFENKFPFHST

Query:  DVSAEAIDTLFSDHVLPCSIVD-PVAL----HEANNLLDSQEHQEPLIFPGSSTDFVAAQPDAQTDVPNPAQDSVVQ-PDLVDLVDPEVVNQPSTHVSLR
          S     T  +  VLP  I D P++L     + N    S     PL  P S +        + T  P P   +++Q PD+V        N PST  +LR
Subjt:  DVSAEAIDTLFSDHVLPCSIVD-PVAL----HEANNLLDSQEHQEPLIFPGSSTDFVAAQPDAQTDVPNPAQDSVVQ-PDLVDLVDPEVVNQPSTHVSLR

Query:  RSTRPHVKPSFLNQYHCNSA-------------------CLYPIDDYLSYDHFSTTHKHFILNVSAAYEPSYFHQAIKFDHWKEAMDSEIRAMERTSTWT
        +STR H  PS+L  +HCN+A                    ++P+ +Y+SY   +  +  F+L+ SA  EP+ FH+A K  +W +AM +E+ A+E   TW+
Subjt:  RSTRPHVKPSFLNQYHCNSA-------------------CLYPIDDYLSYDHFSTTHKHFILNVSAAYEPSYFHQAIKFDHWKEAMDSEIRAMERTSTWT

Query:  IVPLPPGKHIVGCKWVYRNKYKTDGTVDRYKARLVAKGYSQQEGIDFFILF
        + PLPPGK  +G KWV++ K ++DG+++RYKARLVAKGY+QQEG D+F  F
Subjt:  IVPLPPGKHIVGCKWVYRNKYKTDGTVDRYKARLVAKGYSQQEGIDFFILF

A0A2N9HKX8 Integrase catalytic domain-containing protein2.5e-16639.11Show/hide
Query:  TIWQDLVDYRPTYDCSCGGIKPIIQHMESEFVMIFLMGLNDSYSSVRAQILLMNPIPDITKVFSLVIQEERQRIAGNLVPSTSSDQITLLA-AEASKKQN
        ++W +L ++R   DCSCG +K ++ + + E+VM FLMGLNDS++ VRAQIL+ +P+P ITK F+LV+QEERQR       + + D + L    EA +   
Subjt:  TIWQDLVDYRPTYDCSCGGIKPIIQHMESEFVMIFLMGLNDSYSSVRAQILLMNPIPDITKVFSLVIQEERQRIAGNLVPSTSSDQITLLA-AEASKKQN

Query:  NNRFRRNDNQRPVCSHCNVKGHTVDKCYKIHGYPPGYRSRNT--KASSTKA--------------------VEANAVTQPQSNFFSSLNQTQYSICSSAV
          + +    +RP+CSHC + GHTVDKCYK+HGYPPGY+ +N    A+ T A                    + + A   P     S+    Q    SS+ 
Subjt:  NNRFRRNDNQRPVCSHCNVKGHTVDKCYKIHGYPPGYRSRNT--KASSTKA--------------------VEANAVTQPQSNFFSSLNQTQYSICSSAV

Query:  HNSSAWILDSGAARHICHQFSLFQNWRRVYGITVVLPTTYRMSVEFMGDIQDKNRLMMIGRAESSNGLYIL-------LPPDKPCLL-------------
         + +A  +    A H+ H  S F +        + LP   +     +G +QD      IG     NGLY L        P   P +              
Subjt:  HNSSAWILDSGAARHICHQFSLFQNWRRVYGITVVLPTTYRMSVEFMGDIQDKNRLMMIGRAESSNGLYIL-------LPPDKPCLL-------------

Query:  ---------SETICSVSMVLGMIVLDTS--HLR----AKQRRLAFPFNNHVASDIFDVVHCDVWGPFRTPTYAGYKYFLTLVDDCSRYTWTFLMHSKSDA
                 S  +  +  V+  +V+ ++  H +    +KQ+RL F  + HV +  F+++HCD+WGP+  PT    KYFLT+VDD +R TW FLM  KS+ 
Subjt:  ---------SETICSVSMVLGMIVLDTS--HLR----AKQRRLAFPFNNHVASDIFDVVHCDVWGPFRTPTYAGYKYFLTLVDDCSRYTWTFLMHSKSDA

Query:  IHIIPRFFQLVLTQFNKTIKVFRSDNAPKLQFKEFFATKGTVHQFSCIETPQQNSVAERKHQHLLNVARALLFQSKVPLRFWGDCVLTATYLINRIPAPL
        + +I  FF L+ TQF+ TIK  RSDN  +     F+A  GT+HQ SC+ TPQQN+  ERKHQHLL VARAL FQ+ +PL FWG CVLTAT+LINR P PL
Subjt:  IHIIPRFFQLVLTQFNKTIKVFRSDNAPKLQFKEFFATKGTVHQFSCIETPQQNSVAERKHQHLLNVARALLFQSKVPLRFWGDCVLTATYLINRIPAPL

Query:  LKHKTPFELLHKRSVDYSGLRVFGCLCYASTLANNRSKFDPRAKPCVFLGYSPGVKGYRLSDIVRRQLIISRDVVFFENKFPFHSTDVSAEAIDTLFSDH
        L +K+PFE+L  +S +YS LRVFGCLCYA+TL++NR KF PR+  C+ LGY  G+KGYRL ++  RQ+ +SRDV+F+EN FPFH++     A   +    
Subjt:  LKHKTPFELLHKRSVDYSGLRVFGCLCYASTLANNRSKFDPRAKPCVFLGYSPGVKGYRLSDIVRRQLIISRDVVFFENKFPFHSTDVSAEAIDTLFSDH

Query:  VLPC-SIVDPVALHEANNLLDSQEHQEPLIFPGSSTDFVAA---QPDAQTDVPNPAQDSVVQPDLVDLVDPEVVNQPSTHVSLRRSTRPHVKPSFLNQYH
        V    + + P++    N+ L   +H  P +    S    ++    P   T  P  A    +    V + +  V + PS   ++R+STRPH  PS+L ++H
Subjt:  VLPC-SIVDPVALHEANNLLDSQEHQEPLIFPGSSTDFVAA---QPDAQTDVPNPAQDSVVQPDLVDLVDPEVVNQPSTHVSLRRSTRPHVKPSFLNQYH

Query:  CNSACL------------------YPIDDYLSYDHFSTTHKHFILNVSAAYEPSYFHQAIKFDHWKEAMDSEIRAMERTSTWTIVPLPPGKHIVGCKWVY
        CNSA L                  +P+ ++LSY + +  +  F+LN S   EP+ F +A +   W EAM +E+ A+E  +TWTI PLP GK  +G KWV+
Subjt:  CNSACL------------------YPIDDYLSYDHFSTTHKHFILNVSAAYEPSYFHQAIKFDHWKEAMDSEIRAMERTSTWTIVPLPPGKHIVGCKWVY

Query:  RNKYKTDGTVDRYKARLVAKGYSQQEGIDFF
        + K K+DG+++RYKARLVAKGY+Q+EG D+F
Subjt:  RNKYKTDGTVDRYKARLVAKGYSQQEGIDFF

A0A2N9IZK3 Uncharacterized protein3.2e-16137.04Show/hide
Query:  IWQDLVDYRPTYDCSCGG------IKPIIQHMESEFVMIFLMGLNDSYSSVRAQILLMNPIPDITKVFSLVIQEERQRIAGNL---VPSTSSDQITLLAA
        +W + ++YRP   C+CG        K +I++   ++V  FLMGLN+++++VR QILLM P+P I KVFSL+   E+Q+ AG L   V  +S D   L   
Subjt:  IWQDLVDYRPTYDCSCGG------IKPIIQHMESEFVMIFLMGLNDSYSSVRAQILLMNPIPDITKVFSLVIQEERQRIAGNL---VPSTSSDQITLLAA

Query:  EASKKQNNNRFRRNDNQRPVCSHCNVKGHTVDKCYKIHGYPPGYRSRNTKASSTKAVEA-----------------------------------------
         AS+K            +P+CSHC  KGH  +KCYK+HGYPPG++ +   A +   V                                           
Subjt:  EASKKQNNNRFRRNDNQRPVCSHCNVKGHTVDKCYKIHGYPPGYRSRNTKASSTKAVEA-----------------------------------------

Query:  -----------------------NAVTQPQSNF---------FSSLNQTQYSICSS-----AVHNSSAWILDSGAARHICHQFSLFQNWRRVYGITVVLP
                                A  QP SN          FS  N   YS+ S+        ++S W++D+GA  H+      F   + V+ +TV LP
Subjt:  -----------------------NAVTQPQSNF---------FSSLNQTQYSICSS-----AVHNSSAWILDSGAARHICHQFSLFQNWRRVYGITVVLP

Query:  TTYRMSVEFMGD--------------------------------------------IQDKNRLMMIGRAESSNGLYIL------------LPPDK-----
            ++V  +G                                             IQD  +  MIG     NGLY+L              PD      
Subjt:  TTYRMSVEFMGD--------------------------------------------IQDKNRLMMIGRAESSNGLYIL------------LPPDK-----

Query:  ----------------PCLLSETICSVSMVLGMIVLDTSHL-----------RAKQRRLAFPFNNHVASDIFDVVHCDVWGPFRTPTYAGYKYFLTLVDD
                         C L     S    L  ++ D SH             AKQ+RL FP NNHV+S  FD++H D+WGP+  PT  GYKYFLTLVDD
Subjt:  ----------------PCLLSETICSVSMVLGMIVLDTSHL-----------RAKQRRLAFPFNNHVASDIFDVVHCDVWGPFRTPTYAGYKYFLTLVDD

Query:  CSRYTWTFLMHSKSDAIHIIPRFFQLVLTQFNKTIKVFRSDNAPKLQFKEFFATKGTVHQFSCIETPQQNSVAERKHQHLLNVARALLFQSKVPLRFWGD
        C+R TW +LM SKS+   ++  F  ++ TQF   +K  RSDN  +    +F+AT+G +HQ SC+ETPQQNSV ERKHQH+LNVAR+L FQS +PL+FWG 
Subjt:  CSRYTWTFLMHSKSDAIHIIPRFFQLVLTQFNKTIKVFRSDNAPKLQFKEFFATKGTVHQFSCIETPQQNSVAERKHQHLLNVARALLFQSKVPLRFWGD

Query:  CVLTATYLINRIPAPLLKHKTPFELLHKRSVDYSGLRVFGCLCYASTLANNRSKFDPRAKPCVFLGYSPGVKGYRLSDIVRRQLIISRDVVFFENKFPFH
         VLTA YLINR+P+P+L HK+P+E L  ++  YS LRVFGCLC+ASTL+N+R+KFDPRAKPCVFLGY  GVKGY+L D+    +IISRDV+F E+ FPF 
Subjt:  CVLTATYLINRIPAPLLKHKTPFELLHKRSVDYSGLRVFGCLCYASTLANNRSKFDPRAKPCVFLGYSPGVKGYRLSDIVRRQLIISRDVVFFENKFPFH

Query:  ST--------DVSAEAIDTLFSDHVLPCSIVDPVALHEANNLLDSQEHQEPLIFPGSSTDFVAAQPDAQT------DVPNPAQDSVVQPDLVDLVDPEVV
        +T        D +       FSD  L  +I  P+     N  L S+E       P S +  +   P A++      DVP P  +SV  P           
Subjt:  ST--------DVSAEAIDTLFSDHVLPCSIVDPVALHEANNLLDSQEHQEPLIFPGSSTDFVAAQPDAQT------DVPNPAQDSVVQPDLVDLVDPEVV

Query:  NQPSTHVSLRRSTRPHVKPSFLNQYHC---------------NSACLYPIDDYLSYDHFSTTHKHFILNVSAAYEPSYFHQAIKFDHWKEAMDSEIRAME
                LRRSTR    P++L  YHC               ++  LYP+   LSYDH S +H+ F L+V+A  EP+ F QA +  HW++AM  E++A+E
Subjt:  NQPSTHVSLRRSTRPHVKPSFLNQYHC---------------NSACLYPIDDYLSYDHFSTTHKHFILNVSAAYEPSYFHQAIKFDHWKEAMDSEIRAME

Query:  RTSTWTIVPLPPGKHIVGCKWVYRNKYKTDGTVDRYKARLVAKGYSQQEGIDF
          +TW++  LPPGKH +GCKWVY+ K K DG+++RYKARLVAKGY+QQEG+D+
Subjt:  RTSTWTIVPLPPGKHIVGCKWVYRNKYKTDGTVDRYKARLVAKGYSQQEGIDF

A0A2Z7AT15 Cysteine-rich RLK (Receptor-like protein kinase) 83.8e-16237.95Show/hide
Query:  DIKSTPVKLQLVDQSVTIWQDLVDYRPTYDCSCGGIKPIIQHMESEFVMIFLMGLNDSYSSVRAQILLMNPIPDITKVFSLVIQEERQRIAGNLVPSTSS
        D+ S   KL+      T+W +L DY+PT  C+CG ++    +   E VM FLMGLNDSY+ VRAQ+L++ P+P I KVF+LVIQEERQR     V     
Subjt:  DIKSTPVKLQLVDQSVTIWQDLVDYRPTYDCSCGGIKPIIQHMESEFVMIFLMGLNDSYSSVRAQILLMNPIPDITKVFSLVIQEERQRIAGNLVPSTSS

Query:  DQITLLAAEASKKQNNNRFRRNDN------QRPVCSHCNVKGHTVDKCYKIHGYPPGY------------------RSRNTKASSTKAVEANAVTQPQS-
        D   +L+   S        R + N       R +CSHC+ + HTVDKCYK+HGYPPG+                   S  T   + +   ++++TQ Q  
Subjt:  DQITLLAAEASKKQNNNRFRRNDN------QRPVCSHCNVKGHTVDKCYKIHGYPPGY------------------RSRNTKASSTKAVEANAVTQPQS-

Query:  ---NFFSSLNQTQYS----------------ICSSAVH----NSSAWILDSGAARHICHQFSLFQNWRRVYGITVVLPTT--------------------
            F SS  QT+ +                ICS+  H        WI+D+GA  HIC   S+F++ R +    VVLP T                    
Subjt:  ---NFFSSLNQTQYS----------------ICSSAVH----NSSAWILDSGAARHICHQFSLFQNWRRVYGITVVLPTT--------------------

Query:  ---------------------YRMSVEFMGD---IQDKNRLMMIGRAESSNGLYILLPPDK--PCLLSET-------------------ICSVSMVLGMI
                             +  SV FM D   IQD +++ MIG  +    LY+L  PD+  P  +  T                   + S+  VL + 
Subjt:  ---------------------YRMSVEFMGD---IQDKNRLMMIGRAESSNGLYILLPPDK--PCLLSET-------------------ICSVSMVLGMI

Query:  VLD------TSHLRAKQRRLAFPFNNHVASDIFDVVHCDVWGPFRTPTYAGYKYFLTLVDDCSRYTWTFLMHSKSDAIHIIPRFFQLVLTQFNKTIKVFR
          D      + HL +KQRRL     N++++ IF+++H D WGPF   +  G+++F T+VDD SRYTW +++ SKSD + I P F ++V TQF  T+K  R
Subjt:  VLD------TSHLRAKQRRLAFPFNNHVASDIFDVVHCDVWGPFRTPTYAGYKYFLTLVDDCSRYTWTFLMHSKSDAIHIIPRFFQLVLTQFNKTIKVFR

Query:  SDNAPKLQFKEFFATKGTVHQFSCIETPQQNSVAERKHQHLLNVARALLFQSKVPLRFWGDCVLTATYLINRIPAPLLKHKTPFELLHKRSVDYSGLRVF
        SDNAP+L F +FFA  G  H  SC+E PQQNSV ERKHQH+LNVARALLFQS +PL +W DC+ T+ YLINR P+P+L HKTPFELLH +   YS L+VF
Subjt:  SDNAPKLQFKEFFATKGTVHQFSCIETPQQNSVAERKHQHLLNVARALLFQSKVPLRFWGDCVLTATYLINRIPAPLLKHKTPFELLHKRSVDYSGLRVF

Query:  GCLCYASTLANNRSKFDPRAKPCVFLGYSPGVKGYRLSDIVRRQLIISRDVVFFENKFPFHSTDVSAEAIDTLFSDHVLPCSIVDPVALHEANNLLDSQE
        GCLCYASTL ++R KF PRA  CVF+GY PG KGY+L ++   ++ ISRDV+F EN FP+ +T   + + D  F   V P S + P      +   D+Q+
Subjt:  GCLCYASTLANNRSKFDPRAKPCVFLGYSPGVKGYRLSDIVRRQLIISRDVVFFENKFPFHSTDVSAEAIDTLFSDHVLPCSIVDPVALHEANNLLDSQE

Query:  HQEPLIFPGSSTDFVAAQPDAQTDVPNPAQDSVVQPDLVDLVDPEVVNQPSTHVSLRRSTRPHVKPSFLNQYH-------CNSACLYPIDDYLSYDHFST
        H                                                        R++RPH  PS L  YH       C+++  +PI   ++Y   S+
Subjt:  HQEPLIFPGSSTDFVAAQPDAQTDVPNPAQDSVVQPDLVDLVDPEVVNQPSTHVSLRRSTRPHVKPSFLNQYH-------CNSACLYPIDDYLSYDHFST

Query:  THKHFILNVSAAYEPSYFHQAIKFDHWKEAMDSEIRAMERTSTWTIVPLPPGKHIVGCKWVYRNKYKTDGTVDRYKARLVAKGYSQQEGIDFFILF
        +H+ F+ N+S+  EP+ F QA+    W++AMD E++A+E   TW+IV LP GK  VGC+WVY+ K+  DG++ RYKARLVAKGY+QQEG+D+   F
Subjt:  THKHFILNVSAAYEPSYFHQAIKFDHWKEAMDSEIRAMERTSTWTIVPLPPGKHIVGCKWVYRNKYKTDGTVDRYKARLVAKGYSQQEGIDFFILF

SwissProt top hitse value%identityAlignment
P04146 Copia protein9.5e-4629.09Show/hide
Query:  LRAKQRRLAF---PFNNHVASDIFDVVHCDVWGPFRTPTYAGYKYFLTLVDDCSRYTWTFLMHSKSDAIHIIPRFFQLVLTQFNKTIKVFRSDNAPKL--
        L  KQ RL F       H+   +F VVH DV GP    T     YF+  VD  + Y  T+L+  KSD   +   F       FN  +     DN  +   
Subjt:  LRAKQRRLAF---PFNNHVASDIFDVVHCDVWGPFRTPTYAGYKYFLTLVDDCSRYTWTFLMHSKSDAIHIIPRFFQLVLTQFNKTIKVFRSDNAPKL--

Query:  -QFKEFFATKGTVHQFSCIETPQQNSVAERKHQHLLNVARALLFQSKVPLRFWGDCVLTATYLINRIPAPLL--KHKTPFELLHKRSVDYSGLRVFGCLC
         + ++F   KG  +  +   TPQ N V+ER  + +   AR ++  +K+   FWG+ VLTATYLINRIP+  L    KTP+E+ H +      LRVFG   
Subjt:  -QFKEFFATKGTVHQFSCIETPQQNSVAERKHQHLLNVARALLFQSKVPLRFWGDCVLTATYLINRIPAPLL--KHKTPFELLHKRSVDYSGLRVFGCLC

Query:  YASTLANNRSKFDPRAKPCVFLGYSPGVKGYRLSDIVRRQLIISRDVVFFEN--------KFPFHSTDVSAEAIDTLF-SDHVLPCSIVDPVALHEANNL
        Y   + N + KFD ++   +F+GY P   G++L D V  + I++RDVV  E         KF       S E+ +  F +D         P    E +N+
Subjt:  YASTLANNRSKFDPRAKPCVFLGYSPGVKGYRLSDIVRRQLIISRDVVFFEN--------KFPFHSTDVSAEAIDTLF-SDHVLPCSIVDPVALHEANNL

Query:  --LDSQEHQEPLIFPGSSTDFVAAQPDAQTDVPNPAQDSVVQPDLVDLVDPEVVNQPSTHVSLRR---------------------STRPHVKPSFLNQY
          L   +  E   FP  S   +      QT+ PN +++     ++  L D +  N+   + S +R                      T  H+K   ++  
Subjt:  --LDSQEHQEPLIFPGSSTDFVAAQPDAQTDVPNPAQDSVVQPDLVDLVDPEVVNQPSTHVSLRR---------------------STRPHVKPSFLNQY

Query:  HCNSAC--------LYPIDDYLSYDHFSTTHKHFILNVSAAYE--PSYFHQAIKFD---HWKEAMDSEIRAMERTSTWTIVPLPPGKHIVGCKWVYRNKY
          N                  +SY+    +    +LN    +   P+ F +    D    W+EA+++E+ A +  +TWTI   P  K+IV  +WV+  KY
Subjt:  HCNSAC--------LYPIDDYLSYDHFSTTHKHFILNVSAAYE--PSYFHQAIKFD---HWKEAMDSEIRAMERTSTWTIVPLPPGKHIVGCKWVYRNKY

Query:  KTDGTVDRYKARLVAKGYSQQEGIDF
           G   RYKARLVA+G++Q+  ID+
Subjt:  KTDGTVDRYKARLVAKGYSQQEGIDF

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-947.3e-6230.58Show/hide
Query:  LRAKQRRLAFPFNNHVASDIFDVVHCDVWGPFRTPTYAGYKYFLTLVDDCSRYTWTFLMHSKSDAIHIIPRFFQLVLTQFNKTIKVFRSDNAPKL---QF
        L  KQ R++F  ++    +I D+V+ DV GP    +  G KYF+T +DD SR  W +++ +K     +  +F  LV  +  + +K  RSDN  +    +F
Subjt:  LRAKQRRLAFPFNNHVASDIFDVVHCDVWGPFRTPTYAGYKYFLTLVDDCSRYTWTFLMHSKSDAIHIIPRFFQLVLTQFNKTIKVFRSDNAPKL---QF

Query:  KEFFATKGTVHQFSCIETPQQNSVAERKHQHLLNVARALLFQSKVPLRFWGDCVLTATYLINRIPAPLLKHKTPFELLHKRSVDYSGLRVFGCLCYASTL
        +E+ ++ G  H+ +   TPQ N VAER ++ ++   R++L  +K+P  FWG+ V TA YLINR P+  L  + P  +   + V YS L+VFGC  +A   
Subjt:  KEFFATKGTVHQFSCIETPQQNSVAERKHQHLLNVARALLFQSKVPLRFWGDCVLTATYLINRIPAPLLKHKTPFELLHKRSVDYSGLRVFGCLCYASTL

Query:  ANNRSKFDPRAKPCVFLGYSPGVKGYRLSDIVRRQLIISRDVVFFENKFPFHSTDVSAEAIDTLFSDHV-LPCSIVDPVALHEANNLLDSQEHQEPLIFP
           R+K D ++ PC+F+GY     GYRL D V++++I SRDVVF E++    + D+S +  + +  + V +P +  +P +     + +  Q  Q     P
Subjt:  ANNRSKFDPRAKPCVFLGYSPGVKGYRLSDIVRRQLIISRDVVFFENKFPFHSTDVSAEAIDTLFSDHV-LPCSIVDPVALHEANNLLDSQEHQEPLIFP

Query:  GSSTDFVAAQPDAQTDVPNPAQDSVVQPDLVDLVDPEVVNQPSTHVSLRRSTRPHVKPSFLNQYHCNSACLYPIDDYLSYDHFSTTHKHFILNVSAAYEP
        G   +      +   +V +P Q                      H  LRRS RP V+           +  YP  +Y+               +S   EP
Subjt:  GSSTDFVAAQPDAQTDVPNPAQDSVVQPDLVDLVDPEVVNQPSTHVSLRRSTRPHVKPSFLNQYHCNSACLYPIDDYLSYDHFSTTHKHFILNVSAAYEP

Query:  SYFHQAIKF---DHWKEAMDSEIRAMERTSTWTIVPLPPGKHIVGCKWVYRNKYKTDGTVDRYKARLVAKGYSQQEGIDFFILF
            + +     +   +AM  E+ ++++  T+ +V LP GK  + CKWV++ K   D  + RYKARLV KG+ Q++GIDF  +F
Subjt:  SYFHQAIKF---DHWKEAMDSEIRAMERTSTWTIVPLPPGKHIVGCKWVYRNKYKTDGTVDRYKARLVAKGYSQQEGIDFFILF

P92520 Uncharacterized mitochondrial protein AtMg008201.6e-1650.63Show/hide
Query:  EPSYFHQAIKFDHWKEAMDSEIRAMERTSTWTIVPLPPGKHIVGCKWVYRNKYKTDGTVDRYKARLVAKGYSQQEGIDF
        EP     A+K   W +AM  E+ A+ R  TW +VP P  ++I+GCKWV++ K  +DGT+DR KARLVAKG+ Q+EGI F
Subjt:  EPSYFHQAIKFDHWKEAMDSEIRAMERTSTWTIVPLPPGKHIVGCKWVYRNKYKTDGTVDRYKARLVAKGYSQQEGIDF

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.9e-5529.4Show/hide
Query:  PCLLSETICSVSMVLGMIVLDTSH--------LRAKQRRLAFPFNNHVASDIFDVVHCDVWGPFRTPTYAGYKYFLTLVDDCSRYTWTFLMHSKSDAIHI
        P +L+  I + S    + VL+ SH        L  K  ++ F  +   ++   + ++ DVW      ++  Y+Y++  VD  +RYTW + +  KS     
Subjt:  PCLLSETICSVSMVLGMIVLDTSH--------LRAKQRRLAFPFNNHVASDIFDVVHCDVWGPFRTPTYAGYKYFLTLVDDCSRYTWTFLMHSKSDAIHI

Query:  IPRFFQLVLTQFNKTIKVFRSDNAPK-LQFKEFFATKGTVHQFSCIETPQQNSVAERKHQHLLNVARALLFQSKVPLRFWGDCVLTATYLINRIPAPLLK
           F  L+  +F   I  F SDN  + +   E+F+  G  H  S   TP+ N ++ERKH+H++     LL  + +P  +W      A YLINR+P PLL+
Subjt:  IPRFFQLVLTQFNKTIKVFRSDNAPK-LQFKEFFATKGTVHQFSCIETPQQNSVAERKHQHLLNVARALLFQSKVPLRFWGDCVLTATYLINRIPAPLLK

Query:  HKTPFELLHKRSVDYSGLRVFGCLCYASTLANNRSKFDPRAKPCVFLGYSPGVKGYRLSDIVRRQLIISRDVVFFENKFPFHSTDVSAEAI-------DT
         ++PF+ L   S +Y  LRVFGC CY      N+ K D +++ CVFLGYS     Y    +   +L ISR V F EN FPF +   +   +         
Subjt:  HKTPFELLHKRSVDYSGLRVFGCLCYASTLANNRSKFDPRAKPCVFLGYSPGVKGYRLSDIVRRQLIISRDVVFFENKFPFHSTDVSAEAI-------DT

Query:  LFSDH--------VLPC-SIVDPVALHEA-----------------NNLLDS----------------QEHQEPLIFP-GSSTDFVAAQPDAQTDVPNPA
        ++S H        VLP  S  DP   H A                 ++ LDS                Q   +P   P  + T   ++Q  +Q +  N +
Subjt:  LFSDH--------VLPC-SIVDPVALHEA-----------------NNLLDS----------------QEHQEPLIFP-GSSTDFVAAQPDAQTDVPNPA

Query:  QDSVVQPDLVDLVDPEVVNQPSTHVSLRRSTRP-------HVKPSFLNQYHCNSAC---LYPIDDYLSYDHFSTTHKHFI-LNVSAAYEPSYFHQAIKFD
           + Q              P+T  S   ST P       H  P      + N+      + +             K+ + ++++A  EP    QA+K +
Subjt:  QDSVVQPDLVDLVDPEVVNQPSTHVSLRRSTRP-------HVKPSFLNQYHCNSAC---LYPIDDYLSYDHFSTTHKHFI-LNVSAAYEPSYFHQAIKFD

Query:  HWKEAMDSEIRAMERTSTWTIVPLPPGK-HIVGCKWVYRNKYKTDGTVDRYKARLVAKGYSQQEGIDF
         W+ AM SEI A     TW +VP PP    IVGC+W++  KY +DG+++RYKARLVAKGY+Q+ G+D+
Subjt:  HWKEAMDSEIRAMERTSTWTIVPLPPGK-HIVGCKWVYRNKYKTDGTVDRYKARLVAKGYSQQEGIDF

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.6e-5925.28Show/hide
Query:  LMGLNDSYSSVRAQILLMNPIPDITKVFSLVIQEERQRIAGN---LVPSTSSDQITLLAAEASKKQNNNRFRRN--------------------DNQRPV
        L  L D Y  V  QI   +  P +T++   +I  E + +A N   +VP T ++ +T      ++ QNN    RN                    DN++P 
Subjt:  LMGLNDSYSSVRAQILLMNPIPDITKVFSLVIQEERQRIAGN---LVPSTSSDQITLLAAEASKKQNNNRFRRN--------------------DNQRPV

Query:  -----CSHCNVKGHTVDKCYKIHGYPPGYRSRNTKASSTKAVEANAVTQPQSNFFSSLNQTQYSICSSAVHNSSAWILDSGAARHICHQFSLFQNWRRVY
             C  C+V+GH+  +C ++H     ++S   +  ST         QP++N           +  ++ +N++ W+LDSGA  HI   F+   ++ + Y
Subjt:  -----CSHCNVKGHTVDKCYKIHGYPPGYRSRNTKASSTKAVEANAVTQPQSNFFSSLNQTQYSICSSAVHNSSAWILDSGAARHICHQFSLFQNWRRVY

Query:  ----------GITV--------VLPT----------------------------TYRMSVEFMG---DIQDKNRLMMIGRAESSNGLY--------ILLP
                  G T+         LPT                            T R+SVEF      ++D N  + + + ++ + LY         +  
Subjt:  ----------GITV--------VLPT----------------------------TYRMSVEFMG---DIQDKNRLMMIGRAESSNGLY--------ILLP

Query:  PDKPC-----------LLSETICSVSMVL---GMIVLDTSH--------LRAKQRRLAFPFNNHVASDIFDVVHCDVWGPFRTPTYA--GYKYFLTLVDD
           PC           L   ++  ++ V+    + VL+ SH           K  ++ F  +   +S   + ++ DVW    +P  +   Y+Y++  VD 
Subjt:  PDKPC-----------LLSETICSVSMVL---GMIVLDTSH--------LRAKQRRLAFPFNNHVASDIFDVVHCDVWGPFRTPTYA--GYKYFLTLVDD

Query:  CSRYTWTFLMHSKSDAIHIIPRFFQLVLTQFNKTIKVFRSDNAPK-LQFKEFFATKGTVHQFSCIETPQQNSVAERKHQHLLNVARALLFQSKVPLRFWG
         +RYTW + +  KS        F  LV  +F   I    SDN  + +  +++ +  G  H  S   TP+ N ++ERKH+H++ +   LL  + VP  +W 
Subjt:  CSRYTWTFLMHSKSDAIHIIPRFFQLVLTQFNKTIKVFRSDNAPK-LQFKEFFATKGTVHQFSCIETPQQNSVAERKHQHLLNVARALLFQSKVPLRFWG

Query:  DCVLTATYLINRIPAPLLKHKTPFELLHKRSVDYSGLRVFGCLCYASTLANNRSKFDPRAKPCVFLGYSPGVKGYRLSDIVRRQLIISRDVVFFENKFPF
             A YLINR+P PLL+ ++PF+ L  +  +Y  L+VFGC CY      NR K + ++K C F+GYS     Y    I   +L  SR V F E  FPF
Subjt:  DCVLTATYLINRIPAPLLKHKTPFELLHKRSVDYSGLRVFGCLCYASTLANNRSKFDPRAKPCVFLGYSPGVKGYRLSDIVRRQLIISRDVVFFENKFPF

Query:  HSTDVSAEAIDTLFSDH---------------VLPC---------------SIVDPVALHE-ANNLLDSQEHQEPLIFPGSSTDFVAAQPDAQTDVPNPA
         +T+          SD                VLP                S   P+   + +++ L S     P     ++      QP AQ   P+  
Subjt:  HSTDVSAEAIDTLFSDH---------------VLPC---------------SIVDPVALHE-ANNLLDSQEHQEPLIFPGSSTDFVAAQPDAQTDVPNPA

Query:  QDSVVQPDLVDLVDPE--VVNQPSTHVSLRRS--TRPHVKPSFLNQYHCNSAC-----------LYPIDDYLSYDHFSTTHKH-----------------
        Q+S     +++  +P     N P+ +  L +S  + PH+     +    NS             + P    +  +  +  + H                 
Subjt:  QDSVVQPDLVDLVDPE--VVNQPSTHVSLRRS--TRPHVKPSFLNQYHCNSAC-----------LYPIDDYLSYDHFSTTHKH-----------------

Query:  -FILNVSAAYEPSYFHQAIKFDHWKEAMDSEIRAMERTSTWTIV-PLPPGKHIVGCKWVYRNKYKTDGTVDRYKARLVAKGYSQQEGIDF
         +  +++A  EP    QA+K D W++AM SEI A     TW +V P PP   IVGC+W++  K+ +DG+++RYKARLVAKGY+Q+ G+D+
Subjt:  -FILNVSAAYEPSYFHQAIKFDHWKEAMDSEIRAMERTSTWTIV-PLPPGKHIVGCKWVYRNKYKTDGTVDRYKARLVAKGYSQQEGIDF

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 83.8e-3449.32Show/hide
Query:  VNQPSTHVSLRRSTRPHVKPSFLNQYHCNSAC---LYPIDDYLSYDHFSTTHKHFILNVSAAYEPSYFHQAIKFDHWKEAMDSEIRAMERTSTWTIVPLP
        V +PS H S RR+     KP++L  Y+C+S     ++ I  +LSY+  S  +  F++ ++ A EPS +++A +F  W  AMD EI AME T TW I  LP
Subjt:  VNQPSTHVSLRRSTRPHVKPSFLNQYHCNSAC---LYPIDDYLSYDHFSTTHKHFILNVSAAYEPSYFHQAIKFDHWKEAMDSEIRAMERTSTWTIVPLP

Query:  PGKHIVGCKWVYRNKYKTDGTVDRYKARLVAKGYSQQEGIDFFILF
        P K  +GCKWVY+ KY +DGT++RYKARLVAKGY+QQEGIDF   F
Subjt:  PGKHIVGCKWVYRNKYKTDGTVDRYKARLVAKGYSQQEGIDFFILF

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.1e-1750.63Show/hide
Query:  EPSYFHQAIKFDHWKEAMDSEIRAMERTSTWTIVPLPPGKHIVGCKWVYRNKYKTDGTVDRYKARLVAKGYSQQEGIDF
        EP     A+K   W +AM  E+ A+ R  TW +VP P  ++I+GCKWV++ K  +DGT+DR KARLVAKG+ Q+EGI F
Subjt:  EPSYFHQAIKFDHWKEAMDSEIRAMERTSTWTIVPLPPGKHIVGCKWVYRNKYKTDGTVDRYKARLVAKGYSQQEGIDF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGTTTTGGAGTCATCTCGGTGTCTTGGACGGCGAGGAGGTGAAAAGACCCAATTGTGCTGAAGTGAACCTGAAGAATGACACCACACGCCGAAGATTGAAGCCCCG
TACCGAGAAGAAACGAGACAAGACCGAGAAAATAAATAGTAAGATAAGGATCAGGGACGACGTCTCGACGTCGACCCGATTTTCATCTGGAATATGGGACTCTGTTGTCC
AAGGCTATGGAGATGCACGTACATTACTGGAGGATATGGCCACAAACAGCTATCAGTGGCCATCTGAGCAGTCTACACCAAAGAAGGTTGCAGCTGGGGTGTTCGAGATC
GATAATGTAAGTGCCCTCCAAGCTCAAATGTCATCCCTTGCTAATGCTTTATTGAAGTTTTCAGAGTCAAGTAATAGGACCAATAAGCTTGAGGAGGCAATGATCGCCAT
AACACCACAGTGTTTGGCCACAGTAAGGCCTCAGCTGAGCAGGAGAGACCCCAAATGGAGTACTGTAAGGCCATCACTGTGCACTAGGAGGAGGAGATTGAAAAAGCTTA
GGAACGTGAGACTGATGAGTATGATACTCCCACTAGAGAAGCTGAGGAGGGCATATCCTCAGACGAAGCTGCAAAGCTTGACCCAAGCCCCTATCCCTTCTCCTACTATT
TTGGTTCCTAAAAGGAAGAAAAAGAAGAAGAAAACAATCAGAAGCATAAGAGATGCCTTAGTACGACAAATTCATGAAGGAATGACTTTCAAAGAAGAAGAAGGAAAAGC
AGGTGCTAGTATCAATATTATTCCTTTATCTTTATGTAAGAAGTTGAATATAGGAGATATTAAATCTACCCCTGTTAAACTGCAATTAGTTGATCAATCTGTGACAATTT
GGCAAGATTTGGTTGATTATCGTCCTACATACGATTGTTCCTGTGGAGGAATTAAGCCAATCATACAACACATGGAGTCTGAGTTCGTGATGATCTTCTTGATGGGACTC
AACGATTCATACTCCTCCGTTCGCGCCCAGATTCTTTTAATGAATCCTATTCCTGACATTACTAAGGTGTTTTCACTGGTTATACAAGAAGAGCGTCAGAGAATTGCTGG
TAATCTTGTGCCATCTACTTCCTCTGATCAGATTACTCTTCTTGCTGCTGAAGCCTCCAAGAAACAAAATAATAATCGCTTTAGGAGAAATGATAATCAAAGGCCTGTTT
GTTCTCATTGCAATGTCAAAGGTCATACAGTGGATAAATGCTACAAAATACATGGTTATCCACCTGGCTACCGGTCTCGGAATACTAAAGCTTCGTCTACTAAGGCTGTT
GAAGCAAACGCTGTTACTCAGCCTCAGTCAAATTTTTTCTCAAGCCTCAACCAAACTCAATACAGTATTTGCTCCTCTGCTGTGCATAATTCTAGTGCTTGGATTTTAGA
TTCTGGTGCAGCTCGACACATATGTCATCAGTTTTCTTTGTTTCAAAATTGGCGTCGGGTTTATGGAATTACTGTTGTTCTTCCTACTACCTATCGTATGAGTGTTGAGT
TTATGGGAGATATTCAGGACAAGAATCGCTTGATGATGATTGGCAGGGCTGAGTCTTCTAATGGCCTTTATATTTTGCTTCCTCCAGATAAGCCTTGTTTACTTTCTGAA
ACTATATGTTCTGTTAGTATGGTACTTGGCATGATCGTCTTGGACACATCTCACCTCAGAGCTAAGCAACGGAGGCTTGCTTTCCCTTTTAACAATCATGTCGCATCTGA
TATTTTTGATGTTGTCCATTGTGATGTTTGGGGACCCTTTAGAACCCCCACTTATGCTGGTTACAAATATTTTTTGACACTAGTCGATGACTGTTCGAGGTATACATGGA
CTTTCTTAATGCATTCCAAATCTGATGCTATCCATATTATACCTCGTTTCTTCCAGCTTGTCCTTACCCAATTTAATAAAACCATTAAGGTTTTTCGTTCAGACAATGCC
CCTAAGCTTCAGTTTAAGGAATTCTTTGCTACAAAGGGAACGGTTCATCAATTCTCTTGCATTGAAACTCCCCAACAAAATTCTGTGGCTGAAAGGAAACACCAGCACCT
CCTTAACGTTGCCAGAGCTTTACTCTTTCAGTCCAAAGTTCCCCTCAGATTCTGGGGAGATTGTGTGTTAACAGCCACATACCTCATCAATCGGATTCCAGCTCCCTTGT
TAAAGCATAAGACTCCTTTTGAACTCTTGCACAAACGATCTGTTGATTACTCTGGACTCCGAGTCTTTGGTTGTCTATGTTATGCTTCTACGCTTGCTAACAACCGTTCA
AAGTTTGATCCTCGTGCCAAACCTTGTGTGTTTCTTGGCTATTCGCCTGGTGTTAAAGGGTATCGATTGTCTGATATCGTTAGGAGACAACTTATCATATCTCGGGACGT
TGTTTTCTTCGAAAATAAGTTTCCTTTTCATTCAACTGATGTCTCCGCTGAGGCCATAGATACTTTATTCTCTGATCATGTTTTGCCATGCTCAATCGTGGATCCAGTTG
CTTTACATGAAGCAAATAATCTTTTGGATTCTCAAGAACATCAGGAGCCTTTGATATTTCCAGGTTCATCTACTGATTTTGTTGCAGCACAACCTGATGCTCAAACTGAT
GTTCCTAATCCTGCACAGGATTCTGTTGTACAACCTGATTTGGTTGATTTGGTTGATCCTGAGGTTGTTAATCAGCCATCTACTCATGTTTCTTTGAGGCGGTCCACTCG
TCCCCATGTCAAGCCAAGCTTTCTCAATCAGTACCATTGCAACTCAGCTTGCTTATATCCTATTGATGATTATTTGTCCTATGATCATTTTTCTACAACACATAAGCATT
TCATTCTGAATGTGTCTGCTGCTTATGAGCCATCTTACTTCCATCAAGCTATTAAATTTGATCATTGGAAGGAGGCTATGGACTCTGAAATTCGTGCAATGGAACGTACT
TCTACGTGGACTATTGTTCCTTTACCTCCTGGCAAGCACATTGTTGGATGTAAATGGGTCTATCGGAATAAATATAAAACTGATGGTACCGTAGACCGTTATAAGGCCCG
GCTCGTTGCCAAGGGTTACAGTCAACAAGAGGGCATTGATTTTTTTATACTTTTTCCCCTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGTTTTGGAGTCATCTCGGTGTCTTGGACGGCGAGGAGGTGAAAAGACCCAATTGTGCTGAAGTGAACCTGAAGAATGACACCACACGCCGAAGATTGAAGCCCCG
TACCGAGAAGAAACGAGACAAGACCGAGAAAATAAATAGTAAGATAAGGATCAGGGACGACGTCTCGACGTCGACCCGATTTTCATCTGGAATATGGGACTCTGTTGTCC
AAGGCTATGGAGATGCACGTACATTACTGGAGGATATGGCCACAAACAGCTATCAGTGGCCATCTGAGCAGTCTACACCAAAGAAGGTTGCAGCTGGGGTGTTCGAGATC
GATAATGTAAGTGCCCTCCAAGCTCAAATGTCATCCCTTGCTAATGCTTTATTGAAGTTTTCAGAGTCAAGTAATAGGACCAATAAGCTTGAGGAGGCAATGATCGCCAT
AACACCACAGTGTTTGGCCACAGTAAGGCCTCAGCTGAGCAGGAGAGACCCCAAATGGAGTACTGTAAGGCCATCACTGTGCACTAGGAGGAGGAGATTGAAAAAGCTTA
GGAACGTGAGACTGATGAGTATGATACTCCCACTAGAGAAGCTGAGGAGGGCATATCCTCAGACGAAGCTGCAAAGCTTGACCCAAGCCCCTATCCCTTCTCCTACTATT
TTGGTTCCTAAAAGGAAGAAAAAGAAGAAGAAAACAATCAGAAGCATAAGAGATGCCTTAGTACGACAAATTCATGAAGGAATGACTTTCAAAGAAGAAGAAGGAAAAGC
AGGTGCTAGTATCAATATTATTCCTTTATCTTTATGTAAGAAGTTGAATATAGGAGATATTAAATCTACCCCTGTTAAACTGCAATTAGTTGATCAATCTGTGACAATTT
GGCAAGATTTGGTTGATTATCGTCCTACATACGATTGTTCCTGTGGAGGAATTAAGCCAATCATACAACACATGGAGTCTGAGTTCGTGATGATCTTCTTGATGGGACTC
AACGATTCATACTCCTCCGTTCGCGCCCAGATTCTTTTAATGAATCCTATTCCTGACATTACTAAGGTGTTTTCACTGGTTATACAAGAAGAGCGTCAGAGAATTGCTGG
TAATCTTGTGCCATCTACTTCCTCTGATCAGATTACTCTTCTTGCTGCTGAAGCCTCCAAGAAACAAAATAATAATCGCTTTAGGAGAAATGATAATCAAAGGCCTGTTT
GTTCTCATTGCAATGTCAAAGGTCATACAGTGGATAAATGCTACAAAATACATGGTTATCCACCTGGCTACCGGTCTCGGAATACTAAAGCTTCGTCTACTAAGGCTGTT
GAAGCAAACGCTGTTACTCAGCCTCAGTCAAATTTTTTCTCAAGCCTCAACCAAACTCAATACAGTATTTGCTCCTCTGCTGTGCATAATTCTAGTGCTTGGATTTTAGA
TTCTGGTGCAGCTCGACACATATGTCATCAGTTTTCTTTGTTTCAAAATTGGCGTCGGGTTTATGGAATTACTGTTGTTCTTCCTACTACCTATCGTATGAGTGTTGAGT
TTATGGGAGATATTCAGGACAAGAATCGCTTGATGATGATTGGCAGGGCTGAGTCTTCTAATGGCCTTTATATTTTGCTTCCTCCAGATAAGCCTTGTTTACTTTCTGAA
ACTATATGTTCTGTTAGTATGGTACTTGGCATGATCGTCTTGGACACATCTCACCTCAGAGCTAAGCAACGGAGGCTTGCTTTCCCTTTTAACAATCATGTCGCATCTGA
TATTTTTGATGTTGTCCATTGTGATGTTTGGGGACCCTTTAGAACCCCCACTTATGCTGGTTACAAATATTTTTTGACACTAGTCGATGACTGTTCGAGGTATACATGGA
CTTTCTTAATGCATTCCAAATCTGATGCTATCCATATTATACCTCGTTTCTTCCAGCTTGTCCTTACCCAATTTAATAAAACCATTAAGGTTTTTCGTTCAGACAATGCC
CCTAAGCTTCAGTTTAAGGAATTCTTTGCTACAAAGGGAACGGTTCATCAATTCTCTTGCATTGAAACTCCCCAACAAAATTCTGTGGCTGAAAGGAAACACCAGCACCT
CCTTAACGTTGCCAGAGCTTTACTCTTTCAGTCCAAAGTTCCCCTCAGATTCTGGGGAGATTGTGTGTTAACAGCCACATACCTCATCAATCGGATTCCAGCTCCCTTGT
TAAAGCATAAGACTCCTTTTGAACTCTTGCACAAACGATCTGTTGATTACTCTGGACTCCGAGTCTTTGGTTGTCTATGTTATGCTTCTACGCTTGCTAACAACCGTTCA
AAGTTTGATCCTCGTGCCAAACCTTGTGTGTTTCTTGGCTATTCGCCTGGTGTTAAAGGGTATCGATTGTCTGATATCGTTAGGAGACAACTTATCATATCTCGGGACGT
TGTTTTCTTCGAAAATAAGTTTCCTTTTCATTCAACTGATGTCTCCGCTGAGGCCATAGATACTTTATTCTCTGATCATGTTTTGCCATGCTCAATCGTGGATCCAGTTG
CTTTACATGAAGCAAATAATCTTTTGGATTCTCAAGAACATCAGGAGCCTTTGATATTTCCAGGTTCATCTACTGATTTTGTTGCAGCACAACCTGATGCTCAAACTGAT
GTTCCTAATCCTGCACAGGATTCTGTTGTACAACCTGATTTGGTTGATTTGGTTGATCCTGAGGTTGTTAATCAGCCATCTACTCATGTTTCTTTGAGGCGGTCCACTCG
TCCCCATGTCAAGCCAAGCTTTCTCAATCAGTACCATTGCAACTCAGCTTGCTTATATCCTATTGATGATTATTTGTCCTATGATCATTTTTCTACAACACATAAGCATT
TCATTCTGAATGTGTCTGCTGCTTATGAGCCATCTTACTTCCATCAAGCTATTAAATTTGATCATTGGAAGGAGGCTATGGACTCTGAAATTCGTGCAATGGAACGTACT
TCTACGTGGACTATTGTTCCTTTACCTCCTGGCAAGCACATTGTTGGATGTAAATGGGTCTATCGGAATAAATATAAAACTGATGGTACCGTAGACCGTTATAAGGCCCG
GCTCGTTGCCAAGGGTTACAGTCAACAAGAGGGCATTGATTTTTTTATACTTTTTCCCCTGTAG
Protein sequenceShow/hide protein sequence
MEFWSHLGVLDGEEVKRPNCAEVNLKNDTTRRRLKPRTEKKRDKTEKINSKIRIRDDVSTSTRFSSGIWDSVVQGYGDARTLLEDMATNSYQWPSEQSTPKKVAAGVFEI
DNVSALQAQMSSLANALLKFSESSNRTNKLEEAMIAITPQCLATVRPQLSRRDPKWSTVRPSLCTRRRRLKKLRNVRLMSMILPLEKLRRAYPQTKLQSLTQAPIPSPTI
LVPKRKKKKKKTIRSIRDALVRQIHEGMTFKEEEGKAGASINIIPLSLCKKLNIGDIKSTPVKLQLVDQSVTIWQDLVDYRPTYDCSCGGIKPIIQHMESEFVMIFLMGL
NDSYSSVRAQILLMNPIPDITKVFSLVIQEERQRIAGNLVPSTSSDQITLLAAEASKKQNNNRFRRNDNQRPVCSHCNVKGHTVDKCYKIHGYPPGYRSRNTKASSTKAV
EANAVTQPQSNFFSSLNQTQYSICSSAVHNSSAWILDSGAARHICHQFSLFQNWRRVYGITVVLPTTYRMSVEFMGDIQDKNRLMMIGRAESSNGLYILLPPDKPCLLSE
TICSVSMVLGMIVLDTSHLRAKQRRLAFPFNNHVASDIFDVVHCDVWGPFRTPTYAGYKYFLTLVDDCSRYTWTFLMHSKSDAIHIIPRFFQLVLTQFNKTIKVFRSDNA
PKLQFKEFFATKGTVHQFSCIETPQQNSVAERKHQHLLNVARALLFQSKVPLRFWGDCVLTATYLINRIPAPLLKHKTPFELLHKRSVDYSGLRVFGCLCYASTLANNRS
KFDPRAKPCVFLGYSPGVKGYRLSDIVRRQLIISRDVVFFENKFPFHSTDVSAEAIDTLFSDHVLPCSIVDPVALHEANNLLDSQEHQEPLIFPGSSTDFVAAQPDAQTD
VPNPAQDSVVQPDLVDLVDPEVVNQPSTHVSLRRSTRPHVKPSFLNQYHCNSACLYPIDDYLSYDHFSTTHKHFILNVSAAYEPSYFHQAIKFDHWKEAMDSEIRAMERT
STWTIVPLPPGKHIVGCKWVYRNKYKTDGTVDRYKARLVAKGYSQQEGIDFFILFPL