; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0015847 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0015847
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr12:27119675..27128343
RNA-Seq ExpressionLag0015847
SyntenyLag0015847
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PIM97577.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]1.5e-6131.3Show/hide
Query:  AFETLKAALISAPILCAPNWSFPFEVMCDASDVA------------------------------------------------------------------
        AF  LK  LISAPI+  P+WSFPFE+MCDASD A                                                                  
Subjt:  AFETLKAALISAPILCAPNWSFPFEVMCDASDVA------------------------------------------------------------------

Query:  --------------------EFDLEIKDKKGSENVLADHLSCLVPSSFLPEQSVISDSFPDKKLFAVTTTKPAFNDGQKKRGGAFNDGP-AATLFFYLQP
                            EFDLEI+D+KG+EN +ADHLS L   +   E ++I+D+FPD++L A+                  +D P  A +  YL  
Subjt:  --------------------EFDLEIKDKKGSENVLADHLSCLVPSSFLPEQSVISDSFPDKKLFAVTTTKPAFNDGQKKRGGAFNDGP-AATLFFYLQP

Query:  PQRPFFNLRRRDSHSSTFSRPNSSSSTFGAATPLLQPSAPQLLFFPLVVVFPLTFVLHVDDTGEQAVANNLRTAWVSPRRRARFCKRSRSNRLQKG---P
           P F+L  +      F      +  +    P L    P  +    V        + ++D  EQ  A        SP        R+ +  LQ G   P
Subjt:  PQRPFFNLRRRDSHSSTFSRPNSSSSTFGAATPLLQPSAPQLLFFPLVVVFPLTFVLHVDDTGEQAVANNLRTAWVSPRRRARFCKRSRSNRLQKG---P

Query:  NYGHRILRVRLNHAERWIW-CDAKVLYICKGPWDVALLQLEQIPKQLSPIIIDCSWSSSGSKIHVIGHGLLGPKSGFSPSFFSGVVANSEGRMIGLVTRF
        N            A  ++  CD      C+   +++      +   L   + D           V G   +GP   F PSF         G M  LV   
Subjt:  NYGHRILRVRLNHAERWIW-CDAKVLYICKGPWDVALLQLEQIPKQLSPIIIDCSWSSSGSKIHVIGHGLLGPKSGFSPSFFSGVVANSEGRMIGLVTRF

Query:  FELLDICLLCDEAKEILEQCHSSSYGEIDYVSKWVEVIACHQNDAKMVSRFLQSQIFARFGTPRALVSDEGTHFVNNVLAKLLVKYGIKHRITIPYHPQA
                                   +DYVSKWVE  A   ND+K+V  F++  IF RFGTPRA++SD GTHF N     LL KYG+KH+I+ PYHPQ 
Subjt:  FELLDICLLCDEAKEILEQCHSSSYGEIDYVSKWVEVIACHQNDAKMVSRFLQSQIFARFGTPRALVSDEGTHFVNNVLAKLLVKYGIKHRITIPYHPQA

Query:  NGQAEDS--------------------------------AYKTPLG------------------------AI-------------RMLQLNELEEFRQFS
        +GQ E S                                AYKTP+G                        AI             R+LQLNEL+EFR  +
Subjt:  NGQAEDS--------------------------------AYKTPLG------------------------AI-------------RMLQLNELEEFRQFS

Query:  YENAKMYKEKTKRWHHKNIKPKEFVKGQRVLLYNSRLKLFPGKLKCKWSGPFIVNEVFPHGAITLRDEKDGRVFKVNGQRVKHYWGE
        YENAK+YKEK KRWH K I  + F  GQ VLL+NSRLKLFPGKLK +WSGPF + EVFPHGA+ L ++     FKVN QR+KHYWGE
Subjt:  YENAKMYKEKTKRWHHKNIKPKEFVKGQRVLLYNSRLKLFPGKLKCKWSGPFIVNEVFPHGAITLRDEKDGRVFKVNGQRVKHYWGE

PIM97577.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]7.2e-0378.57Show/hide
Query:  MKEVVKKEAIKWLDVGIIYPIADSNWIT
        MKEVVKKE IKWLD GIIYPI+DS+W++
Subjt:  MKEVVKKEAIKWLDVGIIYPIADSNWIT

PIM97577.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]1.5e-6129.97Show/hide
Query:  MKEVVKKEAIKWLDVGIIYPIADSNWITIASEDQEKNHFHLPLWDICFQAYAFWPLQCSSNILAVKAFETLKAALISAPILCAPNWSFPFEVMCDASDVA
        MKEVV+ E +KWLD GIIYPI+DS+WI+     +                YA++          +  +  LK  L+SAPI+ AP+WS PFE+MCDASD A
Subjt:  MKEVVKKEAIKWLDVGIIYPIADSNWITIASEDQEKNHFHLPLWDICFQAYAFWPLQCSSNILAVKAFETLKAALISAPILCAPNWSFPFEVMCDASDVA

Query:  -----------------------------------------------------EFDLEIKDKKGSENVLADHLSCLVPSSFLPEQSVISDSFPDKKLFAV
                                                             EFDLEI+DK+G ENV+ADHLS L   S   ++  I++SFPD++L  V
Subjt:  -----------------------------------------------------EFDLEIKDKKGSENVLADHLSCLVPSSFLPEQSVISDSFPDKKLFAV

Query:  TTTKPAFNDGQKKRGGAFNDGPAATLFFYLQPPQRPFFNLRR--RDSHSSTFSRPNSSSSTFGAATPLLQPSAPQLLFFPLVVVFPLTFVLHVDDTGEQA
        +   P + D                L   + PP   +   ++  RD     +  P         A  +++   PQ     +     L     ++  G  +
Subjt:  TTTKPAFNDGQKKRGGAFNDGPAATLFFYLQPPQRPFFNLRR--RDSHSSTFSRPNSSSSTFGAATPLLQPSAPQLLFFPLVVVFPLTFVLHVDDTGEQA

Query:  VANNLRTAWVSPRRRARFCKRSRSNRLQKGPNYGHRILRVRLNHAERWIWCDAKVLYICKGPWDVALLQLEQIPKQLSPIIIDCSWSSSGSKIHVIGHGL
         +N +   W     ++ FC  +     ++  +   R  RV  N ++R       +L       +V L  L               W          G   
Subjt:  VANNLRTAWVSPRRRARFCKRSRSNRLQKGPNYGHRILRVRLNHAERWIWCDAKVLYICKGPWDVALLQLEQIPKQLSPIIIDCSWSSSGSKIHVIGHGL

Query:  LGPKSGFSPSFFSGVVANSEGRMIGLVTRFFELLDICLLCDEAKEILEQCHSSSYGEIDYVSKWVEVIACHQNDAKMVSRFLQSQIFARFGTPRALVSDE
        +GP   F  SF      N++  ++ +V                                YVSKWVE  A   ND+++V RF++  IF+RFG PRA++SDE
Subjt:  LGPKSGFSPSFFSGVVANSEGRMIGLVTRFFELLDICLLCDEAKEILEQCHSSSYGEIDYVSKWVEVIACHQNDAKMVSRFLQSQIFARFGTPRALVSDE

Query:  GTHFVNNVLAKLLVKYGIKHRITIPYHPQANGQAE--------------------------------DSAYKTPLG------------------------
         +HF N     LL KYG+ H++ + YHPQ NGQ E                                 +A+KTPLG                        
Subjt:  GTHFVNNVLAKLLVKYGIKHRITIPYHPQANGQAE--------------------------------DSAYKTPLG------------------------

Query:  AI-------------RMLQLNELEEFRQFSYENAKMYKEKTKRWHHKNIKPKEFVKGQRVLLYNSRLKLFPGKLKCKWSGPFIVNEVFPHGAITLRDEKD
        AI             R+LQL+ELEEFR  +YEN ++YKEKTK WH K+++ + F  GQ+VLL+NSRLKLFPGKL+ +WSGPF V +V+P+GA+ +R E  
Subjt:  AI-------------RMLQLNELEEFRQFSYENAKMYKEKTKRWHHKNIKPKEFVKGQRVLLYNSRLKLFPGKLKCKWSGPFIVNEVFPHGAITLRDEKD

Query:  GRVFKVNGQRVKHY
        G  FKVNGQR+K Y
Subjt:  GRVFKVNGQRVKHY

XP_009784499.1 PREDICTED: uncharacterized protein LOC104232916 [Nicotiana sylvestris]3.4e-6929.88Show/hide
Query:  MKEVVKKEAIKWLDVGIIYPIADSNWITIASEDQEKNHFHLPLWDICFQAYAFWPLQCSSNIL-----------AVKAFETLKAALISAPILCAPNWSFP
        MKEVV+KE IKWL+ GI++PI+DS W++      +K    +    +  +     P +  + +L            +KAFE LK  L+ API+ AP+W  P
Subjt:  MKEVVKKEAIKWLDVGIIYPIADSNWITIASEDQEKNHFHLPLWDICFQAYAFWPLQCSSNIL-----------AVKAFETLKAALISAPILCAPNWSFP

Query:  FEVMCDASDVA---------------------------------EFDL---------------------EIKDKKGSENVLADHLSCLVPSSFLPEQSVI
        FE+MCDASD+A                                 E  L                     EI+D+KG+EN + DHLS L   + + E   I
Subjt:  FEVMCDASDVA---------------------------------EFDL---------------------EIKDKKGSENVLADHLSCLVPSSFLPEQSVI

Query:  SDSFPDKKLFAVT-TTKPAFNDGQKKRGGAFNDGPAATLFFYLQPPQRPFFNLRRRDSHSSTFSRPNSSSSTFGAATPLLQPSAPQLLFFPLVVVFPLTF
         ++FPD++L A+T +T P + D                           + N                                                
Subjt:  SDSFPDKKLFAVT-TTKPAFNDGQKKRGGAFNDGPAATLFFYLQPPQRPFFNLRRRDSHSSTFSRPNSSSSTFGAATPLLQPSAPQLLFFPLVVVFPLTF

Query:  VLHVDDTGEQAVANNLRTAWVSPRRRARFCKRSRSNRLQKGPNYGHRILRVRLNHAERWIWCDAKVLYICKGPWDVALLQLEQIPK-QLSPIIIDCSWSS
                   +A+ +    ++   R RF                       L+    +IW +  +  +C        L    +PK ++S I+  C  SS
Subjt:  VLHVDDTGEQAVANNLRTAWVSPRRRARFCKRSRSNRLQKGPNYGHRILRVRLNHAERWIWCDAKVLYICKGPWDVALLQLEQIPK-QLSPIIIDCSWSS

Query:  SGSKIHVIGHGLLGPKSGFS-PSFF----SGVVANSEGRMIGLVTRFFELLDICLLCDEAKEI----------LEQCHSSSYGEIDYVSKWVEVIACHQN
         G             +SGF  P+FF    + V      + IG +T+  E+    +L  E  ++              H      +DYVSKWVEVIA   N
Subjt:  SGSKIHVIGHGLLGPKSGFS-PSFF----SGVVANSEGRMIGLVTRFFELLDICLLCDEAKEI----------LEQCHSSSYGEIDYVSKWVEVIACHQN

Query:  DAKMVSRFLQSQIFARFGTPRALVSDEGTHFVNNVLAKLLVKYGIKHRITIPYHPQANGQAEDS--------------------------------AYKT
        DAK+V  F++  IF  FGTPR L+ D GTHF N +L  +L KYG+KH+++  YHPQ +GQ + S                                AYKT
Subjt:  DAKMVSRFLQSQIFARFGTPRALVSDEGTHFVNNVLAKLLVKYGIKHRITIPYHPQANGQAEDS--------------------------------AYKT

Query:  PL-------------------------------------GAIRMLQLNELEEFRQFSYENAKMYKEKTKRWHHKNIKPKEFVKGQRVLLYNSRLKLFPGK
        P+                                     G  R+LQL+ELEEFR  +YENAK+YKEKTKRWH K+I+ +EF  GQ VLL+NSRLKLFP K
Subjt:  PL-------------------------------------GAIRMLQLNELEEFRQFSYENAKMYKEKTKRWHHKNIKPKEFVKGQRVLLYNSRLKLFPGK

Query:  LKCKWSGPFIVNEVFPHGAITLRDEKDGRVFKVNGQRVKHYW-GEEFCLKYPS
        LK  WSGPF+V  V PHGA+ LRD      F VNGQR+KHYW G+E  +K PS
Subjt:  LKCKWSGPFIVNEVFPHGAITLRDEKDGRVFKVNGQRVKHYW-GEEFCLKYPS

XP_019241380.1 PREDICTED: uncharacterized protein LOC109221357 [Nicotiana attenuata]5.3e-6230.81Show/hide
Query:  MKEVVKKEAIKWLDVGIIYPIADSNWIT-------------IASEDQE------------------------KNHFHLPLWDICFQAYAFWPLQCSSNIL
        MKEVVKKE I  LD GII+PI+DSN ++             + +E  E                        K+HF LP  D          +  + +  
Subjt:  MKEVVKKEAIKWLDVGIIYPIADSNWIT-------------IASEDQE------------------------KNHFHLPLWDICFQAYAFWPLQCSSNIL

Query:  AVKAFETLKAALISAPILCAPNWSFPFEVMCDASDVAEFDLEIKDKKGSENVLADHLSCLVPSSFLPEQSVISDSFPDKKLFAVTTTKPAFNDGQKKRGG
         + AFE LK  L++API+ AP+WS PFE+MCD                                       ++ +  +K+L AV      F         
Subjt:  AVKAFETLKAALISAPILCAPNWSFPFEVMCDASDVAEFDLEIKDKKGSENVLADHLSCLVPSSFLPEQSVISDSFPDKKLFAVTTTKPAFNDGQKKRGG

Query:  AFNDGPAATLFFYLQPPQRPFFNLRRRDSHSSTFSRPNSSSSTFGAATPLLQPSAPQLLFFPLVVVFPLTFVLHVDDT--GEQAVANNLRTAWVSPRRRA
        A+  G    +     P  R  F  +              S +       LLQ                  F + + D    E  VA++L           
Subjt:  AFNDGPAATLFFYLQPPQRPFFNLRRRDSHSSTFSRPNSSSSTFGAATPLLQPSAPQLLFFPLVVVFPLTFVLHVDDT--GEQAVANNLRTAWVSPRRRA

Query:  RFCKRSRSNRLQKGPNYGHRILRVRLNHAERWIWCDAKVLYICKGPWDVALLQLEQIP-KQLSPIIIDCSWSSSGSKIHVIGHGLLGPKSGFSPSFFSGV
           +    + +++G     +  R R  H   + + D    Y+ K   D   L    IP K++  ++ DC  S  G       HG     +    S F G 
Subjt:  RFCKRSRSNRLQKGPNYGHRILRVRLNHAERWIWCDAKVLYICKGPWDVALLQLEQIP-KQLSPIIIDCSWSSSGSKIHVIGHGLLGPKSGFSPSFFSGV

Query:  VANSEGRMIGLVTRFFELLDICLLCDEAKEILEQCHSSSYGEIDYVSKWVEVIACHQNDAKMVSRFLQSQIFARFGTPRALVSDEGTHFVNNVLAKLLVK
           S G    L+                              +DYVSKWVE IA   NDA +V+ F++  IF+RFGTPRAL+SDEGTHF N +L+ LL K
Subjt:  VANSEGRMIGLVTRFFELLDICLLCDEAKEILEQCHSSSYGEIDYVSKWVEVIACHQNDAKMVSRFLQSQIFARFGTPRALVSDEGTHFVNNVLAKLLVK

Query:  YGIKHRITIPYHPQANGQAEDS--------------------------------AYKTPLGAI-------------------------------------
        YG+ HR+   YHPQ +GQAE S                                AYKTP+GA                                      
Subjt:  YGIKHRITIPYHPQANGQAEDS--------------------------------AYKTPLGAI-------------------------------------

Query:  RMLQLNELEEFRQFSYENAKMYKEKTKRWHHKNIKPKEFVKGQRVLLYNSRLKLFPGKLKCKWSGPFIVNEVFPHGAITLRDEKDGRVFKVNGQRVKHYW
        R++QLNEL+EFR  SYENAK+YKEKTKRWH K+IKP+ F   Q+VLL+NSRL+LFPGKLK +WSGPF V  V P+GAI LR     R F VNG RVKHYW
Subjt:  RMLQLNELEEFRQFSYENAKMYKEKTKRWHHKNIKPKEFVKGQRVLLYNSRLKLFPGKLKCKWSGPFIVNEVFPHGAITLRDEKDGRVFKVNGQRVKHYW

Query:  G
        G
Subjt:  G

XP_038687540.1 uncharacterized protein LOC119986923 [Tripterygium wilfordii]5.6e-6429.6Show/hide
Query:  MKEVVKKEAIKWLDVGIIYPIADSNWITIASE-DQEKNHFHLPLWDICFQAYAFWPLQCSSNILAVKAFETLKAALISAPILCAPNWSFPFEVMCDASD-
        MKEVV+ E +K LDVG+IYPI+DS W+    E DQ   H                            AF TLK  LISAPI+  P+W+ PFE+MCDASD 
Subjt:  MKEVVKKEAIKWLDVGIIYPIADSNWITIASE-DQEKNHFHLPLWDICFQAYAFWPLQCSSNILAVKAFETLKAALISAPILCAPNWSFPFEVMCDASD-

Query:  ------------------VAEFDLEIKDKKGSENVLADHLSCLVPSSFLPEQSVISDSFPDKKLFAVTTTKPAFNDGQKKRGGAFNDGPAATLFFYLQPP
                           AEFD+E++DKKG+EN++ADHLS L  +    +   ++++FPD+KLF +    P + D                +  YL   
Subjt:  ------------------VAEFDLEIKDKKGSENVLADHLSCLVPSSFLPEQSVISDSFPDKKLFAVTTTKPAFNDGQKKRGGAFNDGPAATLFFYLQPP

Query:  QRPFFNLRRRDSHSSTFSRPNSSSSTFGAATPLLQPSAPQLLFFPLVVVFPLTFVLHVDDTGEQAVANNLRTAWVSPRRRARFCKRSRSNRLQKGPNYGH
        + P                PN                                                     +S ++R +F                 
Subjt:  QRPFFNLRRRDSHSSTFSRPNSSSSTFGAATPLLQPSAPQLLFFPLVVVFPLTFVLHVDDTGEQAVANNLRTAWVSPRRRARFCKRSRSNRLQKGPNYGH

Query:  RILRVRLNHAERWIWCDAKVLYICKGPWDVALLQLEQIPKQLSPIIIDCSWSSSGSKIHVIGHGLLGPKSGFSPSFFSGVVANSEGRMIGLVTRFFELLD
              L + + + W +  +   C           +QI ++  P I                                                      
Subjt:  RILRVRLNHAERWIWCDAKVLYICKGPWDVALLQLEQIPKQLSPIIIDCSWSSSGSKIHVIGHGLLGPKSGFSPSFFSGVVANSEGRMIGLVTRFFELLD

Query:  ICLLCDEAKEILEQCHSSSYGEIDYVSKWVEVIACHQNDAKMVSRFLQSQIFARFGTPRALVSDEGTHFVNNVLAKLLVKYGIKHRITIPYHPQANGQAE
              E   +LE  H  ++  ++YVSKWVE +A   ND K+V +FLQ  IF RFG PRA++SD G HFVN     LL KY   H++  PYHPQ +GQ E
Subjt:  ICLLCDEAKEILEQCHSSSYGEIDYVSKWVEVIACHQNDAKMVSRFLQSQIFARFGTPRALVSDEGTHFVNNVLAKLLVKYGIKHRITIPYHPQANGQAE

Query:  DS--------------------------------AYKTPLG------------------------AI-------------RMLQLNELEEFRQFSYENAK
         S                                AY+TP+G                        AI             R LQLNELEE R  +YENA 
Subjt:  DS--------------------------------AYKTPLG------------------------AI-------------RMLQLNELEEFRQFSYENAK

Query:  MYKEKTKRWHHKNIKPKEFVKGQRVLLYNSRLKLFPGKLKCKWSGPFIVNEVFPHGAITLRDEKDGRVFKVNGQRVKHY
        +YKE+TK +H K I  KEF+ GQ+VLL+NSRL+LFPGKLK +W GPF+V +V PHGA+ +++ KDG +FKVNG R+K Y
Subjt:  MYKEKTKRWHHKNIKPKEFVKGQRVLLYNSRLKLFPGKLKCKWSGPFIVNEVFPHGAITLRDEKDGRVFKVNGQRVKHY

TrEMBL top hitse value%identityAlignment
A0A1B5Z879 Reverse transcriptase (Fragment)4.1e-6028.41Show/hide
Query:  MKEVVKKEAIKWLDVGIIYPIADSNWIT-------------IASEDQE------------------------KNHFHLPLWDICFQA------YAFWPLQ
        MKEV++KE +K L+ G+IYPI+DS+W++             + +E  E                        K+HF LP  D   +       Y F    
Subjt:  MKEVVKKEAIKWLDVGIIYPIADSNWIT-------------IASEDQE------------------------KNHFHLPLWDICFQA------YAFWPLQ

Query:  CSSNILAVKAFETLKAA--------------------------LISAPILCAPNWSFPFEVMCDASDVA-------------------------------
           N +AV   +  K A                          L++AP++ AP+WS PFE+MCDASD+A                               
Subjt:  CSSNILAVKAFETLKAA--------------------------LISAPILCAPNWSFPFEVMCDASDVA-------------------------------

Query:  -----------EFDL---------------------EIKDKKGSENVLADHLSCLVPSSFLPEQSVISDSFPDKKLFAVTTTKPAFNDGQKKRGGAFNDG
                    FD                       I+DKKGSEN +ADHLS L       ++  I D F D+ + AVT+  P F D      G     
Subjt:  -----------EFDL---------------------EIKDKKGSENVLADHLSCLVPSSFLPEQSVISDSFPDKKLFAVTTTKPAFNDGQKKRGGAFNDG

Query:  PAATLFFYLQPPQRPFFNLRRRDSHSSTFSRPNSSSSTFGAATPLLQPSAPQL---LFFPLVVVFPLTFVLHVDDTGEQAVANNLRTAWVSPRRRARFCK
           T+ +     QR  F     D     +  P       G A  L + +A  L   LF+P +     T+V   D                   R  R   
Subjt:  PAATLFFYLQPPQRPFFNLRRRDSHSSTFSRPNSSSSTFGAATPLLQPSAPQL---LFFPLVVVFPLTFVLHVDDTGEQAVANNLRTAWVSPRRRARFCK

Query:  RSRSNRLQKGPNYGHRILRVRLNHAERWIWCDAKVLYICKGPWDVALLQLEQIPKQLSPIIIDCSWSSSGSKIHVIGHGLLGPKSGFSPSFFSGVVANSE
         S+ N + + P     +L V +                    W +  +                 + SS SKI+++                   VA   
Subjt:  RSRSNRLQKGPNYGHRILRVRLNHAERWIWCDAKVLYICKGPWDVALLQLEQIPKQLSPIIIDCSWSSSGSKIHVIGHGLLGPKSGFSPSFFSGVVANSE

Query:  GRMIGLVTRFFELLDICLLCDEAKEILEQCHSSSYGEIDYVSKWVEVIACHQNDAKMVSRFLQSQIFARFGTPRALVSDEGTHFVNNVLAKLLVKYGIKH
                                             ++YVSKWVE IA   NDA++V  FL+  IF+ FG PRAL+SDEGTHF+N  +  LL KY + H
Subjt:  GRMIGLVTRFFELLDICLLCDEAKEILEQCHSSSYGEIDYVSKWVEVIACHQNDAKMVSRFLQSQIFARFGTPRALVSDEGTHFVNNVLAKLLVKYGIKH

Query:  RITIPYHPQANGQAEDS--------------------------------AYKTPL-------------------------------------GAIRMLQL
        RI+ PYHPQ +GQ E S                                A+KTP+                                     G  R+LQL
Subjt:  RITIPYHPQANGQAEDS--------------------------------AYKTPL-------------------------------------GAIRMLQL

Query:  NELEEFRQFSYENAKMYKEKTKRWHHKNIKPKEFVKGQRVLLYNSRLKLFPGKLKCKWSGPFIVNEVFPHGAITLRDEKDGRVFKVNGQRVKHYWGEEF
        +EL+EFR ++YENAK++KEKTK+WH K I+ +EF +GQ VLL+NSRLKLFPGKLK +WSGPF V +VFP+GAI ++D    R FKVNGQR+K Y+G EF
Subjt:  NELEEFRQFSYENAKMYKEKTKRWHHKNIKPKEFVKGQRVLLYNSRLKLFPGKLKCKWSGPFIVNEVFPHGAITLRDEKDGRVFKVNGQRVKHYWGEEF

A0A1U7XC36 uncharacterized protein LOC1042329161.6e-6929.88Show/hide
Query:  MKEVVKKEAIKWLDVGIIYPIADSNWITIASEDQEKNHFHLPLWDICFQAYAFWPLQCSSNIL-----------AVKAFETLKAALISAPILCAPNWSFP
        MKEVV+KE IKWL+ GI++PI+DS W++      +K    +    +  +     P +  + +L            +KAFE LK  L+ API+ AP+W  P
Subjt:  MKEVVKKEAIKWLDVGIIYPIADSNWITIASEDQEKNHFHLPLWDICFQAYAFWPLQCSSNIL-----------AVKAFETLKAALISAPILCAPNWSFP

Query:  FEVMCDASDVA---------------------------------EFDL---------------------EIKDKKGSENVLADHLSCLVPSSFLPEQSVI
        FE+MCDASD+A                                 E  L                     EI+D+KG+EN + DHLS L   + + E   I
Subjt:  FEVMCDASDVA---------------------------------EFDL---------------------EIKDKKGSENVLADHLSCLVPSSFLPEQSVI

Query:  SDSFPDKKLFAVT-TTKPAFNDGQKKRGGAFNDGPAATLFFYLQPPQRPFFNLRRRDSHSSTFSRPNSSSSTFGAATPLLQPSAPQLLFFPLVVVFPLTF
         ++FPD++L A+T +T P + D                           + N                                                
Subjt:  SDSFPDKKLFAVT-TTKPAFNDGQKKRGGAFNDGPAATLFFYLQPPQRPFFNLRRRDSHSSTFSRPNSSSSTFGAATPLLQPSAPQLLFFPLVVVFPLTF

Query:  VLHVDDTGEQAVANNLRTAWVSPRRRARFCKRSRSNRLQKGPNYGHRILRVRLNHAERWIWCDAKVLYICKGPWDVALLQLEQIPK-QLSPIIIDCSWSS
                   +A+ +    ++   R RF                       L+    +IW +  +  +C        L    +PK ++S I+  C  SS
Subjt:  VLHVDDTGEQAVANNLRTAWVSPRRRARFCKRSRSNRLQKGPNYGHRILRVRLNHAERWIWCDAKVLYICKGPWDVALLQLEQIPK-QLSPIIIDCSWSS

Query:  SGSKIHVIGHGLLGPKSGFS-PSFF----SGVVANSEGRMIGLVTRFFELLDICLLCDEAKEI----------LEQCHSSSYGEIDYVSKWVEVIACHQN
         G             +SGF  P+FF    + V      + IG +T+  E+    +L  E  ++              H      +DYVSKWVEVIA   N
Subjt:  SGSKIHVIGHGLLGPKSGFS-PSFF----SGVVANSEGRMIGLVTRFFELLDICLLCDEAKEI----------LEQCHSSSYGEIDYVSKWVEVIACHQN

Query:  DAKMVSRFLQSQIFARFGTPRALVSDEGTHFVNNVLAKLLVKYGIKHRITIPYHPQANGQAEDS--------------------------------AYKT
        DAK+V  F++  IF  FGTPR L+ D GTHF N +L  +L KYG+KH+++  YHPQ +GQ + S                                AYKT
Subjt:  DAKMVSRFLQSQIFARFGTPRALVSDEGTHFVNNVLAKLLVKYGIKHRITIPYHPQANGQAEDS--------------------------------AYKT

Query:  PL-------------------------------------GAIRMLQLNELEEFRQFSYENAKMYKEKTKRWHHKNIKPKEFVKGQRVLLYNSRLKLFPGK
        P+                                     G  R+LQL+ELEEFR  +YENAK+YKEKTKRWH K+I+ +EF  GQ VLL+NSRLKLFP K
Subjt:  PL-------------------------------------GAIRMLQLNELEEFRQFSYENAKMYKEKTKRWHHKNIKPKEFVKGQRVLLYNSRLKLFPGK

Query:  LKCKWSGPFIVNEVFPHGAITLRDEKDGRVFKVNGQRVKHYW-GEEFCLKYPS
        LK  WSGPF+V  V PHGA+ LRD      F VNGQR+KHYW G+E  +K PS
Subjt:  LKCKWSGPFIVNEVFPHGAITLRDEKDGRVFKVNGQRVKHYW-GEEFCLKYPS

A0A2G9FWY3 Reverse transcriptase7.4e-6231.3Show/hide
Query:  AFETLKAALISAPILCAPNWSFPFEVMCDASDVA------------------------------------------------------------------
        AF  LK  LISAPI+  P+WSFPFE+MCDASD A                                                                  
Subjt:  AFETLKAALISAPILCAPNWSFPFEVMCDASDVA------------------------------------------------------------------

Query:  --------------------EFDLEIKDKKGSENVLADHLSCLVPSSFLPEQSVISDSFPDKKLFAVTTTKPAFNDGQKKRGGAFNDGP-AATLFFYLQP
                            EFDLEI+D+KG+EN +ADHLS L   +   E ++I+D+FPD++L A+                  +D P  A +  YL  
Subjt:  --------------------EFDLEIKDKKGSENVLADHLSCLVPSSFLPEQSVISDSFPDKKLFAVTTTKPAFNDGQKKRGGAFNDGP-AATLFFYLQP

Query:  PQRPFFNLRRRDSHSSTFSRPNSSSSTFGAATPLLQPSAPQLLFFPLVVVFPLTFVLHVDDTGEQAVANNLRTAWVSPRRRARFCKRSRSNRLQKG---P
           P F+L  +      F      +  +    P L    P  +    V        + ++D  EQ  A        SP        R+ +  LQ G   P
Subjt:  PQRPFFNLRRRDSHSSTFSRPNSSSSTFGAATPLLQPSAPQLLFFPLVVVFPLTFVLHVDDTGEQAVANNLRTAWVSPRRRARFCKRSRSNRLQKG---P

Query:  NYGHRILRVRLNHAERWIW-CDAKVLYICKGPWDVALLQLEQIPKQLSPIIIDCSWSSSGSKIHVIGHGLLGPKSGFSPSFFSGVVANSEGRMIGLVTRF
        N            A  ++  CD      C+   +++      +   L   + D           V G   +GP   F PSF         G M  LV   
Subjt:  NYGHRILRVRLNHAERWIW-CDAKVLYICKGPWDVALLQLEQIPKQLSPIIIDCSWSSSGSKIHVIGHGLLGPKSGFSPSFFSGVVANSEGRMIGLVTRF

Query:  FELLDICLLCDEAKEILEQCHSSSYGEIDYVSKWVEVIACHQNDAKMVSRFLQSQIFARFGTPRALVSDEGTHFVNNVLAKLLVKYGIKHRITIPYHPQA
                                   +DYVSKWVE  A   ND+K+V  F++  IF RFGTPRA++SD GTHF N     LL KYG+KH+I+ PYHPQ 
Subjt:  FELLDICLLCDEAKEILEQCHSSSYGEIDYVSKWVEVIACHQNDAKMVSRFLQSQIFARFGTPRALVSDEGTHFVNNVLAKLLVKYGIKHRITIPYHPQA

Query:  NGQAEDS--------------------------------AYKTPLG------------------------AI-------------RMLQLNELEEFRQFS
        +GQ E S                                AYKTP+G                        AI             R+LQLNEL+EFR  +
Subjt:  NGQAEDS--------------------------------AYKTPLG------------------------AI-------------RMLQLNELEEFRQFS

Query:  YENAKMYKEKTKRWHHKNIKPKEFVKGQRVLLYNSRLKLFPGKLKCKWSGPFIVNEVFPHGAITLRDEKDGRVFKVNGQRVKHYWGE
        YENAK+YKEK KRWH K I  + F  GQ VLL+NSRLKLFPGKLK +WSGPF + EVFPHGA+ L ++     FKVN QR+KHYWGE
Subjt:  YENAKMYKEKTKRWHHKNIKPKEFVKGQRVLLYNSRLKLFPGKLKCKWSGPFIVNEVFPHGAITLRDEKDGRVFKVNGQRVKHYWGE

A0A2G9FWY3 Reverse transcriptase3.5e-0378.57Show/hide
Query:  MKEVVKKEAIKWLDVGIIYPIADSNWIT
        MKEVVKKE IKWLD GIIYPI+DS+W++
Subjt:  MKEVVKKEAIKWLDVGIIYPIADSNWIT

A0A2G9FWY3 Reverse transcriptase1.1e-6052.7Show/hide
Query:  IDYVSKWVEVIACHQNDAKMVSRFLQSQIFARFGTPRALVSDEGTHFVNNVLAKLLVKYGIKHRITIPYHPQANGQAEDS--------------------
        +DYVSKWVE +AC  +DAK+V++FL   IF RFGTPRAL+SDEG+HF+N V+A LL KY IKH+++  YHPQ NG AE S                    
Subjt:  IDYVSKWVEVIACHQNDAKMVSRFLQSQIFARFGTPRALVSDEGTHFVNNVLAKLLVKYGIKHRITIPYHPQANGQAEDS--------------------

Query:  ------------AYKTPLGAIRMLQLNELEEFRQFSYENAKMYKEKTKRWHHKNIKPKEFVKGQRVLLYNSRLKLFPGKLKCKWSGPFIVNEVFPHGAIT
                    A+KTPLG +R LQL ELE+F + +Y+NAK+YKEKTK+WH   I P++F KGQ+VLL+NSRLKL PGKLK +  GPF+V+ VF HGA+ 
Subjt:  ------------AYKTPLGAIRMLQLNELEEFRQFSYENAKMYKEKTKRWHHKNIKPKEFVKGQRVLLYNSRLKLFPGKLKCKWSGPFIVNEVFPHGAIT

Query:  LRDEKDGRVFKVNGQRVKHYWG
        +++   GR F VNGQR+KH++G
Subjt:  LRDEKDGRVFKVNGQRVKHYWG

A0A4Y1R6P1 Reverse transcriptase2.4e-6031.86Show/hide
Query:  AFETLKAALISAPILCAPNWSFPFEVMCDASDVA------------------------------------------------------------------
        AF  LK  L +API+  P+WS PFE+MCDASD A                                                                  
Subjt:  AFETLKAALISAPILCAPNWSFPFEVMCDASDVA------------------------------------------------------------------

Query:  --------------------EFDLEIKDKKGSENVLADHLSCLVPSSFLPEQSV-ISDSFPDKKLFAVTTTKPAFNDGQKKRGGAFNDGP-AATLFFYLQ
                            EFDLEIKDKKGSENV+ADHLS LV  S   E S+ + +SFPD++LF++                + N  P  A +  YL 
Subjt:  --------------------EFDLEIKDKKGSENVLADHLSCLVPSSFLPEQSV-ISDSFPDKKLFAVTTTKPAFNDGQKKRGGAFNDGP-AATLFFYLQ

Query:  PPQRPFFNLRRRDSHSSTFSRP--NSSSSTFGAATPLLQPSAPQLLFFPLVVVFPLTFVL---HVDDTGEQAVANNLRTAWVSPRRRARFCKRSRSNRLQ
          + P        +  STF R      +  +    P L    P  +    V       +L   H    G    A                 K++ S  LQ
Subjt:  PPQRPFFNLRRRDSHSSTFSRP--NSSSSTFGAATPLLQPSAPQLLFFPLVVVFPLTFVL---HVDDTGEQAVANNLRTAWVSPRRRARFCKRSRSNRLQ

Query:  KGPNYGHRILRVRLNHAERWIWCDAKVLYICKGPWDVALLQLEQIPKQLSPIIIDCSWSSSGSKIHVIGHGLLGPKSGFSPSFFSGVVANSEGRMIGLVT
         G  +            + +++C +     C     +  L         + +IID           V G   +GP   F  S+                 
Subjt:  KGPNYGHRILRVRLNHAERWIWCDAKVLYICKGPWDVALLQLEQIPKQLSPIIIDCSWSSSGSKIHVIGHGLLGPKSGFSPSFFSGVVANSEGRMIGLVT

Query:  RFFELLDICLLCDEAKEILEQCHSSSYGEIDYVSKWVEVIACHQNDAKMVSRFLQSQIFARFGTPRALVSDEGTHFVNNVLAKLLVKYGIKHRITIPYHP
          FE + +                     +DYVSKWVE IA   NDAK+V  FL+  IF RFGTPRA++SD G+HFVN   A LL KYGI H++  PYHP
Subjt:  RFFELLDICLLCDEAKEILEQCHSSSYGEIDYVSKWVEVIACHQNDAKMVSRFLQSQIFARFGTPRALVSDEGTHFVNNVLAKLLVKYGIKHRITIPYHP

Query:  QANGQAEDS--------------------------------AYKTPL--GAIRMLQLNELEEFRQFSYENAKMYKEKTKRWHHKNIKPKEFVKGQRVLLY
        Q +GQ E S                                AYKTP+  G  R LQLNELEE R  +YENAK+YKEKTK++H K I  K F KGQ+VLL+
Subjt:  QANGQAEDS--------------------------------AYKTPL--GAIRMLQLNELEEFRQFSYENAKMYKEKTKRWHHKNIKPKEFVKGQRVLLY

Query:  NSRLKLFPGKLKCKWSGPFIVNEVFPHGAITLRDEKDGRVFKVNGQRVKHYWGEEF
        NSRLKLFPGKL+ +W GPF++  VF HGA+ +++ KDG  FKVNG R+K Y+   F
Subjt:  NSRLKLFPGKLKCKWSGPFIVNEVFPHGAITLRDEKDGRVFKVNGQRVKHYWGEEF

SwissProt top hitse value%identityAlignment
O92815 Gag-Pol polyprotein8.3e-1038.46Show/hide
Query:  IDYVSKWVEVIACHQNDAKMVSRFLQSQIFARFGTPRALVSDEGTHFVNNVLAKLLVKYGIKHRITIPYHPQANGQAE
        ID  SKW E+I C++ DAK V   L   I  R+G P  + SD+GTHF   +  +L    G+  ++  P HP+++G  E
Subjt:  IDYVSKWVEVIACHQNDAKMVSRFLQSQIFARFGTPRALVSDEGTHFVNNVLAKLLVKYGIKHRITIPYHPQANGQAE

P10272 Gag-Pol polyprotein1.3e-0737.18Show/hide
Query:  IDYVSKWVEVIACHQNDAKMVSRFLQSQIFARFGTPRALVSDEGTHFVNNVLAKLLVKYGIKHRITIPYHPQANGQAE
        +D  S WVE     Q  A +V++ +  +IF RFG P+ + SD G  FV+ V   L    GI  ++   Y PQ++GQ E
Subjt:  IDYVSKWVEVIACHQNDAKMVSRFLQSQIFARFGTPRALVSDEGTHFVNNVLAKLLVKYGIKHRITIPYHPQANGQAE

P11227 Gag-Pol polyprotein1.3e-0737.18Show/hide
Query:  IDYVSKWVEVIACHQNDAKMVSRFLQSQIFARFGTPRALVSDEGTHFVNNVLAKLLVKYGIKHRITIPYHPQANGQAE
        +D  S WVE        AK+V++ L  +IF RFG P+ L +D G  FV+ V   +    GI  ++   Y PQ++GQ E
Subjt:  IDYVSKWVEVIACHQNDAKMVSRFLQSQIFARFGTPRALVSDEGTHFVNNVLAKLLVKYGIKHRITIPYHPQANGQAE

P31792 Pol polyprotein (Fragment)3.5e-0838.46Show/hide
Query:  IDYVSKWVEVIACHQNDAKMVSRFLQSQIFARFGTPRALVSDEGTHFVNNVLAKLLVKYGIKHRITIPYHPQANGQAE
        +D  S WVE     Q  A MV++ +  +IF RFG P+ + SD G  FV+ V   L    GI  ++   Y PQ++GQ E
Subjt:  IDYVSKWVEVIACHQNDAKMVSRFLQSQIFARFGTPRALVSDEGTHFVNNVLAKLLVKYGIKHRITIPYHPQANGQAE

Q8VZD4 Glyoxysomal processing protease, glyoxysomal2.3e-2055.95Show/hide
Query:  GHRILRVRLNHAERWIWCDAKVLYICKGPWDVALLQLEQIPKQLSPIIIDCSWSSSGSKIHVIGHGLLGPKSGFSPSFFSGVVA
        GHR +RVRL H + W WC A V+YICK   D+ALLQLE +P +L PI  + S    G+  HV+GHGL GP+ G SPS  SGVVA
Subjt:  GHRILRVRLNHAERWIWCDAKVLYICKGPWDVALLQLEQIPKQLSPIIIDCSWSSSGSKIHVIGHGLLGPKSGFSPSFFSGVVA

Arabidopsis top hitse value%identityAlignment
AT1G28320.1 protease-related1.6e-2155.95Show/hide
Query:  GHRILRVRLNHAERWIWCDAKVLYICKGPWDVALLQLEQIPKQLSPIIIDCSWSSSGSKIHVIGHGLLGPKSGFSPSFFSGVVA
        GHR +RVRL H + W WC A V+YICK   D+ALLQLE +P +L PI  + S    G+  HV+GHGL GP+ G SPS  SGVVA
Subjt:  GHRILRVRLNHAERWIWCDAKVLYICKGPWDVALLQLEQIPKQLSPIIIDCSWSSSGSKIHVIGHGLLGPKSGFSPSFFSGVVA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAGAAGTTGTTAAGAAGGAGGCGATTAAATGGTTGGATGTTGGGATTATCTATCCAATTGCAGACAGCAATTGGATTACTATTGCTTCTGAGGATCAGGAAAAAAA
CCATTTTCACCTGCCCTTATGGGACATTTGCTTTCAGGCATATGCCTTTTGGCCTTTGCAATGCTCCAGCAACATTTTAGCGGTGAAGGCTTTTGAGACTTTAAAGGCTG
CTTTGATCTCAGCACCCATTCTTTGTGCACCCAACTGGAGTTTTCCATTCGAGGTGATGTGTGATGCTAGTGATGTAGCGGAATTCGACTTGGAGATAAAGGACAAGAAG
GGATCAGAGAATGTTCTTGCAGATCATTTGTCTTGTCTTGTTCCATCATCATTTTTGCCAGAGCAATCTGTCATTTCAGATTCATTTCCAGATAAGAAACTTTTTGCTGT
CACTACTACAAAACCAGCCTTTAATGACGGACAGAAAAAAAGGGGCGGGGCCTTTAATGACGGACCCGCCGCAACCCTCTTCTTCTACCTTCAGCCGCCCCAACGCCCCT
TCTTCAACCTTCGCCGCCGTGACTCCCATTCTTCTACCTTCAGCCGCCCCAACTCCTCTTCTTCAACCTTCGGCGCCGCAACTCCCCTTCTTCAACCTTCAGCGCCGCAG
CTCTTGTTCTTCCCTTTGGTCGTGGTCTTTCCTTTAACATTCGTCCTCCACGTGGACGACACCGGAGAACAAGCTGTTGCAAATAACCTTCGGACAGCGTGGGTTTCACC
TCGCCGGAGAGCAAGATTTTGCAAACGATCGCGTTCTAACAGGTTGCAGAAAGGACCTAACTATGGACATAGAATCTTGCGTGTTCGCTTGAATCATGCAGAGCGTTGGA
TTTGGTGTGATGCTAAAGTGTTATACATATGCAAAGGACCTTGGGATGTTGCCCTGTTGCAGCTTGAGCAAATTCCGAAGCAACTCTCACCTATTATTATTGATTGTTCG
TGGTCGTCTTCAGGATCAAAGATACATGTTATTGGACATGGACTGTTGGGACCAAAATCTGGCTTCTCCCCATCTTTTTTCTCTGGTGTGGTGGCCAATTCAGAAGGTCG
TATGATTGGACTTGTTACAAGGTTCTTCGAGTTGTTGGACATATGTTTGTTATGTGATGAAGCAAAGGAGATCCTGGAGCAATGTCATTCTTCCTCGTATGGAGAAATTG
ACTATGTGTCCAAGTGGGTAGAGGTCATTGCATGCCATCAGAATGATGCTAAGATGGTGTCAAGGTTTCTTCAATCGCAAATTTTTGCGCGGTTTGGAACACCTAGGGCT
CTTGTGAGCGATGAAGGCACACACTTTGTGAATAATGTTTTAGCTAAGCTTTTAGTTAAATATGGAATTAAGCATAGGATTACTATCCCTTATCACCCACAAGCAAATGG
TCAAGCTGAAGACTCTGCCTATAAGACTCCTCTAGGTGCAATAAGAATGCTGCAGCTAAATGAGTTAGAGGAATTTCGTCAGTTTTCTTATGAAAATGCTAAAATGTATA
AGGAGAAAACAAAGCGGTGGCATCATAAGAATATAAAACCTAAAGAATTTGTTAAGGGTCAGAGAGTCTTGCTATACAACTCTAGATTGAAATTGTTTCCTGGAAAATTA
AAATGTAAATGGTCTGGACCGTTTATTGTGAATGAAGTTTTTCCTCATGGAGCAATTACTTTGCGAGATGAAAAAGATGGGCGAGTGTTTAAGGTTAATGGACAACGAGT
GAAGCATTATTGGGGTGAGGAGTTTTGTTTGAAATATCCTTCCCTAAATTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAAGAAGTTGTTAAGAAGGAGGCGATTAAATGGTTGGATGTTGGGATTATCTATCCAATTGCAGACAGCAATTGGATTACTATTGCTTCTGAGGATCAGGAAAAAAA
CCATTTTCACCTGCCCTTATGGGACATTTGCTTTCAGGCATATGCCTTTTGGCCTTTGCAATGCTCCAGCAACATTTTAGCGGTGAAGGCTTTTGAGACTTTAAAGGCTG
CTTTGATCTCAGCACCCATTCTTTGTGCACCCAACTGGAGTTTTCCATTCGAGGTGATGTGTGATGCTAGTGATGTAGCGGAATTCGACTTGGAGATAAAGGACAAGAAG
GGATCAGAGAATGTTCTTGCAGATCATTTGTCTTGTCTTGTTCCATCATCATTTTTGCCAGAGCAATCTGTCATTTCAGATTCATTTCCAGATAAGAAACTTTTTGCTGT
CACTACTACAAAACCAGCCTTTAATGACGGACAGAAAAAAAGGGGCGGGGCCTTTAATGACGGACCCGCCGCAACCCTCTTCTTCTACCTTCAGCCGCCCCAACGCCCCT
TCTTCAACCTTCGCCGCCGTGACTCCCATTCTTCTACCTTCAGCCGCCCCAACTCCTCTTCTTCAACCTTCGGCGCCGCAACTCCCCTTCTTCAACCTTCAGCGCCGCAG
CTCTTGTTCTTCCCTTTGGTCGTGGTCTTTCCTTTAACATTCGTCCTCCACGTGGACGACACCGGAGAACAAGCTGTTGCAAATAACCTTCGGACAGCGTGGGTTTCACC
TCGCCGGAGAGCAAGATTTTGCAAACGATCGCGTTCTAACAGGTTGCAGAAAGGACCTAACTATGGACATAGAATCTTGCGTGTTCGCTTGAATCATGCAGAGCGTTGGA
TTTGGTGTGATGCTAAAGTGTTATACATATGCAAAGGACCTTGGGATGTTGCCCTGTTGCAGCTTGAGCAAATTCCGAAGCAACTCTCACCTATTATTATTGATTGTTCG
TGGTCGTCTTCAGGATCAAAGATACATGTTATTGGACATGGACTGTTGGGACCAAAATCTGGCTTCTCCCCATCTTTTTTCTCTGGTGTGGTGGCCAATTCAGAAGGTCG
TATGATTGGACTTGTTACAAGGTTCTTCGAGTTGTTGGACATATGTTTGTTATGTGATGAAGCAAAGGAGATCCTGGAGCAATGTCATTCTTCCTCGTATGGAGAAATTG
ACTATGTGTCCAAGTGGGTAGAGGTCATTGCATGCCATCAGAATGATGCTAAGATGGTGTCAAGGTTTCTTCAATCGCAAATTTTTGCGCGGTTTGGAACACCTAGGGCT
CTTGTGAGCGATGAAGGCACACACTTTGTGAATAATGTTTTAGCTAAGCTTTTAGTTAAATATGGAATTAAGCATAGGATTACTATCCCTTATCACCCACAAGCAAATGG
TCAAGCTGAAGACTCTGCCTATAAGACTCCTCTAGGTGCAATAAGAATGCTGCAGCTAAATGAGTTAGAGGAATTTCGTCAGTTTTCTTATGAAAATGCTAAAATGTATA
AGGAGAAAACAAAGCGGTGGCATCATAAGAATATAAAACCTAAAGAATTTGTTAAGGGTCAGAGAGTCTTGCTATACAACTCTAGATTGAAATTGTTTCCTGGAAAATTA
AAATGTAAATGGTCTGGACCGTTTATTGTGAATGAAGTTTTTCCTCATGGAGCAATTACTTTGCGAGATGAAAAAGATGGGCGAGTGTTTAAGGTTAATGGACAACGAGT
GAAGCATTATTGGGGTGAGGAGTTTTGTTTGAAATATCCTTCCCTAAATTAG
Protein sequenceShow/hide protein sequence
MKEVVKKEAIKWLDVGIIYPIADSNWITIASEDQEKNHFHLPLWDICFQAYAFWPLQCSSNILAVKAFETLKAALISAPILCAPNWSFPFEVMCDASDVAEFDLEIKDKK
GSENVLADHLSCLVPSSFLPEQSVISDSFPDKKLFAVTTTKPAFNDGQKKRGGAFNDGPAATLFFYLQPPQRPFFNLRRRDSHSSTFSRPNSSSSTFGAATPLLQPSAPQ
LLFFPLVVVFPLTFVLHVDDTGEQAVANNLRTAWVSPRRRARFCKRSRSNRLQKGPNYGHRILRVRLNHAERWIWCDAKVLYICKGPWDVALLQLEQIPKQLSPIIIDCS
WSSSGSKIHVIGHGLLGPKSGFSPSFFSGVVANSEGRMIGLVTRFFELLDICLLCDEAKEILEQCHSSSYGEIDYVSKWVEVIACHQNDAKMVSRFLQSQIFARFGTPRA
LVSDEGTHFVNNVLAKLLVKYGIKHRITIPYHPQANGQAEDSAYKTPLGAIRMLQLNELEEFRQFSYENAKMYKEKTKRWHHKNIKPKEFVKGQRVLLYNSRLKLFPGKL
KCKWSGPFIVNEVFPHGAITLRDEKDGRVFKVNGQRVKHYWGEEFCLKYPSLN