; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0005550 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0005550
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr6:21625386..21628580
RNA-Seq ExpressionLag0005550
SyntenyLag0005550
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU19483.1 hypothetical protein TSUD_77270 [Trifolium subterraneum]1.3e-10835.67Show/hide
Query:  PGSTSNSTGFSSPPL--NQLLNQITTIKLDRGADASSSTAPAGGASSSTTTMTVNPLYEAWITTDQLLLGWLYNSMTSEVATQVMGFEMQKIXXXXXXXX
        P S S     ++ PL  + +L  I   KLD G    +   P    +SS ++   N  +  W   DQ LLGW+ NSMT+E+ATQ++  E  K         
Subjt:  PGSTSNSTGFSSPPL--NQLLNQITTIKLDRGADASSSTAPAGGASSSTTTMTVNPLYEAWITTDQLLLGWLYNSMTSEVATQVMGFEMQKIXXXXXXXX

Query:  XXXXXXXXXXXXXXXXXXXXXXXXXXWEAIQSLFGIQSRAEEDYLRQIFQQTRKGNTKMADYLRLMKSHSDNLGQAGSPVTPRNLISQN-----------
                                  W+  QSL G  +R++  YL+  F   RKG  KM DYL  MK+  D L  AG+PV+  +LI Q            
Subjt:  XXXXXXXXXXXXXXXXXXXXXXXXXXWEAIQSLFGIQSRAEEDYLRQIFQQTRKGNTKMADYLRLMKSHSDNLGQAGSPVTPRNLISQN-----------

Query:  ----TQKTVVTFNHTTAVNVARNSRGSFNSNPGNSKGWNFNNSQRGMNPKSGQRGGSNFNGGRG------RGGRGRGYGQYNNSYGNYNNSNRSVCQVCG
            + +T +++    A  +   SR    +N  N       N+   +  +S  RG S+ N  RG      RGGRGRG             S ++ CQVCG
Subjt:  ----TQKTVVTFNHTTAVNVARNSRGSFNSNPGNSKGWNFNNSQRGMNPKSGQRGGSNFNGGRG------RGGRGRGYGQYNNSYGNYNNSNRSVCQVCG

Query:  KQGHSALVCYHRW--------------------SLFGNSETVIDPNWYADSGASNHVTGDFNNLINPREYGGNEQVTIGNGD------------------
           H A+ C+HR+                    +   +  +V D +WY DSGASNHVT       +  E+ G   + +GNG+                  
Subjt:  KQGHSALVCYHRW--------------------SLFGNSETVIDPNWYADSGASNHVTGDFNNLINPREYGGNEQVTIGNGD------------------

Query:  -----------------------------------KDKGSDRVLLKGTLKDGLYQLESATNLPESSMHQSTTVTQKSINNSTSRVLDHVIKECNLPVKGN
                                           KDK + +V+LKG LKDGLYQL S T    S+         + + +  ++VLD V++ C + V  +
Subjt:  -----------------------------------KDKGSDRVLLKGTLKDGLYQLESATNLPESSMHQSTTVTQKSINNSTSRVLDHVIKECNLPVKGN

Query:  ESVSFCEACQFGKAHNLPFSLSNNRAKKPFDLVHSDLWGPAPIVSPDGFQFYVLFVDDHSRFTWLYPLKRKSDTLLAFQHFLSLVQTQFHSQIKAFQSDN
        ++ SFCEACQ+GK H LPF  S++ A++P +LVH+D+WGPAPI++  GF++YV FVDD SRFTW+YPLK+KS+T+ AF  F +L + QF+ +IK  Q D 
Subjt:  ESVSFCEACQFGKAHNLPFSLSNNRAKKPFDLVHSDLWGPAPIVSPDGFQFYVLFVDDHSRFTWLYPLKRKSDTLLAFQHFLSLVQTQFHSQIKAFQSDN

Query:  GGEF----------------------VHNGRAERKHRHVIETGLTLLAQANMPLAYWWDAFLVAIQIINGLPTPVLGGKSPVEILLNKKPDFSSFRVFGS
        GGE+                        NGRAERKHRH+ E GLTLLAQA MPL YWW+AF  A+ +IN LP+ V   +SP  ++L K+PD+   + FG 
Subjt:  GGEF----------------------VHNGRAERKHRHVIETGLTLLAQANMPLAYWWDAFLVAIQIINGLPTPVLGGKSPVEILLNKKPDFSSFRVFGS

Query:  SCYPCLRQYQQHKFMFHSENHVYLGFSPSHKGHKCLNASGRIFISRNVQFNEQEFPF
        +CYPCL+ Y QHK  +H+   V+LG+S SHKG+KCLN+ GRIFISR+V FNE  FPF
Subjt:  SCYPCLRQYQQHKFMFHSENHVYLGFSPSHKGHKCLNASGRIFISRNVQFNEQEFPF

GAU19483.1 hypothetical protein TSUD_77270 [Trifolium subterraneum]1.7e-2863.64Show/hide
Query:  SKVMWLNQLMNELGCHSSSKPILWCDNLSAGALAANPVFHARTKHIEIDVHFVRDQVLKGALEVRYVPTADQIADCLTKPLSHSQFAYLRSKLGVVELP
        +++ W+  L+ EL      KPILWCDNLSA ALA+NPV HAR+KHIEIDVH++RDQVL+  + V YVPT DQIADCLTKPLSH++F+ LR KLGV+  P
Subjt:  SKVMWLNQLMNELGCHSSSKPILWCDNLSAGALAANPVFHARTKHIEIDVHFVRDQVLKGALEVRYVPTADQIADCLTKPLSHSQFAYLRSKLGVVELP

GAU19483.1 hypothetical protein TSUD_77270 [Trifolium subterraneum]6.4e-10835.91Show/hide
Query:  LLNQITTIKLDRGADASSSTAPAGGASSSTTTMTVNPLYEAWITTDQLLLGWLYNSMTSEVATQVMGFEMQKIXXXXXXXXXXXXXXXXXXXXXXXXXXX
        +L+ I   KLD G    ++  P    +S+  +  VNP +  WI  DQ LLGWL NSM  ++ATQ++  E  K                            
Subjt:  LLNQITTIKLDRGADASSSTAPAGGASSSTTTMTVNPLYEAWITTDQLLLGWLYNSMTSEVATQVMGFEMQKIXXXXXXXXXXXXXXXXXXXXXXXXXXX

Query:  XXXXXXXWEAIQSLFGIQSRAEEDYLRQIFQQTRKGNTKMADYLRLMKSHSDNLGQAGSPVTPRNLISQNTQKTVVTFN---------------HTTAVN
               W+  QSL G  +++   YL+  F  TRKG  KM +YL  MK+ SD L  AGSP++  +L+ Q        +N                  A  
Subjt:  XXXXXXXWEAIQSLFGIQSRAEEDYLRQIFQQTRKGNTKMADYLRLMKSHSDNLGQAGSPVTPRNLISQNTQKTVVTFN---------------HTTAVN

Query:  VARNSR-GSFNSNPG--NSKGWNFNNSQRGMNPKSGQRGG---SNFNGGRGRGGRGRGYGQYNNSYGNYNNSNRSVCQVCGKQGHSALVCYHRW------
        +A  SR   FN+  G   +   NF N       K   RG    SNF G RG    GRG G+ +N          + CQVC   GH A+ C +R+      
Subjt:  VARNSR-GSFNSNPG--NSKGWNFNNSQRGMNPKSGQRGG---SNFNGGRGRGGRGRGYGQYNNSYGNYNNSNRSVCQVCGKQGHSALVCYHRW------

Query:  -------SLFGNSETVI-------DPNWYADSGASNHVTGDFNNLINPREYGGNEQVTIGNGD-------------------------------------
                  G+    I       D  WY DSGA+NHVT   +      E+ G   + +GNG+                                     
Subjt:  -------SLFGNSETVI-------DPNWYADSGASNHVTGDFNNLINPREYGGNEQVTIGNGD-------------------------------------

Query:  ----------------KDKGSDRVLLKGTLKDGLYQLESATNLPESSMHQSTTVTQKSINNSTSRVLDHVIKECNLPVKGNESVSFCEACQFGKAHNLPF
                        KDK + + LLKG LKDGLYQL +       S+ +S     + + +  ++VLD V+K+CN+ +  ++  SFCEACQFGK H LPF
Subjt:  ----------------KDKGSDRVLLKGTLKDGLYQLESATNLPESSMHQSTTVTQKSINNSTSRVLDHVIKECNLPVKGNESVSFCEACQFGKAHNLPF

Query:  SLSNNRAKKPFDLVHSDLWGPAPIVSPDGFQFYVLFVDDHSRFTWLYPLKRKSDTLLAFQHFLSLVQTQFHSQIKAFQSDNGGEF---------------
          S++  ++P  L+HSD+WGPAPI+SP GF++YV F+DD SRFTW++PLK+KSDT+ AF  F +L + QF+ +IK  Q D GGE+               
Subjt:  SLSNNRAKKPFDLVHSDLWGPAPIVSPDGFQFYVLFVDDHSRFTWLYPLKRKSDTLLAFQHFLSLVQTQFHSQIKAFQSDNGGEF---------------

Query:  -------VHNGRAERKHRHVIETGLTLLAQANMPLAYWWDAFLVAIQIINGLPTPVLGGKSPVEILLNKKPDFSSFRVFGSSCYPCLRQYQQHKFMFHSE
                 NGRAERKHRHV E GLTLLAQA MPL YWW+AF  A+ +IN LP+ V   +SP  ++  ++PD+++ + FG +CYPCL+ Y QHK  FH+ 
Subjt:  -------VHNGRAERKHRHVIETGLTLLAQANMPLAYWWDAFLVAIQIINGLPTPVLGGKSPVEILLNKKPDFSSFRVFGSSCYPCLRQYQQHKFMFHSE

Query:  NHVYLGFSPSHKGHKCLNASGRIFISRNVQFNEQEFPF
          V++G+S SHKG+KC+N+ GRIF+SR+V FNE  FPF
Subjt:  NHVYLGFSPSHKGHKCLNASGRIFISRNVQFNEQEFPF

GAU51268.1 hypothetical protein TSUD_412550 [Trifolium subterraneum]3.0e-2563.04Show/hide
Query:  LMNELGCHSSSKPILWCDNLSAGALAANPVFHARTKHIEIDVHFVRDQVLKGALEVRYVPTADQIADCLTKPLSHSQFAYLRSKLGVVELPA
        L+ EL      KP+LWCDNLSA ALA+NPV HAR+KHIEID+H++RDQVL+  + + YVPTADQIADCLTKPL H++F  +R KLGV   P+
Subjt:  LMNELGCHSSSKPILWCDNLSAGALAANPVFHARTKHIEIDVHFVRDQVLKGALEVRYVPTADQIADCLTKPLSHSQFAYLRSKLGVVELPA

GAU51268.1 hypothetical protein TSUD_412550 [Trifolium subterraneum]8.3e-10835.04Show/hide
Query:  SSSTTTMTVNPLYEAWITTDQLLLGWLYNSMTSEVATQVMGFEMQKIXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXWEAIQSLFGIQSRAEEDYL
        +++ ++   NP +E W   DQ LLGWL NSMT  +ATQ++  E                                      W+  QSL G  +R++  YL
Subjt:  SSSTTTMTVNPLYEAWITTDQLLLGWLYNSMTSEVATQVMGFEMQKIXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXWEAIQSLFGIQSRAEEDYL

Query:  RQIFQQTRKGNTKMADYLRLMKSHSDNLGQAGSPVTPRNLISQ-------------------------NTQKTVVTFNH-----TTAVNVARNSRGSF-N
        +  F  TRKG  KM DYL  MK+ +D L  AG+P++  +LI Q                         + Q  ++TF +      +  N+  N+  +   
Subjt:  RQIFQQTRKGNTKMADYLRLMKSHSDNLGQAGSPVTPRNLISQ-------------------------NTQKTVVTFNH-----TTAVNVARNSRGSF-N

Query:  SNPGNSKGWNFNNSQRGMNPKSGQRGGSNFNGGRGRGGRGRGYGQYNNSYGNYNNSNRSVCQVCGKQGHSALVCYHRW--------------------SL
         +      +N NN+ RG N       GSNF G RG  GRGR +              ++ CQVCG   H A+ C++R+                    + 
Subjt:  SNPGNSKGWNFNNSQRGMNPKSGQRGGSNFNGGRGRGGRGRGYGQYNNSYGNYNNSNRSVCQVCGKQGHSALVCYHRW--------------------SL

Query:  FGNSETVIDPNWYADSGASNHVTGDFNNLINPREYGGNEQVTIGNGD-----------------------------------------------------
          +  ++ D +WY DSGASNHVT   +   N  E+ G   + +GNG+                                                     
Subjt:  FGNSETVIDPNWYADSGASNHVTGDFNNLINPREYGGNEQVTIGNGD-----------------------------------------------------

Query:  KDKGSDRVLLKGTLKDGLYQLESATNLPESSMHQSTTVTQKSINNSTSRVLDHVIKECNLPVKGNESVSFCEACQFGKAHNLPFSLSNNRAKKPFDLVHS
        KDK + + +L+G LKDGLYQL    +    S+ +S     + + +  ++VLD V+K CN+ +  ++  SFCEACQ+GK H LPF  S + AK+  +LVH+
Subjt:  KDKGSDRVLLKGTLKDGLYQLESATNLPESSMHQSTTVTQKSINNSTSRVLDHVIKECNLPVKGNESVSFCEACQFGKAHNLPFSLSNNRAKKPFDLVHS

Query:  DLWGPAPIVSPDGFQFYVLFVDDHSRFTWLYPLKRKSDTLLAFQHFLSLVQTQFHSQIKAFQSDNGGEF----------------------VHNGRAERK
        D+WGPAPI+S  GF++YV F+DD +RFTW+YPLK+KSDT  AF  F ++V+ QF  +IK  Q D GGE+                        NGRAERK
Subjt:  DLWGPAPIVSPDGFQFYVLFVDDHSRFTWLYPLKRKSDTLLAFQHFLSLVQTQFHSQIKAFQSDNGGEF----------------------VHNGRAERK

Query:  HRHVIETGLTLLAQANMPLAYWWDAFLVAIQIINGLPTPVLGGKSPVEILLNKKPDFSSFRVFGSSCYPCLRQYQQHKFMFHSENHVYLGFSPSHKGHKC
        HRH+ E GLTLLAQA MPL YWW+AF  A+ +IN LP+ V   KSP  +L  ++PD++S + FG +CYP L+ Y +HK  FH+   V+LG+S SHKG+KC
Subjt:  HRHVIETGLTLLAQANMPLAYWWDAFLVAIQIINGLPTPVLGGKSPVEILLNKKPDFSSFRVFGSSCYPCLRQYQQHKFMFHSENHVYLGFSPSHKGHKC

Query:  LNASGRIFISRNVQFNEQEFPF
        +N+ GRIFISR+V FNE  FPF
Subjt:  LNASGRIFISRNVQFNEQEFPF

KYP50444.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan]2.1e-11139.44Show/hide
Query:  WEAIQSLFGIQSRAEEDYLRQIFQQTRKGNTKMADYLRLMKSHSDNLGQAGSPVTPRNLISQNTQKTVVTFN----------HTTAVNVAR---------
        WE  QSL G  +R+   +L+  F +TRKG  KM +YL  MK  +D+L  AGS V+  +L++Q        +N          H T V +           
Subjt:  WEAIQSLFGIQSRAEEDYLRQIFQQTRKGNTKMADYLRLMKSHSDNLGQAGSPVTPRNLISQNTQKTVVTFN----------HTTAVNVAR---------

Query:  ---NSRGSFNSNPGNSKGWNFNNSQRGMNPKSGQRGGSNFNGGRGRGGRGRGYGQYNNSYGNYNNSNRSVCQVCGKQGHSALVCYHRW------------
           N++ +   NP ++      N +   N   G RGG    G RG  GRGR               +R VCQVC K GH+A  CYHR+            
Subjt:  ---NSRGSFNSNPGNSKGWNFNNSQRGMNPKSGQRGGSNFNGGRGRGGRGRGYGQYNNSYGNYNNSNRSVCQVCGKQGHSALVCYHRW------------

Query:  -------------SLFGNSETVIDPNWYADSGASNHVTGDFNNLINPREYGGNEQVTIGNGD--------------------------------------
                     +   +  TV D +WY DSGASNHVT D N +    E  G   +T+GNG                                       
Subjt:  -------------SLFGNSETVIDPNWYADSGASNHVTGDFNNLINPREYGGNEQVTIGNGD--------------------------------------

Query:  -------------------KDKGSDRVLLKGTLKDGLYQLESATNLPESSMHQSTTVTQ---KSINNSTSRVLDHVIKECNLPVKGNESVSFCEACQFGK
                           KDK + R+LL+G +KDGLYQL   +       H   ++ +   + + +  S+VL+ V+K CN+     E+  FCEACQFGK
Subjt:  -------------------KDKGSDRVLLKGTLKDGLYQLESATNLPESSMHQSTTVTQ---KSINNSTSRVLDHVIKECNLPVKGNESVSFCEACQFGK

Query:  AHNLPFSLSNNRAKKPFDLVHSDLWGPAPIVSPDGFQFYVLFVDDHSRFTWLYPLKRKSDTLLAFQHFLSLVQTQFHSQIKAFQSDNGGEF---------
        AHNLPF  S + AK+P DLVHSD+WGPAPI S  GF++YVLF+DD SRFTW+YPLK+KSD   AF  F +LV+ QF+ +IK  Q D GGEF         
Subjt:  AHNLPFSLSNNRAKKPFDLVHSDLWGPAPIVSPDGFQFYVLFVDDHSRFTWLYPLKRKSDTLLAFQHFLSLVQTQFHSQIKAFQSDNGGEF---------

Query:  -------------VHNGRAERKHRHVIETGLTLLAQANMPLAYWWDAFLVAIQIINGLPTPVLGGKSPVEILLNKKPDFSSFRVFGSSCYPCLRQYQQHK
                       NGRAERKHRHV+E+GLTLLAQA MPL YWW+AF  A+ +IN LPT V+  KSP + L +K PD+++ + FG +CYPCL+ Y QHK
Subjt:  -------------VHNGRAERKHRHVIETGLTLLAQANMPLAYWWDAFLVAIQIINGLPTPVLGGKSPVEILLNKKPDFSSFRVFGSSCYPCLRQYQQHK

Query:  FMFHSENHVYLGFSPSHKGHKCLNASGRIFISRNVQFNEQEFPF
          FH+   V+LG+S SHKG+KCLN++GRIFISR+V FNE  FPF
Subjt:  FMFHSENHVYLGFSPSHKGHKCLNASGRIFISRNVQFNEQEFPF

PNX94503.1 putative retrotransposon Ty1-copia subclass protein, partial [Trifolium pratense]3.1e-11036.22Show/hide
Query:  SSSTTTMTVNPLYEAWITTDQLLLGWLYNSMTSEVATQVMGFEMQKIXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXWEAIQSLFGIQSRAEEDYL
        +S   T  +NP Y+ W   DQ LLGWL NSMT ++ATQV+  E  K                                   W+  QSL G  +R+   YL
Subjt:  SSSTTTMTVNPLYEAWITTDQLLLGWLYNSMTSEVATQVMGFEMQKIXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXWEAIQSLFGIQSRAEEDYL

Query:  RQIFQQTRKGNTKMADYLRLMKSHSDNLGQAGSPVTPRNLISQN---------------TQKTVVTFNHTTAVNVARNSR----GSFNSNPGNSKGWNFN
        +  F  T K   KM  YL  MK+ +D L  AGSP++  +L+ Q                + +T +++    A  +A  SR     +FN+   N+   NF 
Subjt:  RQIFQQTRKGNTKMADYLRLMKSHSDNLGQAGSPVTPRNLISQN---------------TQKTVVTFNHTTAVNVARNSR----GSFNSNPGNSKGWNFN

Query:  NSQRGMNPKSGQRGGSNFNGGRG-RGGRGRGYGQYNNSYGNYNNSNRSVCQVCGKQGHSALVCYHRWSLF------------GNSETVIDP------NWY
        +       K G RGG   +  RG RGGRGR            +   R +CQ+CGK GH+A  CY+R+                +S  V  P       WY
Subjt:  NSQRGMNPKSGQRGGSNFNGGRG-RGGRGRGYGQYNNSYGNYNNSNRSVCQVCGKQGHSALVCYHRWSLF------------GNSETVIDP------NWY

Query:  ADSGASNHVTGDFNNLINPREYGGNEQVTIGNGD-----------------------------------------------------KDKGSDRVLLKGT
         DSGASNHVT     L +  E  G   + +GNG+                                                     KDK + + LLKG 
Subjt:  ADSGASNHVTGDFNNLINPREYGGNEQVTIGNGD-----------------------------------------------------KDKGSDRVLLKGT

Query:  LKDGLYQL----ESATNLPESSMHQSTTVTQKSINNSTSRVLDHVIKECNLPVKGNESVSFCEACQFGKAHNLPFSLSNNRAKKPFDLVHSDLWGPAPIV
        LKDGLYQL    E  TN    +      +  + + +  ++VL+ V+K+ N+ +  ++  +FCEACQFGK H LPF  S++ AK+P DL+H+D+WGPAPI+
Subjt:  LKDGLYQL----ESATNLPESSMHQSTTVTQKSINNSTSRVLDHVIKECNLPVKGNESVSFCEACQFGKAHNLPFSLSNNRAKKPFDLVHSDLWGPAPIV

Query:  SPDGFQFYVLFVDDHSRFTWLYPLKRKSDTLLAFQHFLSLVQTQFHSQIKAFQSDNGGEF----------------------VHNGRAERKHRHVIETGL
        S   F++YV F+DD SRFTW++PLK+KS+T+ AF  F +LV+ QF+ +IK  + D GGE+                        NGRAERKHRHV E GL
Subjt:  SPDGFQFYVLFVDDHSRFTWLYPLKRKSDTLLAFQHFLSLVQTQFHSQIKAFQSDNGGEF----------------------VHNGRAERKHRHVIETGL

Query:  TLLAQANMPLAYWWDAFLVAIQIINGLPTPVLGGKSPVEILLNKKPDFSSFRVFGSSCYPCLRQYQQHKFMFHSENHVYLGFSPSHKGHKCLNASGRIFI
        TLLAQA MPL+YWW+AF  A+ +IN LP+ V   +SP  ++  K+PD+++ + FG +CYPCL+ Y QHK  FH+   V+LG+S SHKG+KC+N+ GR+F+
Subjt:  TLLAQANMPLAYWWDAFLVAIQIINGLPTPVLGGKSPVEILLNKKPDFSSFRVFGSSCYPCLRQYQQHKFMFHSENHVYLGFSPSHKGHKCLNASGRIFI

Query:  SRNVQFNEQEFPFSK
        SR+V FNE  FPF +
Subjt:  SRNVQFNEQEFPFSK

TrEMBL top hitse value%identityAlignment
A0A151S6M8 Retrovirus-related Pol polyprotein from transposon TNT 1-941.0e-11139.44Show/hide
Query:  WEAIQSLFGIQSRAEEDYLRQIFQQTRKGNTKMADYLRLMKSHSDNLGQAGSPVTPRNLISQNTQKTVVTFN----------HTTAVNVAR---------
        WE  QSL G  +R+   +L+  F +TRKG  KM +YL  MK  +D+L  AGS V+  +L++Q        +N          H T V +           
Subjt:  WEAIQSLFGIQSRAEEDYLRQIFQQTRKGNTKMADYLRLMKSHSDNLGQAGSPVTPRNLISQNTQKTVVTFN----------HTTAVNVAR---------

Query:  ---NSRGSFNSNPGNSKGWNFNNSQRGMNPKSGQRGGSNFNGGRGRGGRGRGYGQYNNSYGNYNNSNRSVCQVCGKQGHSALVCYHRW------------
           N++ +   NP ++      N +   N   G RGG    G RG  GRGR               +R VCQVC K GH+A  CYHR+            
Subjt:  ---NSRGSFNSNPGNSKGWNFNNSQRGMNPKSGQRGGSNFNGGRGRGGRGRGYGQYNNSYGNYNNSNRSVCQVCGKQGHSALVCYHRW------------

Query:  -------------SLFGNSETVIDPNWYADSGASNHVTGDFNNLINPREYGGNEQVTIGNGD--------------------------------------
                     +   +  TV D +WY DSGASNHVT D N +    E  G   +T+GNG                                       
Subjt:  -------------SLFGNSETVIDPNWYADSGASNHVTGDFNNLINPREYGGNEQVTIGNGD--------------------------------------

Query:  -------------------KDKGSDRVLLKGTLKDGLYQLESATNLPESSMHQSTTVTQ---KSINNSTSRVLDHVIKECNLPVKGNESVSFCEACQFGK
                           KDK + R+LL+G +KDGLYQL   +       H   ++ +   + + +  S+VL+ V+K CN+     E+  FCEACQFGK
Subjt:  -------------------KDKGSDRVLLKGTLKDGLYQLESATNLPESSMHQSTTVTQ---KSINNSTSRVLDHVIKECNLPVKGNESVSFCEACQFGK

Query:  AHNLPFSLSNNRAKKPFDLVHSDLWGPAPIVSPDGFQFYVLFVDDHSRFTWLYPLKRKSDTLLAFQHFLSLVQTQFHSQIKAFQSDNGGEF---------
        AHNLPF  S + AK+P DLVHSD+WGPAPI S  GF++YVLF+DD SRFTW+YPLK+KSD   AF  F +LV+ QF+ +IK  Q D GGEF         
Subjt:  AHNLPFSLSNNRAKKPFDLVHSDLWGPAPIVSPDGFQFYVLFVDDHSRFTWLYPLKRKSDTLLAFQHFLSLVQTQFHSQIKAFQSDNGGEF---------

Query:  -------------VHNGRAERKHRHVIETGLTLLAQANMPLAYWWDAFLVAIQIINGLPTPVLGGKSPVEILLNKKPDFSSFRVFGSSCYPCLRQYQQHK
                       NGRAERKHRHV+E+GLTLLAQA MPL YWW+AF  A+ +IN LPT V+  KSP + L +K PD+++ + FG +CYPCL+ Y QHK
Subjt:  -------------VHNGRAERKHRHVIETGLTLLAQANMPLAYWWDAFLVAIQIINGLPTPVLGGKSPVEILLNKKPDFSSFRVFGSSCYPCLRQYQQHK

Query:  FMFHSENHVYLGFSPSHKGHKCLNASGRIFISRNVQFNEQEFPF
          FH+   V+LG+S SHKG+KCLN++GRIFISR+V FNE  FPF
Subjt:  FMFHSENHVYLGFSPSHKGHKCLNASGRIFISRNVQFNEQEFPF

A0A2Z6MBG6 Integrase catalytic domain-containing protein6.2e-10935.67Show/hide
Query:  PGSTSNSTGFSSPPL--NQLLNQITTIKLDRGADASSSTAPAGGASSSTTTMTVNPLYEAWITTDQLLLGWLYNSMTSEVATQVMGFEMQKIXXXXXXXX
        P S S     ++ PL  + +L  I   KLD G    +   P    +SS ++   N  +  W   DQ LLGW+ NSMT+E+ATQ++  E  K         
Subjt:  PGSTSNSTGFSSPPL--NQLLNQITTIKLDRGADASSSTAPAGGASSSTTTMTVNPLYEAWITTDQLLLGWLYNSMTSEVATQVMGFEMQKIXXXXXXXX

Query:  XXXXXXXXXXXXXXXXXXXXXXXXXXWEAIQSLFGIQSRAEEDYLRQIFQQTRKGNTKMADYLRLMKSHSDNLGQAGSPVTPRNLISQN-----------
                                  W+  QSL G  +R++  YL+  F   RKG  KM DYL  MK+  D L  AG+PV+  +LI Q            
Subjt:  XXXXXXXXXXXXXXXXXXXXXXXXXXWEAIQSLFGIQSRAEEDYLRQIFQQTRKGNTKMADYLRLMKSHSDNLGQAGSPVTPRNLISQN-----------

Query:  ----TQKTVVTFNHTTAVNVARNSRGSFNSNPGNSKGWNFNNSQRGMNPKSGQRGGSNFNGGRG------RGGRGRGYGQYNNSYGNYNNSNRSVCQVCG
            + +T +++    A  +   SR    +N  N       N+   +  +S  RG S+ N  RG      RGGRGRG             S ++ CQVCG
Subjt:  ----TQKTVVTFNHTTAVNVARNSRGSFNSNPGNSKGWNFNNSQRGMNPKSGQRGGSNFNGGRG------RGGRGRGYGQYNNSYGNYNNSNRSVCQVCG

Query:  KQGHSALVCYHRW--------------------SLFGNSETVIDPNWYADSGASNHVTGDFNNLINPREYGGNEQVTIGNGD------------------
           H A+ C+HR+                    +   +  +V D +WY DSGASNHVT       +  E+ G   + +GNG+                  
Subjt:  KQGHSALVCYHRW--------------------SLFGNSETVIDPNWYADSGASNHVTGDFNNLINPREYGGNEQVTIGNGD------------------

Query:  -----------------------------------KDKGSDRVLLKGTLKDGLYQLESATNLPESSMHQSTTVTQKSINNSTSRVLDHVIKECNLPVKGN
                                           KDK + +V+LKG LKDGLYQL S T    S+         + + +  ++VLD V++ C + V  +
Subjt:  -----------------------------------KDKGSDRVLLKGTLKDGLYQLESATNLPESSMHQSTTVTQKSINNSTSRVLDHVIKECNLPVKGN

Query:  ESVSFCEACQFGKAHNLPFSLSNNRAKKPFDLVHSDLWGPAPIVSPDGFQFYVLFVDDHSRFTWLYPLKRKSDTLLAFQHFLSLVQTQFHSQIKAFQSDN
        ++ SFCEACQ+GK H LPF  S++ A++P +LVH+D+WGPAPI++  GF++YV FVDD SRFTW+YPLK+KS+T+ AF  F +L + QF+ +IK  Q D 
Subjt:  ESVSFCEACQFGKAHNLPFSLSNNRAKKPFDLVHSDLWGPAPIVSPDGFQFYVLFVDDHSRFTWLYPLKRKSDTLLAFQHFLSLVQTQFHSQIKAFQSDN

Query:  GGEF----------------------VHNGRAERKHRHVIETGLTLLAQANMPLAYWWDAFLVAIQIINGLPTPVLGGKSPVEILLNKKPDFSSFRVFGS
        GGE+                        NGRAERKHRH+ E GLTLLAQA MPL YWW+AF  A+ +IN LP+ V   +SP  ++L K+PD+   + FG 
Subjt:  GGEF----------------------VHNGRAERKHRHVIETGLTLLAQANMPLAYWWDAFLVAIQIINGLPTPVLGGKSPVEILLNKKPDFSSFRVFGS

Query:  SCYPCLRQYQQHKFMFHSENHVYLGFSPSHKGHKCLNASGRIFISRNVQFNEQEFPF
        +CYPCL+ Y QHK  +H+   V+LG+S SHKG+KCLN+ GRIFISR+V FNE  FPF
Subjt:  SCYPCLRQYQQHKFMFHSENHVYLGFSPSHKGHKCLNASGRIFISRNVQFNEQEFPF

A0A2Z6MBG6 Integrase catalytic domain-containing protein8.3e-2963.64Show/hide
Query:  SKVMWLNQLMNELGCHSSSKPILWCDNLSAGALAANPVFHARTKHIEIDVHFVRDQVLKGALEVRYVPTADQIADCLTKPLSHSQFAYLRSKLGVVELP
        +++ W+  L+ EL      KPILWCDNLSA ALA+NPV HAR+KHIEIDVH++RDQVL+  + V YVPT DQIADCLTKPLSH++F+ LR KLGV+  P
Subjt:  SKVMWLNQLMNELGCHSSSKPILWCDNLSAGALAANPVFHARTKHIEIDVHFVRDQVLKGALEVRYVPTADQIADCLTKPLSHSQFAYLRSKLGVVELP

A0A2Z6MBG6 Integrase catalytic domain-containing protein3.1e-10835.91Show/hide
Query:  LLNQITTIKLDRGADASSSTAPAGGASSSTTTMTVNPLYEAWITTDQLLLGWLYNSMTSEVATQVMGFEMQKIXXXXXXXXXXXXXXXXXXXXXXXXXXX
        +L+ I   KLD G    ++  P    +S+  +  VNP +  WI  DQ LLGWL NSM  ++ATQ++  E  K                            
Subjt:  LLNQITTIKLDRGADASSSTAPAGGASSSTTTMTVNPLYEAWITTDQLLLGWLYNSMTSEVATQVMGFEMQKIXXXXXXXXXXXXXXXXXXXXXXXXXXX

Query:  XXXXXXXWEAIQSLFGIQSRAEEDYLRQIFQQTRKGNTKMADYLRLMKSHSDNLGQAGSPVTPRNLISQNTQKTVVTFN---------------HTTAVN
               W+  QSL G  +++   YL+  F  TRKG  KM +YL  MK+ SD L  AGSP++  +L+ Q        +N                  A  
Subjt:  XXXXXXXWEAIQSLFGIQSRAEEDYLRQIFQQTRKGNTKMADYLRLMKSHSDNLGQAGSPVTPRNLISQNTQKTVVTFN---------------HTTAVN

Query:  VARNSR-GSFNSNPG--NSKGWNFNNSQRGMNPKSGQRGG---SNFNGGRGRGGRGRGYGQYNNSYGNYNNSNRSVCQVCGKQGHSALVCYHRW------
        +A  SR   FN+  G   +   NF N       K   RG    SNF G RG    GRG G+ +N          + CQVC   GH A+ C +R+      
Subjt:  VARNSR-GSFNSNPG--NSKGWNFNNSQRGMNPKSGQRGG---SNFNGGRGRGGRGRGYGQYNNSYGNYNNSNRSVCQVCGKQGHSALVCYHRW------

Query:  -------SLFGNSETVI-------DPNWYADSGASNHVTGDFNNLINPREYGGNEQVTIGNGD-------------------------------------
                  G+    I       D  WY DSGA+NHVT   +      E+ G   + +GNG+                                     
Subjt:  -------SLFGNSETVI-------DPNWYADSGASNHVTGDFNNLINPREYGGNEQVTIGNGD-------------------------------------

Query:  ----------------KDKGSDRVLLKGTLKDGLYQLESATNLPESSMHQSTTVTQKSINNSTSRVLDHVIKECNLPVKGNESVSFCEACQFGKAHNLPF
                        KDK + + LLKG LKDGLYQL +       S+ +S     + + +  ++VLD V+K+CN+ +  ++  SFCEACQFGK H LPF
Subjt:  ----------------KDKGSDRVLLKGTLKDGLYQLESATNLPESSMHQSTTVTQKSINNSTSRVLDHVIKECNLPVKGNESVSFCEACQFGKAHNLPF

Query:  SLSNNRAKKPFDLVHSDLWGPAPIVSPDGFQFYVLFVDDHSRFTWLYPLKRKSDTLLAFQHFLSLVQTQFHSQIKAFQSDNGGEF---------------
          S++  ++P  L+HSD+WGPAPI+SP GF++YV F+DD SRFTW++PLK+KSDT+ AF  F +L + QF+ +IK  Q D GGE+               
Subjt:  SLSNNRAKKPFDLVHSDLWGPAPIVSPDGFQFYVLFVDDHSRFTWLYPLKRKSDTLLAFQHFLSLVQTQFHSQIKAFQSDNGGEF---------------

Query:  -------VHNGRAERKHRHVIETGLTLLAQANMPLAYWWDAFLVAIQIINGLPTPVLGGKSPVEILLNKKPDFSSFRVFGSSCYPCLRQYQQHKFMFHSE
                 NGRAERKHRHV E GLTLLAQA MPL YWW+AF  A+ +IN LP+ V   +SP  ++  ++PD+++ + FG +CYPCL+ Y QHK  FH+ 
Subjt:  -------VHNGRAERKHRHVIETGLTLLAQANMPLAYWWDAFLVAIQIINGLPTPVLGGKSPVEILLNKKPDFSSFRVFGSSCYPCLRQYQQHKFMFHSE

Query:  NHVYLGFSPSHKGHKCLNASGRIFISRNVQFNEQEFPF
          V++G+S SHKG+KC+N+ GRIF+SR+V FNE  FPF
Subjt:  NHVYLGFSPSHKGHKCLNASGRIFISRNVQFNEQEFPF

A0A2Z6P4D5 Integrase catalytic domain-containing protein1.5e-2563.04Show/hide
Query:  LMNELGCHSSSKPILWCDNLSAGALAANPVFHARTKHIEIDVHFVRDQVLKGALEVRYVPTADQIADCLTKPLSHSQFAYLRSKLGVVELPA
        L+ EL      KP+LWCDNLSA ALA+NPV HAR+KHIEID+H++RDQVL+  + + YVPTADQIADCLTKPL H++F  +R KLGV   P+
Subjt:  LMNELGCHSSSKPILWCDNLSAGALAANPVFHARTKHIEIDVHFVRDQVLKGALEVRYVPTADQIADCLTKPLSHSQFAYLRSKLGVVELPA

A0A803QCY3 Uncharacterized protein5.1e-11137.25Show/hide
Query:  STTTMTVNPLYEAWITTDQLLLGWLYNSMTSEVATQVMGFEMQKIXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXWEAIQSLFGIQSRAEEDYLRQ
        S T  T+NP +E WI  DQLL+GWLY+SMT  +AT+VMG                                        W A++ L+G  S+++ D  R 
Subjt:  STTTMTVNPLYEAWITTDQLLLGWLYNSMTSEVATQVMGFEMQKIXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXWEAIQSLFGIQSRAEEDYLRQ

Query:  IFQQTRKGNTKMADYLRLMKSHSDNLGQAGSPVTPRNLISQNTQKTVVTFNHTTAVNVARNSRGSFNSNPGNSKGWNFNNSQRGMNPKSGQRGGSNFNGG
        + Q T+KG T M +YLR  KS +D+L  AG P     L +    +  + +  T  + +   ++ S+          +F +    +   +    G +FN  
Subjt:  IFQQTRKGNTKMADYLRLMKSHSDNLGQAGSPVTPRNLISQNTQKTVVTFNHTTAVNVARNSRGSFNSNPGNSKGWNFNNSQRGMNPKSGQRGGSNFNGG

Query:  RGRGGRGRGYGQYNNSYGNYNNSNRSVCQVCGKQGHSALVCYHRWSLFGNSETVIDPN----------------------------WYADSGASNHVTGD
        RG GGR RG G+ NNS        +  CQVCGK  HSA+VCY+ W  F +S    DP+                            W+ADSGASN++T D
Subjt:  RGRGGRGRGYGQYNNSYGNYNNSNRSVCQVCGKQGHSALVCYHRWSLFGNSETVIDPN----------------------------WYADSGASNHVTGD

Query:  FNNLINPREYGGNEQVTIGNGD----------------------------------------------------------KDKGSDRVLLKGTLKDGLYQ
         + +   +EYGG E+VT+GNGD                                                          KD  + RVLL+G LKDGLYQ
Subjt:  FNNLINPREYGGNEQVTIGNGD----------------------------------------------------------KDKGSDRVLLKGTLKDGLYQ

Query:  LESATNLPESSMHQSTTVTQKSINNSTSRVLDHVIKECNLPVKGNESVSFCEACQFGKAHNLPFSLSNNRAKKPFDLVHSDLWGPAPIVSPDGFQFYVLF
        L++  N                +  STS+    ++ +CN P   +    FC+ACQ+GK+H+LPF  SN++A K  DLVH+DLWGP+PI S   F++YV F
Subjt:  LESATNLPESSMHQSTTVTQKSINNSTSRVLDHVIKECNLPVKGNESVSFCEACQFGKAHNLPFSLSNNRAKKPFDLVHSDLWGPAPIVSPDGFQFYVLF

Query:  VDDHSRFTWLYPLKRKSDTLLAFQHFLSLVQTQFHSQIKAFQSDNGGEF----------------------VHNGRAERKHRHVIETGLTLLAQANMPLA
        VDD +RFTW+YPLK KS+   AF  F SL + QF  +IKA ++D GGE+                        NGRAERKHRH++E GLTLLAQ+ MPL 
Subjt:  VDDHSRFTWLYPLKRKSDTLLAFQHFLSLVQTQFHSQIKAFQSDNGGEF----------------------VHNGRAERKHRHVIETGLTLLAQANMPLA

Query:  YWWDAFLVAIQIINGLPTPVLGGKSPVEILLNKKPDFSSFRVFGSSCYPCLRQYQQHKFMFHSENHVYLGFSPSHKGHKCLNASGRIFISRNVQFNEQEF
        YWWDAF  A+ +IN LPTP+L  K+P E+L  K PD+   + FG +C+PCLR YQ HKF FHS   V LG+S +HKG+KCL+ +GRI+I R+V FNE EF
Subjt:  YWWDAFLVAIQIINGLPTPVLGGKSPVEILLNKKPDFSSFRVFGSSCYPCLRQYQQHKFMFHSENHVYLGFSPSHKGHKCLNASGRIFISRNVQFNEQEF

Query:  PFSKVMWLNQLMNE
        PF ++ +LN   +E
Subjt:  PFSKVMWLNQLMNE

A0A803QCY3 Uncharacterized protein7.8e-2753.64Show/hide
Query:  SKVMWLNQLMNELGCHSSSKPILWCDNLSAGALAANPVFHARTKHIEIDVHFVRDQVLKGALEVRYVPTADQIADCLTKPLSHSQFAYLRSKLGVVELPA
        +K+ W+  L+ E+G       + WCDNL A ALA+NPVFHAR KHIEID+HFVRD+VL+  LEVRY+P++DQ+ADCLTK L+ S+F +L  K+G +  P 
Subjt:  SKVMWLNQLMNELGCHSSSKPILWCDNLSAGALAANPVFHARTKHIEIDVHFVRDQVLKGALEVRYVPTADQIADCLTKPLSHSQFAYLRSKLGVVELPA

Query:  RLRRDIKDKD
        RLR D+++ +
Subjt:  RLRRDIKDKD

A0A803QCY3 Uncharacterized protein1.5e-11036.22Show/hide
Query:  SSSTTTMTVNPLYEAWITTDQLLLGWLYNSMTSEVATQVMGFEMQKIXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXWEAIQSLFGIQSRAEEDYL
        +S   T  +NP Y+ W   DQ LLGWL NSMT ++ATQV+  E  K                                   W+  QSL G  +R+   YL
Subjt:  SSSTTTMTVNPLYEAWITTDQLLLGWLYNSMTSEVATQVMGFEMQKIXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXWEAIQSLFGIQSRAEEDYL

Query:  RQIFQQTRKGNTKMADYLRLMKSHSDNLGQAGSPVTPRNLISQN---------------TQKTVVTFNHTTAVNVARNSR----GSFNSNPGNSKGWNFN
        +  F  T K   KM  YL  MK+ +D L  AGSP++  +L+ Q                + +T +++    A  +A  SR     +FN+   N+   NF 
Subjt:  RQIFQQTRKGNTKMADYLRLMKSHSDNLGQAGSPVTPRNLISQN---------------TQKTVVTFNHTTAVNVARNSR----GSFNSNPGNSKGWNFN

Query:  NSQRGMNPKSGQRGGSNFNGGRG-RGGRGRGYGQYNNSYGNYNNSNRSVCQVCGKQGHSALVCYHRWSLF------------GNSETVIDP------NWY
        +       K G RGG   +  RG RGGRGR            +   R +CQ+CGK GH+A  CY+R+                +S  V  P       WY
Subjt:  NSQRGMNPKSGQRGGSNFNGGRG-RGGRGRGYGQYNNSYGNYNNSNRSVCQVCGKQGHSALVCYHRWSLF------------GNSETVIDP------NWY

Query:  ADSGASNHVTGDFNNLINPREYGGNEQVTIGNGD-----------------------------------------------------KDKGSDRVLLKGT
         DSGASNHVT     L +  E  G   + +GNG+                                                     KDK + + LLKG 
Subjt:  ADSGASNHVTGDFNNLINPREYGGNEQVTIGNGD-----------------------------------------------------KDKGSDRVLLKGT

Query:  LKDGLYQL----ESATNLPESSMHQSTTVTQKSINNSTSRVLDHVIKECNLPVKGNESVSFCEACQFGKAHNLPFSLSNNRAKKPFDLVHSDLWGPAPIV
        LKDGLYQL    E  TN    +      +  + + +  ++VL+ V+K+ N+ +  ++  +FCEACQFGK H LPF  S++ AK+P DL+H+D+WGPAPI+
Subjt:  LKDGLYQL----ESATNLPESSMHQSTTVTQKSINNSTSRVLDHVIKECNLPVKGNESVSFCEACQFGKAHNLPFSLSNNRAKKPFDLVHSDLWGPAPIV

Query:  SPDGFQFYVLFVDDHSRFTWLYPLKRKSDTLLAFQHFLSLVQTQFHSQIKAFQSDNGGEF----------------------VHNGRAERKHRHVIETGL
        S   F++YV F+DD SRFTW++PLK+KS+T+ AF  F +LV+ QF+ +IK  + D GGE+                        NGRAERKHRHV E GL
Subjt:  SPDGFQFYVLFVDDHSRFTWLYPLKRKSDTLLAFQHFLSLVQTQFHSQIKAFQSDNGGEF----------------------VHNGRAERKHRHVIETGL

Query:  TLLAQANMPLAYWWDAFLVAIQIINGLPTPVLGGKSPVEILLNKKPDFSSFRVFGSSCYPCLRQYQQHKFMFHSENHVYLGFSPSHKGHKCLNASGRIFI
        TLLAQA MPL+YWW+AF  A+ +IN LP+ V   +SP  ++  K+PD+++ + FG +CYPCL+ Y QHK  FH+   V+LG+S SHKG+KC+N+ GR+F+
Subjt:  TLLAQANMPLAYWWDAFLVAIQIINGLPTPVLGGKSPVEILLNKKPDFSSFRVFGSSCYPCLRQYQQHKFMFHSENHVYLGFSPSHKGHKCLNASGRIFI

Query:  SRNVQFNEQEFPFSK
        SR+V FNE  FPF +
Subjt:  SRNVQFNEQEFPFSK

SwissProt top hitse value%identityAlignment
P04146 Copia protein7.0e-2529.59Show/hide
Query:  SVSFCEACQFGKAHNLPFSLSNNRA--KKPFDLVHSDLWGPAPIVSPDGFQFYVLFVDDHSRFTWLYPLKRKSDTLLAFQHFLSLVQTQFHSQIKAFQSD
        S   CE C  GK   LPF    ++   K+P  +VHSD+ GP   V+ D   ++V+FVD  + +   Y +K KSD    FQ F++  +  F+ ++     D
Subjt:  SVSFCEACQFGKAHNLPFSLSNNRA--KKPFDLVHSDLWGPAPIVSPDGFQFYVLFVDDHSRFTWLYPLKRKSDTLLAFQHFLSLVQTQFHSQIKAFQSD

Query:  NGGEFVH------------------------NGRAERKHRHVIETGLTLLAQANMPLAYWWDAFLVAIQIINGLPTPVL--GGKSPVEILLNKKPDFSSF
        NG E++                         NG +ER  R + E   T+++ A +  ++W +A L A  +IN +P+  L    K+P E+  NKKP     
Subjt:  NGGEFVH------------------------NGRAERKHRHVIETGLTLLAQANMPLAYWWDAFLVAIQIINGLPTPVL--GGKSPVEILLNKKPDFSSF

Query:  RVFGSSCYPCLRQYQQHKFMFHSENHVYLGFSPSHKGHKCLNASGRIFI-SRNVQFNEQEFPFSKVM
        RVFG++ Y  ++  +Q KF   S   +++G+ P+  G K  +A    FI +R+V  +E     S+ +
Subjt:  RVFGSSCYPCLRQYQQHKFMFHSENHVYLGFSPSHKGHKCLNASGRIFI-SRNVQFNEQEFPFSKVM

P04146 Copia protein1.0e-1236.46Show/hide
Query:  KVMWLNQLMNELGCHSSSKPILWCDNLSAGALAANPVFHARTKHIEIDVHFVRDQVLKGALEVRYVPTADQIADCLTKPLSHSQFAYLRSKLGVVE
        + +WL  L+  +     +   ++ DN    ++A NP  H R KHI+I  HF R+QV    + + Y+PT +Q+AD  TKPL  ++F  LR KLG+++
Subjt:  KVMWLNQLMNELGCHSSSKPILWCDNLSAGALAANPVFHARTKHIEIDVHFVRDQVLKGALEVRYVPTADQIADCLTKPLSHSQFAYLRSKLGVVE

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.9e-3228.79Show/hide
Query:  KGSDRVLLKGTLKDGLYQLESATNLPESSMHQ---STTVTQKSINNSTSRVLDHVIKECNLPVKGNESVSFCEACQFGKAHNLPFSLSNNRAKKPFDLVH
        KGS  V+ KG  +  LY+  +     E +  Q   S  +  K + + + + L  + K+  +      +V  C+ C FGK H + F  S+ R     DLV+
Subjt:  KGSDRVLLKGTLKDGLYQLESATNLPESSMHQ---STTVTQKSINNSTSRVLDHVIKECNLPVKGNESVSFCEACQFGKAHNLPFSLSNNRAKKPFDLVH

Query:  SDLWGPAPIVSPDGFQFYVLFVDDHSRFTWLYPLKRKSDTLLAFQHFLSLVQTQFHSQIKAFQSDNGGEFV------------------------HNGRA
        SD+ GP  I S  G +++V F+DD SR  W+Y LK K      FQ F +LV+ +   ++K  +SDNGGE+                         HNG A
Subjt:  SDLWGPAPIVSPDGFQFYVLFVDDHSRFTWLYPLKRKSDTLLAFQHFLSLVQTQFHSQIKAFQSDNGGEFV------------------------HNGRA

Query:  ERKHRHVIETGLTLLAQANMPLAYWWDAFLVAIQIINGLPTPVLGGKSPVEILLNKKPDFSSFRVFGSSCYPCLRQYQQHKFMFHSENHVYLGFSPSHKG
        ER +R ++E   ++L  A +P ++W +A   A  +IN  P+  L  + P  +  NK+  +S  +VFG   +  + + Q+ K    S   +++G+     G
Subjt:  ERKHRHVIETGLTLLAQANMPLAYWWDAFLVAIQIINGLPTPVLGGKSPVEILLNKKPDFSSFRVFGSSCYPCLRQYQQHKFMFHSENHVYLGFSPSHKG

Query:  HKCLN-ASGRIFISRNVQFNEQE
        ++  +    ++  SR+V F E E
Subjt:  HKCLN-ASGRIFISRNVQFNEQE

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.3e-1034.04Show/hide
Query:  KVMWLNQLMNELGCHSSSKPILWCDNLSAGALAANPVFHARTKHIEIDVHFVRDQVLKGALEVRYVPTADQIADCLTKPLSHSQFAYLRSKLGV
        +++WL + + ELG H   + +++CD+ SA  L+ N ++HARTKHI++  H++R+ V   +L+V  + T +  AD LTK +  ++F   +  +G+
Subjt:  KVMWLNQLMNELGCHSSSKPILWCDNLSAGALAANPVFHARTKHIEIDVHFVRDQVLKGALEVRYVPTADQIADCLTKPLSHSQFAYLRSKLGV

Q12491 Transposon Ty2-B Gag-Pol polyprotein3.1e-1225.12Show/hide
Query:  KSINNSTSRVLDHVIKECNLPVKGNESVSFCEACQFGKA----HNLPFSLSNNRAKKPFDLVHSDLWGPAPIVSPDGFQFYVLFVDDHSRFTWLYPL--K
        +SI  S  +     +KE ++    N S   C  C  GK+    H     L    + +PF  +H+D++GP   +      +++ F D+ +RF W+YPL  +
Subjt:  KSINNSTSRVLDHVIKECNLPVKGNESVSFCEACQFGKA----HNLPFSLSNNRAKKPFDLVHSDLWGPAPIVSPDGFQFYVLFVDDHSRFTWLYPL--K

Query:  RKSDTLLAFQHFLSLVQTQFHSQIKAFQSDNGGEFVH------------------------NGRAERKHRHVIETGLTLLAQANMPLAYWWDAFLVAIQI
        R+   L  F   L+ ++ QF++++   Q D G E+ +                        +G AER +R ++    TLL  + +P   W+ A   +  I
Subjt:  RKSDTLLAFQHFLSLVQTQFHSQIKAFQSDNGGEFVH------------------------NGRAERKHRHVIETGLTLLAQANMPLAYWWDAFLVAIQI

Query:  INGLPTP
         N L +P
Subjt:  INGLPTP

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE18.2e-6629.58Show/hide
Query:  WEAIQSLFGIQSRAEEDYLRQIFQQTRKGNTKMADYLRLMKSHSDNLGQAGSPV----------------------------TPRNLIS-----QNTQKT
        WE ++ ++   S      LR   +Q  KG   + DY++ + +  D L   G P+                            TP  L        N +  
Subjt:  WEAIQSLFGIQSRAEEDYLRQIFQQTRKGNTKMADYLRLMKSHSDNLGQAGSPV----------------------------TPRNLIS-----QNTQKT

Query:  VVTFNHTTAVNVA------RNSRGSFNSNPGNSKGWNFNNSQRGMNPKSGQRGGSNFNGGRGRGGRGRGYGQYNNSYGNYNNSNRSVCQVCGKQGHSALV
        ++  +  T + +       RN+  + N+N GN     ++N     N K  Q+  +NF+               NN   N +      CQ+CG QGHSA  
Subjt:  VVTFNHTTAVNVA------RNSRGSFNSNPGNSKGWNFNNSQRGMNPKSGQRGGSNFNGGRGRGGRGRGYGQYNNSYGNYNNSNRSVCQVCGKQGHSALV

Query:  CYHRWSLFG--NSETVIDP-------------------NWYADSGASNHVTGDFNNLINPREYGGNEQVTIGNGD-------------------------
        C          NS+    P                   NW  DSGA++H+T DFNNL   + Y G + V + +G                          
Subjt:  CYHRWSLFG--NSETVIDP-------------------NWYADSGASNHVTGDFNNLINPREYGGNEQVTIGNGD-------------------------

Query:  --------------------------------KDKGSDRVLLKGTLKDGLYQLESATNLPESSM-HQSTTVTQKS----INNSTSRVLDHVIKECNLPVK
                                        KD  +   LL+G  KD LY+   A++ P S     S+  T  S    + +    +L+ VI   +L V 
Subjt:  --------------------------------KDKGSDRVLLKGTLKDGLYQLESATNLPESSM-HQSTTVTQKS----INNSTSRVLDHVIKECNLPVK

Query:  GNESVSF--CEACQFGKAHNLPFSLSNNRAKKPFDLVHSDLWGPAPIVSPDGFQFYVLFVDDHSRFTWLYPLKRKSDTLLAFQHFLSLVQTQFHSQIKAF
         N S  F  C  C   K++ +PFS S   + +P + ++SD+W  +PI+S D +++YV+FVD  +R+TWLYPLK+KS     F  F +L++ +F ++I  F
Subjt:  GNESVSF--CEACQFGKAHNLPFSLSNNRAKKPFDLVHSDLWGPAPIVSPDGFQFYVLFVDDHSRFTWLYPLKRKSDTLLAFQHFLSLVQTQFHSQIKAF

Query:  QSDNGGEFV----------------------HNGRAERKHRHVIETGLTLLAQANMPLAYWWDAFLVAIQIINGLPTPVLGGKSPVEILLNKKPDFSSFR
         SDNGGEFV                      HNG +ERKHRH++ETGLTLL+ A++P  YW  AF VA+ +IN LPTP+L  +SP + L    P++   R
Subjt:  QSDNGGEFV----------------------HNGRAERKHRHVIETGLTLLAQANMPLAYWWDAFLVAIQIINGLPTPVLGGKSPVEILLNKKPDFSSFR

Query:  VFGSSCYPCLRQYQQHKFMFHSENHVYLGFSPSHKGHKCLN-ASGRIFISRNVQFNEQEFPFSKVM
        VFG +CYP LR Y QHK    S   V+LG+S +   + CL+  + R++ISR+V+F+E  FPFS  +
Subjt:  VFGSSCYPCLRQYQQHKFMFHSENHVYLGFSPSHKGHKCLN-ASGRIFISRNVQFNEQEFPFSKVM

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.0e-2443.7Show/hide
Query:  VYLGFSPSHKGHKCLNASGRIFISRNVQFNEQEFPFSKVMWLNQLMNELGCHSSSKPILWCDNLSAGALAANPVFHARTKHIEIDVHFVRDQVLKGALEV
        VYLG  P     K     G +  S   ++       S++ W+  L+ ELG   +  P+++CDN+ A  L ANPVFH+R KHI ID HF+R+QV  GAL V
Subjt:  VYLGFSPSHKGHKCLNASGRIFISRNVQFNEQEFPFSKVMWLNQLMNELGCHSSSKPILWCDNLSAGALAANPVFHARTKHIEIDVHFVRDQVLKGALEV

Query:  RYVPTADQIADCLTKPLSHSQFAYLRSKLGVVELP
         +V T DQ+AD LTKPLS + F    SK+GV  +P
Subjt:  RYVPTADQIADCLTKPLSHSQFAYLRSKLGVVELP

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE26.1e-6130.66Show/hide
Query:  NTQKTVVTFNHTTAVNVARNSRGSFNSNPGNSKGWNFNNSQRGMNPKSGQRGGSNFNGGRGRGGRGRGYGQYNNSYGNYNNSNRSVCQVCGKQGHSALVC
        N +  ++  N    V +  N     N+N       N N + RG N         N+N    R    +     + S           CQ+C  QGHSA  C
Subjt:  NTQKTVVTFNHTTAVNVARNSRGSFNSNPGNSKGWNFNNSQRGMNPKSGQRGGSNFNGGRGRGGRGRGYGQYNNSYGNYNNSNRSVCQVCGKQGHSALVC

Query:  --YHRWSLFGNSETVIDP-------------------NWYADSGASNHVTGDFNNLINPREYGGNEQVTIGNGD--------------------------
           H++    N +    P                   NW  DSGA++H+T DFNNL   + Y G + V I +G                           
Subjt:  --YHRWSLFGNSETVIDP-------------------NWYADSGASNHVTGDFNNLINPREYGGNEQVTIGNGD--------------------------

Query:  -------------------------------KDKGSDRVLLKGTLKDGLYQLESATNLPESSMHQSTTVTQKS-----INNSTSRVLDHVIKECNLPV-K
                                       KD  +   LL+G  KD LY+   A++   S      +    S     + + +  +L+ VI   +LPV  
Subjt:  -------------------------------KDKGSDRVLLKGTLKDGLYQLESATNLPESSMHQSTTVTQKS-----INNSTSRVLDHVIKECNLPV-K

Query:  GNESVSFCEACQFGKAHNLPFSLSNNRAKKPFDLVHSDLWGPAPIVSPDGFQFYVLFVDDHSRFTWLYPLKRKSDTLLAFQHFLSLVQTQFHSQIKAFQS
         +  +  C  C   K+H +PFS S   + KP + ++SD+W  +PI+S D +++YV+FVD  +R+TWLYPLK+KS     F  F SLV+ +F ++I    S
Subjt:  GNESVSFCEACQFGKAHNLPFSLSNNRAKKPFDLVHSDLWGPAPIVSPDGFQFYVLFVDDHSRFTWLYPLKRKSDTLLAFQHFLSLVQTQFHSQIKAFQS

Query:  DNGGEFV----------------------HNGRAERKHRHVIETGLTLLAQANMPLAYWWDAFLVAIQIINGLPTPVLGGKSPVEILLNKKPDFSSFRVF
        DNGGEFV                      HNG +ERKHRH++E GLTLL+ A++P  YW  AF VA+ +IN LPTP+L  +SP + L  + P++   +VF
Subjt:  DNGGEFV----------------------HNGRAERKHRHVIETGLTLLAQANMPLAYWWDAFLVAIQIINGLPTPVLGGKSPVEILLNKKPDFSSFRVF

Query:  GSSCYPCLRQYQQHKFMFHSENHVYLGFSPSHKGHKCLN-ASGRIFISRNVQFNEQEFPFS
        G +CYP LR Y +HK    S+   ++G+S +   + CL+  +GR++ SR+VQF+E+ FPFS
Subjt:  GSSCYPCLRQYQQHKFMFHSENHVYLGFSPSHKGHKCLN-ASGRIFISRNVQFNEQEFPFS

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.5e-2442.96Show/hide
Query:  VYLGFSPSHKGHKCLNASGRIFISRNVQFNEQEFPFSKVMWLNQLMNELGCHSSSKPILWCDNLSAGALAANPVFHARTKHIEIDVHFVRDQVLKGALEV
        VYLG  P     K     G +  S   ++       S++ W+  L+ ELG   S  P+++CDN+ A  L ANPVFH+R KHI +D HF+R+QV  GAL V
Subjt:  VYLGFSPSHKGHKCLNASGRIFISRNVQFNEQEFPFSKVMWLNQLMNELGCHSSSKPILWCDNLSAGALAANPVFHARTKHIEIDVHFVRDQVLKGALEV

Query:  RYVPTADQIADCLTKPLSHSQFAYLRSKLGVVELP
         +V T DQ+AD LTKPLS   F     K+GV+++P
Subjt:  RYVPTADQIADCLTKPLSHSQFAYLRSKLGVVELP

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 85.0e-1034.19Show/hide
Query:  SRNVQFNEQEFPFSKVMWLNQLMNELGCHSSSKPILWCDNLSAGALAANPVFHARTKHIEIDVHFVRDQ-VLKGALEVRYVPTADQIADCLTK---PLSH
        S   ++    F   ++MWL Q   EL    S   +L+CDN +A  +A N VFH RTKHIE D H VR++ V +  L   +    +Q  D  T+   P+  
Subjt:  SRNVQFNEQEFPFSKVMWLNQLMNELGCHSSSKPILWCDNLSAGALAANPVFHARTKHIEIDVHFVRDQ-VLKGALEVRYVPTADQIADCLTK---PLSH

Query:  SQFAYLRSKLGVVELPA
            Y+ S  G+  L A
Subjt:  SQFAYLRSKLGVVELPA

ATMG00300.1 Gag-Pol-related retrotransposon family protein1.0e-1034.58Show/hide
Query:  RVLLKGTLKDGLYQLESATNLPESSMHQS----TTVTQKSINNSTSRVLDHVIKECNLPVKGNESVSFCEACQFGKAHNLPFSLSNNRAKKPFDLVHSDL
        R +LKG   D LY L+ +    ES++ ++    T +    + + + R ++ ++K+  L      S+ FCE C +GK H + FS   +  K P D VHSDL
Subjt:  RVLLKGTLKDGLYQLESATNLPESSMHQS----TTVTQKSINNSTSRVLDHVIKECNLPVKGNESVSFCEACQFGKAHNLPFSLSNNRAKKPFDLVHSDL

Query:  WGPAPIV
        WG AP V
Subjt:  WGPAPIV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCAACGTCACAGCTTCTGGGCCTGGTTCAACCAGCAATTCAACTGGGTTCAGCTCACCTCCTCTCAATCAGCTTTTGAACCAGATCACCACTATCAAACTAGACAG
AGGTGCAGATGCATCGAGTTCGACGGCACCAGCTGGTGGAGCATCAAGCTCTACAACCACGATGACAGTTAATCCTTTGTATGAAGCGTGGATCACCACAGATCAACTAC
TTCTCGGATGGCTTTACAATTCCATGACGTCTGAAGTTGCCACACAAGTCATGGGATTCGAAATGCAAAAGATCNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTGTGGGAAGCGATTCAGAGTTTGTTTGGCATACAATCAAGAGCGGA
GGAAGACTACCTTCGACAAATTTTTCAGCAAACTCGAAAAGGTAACACTAAAATGGCCGATTATTTGAGACTAATGAAATCTCATTCTGATAACCTAGGGCAAGCTGGTA
GTCCTGTCACGCCGAGAAATTTAATTTCCCAAAATACTCAAAAAACTGTTGTCACCTTCAATCACACAACTGCAGTGAATGTAGCAAGGAACAGCAGAGGTTCATTTAAT
TCAAACCCAGGGAATTCCAAAGGATGGAACTTCAACAATAGTCAGCGAGGCATGAATCCTAAAAGTGGTCAAAGAGGTGGATCTAATTTCAATGGAGGAAGAGGCCGTGG
TGGACGCGGCAGGGGTTATGGGCAATACAATAACTCCTATGGGAACTACAATAACTCCAACAGGTCAGTGTGTCAAGTCTGTGGTAAGCAGGGACACTCAGCCCTTGTTT
GTTATCACAGGTGGTCCTTATTTGGAAATTCTGAGACAGTAATCGACCCCAACTGGTATGCCGACAGTGGGGCCTCTAATCATGTCACTGGAGACTTCAACAATCTGATC
AATCCACGAGAGTATGGAGGTAATGAACAAGTTACTATAGGCAATGGTGATAAGGACAAGGGCTCGGATCGGGTGCTGCTGAAAGGAACCCTTAAAGATGGGCTATACCA
ACTAGAAAGTGCCACCAATCTACCTGAGAGTTCTATGCATCAGTCTACAACCGTTACACAGAAGTCTATAAATAACTCAACCTCTAGAGTACTTGACCATGTTATTAAAG
AGTGTAATCTTCCAGTAAAAGGAAATGAAAGTGTTAGCTTTTGTGAGGCATGTCAGTTTGGCAAGGCACATAATCTTCCTTTCTCTTTATCTAACAATCGAGCTAAGAAA
CCATTTGATCTGGTTCATTCTGATCTATGGGGTCCAGCACCAATAGTTTCACCTGATGGCTTTCAGTTTTACGTGTTATTTGTGGATGATCATAGCAGATTTACATGGTT
GTATCCTTTAAAACGCAAGAGTGACACTCTGCTTGCTTTCCAACACTTCCTTAGTCTAGTTCAGACTCAATTTCATAGCCAAATTAAGGCATTTCAGTCTGACAATGGGG
GTGAATTTGTTCATAATGGCCGAGCTGAACGCAAACACAGACATGTTATTGAAACTGGCCTCACACTTCTTGCTCAGGCTAATATGCCACTAGCCTACTGGTGGGATGCG
TTTCTTGTTGCTATTCAGATCATAAATGGGCTACCTACTCCGGTTCTAGGTGGTAAGTCGCCAGTTGAAATTTTGCTCAATAAGAAGCCTGATTTTTCTTCTTTCCGTGT
GTTTGGCAGCTCATGTTATCCTTGTTTGCGGCAATATCAGCAACACAAGTTCATGTTTCATTCGGAGAATCACGTGTATTTGGGATTCAGCCCTTCTCACAAAGGGCACA
AATGCCTCAATGCGTCTGGGCGGATCTTTATCTCTCGCAATGTTCAGTTCAATGAACAAGAATTCCCATTCTCTAAAGTCATGTGGCTAAATCAGTTGATGAATGAACTT
GGCTGCCACTCTTCCTCAAAGCCGATCCTATGGTGTGACAACCTCAGCGCAGGTGCACTAGCTGCCAACCCAGTTTTTCACGCACGAACTAAACATATTGAGATCGACGT
TCACTTTGTTCGTGATCAAGTTCTCAAGGGTGCTCTGGAAGTCAGGTATGTTCCCACGGCAGATCAAATTGCAGACTGTTTAACTAAACCTTTGTCGCACTCTCAGTTTG
CCTACCTACGTTCCAAACTCGGTGTAGTTGAACTACCAGCTCGTTTGAGGAGGGATATTAAGGACAAGGATCATAAATGCCACGTCAAGGAAATTAATACAGTCTGCCAG
CCATGTCACAATACAAAGCAAGCAGATTCCAGAAACTTCTACTCAACGAATTCTGTTATTCCTCAATGA
mRNA sequenceShow/hide mRNA sequence
ATGACCAACGTCACAGCTTCTGGGCCTGGTTCAACCAGCAATTCAACTGGGTTCAGCTCACCTCCTCTCAATCAGCTTTTGAACCAGATCACCACTATCAAACTAGACAG
AGGTGCAGATGCATCGAGTTCGACGGCACCAGCTGGTGGAGCATCAAGCTCTACAACCACGATGACAGTTAATCCTTTGTATGAAGCGTGGATCACCACAGATCAACTAC
TTCTCGGATGGCTTTACAATTCCATGACGTCTGAAGTTGCCACACAAGTCATGGGATTCGAAATGCAAAAGATCNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTGTGGGAAGCGATTCAGAGTTTGTTTGGCATACAATCAAGAGCGGA
GGAAGACTACCTTCGACAAATTTTTCAGCAAACTCGAAAAGGTAACACTAAAATGGCCGATTATTTGAGACTAATGAAATCTCATTCTGATAACCTAGGGCAAGCTGGTA
GTCCTGTCACGCCGAGAAATTTAATTTCCCAAAATACTCAAAAAACTGTTGTCACCTTCAATCACACAACTGCAGTGAATGTAGCAAGGAACAGCAGAGGTTCATTTAAT
TCAAACCCAGGGAATTCCAAAGGATGGAACTTCAACAATAGTCAGCGAGGCATGAATCCTAAAAGTGGTCAAAGAGGTGGATCTAATTTCAATGGAGGAAGAGGCCGTGG
TGGACGCGGCAGGGGTTATGGGCAATACAATAACTCCTATGGGAACTACAATAACTCCAACAGGTCAGTGTGTCAAGTCTGTGGTAAGCAGGGACACTCAGCCCTTGTTT
GTTATCACAGGTGGTCCTTATTTGGAAATTCTGAGACAGTAATCGACCCCAACTGGTATGCCGACAGTGGGGCCTCTAATCATGTCACTGGAGACTTCAACAATCTGATC
AATCCACGAGAGTATGGAGGTAATGAACAAGTTACTATAGGCAATGGTGATAAGGACAAGGGCTCGGATCGGGTGCTGCTGAAAGGAACCCTTAAAGATGGGCTATACCA
ACTAGAAAGTGCCACCAATCTACCTGAGAGTTCTATGCATCAGTCTACAACCGTTACACAGAAGTCTATAAATAACTCAACCTCTAGAGTACTTGACCATGTTATTAAAG
AGTGTAATCTTCCAGTAAAAGGAAATGAAAGTGTTAGCTTTTGTGAGGCATGTCAGTTTGGCAAGGCACATAATCTTCCTTTCTCTTTATCTAACAATCGAGCTAAGAAA
CCATTTGATCTGGTTCATTCTGATCTATGGGGTCCAGCACCAATAGTTTCACCTGATGGCTTTCAGTTTTACGTGTTATTTGTGGATGATCATAGCAGATTTACATGGTT
GTATCCTTTAAAACGCAAGAGTGACACTCTGCTTGCTTTCCAACACTTCCTTAGTCTAGTTCAGACTCAATTTCATAGCCAAATTAAGGCATTTCAGTCTGACAATGGGG
GTGAATTTGTTCATAATGGCCGAGCTGAACGCAAACACAGACATGTTATTGAAACTGGCCTCACACTTCTTGCTCAGGCTAATATGCCACTAGCCTACTGGTGGGATGCG
TTTCTTGTTGCTATTCAGATCATAAATGGGCTACCTACTCCGGTTCTAGGTGGTAAGTCGCCAGTTGAAATTTTGCTCAATAAGAAGCCTGATTTTTCTTCTTTCCGTGT
GTTTGGCAGCTCATGTTATCCTTGTTTGCGGCAATATCAGCAACACAAGTTCATGTTTCATTCGGAGAATCACGTGTATTTGGGATTCAGCCCTTCTCACAAAGGGCACA
AATGCCTCAATGCGTCTGGGCGGATCTTTATCTCTCGCAATGTTCAGTTCAATGAACAAGAATTCCCATTCTCTAAAGTCATGTGGCTAAATCAGTTGATGAATGAACTT
GGCTGCCACTCTTCCTCAAAGCCGATCCTATGGTGTGACAACCTCAGCGCAGGTGCACTAGCTGCCAACCCAGTTTTTCACGCACGAACTAAACATATTGAGATCGACGT
TCACTTTGTTCGTGATCAAGTTCTCAAGGGTGCTCTGGAAGTCAGGTATGTTCCCACGGCAGATCAAATTGCAGACTGTTTAACTAAACCTTTGTCGCACTCTCAGTTTG
CCTACCTACGTTCCAAACTCGGTGTAGTTGAACTACCAGCTCGTTTGAGGAGGGATATTAAGGACAAGGATCATAAATGCCACGTCAAGGAAATTAATACAGTCTGCCAG
CCATGTCACAATACAAAGCAAGCAGATTCCAGAAACTTCTACTCAACGAATTCTGTTATTCCTCAATGA
Protein sequenceShow/hide protein sequence
MTNVTASGPGSTSNSTGFSSPPLNQLLNQITTIKLDRGADASSSTAPAGGASSSTTTMTVNPLYEAWITTDQLLLGWLYNSMTSEVATQVMGFEMQKIXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXWEAIQSLFGIQSRAEEDYLRQIFQQTRKGNTKMADYLRLMKSHSDNLGQAGSPVTPRNLISQNTQKTVVTFNHTTAVNVARNSRGSFN
SNPGNSKGWNFNNSQRGMNPKSGQRGGSNFNGGRGRGGRGRGYGQYNNSYGNYNNSNRSVCQVCGKQGHSALVCYHRWSLFGNSETVIDPNWYADSGASNHVTGDFNNLI
NPREYGGNEQVTIGNGDKDKGSDRVLLKGTLKDGLYQLESATNLPESSMHQSTTVTQKSINNSTSRVLDHVIKECNLPVKGNESVSFCEACQFGKAHNLPFSLSNNRAKK
PFDLVHSDLWGPAPIVSPDGFQFYVLFVDDHSRFTWLYPLKRKSDTLLAFQHFLSLVQTQFHSQIKAFQSDNGGEFVHNGRAERKHRHVIETGLTLLAQANMPLAYWWDA
FLVAIQIINGLPTPVLGGKSPVEILLNKKPDFSSFRVFGSSCYPCLRQYQQHKFMFHSENHVYLGFSPSHKGHKCLNASGRIFISRNVQFNEQEFPFSKVMWLNQLMNEL
GCHSSSKPILWCDNLSAGALAANPVFHARTKHIEIDVHFVRDQVLKGALEVRYVPTADQIADCLTKPLSHSQFAYLRSKLGVVELPARLRRDIKDKDHKCHVKEINTVCQ
PCHNTKQADSRNFYSTNSVIPQ