; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI07G08350 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI07G08350
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationChr7:6122940..6124956
RNA-Seq ExpressionCSPI07G08350
SyntenyCSPI07G08350
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU19483.1 hypothetical protein TSUD_77270 [Trifolium subterraneum]9.4e-7433.76Show/hide
Query:  NQLP---YVKLDPGNYLLCQTLALPILKSYKLEGHLTGENPCPPKFITNS-SGESQSGIDGTTEAT-----GGASSSSTTSKNEQRTYGKPS-------Q
        N LP    VKLD  NY L ++L LP+++  KL+G++ G   CP +FIT+S S ++++      +A      G   +S TT    Q  + + S       Q
Subjt:  NQLP---YVKLDPGNYLLCQTLALPILKSYKLEGHLTGENPCPPKFITNS-SGESQSGIDGTTEAT-----GGASSSSTTSKNEQRTYGKPS-------Q

Query:  GLFGIQSR-------------------------------------------------VLLRLDEVYNPATVVIQGKSNISWLDMQSELLIFEKRLEHQNS
         L G  +R                                                  L  LD  YNP  V +  ++ +SW+D+Q++LL FE R+E  N+
Subjt:  GLFGIQSR-------------------------------------------------VLLRLDEVYNPATVVIQGKSNISWLDMQSELLIFEKRLEHQNS

Query:  QKNHANNDARQPGNSYPHNSNQSNGNSQRGGNNFHNSGSRGPGRGRGNKPTCQVCGKYGHSALNP-TPFVTTYNT-------------NPF-ATPETVID
          N   N      N   H    SN N+ RG N+    G R  GRG+  K  CQVCG   H A++    F  TY+              N F A+  +V D
Subjt:  QKNHANNDARQPGNSYPHNSNQSNGNSQRGGNNFHNSGSRGPGRGRGNKPTCQVCGKYGHSALNP-TPFVTTYNT-------------NPF-ATPETVID

Query:  SNWYIDSGATNHVTVDYSNLSNPSEYSGL-------------------------LRDGLY------HLESVAVIADD------------VLEKSGYRKLQ
         +WY DSGA+NHVT       + +E+ G                          L D LY      +L SV+ +A D             ++     K+ 
Subjt:  SNWYIDSGATNHVTVDYSNLSNPSEYSGL-------------------------LRDGLY------HLESVAVIADD------------VLEKSGYRKLQ

Query:  FKHINKNASTFVLSKKANDGASKTV---WYRRLGHSTMQVLNYIAKVCNLPINGNGDFMFCESCQLGKTHNLPFNISRSKPAQSFDLIHSDLWGPAPFTS
         K + K+    +   K N  A  +V   W+RRLGH   +VL+ + + C + +  + +F FCE+CQ GK H LPF  S S   +  +L+H+D+WGPAP  +
Subjt:  FKHINKNASTFVLSKKANDGASKTV---WYRRLGHSTMQVLNYIAKVCNLPINGNGDFMFCESCQLGKTHNLPFNISRSKPAQSFDLIHSDLWGPAPFTS

Query:  TDDYRYYILFVDDFSKYTWIYPLKHKSVALEAFKHFITYVKTQFTKTIKAFQFDNGGEFIQIHHMCNKMGIVSRLSRPYTSAQNGQAERKHRHVTETSLS
        +  ++YY+ FVDDFS++TWIYPLK KS  ++AF  F    + QF K IK  Q D GGE+  +  +  + GI  R+S PYTS QNG+AERKHRH+TE  L+
Subjt:  TDDYRYYILFVDDFSKYTWIYPLKHKSVALEAFKHFITYVKTQFTKTIKAFQFDNGGEFIQIHHMCNKMGIVSRLSRPYTSAQNGQAERKHRHVTETSLS

Query:  LLAQATLPLNFWWDAFITAKSLINGLPT
        LLAQA +PL++WW+AF TA  LIN LP+
Subjt:  LLAQATLPLNFWWDAFITAKSLINGLPT

KYP50444.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan]1.9e-7437.5Show/hide
Query:  SRVLLRLDEVYNPATVVIQGKSNISWLDMQSELLIFEKRLEHQNSQKNHANNDARQPGNSYPHNSNQSN--GNSQRGGNNFHNSGSRGPGRGRGNKPTCQ
        ++ L  LD  YNP  V +  K +++W++MQ++LL +E RLE  N+Q N   N +        +   +SN  G  + G  N    G RG GR   ++  CQ
Subjt:  SRVLLRLDEVYNPATVVIQGKSNISWLDMQSELLIFEKRLEHQNSQKNHANNDARQPGNSYPHNSNQSN--GNSQRGGNNFHNSGSRGPGRGRGNKPTCQ

Query:  VCGKYGHSA-------------------LNPTPFVTTYNTNPF-ATPETVIDSNWYIDSGATNHVTVDYSNLSNPSEYSGL-------------------
        VC K GH+A                    +       YN N + A+P TV D +WY DSGA+NHVT D + +   +E  G                    
Subjt:  VCGKYGHSA-------------------LNPTPFVTTYNTNPF-ATPETVIDSNWYIDSGATNHVTVDYSNLSNPSEYSGL-------------------

Query:  ----------LRDGLY------HLESVAVIADDVLEKSGYRKLQFKHINKNASTFVLSKKANDG-------------------ASKTVWYRRLGHSTMQV
                  L+D LY      +L S++ +  D      +  +     +K     +L  K  DG                   + K  W+R+LGH   +V
Subjt:  ----------LRDGLY------HLESVAVIADDVLEKSGYRKLQFKHINKNASTFVLSKKANDG-------------------ASKTVWYRRLGHSTMQV

Query:  LNYIAKVCNLPINGNGDFMFCESCQLGKTHNLPFNISRSKPAQSFDLIHSDLWGPAPFTSTDDYRYYILFVDDFSKYTWIYPLKHKSVALEAFKHFITYV
        LN + K+CN+  +   +F FCE+CQ GK HNLPF  S S   +  DL+HSD+WGPAP +S   ++YY+LF+DD+S++TWIYPLK KS   +AF  F   V
Subjt:  LNYIAKVCNLPINGNGDFMFCESCQLGKTHNLPFNISRSKPAQSFDLIHSDLWGPAPFTSTDDYRYYILFVDDFSKYTWIYPLKHKSVALEAFKHFITYV

Query:  KTQFTKTIKAFQFDNGGEFIQIHHMCNKMGIVSRLSRPYTSAQNGQAERKHRHVTETSLSLLAQATLPLNFWWDAFITAKSLINGLPT
        + QF K IK  Q D GGEF  +  +  K GI  R S PYTSAQNG+AERKHRHV E+ L+LLAQA +PL++WW+AF TA  LIN LPT
Subjt:  KTQFTKTIKAFQFDNGGEFIQIHHMCNKMGIVSRLSRPYTSAQNGQAERKHRHVTETSLSLLAQATLPLNFWWDAFITAKSLINGLPT

PNX76291.1 gag/pol polyprotein - maize retrotransposon Hopscotch, partial [Trifolium pratense]1.1e-7434.12Show/hide
Query:  NQLP---YVKLDPGNYLLCQTLALPILKSYKLEGHLTGENPCPPKFIT---------------------------NS--------------------SGE
        N LP    VKLD  NY L Q++ LPI++  +L+G++ G+  CP +FIT                           NS                      +
Subjt:  NQLP---YVKLDPGNYLLCQTLALPILKSYKLEGHLTGENPCPPKFIT---------------------------NS--------------------SGE

Query:  SQSGIDGTTEATGGASSSSTTSKNEQ----------------RTYGKPSQGLFGIQSRVLLRLDEVYNPATVVIQGKSNISWLDMQSELLIFEKRLEHQN
        S +G    ++ T   S   +T K E                 +  G P      +  + L  LD  YNP  V +  ++ +SW+D+Q++LL FE R+E  N
Subjt:  SQSGIDGTTEATGGASSSSTTSKNEQ----------------RTYGKPSQGLFGIQSRVLLRLDEVYNPATVVIQGKSNISWLDMQSELLIFEKRLEHQN

Query:  SQKNHANNDARQPGNSYPHNSNQSNGNSQRGGNNFHNSGSR------GPGRGRGNKPTCQVCGKYGHSALNP-TPFVTTYNT-------------NPF-A
        S  N   N          H  N+ N N+   G+N +  GS       G GRGR  K TCQVCG   H A++    F  TY+              N F A
Subjt:  SQKNHANNDARQPGNSYPHNSNQSNGNSQRGGNNFHNSGSR------GPGRGRGNKPTCQVCGKYGHSALNP-TPFVTTYNT-------------NPF-A

Query:  TPETVIDSNWYIDSGATNHVTVDYSNLSNPSEYSGL-------------------------LRDGLY------HLESVAVIADD--VLEKSGYR------
        +  ++ D +WY DSGA+NHVT       N SE+ G                          L D LY      +L SV+ +A D  +L +          
Subjt:  TPETVIDSNWYIDSGATNHVTVDYSNLSNPSEYSGL-------------------------LRDGLY------HLESVAVIADD--VLEKSGYR------

Query:  KLQFKHINKNA---STFVLSKKANDG--ASKTVWYRRLGHSTMQVLNYIAKVCNLPINGNGDFMFCESCQLGKTHNLPFNISRSKPAQSFDLIHSDLWGP
        KL  K I +       + LS+K +    + K  W+R+LGH   +VL+ + K CN+ ++ +  F FCE+CQ GK H LPF  S S   +  +L+H+D+WGP
Subjt:  KLQFKHINKNA---STFVLSKKANDG--ASKTVWYRRLGHSTMQVLNYIAKVCNLPINGNGDFMFCESCQLGKTHNLPFNISRSKPAQSFDLIHSDLWGP

Query:  APFTSTDDYRYYILFVDDFSKYTWIYPLKHKSVALEAFKHFITYVKTQFTKTIKAFQFDNGGEFIQIHHMCNKMGIVSRLSRPYTSAQNGQAERKHRHVT
        AP  S+  ++YY+ F+DDF+++TWIYPLK KS    AF  F   V+ QF+K IK  Q D GGE+  +     + GI  R+S PYTS QNG+AERKHRH+ 
Subjt:  APFTSTDDYRYYILFVDDFSKYTWIYPLKHKSVALEAFKHFITYVKTQFTKTIKAFQFDNGGEFIQIHHMCNKMGIVSRLSRPYTSAQNGQAERKHRHVT

Query:  ETSLSLLAQATLPLNFWWDAFITAKSLINGLPT
        E  L+LLAQA +PLN+WW+AF TA  LIN LP+
Subjt:  ETSLSLLAQATLPLNFWWDAFITAKSLINGLPT

PNX78574.1 retrovirus-related Pol polyprotein from transposon TNT 1-94 [Trifolium pratense]1.8e-6933.12Show/hide
Query:  NQLP---YVKLDPGNYLLCQTLALPILKSYKLEGHLTGENPCPPKFITN--SSGESQSGIDGTTEA-----TGGASSSSTTSKNEQRTYGKPSQGLF---
        N LP    V LD  N+ L ++L LPI++  +L+G++ G   CP +FIT+  +SG+  +   G  +A      G   ++ TT    Q  + + S+ L+   
Subjt:  NQLP---YVKLDPGNYLLCQTLALPILKSYKLEGHLTGENPCPPKFITN--SSGESQSGIDGTTEA-----TGGASSSSTTSKNEQRTYGKPSQGLF---

Query:  ------GIQSRVL-LR----------------------------------------------LDEVYNPATVVIQGKSNISWLDMQSELLIFEKRLEHQN
                +SRV+ LR                                              LD  YNP  V +  + N+SW+D+Q++LL FE RL+  N
Subjt:  ------GIQSRVL-LR----------------------------------------------LDEVYNPATVVIQGKSNISWLDMQSELLIFEKRLEHQN

Query:  SQKNHANNDARQPGNSYPHNSNQSNGNSQRGGNNFHNSGSRGPGRGRGNKPTCQVCGKYGHSAL------NPTPFVTTYNT---------NPF-ATPETV
        S  N   N      N      N  N      G++F N+   G G+GR +   CQVC K+GH+A+      + +   ++Y+          N F A+    
Subjt:  SQKNHANNDARQPGNSYPHNSNQSNGNSQRGGNNFHNSGSRGPGRGRGNKPTCQVCGKYGHSAL------NPTPFVTTYNT---------NPF-ATPETV

Query:  IDSNWYIDSGATNHVTVDYSNLSNPSEYSGL-------------------------LRDGLY------HLESVAVIADD---VLEKSG-----YRKLQFK
         D  WY DSGA+NHVT         +E SG                          L D LY      +L SV+ +  D   ++E          KL  K
Subjt:  IDSNWYIDSGATNHVTVDYSNLSNPSEYSGL-------------------------LRDGLY------HLESVAVIADD---VLEKSG-----YRKLQFK

Query:  HINKNASTFVLSKKANDGAS-----------KTVWYRRLGHSTMQVLNYIAKVCNLPINGNGDFMFCESCQLGKTHNLPFNISRSKPAQSFDLIHSDLWG
         + +      L + +N  +            K  W+R+LGH +  VL+ + K+CN+  + +  F FCE+CQLGK+H LPF  S S   +  +LIH+D+WG
Subjt:  HINKNASTFVLSKKANDGAS-----------KTVWYRRLGHSTMQVLNYIAKVCNLPINGNGDFMFCESCQLGKTHNLPFNISRSKPAQSFDLIHSDLWG

Query:  PAPFTSTDDYRYYILFVDDFSKYTWIYPLKHKSVALEAFKHFITYVKTQFTKTIKAFQFDNGGEFIQIHHMCNKMGIVSRLSRPYTSAQNGQAERKHRHV
        PAP  S   ++YY+ F+DD S++TWIYPLK KS  + AF  F   V+ QF K IK  Q D GGEF  +  +  + GI  R+S PYTS QNG+AERKHRHV
Subjt:  PAPFTSTDDYRYYILFVDDFSKYTWIYPLKHKSVALEAFKHFITYVKTQFTKTIKAFQFDNGGEFIQIHHMCNKMGIVSRLSRPYTSAQNGQAERKHRHV

Query:  TETSLSLLAQATLPLNFWWDAFITAKSLINGLPT
         E  L+LLAQA + L++WW+AF TA  LIN LP+
Subjt:  TETSLSLLAQATLPLNFWWDAFITAKSLINGLPT

PNX94503.1 putative retrotransposon Ty1-copia subclass protein, partial [Trifolium pratense]5.3e-6932.03Show/hide
Query:  NQLP---YVKLDPGNYLLCQTLALPILKSYKLEGHLTGENPCPPKFITNSSG-------------------------------------ESQSGIDGTTE
        N LP    VKLD  N+ L ++L LP+++  K +G++ G   CP +F+T+                                        E+   +    +
Subjt:  NQLP---YVKLDPGNYLLCQTLALPILKSYKLEGHLTGENPCPPKFITNSSG-------------------------------------ESQSGIDGTTE

Query:  ATGGASSSS----------TTSKNEQ----------------RTYGKPSQGLFGIQSRVLLRLDEVYNPATVVIQGKSNISWLDMQSELLIFEKRLEHQN
        +  GA + S           T K E                 +  G P      +  + L  LD  YNP  V +  ++NISW+D Q++LL FE RL+  N
Subjt:  ATGGASSSS----------TTSKNEQ----------------RTYGKPSQGLFGIQSRVLLRLDEVYNPATVVIQGKSNISWLDMQSELLIFEKRLEHQN

Query:  SQKNHANNDARQPGNSYPHNSNQSNGNSQRGGNNFHNSGSRGPGRGRGN-----KPTCQVCGKYGHSALN-PTPFVTTYNTNP------------FATPE
           N  N +     N    N +  N    RGG    NS     GRGR       +P CQ+CGK+GH+A      F  +Y                 A+P 
Subjt:  SQKNHANNDARQPGNSYPHNSNQSNGNSQRGGNNFHNSGSRGPGRGRGN-----KPTCQVCGKYGHSALN-PTPFVTTYNTNP------------FATPE

Query:  TVIDSNWYIDSGATNHVTVDYSNLSNPSEYSGL-------------------------LRDGLY------HLESVAVIADDVLEKSGYRKLQFKHINKNA
           D  WY DSGA+NHVT     L + +E +G                          LR+ LY      +L SV+ +  D      + +      +K  
Subjt:  TVIDSNWYIDSGATNHVTVDYSNLSNPSEYSGL-------------------------LRDGLY------HLESVAVIADDVLEKSGYRKLQFKHINKNA

Query:  STFVLSKKANDG--------------------ASKTVWYRRLGHSTMQVLNYIAKVCNLPINGNGDFMFCESCQLGKTHNLPFNISRSKPAQSFDLIHSD
           +L  +  DG                    + K +W+R+LGH   +VL  + K  N+ I+ +  F FCE+CQ GK H LPF  S S   +  DLIH+D
Subjt:  STFVLSKKANDG--------------------ASKTVWYRRLGHSTMQVLNYIAKVCNLPINGNGDFMFCESCQLGKTHNLPFNISRSKPAQSFDLIHSD

Query:  LWGPAPFTSTDDYRYYILFVDDFSKYTWIYPLKHKSVALEAFKHFITYVKTQFTKTIKAFQFDNGGEFIQIHHMCNKMGIVSRLSRPYTSAQNGQAERKH
        +WGPAP  S  +++YY+ F+DDFS++TWI+PLK KS  + AF  F   V+ QF K IK  + D GGE+  +       GI  ++S PYTS QNG+AERKH
Subjt:  LWGPAPFTSTDDYRYYILFVDDFSKYTWIYPLKHKSVALEAFKHFITYVKTQFTKTIKAFQFDNGGEFIQIHHMCNKMGIVSRLSRPYTSAQNGQAERKH

Query:  RHVTETSLSLLAQATLPLNFWWDAFITAKSLINGLPT
        RHVTE  L+LLAQA +PL++WW+AF TA  LIN LP+
Subjt:  RHVTETSLSLLAQATLPLNFWWDAFITAKSLINGLPT

TrEMBL top hitse value%identityAlignment
A0A151S6M8 Retrovirus-related Pol polyprotein from transposon TNT 1-949.2e-7537.5Show/hide
Query:  SRVLLRLDEVYNPATVVIQGKSNISWLDMQSELLIFEKRLEHQNSQKNHANNDARQPGNSYPHNSNQSN--GNSQRGGNNFHNSGSRGPGRGRGNKPTCQ
        ++ L  LD  YNP  V +  K +++W++MQ++LL +E RLE  N+Q N   N +        +   +SN  G  + G  N    G RG GR   ++  CQ
Subjt:  SRVLLRLDEVYNPATVVIQGKSNISWLDMQSELLIFEKRLEHQNSQKNHANNDARQPGNSYPHNSNQSN--GNSQRGGNNFHNSGSRGPGRGRGNKPTCQ

Query:  VCGKYGHSA-------------------LNPTPFVTTYNTNPF-ATPETVIDSNWYIDSGATNHVTVDYSNLSNPSEYSGL-------------------
        VC K GH+A                    +       YN N + A+P TV D +WY DSGA+NHVT D + +   +E  G                    
Subjt:  VCGKYGHSA-------------------LNPTPFVTTYNTNPF-ATPETVIDSNWYIDSGATNHVTVDYSNLSNPSEYSGL-------------------

Query:  ----------LRDGLY------HLESVAVIADDVLEKSGYRKLQFKHINKNASTFVLSKKANDG-------------------ASKTVWYRRLGHSTMQV
                  L+D LY      +L S++ +  D      +  +     +K     +L  K  DG                   + K  W+R+LGH   +V
Subjt:  ----------LRDGLY------HLESVAVIADDVLEKSGYRKLQFKHINKNASTFVLSKKANDG-------------------ASKTVWYRRLGHSTMQV

Query:  LNYIAKVCNLPINGNGDFMFCESCQLGKTHNLPFNISRSKPAQSFDLIHSDLWGPAPFTSTDDYRYYILFVDDFSKYTWIYPLKHKSVALEAFKHFITYV
        LN + K+CN+  +   +F FCE+CQ GK HNLPF  S S   +  DL+HSD+WGPAP +S   ++YY+LF+DD+S++TWIYPLK KS   +AF  F   V
Subjt:  LNYIAKVCNLPINGNGDFMFCESCQLGKTHNLPFNISRSKPAQSFDLIHSDLWGPAPFTSTDDYRYYILFVDDFSKYTWIYPLKHKSVALEAFKHFITYV

Query:  KTQFTKTIKAFQFDNGGEFIQIHHMCNKMGIVSRLSRPYTSAQNGQAERKHRHVTETSLSLLAQATLPLNFWWDAFITAKSLINGLPT
        + QF K IK  Q D GGEF  +  +  K GI  R S PYTSAQNG+AERKHRHV E+ L+LLAQA +PL++WW+AF TA  LIN LPT
Subjt:  KTQFTKTIKAFQFDNGGEFIQIHHMCNKMGIVSRLSRPYTSAQNGQAERKHRHVTETSLSLLAQATLPLNFWWDAFITAKSLINGLPT

A0A2K3LCM1 Gag/pol polyprotein-maize retrotransposon Hopscotch (Fragment)5.4e-7534.12Show/hide
Query:  NQLP---YVKLDPGNYLLCQTLALPILKSYKLEGHLTGENPCPPKFIT---------------------------NS--------------------SGE
        N LP    VKLD  NY L Q++ LPI++  +L+G++ G+  CP +FIT                           NS                      +
Subjt:  NQLP---YVKLDPGNYLLCQTLALPILKSYKLEGHLTGENPCPPKFIT---------------------------NS--------------------SGE

Query:  SQSGIDGTTEATGGASSSSTTSKNEQ----------------RTYGKPSQGLFGIQSRVLLRLDEVYNPATVVIQGKSNISWLDMQSELLIFEKRLEHQN
        S +G    ++ T   S   +T K E                 +  G P      +  + L  LD  YNP  V +  ++ +SW+D+Q++LL FE R+E  N
Subjt:  SQSGIDGTTEATGGASSSSTTSKNEQ----------------RTYGKPSQGLFGIQSRVLLRLDEVYNPATVVIQGKSNISWLDMQSELLIFEKRLEHQN

Query:  SQKNHANNDARQPGNSYPHNSNQSNGNSQRGGNNFHNSGSR------GPGRGRGNKPTCQVCGKYGHSALNP-TPFVTTYNT-------------NPF-A
        S  N   N          H  N+ N N+   G+N +  GS       G GRGR  K TCQVCG   H A++    F  TY+              N F A
Subjt:  SQKNHANNDARQPGNSYPHNSNQSNGNSQRGGNNFHNSGSR------GPGRGRGNKPTCQVCGKYGHSALNP-TPFVTTYNT-------------NPF-A

Query:  TPETVIDSNWYIDSGATNHVTVDYSNLSNPSEYSGL-------------------------LRDGLY------HLESVAVIADD--VLEKSGYR------
        +  ++ D +WY DSGA+NHVT       N SE+ G                          L D LY      +L SV+ +A D  +L +          
Subjt:  TPETVIDSNWYIDSGATNHVTVDYSNLSNPSEYSGL-------------------------LRDGLY------HLESVAVIADD--VLEKSGYR------

Query:  KLQFKHINKNA---STFVLSKKANDG--ASKTVWYRRLGHSTMQVLNYIAKVCNLPINGNGDFMFCESCQLGKTHNLPFNISRSKPAQSFDLIHSDLWGP
        KL  K I +       + LS+K +    + K  W+R+LGH   +VL+ + K CN+ ++ +  F FCE+CQ GK H LPF  S S   +  +L+H+D+WGP
Subjt:  KLQFKHINKNA---STFVLSKKANDG--ASKTVWYRRLGHSTMQVLNYIAKVCNLPINGNGDFMFCESCQLGKTHNLPFNISRSKPAQSFDLIHSDLWGP

Query:  APFTSTDDYRYYILFVDDFSKYTWIYPLKHKSVALEAFKHFITYVKTQFTKTIKAFQFDNGGEFIQIHHMCNKMGIVSRLSRPYTSAQNGQAERKHRHVT
        AP  S+  ++YY+ F+DDF+++TWIYPLK KS    AF  F   V+ QF+K IK  Q D GGE+  +     + GI  R+S PYTS QNG+AERKHRH+ 
Subjt:  APFTSTDDYRYYILFVDDFSKYTWIYPLKHKSVALEAFKHFITYVKTQFTKTIKAFQFDNGGEFIQIHHMCNKMGIVSRLSRPYTSAQNGQAERKHRHVT

Query:  ETSLSLLAQATLPLNFWWDAFITAKSLINGLPT
        E  L+LLAQA +PLN+WW+AF TA  LIN LP+
Subjt:  ETSLSLLAQATLPLNFWWDAFITAKSLINGLPT

A0A2K3LJ49 Retrovirus-related Pol polyprotein from transposon TNT 1-948.9e-7033.12Show/hide
Query:  NQLP---YVKLDPGNYLLCQTLALPILKSYKLEGHLTGENPCPPKFITN--SSGESQSGIDGTTEA-----TGGASSSSTTSKNEQRTYGKPSQGLF---
        N LP    V LD  N+ L ++L LPI++  +L+G++ G   CP +FIT+  +SG+  +   G  +A      G   ++ TT    Q  + + S+ L+   
Subjt:  NQLP---YVKLDPGNYLLCQTLALPILKSYKLEGHLTGENPCPPKFITN--SSGESQSGIDGTTEA-----TGGASSSSTTSKNEQRTYGKPSQGLF---

Query:  ------GIQSRVL-LR----------------------------------------------LDEVYNPATVVIQGKSNISWLDMQSELLIFEKRLEHQN
                +SRV+ LR                                              LD  YNP  V +  + N+SW+D+Q++LL FE RL+  N
Subjt:  ------GIQSRVL-LR----------------------------------------------LDEVYNPATVVIQGKSNISWLDMQSELLIFEKRLEHQN

Query:  SQKNHANNDARQPGNSYPHNSNQSNGNSQRGGNNFHNSGSRGPGRGRGNKPTCQVCGKYGHSAL------NPTPFVTTYNT---------NPF-ATPETV
        S  N   N      N      N  N      G++F N+   G G+GR +   CQVC K+GH+A+      + +   ++Y+          N F A+    
Subjt:  SQKNHANNDARQPGNSYPHNSNQSNGNSQRGGNNFHNSGSRGPGRGRGNKPTCQVCGKYGHSAL------NPTPFVTTYNT---------NPF-ATPETV

Query:  IDSNWYIDSGATNHVTVDYSNLSNPSEYSGL-------------------------LRDGLY------HLESVAVIADD---VLEKSG-----YRKLQFK
         D  WY DSGA+NHVT         +E SG                          L D LY      +L SV+ +  D   ++E          KL  K
Subjt:  IDSNWYIDSGATNHVTVDYSNLSNPSEYSGL-------------------------LRDGLY------HLESVAVIADD---VLEKSG-----YRKLQFK

Query:  HINKNASTFVLSKKANDGAS-----------KTVWYRRLGHSTMQVLNYIAKVCNLPINGNGDFMFCESCQLGKTHNLPFNISRSKPAQSFDLIHSDLWG
         + +      L + +N  +            K  W+R+LGH +  VL+ + K+CN+  + +  F FCE+CQLGK+H LPF  S S   +  +LIH+D+WG
Subjt:  HINKNASTFVLSKKANDGAS-----------KTVWYRRLGHSTMQVLNYIAKVCNLPINGNGDFMFCESCQLGKTHNLPFNISRSKPAQSFDLIHSDLWG

Query:  PAPFTSTDDYRYYILFVDDFSKYTWIYPLKHKSVALEAFKHFITYVKTQFTKTIKAFQFDNGGEFIQIHHMCNKMGIVSRLSRPYTSAQNGQAERKHRHV
        PAP  S   ++YY+ F+DD S++TWIYPLK KS  + AF  F   V+ QF K IK  Q D GGEF  +  +  + GI  R+S PYTS QNG+AERKHRHV
Subjt:  PAPFTSTDDYRYYILFVDDFSKYTWIYPLKHKSVALEAFKHFITYVKTQFTKTIKAFQFDNGGEFIQIHHMCNKMGIVSRLSRPYTSAQNGQAERKHRHV

Query:  TETSLSLLAQATLPLNFWWDAFITAKSLINGLPT
         E  L+LLAQA + L++WW+AF TA  LIN LP+
Subjt:  TETSLSLLAQATLPLNFWWDAFITAKSLINGLPT

A0A2K3MUJ9 Putative retrotransposon Ty1-copia subclass protein (Fragment)2.6e-6932.03Show/hide
Query:  NQLP---YVKLDPGNYLLCQTLALPILKSYKLEGHLTGENPCPPKFITNSSG-------------------------------------ESQSGIDGTTE
        N LP    VKLD  N+ L ++L LP+++  K +G++ G   CP +F+T+                                        E+   +    +
Subjt:  NQLP---YVKLDPGNYLLCQTLALPILKSYKLEGHLTGENPCPPKFITNSSG-------------------------------------ESQSGIDGTTE

Query:  ATGGASSSS----------TTSKNEQ----------------RTYGKPSQGLFGIQSRVLLRLDEVYNPATVVIQGKSNISWLDMQSELLIFEKRLEHQN
        +  GA + S           T K E                 +  G P      +  + L  LD  YNP  V +  ++NISW+D Q++LL FE RL+  N
Subjt:  ATGGASSSS----------TTSKNEQ----------------RTYGKPSQGLFGIQSRVLLRLDEVYNPATVVIQGKSNISWLDMQSELLIFEKRLEHQN

Query:  SQKNHANNDARQPGNSYPHNSNQSNGNSQRGGNNFHNSGSRGPGRGRGN-----KPTCQVCGKYGHSALN-PTPFVTTYNTNP------------FATPE
           N  N +     N    N +  N    RGG    NS     GRGR       +P CQ+CGK+GH+A      F  +Y                 A+P 
Subjt:  SQKNHANNDARQPGNSYPHNSNQSNGNSQRGGNNFHNSGSRGPGRGRGN-----KPTCQVCGKYGHSALN-PTPFVTTYNTNP------------FATPE

Query:  TVIDSNWYIDSGATNHVTVDYSNLSNPSEYSGL-------------------------LRDGLY------HLESVAVIADDVLEKSGYRKLQFKHINKNA
           D  WY DSGA+NHVT     L + +E +G                          LR+ LY      +L SV+ +  D      + +      +K  
Subjt:  TVIDSNWYIDSGATNHVTVDYSNLSNPSEYSGL-------------------------LRDGLY------HLESVAVIADDVLEKSGYRKLQFKHINKNA

Query:  STFVLSKKANDG--------------------ASKTVWYRRLGHSTMQVLNYIAKVCNLPINGNGDFMFCESCQLGKTHNLPFNISRSKPAQSFDLIHSD
           +L  +  DG                    + K +W+R+LGH   +VL  + K  N+ I+ +  F FCE+CQ GK H LPF  S S   +  DLIH+D
Subjt:  STFVLSKKANDG--------------------ASKTVWYRRLGHSTMQVLNYIAKVCNLPINGNGDFMFCESCQLGKTHNLPFNISRSKPAQSFDLIHSD

Query:  LWGPAPFTSTDDYRYYILFVDDFSKYTWIYPLKHKSVALEAFKHFITYVKTQFTKTIKAFQFDNGGEFIQIHHMCNKMGIVSRLSRPYTSAQNGQAERKH
        +WGPAP  S  +++YY+ F+DDFS++TWI+PLK KS  + AF  F   V+ QF K IK  + D GGE+  +       GI  ++S PYTS QNG+AERKH
Subjt:  LWGPAPFTSTDDYRYYILFVDDFSKYTWIYPLKHKSVALEAFKHFITYVKTQFTKTIKAFQFDNGGEFIQIHHMCNKMGIVSRLSRPYTSAQNGQAERKH

Query:  RHVTETSLSLLAQATLPLNFWWDAFITAKSLINGLPT
        RHVTE  L+LLAQA +PL++WW+AF TA  LIN LP+
Subjt:  RHVTETSLSLLAQATLPLNFWWDAFITAKSLINGLPT

A0A2Z6MBG6 Integrase catalytic domain-containing protein4.6e-7433.76Show/hide
Query:  NQLP---YVKLDPGNYLLCQTLALPILKSYKLEGHLTGENPCPPKFITNS-SGESQSGIDGTTEAT-----GGASSSSTTSKNEQRTYGKPS-------Q
        N LP    VKLD  NY L ++L LP+++  KL+G++ G   CP +FIT+S S ++++      +A      G   +S TT    Q  + + S       Q
Subjt:  NQLP---YVKLDPGNYLLCQTLALPILKSYKLEGHLTGENPCPPKFITNS-SGESQSGIDGTTEAT-----GGASSSSTTSKNEQRTYGKPS-------Q

Query:  GLFGIQSR-------------------------------------------------VLLRLDEVYNPATVVIQGKSNISWLDMQSELLIFEKRLEHQNS
         L G  +R                                                  L  LD  YNP  V +  ++ +SW+D+Q++LL FE R+E  N+
Subjt:  GLFGIQSR-------------------------------------------------VLLRLDEVYNPATVVIQGKSNISWLDMQSELLIFEKRLEHQNS

Query:  QKNHANNDARQPGNSYPHNSNQSNGNSQRGGNNFHNSGSRGPGRGRGNKPTCQVCGKYGHSALNP-TPFVTTYNT-------------NPF-ATPETVID
          N   N      N   H    SN N+ RG N+    G R  GRG+  K  CQVCG   H A++    F  TY+              N F A+  +V D
Subjt:  QKNHANNDARQPGNSYPHNSNQSNGNSQRGGNNFHNSGSRGPGRGRGNKPTCQVCGKYGHSALNP-TPFVTTYNT-------------NPF-ATPETVID

Query:  SNWYIDSGATNHVTVDYSNLSNPSEYSGL-------------------------LRDGLY------HLESVAVIADD------------VLEKSGYRKLQ
         +WY DSGA+NHVT       + +E+ G                          L D LY      +L SV+ +A D             ++     K+ 
Subjt:  SNWYIDSGATNHVTVDYSNLSNPSEYSGL-------------------------LRDGLY------HLESVAVIADD------------VLEKSGYRKLQ

Query:  FKHINKNASTFVLSKKANDGASKTV---WYRRLGHSTMQVLNYIAKVCNLPINGNGDFMFCESCQLGKTHNLPFNISRSKPAQSFDLIHSDLWGPAPFTS
         K + K+    +   K N  A  +V   W+RRLGH   +VL+ + + C + +  + +F FCE+CQ GK H LPF  S S   +  +L+H+D+WGPAP  +
Subjt:  FKHINKNASTFVLSKKANDGASKTV---WYRRLGHSTMQVLNYIAKVCNLPINGNGDFMFCESCQLGKTHNLPFNISRSKPAQSFDLIHSDLWGPAPFTS

Query:  TDDYRYYILFVDDFSKYTWIYPLKHKSVALEAFKHFITYVKTQFTKTIKAFQFDNGGEFIQIHHMCNKMGIVSRLSRPYTSAQNGQAERKHRHVTETSLS
        +  ++YY+ FVDDFS++TWIYPLK KS  ++AF  F    + QF K IK  Q D GGE+  +  +  + GI  R+S PYTS QNG+AERKHRH+TE  L+
Subjt:  TDDYRYYILFVDDFSKYTWIYPLKHKSVALEAFKHFITYVKTQFTKTIKAFQFDNGGEFIQIHHMCNKMGIVSRLSRPYTSAQNGQAERKHRHVTETSLS

Query:  LLAQATLPLNFWWDAFITAKSLINGLPT
        LLAQA +PL++WW+AF TA  LIN LP+
Subjt:  LLAQATLPLNFWWDAFITAKSLINGLPT

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.6e-2330Show/hide
Query:  VWYRRLGH---------------STMQVLNYIAKVCNLPINGNGDFMFCESCQLGKTHNLPFNISRSKP--AQSFDLIHSDLWGPAPFTSTDDYRYYILF
        +W+ R GH               S   +LN +   C +          CE C  GK   LPF   + K    +   ++HSD+ GP    + DD  Y+++F
Subjt:  VWYRRLGH---------------STMQVLNYIAKVCNLPINGNGDFMFCESCQLGKTHNLPFNISRSKP--AQSFDLIHSDLWGPAPFTSTDDYRYYILF

Query:  VDDFSKYTWIYPLKHKSVALEAFKHFITYVKTQFTKTIKAFQFDNGGEFI--QIHHMCNKMGIVSRLSRPYTSAQNGQAERKHRHVTETSLSLLAQATLP
        VD F+ Y   Y +K+KS     F+ F+   +  F   +     DNG E++  ++   C K GI   L+ P+T   NG +ER  R +TE + ++++ A L 
Subjt:  VDDFSKYTWIYPLKHKSVALEAFKHFITYVKTQFTKTIKAFQFDNGGEFI--QIHHMCNKMGIVSRLSRPYTSAQNGQAERKHRHVTETSLSLLAQATLP

Query:  LNFWWDAFITAKSLINGLPT
         +FW +A +TA  LIN +P+
Subjt:  LNFWWDAFITAKSLINGLPT

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.3e-2925.31Show/hide
Query:  NPATVVIQGKSNISWLDMQSELLIFEKRLEHQNSQKNHANNDARQPGNSYPHNSNQSNGNSQRGGNNFHNSGSRGPGRGRGNK--PTCQVCGKYGH---S
        N AT ++ GK+ I   D+ S LL+ EK  +   +Q      + R  G SY           QR  NN+  SG+RG  + R       C  C + GH    
Subjt:  NPATVVIQGKSNISWLDMQSELLIFEKRLEHQNSQKNHANNDARQPGNSYPHNSNQSNGNSQRGGNNFHNSGSRGPGRGRGNK--PTCQVCGKYGH---S

Query:  ALNP---------------TPFVTTYNTNP--FATPETVI------DSNWYIDSGATNHVT---------------------VDYSNLSNPSEY------
          NP               T  +   N N   F   E         +S W +D+ A++H T                       YS ++   +       
Subjt:  ALNP---------------TPFVTTYNTNP--FATPETVI------DSNWYIDSGATNHVT---------------------VDYSNLSNPSEY------

Query:  -SGLLRDGLYHLESVA--VIADDVLEKSGYRK-LQFKHINKNASTFVLSK--------------------KANDGASKTVWYRRLGHSTMQVLNYIAKVC
           L+   + H+  +   +I+   L++ GY      +       + V++K                     A D  S  +W++R+GH + + L  +AK  
Subjt:  -SGLLRDGLYHLESVA--VIADDVLEKSGYRK-LQFKHINKNASTFVLSK--------------------KANDGASKTVWYRRLGHSTMQVLNYIAKVC

Query:  NLPINGNGDFMFCESCQLGKTHNLPFNISRSKPAQSFDLIHSDLWGPAPFTSTDDYRYYILFVDDFSKYTWIYPLKHKSVALEAFKHFITYVKTQFTKTI
         +          C+ C  GK H + F  S  +     DL++SD+ GP    S    +Y++ F+DD S+  W+Y LK K    + F+ F   V+ +  + +
Subjt:  NLPINGNGDFMFCESCQLGKTHNLPFNISRSKPAQSFDLIHSDLWGPAPFTSTDDYRYYILFVDDFSKYTWIYPLKHKSVALEAFKHFITYVKTQFTKTI

Query:  KAFQFDNGGEFI--QIHHMCNKMGIVSRLSRPYTSAQNGQAERKHRHVTETSLSLLAQATLPLNFWWDAFITAKSLINGLPT
        K  + DNGGE+   +    C+  GI    + P T   NG AER +R + E   S+L  A LP +FW +A  TA  LIN  P+
Subjt:  KAFQFDNGGEFI--QIHHMCNKMGIVSRLSRPYTSAQNGQAERKHRHVTETSLSLLAQATLPLNFWWDAFITAKSLINGLPT

Q07163 Transposon TyH3 Gag-Pol polyprotein7.9e-1524.53Show/hide
Query:  YRRLGHSTMQVLNYIAKVCNLPINGNGDFMF-------CESCQLGKT----HNLPFNISRSKPAQSFDLIHSDLWGPAPFTSTDDYRYYILFVDDFSKYT
        +R L H+  Q + Y  K   +      D  +       C  C +GK+    H     +      + F  +H+D++GP          Y+I F D+ +K+ 
Subjt:  YRRLGHSTMQVLNYIAKVCNLPINGNGDFMF-------CESCQLGKT----HNLPFNISRSKPAQSFDLIHSDLWGPAPFTSTDDYRYYILFVDDFSKYT

Query:  WIYPL--KHKSVALEAFKHFITYVKTQFTKTIKAFQFDNGGEFIQ--IHHMCNKMGIVSRLSRPYTSAQNGQAERKHRHVTETSLSLLAQATLPLNFWWD
        W+YPL  + +   L+ F   + ++K QF  ++   Q D G E+    +H    K GI    +    S  +G AER +R + +   + L  + LP + W+ 
Subjt:  WIYPL--KHKSVALEAFKHFITYVKTQFTKTIKAFQFDNGGEFIQ--IHHMCNKMGIVSRLSRPYTSAQNGQAERKHRHVTETSLSLLAQATLPLNFWWD

Query:  AFITAKSLINGL
        A   +  + N L
Subjt:  AFITAKSLINGL

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.6e-4729.82Show/hide
Query:  RVLLRLDEVYNPATVVIQGKSNISWL-DMQSELLIFEKRL----------------EHQNSQKNHANNDARQPGNSYPHNSNQSNGNS-QRGGNNFH-NS
        RVL  L E Y P    I  K     L ++   LL  E ++                 H+N+   + NN+  +  N Y + +N +N    Q+   NFH N+
Subjt:  RVLLRLDEVYNPATVVIQGKSNISWL-DMQSELLIFEKRL----------------EHQNSQKNHANNDARQPGNSYPHNSNQSNGNS-QRGGNNFH-NS

Query:  GSRGPGRGRGNKPTCQVCGKYGHSALN----------------PTPFVTTYNTNPFATPETVIDSNWYIDSGATNHVTVDYSNLSNPSEYSG----LLRD
            P  G+     CQ+CG  GHSA                  P+PF         A       +NW +DSGAT+H+T D++NLS    Y+G    ++ D
Subjt:  GSRGPGRGRGNKPTCQVCGKYGHSALN----------------PTPFVTTYNTNPFATPETVIDSNWYIDSGATNHVTVDYSNLSNPSEYSG----LLRD

Query:  G-------------------------LY----HLESVAVI----ADDVLEKSGYRKLQFKHIN-------------------KNASTFVLSKKANDGASK
        G                         LY    H   ++V     A+ V  +      Q K +N                    ++    L    +  A+ 
Subjt:  G-------------------------LY----HLESVAVI----ADDVLEKSGYRKLQFKHIN-------------------KNASTFVLSKKANDGASK

Query:  TVWYRRLGHSTMQVLNYIAKVCNLPI-NGNGDFMFCESCQLGKTHNLPFNISRSKPAQSFDLIHSDLWGPAPFTSTDDYRYYILFVDDFSKYTWIYPLKH
        + W+ RLGH    +LN +    +L + N +  F+ C  C + K++ +PF+ S     +  + I+SD+W  +P  S D+YRYY++FVD F++YTW+YPLK 
Subjt:  TVWYRRLGHSTMQVLNYIAKVCNLPI-NGNGDFMFCESCQLGKTHNLPFNISRSKPAQSFDLIHSDLWGPAPFTSTDDYRYYILFVDDFSKYTWIYPLKH

Query:  KSVALEAFKHFITYVKTQFTKTIKAFQFDNGGEFIQIHHMCNKMGIVSRLSRPYTSAQNGQAERKHRHVTETSLSLLAQATLPLNFWWDAFITAKSLING
        KS   E F  F   ++ +F   I  F  DNGGEF+ +    ++ GI    S P+T   NG +ERKHRH+ ET L+LL+ A++P  +W  AF  A  LIN 
Subjt:  KSVALEAFKHFITYVKTQFTKTIKAFQFDNGGEFIQIHHMCNKMGIVSRLSRPYTSAQNGQAERKHRHVTETSLSLLAQATLPLNFWWDAFITAKSLING

Query:  LPT
        LPT
Subjt:  LPT

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.5e-4528.68Show/hide
Query:  LNQLPYVKLDPGNYLLCQTLALPILKSYKLEGHLTGENPCPPKFITNSSGES---------------QSGIDGTTE-----ATGGASSSSTTSKNEQRTY
        +N     KL   NYL+       +   Y+L G L G  P PP  I   +                   S I G        A   A++++   +  ++ Y
Subjt:  LNQLPYVKLDPGNYLLCQTLALPILKSYKLEGHLTGENPCPPKFITNSSGES---------------QSGIDGTTE-----ATGGASSSSTTSKNEQRTY

Query:  GKPSQG---------------LFGIQ-------SRVLLRLDEVYNPATVVIQGKSN-ISWLDMQSELLIFEKRLEHQNSQK---------NHANNDARQP
          PS G               L G          RVL  L + Y P    I  K    S  ++   L+  E +L   NS +          H N +  + 
Subjt:  GKPSQG---------------LFGIQ-------SRVLLRLDEVYNPATVVIQGKSN-ISWLDMQSELLIFEKRLEHQNSQK---------NHANNDARQP

Query:  GNSYPHNSNQSNGNSQRGGNNFHNSGSRGPGRGRGNKP---TCQVCGKYGHSALNPT---PFVTTYN----TNPF---------ATPETVIDSNWYIDSG
         N+   N N +N N++       +SGSR     R  KP    CQ+C   GHSA        F +T N    T+PF         A       +NW +DSG
Subjt:  GNSYPHNSNQSNGNSQRGGNNFHNSGSRGPGRGRGNKP---TCQVCGKYGHSALNPT---PFVTTYN----TNPF---------ATPETVIDSNWYIDSG

Query:  ATNHVTVDYSNLSNPSEYSG----LLRDG----LYHLESVA------------VIADDVLEK---SGYR--------------KLQFKHINKN-------
        AT+H+T D++NLS    Y+G    ++ DG    + H  S +            V+    + K   S YR                Q K +N         
Subjt:  ATNHVTVDYSNLSNPSEYSG----LLRDG----LYHLESVA------------VIADDVLEK---SGYR--------------KLQFKHINKN-------

Query:  ----------ASTFVLSKKAN--DGASKTVWYRRLGHSTMQVLNYIAKVCNLPI-NGNGDFMFCESCQLGKTHNLPFNISRSKPAQSFDLIHSDLWGPAP
                  AS+  +S  A+    A+ + W+ RLGH ++ +LN +    +LP+ N +   + C  C + K+H +PF+ S    ++  + I+SD+W  +P
Subjt:  ----------ASTFVLSKKAN--DGASKTVWYRRLGHSTMQVLNYIAKVCNLPI-NGNGDFMFCESCQLGKTHNLPFNISRSKPAQSFDLIHSDLWGPAP

Query:  FTSTDDYRYYILFVDDFSKYTWIYPLKHKSVALEAFKHFITYVKTQFTKTIKAFQFDNGGEFIQIHHMCNKMGIVSRLSRPYTSAQNGQAERKHRHVTET
          S D+YRYY++FVD F++YTW+YPLK KS   + F  F + V+ +F   I     DNGGEF+ +    ++ GI    S P+T   NG +ERKHRH+ E 
Subjt:  FTSTDDYRYYILFVDDFSKYTWIYPLKHKSVALEAFKHFITYVKTQFTKTIKAFQFDNGGEFIQIHHMCNKMGIVSRLSRPYTSAQNGQAERKHRHVTET

Query:  SLSLLAQATLPLNFWWDAFITAKSLINGLPT
         L+LL+ A++P  +W  AF  A  LIN LPT
Subjt:  SLSLLAQATLPLNFWWDAFITAKSLINGLPT

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAGTACCGTTATTGAATGAAATAGAGAAAATTTCTCCCAAACTTGAAAACCCACAATTTCGTTCTTGGTATCAGAGCTTTGAAATGGCCAACGCTGTCTCCCTCGT
CAGATCTTCAACCACTAACTTCAGTAATCCTCCTTTAAACCAATTACTAAATCAACTACCCTATGTAAAACTTGATCCTGGAAATTATTTGTTGTGCCAAACTCTAGCCC
TTCCCATTCTGAAGAGTTACAAACTCGAAGGTCATTTGACAGGAGAAAATCCTTGTCCTCCAAAATTCATCACAAATTCATCCGGTGAATCACAATCTGGAATCGATGGC
ACTACTGAAGCCACTGGTGGAGCATCCTCAAGTTCCACTACATCTAAAAACGAGCAAAGGACGTATGGGAAGCCTTCCCAAGGTTTATTTGGAATACAATCAAGGGTCTT
ATTGAGGTTAGATGAAGTTTACAACCCCGCAACTGTTGTTATTCAAGGAAAATCCAACATCTCTTGGCTGGATATGCAGTCGGAATTACTAATATTTGAAAAACGCCTTG
AGCATCAGAACTCACAAAAAAATCATGCCAATAATGATGCTCGACAACCTGGAAACTCATATCCACACAATAGTAATCAATCAAATGGAAATAGTCAGCGTGGAGGTAAC
AACTTCCATAATAGTGGTAGTCGTGGTCCTGGCCGAGGAAGAGGAAACAAACCAACGTGCCAAGTATGTGGCAAATATGGGCACTCTGCACTGAATCCCACTCCCTTTGT
CACCACTTATAATACTAATCCCTTTGCAACACCAGAGACGGTTATTGATTCTAACTGGTATATTGATAGTGGGGCAACAAACCATGTAACAGTGGACTACTCAAACTTGT
CAAACCCTTCTGAATACTCAGGTCTTCTTAGAGATGGGCTCTATCACCTCGAGAGCGTAGCTGTGATAGCTGATGATGTTTTGGAGAAAAGTGGCTACAGGAAACTACAG
TTTAAGCACATAAATAAGAATGCATCGACATTTGTTTTATCTAAAAAGGCTAATGATGGTGCATCTAAAACTGTTTGGTATAGGCGTTTGGGTCATTCCACAATGCAAGT
TTTGAATTATATAGCTAAGGTTTGTAATTTGCCTATTAATGGCAATGGAGATTTTATGTTTTGTGAGTCATGCCAATTGGGTAAAACACACAATCTTCCCTTCAATATTT
CTCGAAGTAAACCAGCTCAATCTTTTGATCTCATTCACTCTGACTTATGGGGACCGGCTCCCTTTACATCAACCGATGACTATCGATATTACATTCTTTTTGTTGATGAC
TTTAGCAAGTATACTTGGATTTATCCACTCAAACATAAAAGTGTAGCCCTTGAAGCATTCAAACATTTTATTACCTATGTGAAAACACAGTTCACTAAAACAATCAAAGC
CTTTCAATTTGATAATGGTGGCGAATTCATCCAAATTCATCACATGTGCAATAAAATGGGAATTGTCTCTAGACTCTCTCGTCCCTATACATCTGCCCAAAATGGTCAAG
CTGAAAGGAAACATCGACATGTTACAGAAACAAGTCTCTCTCTATTGGCTCAAGCTACATTGCCTCTTAATTTTTGGTGGGATGCTTTTATAACAGCAAAATCATTGATC
AATGGATTACCTACATAG
mRNA sequenceShow/hide mRNA sequence
ATGAGAGTACCGTTATTGAATGAAATAGAGAAAATTTCTCCCAAACTTGAAAACCCACAATTTCGTTCTTGGTATCAGAGCTTTGAAATGGCCAACGCTGTCTCCCTCGT
CAGATCTTCAACCACTAACTTCAGTAATCCTCCTTTAAACCAATTACTAAATCAACTACCCTATGTAAAACTTGATCCTGGAAATTATTTGTTGTGCCAAACTCTAGCCC
TTCCCATTCTGAAGAGTTACAAACTCGAAGGTCATTTGACAGGAGAAAATCCTTGTCCTCCAAAATTCATCACAAATTCATCCGGTGAATCACAATCTGGAATCGATGGC
ACTACTGAAGCCACTGGTGGAGCATCCTCAAGTTCCACTACATCTAAAAACGAGCAAAGGACGTATGGGAAGCCTTCCCAAGGTTTATTTGGAATACAATCAAGGGTCTT
ATTGAGGTTAGATGAAGTTTACAACCCCGCAACTGTTGTTATTCAAGGAAAATCCAACATCTCTTGGCTGGATATGCAGTCGGAATTACTAATATTTGAAAAACGCCTTG
AGCATCAGAACTCACAAAAAAATCATGCCAATAATGATGCTCGACAACCTGGAAACTCATATCCACACAATAGTAATCAATCAAATGGAAATAGTCAGCGTGGAGGTAAC
AACTTCCATAATAGTGGTAGTCGTGGTCCTGGCCGAGGAAGAGGAAACAAACCAACGTGCCAAGTATGTGGCAAATATGGGCACTCTGCACTGAATCCCACTCCCTTTGT
CACCACTTATAATACTAATCCCTTTGCAACACCAGAGACGGTTATTGATTCTAACTGGTATATTGATAGTGGGGCAACAAACCATGTAACAGTGGACTACTCAAACTTGT
CAAACCCTTCTGAATACTCAGGTCTTCTTAGAGATGGGCTCTATCACCTCGAGAGCGTAGCTGTGATAGCTGATGATGTTTTGGAGAAAAGTGGCTACAGGAAACTACAG
TTTAAGCACATAAATAAGAATGCATCGACATTTGTTTTATCTAAAAAGGCTAATGATGGTGCATCTAAAACTGTTTGGTATAGGCGTTTGGGTCATTCCACAATGCAAGT
TTTGAATTATATAGCTAAGGTTTGTAATTTGCCTATTAATGGCAATGGAGATTTTATGTTTTGTGAGTCATGCCAATTGGGTAAAACACACAATCTTCCCTTCAATATTT
CTCGAAGTAAACCAGCTCAATCTTTTGATCTCATTCACTCTGACTTATGGGGACCGGCTCCCTTTACATCAACCGATGACTATCGATATTACATTCTTTTTGTTGATGAC
TTTAGCAAGTATACTTGGATTTATCCACTCAAACATAAAAGTGTAGCCCTTGAAGCATTCAAACATTTTATTACCTATGTGAAAACACAGTTCACTAAAACAATCAAAGC
CTTTCAATTTGATAATGGTGGCGAATTCATCCAAATTCATCACATGTGCAATAAAATGGGAATTGTCTCTAGACTCTCTCGTCCCTATACATCTGCCCAAAATGGTCAAG
CTGAAAGGAAACATCGACATGTTACAGAAACAAGTCTCTCTCTATTGGCTCAAGCTACATTGCCTCTTAATTTTTGGTGGGATGCTTTTATAACAGCAAAATCATTGATC
AATGGATTACCTACATAG
Protein sequenceShow/hide protein sequence
MRVPLLNEIEKISPKLENPQFRSWYQSFEMANAVSLVRSSTTNFSNPPLNQLLNQLPYVKLDPGNYLLCQTLALPILKSYKLEGHLTGENPCPPKFITNSSGESQSGIDG
TTEATGGASSSSTTSKNEQRTYGKPSQGLFGIQSRVLLRLDEVYNPATVVIQGKSNISWLDMQSELLIFEKRLEHQNSQKNHANNDARQPGNSYPHNSNQSNGNSQRGGN
NFHNSGSRGPGRGRGNKPTCQVCGKYGHSALNPTPFVTTYNTNPFATPETVIDSNWYIDSGATNHVTVDYSNLSNPSEYSGLLRDGLYHLESVAVIADDVLEKSGYRKLQ
FKHINKNASTFVLSKKANDGASKTVWYRRLGHSTMQVLNYIAKVCNLPINGNGDFMFCESCQLGKTHNLPFNISRSKPAQSFDLIHSDLWGPAPFTSTDDYRYYILFVDD
FSKYTWIYPLKHKSVALEAFKHFITYVKTQFTKTIKAFQFDNGGEFIQIHHMCNKMGIVSRLSRPYTSAQNGQAERKHRHVTETSLSLLAQATLPLNFWWDAFITAKSLI
NGLPT