; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0002170 (gene) of Snake gourd v1 genome

Gene IDTan0002170
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG06:40301363..40303713
RNA-Seq ExpressionTan0002170
SyntenyTan0002170
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]7.7e-14542.61Show/hide
Query:  MSSSFIALLKMERLTGEKFTTWKSNLDAILVVDNLRFVLTEECPQIPAQDAPQSVKDAFYRWAKANEKA-AYILAGTSDVLARRLQGMVSAREIMGYLQT
        M++S + LL  E+L G+ ++ WKSNL+ ILVVD+LRFVLTEECPQ PA +A ++V++A+ RW KAN+KA  YILA  +DVLA++   + +A+ IM  L+ 
Subjt:  MSSSFIALLKMERLTGEKFTTWKSNLDAILVVDNLRFVLTEECPQIPAQDAPQSVKDAFYRWAKANEKA-AYILAGTSDVLARRLQGMVSAREIMGYLQT

Query:  LSRQPSEQIQHESIKYVYNARMKEGTSVREHVLDLIVHFNVAEMNG------------------------RNAELNKKEYNLTSLLNELQSFESLIKNKR
        +  QPS  ++HE+IK++Y  RMKEGTSVREHVLD+++HFN+AE+NG                         NA LNK E+NLT+LLNELQ F++L  +K 
Subjt:  LSRQPSEQIQHESIKYVYNARMKEGTSVREHVLDLIVHFNVAEMNG------------------------RNAELNKKEYNLTSLLNELQSFESLIKNKR

Query:  HADGEVNLFAHSKRFQKGSSSGTKPCGSTRKKRKGGKGKGSATADIE------------------------LKEKK------------------------
          + E N+    ++F +GSSS  K   S  + +K GKGK   T+ ++                        L EKK                        
Subjt:  HADGEVNLFAHSKRFQKGSSSGTKPCGSTRKKRKGGKGKGSATADIE------------------------LKEKK------------------------

Query:  ------GATNHVCSSFQETSSFKELEE-------------------------------------------------------------------------
              GATNH+C SFQETSS+K+L+E                                                                         
Subjt:  ------GATNHVCSSFQETSSFKELEE-------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------VNAKTRGGYKYFISFIGDYSRYGYLYLMHHKSEVLEKFKEYKAEVENALGKTIKTLQSDRGGEYMDLRFQDYMIEHGIKSQLSAPNTP
                    +N K RGGY+YFISFI D+SRYG++YL+HHKSE  EKFKEYKAEVEN +GKTIKTL+SDRGGEYMD +FQDY+IE GI+SQLSAP+TP
Subjt:  ------------VNAKTRGGYKYFISFIGDYSRYGYLYLMHHKSEVLEKFKEYKAEVENALGKTIKTLQSDRGGEYMDLRFQDYMIEHGIKSQLSAPNTP

Query:  QQNGVSERRNRTLLDMVRSMMSYAQLPASFWGYAVETAVQILNSVPSKSVSETPFELWKGRKPSLQHFRIWGCPTHVLVTNPKKLEPRSRLCQFVGYSKE
        QQNGVSERRNRTLLDMVRSMMSYAQLP SFWGYA+ETA+ ILN+VPSKSV ETP+ELWKGRK SL++FRIWGCP HVLV NPKKLEPRS+LC FVGY KE
Subjt:  QQNGVSERRNRTLLDMVRSMMSYAQLPASFWGYAVETAVQILNSVPSKSVSETPFELWKGRKPSLQHFRIWGCPTHVLVTNPKKLEPRSRLCQFVGYSKE

Query:  TRGGLFYDPQENKVIVSTNTTFLKEDHMRNHKLRSKLVLNE----ATHEP---TRVVNQAGPSSR
        +RGGLFY PQENKV VSTN TFL+EDH RNH+ RSK+VL E    AT +P   T+VV++A  S +
Subjt:  TRGGLFYDPQENKVIVSTNTTFLKEDHMRNHKLRSKLVLNE----ATHEP---TRVVNQAGPSSR

KAA0026233.1 gag/pol protein [Cucumis melo var. makuwa]4.3e-15654.34Show/hide
Query:  MSSSFIALLKMERLTGEKFTTWKSNLDAILVVDNLRFVLTEECPQIPAQDAPQSVKDAFYRWAKANEKA-AYILAGTSDVLARRLQGMVSAREIMGYLQT
        +SS+ + +L  ++L G  + +WK+ ++ +L++D+LRFVL E+CPQ+PA +A ++V++ + RWAKANEKA AYILA  S+VLA++ + M++AREIM  LQ 
Subjt:  MSSSFIALLKMERLTGEKFTTWKSNLDAILVVDNLRFVLTEECPQIPAQDAPQSVKDAFYRWAKANEKA-AYILAGTSDVLARRLQGMVSAREIMGYLQT

Query:  LSRQPSEQIQHESIKYVYNARMKEGTSVREHVLDLIVHFNVAEMN------------------------GRNAELNKKEYNLTSLLNELQSFESLIKNKR
        +  Q S QI+H+++KY+YNARM EG SVREHVL+++VHFNVA MN                          NA +NK  Y LT+LLNELQ+FESL+K K 
Subjt:  LSRQPSEQIQHESIKYVYNARMKEGTSVREHVLDLIVHFNVAEMN------------------------GRNAELNKKEYNLTSLLNELQSFESLIKNKR

Query:  HADGEVNLFAHSKRFQKGSSSGTKPCGSTR-----KKRKGGKGKGSATADIELKEKKGATNHVCSSFQETSSFKE-------------------------
           GE N+   +++F +GS+SGTK   S+      KK+KGG+G  +  A  +  +K  A   +C    +   +K                          
Subjt:  HADGEVNLFAHSKRFQKGSSSGTKPCGSTR-----KKRKGGKGKGSATADIELKEKKGATNHVCSSFQETSSFKE-------------------------

Query:  -------LEEV--------NAKTRGGYKYFISFIGDYSRYGYLYLMHHKSEVLEKFKEYKAEVENALGKTIKTLQSDRGGEYMDLRFQDYMIEHGIKSQL
               LE V        N K RGG++YFI+F  DYSRYGY+YLM HKSE LEKFKEYKAEVENAL KTIKT +SDRGGEYMDL+FQ+Y++E GI SQL
Subjt:  -------LEEV--------NAKTRGGYKYFISFIGDYSRYGYLYLMHHKSEVLEKFKEYKAEVENALGKTIKTLQSDRGGEYMDLRFQDYMIEHGIKSQL

Query:  SAPNTPQQNGVSERRNRTLLDMVRSMMSYAQLPASFWGYAVETAVQILNSVPSKSVSETPFELWKGRKPSLQHFRIWGCPTHVLVTNPKKLEPRSRLCQF
        SAP+TPQQNGVSERRNRTLLDMVRSMMSYA LP SFWGYAV+TAV ILN VPSKSVSETP +LW G K SL+HFRIWGCP HVL  NPKKLEPRS+LC F
Subjt:  SAPNTPQQNGVSERRNRTLLDMVRSMMSYAQLPASFWGYAVETAVQILNSVPSKSVSETPFELWKGRKPSLQHFRIWGCPTHVLVTNPKKLEPRSRLCQF

Query:  VGYSKETRGGLFYDPQENKVIVSTNTTFLKEDHMRNHKLRSKLVLNEATHEPTRVVNQAGPSSRV
        VGY K TRGG FYDP++NKV VSTN TFL+EDH+R HK RSK+VLNE + E T       PS+RV
Subjt:  VGYSKETRGGLFYDPQENKVIVSTNTTFLKEDHMRNHKLRSKLVLNEATHEPTRVVNQAGPSSRV

KAA0051952.1 gag/pol protein [Cucumis melo var. makuwa]3.0e-15754.13Show/hide
Query:  MSSSFIALLKMERLTGEKFTTWKSNLDAILVVDNLRFVLTEECPQIPAQDAPQSVKDAFYRWAKANEKA-AYILAGTSDVLARRLQGMVSAREIMGYLQT
        M+S+ + +L  ++L G  + +WK+ ++ +L++D+LRFVL EECPQ+PA +A ++V++ + RWAKANEKA AYILA  S+VLA++ + M++AREIM  LQ 
Subjt:  MSSSFIALLKMERLTGEKFTTWKSNLDAILVVDNLRFVLTEECPQIPAQDAPQSVKDAFYRWAKANEKA-AYILAGTSDVLARRLQGMVSAREIMGYLQT

Query:  LSRQPSEQIQHESIKYVYNARMKEGTSVREHVLDLIVHFNVAEMNG------------------------RNAELNKKEYNLTSLLNELQSFESLIKNKR
        +  Q S QI+H+++KY+YNARM EG SVREHVL+++VHFNVAEMNG                         NA +NK  Y LT+LLNELQ+FESL+K K 
Subjt:  LSRQPSEQIQHESIKYVYNARMKEGTSVREHVLDLIVHFNVAEMNG------------------------RNAELNKKEYNLTSLLNELQSFESLIKNKR

Query:  HADGEVNLFAHSKRFQKGSSSGTKPCGSTR-----KKRKGGKGKGSATADIELKEKKGATNHVCSSFQETSSFKE-------------------------
           GE N+   +++F +GS+SGTK   S+      KK+KGG+G  +  A  +  +K  A   +C    +   +K                          
Subjt:  HADGEVNLFAHSKRFQKGSSSGTKPCGSTR-----KKRKGGKGKGSATADIELKEKKGATNHVCSSFQETSSFKE-------------------------

Query:  -------LEEV--------NAKTRGGYKYFISFIGDYSRYGYLYLMHHKSEVLEKFKEYKAEVENALGKTIKTLQSDRGGEYMDLRFQDYMIEHGIKSQL
               LE V        N K RGG++YFI+F  DYSRYGY+YLM HKSE LEKFKEYKAEVENAL KTIKT +SDRGGEYMDL+FQ+Y++E GI SQL
Subjt:  -------LEEV--------NAKTRGGYKYFISFIGDYSRYGYLYLMHHKSEVLEKFKEYKAEVENALGKTIKTLQSDRGGEYMDLRFQDYMIEHGIKSQL

Query:  SAPNTPQQNGVSERRNRTLLDMVRSMMSYAQLPASFWGYAVETAVQILNSVPSKSVSETPFELWKGRKPSLQHFRIWGCPTHVLVTNPKKLEPRSRLCQF
        S P TPQQNGVS+RRNRTLLDMVRSMMSY  LP SFWGYAV+TAV ILN VPSKSVS+TP +LW GRK SL+HFRIWGCP HVL  NPKKLEPRS+LC F
Subjt:  SAPNTPQQNGVSERRNRTLLDMVRSMMSYAQLPASFWGYAVETAVQILNSVPSKSVSETPFELWKGRKPSLQHFRIWGCPTHVLVTNPKKLEPRSRLCQF

Query:  VGYSKETRGGLFYDPQENKVIVSTNTTFLKEDHMRNHKLRSKLVLN----EATHEPTRVVNQAGPSSRV
        VGY K TRGG FYDP++NKV VSTN TFL+EDH+R HK RSK+VLN    E T   TRVV +    +RV
Subjt:  VGYSKETRGGLFYDPQENKVIVSTNTTFLKEDHMRNHKLRSKLVLN----EATHEPTRVVNQAGPSSRV

TYJ97618.1 gag/pol protein [Cucumis melo var. makuwa]9.1e-14650.17Show/hide
Query:  MSSSFIALLKMERLTGEKFTTWKSNLDAILVVDNLRFVLTEECPQIPAQDAPQSVKDAFYRWAKANEKA-AYILAGTSDVLARRLQGMVSAREIMGYLQT
        M+++ + +L  ++L G  + +WK+ ++ +L++D+L+FVL EECPQ+PA +A Q+V++ + RWAK NEK  AYILA  S+VLA++ + M++AREIM  LQ 
Subjt:  MSSSFIALLKMERLTGEKFTTWKSNLDAILVVDNLRFVLTEECPQIPAQDAPQSVKDAFYRWAKANEKA-AYILAGTSDVLARRLQGMVSAREIMGYLQT

Query:  LSRQPSEQIQHESIKYVYNARMKEGTSVREHVLDLIVHFNVAEMNGRNAELNKKEYNLTSLLNELQSFESLIKNKRHADGEVNLFAHSKRFQKGSSSGTK
        +  Q S QI H+++KY+YNARM EG SVREHVL+++VHFNVAEMNG             ++++E    +          GE N+   +++F +GS+SGTK
Subjt:  LSRQPSEQIQHESIKYVYNARMKEGTSVREHVLDLIVHFNVAEMNGRNAELNKKEYNLTSLLNELQSFESLIKNKRHADGEVNLFAHSKRFQKGSSSGTK

Query:  PCGSTR-----KKRKGGKGKGSATADIELKEKKGATNHVCSSFQETSSFK---------------------------------ELEE-------------
           S+      KK+KGG+G  +  A  +  +K  AT  +C  + +   +K                                 ELEE             
Subjt:  PCGSTR-----KKRKGGKGKGSATADIELKEKKGATNHVCSSFQETSSFK---------------------------------ELEE-------------

Query:  ---------------------------VNAKTRGGYKYFISFIGDYSRYGYLYLMHHKSEVLEKFKEYKAEVENALGKTIKTLQSDRGGEYMDLRFQDYM
                                   +N K RG ++YFI+F  DYSRYGY+YLM HKSE LEKFKEYKAEVENAL KTIKT +SDRGGEYMDL+FQ+Y+
Subjt:  ---------------------------VNAKTRGGYKYFISFIGDYSRYGYLYLMHHKSEVLEKFKEYKAEVENALGKTIKTLQSDRGGEYMDLRFQDYM

Query:  IEHGIKSQLSAPNTPQQNGVSERRNRTLLDMVRSMMSYAQLPASFWGYAVETAVQILNSVPSKSVSETPFELWKGRKPSLQHFRIWGCPTHVLVTNPKKL
        +E  I SQLSAP TPQQNGVSERRNRTLLDMVRSM+SYA LP SFWGYAV+TAV ILN VPSKSVSETP +LW GRK SL+HFRIWGCP HVL  NPKKL
Subjt:  IEHGIKSQLSAPNTPQQNGVSERRNRTLLDMVRSMMSYAQLPASFWGYAVETAVQILNSVPSKSVSETPFELWKGRKPSLQHFRIWGCPTHVLVTNPKKL

Query:  EPRSRLCQFVGYSKETRGGLFYDPQENKVIVSTNTTFLKEDHMRNHKLRSKLVLN----EATHEPTRVVNQAGPSSRV
        EPRS+LC FVGY K TRGG FYDP++NKV VSTN TFL+EDH+R HK RSK+VLN    E T   TRVV +    +RV
Subjt:  EPRSRLCQFVGYSKETRGGLFYDPQENKVIVSTNTTFLKEDHMRNHKLRSKLVLN----EATHEPTRVVNQAGPSSRV

TYK00843.1 gag/pol protein [Cucumis melo var. makuwa]3.4e-14558.94Show/hide
Query:  MSSSFIALLKMERLTGEKFTTWKSNLDAILVVDNLRFVLTEECPQIPAQDAPQSVKDAFYRWAKANEKAA-YILAGTSDVLARRLQGMVSAREIMGYLQT
        MSSS IALLK E LTGE +  WKS L+ ILV+ +LRF+L EECP  P Q+A +S++DA+ R  KAN+KA  YILA  SD+L+++ + MV+AR+IM   + 
Subjt:  MSSSFIALLKMERLTGEKFTTWKSNLDAILVVDNLRFVLTEECPQIPAQDAPQSVKDAFYRWAKANEKAA-YILAGTSDVLARRLQGMVSAREIMGYLQT

Query:  LSRQPSEQIQHESIKYVYNARMKEGTSVREHVLDLIVHFNVAEMNGRNAELNKKEYNLTSLLNELQSFESLIKNKRHADGEVNLFAHSKRFQKGSSSGTK
        L RQPS QI+ E+IKYVYNA MKEG SVREH LD+IV+FNVAEMNG  A +++K  +L                K    GE N+ AH +RF   SS   K
Subjt:  LSRQPSEQIQHESIKYVYNARMKEGTSVREHVLDLIVHFNVAEMNGRNAELNKKEYNLTSLLNELQSFESLIKNKRHADGEVNLFAHSKRFQKGSSSGTK

Query:  PCGSTRKKRKGGKGKGSATADIELKEKKGATNHVCSSFQETSSFKELEEVNAKTRGGYKYFISFIGDYSRYGYLYLMHHKSEVLEKFKEYKAEVENALGK
              +K KG KGKG   AD                                             DYSRYGYLYLM HKS+ LEKFKEYKA+V+N L +
Subjt:  PCGSTRKKRKGGKGKGSATADIELKEKKGATNHVCSSFQETSSFKELEEVNAKTRGGYKYFISFIGDYSRYGYLYLMHHKSEVLEKFKEYKAEVENALGK

Query:  TIKTLQSDRGGEYMDLRFQDYMIEHGIKSQLSAPNTPQQNGVSERRNRTLLDMVRSMMSYAQLPASFWGYAVETAVQILNSVPSKSVSETPFELWKGRKP
          K LQSD+GGEYMDLRFQ YMIEHGI+SQLSAP TPQ N VS+RR RTLLDMV SMMSY QLP++FWGY +ETAV ILN+V SKSVSETPF+LWK RKP
Subjt:  TIKTLQSDRGGEYMDLRFQDYMIEHGIKSQLSAPNTPQQNGVSERRNRTLLDMVRSMMSYAQLPASFWGYAVETAVQILNSVPSKSVSETPFELWKGRKP

Query:  SLQHFRIWGCPTHVLVTNPKKLEPRSRLCQFVGYSKETRGGLFYDPQENKVIVSTNTTFLKEDHMRNHKLRSKLVLNEATHEPTRVVNQAGP
        SL HFRIWGCPTHV VTN KKLEPRSRLCQF+GY KETRGGLF+DPQEN+V VSTN TFL+EDHMR+HK +SKLVLNE T E TRVV++ GP
Subjt:  SLQHFRIWGCPTHVLVTNPKKLEPRSRLCQFVGYSKETRGGLFYDPQENKVIVSTNTTFLKEDHMRNHKLRSKLVLNEATHEPTRVVNQAGP

TrEMBL top hitse value%identityAlignment
A0A5A7SNP8 Gag/pol protein2.1e-15654.34Show/hide
Query:  MSSSFIALLKMERLTGEKFTTWKSNLDAILVVDNLRFVLTEECPQIPAQDAPQSVKDAFYRWAKANEKA-AYILAGTSDVLARRLQGMVSAREIMGYLQT
        +SS+ + +L  ++L G  + +WK+ ++ +L++D+LRFVL E+CPQ+PA +A ++V++ + RWAKANEKA AYILA  S+VLA++ + M++AREIM  LQ 
Subjt:  MSSSFIALLKMERLTGEKFTTWKSNLDAILVVDNLRFVLTEECPQIPAQDAPQSVKDAFYRWAKANEKA-AYILAGTSDVLARRLQGMVSAREIMGYLQT

Query:  LSRQPSEQIQHESIKYVYNARMKEGTSVREHVLDLIVHFNVAEMN------------------------GRNAELNKKEYNLTSLLNELQSFESLIKNKR
        +  Q S QI+H+++KY+YNARM EG SVREHVL+++VHFNVA MN                          NA +NK  Y LT+LLNELQ+FESL+K K 
Subjt:  LSRQPSEQIQHESIKYVYNARMKEGTSVREHVLDLIVHFNVAEMN------------------------GRNAELNKKEYNLTSLLNELQSFESLIKNKR

Query:  HADGEVNLFAHSKRFQKGSSSGTKPCGSTR-----KKRKGGKGKGSATADIELKEKKGATNHVCSSFQETSSFKE-------------------------
           GE N+   +++F +GS+SGTK   S+      KK+KGG+G  +  A  +  +K  A   +C    +   +K                          
Subjt:  HADGEVNLFAHSKRFQKGSSSGTKPCGSTR-----KKRKGGKGKGSATADIELKEKKGATNHVCSSFQETSSFKE-------------------------

Query:  -------LEEV--------NAKTRGGYKYFISFIGDYSRYGYLYLMHHKSEVLEKFKEYKAEVENALGKTIKTLQSDRGGEYMDLRFQDYMIEHGIKSQL
               LE V        N K RGG++YFI+F  DYSRYGY+YLM HKSE LEKFKEYKAEVENAL KTIKT +SDRGGEYMDL+FQ+Y++E GI SQL
Subjt:  -------LEEV--------NAKTRGGYKYFISFIGDYSRYGYLYLMHHKSEVLEKFKEYKAEVENALGKTIKTLQSDRGGEYMDLRFQDYMIEHGIKSQL

Query:  SAPNTPQQNGVSERRNRTLLDMVRSMMSYAQLPASFWGYAVETAVQILNSVPSKSVSETPFELWKGRKPSLQHFRIWGCPTHVLVTNPKKLEPRSRLCQF
        SAP+TPQQNGVSERRNRTLLDMVRSMMSYA LP SFWGYAV+TAV ILN VPSKSVSETP +LW G K SL+HFRIWGCP HVL  NPKKLEPRS+LC F
Subjt:  SAPNTPQQNGVSERRNRTLLDMVRSMMSYAQLPASFWGYAVETAVQILNSVPSKSVSETPFELWKGRKPSLQHFRIWGCPTHVLVTNPKKLEPRSRLCQF

Query:  VGYSKETRGGLFYDPQENKVIVSTNTTFLKEDHMRNHKLRSKLVLNEATHEPTRVVNQAGPSSRV
        VGY K TRGG FYDP++NKV VSTN TFL+EDH+R HK RSK+VLNE + E T       PS+RV
Subjt:  VGYSKETRGGLFYDPQENKVIVSTNTTFLKEDHMRNHKLRSKLVLNEATHEPTRVVNQAGPSSRV

A0A5A7U869 Gag/pol protein1.5e-15754.13Show/hide
Query:  MSSSFIALLKMERLTGEKFTTWKSNLDAILVVDNLRFVLTEECPQIPAQDAPQSVKDAFYRWAKANEKA-AYILAGTSDVLARRLQGMVSAREIMGYLQT
        M+S+ + +L  ++L G  + +WK+ ++ +L++D+LRFVL EECPQ+PA +A ++V++ + RWAKANEKA AYILA  S+VLA++ + M++AREIM  LQ 
Subjt:  MSSSFIALLKMERLTGEKFTTWKSNLDAILVVDNLRFVLTEECPQIPAQDAPQSVKDAFYRWAKANEKA-AYILAGTSDVLARRLQGMVSAREIMGYLQT

Query:  LSRQPSEQIQHESIKYVYNARMKEGTSVREHVLDLIVHFNVAEMNG------------------------RNAELNKKEYNLTSLLNELQSFESLIKNKR
        +  Q S QI+H+++KY+YNARM EG SVREHVL+++VHFNVAEMNG                         NA +NK  Y LT+LLNELQ+FESL+K K 
Subjt:  LSRQPSEQIQHESIKYVYNARMKEGTSVREHVLDLIVHFNVAEMNG------------------------RNAELNKKEYNLTSLLNELQSFESLIKNKR

Query:  HADGEVNLFAHSKRFQKGSSSGTKPCGSTR-----KKRKGGKGKGSATADIELKEKKGATNHVCSSFQETSSFKE-------------------------
           GE N+   +++F +GS+SGTK   S+      KK+KGG+G  +  A  +  +K  A   +C    +   +K                          
Subjt:  HADGEVNLFAHSKRFQKGSSSGTKPCGSTR-----KKRKGGKGKGSATADIELKEKKGATNHVCSSFQETSSFKE-------------------------

Query:  -------LEEV--------NAKTRGGYKYFISFIGDYSRYGYLYLMHHKSEVLEKFKEYKAEVENALGKTIKTLQSDRGGEYMDLRFQDYMIEHGIKSQL
               LE V        N K RGG++YFI+F  DYSRYGY+YLM HKSE LEKFKEYKAEVENAL KTIKT +SDRGGEYMDL+FQ+Y++E GI SQL
Subjt:  -------LEEV--------NAKTRGGYKYFISFIGDYSRYGYLYLMHHKSEVLEKFKEYKAEVENALGKTIKTLQSDRGGEYMDLRFQDYMIEHGIKSQL

Query:  SAPNTPQQNGVSERRNRTLLDMVRSMMSYAQLPASFWGYAVETAVQILNSVPSKSVSETPFELWKGRKPSLQHFRIWGCPTHVLVTNPKKLEPRSRLCQF
        S P TPQQNGVS+RRNRTLLDMVRSMMSY  LP SFWGYAV+TAV ILN VPSKSVS+TP +LW GRK SL+HFRIWGCP HVL  NPKKLEPRS+LC F
Subjt:  SAPNTPQQNGVSERRNRTLLDMVRSMMSYAQLPASFWGYAVETAVQILNSVPSKSVSETPFELWKGRKPSLQHFRIWGCPTHVLVTNPKKLEPRSRLCQF

Query:  VGYSKETRGGLFYDPQENKVIVSTNTTFLKEDHMRNHKLRSKLVLN----EATHEPTRVVNQAGPSSRV
        VGY K TRGG FYDP++NKV VSTN TFL+EDH+R HK RSK+VLN    E T   TRVV +    +RV
Subjt:  VGYSKETRGGLFYDPQENKVIVSTNTTFLKEDHMRNHKLRSKLVLN----EATHEPTRVVNQAGPSSRV

A0A5D3BHG7 Gag/pol protein4.4e-14650.17Show/hide
Query:  MSSSFIALLKMERLTGEKFTTWKSNLDAILVVDNLRFVLTEECPQIPAQDAPQSVKDAFYRWAKANEKA-AYILAGTSDVLARRLQGMVSAREIMGYLQT
        M+++ + +L  ++L G  + +WK+ ++ +L++D+L+FVL EECPQ+PA +A Q+V++ + RWAK NEK  AYILA  S+VLA++ + M++AREIM  LQ 
Subjt:  MSSSFIALLKMERLTGEKFTTWKSNLDAILVVDNLRFVLTEECPQIPAQDAPQSVKDAFYRWAKANEKA-AYILAGTSDVLARRLQGMVSAREIMGYLQT

Query:  LSRQPSEQIQHESIKYVYNARMKEGTSVREHVLDLIVHFNVAEMNGRNAELNKKEYNLTSLLNELQSFESLIKNKRHADGEVNLFAHSKRFQKGSSSGTK
        +  Q S QI H+++KY+YNARM EG SVREHVL+++VHFNVAEMNG             ++++E    +          GE N+   +++F +GS+SGTK
Subjt:  LSRQPSEQIQHESIKYVYNARMKEGTSVREHVLDLIVHFNVAEMNGRNAELNKKEYNLTSLLNELQSFESLIKNKRHADGEVNLFAHSKRFQKGSSSGTK

Query:  PCGSTR-----KKRKGGKGKGSATADIELKEKKGATNHVCSSFQETSSFK---------------------------------ELEE-------------
           S+      KK+KGG+G  +  A  +  +K  AT  +C  + +   +K                                 ELEE             
Subjt:  PCGSTR-----KKRKGGKGKGSATADIELKEKKGATNHVCSSFQETSSFK---------------------------------ELEE-------------

Query:  ---------------------------VNAKTRGGYKYFISFIGDYSRYGYLYLMHHKSEVLEKFKEYKAEVENALGKTIKTLQSDRGGEYMDLRFQDYM
                                   +N K RG ++YFI+F  DYSRYGY+YLM HKSE LEKFKEYKAEVENAL KTIKT +SDRGGEYMDL+FQ+Y+
Subjt:  ---------------------------VNAKTRGGYKYFISFIGDYSRYGYLYLMHHKSEVLEKFKEYKAEVENALGKTIKTLQSDRGGEYMDLRFQDYM

Query:  IEHGIKSQLSAPNTPQQNGVSERRNRTLLDMVRSMMSYAQLPASFWGYAVETAVQILNSVPSKSVSETPFELWKGRKPSLQHFRIWGCPTHVLVTNPKKL
        +E  I SQLSAP TPQQNGVSERRNRTLLDMVRSM+SYA LP SFWGYAV+TAV ILN VPSKSVSETP +LW GRK SL+HFRIWGCP HVL  NPKKL
Subjt:  IEHGIKSQLSAPNTPQQNGVSERRNRTLLDMVRSMMSYAQLPASFWGYAVETAVQILNSVPSKSVSETPFELWKGRKPSLQHFRIWGCPTHVLVTNPKKL

Query:  EPRSRLCQFVGYSKETRGGLFYDPQENKVIVSTNTTFLKEDHMRNHKLRSKLVLN----EATHEPTRVVNQAGPSSRV
        EPRS+LC FVGY K TRGG FYDP++NKV VSTN TFL+EDH+R HK RSK+VLN    E T   TRVV +    +RV
Subjt:  EPRSRLCQFVGYSKETRGGLFYDPQENKVIVSTNTTFLKEDHMRNHKLRSKLVLN----EATHEPTRVVNQAGPSSRV

A0A5D3BM47 Gag/pol protein1.7e-14558.94Show/hide
Query:  MSSSFIALLKMERLTGEKFTTWKSNLDAILVVDNLRFVLTEECPQIPAQDAPQSVKDAFYRWAKANEKAA-YILAGTSDVLARRLQGMVSAREIMGYLQT
        MSSS IALLK E LTGE +  WKS L+ ILV+ +LRF+L EECP  P Q+A +S++DA+ R  KAN+KA  YILA  SD+L+++ + MV+AR+IM   + 
Subjt:  MSSSFIALLKMERLTGEKFTTWKSNLDAILVVDNLRFVLTEECPQIPAQDAPQSVKDAFYRWAKANEKAA-YILAGTSDVLARRLQGMVSAREIMGYLQT

Query:  LSRQPSEQIQHESIKYVYNARMKEGTSVREHVLDLIVHFNVAEMNGRNAELNKKEYNLTSLLNELQSFESLIKNKRHADGEVNLFAHSKRFQKGSSSGTK
        L RQPS QI+ E+IKYVYNA MKEG SVREH LD+IV+FNVAEMNG  A +++K  +L                K    GE N+ AH +RF   SS   K
Subjt:  LSRQPSEQIQHESIKYVYNARMKEGTSVREHVLDLIVHFNVAEMNGRNAELNKKEYNLTSLLNELQSFESLIKNKRHADGEVNLFAHSKRFQKGSSSGTK

Query:  PCGSTRKKRKGGKGKGSATADIELKEKKGATNHVCSSFQETSSFKELEEVNAKTRGGYKYFISFIGDYSRYGYLYLMHHKSEVLEKFKEYKAEVENALGK
              +K KG KGKG   AD                                             DYSRYGYLYLM HKS+ LEKFKEYKA+V+N L +
Subjt:  PCGSTRKKRKGGKGKGSATADIELKEKKGATNHVCSSFQETSSFKELEEVNAKTRGGYKYFISFIGDYSRYGYLYLMHHKSEVLEKFKEYKAEVENALGK

Query:  TIKTLQSDRGGEYMDLRFQDYMIEHGIKSQLSAPNTPQQNGVSERRNRTLLDMVRSMMSYAQLPASFWGYAVETAVQILNSVPSKSVSETPFELWKGRKP
          K LQSD+GGEYMDLRFQ YMIEHGI+SQLSAP TPQ N VS+RR RTLLDMV SMMSY QLP++FWGY +ETAV ILN+V SKSVSETPF+LWK RKP
Subjt:  TIKTLQSDRGGEYMDLRFQDYMIEHGIKSQLSAPNTPQQNGVSERRNRTLLDMVRSMMSYAQLPASFWGYAVETAVQILNSVPSKSVSETPFELWKGRKP

Query:  SLQHFRIWGCPTHVLVTNPKKLEPRSRLCQFVGYSKETRGGLFYDPQENKVIVSTNTTFLKEDHMRNHKLRSKLVLNEATHEPTRVVNQAGP
        SL HFRIWGCPTHV VTN KKLEPRSRLCQF+GY KETRGGLF+DPQEN+V VSTN TFL+EDHMR+HK +SKLVLNE T E TRVV++ GP
Subjt:  SLQHFRIWGCPTHVLVTNPKKLEPRSRLCQFVGYSKETRGGLFYDPQENKVIVSTNTTFLKEDHMRNHKLRSKLVLNEATHEPTRVVNQAGP

E2GK51 Gag/pol protein (Fragment)3.7e-14542.61Show/hide
Query:  MSSSFIALLKMERLTGEKFTTWKSNLDAILVVDNLRFVLTEECPQIPAQDAPQSVKDAFYRWAKANEKA-AYILAGTSDVLARRLQGMVSAREIMGYLQT
        M++S + LL  E+L G+ ++ WKSNL+ ILVVD+LRFVLTEECPQ PA +A ++V++A+ RW KAN+KA  YILA  +DVLA++   + +A+ IM  L+ 
Subjt:  MSSSFIALLKMERLTGEKFTTWKSNLDAILVVDNLRFVLTEECPQIPAQDAPQSVKDAFYRWAKANEKA-AYILAGTSDVLARRLQGMVSAREIMGYLQT

Query:  LSRQPSEQIQHESIKYVYNARMKEGTSVREHVLDLIVHFNVAEMNG------------------------RNAELNKKEYNLTSLLNELQSFESLIKNKR
        +  QPS  ++HE+IK++Y  RMKEGTSVREHVLD+++HFN+AE+NG                         NA LNK E+NLT+LLNELQ F++L  +K 
Subjt:  LSRQPSEQIQHESIKYVYNARMKEGTSVREHVLDLIVHFNVAEMNG------------------------RNAELNKKEYNLTSLLNELQSFESLIKNKR

Query:  HADGEVNLFAHSKRFQKGSSSGTKPCGSTRKKRKGGKGKGSATADIE------------------------LKEKK------------------------
          + E N+    ++F +GSSS  K   S  + +K GKGK   T+ ++                        L EKK                        
Subjt:  HADGEVNLFAHSKRFQKGSSSGTKPCGSTRKKRKGGKGKGSATADIE------------------------LKEKK------------------------

Query:  ------GATNHVCSSFQETSSFKELEE-------------------------------------------------------------------------
              GATNH+C SFQETSS+K+L+E                                                                         
Subjt:  ------GATNHVCSSFQETSSFKELEE-------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------VNAKTRGGYKYFISFIGDYSRYGYLYLMHHKSEVLEKFKEYKAEVENALGKTIKTLQSDRGGEYMDLRFQDYMIEHGIKSQLSAPNTP
                    +N K RGGY+YFISFI D+SRYG++YL+HHKSE  EKFKEYKAEVEN +GKTIKTL+SDRGGEYMD +FQDY+IE GI+SQLSAP+TP
Subjt:  ------------VNAKTRGGYKYFISFIGDYSRYGYLYLMHHKSEVLEKFKEYKAEVENALGKTIKTLQSDRGGEYMDLRFQDYMIEHGIKSQLSAPNTP

Query:  QQNGVSERRNRTLLDMVRSMMSYAQLPASFWGYAVETAVQILNSVPSKSVSETPFELWKGRKPSLQHFRIWGCPTHVLVTNPKKLEPRSRLCQFVGYSKE
        QQNGVSERRNRTLLDMVRSMMSYAQLP SFWGYA+ETA+ ILN+VPSKSV ETP+ELWKGRK SL++FRIWGCP HVLV NPKKLEPRS+LC FVGY KE
Subjt:  QQNGVSERRNRTLLDMVRSMMSYAQLPASFWGYAVETAVQILNSVPSKSVSETPFELWKGRKPSLQHFRIWGCPTHVLVTNPKKLEPRSRLCQFVGYSKE

Query:  TRGGLFYDPQENKVIVSTNTTFLKEDHMRNHKLRSKLVLNE----ATHEP---TRVVNQAGPSSR
        +RGGLFY PQENKV VSTN TFL+EDH RNH+ RSK+VL E    AT +P   T+VV++A  S +
Subjt:  TRGGLFYDPQENKVIVSTNTTFLKEDHMRNHKLRSKLVLNE----ATHEP---TRVVNQAGPSSR

SwissProt top hitse value%identityAlignment
P04146 Copia protein9.1e-3235.51Show/hide
Query:  YFISFIGDYSRYGYLYLMHHKSEVLEKFKEYKAEVENALGKTIKTLQSDRGGEYMDLRFQDYMIEHGIKSQLSAPNTPQQNGVSERRNRTLLDMVRSMMS
        YF+ F+  ++ Y   YL+ +KS+V   F+++ A+ E      +  L  D G EY+    + + ++ GI   L+ P+TPQ NGVSER  RT+ +  R+M+S
Subjt:  YFISFIGDYSRYGYLYLMHHKSEVLEKFKEYKAEVENALGKTIKTLQSDRGGEYMDLRFQDYMIEHGIKSQLSAPNTPQQNGVSERRNRTLLDMVRSMMS

Query:  YAQLPASFWGYAVETAVQILNSVPSKSV---SETPFELWKGRKPSLQHFRIWGCPTHVLVTNPK-KLEPRSRLCQFVGYSKETRGGLFYDPQENKVIVST
         A+L  SFWG AV TA  ++N +PS+++   S+TP+E+W  +KP L+H R++G   +V + N + K + +S    FVGY  E  G   +D    K IV+ 
Subjt:  YAQLPASFWGYAVETAVQILNSVPSKSV---SETPFELWKGRKPSLQHFRIWGCPTHVLVTNPK-KLEPRSRLCQFVGYSKETRGGLFYDPQENKVIVST

Query:  NTTFLKEDHMRNHK
        +   + E +M N +
Subjt:  NTTFLKEDHMRNHK

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.7e-4239.91Show/hide
Query:  KTRGGYKYFISFIGDYSRYGYLYLMHHKSEVLEKFKEYKAEVENALGKTIKTLQSDRGGEYMDLRFQDYMIEHGIKSQLSAPNTPQQNGVSERRNRTLLD
        ++ GG KYF++FI D SR  ++Y++  K +V + F+++ A VE   G+ +K L+SD GGEY    F++Y   HGI+ + + P TPQ NGV+ER NRT+++
Subjt:  KTRGGYKYFISFIGDYSRYGYLYLMHHKSEVLEKFKEYKAEVENALGKTIKTLQSDRGGEYMDLRFQDYMIEHGIKSQLSAPNTPQQNGVSERRNRTLLD

Query:  MVRSMMSYAQLPASFWGYAVETAVQILNSVPSKSVS-ETPFELWKGRKPSLQHFRIWGCP--THVLVTNPKKLEPRSRLCQFVGYSKETRGGLFYDPQEN
         VRSM+  A+LP SFWG AV+TA  ++N  PS  ++ E P  +W  ++ S  H +++GC    HV      KL+ +S  C F+GY  E  G   +DP + 
Subjt:  MVRSMMSYAQLPASFWGYAVETAVQILNSVPSKSVS-ETPFELWKGRKPSLQHFRIWGCP--THVLVTNPKKLEPRSRLCQFVGYSKETRGGLFYDPQEN

Query:  KVIVSTNTTFLKEDHMRNHKLRSKLVLN
        KVI S +  F +E  +R     S+ V N
Subjt:  KVIVSTNTTFLKEDHMRNHKLRSKLVLN

Q07163 Transposon TyH3 Gag-Pol polyprotein1.3e-1427.27Show/hide
Query:  YFISFIGDYSRYGYLYLMHHKSE--VLEKFKEYKAEVENALGKTIKTLQSDRGGEYMDLRFQDYMIEHGIKSQLSAPNTPQQNGVSERRNRTLLDMVRSM
        YFISF  + +++ ++Y +H + E  +L+ F    A ++N    ++  +Q DRG EY +     ++ ++GI    +     + +GV+ER NRTLLD  R+ 
Subjt:  YFISFIGDYSRYGYLYLMHHKSE--VLEKFKEYKAEVENALGKTIKTLQSDRGGEYMDLRFQDYMIEHGIKSQLSAPNTPQQNGVSERRNRTLLDMVRSM

Query:  MSYAQLPASFWGYAVETAVQILNSVPSKSVSETPFELWKGRKPSLQHFRI----------WGCPTHVLVTNP-KKLEPRSRLCQFVGYSKETRGGLFYDP
        +  + LP   W  A+E +  + NS+ S           K +K + QH  +          +G P  V   NP  K+ PR      +  S+ + G + Y P
Subjt:  MSYAQLPASFWGYAVETAVQILNSVPSKSVSETPFELWKGRKPSLQHFRI----------WGCPTHVLVTNP-KKLEPRSRLCQFVGYSKETRGGLFYDP

Query:  QENKVIVSTNTTFLKEDHMR
           K + +TN   L+    R
Subjt:  QENKVIVSTNTTFLKEDHMR

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.8e-3033.66Show/hide
Query:  YKYFISFIGDYSRYGYLYLMHHKSEVLEKFKEYKAEVENALGKTIKTLQSDRGGEYMDLRFQDYMIEHGIKSQLSAPNTPQQNGVSERRNRTLLDMVRSM
        Y+Y++ F+  ++RY +LY +  KS+V E F  +K  +EN     I T  SD GGE++ L   +Y  +HGI    S P+TP+ NG+SER++R +++   ++
Subjt:  YKYFISFIGDYSRYGYLYLMHHKSEVLEKFKEYKAEVENALGKTIKTLQSDRGGEYMDLRFQDYMIEHGIKSQLSAPNTPQQNGVSERRNRTLLDMVRSM

Query:  MSYAQLPASFWGYAVETAVQILNSVPSKSVS-ETPFELWKGRKPSLQHFRIWGCPTHVLVT--NPKKLEPRSRLCQFVGYSKETRGGLFYDPQENKVIVS
        +S+A +P ++W YA   AV ++N +P+  +  E+PF+   G  P+    R++GC  +  +   N  KL+ +SR C F+GYS      L    Q +++ +S
Subjt:  MSYAQLPASFWGYAVETAVQILNSVPSKSVS-ETPFELWKGRKPSLQHFRIWGCPTHVLVT--NPKKLEPRSRLCQFVGYSKETRGGLFYDPQENKVIVS

Query:  TNTTF
         +  F
Subjt:  TNTTF

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.5e-3137.02Show/hide
Query:  YKYFISFIGDYSRYGYLYLMHHKSEVLEKFKEYKAEVENALGKTIKTLQSDRGGEYMDLRFQDYMIEHGIKSQLSAPNTPQQNGVSERRNRTLLDMVRSM
        Y+Y++ F+  ++RY +LY +  KS+V + F  +K+ VEN     I TL SD GGE++ LR  DY+ +HGI    S P+TP+ NG+SER++R +++M  ++
Subjt:  YKYFISFIGDYSRYGYLYLMHHKSEVLEKFKEYKAEVENALGKTIKTLQSDRGGEYMDLRFQDYMIEHGIKSQLSAPNTPQQNGVSERRNRTLLDMVRSM

Query:  MSYAQLPASFWGYAVETAVQILNSVPSKSVS-ETPFELWKGRKPSLQHFRIWGCPTHVLVT--NPKKLEPRSRLCQFVGYS
        +S+A +P ++W YA   AV ++N +P+  +  ++PF+   G+ P+ +  +++GC  +  +   N  KLE +S+ C F+GYS
Subjt:  MSYAQLPASFWGYAVETAVQILNSVPSKSVS-ETPFELWKGRKPSLQHFRIWGCPTHVLVT--NPKKLEPRSRLCQFVGYS

Arabidopsis top hitse value%identityAlignment
ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein9.4e-0835.37Show/hide
Query:  NRTLLDMVRSMMSYAQLPASFWGYAVETAVQILNSVPSKSVS-ETPFELWKGRKPSLQHFRIWGCPTHVLVTNPKKLEPRSR
        NRT+++ VRSM+    LP +F   A  TAV I+N  PS +++   P E+W    P+  + R +GC  ++   +  KL+PR++
Subjt:  NRTLLDMVRSMMSYAQLPASFWGYAVETAVQILNSVPSKSVS-ETPFELWKGRKPSLQHFRIWGCPTHVLVTNPKKLEPRSR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTCCTCATTCATAGCCTTACTCAAAATGGAACGTTTAACTGGTGAGAAATTTACTACGTGGAAGTCAAACCTGGATGCGATCCTTGTTGTTGACAACCTTCGGTT
CGTACTAACTGAGGAATGTCCTCAGATTCCTGCTCAAGACGCACCTCAATCAGTTAAGGATGCGTTTTACCGCTGGGCCAAGGCTAATGAAAAAGCTGCCTATATCCTGG
CTGGGACATCTGATGTTTTAGCCAGGAGATTGCAGGGTATGGTCTCAGCTCGTGAGATCATGGGATATCTGCAAACTTTATCTAGACAACCGTCTGAACAAATTCAGCAC
GAATCCATCAAATACGTTTATAACGCGCGCATGAAGGAGGGAACTTCAGTAAGAGAACATGTTCTCGATCTTATAGTTCACTTCAACGTGGCCGAAATGAATGGACGAAA
TGCGGAGTTGAACAAAAAGGAGTATAACCTGACTTCCCTCCTAAATGAACTACAATCTTTCGAGTCTCTTATTAAGAATAAGAGACATGCTGATGGAGAGGTAAATCTGT
TTGCCCATTCCAAAAGATTCCAGAAGGGTTCATCCTCTGGGACTAAGCCCTGTGGTTCGACTCGGAAAAAGAGGAAAGGAGGCAAAGGGAAAGGTTCTGCCACTGCAGAC
ATTGAGCTCAAAGAGAAGAAAGGAGCCACTAATCACGTTTGCTCTTCATTTCAGGAAACTAGTTCCTTCAAGGAGCTCGAAGAGGTGAATGCCAAAACTCGAGGAGGGTA
CAAATATTTCATCTCTTTCATAGGTGATTATTCGAGGTATGGTTATCTATACCTAATGCATCACAAGTCTGAAGTTCTTGAAAAGTTCAAAGAGTATAAGGCAGAAGTAG
AGAATGCATTAGGAAAAACCATTAAAACACTTCAATCCGATCGAGGTGGAGAGTATATGGATTTGAGATTTCAGGACTATATGATAGAACATGGAATCAAATCTCAACTC
TCAGCACCAAATACACCACAACAAAATGGTGTGTCAGAAAGGAGAAATAGAACCTTGTTAGACATGGTTCGTTCTATGATGAGCTATGCTCAATTGCCTGCCTCGTTTTG
GGGATACGCAGTAGAGACCGCAGTTCAAATCTTGAACAGTGTTCCATCAAAAAGTGTTTCAGAAACACCTTTTGAACTTTGGAAGGGGCGTAAACCTAGTTTACAACACT
TCAGGATTTGGGGTTGTCCGACACACGTGCTCGTGACAAACCCAAAGAAACTGGAACCTCGTTCAAGATTGTGCCAATTTGTTGGCTATTCCAAAGAAACGAGAGGTGGT
CTTTTCTACGACCCACAAGAAAACAAGGTGATTGTATCGACAAACACAACGTTCTTGAAGGAAGATCACATGAGGAACCACAAACTGCGTAGTAAATTAGTACTAAATGA
AGCTACACATGAACCAACAAGAGTTGTTAATCAAGCTGGACCTTCATCAAGAGTTGATGGAAGAGCGACACCTCAAGTCGATCTCGTCCTTCTCAATCGTTGGGAATGCC
TCGACGCGATGGGAGGATTGTTTCCCAACCCGACCGCTACTTGGTTTTATTTGAAACTCAAGTTGTCATACACGATGTGGCGTAGAAGATCCATTGTCTTATAA
mRNA sequenceShow/hide mRNA sequence
ATGTCGTCCTCATTCATAGCCTTACTCAAAATGGAACGTTTAACTGGTGAGAAATTTACTACGTGGAAGTCAAACCTGGATGCGATCCTTGTTGTTGACAACCTTCGGTT
CGTACTAACTGAGGAATGTCCTCAGATTCCTGCTCAAGACGCACCTCAATCAGTTAAGGATGCGTTTTACCGCTGGGCCAAGGCTAATGAAAAAGCTGCCTATATCCTGG
CTGGGACATCTGATGTTTTAGCCAGGAGATTGCAGGGTATGGTCTCAGCTCGTGAGATCATGGGATATCTGCAAACTTTATCTAGACAACCGTCTGAACAAATTCAGCAC
GAATCCATCAAATACGTTTATAACGCGCGCATGAAGGAGGGAACTTCAGTAAGAGAACATGTTCTCGATCTTATAGTTCACTTCAACGTGGCCGAAATGAATGGACGAAA
TGCGGAGTTGAACAAAAAGGAGTATAACCTGACTTCCCTCCTAAATGAACTACAATCTTTCGAGTCTCTTATTAAGAATAAGAGACATGCTGATGGAGAGGTAAATCTGT
TTGCCCATTCCAAAAGATTCCAGAAGGGTTCATCCTCTGGGACTAAGCCCTGTGGTTCGACTCGGAAAAAGAGGAAAGGAGGCAAAGGGAAAGGTTCTGCCACTGCAGAC
ATTGAGCTCAAAGAGAAGAAAGGAGCCACTAATCACGTTTGCTCTTCATTTCAGGAAACTAGTTCCTTCAAGGAGCTCGAAGAGGTGAATGCCAAAACTCGAGGAGGGTA
CAAATATTTCATCTCTTTCATAGGTGATTATTCGAGGTATGGTTATCTATACCTAATGCATCACAAGTCTGAAGTTCTTGAAAAGTTCAAAGAGTATAAGGCAGAAGTAG
AGAATGCATTAGGAAAAACCATTAAAACACTTCAATCCGATCGAGGTGGAGAGTATATGGATTTGAGATTTCAGGACTATATGATAGAACATGGAATCAAATCTCAACTC
TCAGCACCAAATACACCACAACAAAATGGTGTGTCAGAAAGGAGAAATAGAACCTTGTTAGACATGGTTCGTTCTATGATGAGCTATGCTCAATTGCCTGCCTCGTTTTG
GGGATACGCAGTAGAGACCGCAGTTCAAATCTTGAACAGTGTTCCATCAAAAAGTGTTTCAGAAACACCTTTTGAACTTTGGAAGGGGCGTAAACCTAGTTTACAACACT
TCAGGATTTGGGGTTGTCCGACACACGTGCTCGTGACAAACCCAAAGAAACTGGAACCTCGTTCAAGATTGTGCCAATTTGTTGGCTATTCCAAAGAAACGAGAGGTGGT
CTTTTCTACGACCCACAAGAAAACAAGGTGATTGTATCGACAAACACAACGTTCTTGAAGGAAGATCACATGAGGAACCACAAACTGCGTAGTAAATTAGTACTAAATGA
AGCTACACATGAACCAACAAGAGTTGTTAATCAAGCTGGACCTTCATCAAGAGTTGATGGAAGAGCGACACCTCAAGTCGATCTCGTCCTTCTCAATCGTTGGGAATGCC
TCGACGCGATGGGAGGATTGTTTCCCAACCCGACCGCTACTTGGTTTTATTTGAAACTCAAGTTGTCATACACGATGTGGCGTAGAAGATCCATTGTCTTATAA
Protein sequenceShow/hide protein sequence
MSSSFIALLKMERLTGEKFTTWKSNLDAILVVDNLRFVLTEECPQIPAQDAPQSVKDAFYRWAKANEKAAYILAGTSDVLARRLQGMVSAREIMGYLQTLSRQPSEQIQH
ESIKYVYNARMKEGTSVREHVLDLIVHFNVAEMNGRNAELNKKEYNLTSLLNELQSFESLIKNKRHADGEVNLFAHSKRFQKGSSSGTKPCGSTRKKRKGGKGKGSATAD
IELKEKKGATNHVCSSFQETSSFKELEEVNAKTRGGYKYFISFIGDYSRYGYLYLMHHKSEVLEKFKEYKAEVENALGKTIKTLQSDRGGEYMDLRFQDYMIEHGIKSQL
SAPNTPQQNGVSERRNRTLLDMVRSMMSYAQLPASFWGYAVETAVQILNSVPSKSVSETPFELWKGRKPSLQHFRIWGCPTHVLVTNPKKLEPRSRLCQFVGYSKETRGG
LFYDPQENKVIVSTNTTFLKEDHMRNHKLRSKLVLNEATHEPTRVVNQAGPSSRVDGRATPQVDLVLLNRWECLDAMGGLFPNPTATWFYLKLKLSYTMWRRRSIVL