; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0005116 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0005116
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr6:10871265..10878857
RNA-Seq ExpressionLag0005116
SyntenyLag0005116
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048297.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]7.8e-15241.48Show/hide
Query:  NSEISSGSQNGQIVNPGNKISTVKLTDENFLMWKFQILTALEGHDLDDHISGDSQPLPELI--QVSENATTVSKPNPAYKVWKKQDKLVSSWIVGSMSES
        N+E S  S   QI   GNKIS VKL D+ FL+WKFQILTALE +DL++ +  +S+P  + +    S +A+    PNPAYKVWK+QD+L+SSW++GSMSE 
Subjt:  NSEISSGSQNGQIVNPGNKISTVKLTDENFLMWKFQILTALEGHDLDDHISGDSQPLPELI--QVSENATTVSKPNPAYKVWKKQDKLVSSWIVGSMSES

Query:  ILEQVLH---------------------------------------------------------------------------------------------
        IL Q+LH                                                                                             
Subjt:  ILEQVLH---------------------------------------------------------------------------------------------

Query:  --YVVALLLTHESRIESKSVINSDNVLPSANLAVQNVSQNSV------PNPSPNSQQQNFGNGRGRSRSNFGQNRGGRSWNNRNRPQCQLCNKIGHTAMK
           V++LLLT ES+ ESK +  S+  LPS N+  Q   + +        N   N+   N   GRG  RSN G+        NRN+PQCQ+C K+G++A +
Subjt:  --YVVALLLTHESRIESKSVINSDNVLPSANLAVQNVSQNSV------PNPSPNSQQQNFGNGRGRSRSNFGQNRGGRSWNNRNRPQCQLCNKIGHTAMK

Query:  CYSRVQMPGAYATQFNPPG-QMNPSGLNFSPQQF------------------------------------------------NGLAISHLGYASFTSSN-
        C+ R   P + ++ ++P     + + +N  PQ                                                  +GL I+H G  SF SS  
Subjt:  CYSRVQMPGAYATQFNPPG-QMNPSGLNFSPQQF------------------------------------------------NGLAISHLGYASFTSSN-

Query:  -NHMFHLNNLLHVPSITKNLISVSQFAKDNAVFFEFHPTFCVVKDLATGRALLRGTLHEGLYRFNLPQPLPSLNTSVVRPETNT-AVFSTLPCVSNDVSA
            F LNNLL VPSITKNLISVSQFAKDN VFFEFHPT C VKDL TG+ LL+G L++GLY+F +      L+ S     +NT  VF+T+         
Subjt:  -NHMFHLNNLLHVPSITKNLISVSQFAKDNAVFFEFHPTFCVVKDLATGRALLRGTLHEGLYRFNLPQPLPSLNTSVVRPETNT-AVFSTLPCVSNDVSA

Query:  LYSSVQSSNNLSIDVWHQRLGHPSISIVKQVVRSCNPKVSTNAIMSFCHACAIGKHHAMPFSPSTTSYSAPLQLIVTDLWGPAYKLSTHGFQYYISFVDA
            V  SN   +D+WH+RLGHP + IVK V+   +    T   ++FC ACA+GKHHA+PFS S T Y+ PLQLI  DLWGPA  +S +GF+YYISFVDA
Subjt:  LYSSVQSSNNLSIDVWHQRLGHPSISIVKQVVRSCNPKVSTNAIMSFCHACAIGKHHAMPFSPSTTSYSAPLQLIVTDLWGPAYKLSTHGFQYYISFVDA

Query:  FSRYTWIYFLQTKSEAFQAFLKFKTHVEKQFGTPIVS----------------------------FKLMENGIVERKHRHIVDVGLTLLSHSSIPLTFWD
        +SRYTWIYFL +KS+AF AF KFKT VEK  G  I S                            +   +N IVERKHR+I+++GLTLLS +++PL+FWD
Subjt:  FSRYTWIYFLQTKSEAFQAFLKFKTHVEKQFGTPIVS----------------------------FKLMENGIVERKHRHIVDVGLTLLSHSSIPLTFWD

Query:  DAFSTSVYLINRLPSIVLGGMSPLEKLFRKQPDYSTLKVFGCKCFPCLRPYNSHKLSFRSSPCTFIGYSHIHKGYKCLSSDGRLYISRHVLFDENSFPFA
        +AFSTSVYLINRLP+ VL  +SPLEKLF ++P++ +L+VFGCKC+P LRPY SHKLS RS+PCTF+GYS  HKGYKCL+SDGRL+ISRHVLFDENSFP+A
Subjt:  DAFSTSVYLINRLPSIVLGGMSPLEKLFRKQPDYSTLKVFGCKCFPCLRPYNSHKLSFRSSPCTFIGYSHIHKGYKCLSSDGRLYISRHVLFDENSFPFA

Query:  SLTSHSSVSPNCVTQSLPTLSSVSSSTTVESSSDAHLSISETSSIPSTADH
        S  SHSS +P       P L S+  S+ +  + D       T ++    DH
Subjt:  SLTSHSSVSPNCVTQSLPTLSSVSSSTTVESSSDAHLSISETSSIPSTADH

RVW44519.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]8.1e-13335.76Show/hide
Query:  IVNPGNKISTVKLTDENFLMWKFQILTALEGHDLDDHISGDSQPLPELIQVSENATTVSKPNPAYKVWKKQDKLVSSWIVGSMSESILEQV---------
        +++P +++ T++L D+NFLMWK+QI  A+ G+ L+  + G  Q  P+++    +   V  PNP ++ +++QD L+ SW++ S+  + L QV         
Subjt:  IVNPGNKISTVKLTDENFLMWKFQILTALEGHDLDDHISGDSQPLPELIQVSENATTVSKPNPAYKVWKKQDKLVSSWIVGSMSESILEQV---------

Query:  ---------------------------------------------------------LHYVVALLLTHESRIESKSVINSDNVLPSANLAVQNVSQ--NS
                                                                 L YV + L+ HE RI  K   N  +V  ++  + +  S   NS
Subjt:  ---------------------------------------------------------LHYVVALLLTHESRIESKSVINSDNVLPSANLAVQNVSQ--NS

Query:  VPNPSPNSQQQNFGNGRGRSRSNFGQNRG---GRSWNNRNRPQCQLCNKIGHTAMKCYSRVQ------------MPGA----------------------
           PS   Q +N   G   +R +F  NRG   GR+     +PQCQLCNK GHT  +C+ R               PG                       
Subjt:  VPNPSPNSQQQNFGNGRGRSRSNFGQNRG---GRSWNNRNRPQCQLCNKIGHTAMKCYSRVQ------------MPGA----------------------

Query:  -YATQFNPP---------------------------------GQMNPSGLNFSPQ------QFNGLAISHLGYASFTSSN--NHMFHLNNLLHVPSITKN
         Y  Q N                                   G +N SG  ++           GL ISH+G + F SS+  N +  L N+L VP+I KN
Subjt:  -YATQFNPP---------------------------------GQMNPSGLNFSPQ------QFNGLAISHLGYASFTSSN--NHMFHLNNLLHVPSITKN

Query:  LISVSQFAKDNAVFFEFHPTFCVVKDLATGRALLRGTLHEGLYRFNLPQPLPSLNTSV-VRPETNTAVFSTLPCVSNDVSALYSSVQSSNNLSIDVWHQR
        L+SVSQFA+DN V+FEFHP  C VKD +    LL+G LH+GLY+FNL + L    + + +  + N         V ND S       SS ++  D+WH+R
Subjt:  LISVSQFAKDNAVFFEFHPTFCVVKDLATGRALLRGTLHEGLYRFNLPQPLPSLNTSV-VRPETNTAVFSTLPCVSNDVSALYSSVQSSNNLSIDVWHQR

Query:  LGHPSISIVKQVVRSCNPKVSTNAIMSFCHACAIGKHHAMPFSPSTTSYSAPLQLIVTDLWGPAYKLSTHGFQYYISFVDAFSRYTWIYFLQTKSEAFQA
        LGHP+  IV QV+       ST +  S C AC +GK H +PF  S T Y+ PLQL+V+DLWGPA   S++GF YY+SFVDA+SRYTW+YFL+TKS+  +A
Subjt:  LGHPSISIVKQVVRSCNPKVSTNAIMSFCHACAIGKHHAMPFSPSTTSYSAPLQLIVTDLWGPAYKLSTHGFQYYISFVDAFSRYTWIYFLQTKSEAFQA

Query:  FLKFKTHVEKQFGTPIVSFKL----------------------------MENGIVERKHRHIVDVGLTLLSHSSIPLTFWDDAFSTSVYLINRLPSIVLG
        FL FK   E QFG  + +F+                              +NGI+ERKHRHIV++GLTLL+ +S+PL +W DAFST+V+LINRLP+ VL 
Subjt:  FLKFKTHVEKQFGTPIVSFKL----------------------------MENGIVERKHRHIVDVGLTLLSHSSIPLTFWDDAFSTSVYLINRLPSIVLG

Query:  GMSPLEKLFRKQPDYSTLKVFGCKCFPCLRPYNSHKLSFRSSPCTFIGYSHIHKGYKCLSSDGRLYISRHVLFDENSFPFA-------SLTSHSSVSPNC
           P E LF  +P+YS LKVFGC CFP LRPYN HKL FRSSPCTF+GYS  HKGYKCL+  GR++ISR V+FDE  FPFA        + SHS+V   C
Subjt:  GMSPLEKLFRKQPDYSTLKVFGCKCFPCLRPYNSHKLSFRSSPCTFIGYSHIHKGYKCLSSDGRLYISRHVLFDENSFPFA-------SLTSHSSVSPNC

Query:  V-------------TQSLPTLSSVSSSTTVES----------------SSDAHLSISETSSIPST---------------ADHPTN--NSPSPCFLNRTH
        +             + SLPT S+ SS    E+                SS     ++E++SIPS+               +D P    N+    F  + H
Subjt:  V-------------TQSLPTLSSVSSSTTVES----------------SSDAHLSISETSSIPST---------------ADHPTN--NSPSPCFLNRTH

Query:  HMITRSKRGIFKPKAFLATFVDVEPPNVKETLKCSHWKQAMQDEYDAL
        HM+TRSK GIFKPK +       EP   +E +    WK+AM +E+ AL
Subjt:  HMITRSKRGIFKPKAFLATFVDVEPPNVKETLKCSHWKQAMQDEYDAL

RVW60229.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]1.9e-12934.7Show/hide
Query:  IVNPGNKISTVKLTDENFLMWKFQILTALEGHDLDDHISGDSQPLPELIQVSENATTVSKPNPAYKVWKKQDKLVSSWIVGSMSESILEQV---------
        +++P +++ T++L D+NFLMWK+QI  A+ G+ L+  + G  Q  P+++    +   V  PNP ++ +++QD L+ SW++ S+  + L QV         
Subjt:  IVNPGNKISTVKLTDENFLMWKFQILTALEGHDLDDHISGDSQPLPELIQVSENATTVSKPNPAYKVWKKQDKLVSSWIVGSMSESILEQV---------

Query:  --------------------------------------------------------------------------------------LHYVVALLLTHESR
                                                                                              L YV + L+ HE R
Subjt:  --------------------------------------------------------------------------------------LHYVVALLLTHESR

Query:  IESKSVINSDNVLPSANLAVQNVSQ--NSVPNPSPNSQQQNFGNGRGRSRSNFGQNRG---GRSWNNRNRPQCQLCNKIGHTAMKCYSRVQ---------
        I  K   N  +V  ++  + +  S   NS   PS   Q +N   G   +R +F  NRG   GR+     +PQCQLCNK GHT  +C+ R           
Subjt:  IESKSVINSDNVLPSANLAVQNVSQ--NSVPNPSPNSQQQNFGNGRGRSRSNFGQNRG---GRSWNNRNRPQCQLCNKIGHTAMKCYSRVQ---------

Query:  ---MPGA-----------------------YATQFNPP---------------------------------GQMNPSGLNFSPQ------QFNGLAISHL
            PG                        Y  Q N                                   G +N SG  ++           GL ISH+
Subjt:  ---MPGA-----------------------YATQFNPP---------------------------------GQMNPSGLNFSPQ------QFNGLAISHL

Query:  GYASFTSSN--NHMFHLNNLLHVPSITKNLISVSQFAKDNAVFFEFHPTFCVVKDLATGRALLRGTLHEGLYRFNLPQPLPSLNTSV-VRPETNTAVFST
        G + F SS+  N +  L N+L VP+I KNL+SVSQFA+DN V+FEFHP  C VKD +    LL+G LH+GLY+FNL + L    + + +  + N      
Subjt:  GYASFTSSN--NHMFHLNNLLHVPSITKNLISVSQFAKDNAVFFEFHPTFCVVKDLATGRALLRGTLHEGLYRFNLPQPLPSLNTSV-VRPETNTAVFST

Query:  LPCVSNDVSALYSSVQSSNNLSIDVWHQRLGHPSISIVKQVVRSCNPKVSTNAIMSFCHACAIGKHHAMPFSPSTTSYSAPLQLIVTDLWGPAYKLSTHG
           V ND S       SS ++  D+WH+RLGHP+  IV QV+       ST +  S C AC +GK H +PF  S T Y+ PLQL+V+DLWGPA   S++G
Subjt:  LPCVSNDVSALYSSVQSSNNLSIDVWHQRLGHPSISIVKQVVRSCNPKVSTNAIMSFCHACAIGKHHAMPFSPSTTSYSAPLQLIVTDLWGPAYKLSTHG

Query:  FQYYISFVDAFSRYTWIYFLQTKSEAFQAFLKFKTHVEKQFGTPIVSFKL----------------------------MENGIVERKHRHIVDVGLTLLS
        F YY+SFVDA+SRYTW+YFL+TKS+  +AFL FK   E QFG  + +F+                              +NGI+ERKHRHIV++GLTLL+
Subjt:  FQYYISFVDAFSRYTWIYFLQTKSEAFQAFLKFKTHVEKQFGTPIVSFKL----------------------------MENGIVERKHRHIVDVGLTLLS

Query:  HSSIPLTFWDDAFSTSVYLINRLPSIVLGGMSPLEKLFRKQPDYSTLKVFGCKCFPCLRPYNSHKLSFRSSPCTFIGYSHIHKGYKCLSSDGRLYISRHV
         +S+PL +W DAFST+V+LINRLP+ VL    P E LF  +P+YS LKVFGC CFP LRPYN HKL FRSSPCTF+GYS  HKGYKCL+  GR++ISR V
Subjt:  HSSIPLTFWDDAFSTSVYLINRLPSIVLGGMSPLEKLFRKQPDYSTLKVFGCKCFPCLRPYNSHKLSFRSSPCTFIGYSHIHKGYKCLSSDGRLYISRHV

Query:  LFDENSFPFA-------SLTSHSSVSPNCV-------------TQSLPTLSSVSSSTTVES----------------SSDAHLSISETSSIPST------
        +FDE  FPFA        + SHS+V   C+             + SLPT S+ SS    E+                SS     ++E++SIPS+      
Subjt:  LFDENSFPFA-------SLTSHSSVSPNCV-------------TQSLPTLSSVSSSTTVES----------------SSDAHLSISETSSIPST------

Query:  ---------ADHPTN--NSPSPCFLNRTHHMITRSKRGIFKPKAFLATFVDVEPPNVKETLKCSHWKQAMQDEYDAL
                 +D P    N+    F  + HHM+TRSK GIFKPK +       EP   +E +    WK+AM +E+ AL
Subjt:  ---------ADHPTN--NSPSPCFLNRTHHMITRSKRGIFKPKAFLATFVDVEPPNVKETLKCSHWKQAMQDEYDAL

RVW80632.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]3.1e-13238.06Show/hide
Query:  KLTDENFLMWKFQILTALEGHDLDDHISGDSQPLPELIQVSENATTVSKPNPAYKVWKKQDKLVSSWIVGSMSESILEQVLH------------------
        KL + NFL+W+ QILT L GH L  H   ++  LP    +S +  T +  NP ++ W++QD+L+ SW++ S+++++L ++++                  
Subjt:  KLTDENFLMWKFQILTALEGHDLDDHISGDSQPLPELIQVSENATTVSKPNPAYKVWKKQDKLVSSWIVGSMSESILEQVLH------------------

Query:  -------------------------------------------------------------------------YVV----ALLLTHESRIESKSVINSDN
                                                                                 Y V     LLL  ESRIE K++  +D 
Subjt:  -------------------------------------------------------------------------YVV----ALLLTHESRIESKSVINSDN

Query:  VLPS-ANLAVQNVSQNSVPNPSPNSQQQNF------GNGRGRSRSNF---GQNRGGR-SWNNRNRPQCQLCNKIGHTAMKCYSRVQMPGAYATQF---NP
          PS A+L   N + +   N   +++  NF      GNG    R NF   G+ R GR SW   N+PQCQLC +IGH  M+CY R        +Q     P
Subjt:  VLPS-ANLAVQNVSQNSVPNPSPNSQQQNF------GNGRGRSRSNF---GQNRGGR-SWNNRNRPQCQLCNKIGHTAMKCYSRVQMPGAYATQF---NP

Query:  PGQM----NPSGLNFSP--------------------------------------QQF-----------NGLAISHLGYASFTSS--NNHMFHLNNLLHV
         G M         NF P                                       QF            GL I H+G+ SF+SS   +    L  LLHV
Subjt:  PGQM----NPSGLNFSP--------------------------------------QQF-----------NGLAISHLGYASFTSS--NNHMFHLNNLLHV

Query:  PSITKNLISVSQFAKDNAVFFEFHPTFCVVKDLATGRALLRGTLHEGLYRFNLPQPLPSLNTSVVRPETNTAVFSTLPCVSNDVSALYSSVQSSNNLSID
        P ITKNL+SVS+FA DN VFFEFHPT C VKDL+T   L+ G L  GLY F+        NT +  P  N++ F++    S +      +V +S+     
Subjt:  PSITKNLISVSQFAKDNAVFFEFHPTFCVVKDLATGRALLRGTLHEGLYRFNLPQPLPSLNTSVVRPETNTAVFSTLPCVSNDVSALYSSVQSSNNLSID

Query:  VWHQRLGHPSISIVKQVVRSCNPKVSTNAIMSFCHACAIGKHHAMPFSPSTTSYSAPLQLIVTDLWGPAYKLSTHGFQYYISFVDAFSRYTWIYFLQTKS
        +WH RLGHPS  IV  V+  CN           C AC +GK H  PF  S +SY+ PL+LI TDLWGP    S+HG QYYI F+DA+SR+TWIY L+ KS
Subjt:  VWHQRLGHPSISIVKQVVRSCNPKVSTNAIMSFCHACAIGKHHAMPFSPSTTSYSAPLQLIVTDLWGPAYKLSTHGFQYYISFVDAFSRYTWIYFLQTKS

Query:  EAFQAFLKFKTHVEKQFGTPIVS----------------------------FKLMENGIVERKHRHIVDVGLTLLSHSSIPLTFWDDAFSTSVYLINRLP
        EAFQ FL FK+ VE Q G  I +                            +   +NG+ ERKHRHIV+ G+ LL+ +S+P  +WD+AF TSVYLINRLP
Subjt:  EAFQAFLKFKTHVEKQFGTPIVS----------------------------FKLMENGIVERKHRHIVDVGLTLLSHSSIPLTFWDDAFSTSVYLINRLP

Query:  SIVLGGMSPLEKLFRKQPDYSTLKVFGCKCFPCLRPYNSHKLSFRSSPCTFIGYSHIHKGYKCLSSDGRLYISRHVLFDENSFPFA---------SLTSH
        + VL   SPLE LF ++P YS LKVFGC C+P LRP+N HKL FRS PCTF+GYS  HKGYKCLS +G + ISR V+FDE++FPFA         S  S 
Subjt:  SIVLGGMSPLEKLFRKQPDYSTLKVFGCKCFPCLRPYNSHKLSFRSSPCTFIGYSHIHKGYKCLSSDGRLYISRHVLFDENSFPFA---------SLTSH

Query:  SSVSPNCVTQSLPTLSSVSSSTTVESSSDAHLSISETSSIPSTADHPTNNSPSPCFLNRTHHMITRSKRGIFKPKAFLATFVDVEPPNVKETLKCSHWKQ
        SS S  C T SLP L  + SST+  +SS  + SI   +S  + A  P  +S  P     +HHMITRSK GIFKPKA+L   +   P +V E L+ SHWKQ
Subjt:  SSVSPNCVTQSLPTLSSVSSSTTVESSSDAHLSISETSSIPSTADHPTNNSPSPCFLNRTHHMITRSKRGIFKPKAFLATFVDVEPPNVKETLKCSHWKQ

Query:  AMQDEYDAL
        AM DEY AL
Subjt:  AMQDEYDAL

TYK10642.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]3.5e-15241.48Show/hide
Query:  NSEISSGSQNGQIVNPGNKISTVKLTDENFLMWKFQILTALEGHDLDDHISGDSQPLPELI--QVSENATTVSKPNPAYKVWKKQDKLVSSWIVGSMSES
        N+E S  S   QI   GNKIS VKL D+ FL+WKFQILTALE +DL++ +  +S+P  + +    S +A+    PNPAYKVWK+QD+L+SSW++GSMSE 
Subjt:  NSEISSGSQNGQIVNPGNKISTVKLTDENFLMWKFQILTALEGHDLDDHISGDSQPLPELI--QVSENATTVSKPNPAYKVWKKQDKLVSSWIVGSMSES

Query:  ILEQVLH---------------------------------------------------------------------------------------------
        IL Q+LH                                                                                             
Subjt:  ILEQVLH---------------------------------------------------------------------------------------------

Query:  --YVVALLLTHESRIESKSVINSDNVLPSANLAVQNVSQNSV------PNPSPNSQQQNFGNGRGRSRSNFGQNRGGRSWNNRNRPQCQLCNKIGHTAMK
           V++LLLT ES+ ESK +  S+  LPS N+  Q   + +        N   N+   N   GRG  RSN G+        NRN+PQCQ+C K+G++A +
Subjt:  --YVVALLLTHESRIESKSVINSDNVLPSANLAVQNVSQNSV------PNPSPNSQQQNFGNGRGRSRSNFGQNRGGRSWNNRNRPQCQLCNKIGHTAMK

Query:  CYSRVQMPGAYATQFNPPG-QMNPSGLNFSPQQF------------------------------------------------NGLAISHLGYASFTSSN-
        C+ R   P + ++ ++P     + + +N  PQ                                                  +GL I+H G  SF SS  
Subjt:  CYSRVQMPGAYATQFNPPG-QMNPSGLNFSPQQF------------------------------------------------NGLAISHLGYASFTSSN-

Query:  -NHMFHLNNLLHVPSITKNLISVSQFAKDNAVFFEFHPTFCVVKDLATGRALLRGTLHEGLYRFNLPQPLPSLNTSVVRPETNT-AVFSTLPCVSNDVSA
            F LNNLL VPSITKNLISVSQFAKDN VFFEFHPT C VKDL TG+ LL+G L++GLY+F +      L+ S     +NT  VF+T+         
Subjt:  -NHMFHLNNLLHVPSITKNLISVSQFAKDNAVFFEFHPTFCVVKDLATGRALLRGTLHEGLYRFNLPQPLPSLNTSVVRPETNT-AVFSTLPCVSNDVSA

Query:  LYSSVQSSNNLSIDVWHQRLGHPSISIVKQVVRSCNPKVSTNAIMSFCHACAIGKHHAMPFSPSTTSYSAPLQLIVTDLWGPAYKLSTHGFQYYISFVDA
            V  SN   +D+WH+RLGHP + IVK V+   +    T   ++FC ACA+GKHHA+PFS S T Y+ PLQLI  DLWGPA  +S +GF+YYISFVDA
Subjt:  LYSSVQSSNNLSIDVWHQRLGHPSISIVKQVVRSCNPKVSTNAIMSFCHACAIGKHHAMPFSPSTTSYSAPLQLIVTDLWGPAYKLSTHGFQYYISFVDA

Query:  FSRYTWIYFLQTKSEAFQAFLKFKTHVEKQFGTPIVS----------------------------FKLMENGIVERKHRHIVDVGLTLLSHSSIPLTFWD
        +SRYTWIYFL +KS+AF AF KFKT VEK  G  I S                            +   +N IVERKHR+I+++GLTLLS +++PL+FWD
Subjt:  FSRYTWIYFLQTKSEAFQAFLKFKTHVEKQFGTPIVS----------------------------FKLMENGIVERKHRHIVDVGLTLLSHSSIPLTFWD

Query:  DAFSTSVYLINRLPSIVLGGMSPLEKLFRKQPDYSTLKVFGCKCFPCLRPYNSHKLSFRSSPCTFIGYSHIHKGYKCLSSDGRLYISRHVLFDENSFPFA
        +AFSTSVYLINRLP+ VL  +SPLEKLF ++P++ +L+VFGCKC+P LRPY SHKLS RS+PCTF+GYS  HKGYKCL+SDGRL+ISRHVLFDENSFP+A
Subjt:  DAFSTSVYLINRLPSIVLGGMSPLEKLFRKQPDYSTLKVFGCKCFPCLRPYNSHKLSFRSSPCTFIGYSHIHKGYKCLSSDGRLYISRHVLFDENSFPFA

Query:  SLTSHSSVSPNCVTQSLPTLSSVSSSTTVESSSDAHLSISETSSIPSTADH
        S  SHSS+ P       P L S+  S+ +  + D       T ++    DH
Subjt:  SLTSHSSVSPNCVTQSLPTLSSVSSSTTVESSSDAHLSISETSSIPSTADH

TrEMBL top hitse value%identityAlignment
A0A438EA49 Retrovirus-related Pol polyprotein from transposon TNT 1-943.9e-13335.76Show/hide
Query:  IVNPGNKISTVKLTDENFLMWKFQILTALEGHDLDDHISGDSQPLPELIQVSENATTVSKPNPAYKVWKKQDKLVSSWIVGSMSESILEQV---------
        +++P +++ T++L D+NFLMWK+QI  A+ G+ L+  + G  Q  P+++    +   V  PNP ++ +++QD L+ SW++ S+  + L QV         
Subjt:  IVNPGNKISTVKLTDENFLMWKFQILTALEGHDLDDHISGDSQPLPELIQVSENATTVSKPNPAYKVWKKQDKLVSSWIVGSMSESILEQV---------

Query:  ---------------------------------------------------------LHYVVALLLTHESRIESKSVINSDNVLPSANLAVQNVSQ--NS
                                                                 L YV + L+ HE RI  K   N  +V  ++  + +  S   NS
Subjt:  ---------------------------------------------------------LHYVVALLLTHESRIESKSVINSDNVLPSANLAVQNVSQ--NS

Query:  VPNPSPNSQQQNFGNGRGRSRSNFGQNRG---GRSWNNRNRPQCQLCNKIGHTAMKCYSRVQ------------MPGA----------------------
           PS   Q +N   G   +R +F  NRG   GR+     +PQCQLCNK GHT  +C+ R               PG                       
Subjt:  VPNPSPNSQQQNFGNGRGRSRSNFGQNRG---GRSWNNRNRPQCQLCNKIGHTAMKCYSRVQ------------MPGA----------------------

Query:  -YATQFNPP---------------------------------GQMNPSGLNFSPQ------QFNGLAISHLGYASFTSSN--NHMFHLNNLLHVPSITKN
         Y  Q N                                   G +N SG  ++           GL ISH+G + F SS+  N +  L N+L VP+I KN
Subjt:  -YATQFNPP---------------------------------GQMNPSGLNFSPQ------QFNGLAISHLGYASFTSSN--NHMFHLNNLLHVPSITKN

Query:  LISVSQFAKDNAVFFEFHPTFCVVKDLATGRALLRGTLHEGLYRFNLPQPLPSLNTSV-VRPETNTAVFSTLPCVSNDVSALYSSVQSSNNLSIDVWHQR
        L+SVSQFA+DN V+FEFHP  C VKD +    LL+G LH+GLY+FNL + L    + + +  + N         V ND S       SS ++  D+WH+R
Subjt:  LISVSQFAKDNAVFFEFHPTFCVVKDLATGRALLRGTLHEGLYRFNLPQPLPSLNTSV-VRPETNTAVFSTLPCVSNDVSALYSSVQSSNNLSIDVWHQR

Query:  LGHPSISIVKQVVRSCNPKVSTNAIMSFCHACAIGKHHAMPFSPSTTSYSAPLQLIVTDLWGPAYKLSTHGFQYYISFVDAFSRYTWIYFLQTKSEAFQA
        LGHP+  IV QV+       ST +  S C AC +GK H +PF  S T Y+ PLQL+V+DLWGPA   S++GF YY+SFVDA+SRYTW+YFL+TKS+  +A
Subjt:  LGHPSISIVKQVVRSCNPKVSTNAIMSFCHACAIGKHHAMPFSPSTTSYSAPLQLIVTDLWGPAYKLSTHGFQYYISFVDAFSRYTWIYFLQTKSEAFQA

Query:  FLKFKTHVEKQFGTPIVSFKL----------------------------MENGIVERKHRHIVDVGLTLLSHSSIPLTFWDDAFSTSVYLINRLPSIVLG
        FL FK   E QFG  + +F+                              +NGI+ERKHRHIV++GLTLL+ +S+PL +W DAFST+V+LINRLP+ VL 
Subjt:  FLKFKTHVEKQFGTPIVSFKL----------------------------MENGIVERKHRHIVDVGLTLLSHSSIPLTFWDDAFSTSVYLINRLPSIVLG

Query:  GMSPLEKLFRKQPDYSTLKVFGCKCFPCLRPYNSHKLSFRSSPCTFIGYSHIHKGYKCLSSDGRLYISRHVLFDENSFPFA-------SLTSHSSVSPNC
           P E LF  +P+YS LKVFGC CFP LRPYN HKL FRSSPCTF+GYS  HKGYKCL+  GR++ISR V+FDE  FPFA        + SHS+V   C
Subjt:  GMSPLEKLFRKQPDYSTLKVFGCKCFPCLRPYNSHKLSFRSSPCTFIGYSHIHKGYKCLSSDGRLYISRHVLFDENSFPFA-------SLTSHSSVSPNC

Query:  V-------------TQSLPTLSSVSSSTTVES----------------SSDAHLSISETSSIPST---------------ADHPTN--NSPSPCFLNRTH
        +             + SLPT S+ SS    E+                SS     ++E++SIPS+               +D P    N+    F  + H
Subjt:  V-------------TQSLPTLSSVSSSTTVES----------------SSDAHLSISETSSIPST---------------ADHPTN--NSPSPCFLNRTH

Query:  HMITRSKRGIFKPKAFLATFVDVEPPNVKETLKCSHWKQAMQDEYDAL
        HM+TRSK GIFKPK +       EP   +E +    WK+AM +E+ AL
Subjt:  HMITRSKRGIFKPKAFLATFVDVEPPNVKETLKCSHWKQAMQDEYDAL

A0A438FJP6 Retrovirus-related Pol polyprotein from transposon TNT 1-949.1e-13034.7Show/hide
Query:  IVNPGNKISTVKLTDENFLMWKFQILTALEGHDLDDHISGDSQPLPELIQVSENATTVSKPNPAYKVWKKQDKLVSSWIVGSMSESILEQV---------
        +++P +++ T++L D+NFLMWK+QI  A+ G+ L+  + G  Q  P+++    +   V  PNP ++ +++QD L+ SW++ S+  + L QV         
Subjt:  IVNPGNKISTVKLTDENFLMWKFQILTALEGHDLDDHISGDSQPLPELIQVSENATTVSKPNPAYKVWKKQDKLVSSWIVGSMSESILEQV---------

Query:  --------------------------------------------------------------------------------------LHYVVALLLTHESR
                                                                                              L YV + L+ HE R
Subjt:  --------------------------------------------------------------------------------------LHYVVALLLTHESR

Query:  IESKSVINSDNVLPSANLAVQNVSQ--NSVPNPSPNSQQQNFGNGRGRSRSNFGQNRG---GRSWNNRNRPQCQLCNKIGHTAMKCYSRVQ---------
        I  K   N  +V  ++  + +  S   NS   PS   Q +N   G   +R +F  NRG   GR+     +PQCQLCNK GHT  +C+ R           
Subjt:  IESKSVINSDNVLPSANLAVQNVSQ--NSVPNPSPNSQQQNFGNGRGRSRSNFGQNRG---GRSWNNRNRPQCQLCNKIGHTAMKCYSRVQ---------

Query:  ---MPGA-----------------------YATQFNPP---------------------------------GQMNPSGLNFSPQ------QFNGLAISHL
            PG                        Y  Q N                                   G +N SG  ++           GL ISH+
Subjt:  ---MPGA-----------------------YATQFNPP---------------------------------GQMNPSGLNFSPQ------QFNGLAISHL

Query:  GYASFTSSN--NHMFHLNNLLHVPSITKNLISVSQFAKDNAVFFEFHPTFCVVKDLATGRALLRGTLHEGLYRFNLPQPLPSLNTSV-VRPETNTAVFST
        G + F SS+  N +  L N+L VP+I KNL+SVSQFA+DN V+FEFHP  C VKD +    LL+G LH+GLY+FNL + L    + + +  + N      
Subjt:  GYASFTSSN--NHMFHLNNLLHVPSITKNLISVSQFAKDNAVFFEFHPTFCVVKDLATGRALLRGTLHEGLYRFNLPQPLPSLNTSV-VRPETNTAVFST

Query:  LPCVSNDVSALYSSVQSSNNLSIDVWHQRLGHPSISIVKQVVRSCNPKVSTNAIMSFCHACAIGKHHAMPFSPSTTSYSAPLQLIVTDLWGPAYKLSTHG
           V ND S       SS ++  D+WH+RLGHP+  IV QV+       ST +  S C AC +GK H +PF  S T Y+ PLQL+V+DLWGPA   S++G
Subjt:  LPCVSNDVSALYSSVQSSNNLSIDVWHQRLGHPSISIVKQVVRSCNPKVSTNAIMSFCHACAIGKHHAMPFSPSTTSYSAPLQLIVTDLWGPAYKLSTHG

Query:  FQYYISFVDAFSRYTWIYFLQTKSEAFQAFLKFKTHVEKQFGTPIVSFKL----------------------------MENGIVERKHRHIVDVGLTLLS
        F YY+SFVDA+SRYTW+YFL+TKS+  +AFL FK   E QFG  + +F+                              +NGI+ERKHRHIV++GLTLL+
Subjt:  FQYYISFVDAFSRYTWIYFLQTKSEAFQAFLKFKTHVEKQFGTPIVSFKL----------------------------MENGIVERKHRHIVDVGLTLLS

Query:  HSSIPLTFWDDAFSTSVYLINRLPSIVLGGMSPLEKLFRKQPDYSTLKVFGCKCFPCLRPYNSHKLSFRSSPCTFIGYSHIHKGYKCLSSDGRLYISRHV
         +S+PL +W DAFST+V+LINRLP+ VL    P E LF  +P+YS LKVFGC CFP LRPYN HKL FRSSPCTF+GYS  HKGYKCL+  GR++ISR V
Subjt:  HSSIPLTFWDDAFSTSVYLINRLPSIVLGGMSPLEKLFRKQPDYSTLKVFGCKCFPCLRPYNSHKLSFRSSPCTFIGYSHIHKGYKCLSSDGRLYISRHV

Query:  LFDENSFPFA-------SLTSHSSVSPNCV-------------TQSLPTLSSVSSSTTVES----------------SSDAHLSISETSSIPST------
        +FDE  FPFA        + SHS+V   C+             + SLPT S+ SS    E+                SS     ++E++SIPS+      
Subjt:  LFDENSFPFA-------SLTSHSSVSPNCV-------------TQSLPTLSSVSSSTTVES----------------SSDAHLSISETSSIPST------

Query:  ---------ADHPTN--NSPSPCFLNRTHHMITRSKRGIFKPKAFLATFVDVEPPNVKETLKCSHWKQAMQDEYDAL
                 +D P    N+    F  + HHM+TRSK GIFKPK +       EP   +E +    WK+AM +E+ AL
Subjt:  ---------ADHPTN--NSPSPCFLNRTHHMITRSKRGIFKPKAFLATFVDVEPPNVKETLKCSHWKQAMQDEYDAL

A0A438H844 Retrovirus-related Pol polyprotein from transposon RE11.5e-13238.06Show/hide
Query:  KLTDENFLMWKFQILTALEGHDLDDHISGDSQPLPELIQVSENATTVSKPNPAYKVWKKQDKLVSSWIVGSMSESILEQVLH------------------
        KL + NFL+W+ QILT L GH L  H   ++  LP    +S +  T +  NP ++ W++QD+L+ SW++ S+++++L ++++                  
Subjt:  KLTDENFLMWKFQILTALEGHDLDDHISGDSQPLPELIQVSENATTVSKPNPAYKVWKKQDKLVSSWIVGSMSESILEQVLH------------------

Query:  -------------------------------------------------------------------------YVV----ALLLTHESRIESKSVINSDN
                                                                                 Y V     LLL  ESRIE K++  +D 
Subjt:  -------------------------------------------------------------------------YVV----ALLLTHESRIESKSVINSDN

Query:  VLPS-ANLAVQNVSQNSVPNPSPNSQQQNF------GNGRGRSRSNF---GQNRGGR-SWNNRNRPQCQLCNKIGHTAMKCYSRVQMPGAYATQF---NP
          PS A+L   N + +   N   +++  NF      GNG    R NF   G+ R GR SW   N+PQCQLC +IGH  M+CY R        +Q     P
Subjt:  VLPS-ANLAVQNVSQNSVPNPSPNSQQQNF------GNGRGRSRSNF---GQNRGGR-SWNNRNRPQCQLCNKIGHTAMKCYSRVQMPGAYATQF---NP

Query:  PGQM----NPSGLNFSP--------------------------------------QQF-----------NGLAISHLGYASFTSS--NNHMFHLNNLLHV
         G M         NF P                                       QF            GL I H+G+ SF+SS   +    L  LLHV
Subjt:  PGQM----NPSGLNFSP--------------------------------------QQF-----------NGLAISHLGYASFTSS--NNHMFHLNNLLHV

Query:  PSITKNLISVSQFAKDNAVFFEFHPTFCVVKDLATGRALLRGTLHEGLYRFNLPQPLPSLNTSVVRPETNTAVFSTLPCVSNDVSALYSSVQSSNNLSID
        P ITKNL+SVS+FA DN VFFEFHPT C VKDL+T   L+ G L  GLY F+        NT +  P  N++ F++    S +      +V +S+     
Subjt:  PSITKNLISVSQFAKDNAVFFEFHPTFCVVKDLATGRALLRGTLHEGLYRFNLPQPLPSLNTSVVRPETNTAVFSTLPCVSNDVSALYSSVQSSNNLSID

Query:  VWHQRLGHPSISIVKQVVRSCNPKVSTNAIMSFCHACAIGKHHAMPFSPSTTSYSAPLQLIVTDLWGPAYKLSTHGFQYYISFVDAFSRYTWIYFLQTKS
        +WH RLGHPS  IV  V+  CN           C AC +GK H  PF  S +SY+ PL+LI TDLWGP    S+HG QYYI F+DA+SR+TWIY L+ KS
Subjt:  VWHQRLGHPSISIVKQVVRSCNPKVSTNAIMSFCHACAIGKHHAMPFSPSTTSYSAPLQLIVTDLWGPAYKLSTHGFQYYISFVDAFSRYTWIYFLQTKS

Query:  EAFQAFLKFKTHVEKQFGTPIVS----------------------------FKLMENGIVERKHRHIVDVGLTLLSHSSIPLTFWDDAFSTSVYLINRLP
        EAFQ FL FK+ VE Q G  I +                            +   +NG+ ERKHRHIV+ G+ LL+ +S+P  +WD+AF TSVYLINRLP
Subjt:  EAFQAFLKFKTHVEKQFGTPIVS----------------------------FKLMENGIVERKHRHIVDVGLTLLSHSSIPLTFWDDAFSTSVYLINRLP

Query:  SIVLGGMSPLEKLFRKQPDYSTLKVFGCKCFPCLRPYNSHKLSFRSSPCTFIGYSHIHKGYKCLSSDGRLYISRHVLFDENSFPFA---------SLTSH
        + VL   SPLE LF ++P YS LKVFGC C+P LRP+N HKL FRS PCTF+GYS  HKGYKCLS +G + ISR V+FDE++FPFA         S  S 
Subjt:  SIVLGGMSPLEKLFRKQPDYSTLKVFGCKCFPCLRPYNSHKLSFRSSPCTFIGYSHIHKGYKCLSSDGRLYISRHVLFDENSFPFA---------SLTSH

Query:  SSVSPNCVTQSLPTLSSVSSSTTVESSSDAHLSISETSSIPSTADHPTNNSPSPCFLNRTHHMITRSKRGIFKPKAFLATFVDVEPPNVKETLKCSHWKQ
        SS S  C T SLP L  + SST+  +SS  + SI   +S  + A  P  +S  P     +HHMITRSK GIFKPKA+L   +   P +V E L+ SHWKQ
Subjt:  SSVSPNCVTQSLPTLSSVSSSTTVESSSDAHLSISETSSIPSTADHPTNNSPSPCFLNRTHHMITRSKRGIFKPKAFLATFVDVEPPNVKETLKCSHWKQ

Query:  AMQDEYDAL
        AM DEY AL
Subjt:  AMQDEYDAL

A0A5A7U233 Retrovirus-related Pol polyprotein from transposon TNT 1-943.8e-15241.48Show/hide
Query:  NSEISSGSQNGQIVNPGNKISTVKLTDENFLMWKFQILTALEGHDLDDHISGDSQPLPELI--QVSENATTVSKPNPAYKVWKKQDKLVSSWIVGSMSES
        N+E S  S   QI   GNKIS VKL D+ FL+WKFQILTALE +DL++ +  +S+P  + +    S +A+    PNPAYKVWK+QD+L+SSW++GSMSE 
Subjt:  NSEISSGSQNGQIVNPGNKISTVKLTDENFLMWKFQILTALEGHDLDDHISGDSQPLPELI--QVSENATTVSKPNPAYKVWKKQDKLVSSWIVGSMSES

Query:  ILEQVLH---------------------------------------------------------------------------------------------
        IL Q+LH                                                                                             
Subjt:  ILEQVLH---------------------------------------------------------------------------------------------

Query:  --YVVALLLTHESRIESKSVINSDNVLPSANLAVQNVSQNSV------PNPSPNSQQQNFGNGRGRSRSNFGQNRGGRSWNNRNRPQCQLCNKIGHTAMK
           V++LLLT ES+ ESK +  S+  LPS N+  Q   + +        N   N+   N   GRG  RSN G+        NRN+PQCQ+C K+G++A +
Subjt:  --YVVALLLTHESRIESKSVINSDNVLPSANLAVQNVSQNSV------PNPSPNSQQQNFGNGRGRSRSNFGQNRGGRSWNNRNRPQCQLCNKIGHTAMK

Query:  CYSRVQMPGAYATQFNPPG-QMNPSGLNFSPQQF------------------------------------------------NGLAISHLGYASFTSSN-
        C+ R   P + ++ ++P     + + +N  PQ                                                  +GL I+H G  SF SS  
Subjt:  CYSRVQMPGAYATQFNPPG-QMNPSGLNFSPQQF------------------------------------------------NGLAISHLGYASFTSSN-

Query:  -NHMFHLNNLLHVPSITKNLISVSQFAKDNAVFFEFHPTFCVVKDLATGRALLRGTLHEGLYRFNLPQPLPSLNTSVVRPETNT-AVFSTLPCVSNDVSA
            F LNNLL VPSITKNLISVSQFAKDN VFFEFHPT C VKDL TG+ LL+G L++GLY+F +      L+ S     +NT  VF+T+         
Subjt:  -NHMFHLNNLLHVPSITKNLISVSQFAKDNAVFFEFHPTFCVVKDLATGRALLRGTLHEGLYRFNLPQPLPSLNTSVVRPETNT-AVFSTLPCVSNDVSA

Query:  LYSSVQSSNNLSIDVWHQRLGHPSISIVKQVVRSCNPKVSTNAIMSFCHACAIGKHHAMPFSPSTTSYSAPLQLIVTDLWGPAYKLSTHGFQYYISFVDA
            V  SN   +D+WH+RLGHP + IVK V+   +    T   ++FC ACA+GKHHA+PFS S T Y+ PLQLI  DLWGPA  +S +GF+YYISFVDA
Subjt:  LYSSVQSSNNLSIDVWHQRLGHPSISIVKQVVRSCNPKVSTNAIMSFCHACAIGKHHAMPFSPSTTSYSAPLQLIVTDLWGPAYKLSTHGFQYYISFVDA

Query:  FSRYTWIYFLQTKSEAFQAFLKFKTHVEKQFGTPIVS----------------------------FKLMENGIVERKHRHIVDVGLTLLSHSSIPLTFWD
        +SRYTWIYFL +KS+AF AF KFKT VEK  G  I S                            +   +N IVERKHR+I+++GLTLLS +++PL+FWD
Subjt:  FSRYTWIYFLQTKSEAFQAFLKFKTHVEKQFGTPIVS----------------------------FKLMENGIVERKHRHIVDVGLTLLSHSSIPLTFWD

Query:  DAFSTSVYLINRLPSIVLGGMSPLEKLFRKQPDYSTLKVFGCKCFPCLRPYNSHKLSFRSSPCTFIGYSHIHKGYKCLSSDGRLYISRHVLFDENSFPFA
        +AFSTSVYLINRLP+ VL  +SPLEKLF ++P++ +L+VFGCKC+P LRPY SHKLS RS+PCTF+GYS  HKGYKCL+SDGRL+ISRHVLFDENSFP+A
Subjt:  DAFSTSVYLINRLPSIVLGGMSPLEKLFRKQPDYSTLKVFGCKCFPCLRPYNSHKLSFRSSPCTFIGYSHIHKGYKCLSSDGRLYISRHVLFDENSFPFA

Query:  SLTSHSSVSPNCVTQSLPTLSSVSSSTTVESSSDAHLSISETSSIPSTADH
        S  SHSS +P       P L S+  S+ +  + D       T ++    DH
Subjt:  SLTSHSSVSPNCVTQSLPTLSSVSSSTTVESSSDAHLSISETSSIPSTADH

A0A5D3CH97 Retrovirus-related Pol polyprotein from transposon TNT 1-941.7e-15241.48Show/hide
Query:  NSEISSGSQNGQIVNPGNKISTVKLTDENFLMWKFQILTALEGHDLDDHISGDSQPLPELI--QVSENATTVSKPNPAYKVWKKQDKLVSSWIVGSMSES
        N+E S  S   QI   GNKIS VKL D+ FL+WKFQILTALE +DL++ +  +S+P  + +    S +A+    PNPAYKVWK+QD+L+SSW++GSMSE 
Subjt:  NSEISSGSQNGQIVNPGNKISTVKLTDENFLMWKFQILTALEGHDLDDHISGDSQPLPELI--QVSENATTVSKPNPAYKVWKKQDKLVSSWIVGSMSES

Query:  ILEQVLH---------------------------------------------------------------------------------------------
        IL Q+LH                                                                                             
Subjt:  ILEQVLH---------------------------------------------------------------------------------------------

Query:  --YVVALLLTHESRIESKSVINSDNVLPSANLAVQNVSQNSV------PNPSPNSQQQNFGNGRGRSRSNFGQNRGGRSWNNRNRPQCQLCNKIGHTAMK
           V++LLLT ES+ ESK +  S+  LPS N+  Q   + +        N   N+   N   GRG  RSN G+        NRN+PQCQ+C K+G++A +
Subjt:  --YVVALLLTHESRIESKSVINSDNVLPSANLAVQNVSQNSV------PNPSPNSQQQNFGNGRGRSRSNFGQNRGGRSWNNRNRPQCQLCNKIGHTAMK

Query:  CYSRVQMPGAYATQFNPPG-QMNPSGLNFSPQQF------------------------------------------------NGLAISHLGYASFTSSN-
        C+ R   P + ++ ++P     + + +N  PQ                                                  +GL I+H G  SF SS  
Subjt:  CYSRVQMPGAYATQFNPPG-QMNPSGLNFSPQQF------------------------------------------------NGLAISHLGYASFTSSN-

Query:  -NHMFHLNNLLHVPSITKNLISVSQFAKDNAVFFEFHPTFCVVKDLATGRALLRGTLHEGLYRFNLPQPLPSLNTSVVRPETNT-AVFSTLPCVSNDVSA
            F LNNLL VPSITKNLISVSQFAKDN VFFEFHPT C VKDL TG+ LL+G L++GLY+F +      L+ S     +NT  VF+T+         
Subjt:  -NHMFHLNNLLHVPSITKNLISVSQFAKDNAVFFEFHPTFCVVKDLATGRALLRGTLHEGLYRFNLPQPLPSLNTSVVRPETNT-AVFSTLPCVSNDVSA

Query:  LYSSVQSSNNLSIDVWHQRLGHPSISIVKQVVRSCNPKVSTNAIMSFCHACAIGKHHAMPFSPSTTSYSAPLQLIVTDLWGPAYKLSTHGFQYYISFVDA
            V  SN   +D+WH+RLGHP + IVK V+   +    T   ++FC ACA+GKHHA+PFS S T Y+ PLQLI  DLWGPA  +S +GF+YYISFVDA
Subjt:  LYSSVQSSNNLSIDVWHQRLGHPSISIVKQVVRSCNPKVSTNAIMSFCHACAIGKHHAMPFSPSTTSYSAPLQLIVTDLWGPAYKLSTHGFQYYISFVDA

Query:  FSRYTWIYFLQTKSEAFQAFLKFKTHVEKQFGTPIVS----------------------------FKLMENGIVERKHRHIVDVGLTLLSHSSIPLTFWD
        +SRYTWIYFL +KS+AF AF KFKT VEK  G  I S                            +   +N IVERKHR+I+++GLTLLS +++PL+FWD
Subjt:  FSRYTWIYFLQTKSEAFQAFLKFKTHVEKQFGTPIVS----------------------------FKLMENGIVERKHRHIVDVGLTLLSHSSIPLTFWD

Query:  DAFSTSVYLINRLPSIVLGGMSPLEKLFRKQPDYSTLKVFGCKCFPCLRPYNSHKLSFRSSPCTFIGYSHIHKGYKCLSSDGRLYISRHVLFDENSFPFA
        +AFSTSVYLINRLP+ VL  +SPLEKLF ++P++ +L+VFGCKC+P LRPY SHKLS RS+PCTF+GYS  HKGYKCL+SDGRL+ISRHVLFDENSFP+A
Subjt:  DAFSTSVYLINRLPSIVLGGMSPLEKLFRKQPDYSTLKVFGCKCFPCLRPYNSHKLSFRSSPCTFIGYSHIHKGYKCLSSDGRLYISRHVLFDENSFPFA

Query:  SLTSHSSVSPNCVTQSLPTLSSVSSSTTVESSSDAHLSISETSSIPSTADH
        S  SHSS+ P       P L S+  S+ +  + D       T ++    DH
Subjt:  SLTSHSSVSPNCVTQSLPTLSSVSSSTTVESSSDAHLSISETSSIPSTADH

SwissProt top hitse value%identityAlignment
P04146 Copia protein8.5e-1620.1Show/hide
Query:  DKLVSSWIVGSMSESILEQVLHYVVALLLTHESRIESKSVINSDNV---LPSANLAVQNVS-QNSVPNPSPNSQQQNFGNGRGRSRSNFGQNRGGR----
        D+L+S  +        ++++ H ++ L   ++  I +   ++ +N+        L  Q +  +N   + S         N     ++N  +NR  +    
Subjt:  DKLVSSWIVGSMSESILEQVLHYVVALLLTHESRIESKSVINSDNV---LPSANLAVQNVS-QNSVPNPSPNSQQQNFGNGRGRSRSNFGQNRGGR----

Query:  -SWNNRNRPQCQLCNKIGHTAMKCYSRVQMPGAYATQFNPPGQMNPS-GLNFSPQQFNGLAI-SHLGYASFTSSNNHMFHLNNLLH-----VPSITKNLI
           N++ + +C  C + GH    C+   ++      +     Q   S G+ F  ++ N  ++  + G+   + +++H+ +  +L       VP +   + 
Subjt:  -SWNNRNRPQCQLCNKIGHTAMKCYSRVQMPGAYATQFNPPGQMNPS-GLNFSPQQFNGLAI-SHLGYASFTSSNNHMFHLNNLLH-----VPSITKNLI

Query:  SVSQF-----------AKDNAVFFEFHPTFCVVKDLATGRALLRGTLHEGLYRFNLPQPLPSLNTSVVRPETNTAVFSTLPCVSNDVSALYSSVQSSNNL
           +F             D+ +  E    FC     A G  +    L E        +   +++ + +    N+ + + +P ++        S+ + +  
Subjt:  SVSQF-----------AKDNAVFFEFHPTFCVVKDLATGRALLRGTLHEGLYRFNLPQPLPSLNTSVVRPETNTAVFSTLPCVSNDVSALYSSVQSSNNL

Query:  SIDVWHQRLGHPSISIVKQVVRS---CNPKVSTNAIMS--FCHACAIGKHHAMPFS--PSTTSYSAPLQLIVTDLWGPAYKLSTHGFQYYISFVDAFSRY
        +  +WH+R GH S   + ++ R     +  +  N  +S   C  C  GK   +PF      T    PL ++ +D+ GP   ++     Y++ FVD F+ Y
Subjt:  SIDVWHQRLGHPSISIVKQVVRS---CNPKVSTNAIMS--FCHACAIGKHHAMPFS--PSTTSYSAPLQLIVTDLWGPAYKLSTHGFQYYISFVDAFSRY

Query:  TWIYFLQTKSEAFQAFLKFKTHVEKQFGTPIV------------------------SFKL------MENGIVERKHRHIVDVGLTLLSHSSIPLTFWDDA
           Y ++ KS+ F  F  F    E  F   +V                        S+ L        NG+ ER  R I +   T++S + +  +FW +A
Subjt:  TWIYFLQTKSEAFQAFLKFKTHVEKQFGTPIV------------------------SFKL------MENGIVERKHRHIVDVGLTLLSHSSIPLTFWDDA

Query:  FSTSVYLINRLPS--IVLGGMSPLEKLFRKQPDYSTLKVFGCKCFPCLRPYNSHKLSFRSSPCTFIGYSHIHKGYKCLSS-DGRLYISRHVLFDENS
          T+ YLINR+PS  +V    +P E    K+P    L+VFG   +  ++     K   +S    F+GY     G+K   + + +  ++R V+ DE +
Subjt:  FSTSVYLINRLPS--IVLGGMSPLEKLFRKQPDYSTLKVFGCKCFPCLRPYNSHKLSFRSSPCTFIGYSHIHKGYKCLSS-DGRLYISRHVLFDENS

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.0e-3726.74Show/hide
Query:  LNNLLHVPSITKNLISVSQFAKDNAVFFEFHPTFCVVK-DLATGRALLRGTLHEGLYRFNLPQPLPSLNTSVVRPETNTAVFSTLPCVSNDVSALYSSVQ
        L ++ HVP +  NLIS     +D    +  +  + + K  L   + + RGTL+               N  + + E N A                    
Subjt:  LNNLLHVPSITKNLISVSQFAKDNAVFFEFHPTFCVVK-DLATGRALLRGTLHEGLYRFNLPQPLPSLNTSVVRPETNTAVFSTLPCVSNDVSALYSSVQ

Query:  SSNNLSIDVWHQRLGHPSISIVKQVVRSCNPKVSTNAIMSFCHACAIGKHHAMPFSPSTTSYSAPLQLIVTDLWGPAYKLSTHGFQYYISFVDAFSRYTW
          + +S+D+WH+R+GH S   ++ + +      +    +  C  C  GK H + F  S+      L L+ +D+ GP    S  G +Y+++F+D  SR  W
Subjt:  SSNNLSIDVWHQRLGHPSISIVKQVVRSCNPKVSTNAIMSFCHACAIGKHHAMPFSPSTTSYSAPLQLIVTDLWGPAYKLSTHGFQYYISFVDAFSRYTW

Query:  IYFLQTKSEAFQAFLKFKTHVEKQFGTPIVSFKL------------------------------MENGIVERKHRHIVDVGLTLLSHSSIPLTFWDDAFS
        +Y L+TK + FQ F KF   VE++ G  +   +                                 NG+ ER +R IV+   ++L  + +P +FW +A  
Subjt:  IYFLQTKSEAFQAFLKFKTHVEKQFGTPIVSFKL------------------------------MENGIVERKHRHIVDVGLTLLSHSSIPLTFWDDAFS

Query:  TSVYLINRLPSIVLGGMSPLEKLFRKQPDYSTLKVFGCKCFPCLRPYNSHKLSFRSSPCTFIGYSHIHKGYKCLSSDGRLYI-SRHVLFDENSFPFASLT
        T+ YLINR PS+ L    P      K+  YS LKVFGC+ F  +      KL  +S PC FIGY     GY+      +  I SR V+F E+    A+  
Subjt:  TSVYLINRLPSIVLGGMSPLEKLFRKQPDYSTLKVFGCKCFPCLRPYNSHKLSFRSSPCTFIGYSHIHKGYKCLSSDGRLYI-SRHVLFDENSFPFASLT

Query:  SH---SSVSPNCVTQSLPTLSSVSSSTTVESSSDAHLSISETSSIP--------------STADHPTNNSPSPCFLNRTHHMITRSKRGIFKPKAFLATF
        S    + + PN VT  +P+ S  ++ T+ ES++D    +SE    P                 +HPT        L R+      S+R  +    ++   
Subjt:  SH---SSVSPNCVTQSLPTLSSVSSSTTVESSSDAHLSISETSSIP--------------STADHPTNNSPSPCFLNRTHHMITRSKRGIFKPKAFLATF

Query:  VDVEPPNVKETLKCSHWKQ---AMQDEYDAL
         D EP ++KE L      Q   AMQ+E ++L
Subjt:  VDVEPPNVKETLKCSHWKQ---AMQDEYDAL

Q12491 Transposon Ty2-B Gag-Pol polyprotein3.2e-0721.51Show/hide
Query:  SGLNFSPQQFNGLAISHLGYASFTSSNNHMFHLNNLLHVPSITKNLISVSQFAKDNAVFFEFHPTFCVVK---DLATGRALLRGTLHEGLYRFNLPQPLP
        S +N    Q   + I+ +G   F   N     +   LH P+I  +L+S+S+ A  N        T C  +   + + G  L     H   Y  +    +P
Subjt:  SGLNFSPQQFNGLAISHLGYASFTSSNNHMFHLNNLLHVPSITKNLISVSQFAKDNAVFFEFHPTFCVVK---DLATGRALLRGTLHEGLYRFNLPQPLP

Query:  SLNTSVVRPETNTAVFSTLPCVSNDVSALYSSVQSSNNLSIDVWHQRLGHPSI-SIVKQVVRSCNPKVS------TNAIMSFCHACAIGK----HHAMPF
        S  + +     N +                   +S N     + H+ LGH +  SI K + ++    +       +NA    C  C IGK     H    
Subjt:  SLNTSVVRPETNTAVFSTLPCVSNDVSALYSSVQSSNNLSIDVWHQRLGHPSI-SIVKQVVRSCNPKVS------TNAIMSFCHACAIGK----HHAMPF

Query:  SPSTTSYSAPLQLIVTDLWGPAYKLSTHGFQYYISFVDAFSRYTWIYFLQTKSE--AFQAFLKFKTHVEKQFGTPIVSFKL-------------------
                 P Q + TD++GP + L      Y+ISF D  +R+ W+Y L  + E      F      ++ QF   ++  ++                   
Subjt:  SPSTTSYSAPLQLIVTDLWGPAYKLSTHGFQYYISFVDAFSRYTWIYFLQTKSE--AFQAFLKFKTHVEKQFGTPIVSFKL-------------------

Query:  -----------MENGIVERKHRHIVDVGLTLLSHSSIPLTFWDDAFSTSVYLINRLPS
                     +G+ ER +R +++   TLL  S +P   W  A   S  + N L S
Subjt:  -----------MENGIVERKHRHIVDVGLTLLSHSSIPLTFWDDAFSTSVYLINRLPS

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.3e-7630Show/hide
Query:  NGQIVNPGNKISTVKLTDENFLMWKFQILTALEGHDLDDHISGDSQPLPELIQVSENATTVSKPNPAYKVWKKQDKLVSSWIVGSMSESILEQVLHYV--
        N  I+N  N  +  KLT  N+LMW  Q+    +G++L   + G +   P  I          + NP Y  WK+QDKL+ S ++G++S S+   V      
Subjt:  NGQIVNPGNKISTVKLTDENFLMWKFQILTALEGHDLDDHISGDSQPLPELIQVSENATTVSKPNPAYKVWKKQDKLVSSWIVGSMSESILEQVLHYV--

Query:  -------------------------------------------------VALL---LTHESRIES---------KSVI---------------------N
                                                         +ALL   + H+ ++E          K VI                     +
Subjt:  -------------------------------------------------VALL---LTHESRIES---------KSVI---------------------N

Query:  SDNVLPSANLAVQNVSQNSVPNPSPNSQQQNFGNGRGRSRSNFGQNRGGRSW----------NNRNRP---QCQLCNKIGHTAMKCYSRVQMPGAYATQF
           +L  ++  V  ++ N+V + +  +   N    R     N   N   + W          NN+++P   +CQ+C   GH+A +C S++Q   +     
Subjt:  SDNVLPSANLAVQNVSQNSVPNPSPNSQQQNFGNGRGRSRSNFGQNRGGRSW----------NNRNRP---QCQLCNKIGHTAMKCYSRVQMPGAYATQF

Query:  NPPGQMNP----------------------SGLNFSPQQFNGLA--------------------ISHLGYASFTSSNNHMFHLNNLLHVPSITKNLISVS
         PP    P                         +     FN L+                    ISH G  S  S+ +   +L+N+L+VP+I KNLISV 
Subjt:  NPPGQMNP----------------------SGLNFSPQQFNGLA--------------------ISHLGYASFTSSNNHMFHLNNLLHVPSITKNLISVS

Query:  QFAKDNAVFFEFHPTFCVVKDLATGRALLRGTLHEGLYRFNLPQPLPSLNTSVVRPETNTAVFSTLPCVSNDVSALYSSVQSSNNLSIDVWHQRLGHPSI
        +    N V  EF P    VKDL TG  LL+G   + LY +                          P  S+   +L++S   S+  +   WH RLGHP+ 
Subjt:  QFAKDNAVFFEFHPTFCVVKDLATGRALLRGTLHEGLYRFNLPQPLPSLNTSVVRPETNTAVFSTLPCVSNDVSALYSSVQSSNNLSIDVWHQRLGHPSI

Query:  SIVKQVVRSCNPKV--STNAIMSFCHACAIGKHHAMPFSPSTTSYSAPLQLIVTDLWGPAYKLSTHGFQYYISFVDAFSRYTWIYFLQTKSEAFQAFLKF
        SI+  V+ + +  V   ++  +S C  C I K + +PFS ST + + PL+ I +D+W     LS   ++YY+ FVD F+RYTW+Y L+ KS+  + F+ F
Subjt:  SIVKQVVRSCNPKV--STNAIMSFCHACAIGKHHAMPFSPSTTSYSAPLQLIVTDLWGPAYKLSTHGFQYYISFVDAFSRYTWIYFLQTKSEAFQAFLKF

Query:  KTHVEKQFGTPIVSF---------KLME-------------------NGIVERKHRHIVDVGLTLLSHSSIPLTFWDDAFSTSVYLINRLPSIVLGGMSP
        K  +E +F T I +F          L E                   NG+ ERKHRHIV+ GLTLLSH+SIP T+W  AF+ +VYLINRLP+ +L   SP
Subjt:  KTHVEKQFGTPIVSF---------KLME-------------------NGIVERKHRHIVDVGLTLLSHSSIPLTFWDDAFSTSVYLINRLPSIVLGGMSP

Query:  LEKLFRKQPDYSTLKVFGCKCFPCLRPYNSHKLSFRSSPCTFIGYSHIHKGYKCLS-SDGRLYISRHVLFDENSFPFAS-LTSHSSV-----SPNCVTQS
         +KLF   P+Y  L+VFGC C+P LRPYN HKL  +S  C F+GYS     Y CL     RLYISRHV FDEN FPF++ L + S V       +CV   
Subjt:  LEKLFRKQPDYSTLKVFGCKCFPCLRPYNSHKLSFRSSPCTFIGYSHIHKGYKCLS-SDGRLYISRHVLFDENSFPFAS-LTSHSSV-----SPNCVTQS

Query:  LPTLSSVSSSTTVESSSDAHLSISETSSIPSTADHPTNNS
          TL + +      S SD H      ++ PS+   P  NS
Subjt:  LPTLSSVSSSTTVESSSDAHLSISETSSIPSTADHPTNNS

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.0e-8131.21Show/hide
Query:  NGQIVNPGNKISTVKLTDENFLMWKFQILTALEGHDLDDHISGDSQPLPELIQVSENATTVSKPNPAYKVWKKQDKLVSSWIVGSMSESI----------
        N  I+N  N  +  KLT  N+LMW  Q+    +G++L   + G S P+P     +     V + NP Y  W++QDKL+ S I+G++S S+          
Subjt:  NGQIVNPGNKISTVKLTDENFLMWKFQILTALEGHDLDDHISGDSQPLPELIQVSENATTVSKPNPAYKVWKKQDKLVSSWIVGSMSESI----------

Query:  -------------------------------------------LEQVLHYV-----------------VALLLTHESRIESKS---VINSDNVLPSANLA
                                                   +E+VL  +                  +L   HE  I  +S    +NS  V+P     
Subjt:  -------------------------------------------LEQVLHYV-----------------VALLLTHESRIESKS---VINSDNVLPSANLA

Query:  VQNVSQNSVPNPSPNSQQQNFGNGRGRSRSNFGQNRGGRSWNNRNRP---QCQLCNKIGHTAMKCYSRVQMPG--------------------AYATQFN
        V + + N+  N +     +N+ N   RS S    + G RS N + +P   +CQ+C+  GH+A +C    Q                       A  + +N
Subjt:  VQNVSQNSVPNPSPNSQQQNFGNGRGRSRSNFGQNRGGRSWNNRNRP---QCQLCNKIGHTAMKCYSRVQMPG--------------------AYATQFN

Query:  PPGQMNPSG-LNFSPQQFNGLA--------------------ISHLGYASFTSSNNHMFHLNNLLHVPSITKNLISVSQFAKDNAVFFEFHPTFCVVKDL
            +  SG  +     FN L+                    I+H G AS  +S+  +  LN +L+VP+I KNLISV +    N V  EF P    VKDL
Subjt:  PPGQMNPSG-LNFSPQQFNGLA--------------------ISHLGYASFTSSNNHMFHLNNLLHVPSITKNLISVSQFAKDNAVFFEFHPTFCVVKDL

Query:  ATGRALLRGTLHEGLYRFNLPQPLPSLNTSVVRPETNTAVFSTLPCVSNDVSALYSSVQSSNNLSIDVWHQRLGHPSISIVKQVVRSCN-PKVSTNAIMS
         TG  LL+G   + LY +                          P  S+   ++++S  S    S   WH RLGHPS++I+  V+ + + P ++ +  + 
Subjt:  ATGRALLRGTLHEGLYRFNLPQPLPSLNTSVVRPETNTAVFSTLPCVSNDVSALYSSVQSSNNLSIDVWHQRLGHPSISIVKQVVRSCN-PKVSTNAIMS

Query:  FCHACAIGKHHAMPFSPSTTSYSAPLQLIVTDLWGPAYKLSTHGFQYYISFVDAFSRYTWIYFLQTKSEAFQAFLKFKTHVEKQFGTPIVS--------F
         C  C I K H +PFS ST + S PL+ I +D+W     LS   ++YY+ FVD F+RYTW+Y L+ KS+    F+ FK+ VE +F T I +        F
Subjt:  FCHACAIGKHHAMPFSPSTTSYSAPLQLIVTDLWGPAYKLSTHGFQYYISFVDAFSRYTWIYFLQTKSEAFQAFLKFKTHVEKQFGTPIVS--------F

Query:  KLM--------------------ENGIVERKHRHIVDVGLTLLSHSSIPLTFWDDAFSTSVYLINRLPSIVLGGMSPLEKLFRKQPDYSTLKVFGCKCFP
         ++                     NG+ ERKHRHIV++GLTLLSH+S+P T+W  AFS +VYLINRLP+ +L   SP +KLF + P+Y  LKVFGC C+P
Subjt:  KLM--------------------ENGIVERKHRHIVDVGLTLLSHSSIPLTFWDDAFSTSVYLINRLPSIVLGGMSPLEKLFRKQPDYSTLKVFGCKCFP

Query:  CLRPYNSHKLSFRSSPCTFIGYSHIHKGYKCLS-SDGRLYISRHVLFDENSFPFASLT------------------SHSSV--------SPNCVTQSL--
         LRPYN HKL  +S  C F+GYS     Y CL    GRLY SRHV FDE  FPF++                    SH+++        +P C+   L  
Subjt:  CLRPYNSHKLSFRSSPCTFIGYSHIHKGYKCLS-SDGRLYISRHVLFDENSFPFASLT------------------SHSSV--------SPNCVTQSL--

Query:  ----PTLSSVSSSTTVESSSDAHLSISETSSIPSTADHPTNNSPSP
            P+  S   +T V SS+    SIS  SS   TA  P++N P P
Subjt:  ----PTLSSVSSSTTVESSSDAHLSISETSSIPSTADHPTNNSPSP

Arabidopsis top hitse value%identityAlignment
ATMG00300.1 Gag-Pol-related retrotransposon family protein1.2e-0428.77Show/hide
Query:  VVKDLATGRALLRGTLHEGLYRFNLPQPLPSLNTSVVRPETNTAVFSTLPCVSNDVSALYSSVQSSNNLSIDVWHQRLGHPSISIVKQVVRSCNPKVSTN
        V+K L   R +L+G  H+ LY          L  SV   E+N A        + D + L              WH RL H S   ++ +V+      S  
Subjt:  VVKDLATGRALLRGTLHEGLYRFNLPQPLPSLNTSVVRPETNTAVFSTLPCVSNDVSALYSSVQSSNNLSIDVWHQRLGHPSISIVKQVVRSCNPKVSTN

Query:  AIMSFCHACAIGKHHAMPFSPSTTSYSAPLQLIVTDLWG-PAYKLS
        + + FC  C  GK H + FS    +   PL  + +DLWG P+  LS
Subjt:  AIMSFCHACAIGKHHAMPFSPSTTSYSAPLQLIVTDLWG-PAYKLS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAACCTCTGAGAACAATTCTGAGATCTCAAGCGGCTCGCAAAATGGCCAAATCGTCAATCCTGGGAATAAGATCTCAACTGTGAAGCTGACCGATGAAAACTTCCT
CATGTGGAAATTTCAGATCCTCACTGCTCTCGAAGGTCATGATCTTGATGACCATATCAGTGGCGATTCTCAACCACTGCCTGAGCTAATCCAGGTAAGTGAAAATGCGA
CGACGGTCAGTAAGCCTAACCCTGCCTATAAAGTTTGGAAAAAGCAAGATAAATTAGTGTCCTCGTGGATTGTTGGGTCTATGTCTGAATCCATCCTCGAGCAAGTCCTT
CACTATGTTGTAGCCCTGTTATTGACTCATGAGAGTAGAATAGAGAGTAAGTCAGTGATTAATTCTGATAATGTTTTACCTTCTGCGAACCTAGCGGTTCAAAATGTGAG
TCAAAACTCAGTTCCAAATCCTTCCCCTAACTCTCAACAGCAAAATTTTGGTAATGGTAGAGGTAGGAGTCGGTCTAATTTTGGTCAAAACAGAGGAGGAAGGTCCTGGA
ACAATCGTAATAGGCCTCAGTGTCAATTGTGTAATAAAATTGGCCATACTGCTATGAAGTGCTACTCTCGAGTTCAGATGCCAGGAGCCTATGCAACTCAATTTAATCCT
CCTGGGCAAATGAATCCCTCAGGCCTGAATTTTAGTCCACAACAATTTAATGGTTTGGCTATTTCTCATCTTGGATATGCTTCTTTTACTTCTTCAAATAATCATATGTT
TCATTTAAATAACCTTTTACATGTTCCCTCCATTACAAAAAATCTTATCAGTGTCAGTCAATTCGCTAAGGATAATGCTGTTTTCTTTGAATTTCATCCAACTTTCTGTG
TTGTGAAGGATCTAGCAACTGGACGGGCACTCCTTCGAGGGACTCTACATGAAGGACTATATAGGTTCAACCTGCCGCAGCCTTTGCCATCGTTAAACACCTCTGTTGTT
CGACCGGAAACTAATACTGCTGTTTTTTCTACTTTACCTTGTGTTTCCAATGATGTGTCTGCTTTATATTCCTCTGTTCAGTCTTCAAATAATTTGTCCATAGATGTTTG
GCATCAACGTCTTGGACATCCATCTATTTCTATTGTTAAGCAAGTTGTTCGTTCCTGTAATCCAAAAGTTTCTACTAATGCCATTATGTCATTTTGTCATGCATGTGCAA
TAGGCAAACATCATGCCATGCCTTTCTCTCCCTCTACTACATCTTACTCTGCTCCTTTGCAACTTATAGTTACTGATTTATGGGGTCCAGCTTATAAACTGTCTACCCAT
GGATTTCAGTATTACATTAGCTTTGTGGATGCTTTTTCGAGATATACATGGATTTATTTTCTTCAAACCAAGTCTGAAGCATTTCAGGCTTTTCTCAAATTTAAAACTCA
TGTGGAAAAACAGTTTGGAACTCCTATTGTTTCCTTCAAACTGATGGAGAATGGCATTGTAGAACGTAAACATCGTCACATTGTTGATGTTGGTCTCACCTTGTTATCAC
ATTCTTCTATACCTCTAACATTCTGGGATGATGCTTTTTCCACCAGTGTTTATCTTATCAACAGGTTACCCTCTATAGTTCTTGGTGGCATGAGTCCCTTGGAGAAGCTC
TTTCGGAAGCAACCAGATTATTCCACACTTAAAGTCTTTGGTTGTAAGTGTTTTCCTTGCCTTCGCCCATATAATTCTCATAAGTTGAGTTTTCGGTCGAGTCCCTGTAC
ATTCATTGGTTATAGTCATATTCATAAAGGCTATAAATGTTTGTCCTCTGACGGTAGACTTTATATCTCTAGACATGTATTGTTTGATGAAAATTCTTTTCCATTTGCTT
CTCTTACTTCTCATTCTTCTGTTTCTCCCAATTGTGTAACTCAAAGTTTACCTACATTGTCTTCTGTTTCTTCCTCTACTACAGTTGAGTCTTCTTCTGATGCACACTTA
AGTATCAGTGAGACATCCTCTATTCCATCAACTGCTGATCATCCTACTAATAATAGTCCTTCACCCTGTTTCCTGAACCGTACTCACCACATGATTACACGAAGTAAAAG
AGGCATATTCAAACCTAAGGCTTTTCTTGCTACCTTTGTTGATGTTGAACCGCCTAATGTTAAAGAGACCCTTAAATGTTCTCATTGGAAACAAGCAATGCAAGATGAGT
ATGATGCTCTTAACGCAATTTTGGACCACCCCGATACACAAGGAGCTGACGAGGACAACCGGGGAGAAATCGGGCTGAAAGATGGACTAAGAAGGCAAAACCGGCAAATG
GGACGGGCCAAGACCGAAGGGGTCGGGTTTTCGGCCCGACCCCCTGCTCGGCCCGCTCGTGCGGGCCGAGTCCGTTTGGTCCCGTCTGGTCTCCACCGCCTCTGGATGCC
CCGGCCACGTCTTCCCCCCTCAACTACAAATTTACCGTTGGTGGCACGTGAAGGGTATGGAAAGAAAGACCAAGACGTAAACATAGAGAATTCGGATGGTGACCGCCACC
AGCGGAGGTCACGTGACGAGGATAGCATCCGGGGGTCACCGAGACAAGCAGGCCGAGGCCGAGGCCGAGGCCGAGCAGAGGATGCCGACACCAAGATTGCCGCCCTTGAG
GATGAGGTGAAGGGAATGAATCGGAGTTTATCCAAAATACTCCAAATCCTGGATAAACCCGGCCCTAGCACCAAAGTCCATGAGGGGAGCTTGATTAGAGACCCGAGGAA
GGGGAAGGAGCCTATGGAGCACACTGCAGAGTCAGGGACGAGGTCAAGAGGAAAGAAGACTGACAGCATGACCAGCAAGGTCAGGGGGCTCAAACCCACTGATCACACGA
TTTTGAGGAGCCCAGAGTCAAGCACACTTAAAGGGCGTCACTACACAGTTTCTACCCCAAGCTTCGGTCATACTAAGACAGACCTGAGGAATCTGATCGTTGAGAAGCGC
AGAAGTGCCAAAACTGCCGAGTCCGAAGCCAAAGCCGCCGAAGCTGAAGCCCGGGCTGCCGAGGCCGAGGCCAGAGCAGCCGAGGCCGAGGCTAGGTTGGCCGAGGCCGA
GGCCAAGAAAGACGACCTCCCTTGGAAGACCGAGCTTCTTAATGCACTAAAGGAGCTCGGAAATCCTCAGGGAGACCAGCAGAGGTCAAAGAACTTTGGAGATCAAAACT
TGGAGGAACTAGCCGACCAAGTCGATCCGCCCTTCACAGAAGAAGTCATGAAAGCCGAGGTGCCTCAAAAGTTTAAGGTACCCACGTTCAAACAGTACGATGGCAAGAAA
GACCCCGTGCAACATCTAAATGCATACAGAAGTTGGATGGACTTCCACGGCGTCTCAGATGCAATCAGAGTTAGCCCAAGCATTCCTTGCACAGTTCATGGGGCTAGAGA
GCAGCGCAAGCCTCACATCAACCTCTTGACGGTCAAACAACAACCAGGTGAGAGCTTGCGTGATTACATAACTCGTTTTAATGATGAGGCACTACAGGTTGAGGGGTACA
GCGAGGGAGCAGCCCTGGTAGCCATAACAGCCGGTCTGGAAGACGAAAGACTGCTCAATTCAATAGGTAAGAGCCAGCCTCGAACCTATGCGGAGTTCGTCTCCCGGGCA
CAGAAGTATATGAGCGCAGAGGAGTTACTGAAGTCAAAAAGGTCAGAACGAGAATACAAGAGGTTTTCTTCATCTAGCTATGACAGTAAAAAGGACAAAAGGCAGCGGAC
CGACGAAGGAGGCCGGGGCCGAGCAGACCATGGCCGAGGCCGACCAGACAATGGCCGAGGCCGACCAGACCAAGGAGCACCTCCTTTCGGTAAGTTTGAGAAATACACCC
CAACTGCTGTTCCGCAGGAGCAAGTACTGATGGAGATCCGAAATACGGGCCTCCTGAAATTCCCAGGGAGGATGAAGTCGAGTGCCGATAGAAGAGACAAGAGCCAGTAT
TGTCTTTTCCACCGGGACCACGGGCATTCAACCAGGAATTGTATTCAGTTGAAGGATGAAATCGAAGCACTGATCCAGAATGGGTATTTGAAAGAGTTCGTCGGTGAGCC
AAAGGCCGAGGCCGACCACGGATGGCCGAGGCCGAGCCTTACCAAAGATGGCCGAGACAAAGAAGAACCCCTACGGGAGATCAGAACCATCTTTGGAGGACCAGCAGGAG
GAGGTTCGAGCAGGAAGAGGAAAGCTATGGTCAGGGAAGCAAGGTCCGAACCAGAATATCGAGGTATGTACTCTGTCCATCTATCAAAGGCACACCCCCCTTTGGAGTTC
ACTGAGGCTGAGGCAGCGAGCATTCATCAGCCACATAATGATGCTCTGGTGGTCACTCTAATCGTAGCCAATGTGAAAATCCATCGGATCCTAATTGATGGGGGAAGCTC
GGCTGATGTCCTTTCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAAACCTCTGAGAACAATTCTGAGATCTCAAGCGGCTCGCAAAATGGCCAAATCGTCAATCCTGGGAATAAGATCTCAACTGTGAAGCTGACCGATGAAAACTTCCT
CATGTGGAAATTTCAGATCCTCACTGCTCTCGAAGGTCATGATCTTGATGACCATATCAGTGGCGATTCTCAACCACTGCCTGAGCTAATCCAGGTAAGTGAAAATGCGA
CGACGGTCAGTAAGCCTAACCCTGCCTATAAAGTTTGGAAAAAGCAAGATAAATTAGTGTCCTCGTGGATTGTTGGGTCTATGTCTGAATCCATCCTCGAGCAAGTCCTT
CACTATGTTGTAGCCCTGTTATTGACTCATGAGAGTAGAATAGAGAGTAAGTCAGTGATTAATTCTGATAATGTTTTACCTTCTGCGAACCTAGCGGTTCAAAATGTGAG
TCAAAACTCAGTTCCAAATCCTTCCCCTAACTCTCAACAGCAAAATTTTGGTAATGGTAGAGGTAGGAGTCGGTCTAATTTTGGTCAAAACAGAGGAGGAAGGTCCTGGA
ACAATCGTAATAGGCCTCAGTGTCAATTGTGTAATAAAATTGGCCATACTGCTATGAAGTGCTACTCTCGAGTTCAGATGCCAGGAGCCTATGCAACTCAATTTAATCCT
CCTGGGCAAATGAATCCCTCAGGCCTGAATTTTAGTCCACAACAATTTAATGGTTTGGCTATTTCTCATCTTGGATATGCTTCTTTTACTTCTTCAAATAATCATATGTT
TCATTTAAATAACCTTTTACATGTTCCCTCCATTACAAAAAATCTTATCAGTGTCAGTCAATTCGCTAAGGATAATGCTGTTTTCTTTGAATTTCATCCAACTTTCTGTG
TTGTGAAGGATCTAGCAACTGGACGGGCACTCCTTCGAGGGACTCTACATGAAGGACTATATAGGTTCAACCTGCCGCAGCCTTTGCCATCGTTAAACACCTCTGTTGTT
CGACCGGAAACTAATACTGCTGTTTTTTCTACTTTACCTTGTGTTTCCAATGATGTGTCTGCTTTATATTCCTCTGTTCAGTCTTCAAATAATTTGTCCATAGATGTTTG
GCATCAACGTCTTGGACATCCATCTATTTCTATTGTTAAGCAAGTTGTTCGTTCCTGTAATCCAAAAGTTTCTACTAATGCCATTATGTCATTTTGTCATGCATGTGCAA
TAGGCAAACATCATGCCATGCCTTTCTCTCCCTCTACTACATCTTACTCTGCTCCTTTGCAACTTATAGTTACTGATTTATGGGGTCCAGCTTATAAACTGTCTACCCAT
GGATTTCAGTATTACATTAGCTTTGTGGATGCTTTTTCGAGATATACATGGATTTATTTTCTTCAAACCAAGTCTGAAGCATTTCAGGCTTTTCTCAAATTTAAAACTCA
TGTGGAAAAACAGTTTGGAACTCCTATTGTTTCCTTCAAACTGATGGAGAATGGCATTGTAGAACGTAAACATCGTCACATTGTTGATGTTGGTCTCACCTTGTTATCAC
ATTCTTCTATACCTCTAACATTCTGGGATGATGCTTTTTCCACCAGTGTTTATCTTATCAACAGGTTACCCTCTATAGTTCTTGGTGGCATGAGTCCCTTGGAGAAGCTC
TTTCGGAAGCAACCAGATTATTCCACACTTAAAGTCTTTGGTTGTAAGTGTTTTCCTTGCCTTCGCCCATATAATTCTCATAAGTTGAGTTTTCGGTCGAGTCCCTGTAC
ATTCATTGGTTATAGTCATATTCATAAAGGCTATAAATGTTTGTCCTCTGACGGTAGACTTTATATCTCTAGACATGTATTGTTTGATGAAAATTCTTTTCCATTTGCTT
CTCTTACTTCTCATTCTTCTGTTTCTCCCAATTGTGTAACTCAAAGTTTACCTACATTGTCTTCTGTTTCTTCCTCTACTACAGTTGAGTCTTCTTCTGATGCACACTTA
AGTATCAGTGAGACATCCTCTATTCCATCAACTGCTGATCATCCTACTAATAATAGTCCTTCACCCTGTTTCCTGAACCGTACTCACCACATGATTACACGAAGTAAAAG
AGGCATATTCAAACCTAAGGCTTTTCTTGCTACCTTTGTTGATGTTGAACCGCCTAATGTTAAAGAGACCCTTAAATGTTCTCATTGGAAACAAGCAATGCAAGATGAGT
ATGATGCTCTTAACGCAATTTTGGACCACCCCGATACACAAGGAGCTGACGAGGACAACCGGGGAGAAATCGGGCTGAAAGATGGACTAAGAAGGCAAAACCGGCAAATG
GGACGGGCCAAGACCGAAGGGGTCGGGTTTTCGGCCCGACCCCCTGCTCGGCCCGCTCGTGCGGGCCGAGTCCGTTTGGTCCCGTCTGGTCTCCACCGCCTCTGGATGCC
CCGGCCACGTCTTCCCCCCTCAACTACAAATTTACCGTTGGTGGCACGTGAAGGGTATGGAAAGAAAGACCAAGACGTAAACATAGAGAATTCGGATGGTGACCGCCACC
AGCGGAGGTCACGTGACGAGGATAGCATCCGGGGGTCACCGAGACAAGCAGGCCGAGGCCGAGGCCGAGGCCGAGCAGAGGATGCCGACACCAAGATTGCCGCCCTTGAG
GATGAGGTGAAGGGAATGAATCGGAGTTTATCCAAAATACTCCAAATCCTGGATAAACCCGGCCCTAGCACCAAAGTCCATGAGGGGAGCTTGATTAGAGACCCGAGGAA
GGGGAAGGAGCCTATGGAGCACACTGCAGAGTCAGGGACGAGGTCAAGAGGAAAGAAGACTGACAGCATGACCAGCAAGGTCAGGGGGCTCAAACCCACTGATCACACGA
TTTTGAGGAGCCCAGAGTCAAGCACACTTAAAGGGCGTCACTACACAGTTTCTACCCCAAGCTTCGGTCATACTAAGACAGACCTGAGGAATCTGATCGTTGAGAAGCGC
AGAAGTGCCAAAACTGCCGAGTCCGAAGCCAAAGCCGCCGAAGCTGAAGCCCGGGCTGCCGAGGCCGAGGCCAGAGCAGCCGAGGCCGAGGCTAGGTTGGCCGAGGCCGA
GGCCAAGAAAGACGACCTCCCTTGGAAGACCGAGCTTCTTAATGCACTAAAGGAGCTCGGAAATCCTCAGGGAGACCAGCAGAGGTCAAAGAACTTTGGAGATCAAAACT
TGGAGGAACTAGCCGACCAAGTCGATCCGCCCTTCACAGAAGAAGTCATGAAAGCCGAGGTGCCTCAAAAGTTTAAGGTACCCACGTTCAAACAGTACGATGGCAAGAAA
GACCCCGTGCAACATCTAAATGCATACAGAAGTTGGATGGACTTCCACGGCGTCTCAGATGCAATCAGAGTTAGCCCAAGCATTCCTTGCACAGTTCATGGGGCTAGAGA
GCAGCGCAAGCCTCACATCAACCTCTTGACGGTCAAACAACAACCAGGTGAGAGCTTGCGTGATTACATAACTCGTTTTAATGATGAGGCACTACAGGTTGAGGGGTACA
GCGAGGGAGCAGCCCTGGTAGCCATAACAGCCGGTCTGGAAGACGAAAGACTGCTCAATTCAATAGGTAAGAGCCAGCCTCGAACCTATGCGGAGTTCGTCTCCCGGGCA
CAGAAGTATATGAGCGCAGAGGAGTTACTGAAGTCAAAAAGGTCAGAACGAGAATACAAGAGGTTTTCTTCATCTAGCTATGACAGTAAAAAGGACAAAAGGCAGCGGAC
CGACGAAGGAGGCCGGGGCCGAGCAGACCATGGCCGAGGCCGACCAGACAATGGCCGAGGCCGACCAGACCAAGGAGCACCTCCTTTCGGTAAGTTTGAGAAATACACCC
CAACTGCTGTTCCGCAGGAGCAAGTACTGATGGAGATCCGAAATACGGGCCTCCTGAAATTCCCAGGGAGGATGAAGTCGAGTGCCGATAGAAGAGACAAGAGCCAGTAT
TGTCTTTTCCACCGGGACCACGGGCATTCAACCAGGAATTGTATTCAGTTGAAGGATGAAATCGAAGCACTGATCCAGAATGGGTATTTGAAAGAGTTCGTCGGTGAGCC
AAAGGCCGAGGCCGACCACGGATGGCCGAGGCCGAGCCTTACCAAAGATGGCCGAGACAAAGAAGAACCCCTACGGGAGATCAGAACCATCTTTGGAGGACCAGCAGGAG
GAGGTTCGAGCAGGAAGAGGAAAGCTATGGTCAGGGAAGCAAGGTCCGAACCAGAATATCGAGGTATGTACTCTGTCCATCTATCAAAGGCACACCCCCCTTTGGAGTTC
ACTGAGGCTGAGGCAGCGAGCATTCATCAGCCACATAATGATGCTCTGGTGGTCACTCTAATCGTAGCCAATGTGAAAATCCATCGGATCCTAATTGATGGGGGAAGCTC
GGCTGATGTCCTTTCTTAA
Protein sequenceShow/hide protein sequence
METSENNSEISSGSQNGQIVNPGNKISTVKLTDENFLMWKFQILTALEGHDLDDHISGDSQPLPELIQVSENATTVSKPNPAYKVWKKQDKLVSSWIVGSMSESILEQVL
HYVVALLLTHESRIESKSVINSDNVLPSANLAVQNVSQNSVPNPSPNSQQQNFGNGRGRSRSNFGQNRGGRSWNNRNRPQCQLCNKIGHTAMKCYSRVQMPGAYATQFNP
PGQMNPSGLNFSPQQFNGLAISHLGYASFTSSNNHMFHLNNLLHVPSITKNLISVSQFAKDNAVFFEFHPTFCVVKDLATGRALLRGTLHEGLYRFNLPQPLPSLNTSVV
RPETNTAVFSTLPCVSNDVSALYSSVQSSNNLSIDVWHQRLGHPSISIVKQVVRSCNPKVSTNAIMSFCHACAIGKHHAMPFSPSTTSYSAPLQLIVTDLWGPAYKLSTH
GFQYYISFVDAFSRYTWIYFLQTKSEAFQAFLKFKTHVEKQFGTPIVSFKLMENGIVERKHRHIVDVGLTLLSHSSIPLTFWDDAFSTSVYLINRLPSIVLGGMSPLEKL
FRKQPDYSTLKVFGCKCFPCLRPYNSHKLSFRSSPCTFIGYSHIHKGYKCLSSDGRLYISRHVLFDENSFPFASLTSHSSVSPNCVTQSLPTLSSVSSSTTVESSSDAHL
SISETSSIPSTADHPTNNSPSPCFLNRTHHMITRSKRGIFKPKAFLATFVDVEPPNVKETLKCSHWKQAMQDEYDALNAILDHPDTQGADEDNRGEIGLKDGLRRQNRQM
GRAKTEGVGFSARPPARPARAGRVRLVPSGLHRLWMPRPRLPPSTTNLPLVAREGYGKKDQDVNIENSDGDRHQRRSRDEDSIRGSPRQAGRGRGRGRAEDADTKIAALE
DEVKGMNRSLSKILQILDKPGPSTKVHEGSLIRDPRKGKEPMEHTAESGTRSRGKKTDSMTSKVRGLKPTDHTILRSPESSTLKGRHYTVSTPSFGHTKTDLRNLIVEKR
RSAKTAESEAKAAEAEARAAEAEARAAEAEARLAEAEAKKDDLPWKTELLNALKELGNPQGDQQRSKNFGDQNLEELADQVDPPFTEEVMKAEVPQKFKVPTFKQYDGKK
DPVQHLNAYRSWMDFHGVSDAIRVSPSIPCTVHGAREQRKPHINLLTVKQQPGESLRDYITRFNDEALQVEGYSEGAALVAITAGLEDERLLNSIGKSQPRTYAEFVSRA
QKYMSAEELLKSKRSEREYKRFSSSSYDSKKDKRQRTDEGGRGRADHGRGRPDNGRGRPDQGAPPFGKFEKYTPTAVPQEQVLMEIRNTGLLKFPGRMKSSADRRDKSQY
CLFHRDHGHSTRNCIQLKDEIEALIQNGYLKEFVGEPKAEADHGWPRPSLTKDGRDKEEPLREIRTIFGGPAGGGSSRKRKAMVREARSEPEYRGMYSVHLSKAHPPLEF
TEAEAASIHQPHNDALVVTLIVANVKIHRILIDGGSSADVLS