; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022097 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022097
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr7:18304233..18308315
RNA-Seq ExpressionLag0022097
SyntenyLag0022097
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
AFP55537.1 retrotransposon polyprotein [Rosa rugosa]1.3e-4031.62Show/hide
Query:  VPVEDHIMYILSGLGAEYETMVLVITAKIGSQTVQDVIALLLTDESRI-ESKSSINADGVLPTANLTIQNPTQKDIENLRNVSQSQQQQSFG-------N
        VP  D +  +++ +G  YET V    A+    T + +  LLL+ E R+ E      A+G+        +   +    + R  S      S G       +
Subjt:  VPVEDHIMYILSGLGAEYETMVLVITAKIGSQTVQDVIALLLTDESRI-ESKSSINADGVLPTANLTIQNPTQKDIENLRNVSQSQQQQSFG-------N

Query:  GRGRGRFNSSQG------------------RGGKSWNSRNKPQCQICNKIGHTTLKCYSRVQMP----------GAYLTQFNPSSQFFLGQNSGQMY---
         RGRG F+ S+G                    G S+ +  + QCQIC++ GH+ + C++R+ +            A     +PS + +L       +   
Subjt:  GRGRGRFNSSQG------------------RGGKSWNSRNKPQCQICNKIGHTTLKCYSRVQMP----------GAYLTQFNPSSQFFLGQNSGQMY---

Query:  --GKQFSP--MQAMTATPSFNQDS-LAINRFGYASLTSPSNHVFHLNNLLHVPSITKNLISVRLYKFNLSK-------PQHSIPSNANLAVFKSFNNSVS
          G   SP   + +      N DS L+I   G + L S S+  FHLN++LH PS + NL+SV  YKF +         P +    +           S +
Subjt:  --GKQFSP--MQAMTATPSFNQDS-LAINRFGYASLTSPSNHVFHLNNLLHVPSITKNLISVRLYKFNLSK-------PQHSIPSNANLAVFKSFNNSVS

Query:  AVSESILPSNSIALLASVQSLNDK-SYDIWHQRLGHPIFSIVEQVLQKCNPNFSGNENISFCNACAIGPS--YK-------LSRNGF--RVTCPYTAQQN
         +     PS +  + +S+  ++ + S + WH RLGH   +IV +VL        G+ N SFC +C +G S  YK       LS NG   R +CP   +QN
Subjt:  AVSESILPSNSIALLASVQSLNDK-SYDIWHQRLGHPIFSIVEQVLQKCNPNFSGNENISFCNACAIGPS--YK-------LSRNGF--RVTCPYTAQQN

Query:  GIIERKHRHIVDVGLTLLSQSSMPLTFWDDAFSTSVFLINKLPSIVLGGASPMEKLLQRQPDYTSLKV
        G+ ERKHRHIVD GLTLL+ + MP T+W D+F+T+V++IN+LPS VL  ASP E+L  R P Y SLKV
Subjt:  GIIERKHRHIVDVGLTLLSQSSMPLTFWDDAFSTSVFLINKLPSIVLGGASPMEKLLQRQPDYTSLKV

KAA0048297.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]5.0e-6128.8Show/hide
Query:  LKGTTSLLSLIWQYGFSETNSEISSGNQVTQAINPGNKISTIKLTEENFLLWKFEVFTALEGHELEEHIGEDCQPPLEKIQVSEGT--STVSKPNPHYKL
        +  T+SLL +         N+E SS   + Q    GNKIS +KL ++ FLLWKF++ TALE ++LE  +  + +PP + +  +E +  S    PNP YK+
Subjt:  LKGTTSLLSLIWQYGFSETNSEISSGNQVTQAINPGNKISTIKLTEENFLLWKFEVFTALEGHELEEHIGEDCQPPLEKIQVSEGT--STVSKPNPHYKL

Query:  WKR---------------------------------------------------------------------------------KTVPVEDHIMYILSGL
        WKR                                                                                 K V  +DHI+YIL+GL
Subjt:  WKR---------------------------------------------------------------------------------KTVPVEDHIMYILSGL

Query:  GAEYETMVLVITAKIGSQTVQDVIALLLTDESRIESKSSINADGVLPTANLTIQNPTQKDIENLRNVSQS--QQQQSFGNGRGRGRFNSSQGRGGKSWNS
        G++Y++M+ VI+A+  S +VQ+V++LLLT ES+ ESK  + ++  LP+ N+  Q  T+K  E+    +Q+      S+    GRG   S++GR G    +
Subjt:  GAEYETMVLVITAKIGSQTVQDVIALLLTDESRIESKSSINADGVLPTANLTIQNPTQKDIENLRNVSQS--QQQQSFGNGRGRGRFNSSQGRGGKSWNS

Query:  RNKPQCQICNKIGHTTLKCYSRVQMPGAYLTQFNPSSQFFLGQNSGQMYGKQFSPMQAMTATPSFNQDS-------------------------------
        RNKPQCQIC K+G++  +C+ R   P +  + ++P+S      N+          M AM A    N DS                               
Subjt:  RNKPQCQICNKIGHTTLKCYSRVQMPGAYLTQFNPSSQFFLGQNSGQMYGKQFSPMQAMTATPSFNQDS-------------------------------

Query:  -------LAINRFGYASLTSPS--NHVFHLNNLLHVPSITKNLISVR----------------LYKFNLSKPQHSIPSNANLAVFK-----------SFN
               L I  +G  S  S +     F LNNLL VPSITKNLISV                  Y  +L   Q  +    N  ++K             N
Subjt:  -------LAINRFGYASLTSPS--NHVFHLNNLLHVPSITKNLISVR----------------LYKFNLSKPQHSIPSNANLAVFK-----------SFN

Query:  NSVSAVSESILPSNSIALLASVQSLNDKSYDIWHQRLGHPIFSIVEQVLQKCNPNFSGNEN-ISFCNACAI---------------------------GP
        ++   V  +++P ++  LL           D+WH+RLGHP   IV+ VL   + N SG  N ++FC ACA+                           GP
Subjt:  NSVSAVSESILPSNSIALLASVQSLNDKSYDIWHQRLGHPIFSIVEQVLQKCNPNFSGNEN-ISFCNACAI---------------------------GP

Query:  SYKLSRNGF---------------------------------------------------------------------RVTCPYTAQQNGIIERKHRHIV
        +  +S NGF                                                                     R+TCPYT++QN I+ERKHR+I+
Subjt:  SYKLSRNGF---------------------------------------------------------------------RVTCPYTAQQNGIIERKHRHIV

Query:  DVGLTLLSQSSMPLTFWDDAFSTSVFLINKLPSIVLGGASPMEKLLQRQPDYTSLKV
        ++GLTLLSQ+++PL+FWD+AFSTSV+LIN+LP+ VL   SP+EKL  R+P++ SL+V
Subjt:  DVGLTLLSQSSMPLTFWDDAFSTSVFLINKLPSIVLGGASPMEKLLQRQPDYTSLKV

KZV26181.1 hypothetical protein F511_06348 [Dorcoceras hygrometricum]7.8e-4630.54Show/hide
Query:  TVPVEDHIMYILSGLGAEYETMVLVITAKIGSQTVQDVIALLLTDESRIESKSSINADGVLPTANLTIQNPTQKDIENLRNVSQSQQQQSFGNGRGRGRF
        ++P +D I++IL G+G EYE++V+ +T+++ S ++ +V ALLL  E RIE+ +        P+ N+T   P+Q+  EN      + Q Q    GRGRGR 
Subjt:  TVPVEDHIMYILSGLGAEYETMVLVITAKIGSQTVQDVIALLLTDESRIESKSSINADGVLPTANLTIQNPTQKDIENLRNVSQSQQQQSFGNGRGRGRF

Query:  NSSQGRGG-KSWNSRNKPQCQICNKIGHTTLKCYSRV------QMPGAYLT---QFNPSS-----QFFLGQNSGQMYGKQFSP-----------MQAMTA
            GRGG K W++  +P CQIC   GH    CY R       +  G   T   QFN SS       F    S     + + P           +  ++ 
Subjt:  NSSQGRGG-KSWNSRNKPQCQICNKIGHTTLKCYSRV------QMPGAYLT---QFNPSS-----QFFLGQNSGQMYGKQFSP-----------MQAMTA

Query:  TPSF---------NQDSLAINRFGYASLTS-PSNHVFHLNNLLHVPSITKNLISVR--------LYKFNLSKPQHSIPSNANLAVFKSFNNSV---SAVS
        +  +         N   L+I+  G ++L   PS+  F L NLLHVP ITKNLISV          ++F+ S      P+   + +  + +N +   +  S
Subjt:  TPSF---------NQDSLAINRFGYASLTS-PSNHVFHLNNLLHVPSITKNLISVR--------LYKFNLSKPQHSIPSNANLAVFKSFNNSV---SAVS

Query:  ESILPSNSIALLASVQS-----------LNDKSYDIWHQRLGHPIFSIVEQVLQKCNPNFSGNENISFCNACAI--------------------------
            P +S A L S  S           L   + D WH RLGHP  + V+QVL  CN   S N+NISFC++C +                          
Subjt:  ESILPSNSIALLASVQS-----------LNDKSYDIWHQRLGHPIFSIVEQVLQKCNPNFSGNENISFCNACAI--------------------------

Query:  -GPSYKLSRNG---------------------------------------------------------------------FRVTCPYTAQQNGIIERKHR
         GP++  SRNG                                                                      R +CPYT++QNG++ERKHR
Subjt:  -GPSYKLSRNG---------------------------------------------------------------------FRVTCPYTAQQNGIIERKHR

Query:  HIVDVGLTLLSQSSMPLTFWDDAFSTSVFLINKLPSIVLGGASPMEKLLQRQPDYTSLKV
        H+VD GL+LL+ +S+P  FW+DAF ++V+LIN+LPS  LG  SP   L  R+PDY+ L+V
Subjt:  HIVDVGLTLLSQSSMPLTFWDDAFSTSVFLINKLPSIVLGGASPMEKLLQRQPDYTSLKV

RVW44519.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]1.2e-4326.28Show/hide
Query:  SETNSEISSGNQVT------QAINPGNKISTIKLTEENFLLWKFEVFTALEGHELEEHIGEDCQPPLEKIQVSEGTSTVSKPNPHYKLWKRK--------
        S+ ++E   GN  T        I+P +++ T++L ++NFL+WK+++  A+ G+ LE  +    Q P + +    G   V  PNP ++ ++R+        
Subjt:  SETNSEISSGNQVT------QAINPGNKISTIKLTEENFLLWKFEVFTALEGHELEEHIGEDCQPPLEKIQVSEGTSTVSKPNPHYKLWKRK--------

Query:  --------------------------------------------TVPVEDHIMYILSGLGAEYETMVLVITAKIGSQTVQDVIALLLTDESRIESKSSIN
                                                     +   DHI+ I+ GLG EYE+++ VI++K  S ++Q V + L+  E RI  K S N
Subjt:  --------------------------------------------TVPVEDHIMYILSGLGAEYETMVLVITAKIGSQTVQDVIALLLTDESRIESKSSIN

Query:  ADGVLPTANLTIQNPTQKDIENLRNVSQSQQQQSF-GNGRGRGRFNSSQGRGGKSWNSRNKPQCQICNKIGHTTLKCYSRVQMPGAYLTQFNPSSQFFLG
           V  T+  + + P+     N    S  Q +  F GN   RG F  ++GRG        KPQCQ+CNK GHT  +C+ R           N  +   LG
Subjt:  ADGVLPTANLTIQNPTQKDIENLRNVSQSQQQQSF-GNGRGRGRFNSSQGRGGKSWNSRNKPQCQICNKIGHTTLKCYSRVQMPGAYLTQFNPSSQFFLG

Query:  QN-----SGQM--------------YGKQFSPMQAMTATP-----------------------------SFNQDS---------LAINRFGYASLTSPS-
               SG +                + +S M+AM ATP                              +N +S         L I+  G +   S S 
Subjt:  QN-----SGQM--------------YGKQFSPMQAMTATP-----------------------------SFNQDS---------LAINRFGYASLTSPS-

Query:  -NHVFHLNNLLHVPSITKNLISVR------------------------------------LYKFNLSKPQHSIPSNANLAVFKSFNNSVSAVSESILPSN
         N V  L N+L VP+I KNL+SV                                     LY+FNLSK      S  +L+  K   N ++  + S++ ++
Subjt:  -NHVFHLNNLLHVPSITKNLISVR------------------------------------LYKFNLSKPQHSIPSNANLAVFKSFNNSVSAVSESILPSN

Query:  SIALLASVQSLNDKSYDIWHQRLGHPIFSIVEQVLQKCNPNFSGNENISFCNACAIGPSYKL--------------------------------------
        +        S +   +D+WH+RLGHP   IV QVL      FS     S C+AC +G S+ L                                      
Subjt:  SIALLASVQSLNDKSYDIWHQRLGHPIFSIVEQVLQKCNPNFSGNENISFCNACAIGPSYKL--------------------------------------

Query:  --------------------------------------------------------SRNGF--RVTCPYTAQQNGIIERKHRHIVDVGLTLLSQSSMPLT
                                                                 +NG   R++CP+T++QNGIIERKHRHIV++GLTLL+Q+S+PL 
Subjt:  --------------------------------------------------------SRNGF--RVTCPYTAQQNGIIERKHRHIVDVGLTLLSQSSMPLT

Query:  FWDDAFSTSVFLINKLPSIVLGGASPMEKLLQRQPDYTSLKV
        +W DAFST+VFLIN+LP+ VL    P E L   +P+Y+ LKV
Subjt:  FWDDAFSTSVFLINKLPSIVLGGASPMEKLLQRQPDYTSLKV

TYK10642.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]5.0e-6128.8Show/hide
Query:  LKGTTSLLSLIWQYGFSETNSEISSGNQVTQAINPGNKISTIKLTEENFLLWKFEVFTALEGHELEEHIGEDCQPPLEKIQVSEGT--STVSKPNPHYKL
        +  T+SLL +         N+E SS   + Q    GNKIS +KL ++ FLLWKF++ TALE ++LE  +  + +PP + +  +E +  S    PNP YK+
Subjt:  LKGTTSLLSLIWQYGFSETNSEISSGNQVTQAINPGNKISTIKLTEENFLLWKFEVFTALEGHELEEHIGEDCQPPLEKIQVSEGT--STVSKPNPHYKL

Query:  WKR---------------------------------------------------------------------------------KTVPVEDHIMYILSGL
        WKR                                                                                 K V  +DHI+YIL+GL
Subjt:  WKR---------------------------------------------------------------------------------KTVPVEDHIMYILSGL

Query:  GAEYETMVLVITAKIGSQTVQDVIALLLTDESRIESKSSINADGVLPTANLTIQNPTQKDIENLRNVSQS--QQQQSFGNGRGRGRFNSSQGRGGKSWNS
        G++Y++M+ VI+A+  S +VQ+V++LLLT ES+ ESK  + ++  LP+ N+  Q  T+K  E+    +Q+      S+    GRG   S++GR G    +
Subjt:  GAEYETMVLVITAKIGSQTVQDVIALLLTDESRIESKSSINADGVLPTANLTIQNPTQKDIENLRNVSQS--QQQQSFGNGRGRGRFNSSQGRGGKSWNS

Query:  RNKPQCQICNKIGHTTLKCYSRVQMPGAYLTQFNPSSQFFLGQNSGQMYGKQFSPMQAMTATPSFNQDS-------------------------------
        RNKPQCQIC K+G++  +C+ R   P +  + ++P+S      N+          M AM A    N DS                               
Subjt:  RNKPQCQICNKIGHTTLKCYSRVQMPGAYLTQFNPSSQFFLGQNSGQMYGKQFSPMQAMTATPSFNQDS-------------------------------

Query:  -------LAINRFGYASLTSPS--NHVFHLNNLLHVPSITKNLISVR----------------LYKFNLSKPQHSIPSNANLAVFK-----------SFN
               L I  +G  S  S +     F LNNLL VPSITKNLISV                  Y  +L   Q  +    N  ++K             N
Subjt:  -------LAINRFGYASLTSPS--NHVFHLNNLLHVPSITKNLISVR----------------LYKFNLSKPQHSIPSNANLAVFK-----------SFN

Query:  NSVSAVSESILPSNSIALLASVQSLNDKSYDIWHQRLGHPIFSIVEQVLQKCNPNFSGNEN-ISFCNACAI---------------------------GP
        ++   V  +++P ++  LL           D+WH+RLGHP   IV+ VL   + N SG  N ++FC ACA+                           GP
Subjt:  NSVSAVSESILPSNSIALLASVQSLNDKSYDIWHQRLGHPIFSIVEQVLQKCNPNFSGNEN-ISFCNACAI---------------------------GP

Query:  SYKLSRNGF---------------------------------------------------------------------RVTCPYTAQQNGIIERKHRHIV
        +  +S NGF                                                                     R+TCPYT++QN I+ERKHR+I+
Subjt:  SYKLSRNGF---------------------------------------------------------------------RVTCPYTAQQNGIIERKHRHIV

Query:  DVGLTLLSQSSMPLTFWDDAFSTSVFLINKLPSIVLGGASPMEKLLQRQPDYTSLKV
        ++GLTLLSQ+++PL+FWD+AFSTSV+LIN+LP+ VL   SP+EKL  R+P++ SL+V
Subjt:  DVGLTLLSQSSMPLTFWDDAFSTSVFLINKLPSIVLGGASPMEKLLQRQPDYTSLKV

TrEMBL top hitse value%identityAlignment
A0A2Z7AWA7 Integrase catalytic domain-containing protein3.8e-4630.54Show/hide
Query:  TVPVEDHIMYILSGLGAEYETMVLVITAKIGSQTVQDVIALLLTDESRIESKSSINADGVLPTANLTIQNPTQKDIENLRNVSQSQQQQSFGNGRGRGRF
        ++P +D I++IL G+G EYE++V+ +T+++ S ++ +V ALLL  E RIE+ +        P+ N+T   P+Q+  EN      + Q Q    GRGRGR 
Subjt:  TVPVEDHIMYILSGLGAEYETMVLVITAKIGSQTVQDVIALLLTDESRIESKSSINADGVLPTANLTIQNPTQKDIENLRNVSQSQQQQSFGNGRGRGRF

Query:  NSSQGRGG-KSWNSRNKPQCQICNKIGHTTLKCYSRV------QMPGAYLT---QFNPSS-----QFFLGQNSGQMYGKQFSP-----------MQAMTA
            GRGG K W++  +P CQIC   GH    CY R       +  G   T   QFN SS       F    S     + + P           +  ++ 
Subjt:  NSSQGRGG-KSWNSRNKPQCQICNKIGHTTLKCYSRV------QMPGAYLT---QFNPSS-----QFFLGQNSGQMYGKQFSP-----------MQAMTA

Query:  TPSF---------NQDSLAINRFGYASLTS-PSNHVFHLNNLLHVPSITKNLISVR--------LYKFNLSKPQHSIPSNANLAVFKSFNNSV---SAVS
        +  +         N   L+I+  G ++L   PS+  F L NLLHVP ITKNLISV          ++F+ S      P+   + +  + +N +   +  S
Subjt:  TPSF---------NQDSLAINRFGYASLTS-PSNHVFHLNNLLHVPSITKNLISVR--------LYKFNLSKPQHSIPSNANLAVFKSFNNSV---SAVS

Query:  ESILPSNSIALLASVQS-----------LNDKSYDIWHQRLGHPIFSIVEQVLQKCNPNFSGNENISFCNACAI--------------------------
            P +S A L S  S           L   + D WH RLGHP  + V+QVL  CN   S N+NISFC++C +                          
Subjt:  ESILPSNSIALLASVQS-----------LNDKSYDIWHQRLGHPIFSIVEQVLQKCNPNFSGNENISFCNACAI--------------------------

Query:  -GPSYKLSRNG---------------------------------------------------------------------FRVTCPYTAQQNGIIERKHR
         GP++  SRNG                                                                      R +CPYT++QNG++ERKHR
Subjt:  -GPSYKLSRNG---------------------------------------------------------------------FRVTCPYTAQQNGIIERKHR

Query:  HIVDVGLTLLSQSSMPLTFWDDAFSTSVFLINKLPSIVLGGASPMEKLLQRQPDYTSLKV
        H+VD GL+LL+ +S+P  FW+DAF ++V+LIN+LPS  LG  SP   L  R+PDY+ L+V
Subjt:  HIVDVGLTLLSQSSMPLTFWDDAFSTSVFLINKLPSIVLGGASPMEKLLQRQPDYTSLKV

A0A438EA49 Retrovirus-related Pol polyprotein from transposon TNT 1-946.0e-4426.28Show/hide
Query:  SETNSEISSGNQVT------QAINPGNKISTIKLTEENFLLWKFEVFTALEGHELEEHIGEDCQPPLEKIQVSEGTSTVSKPNPHYKLWKRK--------
        S+ ++E   GN  T        I+P +++ T++L ++NFL+WK+++  A+ G+ LE  +    Q P + +    G   V  PNP ++ ++R+        
Subjt:  SETNSEISSGNQVT------QAINPGNKISTIKLTEENFLLWKFEVFTALEGHELEEHIGEDCQPPLEKIQVSEGTSTVSKPNPHYKLWKRK--------

Query:  --------------------------------------------TVPVEDHIMYILSGLGAEYETMVLVITAKIGSQTVQDVIALLLTDESRIESKSSIN
                                                     +   DHI+ I+ GLG EYE+++ VI++K  S ++Q V + L+  E RI  K S N
Subjt:  --------------------------------------------TVPVEDHIMYILSGLGAEYETMVLVITAKIGSQTVQDVIALLLTDESRIESKSSIN

Query:  ADGVLPTANLTIQNPTQKDIENLRNVSQSQQQQSF-GNGRGRGRFNSSQGRGGKSWNSRNKPQCQICNKIGHTTLKCYSRVQMPGAYLTQFNPSSQFFLG
           V  T+  + + P+     N    S  Q +  F GN   RG F  ++GRG        KPQCQ+CNK GHT  +C+ R           N  +   LG
Subjt:  ADGVLPTANLTIQNPTQKDIENLRNVSQSQQQQSF-GNGRGRGRFNSSQGRGGKSWNSRNKPQCQICNKIGHTTLKCYSRVQMPGAYLTQFNPSSQFFLG

Query:  QN-----SGQM--------------YGKQFSPMQAMTATP-----------------------------SFNQDS---------LAINRFGYASLTSPS-
               SG +                + +S M+AM ATP                              +N +S         L I+  G +   S S 
Subjt:  QN-----SGQM--------------YGKQFSPMQAMTATP-----------------------------SFNQDS---------LAINRFGYASLTSPS-

Query:  -NHVFHLNNLLHVPSITKNLISVR------------------------------------LYKFNLSKPQHSIPSNANLAVFKSFNNSVSAVSESILPSN
         N V  L N+L VP+I KNL+SV                                     LY+FNLSK      S  +L+  K   N ++  + S++ ++
Subjt:  -NHVFHLNNLLHVPSITKNLISVR------------------------------------LYKFNLSKPQHSIPSNANLAVFKSFNNSVSAVSESILPSN

Query:  SIALLASVQSLNDKSYDIWHQRLGHPIFSIVEQVLQKCNPNFSGNENISFCNACAIGPSYKL--------------------------------------
        +        S +   +D+WH+RLGHP   IV QVL      FS     S C+AC +G S+ L                                      
Subjt:  SIALLASVQSLNDKSYDIWHQRLGHPIFSIVEQVLQKCNPNFSGNENISFCNACAIGPSYKL--------------------------------------

Query:  --------------------------------------------------------SRNGF--RVTCPYTAQQNGIIERKHRHIVDVGLTLLSQSSMPLT
                                                                 +NG   R++CP+T++QNGIIERKHRHIV++GLTLL+Q+S+PL 
Subjt:  --------------------------------------------------------SRNGF--RVTCPYTAQQNGIIERKHRHIVDVGLTLLSQSSMPLT

Query:  FWDDAFSTSVFLINKLPSIVLGGASPMEKLLQRQPDYTSLKV
        +W DAFST+VFLIN+LP+ VL    P E L   +P+Y+ LKV
Subjt:  FWDDAFSTSVFLINKLPSIVLGGASPMEKLLQRQPDYTSLKV

A0A5A7U233 Retrovirus-related Pol polyprotein from transposon TNT 1-942.4e-6128.8Show/hide
Query:  LKGTTSLLSLIWQYGFSETNSEISSGNQVTQAINPGNKISTIKLTEENFLLWKFEVFTALEGHELEEHIGEDCQPPLEKIQVSEGT--STVSKPNPHYKL
        +  T+SLL +         N+E SS   + Q    GNKIS +KL ++ FLLWKF++ TALE ++LE  +  + +PP + +  +E +  S    PNP YK+
Subjt:  LKGTTSLLSLIWQYGFSETNSEISSGNQVTQAINPGNKISTIKLTEENFLLWKFEVFTALEGHELEEHIGEDCQPPLEKIQVSEGT--STVSKPNPHYKL

Query:  WKR---------------------------------------------------------------------------------KTVPVEDHIMYILSGL
        WKR                                                                                 K V  +DHI+YIL+GL
Subjt:  WKR---------------------------------------------------------------------------------KTVPVEDHIMYILSGL

Query:  GAEYETMVLVITAKIGSQTVQDVIALLLTDESRIESKSSINADGVLPTANLTIQNPTQKDIENLRNVSQS--QQQQSFGNGRGRGRFNSSQGRGGKSWNS
        G++Y++M+ VI+A+  S +VQ+V++LLLT ES+ ESK  + ++  LP+ N+  Q  T+K  E+    +Q+      S+    GRG   S++GR G    +
Subjt:  GAEYETMVLVITAKIGSQTVQDVIALLLTDESRIESKSSINADGVLPTANLTIQNPTQKDIENLRNVSQS--QQQQSFGNGRGRGRFNSSQGRGGKSWNS

Query:  RNKPQCQICNKIGHTTLKCYSRVQMPGAYLTQFNPSSQFFLGQNSGQMYGKQFSPMQAMTATPSFNQDS-------------------------------
        RNKPQCQIC K+G++  +C+ R   P +  + ++P+S      N+          M AM A    N DS                               
Subjt:  RNKPQCQICNKIGHTTLKCYSRVQMPGAYLTQFNPSSQFFLGQNSGQMYGKQFSPMQAMTATPSFNQDS-------------------------------

Query:  -------LAINRFGYASLTSPS--NHVFHLNNLLHVPSITKNLISVR----------------LYKFNLSKPQHSIPSNANLAVFK-----------SFN
               L I  +G  S  S +     F LNNLL VPSITKNLISV                  Y  +L   Q  +    N  ++K             N
Subjt:  -------LAINRFGYASLTSPS--NHVFHLNNLLHVPSITKNLISVR----------------LYKFNLSKPQHSIPSNANLAVFK-----------SFN

Query:  NSVSAVSESILPSNSIALLASVQSLNDKSYDIWHQRLGHPIFSIVEQVLQKCNPNFSGNEN-ISFCNACAI---------------------------GP
        ++   V  +++P ++  LL           D+WH+RLGHP   IV+ VL   + N SG  N ++FC ACA+                           GP
Subjt:  NSVSAVSESILPSNSIALLASVQSLNDKSYDIWHQRLGHPIFSIVEQVLQKCNPNFSGNEN-ISFCNACAI---------------------------GP

Query:  SYKLSRNGF---------------------------------------------------------------------RVTCPYTAQQNGIIERKHRHIV
        +  +S NGF                                                                     R+TCPYT++QN I+ERKHR+I+
Subjt:  SYKLSRNGF---------------------------------------------------------------------RVTCPYTAQQNGIIERKHRHIV

Query:  DVGLTLLSQSSMPLTFWDDAFSTSVFLINKLPSIVLGGASPMEKLLQRQPDYTSLKV
        ++GLTLLSQ+++PL+FWD+AFSTSV+LIN+LP+ VL   SP+EKL  R+P++ SL+V
Subjt:  DVGLTLLSQSSMPLTFWDDAFSTSVFLINKLPSIVLGGASPMEKLLQRQPDYTSLKV

A0A5D3CH97 Retrovirus-related Pol polyprotein from transposon TNT 1-942.4e-6128.8Show/hide
Query:  LKGTTSLLSLIWQYGFSETNSEISSGNQVTQAINPGNKISTIKLTEENFLLWKFEVFTALEGHELEEHIGEDCQPPLEKIQVSEGT--STVSKPNPHYKL
        +  T+SLL +         N+E SS   + Q    GNKIS +KL ++ FLLWKF++ TALE ++LE  +  + +PP + +  +E +  S    PNP YK+
Subjt:  LKGTTSLLSLIWQYGFSETNSEISSGNQVTQAINPGNKISTIKLTEENFLLWKFEVFTALEGHELEEHIGEDCQPPLEKIQVSEGT--STVSKPNPHYKL

Query:  WKR---------------------------------------------------------------------------------KTVPVEDHIMYILSGL
        WKR                                                                                 K V  +DHI+YIL+GL
Subjt:  WKR---------------------------------------------------------------------------------KTVPVEDHIMYILSGL

Query:  GAEYETMVLVITAKIGSQTVQDVIALLLTDESRIESKSSINADGVLPTANLTIQNPTQKDIENLRNVSQS--QQQQSFGNGRGRGRFNSSQGRGGKSWNS
        G++Y++M+ VI+A+  S +VQ+V++LLLT ES+ ESK  + ++  LP+ N+  Q  T+K  E+    +Q+      S+    GRG   S++GR G    +
Subjt:  GAEYETMVLVITAKIGSQTVQDVIALLLTDESRIESKSSINADGVLPTANLTIQNPTQKDIENLRNVSQS--QQQQSFGNGRGRGRFNSSQGRGGKSWNS

Query:  RNKPQCQICNKIGHTTLKCYSRVQMPGAYLTQFNPSSQFFLGQNSGQMYGKQFSPMQAMTATPSFNQDS-------------------------------
        RNKPQCQIC K+G++  +C+ R   P +  + ++P+S      N+          M AM A    N DS                               
Subjt:  RNKPQCQICNKIGHTTLKCYSRVQMPGAYLTQFNPSSQFFLGQNSGQMYGKQFSPMQAMTATPSFNQDS-------------------------------

Query:  -------LAINRFGYASLTSPS--NHVFHLNNLLHVPSITKNLISVR----------------LYKFNLSKPQHSIPSNANLAVFK-----------SFN
               L I  +G  S  S +     F LNNLL VPSITKNLISV                  Y  +L   Q  +    N  ++K             N
Subjt:  -------LAINRFGYASLTSPS--NHVFHLNNLLHVPSITKNLISVR----------------LYKFNLSKPQHSIPSNANLAVFK-----------SFN

Query:  NSVSAVSESILPSNSIALLASVQSLNDKSYDIWHQRLGHPIFSIVEQVLQKCNPNFSGNEN-ISFCNACAI---------------------------GP
        ++   V  +++P ++  LL           D+WH+RLGHP   IV+ VL   + N SG  N ++FC ACA+                           GP
Subjt:  NSVSAVSESILPSNSIALLASVQSLNDKSYDIWHQRLGHPIFSIVEQVLQKCNPNFSGNEN-ISFCNACAI---------------------------GP

Query:  SYKLSRNGF---------------------------------------------------------------------RVTCPYTAQQNGIIERKHRHIV
        +  +S NGF                                                                     R+TCPYT++QN I+ERKHR+I+
Subjt:  SYKLSRNGF---------------------------------------------------------------------RVTCPYTAQQNGIIERKHRHIV

Query:  DVGLTLLSQSSMPLTFWDDAFSTSVFLINKLPSIVLGGASPMEKLLQRQPDYTSLKV
        ++GLTLLSQ+++PL+FWD+AFSTSV+LIN+LP+ VL   SP+EKL  R+P++ SL+V
Subjt:  DVGLTLLSQSSMPLTFWDDAFSTSVFLINKLPSIVLGGASPMEKLLQRQPDYTSLKV

J7G0P5 Retrotransposon polyprotein6.2e-4131.62Show/hide
Query:  VPVEDHIMYILSGLGAEYETMVLVITAKIGSQTVQDVIALLLTDESRI-ESKSSINADGVLPTANLTIQNPTQKDIENLRNVSQSQQQQSFG-------N
        VP  D +  +++ +G  YET V    A+    T + +  LLL+ E R+ E      A+G+        +   +    + R  S      S G       +
Subjt:  VPVEDHIMYILSGLGAEYETMVLVITAKIGSQTVQDVIALLLTDESRI-ESKSSINADGVLPTANLTIQNPTQKDIENLRNVSQSQQQQSFG-------N

Query:  GRGRGRFNSSQG------------------RGGKSWNSRNKPQCQICNKIGHTTLKCYSRVQMP----------GAYLTQFNPSSQFFLGQNSGQMY---
         RGRG F+ S+G                    G S+ +  + QCQIC++ GH+ + C++R+ +            A     +PS + +L       +   
Subjt:  GRGRGRFNSSQG------------------RGGKSWNSRNKPQCQICNKIGHTTLKCYSRVQMP----------GAYLTQFNPSSQFFLGQNSGQMY---

Query:  --GKQFSP--MQAMTATPSFNQDS-LAINRFGYASLTSPSNHVFHLNNLLHVPSITKNLISVRLYKFNLSK-------PQHSIPSNANLAVFKSFNNSVS
          G   SP   + +      N DS L+I   G + L S S+  FHLN++LH PS + NL+SV  YKF +         P +    +           S +
Subjt:  --GKQFSP--MQAMTATPSFNQDS-LAINRFGYASLTSPSNHVFHLNNLLHVPSITKNLISVRLYKFNLSK-------PQHSIPSNANLAVFKSFNNSVS

Query:  AVSESILPSNSIALLASVQSLNDK-SYDIWHQRLGHPIFSIVEQVLQKCNPNFSGNENISFCNACAIGPS--YK-------LSRNGF--RVTCPYTAQQN
         +     PS +  + +S+  ++ + S + WH RLGH   +IV +VL        G+ N SFC +C +G S  YK       LS NG   R +CP   +QN
Subjt:  AVSESILPSNSIALLASVQSLNDK-SYDIWHQRLGHPIFSIVEQVLQKCNPNFSGNENISFCNACAIGPS--YK-------LSRNGF--RVTCPYTAQQN

Query:  GIIERKHRHIVDVGLTLLSQSSMPLTFWDDAFSTSVFLINKLPSIVLGGASPMEKLLQRQPDYTSLKV
        G+ ERKHRHIVD GLTLL+ + MP T+W D+F+T+V++IN+LPS VL  ASP E+L  R P Y SLKV
Subjt:  GIIERKHRHIVDVGLTLLSQSSMPLTFWDDAFSTSVFLINKLPSIVLGGASPMEKLLQRQPDYTSLKV

SwissProt top hitse value%identityAlignment
P04146 Copia protein4.8e-0632.93Show/hide
Query:  FRVTCPYTAQQNGIIERKHRHIVDVGLTLLSQSSMPLTFWDDAFSTSVFLINKLPS--IVLGGASPMEKLLQRQPDYTSLKV
        + +T P+T Q NG+ ER  R I +   T++S + +  +FW +A  T+ +LIN++PS  +V    +P E    ++P    L+V
Subjt:  FRVTCPYTAQQNGIIERKHRHIVDVGLTLLSQSSMPLTFWDDAFSTSVFLINKLPS--IVLGGASPMEKLLQRQPDYTSLKV

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.5e-0736.36Show/hide
Query:  TCPYTAQQNGIIERKHRHIVDVGLTLLSQSSMPLTFWDDAFSTSVFLINKLPSIVLGGASPMEKLLQRQPDYTSLKV
        T P T Q NG+ ER +R IV+   ++L  + +P +FW +A  T+ +LIN+ PS+ L    P      ++  Y+ LKV
Subjt:  TCPYTAQQNGIIERKHRHIVDVGLTLLSQSSMPLTFWDDAFSTSVFLINKLPSIVLGGASPMEKLLQRQPDYTSLKV

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.2e-1924.03Show/hide
Query:  KTVPVEDHIMYILSGLGAEYETMVLVITAKIGSQTVQDVIALLLTDESRIESKSSINADGVLPTANLTIQNPTQKDIENLRNVSQSQQQQSFGNGRGRGR
        K +  ++ +  +L  L  EY+ ++  I AK    T+ ++   LL  ES+I + SS     V+P     + +       N  N +++ +  +  N      
Subjt:  KTVPVEDHIMYILSGLGAEYETMVLVITAKIGSQTVQDVIALLLTDESRIESKSSINADGVLPTANLTIQNPTQKDIENLRNVSQSQQQQSFGNGRGRGR

Query:  FNSSQGRGGKSWNSRNKP---QCQICNKIGHTTLKCYS--------RVQMPGAYLTQFNPSSQFFLGQ---------NSGQMYG-----KQFSPMQAMTA
        +  S      + N+++KP   +CQIC   GH+  +C            Q P +  T + P +   LG          +SG  +         S  Q  T 
Subjt:  FNSSQGRGGKSWNSRNKP---QCQICNKIGHTTLKCYS--------RVQMPGAYLTQFNPSSQFFLGQ---------NSGQMYG-----KQFSPMQAMTA

Query:  TPSF---NQDSLAINRFGYASLTSPSNHVFHLNNLLHVPSITKNLISV-RLYKFNLSKPQHSIPSNANLAVFKSFNNSVSAVSES---------ILPSNS
               +  ++ I+  G  SL++ S  + +L+N+L+VP+I KNLISV RL   N        P++  +   K  N  V  +            I  S  
Subjt:  TPSF---NQDSLAINRFGYASLTSPSNHVFHLNNLLHVPSITKNLISV-RLYKFNLSKPQHSIPSNANLAVFKSFNNSVSAVSES---------ILPSNS

Query:  IALLASVQSLNDKSYDIWHQRLGHPIFSIVEQVLQKCNPN-FSGNENISFCNACAIGPSYK--------------------------LSRNGFR------
        ++L AS  S    ++  WH RLGHP  SI+  V+   + +  + +     C+ C I  S K                          LS + +R      
Subjt:  IALLASVQSLNDKSYDIWHQRLGHPIFSIVEQVLQKCNPN-FSGNENISFCNACAIGPSYK--------------------------LSRNGFR------

Query:  ---------------------------------------------------------------VTCPYTAQQNGIIERKHRHIVDVGLTLLSQSSMPLTF
                                                                        + P+T + NG+ ERKHRHIV+ GLTLLS +S+P T+
Subjt:  ---------------------------------------------------------------VTCPYTAQQNGIIERKHRHIVDVGLTLLSQSSMPLTF

Query:  WDDAFSTSVFLINKLPSIVLGGASPMEKLLQRQPDYTSLKV
        W  AF+ +V+LIN+LP+ +L   SP +KL    P+Y  L+V
Subjt:  WDDAFSTSVFLINKLPSIVLGGASPMEKLLQRQPDYTSLKV

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE25.3e-2123.8Show/hide
Query:  KTVPVEDHIMYILSGLGAEYETMVLVITAKIGSQTVQDVIALLLTDESRIESKSSINADGVLP-TANLTIQNPTQKDIENLRNVSQSQQQQSFGNGRGRG
        K +  ++ +  +L  L  +Y+ ++  I AK    ++ ++   L+  ES++    ++N+  V+P TAN+     T ++    RN +     +++ N   R 
Subjt:  KTVPVEDHIMYILSGLGAEYETMVLVITAKIGSQTVQDVIALLLTDESRIESKSSINADGVLP-TANLTIQNPTQKDIENLRNVSQSQQQQSFGNGRGRG

Query:  RFNSSQGRGGKSWNSRNKP---QCQICNKIGHTTLKCYSRVQ-----------------MPGAYLTQFNP--SSQFFLGQNSGQMYGKQFS------PMQ
                G +S N + KP   +CQIC+  GH+  +C    Q                  P A L   +P  ++ + L   +       F+      P  
Subjt:  RFNSSQGRGGKSWNSRNKP---QCQICNKIGHTTLKCYSRVQ-----------------MPGAYLTQFNP--SSQFFLGQNSGQMYGKQFS------PMQ

Query:  AMTATPSFNQDSLAINRFGYASLTSPSNHVFHLNNLLHVPSITKNLISV-RLYKFNLSKPQHSIPSNANLAVFKSFNNSVSAVSES---------ILPSN
                +  ++ I   G ASL + S+    LN +L+VP+I KNLISV RL   N    +   P++  +   K  N  V  +            I  S 
Subjt:  AMTATPSFNQDSLAINRFGYASLTSPSNHVFHLNNLLHVPSITKNLISV-RLYKFNLSKPQHSIPSNANLAVFKSFNNSVSAVSES---------ILPSN

Query:  SIALLASVQSLNDKSYDIWHQRLGHPIFSIVEQVLQKCN-PNFSGNENISFCNACAIGPSYK--------------------------------------
        ++++ AS    +  ++  WH RLGHP  +I+  V+   + P  + +  +  C+ C I  S+K                                      
Subjt:  SIALLASVQSLNDKSYDIWHQRLGHPIFSIVEQVLQKCN-PNFSGNENISFCNACAIGPSYK--------------------------------------

Query:  -------------------------------------------------------LSRNGFR--VTCPYTAQQNGIIERKHRHIVDVGLTLLSQSSMPLT
                                                               LS++G     + P+T + NG+ ERKHRHIV++GLTLLS +S+P T
Subjt:  -------------------------------------------------------LSRNGFR--VTCPYTAQQNGIIERKHRHIVDVGLTLLSQSSMPLT

Query:  FWDDAFSTSVFLINKLPSIVLGGASPMEKLLQRQPDYTSLKV
        +W  AFS +V+LIN+LP+ +L   SP +KL  + P+Y  LKV
Subjt:  FWDDAFSTSVFLINKLPSIVLGGASPMEKLLQRQPDYTSLKV

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAGGGCGGGATCCCTTTGTTCAAGCCCCGGAGTCAGCACTTAAGGGAACAACCTCTCTACTATCCCTTATTTGGCAATATGGATTTTCAGAGACGAACTCTGAAAT
TTCAAGTGGAAATCAAGTCACTCAAGCAATTAATCCTGGAAACAAGATCTCAACAATAAAGCTAACTGAGGAAAATTTCCTACTCTGGAAATTTGAAGTTTTCACGGCGC
TCGAAGGACATGAATTGGAAGAGCATATCGGTGAAGATTGCCAACCTCCTTTAGAGAAAATCCAAGTAAGTGAAGGTACCTCCACCGTGAGTAAACCTAACCCTCATTAT
AAGTTATGGAAGAGGAAAACTGTACCAGTTGAAGATCATATTATGTATATCCTATCTGGTCTTGGTGCAGAATATGAAACTATGGTTTTAGTGATTACTGCCAAAATAGG
GTCACAAACTGTTCAAGATGTTATTGCATTGCTTTTAACAGATGAAAGTAGGATAGAAAGCAAATCTTCTATTAATGCTGATGGAGTCTTACCTACTGCTAATCTCACAA
TTCAAAATCCTACACAAAAAGATATAGAAAATTTGAGAAATGTGAGTCAGTCGCAACAACAACAAAGTTTCGGTAATGGTAGAGGCAGAGGAAGATTTAATTCTAGTCAA
GGTAGAGGAGGAAAGTCTTGGAATAGTAGAAACAAACCCCAGTGTCAAATCTGTAACAAAATTGGTCATACAACCTTAAAATGTTATTCTCGTGTTCAGATGCCTGGGGC
TTACTTGACTCAATTCAACCCCTCTAGTCAGTTTTTTCTTGGACAGAATTCTGGTCAGATGTATGGGAAACAGTTCTCTCCAATGCAGGCTATGACAGCTACTCCAAGCT
TTAATCAAGATTCTTTGGCTATAAATCGGTTTGGATATGCTTCTCTTACTTCGCCTAGTAATCATGTCTTTCATCTAAATAATCTTTTACACGTTCCATCCATTACCAAG
AATTTGATTAGTGTCAGACTATACAAGTTCAACCTATCCAAGCCTCAACACTCTATCCCATCAAATGCTAATCTTGCTGTCTTCAAGTCATTTAATAATTCTGTTTCTGC
TGTTTCTGAATCTATTTTGCCTTCCAACTCTATTGCTTTGCTTGCTTCTGTCCAATCTTTGAATGATAAATCTTATGATATTTGGCATCAAAGGCTAGGTCATCCAATCT
TTTCTATTGTTGAGCAAGTTCTTCAAAAGTGTAATCCCAATTTCTCTGGTAATGAAAACATTTCATTCTGTAATGCATGTGCAATTGGACCTTCATACAAATTGTCTAGA
AATGGCTTCAGAGTTACTTGTCCATATACCGCTCAACAAAATGGTATTATCGAACGTAAGCATAGGCATATCGTTGATGTAGGTCTTACATTGTTGTCTCAATCATCCAT
GCCTCTAACATTTTGGGACGATGCCTTTTCTACTAGTGTTTTTCTTATCAACAAGTTGCCTTCTATAGTTCTTGGTGGAGCAAGTCCCATGGAGAAACTCTTGCAGCGAC
AACCCGATTATACATCTCTCAAGGTATAG
mRNA sequenceShow/hide mRNA sequence
ATGAGAGGGCGGGATCCCTTTGTTCAAGCCCCGGAGTCAGCACTTAAGGGAACAACCTCTCTACTATCCCTTATTTGGCAATATGGATTTTCAGAGACGAACTCTGAAAT
TTCAAGTGGAAATCAAGTCACTCAAGCAATTAATCCTGGAAACAAGATCTCAACAATAAAGCTAACTGAGGAAAATTTCCTACTCTGGAAATTTGAAGTTTTCACGGCGC
TCGAAGGACATGAATTGGAAGAGCATATCGGTGAAGATTGCCAACCTCCTTTAGAGAAAATCCAAGTAAGTGAAGGTACCTCCACCGTGAGTAAACCTAACCCTCATTAT
AAGTTATGGAAGAGGAAAACTGTACCAGTTGAAGATCATATTATGTATATCCTATCTGGTCTTGGTGCAGAATATGAAACTATGGTTTTAGTGATTACTGCCAAAATAGG
GTCACAAACTGTTCAAGATGTTATTGCATTGCTTTTAACAGATGAAAGTAGGATAGAAAGCAAATCTTCTATTAATGCTGATGGAGTCTTACCTACTGCTAATCTCACAA
TTCAAAATCCTACACAAAAAGATATAGAAAATTTGAGAAATGTGAGTCAGTCGCAACAACAACAAAGTTTCGGTAATGGTAGAGGCAGAGGAAGATTTAATTCTAGTCAA
GGTAGAGGAGGAAAGTCTTGGAATAGTAGAAACAAACCCCAGTGTCAAATCTGTAACAAAATTGGTCATACAACCTTAAAATGTTATTCTCGTGTTCAGATGCCTGGGGC
TTACTTGACTCAATTCAACCCCTCTAGTCAGTTTTTTCTTGGACAGAATTCTGGTCAGATGTATGGGAAACAGTTCTCTCCAATGCAGGCTATGACAGCTACTCCAAGCT
TTAATCAAGATTCTTTGGCTATAAATCGGTTTGGATATGCTTCTCTTACTTCGCCTAGTAATCATGTCTTTCATCTAAATAATCTTTTACACGTTCCATCCATTACCAAG
AATTTGATTAGTGTCAGACTATACAAGTTCAACCTATCCAAGCCTCAACACTCTATCCCATCAAATGCTAATCTTGCTGTCTTCAAGTCATTTAATAATTCTGTTTCTGC
TGTTTCTGAATCTATTTTGCCTTCCAACTCTATTGCTTTGCTTGCTTCTGTCCAATCTTTGAATGATAAATCTTATGATATTTGGCATCAAAGGCTAGGTCATCCAATCT
TTTCTATTGTTGAGCAAGTTCTTCAAAAGTGTAATCCCAATTTCTCTGGTAATGAAAACATTTCATTCTGTAATGCATGTGCAATTGGACCTTCATACAAATTGTCTAGA
AATGGCTTCAGAGTTACTTGTCCATATACCGCTCAACAAAATGGTATTATCGAACGTAAGCATAGGCATATCGTTGATGTAGGTCTTACATTGTTGTCTCAATCATCCAT
GCCTCTAACATTTTGGGACGATGCCTTTTCTACTAGTGTTTTTCTTATCAACAAGTTGCCTTCTATAGTTCTTGGTGGAGCAAGTCCCATGGAGAAACTCTTGCAGCGAC
AACCCGATTATACATCTCTCAAGGTATAG
Protein sequenceShow/hide protein sequence
MRGRDPFVQAPESALKGTTSLLSLIWQYGFSETNSEISSGNQVTQAINPGNKISTIKLTEENFLLWKFEVFTALEGHELEEHIGEDCQPPLEKIQVSEGTSTVSKPNPHY
KLWKRKTVPVEDHIMYILSGLGAEYETMVLVITAKIGSQTVQDVIALLLTDESRIESKSSINADGVLPTANLTIQNPTQKDIENLRNVSQSQQQQSFGNGRGRGRFNSSQ
GRGGKSWNSRNKPQCQICNKIGHTTLKCYSRVQMPGAYLTQFNPSSQFFLGQNSGQMYGKQFSPMQAMTATPSFNQDSLAINRFGYASLTSPSNHVFHLNNLLHVPSITK
NLISVRLYKFNLSKPQHSIPSNANLAVFKSFNNSVSAVSESILPSNSIALLASVQSLNDKSYDIWHQRLGHPIFSIVEQVLQKCNPNFSGNENISFCNACAIGPSYKLSR
NGFRVTCPYTAQQNGIIERKHRHIVDVGLTLLSQSSMPLTFWDDAFSTSVFLINKLPSIVLGGASPMEKLLQRQPDYTSLKV