; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0026081 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0026081
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr10:28836157..28838362
RNA-Seq ExpressionLag0026081
SyntenyLag0026081
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0016779 - nucleotidyltransferase activity (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU17915.1 hypothetical protein TSUD_330400, partial [Trifolium subterraneum]4.5e-8232.13Show/hide
Query:  IKLDQTNFLLWQNIVLLVLKSYKLEGHLSGKTPAPDMTIIVPPSEEDPLGLKIPNPEYDLWLAADQLLVGWLYNSMTPEIATQVMGHEEAKNLWDSIQEY
        +KLD+ N+ LW+++VL +++  +L+G++ GK   P+  I    S       K  NPE++ W A DQ L+GWL NSMT  IATQ++  E +  LW+     
Subjt:  IKLDQTNFLLWQNIVLLVLKSYKLEGHLSGKTPAPDMTIIVPPSEEDPLGLKIPNPEYDLWLAADQLLVGWLYNSMTPEIATQVMGHEEAKNLWDSIQEY

Query:  YGVQSRSQEDYNRLMLQQTRKGTMMMYEYFETMKKYFDNLQMVGFPMDMRSFISHVTAGLDEEYTPIVCVIRNQ-NMTWSEIQLELLSFEQRQDRLQALK
         G  +RSQ  Y +     TRKG M M +Y   MK   D L++ G P+     I     GLD EY P+V  + +Q  + W ++Q +LL+FE R ++L  L 
Subjt:  YGVQSRSQEDYNRLMLQQTRKGTMMMYEYFETMKKYFDNLQMVGFPMDMRSFISHVTAGLDEEYTPIVCVIRNQ-NMTWSEIQLELLSFEQRQDRLQALK

Query:  TNISVNQASANLAEVLLTSDTQKSHYQTNNQ-QNRNFSSNFSNQNRGHF----------------------------SNRGNRYREKRSNEVYMVGKLEN
        TN+++N A+AN+A     S+ + + + +NN  +  NF      + RG                              SN  +   ++ S+ V++  +   
Subjt:  TNISVNQASANLAEVLLTSDTQKSHYQTNNQ-QNRNFSSNFSNQNRGHF----------------------------SNRGNRYREKRSNEVYMVGKLEN

Query:  GLYRLLEEPQASTDSQMEIKGLEDASIVKIQRRL--DDGRQVNLVSYVLTTCK---------------------MDMWHKRLGHPSFKILSQIL------
          Y    +  AS     +    +D +    +  L   +G ++ +V+   T  +                      + WH++LGHP+ K + +++      
Subjt:  GLYRLLEEPQASTDSQMEIKGLEDASIVKIQRRL--DDGRQVNLVSYVLTTCK---------------------MDMWHKRLGHPSFKILSQIL------

Query:  ----------------------------------------------------QLCKVPIKSNGKPDFCEACKLEHGIDIQISCPYASAQNGRIERKHRHI
                                                            ++  +     G+    +   +E GI  ++SCPY S QNGR ERKHRHI
Subjt:  ----------------------------------------------------QLCKVPIKSNGKPDFCEACKLEHGIDIQISCPYASAQNGRIERKHRHI

Query:  VETGLALLAQAKMPLHHWWDAFHTAVHLINRLPSSVIDGKSPFELLKGQQPDYKGLKTFGAACYPCLRPYQHQKFEFHTEKCVFLGCTNTHKGYRCMSKT
         E GL LLAQAKMPL++WW+AF TAV+LINRLPS V   +SP+ LL  ++PDY  LK FG ACYPCL+PY   K +FHT KCVFLG +N+HKGY+C++  
Subjt:  VETGLALLAQAKMPLHHWWDAFHTAVHLINRLPSSVIDGKSPFELLKGQQPDYKGLKTFGAACYPCLRPYQHQKFEFHTEKCVFLGCTNTHKGYRCMSKT

Query:  GLIYISRHVCFNEGDFPYQELFMHHEPEPPISTTT
        G ++ISRHV FNE  FP+ + F++     P+ T T
Subjt:  GLIYISRHVCFNEGDFPYQELFMHHEPEPPISTTT

GAU19483.1 hypothetical protein TSUD_77270 [Trifolium subterraneum]8.5e-8130.65Show/hide
Query:  MSIKLDQTNFLLWQNIVLLVLKSYKLEGHLSGKTPAPDMTIIVPPSEEDPLGLKIPNPEYDLWLAADQLLVGWLYNSMTPEIATQVMGHEEAKNLWDSIQ
        +S+KLD+ N+ LW+++VL V++  KL+G++ G    P+  I    S       K  N  +  W A DQ L+GW+ NSMT EIATQ++  E +K LWD  Q
Subjt:  MSIKLDQTNFLLWQNIVLLVLKSYKLEGHLSGKTPAPDMTIIVPPSEEDPLGLKIPNPEYDLWLAADQLLVGWLYNSMTPEIATQVMGHEEAKNLWDSIQ

Query:  EYYGVQSRSQEDYNRLMLQQTRKGTMMMYEYFETMKKYFDNLQMVGFPMDMRSFISHVTAGLDEEYTPIVCVIRNQ-NMTWSEIQLELLSFEQRQDRLQA
           G  +RSQ  Y +      RKG M M +Y   MK   D L++ G P+     I     GLD EY P+V  + +Q  ++W ++Q +LL+FE R ++L  
Subjt:  EYYGVQSRSQEDYNRLMLQQTRKGTMMMYEYFETMKKYFDNLQMVGFPMDMRSFISHVTAGLDEEYTPIVCVIRNQ-NMTWSEIQLELLSFEQRQDRLQA

Query:  LKTNISVNQASANLA------------------------------------EVLLTS------------------------DTQKSHYQTNNQQN--RNF
        L TN+++N A+AN+A                                    +V   S                        D Q SH      QN   ++
Subjt:  LKTNISVNQASANLA------------------------------------EVLLTS------------------------DTQKSHYQTNNQQN--RNF

Query:  SSNFSNQNRGHFSNRGNRYR---EKRSNEVYMVG------------------KLENGLY------RLLEEPQASTDSQMEIKGLEDASIVK--------I
           F +    H +++  +++   E       +VG                   L + LY       LL   + + D+ + ++  E+   VK        +
Subjt:  SSNFSNQNRGHFSNRGNRYR---EKRSNEVYMVG------------------KLENGLY------RLLEEPQASTDSQMEIKGLEDASIVK--------I

Query:  QRRLDD------GRQVNLVSYVLTTCKMDMWHKRLGHPSFKILSQILQLCKVPIKSNGKPDFCEACK---------------------------------
        +  L D      G + N  ++V      + WH+RLGHP+ K+L ++L+ CKV +  +    FCEAC+                                 
Subjt:  QRRLDD------GRQVNLVSYVLTTCKMDMWHKRLGHPSFKILSQILQLCKVPIKSNGKPDFCEACK---------------------------------

Query:  --------------------------------------------------------------------LEHGIDIQISCPYASAQNGRIERKHRHIVETG
                                                                            +E GI  ++SCPY S QNGR ERKHRHI E G
Subjt:  --------------------------------------------------------------------LEHGIDIQISCPYASAQNGRIERKHRHIVETG

Query:  LALLAQAKMPLHHWWDAFHTAVHLINRLPSSVIDGKSPFELLKGQQPDYKGLKTFGAACYPCLRPYQHQKFEFHTEKCVFLGCTNTHKGYRCMSKTGLIY
        L LLAQA+MPLH+WW+AF TAV+LINRLPS V   +SP+ L+  ++PDYK LKTFG ACYPCL+PY   K ++HT +CVFLG +N+HKGY+C++  G I+
Subjt:  LALLAQAKMPLHHWWDAFHTAVHLINRLPSSVIDGKSPFELLKGQQPDYKGLKTFGAACYPCLRPYQHQKFEFHTEKCVFLGCTNTHKGYRCMSKTGLIY

Query:  ISRHVCFNEGDFPYQELFMHHEPEPPISTTTILSWLPLPSTINP
        ISRHV FNE  FP+ + F++     P+ TT     + +PST  P
Subjt:  ISRHVCFNEGDFPYQELFMHHEPEPPISTTTILSWLPLPSTINP

GAU21262.1 hypothetical protein TSUD_286720 [Trifolium subterraneum]9.0e-8334.59Show/hide
Query:  MSIKLDQTNFLLWQNIVLLVLKSYKLEGHLSGKTPAPDMTIIVPPSEEDPLGLKIPNPEYDLWLAADQLLVGWLYNSMTPEIATQVMGHEEAKNLWDSIQ
        +S+KLD+ N+ LW+++VL +++  +L+G++ G    P+  +    +       K  NPEY+ W+A DQ L+GWL NSM  +IATQ++  E +K LWD  Q
Subjt:  MSIKLDQTNFLLWQNIVLLVLKSYKLEGHLSGKTPAPDMTIIVPPSEEDPLGLKIPNPEYDLWLAADQLLVGWLYNSMTPEIATQVMGHEEAKNLWDSIQ

Query:  EYYGVQSRSQEDYNRLMLQQTRKGTMMMYEYFETMKKYFDNLQMVGFPMDMRSFISHVTAGLDEEYTPIVCVIRNQ-NMTWSEIQLELLSFEQRQDRLQ-
           G  +RS+  Y +     TRKG M M +Y   MK   D L+M G P+     +     GLD E+ P+V  + +Q N++W ++Q +LL+FE R ++L  
Subjt:  EYYGVQSRSQEDYNRLMLQQTRKGTMMMYEYFETMKKYFDNLQMVGFPMDMRSFISHVTAGLDEEYTPIVCVIRNQ-NMTWSEIQLELLSFEQRQDRLQ-

Query:  --ALKTNISVNQASANLAEVLLTSDTQK-----------SHYQTNNQQN-----------RNFSSNFSNQNRGHFSNRGNRYREKRSNEVYMVGKLE---
          +L  N S N AS N     L   T++           + Y  NN+ N           +++   F +    H +++  + ++         GKL    
Subjt:  --ALKTNISVNQASANLAEVLLTSDTQK-----------SHYQTNNQQN-----------RNFSSNFSNQNRGHFSNRGNRYREKRSNEVYMVGKLE---

Query:  -NGLYRLLEEPQASTDSQM--EIKGLEDASIVKIQRRLDDGRQVNLV---SYVLTTCKMDMWHKRLGHPSFKILSQILQLCKVPIKSNGKPDFCEACKLE
         N      +EP     S +      L  ++I      +DD  +   +        T       K +    F    +++Q         G+    +   +E
Subjt:  -NGLYRLLEEPQASTDSQM--EIKGLEDASIVKIQRRLDDGRQVNLV---SYVLTTCKMDMWHKRLGHPSFKILSQILQLCKVPIKSNGKPDFCEACKLE

Query:  HGIDIQISCPYASAQNGRIERKHRHIVETGLALLAQAKMPLHHWWDAFHTAVHLINRLPSSVIDGKSPFELLKGQQPDYKGLKTFGAACYPCLRPYQHQK
         GI  ++SCPY S QNGR ERKHRH+ E G+ LLAQAKMPLH+WW+AF T+V+LINRLPSSV   +SP+ LL  ++PDY  LK FG ACYPCL+PY   K
Subjt:  HGIDIQISCPYASAQNGRIERKHRHIVETGLALLAQAKMPLHHWWDAFHTAVHLINRLPSSVIDGKSPFELLKGQQPDYKGLKTFGAACYPCLRPYQHQK

Query:  FEFHTEKCVFLGCTNTHKGYRCMSKTGLIYISRHVCFNEGDFPYQELFMHHEPEPPISTTTILSWLPLPS-----TINPHIHPT
         +FH  KCVFLG +N+HKGY+C++    I++SR V FNE  FP+ + F+  +   P+ T T ++ +  PS     T +  I PT
Subjt:  FEFHTEKCVFLGCTNTHKGYRCMSKTGLIYISRHVCFNEGDFPYQELFMHHEPEPPISTTTILSWLPLPS-----TINPHIHPT

GAU51268.1 hypothetical protein TSUD_412550 [Trifolium subterraneum]2.9e-8129.58Show/hide
Query:  MSIKLDQTNFLLWQNIVLLVLKSYKLEGHLSGKTPAPDMTIIVPPSEEDPLGLKIPNPEYDLWLAADQLLVGWLYNSMTPEIATQVMGHEEAKNLWDSIQ
        +S+KLD+ N+ LW+++VL +++  KL+G++ G T  P+  +            K  NP++  W+A DQ L+GWL NSM  +IATQ++  E +K LWD  Q
Subjt:  MSIKLDQTNFLLWQNIVLLVLKSYKLEGHLSGKTPAPDMTIIVPPSEEDPLGLKIPNPEYDLWLAADQLLVGWLYNSMTPEIATQVMGHEEAKNLWDSIQ

Query:  EYYGVQSRSQEDYNRLMLQQTRKGTMMMYEYFETMKKYFDNLQMVGFPMDMRSFISHVTAGLDEEYTPIVCVIRNQ-NMTWSEIQLELLSFEQRQDR---
           G  ++S+  Y +     TRKG M M EY   MK   D L++ G P+     +     GLD EY P+V  + +Q N++W ++Q +LL+FE R D+   
Subjt:  EYYGVQSRSQEDYNRLMLQQTRKGTMMMYEYFETMKKYFDNLQMVGFPMDMRSFISHVTAGLDEEYTPIVCVIRNQ-NMTWSEIQLELLSFEQRQDR---

Query:  LQALKTNISVNQASANLAEVLLTSDTQKSHYQTNN--------------------------------------QQNRNFSS-------------------
           L  N S N   AN  E        + +++ +N                                         RN+S+                   
Subjt:  LQALKTNISVNQASANLAEVLLTSDTQKSHYQTNN--------------------------------------QQNRNFSS-------------------

Query:  -------NFSNQNRGHFSNRGNRYREKRSNEVYMVG-------------KLEN-GLYRLLEEPQASTDSQMEIKGLEDASIV------------------
               + +N +  H +++   + E       MVG             KL N  L+ +L  PQ + +     K   D +I+                  
Subjt:  -------NFSNQNRGHFSNRGNRYREKRSNEVYMVG-------------KLEN-GLYRLLEEPQASTDSQMEIKGLEDASIV------------------

Query:  KIQRRLDDG-RQVNLVSYVLTTCKMDMWHKRLGHPSFKILSQILQLCKVPIKSNGKPDFCEACK------------------------------------
         ++ RL DG  Q++     +     + WH++LGHP+ K+L ++L+ C V I  + +  FCEAC+                                    
Subjt:  KIQRRLDDG-RQVNLVSYVLTTCKMDMWHKRLGHPSFKILSQILQLCKVPIKSNGKPDFCEACK------------------------------------

Query:  -----------------------------------------------------------------LEHGIDIQISCPYASAQNGRIERKHRHIVETGLAL
                                                                         +E GI  ++SCPY S QNGR ERKHRH+ E GL L
Subjt:  -----------------------------------------------------------------LEHGIDIQISCPYASAQNGRIERKHRHIVETGLAL

Query:  LAQAKMPLHHWWDAFHTAVHLINRLPSSVIDGKSPFELLKGQQPDYKGLKTFGAACYPCLRPYQHQKFEFHTEKCVFLGCTNTHKGYRCMSKTGLIYISR
        LAQAKMPL +WW+AF TAV+LINRLPSSV   +SP+ L+  ++PDY  LK FG ACYPCL+PY   K +FHT +CVF+G +N+HKGY+C++  G I++SR
Subjt:  LAQAKMPLHHWWDAFHTAVHLINRLPSSVIDGKSPFELLKGQQPDYKGLKTFGAACYPCLRPYQHQKFEFHTEKCVFLGCTNTHKGYRCMSKTGLIYISR

Query:  HVCFNEGDFPYQELFMHHEPEPPISTTTILSWLPLPS
        HV FNE  FP+   F+  + + P+ T T  S + LP+
Subjt:  HVCFNEGDFPYQELFMHHEPEPPISTTTILSWLPLPS

PNX76291.1 gag/pol polyprotein - maize retrotransposon Hopscotch, partial [Trifolium pratense]2.9e-8130.22Show/hide
Query:  MSIKLDQTNFLLWQNIVLLVLKSYKLEGHLSGKTPAPDMTIIVPPSEEDPLGLKIPNPEYDLWLAADQLLVGWLYNSMTPEIATQVMGHEEAKNLWDSIQ
        +S+KLD+ N+ LWQ++VL +++  +L+G++ GK   P+  I    S       K  NPE++ W A DQ L+GWL NSMT  IATQ++  E +  LWD  Q
Subjt:  MSIKLDQTNFLLWQNIVLLVLKSYKLEGHLSGKTPAPDMTIIVPPSEEDPLGLKIPNPEYDLWLAADQLLVGWLYNSMTPEIATQVMGHEEAKNLWDSIQ

Query:  EYYGVQSRSQEDYNRLMLQQTRKGTMMMYEYFETMKKYFDNLQMVGFPMDMRSFISHVTAGLDEEYTPIVCVIRNQ-NMTWSEIQLELLSFEQRQDRLQA
           G  +RSQ  Y +     TRKG M M +Y   MK   D L++ G P+     I     GLD EY P+V  + +Q  ++W ++Q +LL+FE R ++L +
Subjt:  EYYGVQSRSQEDYNRLMLQQTRKGTMMMYEYFETMKKYFDNLQMVGFPMDMRSFISHVTAGLDEEYTPIVCVIRNQ-NMTWSEIQLELLSFEQRQDRLQA

Query:  LKTNISVNQASANLAEVLLTSDTQKSHYQTNNQ---QNRNF-SSNF---------------------------------------------SNQNRG---
        L TN+++N A+AN+A+    SD + + + +NN     N N+  SNF                                             +N  +G   
Subjt:  LKTNISVNQASANLAEVLLTSDTQKSHYQTNNQ---QNRNF-SSNF---------------------------------------------SNQNRG---

Query:  -------------------------HFSNRGNRYREKRSNEVYMVG------------------KLENGLY------RLLEEPQASTDSQMEIKGLEDAS
                                 H +++     E       +VG                   L + LY       LL   + + D+ + ++  E+  
Subjt:  -------------------------HFSNRGNRYREKRSNEVYMVG------------------KLENGLY------RLLEEPQASTDSQMEIKGLEDAS

Query:  IVK--------IQRRLDDG-RQVNLVSYVLTTCKMDMWHKRLGHPSFKILSQILQLCKVPIKSNGKPDFCEACK--------------------------
         VK        ++  L DG  Q++           + WH++LGHP+ K+L  +L+ C V +  + +  FCEAC+                          
Subjt:  IVK--------IQRRLDDG-RQVNLVSYVLTTCKMDMWHKRLGHPSFKILSQILQLCKVPIKSNGKPDFCEACK--------------------------

Query:  ---------------------------------------------------------------------------LEHGIDIQISCPYASAQNGRIERKH
                                                                                   +E GI  ++SCPY S QNGR ERKH
Subjt:  ---------------------------------------------------------------------------LEHGIDIQISCPYASAQNGRIERKH

Query:  RHIVETGLALLAQAKMPLHHWWDAFHTAVHLINRLPSSVIDGKSPFELLKGQQPDYKGLKTFGAACYPCLRPYQHQKFEFHTEKCVFLGCTNTHKGYRCM
        RHI E GL LLAQAKMPL++WW+AF TAV+LINRLPSSV   KSP+ LL  ++PDY  LK FG ACYP L+PY   K +FHT +CVFLG +N+HKGY+C+
Subjt:  RHIVETGLALLAQAKMPLHHWWDAFHTAVHLINRLPSSVIDGKSPFELLKGQQPDYKGLKTFGAACYPCLRPYQHQKFEFHTEKCVFLGCTNTHKGYRCM

Query:  SKTGLIYISRHVCFNEGDFPYQELFMHHEPEPPISTTTILSWLPLPSTINP-HIHPTPMPSELSSSTGLAS
        +  G I+ISRHV FNE  FP+ + F++     P+ T T       PS+  P H   TP+  E++ +  +++
Subjt:  SKTGLIYISRHVCFNEGDFPYQELFMHHEPEPPISTTTILSWLPLPSTINP-HIHPTPMPSELSSSTGLAS

TrEMBL top hitse value%identityAlignment
A0A2K3LCM1 Gag/pol polyprotein-maize retrotransposon Hopscotch (Fragment)1.4e-8130.22Show/hide
Query:  MSIKLDQTNFLLWQNIVLLVLKSYKLEGHLSGKTPAPDMTIIVPPSEEDPLGLKIPNPEYDLWLAADQLLVGWLYNSMTPEIATQVMGHEEAKNLWDSIQ
        +S+KLD+ N+ LWQ++VL +++  +L+G++ GK   P+  I    S       K  NPE++ W A DQ L+GWL NSMT  IATQ++  E +  LWD  Q
Subjt:  MSIKLDQTNFLLWQNIVLLVLKSYKLEGHLSGKTPAPDMTIIVPPSEEDPLGLKIPNPEYDLWLAADQLLVGWLYNSMTPEIATQVMGHEEAKNLWDSIQ

Query:  EYYGVQSRSQEDYNRLMLQQTRKGTMMMYEYFETMKKYFDNLQMVGFPMDMRSFISHVTAGLDEEYTPIVCVIRNQ-NMTWSEIQLELLSFEQRQDRLQA
           G  +RSQ  Y +     TRKG M M +Y   MK   D L++ G P+     I     GLD EY P+V  + +Q  ++W ++Q +LL+FE R ++L +
Subjt:  EYYGVQSRSQEDYNRLMLQQTRKGTMMMYEYFETMKKYFDNLQMVGFPMDMRSFISHVTAGLDEEYTPIVCVIRNQ-NMTWSEIQLELLSFEQRQDRLQA

Query:  LKTNISVNQASANLAEVLLTSDTQKSHYQTNNQ---QNRNF-SSNF---------------------------------------------SNQNRG---
        L TN+++N A+AN+A+    SD + + + +NN     N N+  SNF                                             +N  +G   
Subjt:  LKTNISVNQASANLAEVLLTSDTQKSHYQTNNQ---QNRNF-SSNF---------------------------------------------SNQNRG---

Query:  -------------------------HFSNRGNRYREKRSNEVYMVG------------------KLENGLY------RLLEEPQASTDSQMEIKGLEDAS
                                 H +++     E       +VG                   L + LY       LL   + + D+ + ++  E+  
Subjt:  -------------------------HFSNRGNRYREKRSNEVYMVG------------------KLENGLY------RLLEEPQASTDSQMEIKGLEDAS

Query:  IVK--------IQRRLDDG-RQVNLVSYVLTTCKMDMWHKRLGHPSFKILSQILQLCKVPIKSNGKPDFCEACK--------------------------
         VK        ++  L DG  Q++           + WH++LGHP+ K+L  +L+ C V +  + +  FCEAC+                          
Subjt:  IVK--------IQRRLDDG-RQVNLVSYVLTTCKMDMWHKRLGHPSFKILSQILQLCKVPIKSNGKPDFCEACK--------------------------

Query:  ---------------------------------------------------------------------------LEHGIDIQISCPYASAQNGRIERKH
                                                                                   +E GI  ++SCPY S QNGR ERKH
Subjt:  ---------------------------------------------------------------------------LEHGIDIQISCPYASAQNGRIERKH

Query:  RHIVETGLALLAQAKMPLHHWWDAFHTAVHLINRLPSSVIDGKSPFELLKGQQPDYKGLKTFGAACYPCLRPYQHQKFEFHTEKCVFLGCTNTHKGYRCM
        RHI E GL LLAQAKMPL++WW+AF TAV+LINRLPSSV   KSP+ LL  ++PDY  LK FG ACYP L+PY   K +FHT +CVFLG +N+HKGY+C+
Subjt:  RHIVETGLALLAQAKMPLHHWWDAFHTAVHLINRLPSSVIDGKSPFELLKGQQPDYKGLKTFGAACYPCLRPYQHQKFEFHTEKCVFLGCTNTHKGYRCM

Query:  SKTGLIYISRHVCFNEGDFPYQELFMHHEPEPPISTTTILSWLPLPSTINP-HIHPTPMPSELSSSTGLAS
        +  G I+ISRHV FNE  FP+ + F++     P+ T T       PS+  P H   TP+  E++ +  +++
Subjt:  SKTGLIYISRHVCFNEGDFPYQELFMHHEPEPPISTTTILSWLPLPSTINP-HIHPTPMPSELSSSTGLAS

A0A2Z6M732 Integrase catalytic domain-containing protein4.4e-8334.59Show/hide
Query:  MSIKLDQTNFLLWQNIVLLVLKSYKLEGHLSGKTPAPDMTIIVPPSEEDPLGLKIPNPEYDLWLAADQLLVGWLYNSMTPEIATQVMGHEEAKNLWDSIQ
        +S+KLD+ N+ LW+++VL +++  +L+G++ G    P+  +    +       K  NPEY+ W+A DQ L+GWL NSM  +IATQ++  E +K LWD  Q
Subjt:  MSIKLDQTNFLLWQNIVLLVLKSYKLEGHLSGKTPAPDMTIIVPPSEEDPLGLKIPNPEYDLWLAADQLLVGWLYNSMTPEIATQVMGHEEAKNLWDSIQ

Query:  EYYGVQSRSQEDYNRLMLQQTRKGTMMMYEYFETMKKYFDNLQMVGFPMDMRSFISHVTAGLDEEYTPIVCVIRNQ-NMTWSEIQLELLSFEQRQDRLQ-
           G  +RS+  Y +     TRKG M M +Y   MK   D L+M G P+     +     GLD E+ P+V  + +Q N++W ++Q +LL+FE R ++L  
Subjt:  EYYGVQSRSQEDYNRLMLQQTRKGTMMMYEYFETMKKYFDNLQMVGFPMDMRSFISHVTAGLDEEYTPIVCVIRNQ-NMTWSEIQLELLSFEQRQDRLQ-

Query:  --ALKTNISVNQASANLAEVLLTSDTQK-----------SHYQTNNQQN-----------RNFSSNFSNQNRGHFSNRGNRYREKRSNEVYMVGKLE---
          +L  N S N AS N     L   T++           + Y  NN+ N           +++   F +    H +++  + ++         GKL    
Subjt:  --ALKTNISVNQASANLAEVLLTSDTQK-----------SHYQTNNQQN-----------RNFSSNFSNQNRGHFSNRGNRYREKRSNEVYMVGKLE---

Query:  -NGLYRLLEEPQASTDSQM--EIKGLEDASIVKIQRRLDDGRQVNLV---SYVLTTCKMDMWHKRLGHPSFKILSQILQLCKVPIKSNGKPDFCEACKLE
         N      +EP     S +      L  ++I      +DD  +   +        T       K +    F    +++Q         G+    +   +E
Subjt:  -NGLYRLLEEPQASTDSQM--EIKGLEDASIVKIQRRLDDGRQVNLV---SYVLTTCKMDMWHKRLGHPSFKILSQILQLCKVPIKSNGKPDFCEACKLE

Query:  HGIDIQISCPYASAQNGRIERKHRHIVETGLALLAQAKMPLHHWWDAFHTAVHLINRLPSSVIDGKSPFELLKGQQPDYKGLKTFGAACYPCLRPYQHQK
         GI  ++SCPY S QNGR ERKHRH+ E G+ LLAQAKMPLH+WW+AF T+V+LINRLPSSV   +SP+ LL  ++PDY  LK FG ACYPCL+PY   K
Subjt:  HGIDIQISCPYASAQNGRIERKHRHIVETGLALLAQAKMPLHHWWDAFHTAVHLINRLPSSVIDGKSPFELLKGQQPDYKGLKTFGAACYPCLRPYQHQK

Query:  FEFHTEKCVFLGCTNTHKGYRCMSKTGLIYISRHVCFNEGDFPYQELFMHHEPEPPISTTTILSWLPLPS-----TINPHIHPT
         +FH  KCVFLG +N+HKGY+C++    I++SR V FNE  FP+ + F+  +   P+ T T ++ +  PS     T +  I PT
Subjt:  FEFHTEKCVFLGCTNTHKGYRCMSKTGLIYISRHVCFNEGDFPYQELFMHHEPEPPISTTTILSWLPLPS-----TINPHIHPT

A0A2Z6P4D5 Integrase catalytic domain-containing protein1.4e-8129.58Show/hide
Query:  MSIKLDQTNFLLWQNIVLLVLKSYKLEGHLSGKTPAPDMTIIVPPSEEDPLGLKIPNPEYDLWLAADQLLVGWLYNSMTPEIATQVMGHEEAKNLWDSIQ
        +S+KLD+ N+ LW+++VL +++  KL+G++ G T  P+  +            K  NP++  W+A DQ L+GWL NSM  +IATQ++  E +K LWD  Q
Subjt:  MSIKLDQTNFLLWQNIVLLVLKSYKLEGHLSGKTPAPDMTIIVPPSEEDPLGLKIPNPEYDLWLAADQLLVGWLYNSMTPEIATQVMGHEEAKNLWDSIQ

Query:  EYYGVQSRSQEDYNRLMLQQTRKGTMMMYEYFETMKKYFDNLQMVGFPMDMRSFISHVTAGLDEEYTPIVCVIRNQ-NMTWSEIQLELLSFEQRQDR---
           G  ++S+  Y +     TRKG M M EY   MK   D L++ G P+     +     GLD EY P+V  + +Q N++W ++Q +LL+FE R D+   
Subjt:  EYYGVQSRSQEDYNRLMLQQTRKGTMMMYEYFETMKKYFDNLQMVGFPMDMRSFISHVTAGLDEEYTPIVCVIRNQ-NMTWSEIQLELLSFEQRQDR---

Query:  LQALKTNISVNQASANLAEVLLTSDTQKSHYQTNN--------------------------------------QQNRNFSS-------------------
           L  N S N   AN  E        + +++ +N                                         RN+S+                   
Subjt:  LQALKTNISVNQASANLAEVLLTSDTQKSHYQTNN--------------------------------------QQNRNFSS-------------------

Query:  -------NFSNQNRGHFSNRGNRYREKRSNEVYMVG-------------KLEN-GLYRLLEEPQASTDSQMEIKGLEDASIV------------------
               + +N +  H +++   + E       MVG             KL N  L+ +L  PQ + +     K   D +I+                  
Subjt:  -------NFSNQNRGHFSNRGNRYREKRSNEVYMVG-------------KLEN-GLYRLLEEPQASTDSQMEIKGLEDASIV------------------

Query:  KIQRRLDDG-RQVNLVSYVLTTCKMDMWHKRLGHPSFKILSQILQLCKVPIKSNGKPDFCEACK------------------------------------
         ++ RL DG  Q++     +     + WH++LGHP+ K+L ++L+ C V I  + +  FCEAC+                                    
Subjt:  KIQRRLDDG-RQVNLVSYVLTTCKMDMWHKRLGHPSFKILSQILQLCKVPIKSNGKPDFCEACK------------------------------------

Query:  -----------------------------------------------------------------LEHGIDIQISCPYASAQNGRIERKHRHIVETGLAL
                                                                         +E GI  ++SCPY S QNGR ERKHRH+ E GL L
Subjt:  -----------------------------------------------------------------LEHGIDIQISCPYASAQNGRIERKHRHIVETGLAL

Query:  LAQAKMPLHHWWDAFHTAVHLINRLPSSVIDGKSPFELLKGQQPDYKGLKTFGAACYPCLRPYQHQKFEFHTEKCVFLGCTNTHKGYRCMSKTGLIYISR
        LAQAKMPL +WW+AF TAV+LINRLPSSV   +SP+ L+  ++PDY  LK FG ACYPCL+PY   K +FHT +CVF+G +N+HKGY+C++  G I++SR
Subjt:  LAQAKMPLHHWWDAFHTAVHLINRLPSSVIDGKSPFELLKGQQPDYKGLKTFGAACYPCLRPYQHQKFEFHTEKCVFLGCTNTHKGYRCMSKTGLIYISR

Query:  HVCFNEGDFPYQELFMHHEPEPPISTTTILSWLPLPS
        HV FNE  FP+   F+  + + P+ T T  S + LP+
Subjt:  HVCFNEGDFPYQELFMHHEPEPPISTTTILSWLPLPS

A0A803NU85 Uncharacterized protein7.5e-8331.88Show/hide
Query:  SIKLDQTNFLLWQNIVLLVLKSYKLEGHLSGKTPAPDMTIIVPPSEEDPLGLKIP----NPEYDLWLAADQLLVGWLYNSMTPEIATQVMGHEEAKNLWD
        S+KLD+ NF LW+ +V  +++ ++LEG+L+G   AP   +   PSE    G   P    NPEY+ WL  DQLL+GWL                   +LW 
Subjt:  SIKLDQTNFLLWQNIVLLVLKSYKLEGHLSGKTPAPDMTIIVPPSEEDPLGLKIP----NPEYDLWLAADQLLVGWLYNSMTPEIATQVMGHEEAKNLWD

Query:  SIQEYYGVQSRSQEDYNRLMLQQTRKGTMMMYEYFETMKKYFDNLQMVGFPMDMRSFISHVTAGLDEEYTPIVCVIR-NQNMTWSEIQLELLSFEQRQDR
        +++E YG  SR+  D  R  +Q TRKGT  M +Y +  + + D+L + G P   +  +S+V +GLD EY  IV +I   ++ TW ++Q  LLSF+ R +R
Subjt:  SIQEYYGVQSRSQEDYNRLMLQQTRKGTMMMYEYFETMKKYFDNLQMVGFPMDMRSFISHVTAGLDEEYTPIVCVIR-NQNMTWSEIQLELLSFEQRQDR

Query:  LQALKTNIS-VNQASANLAEVLLTSDTQK-----------------SHYQTNNQQNRNFSSNFSNQNRGHFSNRG---------------NRYREK----
        L A+ TN   +N  SAN A+    S+ Q+                 +H  T N +  +F        RG+ S                  NRY E     
Subjt:  LQALKTNIS-VNQASANLAEVLLTSDTQK-----------------SHYQTNNQQNRNFSSNFSNQNRGHFSNRG---------------NRYREK----

Query:  -------------------------------------RSNEVYMVGKLENG-------------------------------LYRLLEEPQAS-------
                                              S+   +  K E G                               L+ +L  P  S       
Subjt:  -------------------------------------RSNEVYMVGKLENG-------------------------------LYRLLEEPQAS-------

Query:  ---TDSQMEIKGLEDASIVK--------IQRRLDDG-----------------------RQVNLVSYVLTTCKMDMWHKRLGHPSFKILSQILQLCKVPI
           +D+ + ++   D  +VK        +Q  L DG                             S+V      D+WH++LGHPS  +L+Q+L+L  V +
Subjt:  ---TDSQMEIKGLEDASIVK--------IQRRLDDG-----------------------RQVNLVSYVLTTCKMDMWHKRLGHPSFKILSQILQLCKVPI

Query:  KSNGKPDFCEACK---------------------------LEHGIDIQISCPYASAQNGRIERKHRHIVETGLALLAQAKMPLHHWWDAFHTAVHLINRL
          N    FC+AC+                           L+ GI    SCP+ S QNGR ERKHRHIVE GL L+AQA +PL +W DAF TAV+LINRL
Subjt:  KSNGKPDFCEACK---------------------------LEHGIDIQISCPYASAQNGRIERKHRHIVETGLALLAQAKMPLHHWWDAFHTAVHLINRL

Query:  PSSVIDGKSPFELLKGQQPDYKGLKTFGAACYPCLRPYQHQKFEFHTEKCVFLGCTNTHKGYRCMSKTGLIYISRHVCFNEGDFPYQELFM-HHEPEPPI
        P++V+  +SP+  L  +QPDYK LKTFG AC+PCLR Y   KF+FH+ KCV LG + +HKGY+C+S  G IYISRHV FNE +FP++  F+ ++  E  +
Subjt:  PSSVIDGKSPFELLKGQQPDYKGLKTFGAACYPCLRPYQHQKFEFHTEKCVFLGCTNTHKGYRCMSKTGLIYISRHVCFNEGDFPYQELFM-HHEPEPPI

Query:  STTTILSWLPLP
               W  LP
Subjt:  STTTILSWLPLP

A0A803PM38 Uncharacterized protein2.8e-8228.88Show/hide
Query:  SIKLDQTNFLLWQNIVLLVLKSYKLEGHLSGKTPAPDMTIIVPPSEEDPLGLKIPNPEYDLWLAADQLLVGWLYNSMTPEIATQVMGHEEAKNLWDSIQE
        ++KLD+ NF LW+ +V  +++ ++L+G+L G  P P   +     +     +   NP ++ W+  DQLL+GWLY SMT  IA +VMG + + +LW +++E
Subjt:  SIKLDQTNFLLWQNIVLLVLKSYKLEGHLSGKTPAPDMTIIVPPSEEDPLGLKIPNPEYDLWLAADQLLVGWLYNSMTPEIATQVMGHEEAKNLWDSIQE

Query:  YYGVQSRSQEDYNRLMLQQTRKGTMMMYEYFETMKKYFDNLQMVGFPMDMRSFISHVTAGLDEEYTPIVCVIRNQ-NMTWSEIQLELLSFEQRQDRLQAL
         +G  S+++ D  R  +Q  RKG + M +Y    +++ D L + G P      +S+V +GLD EY P+V +I  + + TW ++Q  LLS + + +RL + 
Subjt:  YYGVQSRSQEDYNRLMLQQTRKGTMMMYEYFETMKKYFDNLQMVGFPMDMRSFISHVTAGLDEEYTPIVCVIRNQ-NMTWSEIQLELLSFEQRQDRLQAL

Query:  K-----TNISVNQASANLAEVLLTSDTQKSHYQTNNQQNRNFSSNFSNQNRG-----------------------HFSNR--------------------
              T + +N  SA+LA         + ++  NN+   + +   +N++RG                       H  NR                    
Subjt:  K-----TNISVNQASANLAEVLLTSDTQKSHYQTNNQQNRNFSSNFSNQNRG-----------------------HFSNR--------------------

Query:  ---------GNR--------------------------------------------------------YREKRSNEVYMVGKLENGLYRLLEEPQASTDS
                 GNR                                                         ++K + +V + GKL++GLY+   +   ST S
Subjt:  ---------GNR--------------------------------------------------------YREKRSNEVYMVGKLENGLYRLLEEPQASTDS

Query:  QMEIKGLEDASIVK--IQRRLDDGRQVNLVSYVLTTCKMDMWHKRLGHPSFKILSQILQLCKVPIKS-NGKPDFCEACKL--------------------
            + +   +     +   ++      + + +L + K D WH+RLGHPS ++L  +L   K+ +K+ N    FC+AC+L                    
Subjt:  QMEIKGLEDASIVK--IQRRLDDGRQVNLVSYVLTTCKMDMWHKRLGHPSFKILSQILQLCKVPIKS-NGKPDFCEACKL--------------------

Query:  ---------------------------------------------------------------------------------EHGIDIQISCPYASAQNGR
                                                                                         +HGI  Q  CP+ S QNGR
Subjt:  ---------------------------------------------------------------------------------EHGIDIQISCPYASAQNGR

Query:  IERKHRHIVETGLALLAQAKMPLHHWWDAFHTAVHLINRLPSSVIDGKSPFELLKGQQPDYKGLKTFGAACYPCLRPYQHQKFEFHTEKCVFLGCTNTHK
         ERKHRHIVE GL LLAQA +P  +WWDAF TAV+LINRLP+ V+  K+PFE+L  QQPDYK LK FG +C+PCLR YQ+ KF+FH+ KCV LG ++ HK
Subjt:  IERKHRHIVETGLALLAQAKMPLHHWWDAFHTAVHLINRLPSSVIDGKSPFELLKGQQPDYKGLKTFGAACYPCLRPYQHQKFEFHTEKCVFLGCTNTHK

Query:  GYRCMSKTGLIYISRHVCFNEGDFPYQELFMH-HEPEPPIS
        GY+C+S TG +YISR V FNE +FP++  F++ ++PE P+S
Subjt:  GYRCMSKTGLIYISRHVCFNEGDFPYQELFMH-HEPEPPIS

SwissProt top hitse value%identityAlignment
P04146 Copia protein8.7e-1232.81Show/hide
Query:  SNGKPDFCEACKLEHGIDIQISCPYASAQNGRIERKHRHIVETGLALLAQAKMPLHHWWDAFHTAVHLINRLPSSVI--DGKSPFELLKGQQPDYKGLKT
        SN    FC    ++ GI   ++ P+    NG  ER  R I E    +++ AK+    W +A  TA +LINR+PS  +    K+P+E+   ++P  K L+ 
Subjt:  SNGKPDFCEACKLEHGIDIQISCPYASAQNGRIERKHRHIVETGLALLAQAKMPLHHWWDAFHTAVHLINRLPSSVI--DGKSPFELLKGQQPDYKGLKT

Query:  FGAACYPCLRPYQHQKFEFHTEKCVFLG
        FGA  Y  ++  Q  KF+  + K +F+G
Subjt:  FGAACYPCLRPYQHQKFEFHTEKCVFLG

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.5e-1328.71Show/hide
Query:  DFCEACKLEHGIDIQISCPYASAQNGRIERKHRHIVETGLALLAQAKMPLHHWWDAFHTAVHLINRLPSSVIDGKSPFELLKGQQPDYKGLKTFGAACYP
        +F E C   HGI  + + P     NG  ER +R IVE   ++L  AK+P   W +A  TA +LINR PS  +  + P  +   ++  Y  LK FG   + 
Subjt:  DFCEACKLEHGIDIQISCPYASAQNGRIERKHRHIVETGLALLAQAKMPLHHWWDAFHTAVHLINRLPSSVIDGKSPFELLKGQQPDYKGLKTFGAACYP

Query:  CLRPYQHQKFEFHTEKCVFLGCTNTHKGYRCMSKTGLIYI-SRHVCFNEGDFPYQELFMHHEPEPPISTTTILSWLPLPSTINPHIHPTPMPSELSSSTG
         +   Q  K +  +  C+F+G  +   GYR         I SR V F E      E+    +    +    I +++ +PST N        P+   S+T 
Subjt:  CLRPYQHQKFEFHTEKCVFLGCTNTHKGYRCMSKTGLIYI-SRHVCFNEGDFPYQELFMHHEPEPPISTTTILSWLPLPSTINPHIHPTPMPSELSSSTG

Query:  LASPPSHTP
          S     P
Subjt:  LASPPSHTP

P92512 Uncharacterized mitochondrial protein AtMg007105.1e-0435.29Show/hide
Query:  HRHIVETGLALLAQAKMPLHHWWDAFHTAVHLINRLPSSVIDGKSPFELLKGQQPDYKGLKTFGAACY
        +R I+E   ++L +  +P     DA +TAVH+IN+ PS+ I+   P E+     P Y  L+ FG   Y
Subjt:  HRHIVETGLALLAQAKMPLHHWWDAFHTAVHLINRLPSSVIDGKSPFELLKGQQPDYKGLKTFGAACY

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.9e-3642.64Show/hide
Query:  EHGIDIQISCPYASAQNGRIERKHRHIVETGLALLAQAKMPLHHWWDAFHTAVHLINRLPSSVIDGKSPFELLKGQQPDYKGLKTFGAACYPCLRPYQHQ
        +HGI    S P+    NG  ERKHRHIVETGL LL+ A +P  +W  AF  AV+LINRLP+ ++  +SPF+ L G  P+Y  L+ FG ACYP LRPY   
Subjt:  EHGIDIQISCPYASAQNGRIERKHRHIVETGLALLAQAKMPLHHWWDAFHTAVHLINRLPSSVIDGKSPFELLKGQQPDYKGLKTFGAACYPCLRPYQHQ

Query:  KFEFHTEKCVFLGCTNTHKGYRCMS-KTGLIYISRHVCFNEGDFPYQELFMHHEP------------EPPISTTTILSWLPLPSTINPHIHPTPMPS
        K +  + +CVFLG + T   Y C+  +T  +YISRHV F+E  FP+        P             P  +  T    LP PS  +PH   TP  S
Subjt:  KFEFHTEKCVFLGCTNTHKGYRCMS-KTGLIYISRHVCFNEGDFPYQELFMHHEP------------EPPISTTTILSWLPLPSTINPHIHPTPMPS

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE16.4e-1525.7Show/hide
Query:  KLDQTNFLLWQNIVLLVLKSYKLEGHLSGKTPAPDMTIIVPPSEEDPLGLKIPNPEYDLWLAADQLLVGWLYNSMTPEIATQVMGHEEAKNLWDSIQEYY
        KL  TN+L+W   V  +   Y+L G L G T  P  TI    +          NP+Y  W   D+L+   +  +++  +   V     A  +W+++++ Y
Subjt:  KLDQTNFLLWQNIVLLVLKSYKLEGHLSGKTPAPDMTIIVPPSEEDPLGLKIPNPEYDLWLAADQLLVGWLYNSMTPEIATQVMGHEEAKNLWDSIQEYY

Query:  GVQSRSQEDYNRLMLQQTRKGTMMMYEYFETMKKYFDNLQMVGFPMDMRSFISHVTAGLDEEYTPIVCVI--RNQNMTWSEIQLELLSFEQRQDRLQALK
           S       R  L+Q  KGT  + +Y + +   FD L ++G PMD    +  V   L EEY P++  I  ++   T +EI   LL+ E +        
Subjt:  GVQSRSQEDYNRLMLQQTRKGTMMMYEYFETMKKYFDNLQMVGFPMDMRSFISHVTAGLDEEYTPIVCVI--RNQNMTWSEIQLELLSFEQRQDRLQALK

Query:  TNISVNQASANLAEVLLTSDTQKSHYQTNNQQNRNFSSNFSNQNRGHFS
            +  +SA +  +   + + ++   TNN  N N ++ + N+N  + S
Subjt:  TNISVNQASANLAEVLLTSDTQKSHYQTNNQQNRNFSSNFSNQNRGHFS

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.6e-3741.15Show/hide
Query:  EHGIDIQISCPYASAQNGRIERKHRHIVETGLALLAQAKMPLHHWWDAFHTAVHLINRLPSSVIDGKSPFELLKGQQPDYKGLKTFGAACYPCLRPYQHQ
        +HGI    S P+    NG  ERKHRHIVE GL LL+ A +P  +W  AF  AV+LINRLP+ ++  +SPF+ L GQ P+Y+ LK FG ACYP LRPY   
Subjt:  EHGIDIQISCPYASAQNGRIERKHRHIVETGLALLAQAKMPLHHWWDAFHTAVHLINRLPSSVIDGKSPFELLKGQQPDYKGLKTFGAACYPCLRPYQHQ

Query:  KFEFHTEKCVFLGCTNTHKGYRCMS-KTGLIYISRHVCFNEGDFPY----------QELFMHHEPEPPISTT--TILSWLPLPSTINPHIHPTPMP----
        K E  +++C F+G + T   Y C+   TG +Y SRHV F+E  FP+          QE      P  P  TT  T    LP P  + PH+  +P P    
Subjt:  KFEFHTEKCVFLGCTNTHKGYRCMS-KTGLIYISRHVCFNEGDFPY----------QELFMHHEPEPPISTT--TILSWLPLPSTINPHIHPTPMP----

Query:  -----SELSS----STGLASPPSHTP
             +++SS    S+ ++SP S  P
Subjt:  -----SELSS----STGLASPPSHTP

Arabidopsis top hitse value%identityAlignment
ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein3.6e-0535.29Show/hide
Query:  HRHIVETGLALLAQAKMPLHHWWDAFHTAVHLINRLPSSVIDGKSPFELLKGQQPDYKGLKTFGAACY
        +R I+E   ++L +  +P     DA +TAVH+IN+ PS+ I+   P E+     P Y  L+ FG   Y
Subjt:  HRHIVETGLALLAQAKMPLHHWWDAFHTAVHLINRLPSSVIDGKSPFELLKGQQPDYKGLKTFGAACY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGATCAAGCTGGATCAAACGAATTTTCTGCTTTGGCAAAATATTGTGTTGCTCGTTCTCAAAAGCTATAAGCTCGAGGGGCATCTGTCAGGGAAAACTCCAGCTCC
AGACATGACTATCATTGTGCCACCATCAGAAGAAGATCCACTGGGGTTGAAAATACCCAATCCGGAATATGACCTTTGGCTAGCGGCAGATCAACTCCTTGTCGGGTGGT
TGTATAACTCGATGACTCCTGAAATTGCTACTCAAGTCATGGGACATGAGGAAGCAAAGAACCTGTGGGACTCCATTCAAGAATACTATGGCGTCCAATCTCGCTCTCAA
GAAGACTACAACAGGTTGATGTTACAACAAACTCGAAAAGGTACCATGATGATGTATGAATATTTTGAAACAATGAAGAAATACTTTGATAATCTTCAAATGGTTGGTTT
CCCGATGGATATGAGAAGCTTTATCTCTCACGTGACTGCTGGCTTGGATGAAGAATACACTCCTATAGTCTGTGTGATCAGAAACCAGAATATGACGTGGAGCGAAATCC
AACTTGAACTACTCTCCTTTGAGCAAAGACAAGACCGGTTGCAAGCCTTGAAGACAAACATCTCTGTCAATCAAGCTTCAGCAAACTTAGCTGAAGTTCTCCTCACAAGT
GACACACAGAAGTCACACTATCAGACCAATAACCAACAGAACAGGAATTTTTCTTCAAATTTTTCAAATCAAAATCGTGGTCACTTCTCCAACCGAGGCAACCGATACAG
GGAAAAGAGATCCAATGAAGTTTACATGGTTGGAAAGCTAGAGAATGGCCTATATAGACTTTTGGAAGAACCACAAGCCTCAACTGACAGCCAAATGGAAATAAAGGGAC
TTGAAGATGCATCTATAGTGAAGATCCAGAGAAGACTTGATGATGGAAGACAAGTGAACCTAGTTAGTTATGTTTTGACAACTTGTAAAATGGATATGTGGCACAAAAGA
TTAGGTCATCCATCATTTAAGATTTTGAGTCAAATACTCCAACTCTGTAAAGTTCCTATCAAAAGTAATGGAAAACCAGATTTTTGTGAGGCTTGCAAACTTGAACATGG
CATTGACATACAAATATCTTGTCCTTATGCATCGGCTCAAAATGGAAGAATTGAGAGAAAACATCGTCATATTGTCGAAACCGGGTTAGCCTTATTGGCTCAAGCTAAAA
TGCCTCTCCATCATTGGTGGGATGCCTTTCACACTGCTGTACATTTGATCAATAGGCTGCCATCTTCGGTTATTGATGGGAAATCTCCATTTGAACTATTAAAAGGGCAG
CAACCTGATTACAAAGGCCTCAAAACCTTTGGTGCAGCATGTTACCCTTGTCTTCGGCCTTATCAGCATCAGAAATTTGAGTTTCACACTGAGAAATGTGTGTTTTTAGG
CTGTACTAATACTCACAAAGGCTACCGGTGTATGTCTAAAACAGGGCTGATTTACATTTCCAGGCATGTATGTTTCAATGAGGGTGACTTTCCATACCAGGAACTTTTTA
TGCACCATGAACCCGAGCCTCCAATCAGCACAACAACCATCTTATCTTGGCTGCCACTGCCCTCCACCATTAACCCCCATATCCATCCCACACCTATGCCTTCTGAACTG
TCATCCTCCACTGGTCTAGCATCCCCTCCCAGCCACACTCCCCGCCCCCGCCTTTGCCTTCTCCTTCTGTTTCATCTCCCATCACTTCCCCATCGCCTTGTCCTCTCCCT
TCCCTCCGGCACACAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCGATCAAGCTGGATCAAACGAATTTTCTGCTTTGGCAAAATATTGTGTTGCTCGTTCTCAAAAGCTATAAGCTCGAGGGGCATCTGTCAGGGAAAACTCCAGCTCC
AGACATGACTATCATTGTGCCACCATCAGAAGAAGATCCACTGGGGTTGAAAATACCCAATCCGGAATATGACCTTTGGCTAGCGGCAGATCAACTCCTTGTCGGGTGGT
TGTATAACTCGATGACTCCTGAAATTGCTACTCAAGTCATGGGACATGAGGAAGCAAAGAACCTGTGGGACTCCATTCAAGAATACTATGGCGTCCAATCTCGCTCTCAA
GAAGACTACAACAGGTTGATGTTACAACAAACTCGAAAAGGTACCATGATGATGTATGAATATTTTGAAACAATGAAGAAATACTTTGATAATCTTCAAATGGTTGGTTT
CCCGATGGATATGAGAAGCTTTATCTCTCACGTGACTGCTGGCTTGGATGAAGAATACACTCCTATAGTCTGTGTGATCAGAAACCAGAATATGACGTGGAGCGAAATCC
AACTTGAACTACTCTCCTTTGAGCAAAGACAAGACCGGTTGCAAGCCTTGAAGACAAACATCTCTGTCAATCAAGCTTCAGCAAACTTAGCTGAAGTTCTCCTCACAAGT
GACACACAGAAGTCACACTATCAGACCAATAACCAACAGAACAGGAATTTTTCTTCAAATTTTTCAAATCAAAATCGTGGTCACTTCTCCAACCGAGGCAACCGATACAG
GGAAAAGAGATCCAATGAAGTTTACATGGTTGGAAAGCTAGAGAATGGCCTATATAGACTTTTGGAAGAACCACAAGCCTCAACTGACAGCCAAATGGAAATAAAGGGAC
TTGAAGATGCATCTATAGTGAAGATCCAGAGAAGACTTGATGATGGAAGACAAGTGAACCTAGTTAGTTATGTTTTGACAACTTGTAAAATGGATATGTGGCACAAAAGA
TTAGGTCATCCATCATTTAAGATTTTGAGTCAAATACTCCAACTCTGTAAAGTTCCTATCAAAAGTAATGGAAAACCAGATTTTTGTGAGGCTTGCAAACTTGAACATGG
CATTGACATACAAATATCTTGTCCTTATGCATCGGCTCAAAATGGAAGAATTGAGAGAAAACATCGTCATATTGTCGAAACCGGGTTAGCCTTATTGGCTCAAGCTAAAA
TGCCTCTCCATCATTGGTGGGATGCCTTTCACACTGCTGTACATTTGATCAATAGGCTGCCATCTTCGGTTATTGATGGGAAATCTCCATTTGAACTATTAAAAGGGCAG
CAACCTGATTACAAAGGCCTCAAAACCTTTGGTGCAGCATGTTACCCTTGTCTTCGGCCTTATCAGCATCAGAAATTTGAGTTTCACACTGAGAAATGTGTGTTTTTAGG
CTGTACTAATACTCACAAAGGCTACCGGTGTATGTCTAAAACAGGGCTGATTTACATTTCCAGGCATGTATGTTTCAATGAGGGTGACTTTCCATACCAGGAACTTTTTA
TGCACCATGAACCCGAGCCTCCAATCAGCACAACAACCATCTTATCTTGGCTGCCACTGCCCTCCACCATTAACCCCCATATCCATCCCACACCTATGCCTTCTGAACTG
TCATCCTCCACTGGTCTAGCATCCCCTCCCAGCCACACTCCCCGCCCCCGCCTTTGCCTTCTCCTTCTGTTTCATCTCCCATCACTTCCCCATCGCCTTGTCCTCTCCCT
TCCCTCCGGCACACAGTAG
Protein sequenceShow/hide protein sequence
MSIKLDQTNFLLWQNIVLLVLKSYKLEGHLSGKTPAPDMTIIVPPSEEDPLGLKIPNPEYDLWLAADQLLVGWLYNSMTPEIATQVMGHEEAKNLWDSIQEYYGVQSRSQ
EDYNRLMLQQTRKGTMMMYEYFETMKKYFDNLQMVGFPMDMRSFISHVTAGLDEEYTPIVCVIRNQNMTWSEIQLELLSFEQRQDRLQALKTNISVNQASANLAEVLLTS
DTQKSHYQTNNQQNRNFSSNFSNQNRGHFSNRGNRYREKRSNEVYMVGKLENGLYRLLEEPQASTDSQMEIKGLEDASIVKIQRRLDDGRQVNLVSYVLTTCKMDMWHKR
LGHPSFKILSQILQLCKVPIKSNGKPDFCEACKLEHGIDIQISCPYASAQNGRIERKHRHIVETGLALLAQAKMPLHHWWDAFHTAVHLINRLPSSVIDGKSPFELLKGQ
QPDYKGLKTFGAACYPCLRPYQHQKFEFHTEKCVFLGCTNTHKGYRCMSKTGLIYISRHVCFNEGDFPYQELFMHHEPEPPISTTTILSWLPLPSTINPHIHPTPMPSEL
SSSTGLASPPSHTPRPRLCLLLLFHLPSLPHRLVLSLPSGTQ