; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0026497 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0026497
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr10:37847804..37856366
RNA-Seq ExpressionLag0026497
SyntenyLag0026497
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily
IPR036875 - Zinc finger, CCHC-type superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG8492785.1 hypothetical protein CXB51_010233 [Gossypium anomalum]1.6e-11632.82Show/hide
Query:  MAVARYDIEKFDGKGDFDLWKAKIRAVLGQQKAIHALTEPSKLPTTLSEDDKETINQIAYGALILNLSNNVLRQEVK-----------------------
        MA  +Y+I   D    F LW+ K++AVL Q     AL    K+P+TL++++K+  ++ A   L L+LSN +L+  +K                       
Subjt:  MAVARYDIEKFDGKGDFDLWKAKIRAVLGQQKAIHALTEPSKLPTTLSEDDKETINQIAYGALILNLSNNVLRQEVK-----------------------

Query:  ------------------------------------------------------------TALKYGRVSITTDAIISAIRTKELELQSLKKETS--EGLY
                                                                      + Y R S+T D +  ++ + +     + K  S  EGL 
Subjt:  ------------------------------------------------------------TALKYGRVSITTDAIISAIRTKELELQSLKKETS--EGLY

Query:  AKGKSKPK---------------ENKNQSEDKEKGKVRCNYCHKEGHIKRDCYSLKRK-NQYHRNKKNKQPEASVGENTIT--YSDSLATTGECSNQSAC
         +G+                   ++K +S+   +GK  CN+C K+GHIK +CY L+ K  +   N+K KQPE S GE  +   YSD        ++    
Subjt:  AKGKSKPK---------------ENKNQSEDKEKGKVRCNYCHKEGHIKRDCYSLKRK-NQYHRNKKNKQPEASVGENTIT--YSDSLATTGECSNQSAC

Query:  MEKFDWVIDSGCSFHMTPSKGWFSTYKEWDGGMVYMGNNHTCK-------------------------------------------------DVEMVQSA
         E   W++DSGC+FHM+P++ WF+TY+    G+V MGNN +CK                                                  V  +   
Subjt:  MEKFDWVIDSGCSFHMTPSKGWFSTYKEWDGGMVYMGNNHTCK-------------------------------------------------DVEMVQSA

Query:  LLVTDQGQSE----------------------------TDLWHKRLSHISSKGLQMLSKQGILPQGTGDNMNFCEHCVVGKAKRQSFTKSQHSAKGILEY
         LV  +GQ +                            T LWH RL H+S  G+  LSK+G+L       +NFCEHCV GK KR  FT+  H+ K  LEY
Subjt:  LLVTDQGQSE----------------------------TDLWHKRLSHISSKGLQMLSKQGILPQGTGDNMNFCEHCVVGKAKRQSFTKSQHSAKGILEY

Query:  IHSDLWGPASTNSLSGSR------------------------------TVRHTPQQNGVAERLNRTIMERV----------------------------S
        IHSDLWGP+   S +G +                              TVRHTPQQN VAER+NRTIME+V                             
Subjt:  IHSDLWGPASTNSLSGSR------------------------------TVRHTPQQNGVAERLNRTIMERV----------------------------S

Query:  HTSIEFLTPEERWFGKPPKLQHLKIFGCVGFVHQNHGKLKPRAEKCMFLGFTEGVKGYRLWHPKERRCINSRDVTFREQDMFMLQQNPADSSQSKEDTTI
          +IE  TP+E W G P     LKIFGC  + H N+GKL+PR+ KC+FLG+  GVKGY+LW P+ R+ + SRDV F E  M     N +    S ++   
Subjt:  HTSIEFLTPEERWFGKPPKLQHLKIFGCVGFVHQNHGKLKPRAEKCMFLGFTEGVKGYRLWHPKERRCINSRDVTFREQDMFMLQQNPADSSQSKEDTTI

Query:  EVEHNVHPQNS---ETDIEN--------TEARDRQRRNIVPPLRFSEADCISLALSVADSLNI-EEPISFEAAVNGPNSREWITAMNEEMRSLEENGTWT
        +VEH ++ +++    T IEN        + A++R +R I PP +++EAD ++ AL+VA+ ++  +EP ++  A++  +S +W+ AM EEM SL +N TW 
Subjt:  EVEHNVHPQNS---ETDIEN--------TEARDRQRRNIVPPLRFSEADCISLALSVADSLNI-EEPISFEAAVNGPNSREWITAMNEEMRSLEENGTWT

Query:  LKPLPKGCKPIASKWIYKVKEGIDGVLKPRFKARLVAKGYTQREGIDYSEVFSPVVKLTSIRMLLSFVVQDNLELDQLDVKTTFLHGYLTETIYMKQPTG
        L  LPKG K +  KW++K KE   GV +P++KARLVAKGY+Q  G+D+++VFSPVVK +SIR LL  V   +LEL+QLDVKT FLHG L E IYM+QP G
Subjt:  LKPLPKGCKPIASKWIYKVKEGIDGVLKPRFKARLVAKGYTQREGIDYSEVFSPVVKLTSIRMLLSFVVQDNLELDQLDVKTTFLHGYLTETIYMKQPTG

Query:  YIKSE
        +  SE
Subjt:  YIKSE

KAG8495931.1 hypothetical protein CXB51_009295 [Gossypium anomalum]1.6e-11633.12Show/hide
Query:  MAVARYDIEKFDGKGDFDLWKAKIRAVLGQQKAIHALTEPSKLPTTLSEDDKETINQIAYGALILNLSNNVLRQEVK-----------------------
        MA  +Y+I   D    F LW+ K++AVL Q     AL    K+P+TL++++K+  ++ A   L L+LSN +L+  +K                       
Subjt:  MAVARYDIEKFDGKGDFDLWKAKIRAVLGQQKAIHALTEPSKLPTTLSEDDKETINQIAYGALILNLSNNVLRQEVK-----------------------

Query:  ------------------------------------------------------------TALKYGRVSITTDAIISAIRTKELELQSLKKETS--EGLY
                                                                      + Y R S+T D +  ++ + +     + K  S  EGL 
Subjt:  ------------------------------------------------------------TALKYGRVSITTDAIISAIRTKELELQSLKKETS--EGLY

Query:  AKGKS----------KPKEN-----KNQSEDKEKGKVRCNYCHKEGHIKRDCYSLKRK-NQYHRNKKNKQPEASVGENTIT--YSDSLATTGECSNQSAC
         +G+            P+ N     K +S+   +GK  CN+C K+GHIK +CY L+ K  +   N+K KQPE S GE  +   YSD         N S  
Subjt:  AKGKS----------KPKEN-----KNQSEDKEKGKVRCNYCHKEGHIKRDCYSLKRK-NQYHRNKKNKQPEASVGENTIT--YSDSLATTGECSNQSAC

Query:  MEKFDWVIDSGCSFHMTPSKGWFSTYKEWDGGMVYMGNNHTCK--------------------DVEMV-----------------------------QSA
         E  +W++DSGC+FHM+P++ WF+TY+    G+V MGNN +CK                    DV+ V                                
Subjt:  MEKFDWVIDSGCSFHMTPSKGWFSTYKEWDGGMVYMGNNHTCK--------------------DVEMV-----------------------------QSA

Query:  LLVTDQGQSE----------------------------TDLWHKRLSHISSKGLQMLSKQGILPQGTGDNMNFCEHCVVGKAKRQSFTKSQHSAKGILEY
         LV  +GQ +                            T LWH RL H+S  G+  LSK+G+L       +NFCEHCV GK KR  FT+  H+ K  LEY
Subjt:  LLVTDQGQSE----------------------------TDLWHKRLSHISSKGLQMLSKQGILPQGTGDNMNFCEHCVVGKAKRQSFTKSQHSAKGILEY

Query:  IHSDLWGPASTNSLSGSR----------------------------------------------TVRHTPQQNGVAERLNRTIMERV-------------
        IHSDLWGP+   S  G+                                               TVRHTPQQNGVAER+NRTIME+V             
Subjt:  IHSDLWGPASTNSLSGSR----------------------------------------------TVRHTPQQNGVAERLNRTIMERV-------------

Query:  ---------------SHTSIEFLTPEERWFGKPPKLQHLKIFGCVGFVHQNHGKLKPRAEKCMFLGFTEGVKGYRLWHPKERRCINSRDVTFREQDMFML
                          +IE  TP+E W G P     LKIF C  +VH N+GKL+PR+ KC+FLG+  GVKGY+LW P+ R+ + SRDV F E  M   
Subjt:  ---------------SHTSIEFLTPEERWFGKPPKLQHLKIFGCVGFVHQNHGKLKPRAEKCMFLGFTEGVKGYRLWHPKERRCINSRDVTFREQDMFML

Query:  QQNPADSSQSKEDTTIEVEHNVHPQNS---ETDIEN--------TEARDRQRRNIVPPLRFSEADCISLALSVADSLNI-EEPISFEAAVNGPNSREWIT
          N +    S ++   +VEH ++ +++    T IEN        + A++R +R I PP +++EAD ++ AL+VA+ ++  +EP ++  A++  +S +W+ 
Subjt:  QQNPADSSQSKEDTTIEVEHNVHPQNS---ETDIEN--------TEARDRQRRNIVPPLRFSEADCISLALSVADSLNI-EEPISFEAAVNGPNSREWIT

Query:  AMNEEMRSLEENGTWTLKPLPKGCKPIASKWIYKVKEGIDGVLKPRFKARLVAKGYTQREGIDYSEVFSPVVKLTSIRMLLSFVVQDNLELDQLDVKTTF
        AM EEM SL +N TW L  LPKG K +  KW++K KEG  GV +P++KARLVAKGY+Q  G+D+++VFSPVVK +SIR LL  V   +LEL+QLDVKT F
Subjt:  AMNEEMRSLEENGTWTLKPLPKGCKPIASKWIYKVKEGIDGVLKPRFKARLVAKGYTQREGIDYSEVFSPVVKLTSIRMLLSFVVQDNLELDQLDVKTTF

Query:  LHGYLTETIYMKQPTGYIKSE
        LHG L E IYM+QP G+  SE
Subjt:  LHGYLTETIYMKQPTGYIKSE

KAG8496287.1 hypothetical protein CXB51_007458 [Gossypium anomalum]8.7e-11833.08Show/hide
Query:  MAVARYDIEKFDGKGDFDLWKAKIRAVLGQQKAIHALTEPSKLPTTLSEDDKETINQIAYGALILNLSNNVLRQEVK-----------------------
        MA  +Y+I   D    F LW+ K++AVL Q     AL    K+P+TL++++K+  ++ A   L L+LSN +L+  +K                       
Subjt:  MAVARYDIEKFDGKGDFDLWKAKIRAVLGQQKAIHALTEPSKLPTTLSEDDKETINQIAYGALILNLSNNVLRQEVK-----------------------

Query:  --------------------------------TALKYGRVSITTDAIISAIRTKELELQSLKKETS--EGLYAKGKSKPK---------------ENKNQ
                                         A++  R S+T D +  ++ + +     + K  S  EGL  +G+                   ++K +
Subjt:  --------------------------------TALKYGRVSITTDAIISAIRTKELELQSLKKETS--EGLYAKGKSKPK---------------ENKNQ

Query:  SEDKEKGKVRCNYCHKEGHIKRDCYSLKRK-NQYHRNKKNKQPEASVGENTIT--YSDSLATTGECSNQSACMEKFDWVIDSGCSFHMTPSKGWFSTYKE
        S+   +GK  CN+C K+GHIK +CY L+ K  +   N+K KQPE S GE  +   YSD        ++     E   W++DSGC+FHM+P++ WF+TY+ 
Subjt:  SEDKEKGKVRCNYCHKEGHIKRDCYSLKRK-NQYHRNKKNKQPEASVGENTIT--YSDSLATTGECSNQSACMEKFDWVIDSGCSFHMTPSKGWFSTYKE

Query:  WDGGMVYMGNNHTCK-------------------------------------------------DVEMVQSALLVTDQGQSE------------------
           G+V MGNN +CK                                                  V  +    LV  +GQ +                  
Subjt:  WDGGMVYMGNNHTCK-------------------------------------------------DVEMVQSALLVTDQGQSE------------------

Query:  ----------TDLWHKRLSHISSKGLQMLSKQGILPQGTGDNMNFCEHCVVGKAKRQSFTKSQHSAKGILEYIHSDLWGPASTNSLSGSR----------
                  T LWH RL H+S  G+  LSK+G+L       +NFCEHCV GK KR  FT+  H+ K  LEYIHSDLWGP+   S  G+           
Subjt:  ----------TDLWHKRLSHISSKGLQMLSKQGILPQGTGDNMNFCEHCVVGKAKRQSFTKSQHSAKGILEYIHSDLWGPASTNSLSGSR----------

Query:  --------------------------------------------------------------TVRHTPQQNGVAERLNRTIMERVSHTS-----------
                                                                      TVRHTPQQNGVAER+NRTIME+    S           
Subjt:  --------------------------------------------------------------TVRHTPQQNGVAERLNRTIMERVSHTS-----------

Query:  --IEFLTPEERWFGKPPKLQHLKIFGCVGFVHQNHGKLKPRAEKCMFLGFTEGVKGYRLWHPKERRCINSRDVTFREQDMFMLQQNPADSSQSKEDTTIE
          IE  TP+E W G P     LKIFGC  + H N+GKL+PR+ KC+FLG+  GVKGY+LW P+ R+ + SRDV F E  M     N +    S ++   +
Subjt:  --IEFLTPEERWFGKPPKLQHLKIFGCVGFVHQNHGKLKPRAEKCMFLGFTEGVKGYRLWHPKERRCINSRDVTFREQDMFMLQQNPADSSQSKEDTTIE

Query:  VEHNVHPQNS---ETDIEN--------TEARDRQRRNIVPPLRFSEADCISLALSVADSLNI-EEPISFEAAVNGPNSREWITAMNEEMRSLEENGTWTL
        VEH ++ +++    T IEN        + A++R +R I PP +++EAD ++ AL+VA+ ++  +EP ++  A++  +S +W+ AM EEM SL +N TW L
Subjt:  VEHNVHPQNS---ETDIEN--------TEARDRQRRNIVPPLRFSEADCISLALSVADSLNI-EEPISFEAAVNGPNSREWITAMNEEMRSLEENGTWTL

Query:  KPLPKGCKPIASKWIYKVKEGIDGVLKPRFKARLVAKGYTQREGIDYSEVFSPVVKLTSIRMLLSFVVQDNLELDQLDVKTTFLHGYLTETIYMKQPTGY
          LPKG K +  KW++K KEG  GV +P++KARLVAKGY+Q  G+D+++VFSPVVK +SIR LL  V   +LEL+QLDVKT FLHG L E IYM+QP G+
Subjt:  KPLPKGCKPIASKWIYKVKEGIDGVLKPRFKARLVAKGYTQREGIDYSEVFSPVVKLTSIRMLLSFVVQDNLELDQLDVKTTFLHGYLTETIYMKQPTGY

Query:  IKSE
          SE
Subjt:  IKSE

KAG8503608.1 hypothetical protein CXB51_001612 [Gossypium anomalum]5.6e-11732.93Show/hide
Query:  MAVARYDIEKFDGKGDFDLWKAKIRAVLGQQKAIHALTEPSKLPTTLSEDDKETINQIAYGALILNLSNNVLRQEVK-----------------------
        MA  +Y+I   D    F LW+ K++AVL Q     AL    K+P+TL++++K+  ++ A   L L+LSN +L+  +K                       
Subjt:  MAVARYDIEKFDGKGDFDLWKAKIRAVLGQQKAIHALTEPSKLPTTLSEDDKETINQIAYGALILNLSNNVLRQEVK-----------------------

Query:  ------------------------------------------------------------TALKYGRVSITTDAIISAIRTKELELQSLKKETS--EGLY
                                                                      + Y R S+T D +  ++ + +     + K  S  EGL 
Subjt:  ------------------------------------------------------------TALKYGRVSITTDAIISAIRTKELELQSLKKETS--EGLY

Query:  AKGKSKPK---------------ENKNQSEDKEKGKVRCNYCHKEGHIKRDCYSLKRK-NQYHRNKKNKQPEASVGENTIT--YSDSLATTGECSNQSAC
         +G+                   ++K +S+   +GK  CN+C K+GHIK +CY L+ K  +   N+K KQPE S GE  +   YSD        ++    
Subjt:  AKGKSKPK---------------ENKNQSEDKEKGKVRCNYCHKEGHIKRDCYSLKRK-NQYHRNKKNKQPEASVGENTIT--YSDSLATTGECSNQSAC

Query:  MEKFDWVIDSGCSFHMTPSKGWFSTYKEWDGGMVYMGNNHTCK--------------------DVEMVQSA---------LLVTDQGQSETD---LWHKR
         E   W++DSGC+FHM+P++ WF+TY+    G+V MGNN +CK                    DV  V             +V     S+ D   LWH R
Subjt:  MEKFDWVIDSGCSFHMTPSKGWFSTYKEWDGGMVYMGNNHTCK--------------------DVEMVQSA---------LLVTDQGQSETD---LWHKR

Query:  LSHISSKGLQMLSKQGILPQGTGDNMNFCEHCVVGKAKRQSFTKSQHSAKGILEYIHSDLWGPASTNSLSGSR---------------------------
        L H+S  G+  LSK+G+L       +NFCEHCV GK KR  FT+  H+ K  LEYIHSDLWGP+   S  G+                            
Subjt:  LSHISSKGLQMLSKQGILPQGTGDNMNFCEHCVVGKAKRQSFTKSQHSAKGILEYIHSDLWGPASTNSLSGSR---------------------------

Query:  ---------------------------------------------TVRHTPQQNGVAERLNRTIMERV----------------------------SHTS
                                                     TVRHTPQQNGVAER+NRTIME+V                               +
Subjt:  ---------------------------------------------TVRHTPQQNGVAERLNRTIMERV----------------------------SHTS

Query:  IEFLTPEERWFGKPPKLQHLKIFGCVGFVHQNHGKLKPRAEKCMFLGFTEGVKGYRLWHPKERRCINSRDVTFREQDMFMLQQNPADSSQSKEDTTIEVE
        IE  TP+E W G P     LKIFGC  + H N+GKL+PR+ KC+FLG+  GVKGY+LW P+ R+ + SRDV F E  M     N +    S ++   +VE
Subjt:  IEFLTPEERWFGKPPKLQHLKIFGCVGFVHQNHGKLKPRAEKCMFLGFTEGVKGYRLWHPKERRCINSRDVTFREQDMFMLQQNPADSSQSKEDTTIEVE

Query:  HNVHPQNS---ETDIEN--------TEARDRQRRNIVPPLRFSEADCISLALSVADSLNI-EEPISFEAAVNGPNSREWITAMNEEMRSLEENGTWTLKP
        H ++ +++    T IEN        + A++R +R I PP +++EAD ++ AL+VA+ ++  +EP ++  A++  +S +W+ AM EEM SL +N TW L  
Subjt:  HNVHPQNS---ETDIEN--------TEARDRQRRNIVPPLRFSEADCISLALSVADSLNI-EEPISFEAAVNGPNSREWITAMNEEMRSLEENGTWTLKP

Query:  LPKGCKPIASKWIYKVKEGIDGVLKPRFKARLVAKGYTQREGIDYSEVFSPVVKLTSIRMLLSFVVQDNLELDQLDVKTTFLHGYLTETIYMKQPTGYIK
        LPKG K +  KW++K KEG  GV +P++KARL+AKGY+Q  G+D+++VFSPVVK +SIR LL  +   +LEL+QLDVKT FLHG L E IYM+QP G+  
Subjt:  LPKGCKPIASKWIYKVKEGIDGVLKPRFKARLVAKGYTQREGIDYSEVFSPVVKLTSIRMLLSFVVQDNLELDQLDVKTTFLHGYLTETIYMKQPTGYIK

Query:  SE
        SE
Subjt:  SE

RVW99173.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]1.1e-12034.66Show/hide
Query:  MAVARYDIEKFDGKGDFDLWKAKIRAVLGQQKAIHALTEPSKLPTTLSEDDKETINQIAYGALILNLSNNVLRQEVKTA------LK-------------
        M  A++D+E+F GK DF LW+ K+RA+L QQ    AL     LP+T+ E  K  + + A+ A+IL+L + VLR+  K        LK             
Subjt:  MAVARYDIEKFDGKGDFDLWKAKIRAVLGQQKAIHALTEPSKLPTTLSEDDKETINQIAYGALILNLSNNVLRQEVKTA------LK-------------

Query:  -YGRVSITTDAIISAIRTKE---------LELQSLKKETS------------EGLYAKGK------------SKPKENKNQSEDKEKGKVRCNYCHKEGH
         + ++ + T  +   +  +E         L+L+++    S            +  Y   K             + K +K++S+ K K K +C  CHKEGH
Subjt:  -YGRVSITTDAIISAIRTKE---------LELQSLKKETS------------EGLYAKGK------------SKPKENKNQSEDKEKGKVRCNYCHKEGH

Query:  IKRDCYSLKRKNQYHRNKKNKQPEASVGENTITYSDSLATTGECSNQSACMEKFDWVIDSGCSFHMTPSKGWFSTYKEWDGGMVYMGNNHTCK-------
         K+DC   +R+N     K   + +A+V    +   DS     E  N +      +W++DSGCSFHM P K WF  +KE DGG V +GNN  CK       
Subjt:  IKRDCYSLKRKNQYHRNKKNKQPEASVGENTITYSDSLATTGECSNQSACMEKFDWVIDSGCSFHMTPSKGWFSTYKEWDGGMVYMGNNHTCK-------

Query:  -------------DVE------------------------------------------------------MVQSALLVTDQGQSETDLWHKRLSHISSKG
                     DV                                                       ++  A  V  +    T LWH+RL H+S KG
Subjt:  -------------DVE------------------------------------------------------MVQSALLVTDQGQSETDLWHKRLSHISSKG

Query:  LQMLSKQGILPQGTGDNMNFCEHCVVGKAKRQSFTKSQHSAKGILEYIHSDLWGPASTNSLSGS--------------------------------RTVR
        LQ L KQG+L      ++ FCEHCV GKA R  F K+ H  +  L+YIHSDLWGP+   S+ G+                                RTVR
Subjt:  LQMLSKQGILPQGTGDNMNFCEHCVVGKAKRQSFTKSQHSAKGILEYIHSDLWGPASTNSLSGS--------------------------------RTVR

Query:  HTPQQNGVAERLNRTIMERV----------------------------SHTSIEFLTPEERWFGKPPKLQHLKIFGCVGFVHQNHGKLKPRAEKCMFLGF
        +TPQQNG+AER+NRTI+ER+                              ++++F TP+E+W GK    QHLK+FGC  +VH    KL+PRA KC+FLG+
Subjt:  HTPQQNGVAERLNRTIMERV----------------------------SHTSIEFLTPEERWFGKPPKLQHLKIFGCVGFVHQNHGKLKPRAEKCMFLGF

Query:  TEGVKGYRLWHPKERR--CINSRDVTFREQDMFMLQQNPADSSQSKEDTTIEVEH-NVHPQNSETDIENTE---------------------ARDRQRRN
         +GVKGY+LW   + +  CI SRDVTF EQD  M +Q PA   +  +    EVEH  + P+ S+     T                       RDRQ+R 
Subjt:  TEGVKGYRLWHPKERR--CINSRDVTFREQDMFMLQQNPADSSQSKEDTTIEVEH-NVHPQNSETDIENTE---------------------ARDRQRRN

Query:  IVPPLRFSEADCISLALSVADSLNIEEPISFEAAVNGPNSREWITAMNEEMRSLEENGTWTLKPLPKGCKPIASKWIYKVKEGIDGVLKPRFKARLVAKG
        + PP R+ + +  + ALSVA+ +   EP +++ A+N     +W+ A+ EEM SL +N TW L   PK  K + SKW++K K+G  G   PR+KARLVAKG
Subjt:  IVPPLRFSEADCISLALSVADSLNIEEPISFEAAVNGPNSREWITAMNEEMRSLEENGTWTLKPLPKGCKPIASKWIYKVKEGIDGVLKPRFKARLVAKG

Query:  YTQREGIDYSEVFSPVVKLTSIRMLLSFVVQDNLELDQLDVKTTFLHGYLTETIYMK
        ++Q+EG+DY+E+FSPVVK +SIR+LL+FV  ++LELDQLDVKT FLHG L E IYM+
Subjt:  YTQREGIDYSEVFSPVVKLTSIRMLLSFVVQDNLELDQLDVKTTFLHGYLTETIYMK

TrEMBL top hitse value%identityAlignment
A0A2N9IUS6 Uncharacterized protein2.1e-15430.74Show/hide
Query:  MGELSYFLGIQAAKDGKNLHLSQQKYVLDLLKRAGLESCKPISSLMSVGLPLSLSDGSPLDSVTEYRSLVGALQYCTLTRLDISFVVNKLCQFLHAPTTV
        +G L YFLG++A      LHL+Q KY +DLL R   +  KPISS  + G  LSL DG PL   TE+RS+VGALQY TLTR DI F VN++CQFLH PTT 
Subjt:  MGELSYFLGIQAAKDGKNLHLSQQKYVLDLLKRAGLESCKPISSLMSVGLPLSLSDGSPLDSVTEYRSLVGALQYCTLTRLDISFVVNKLCQFLHAPTTV

Query:  HMMAVKRLLRYLKGTSSMGILISPCSSFAITCYTDADWASCPDDRRSTSGYCTFLGSNLVSWNSAKQKVVSRSSVESEYRGMSNAAAEIAWVQSVLAEIG
        H   VKR+LRYL  T   G+   P  S  +  Y+DAD+A CPDDR ST GYC +LG N +SW++ KQ+ VSRSS E EYR ++  AAE++W++S+  ++G
Subjt:  HMMAVKRLLRYLKGTSSMGILISPCSSFAITCYTDADWASCPDDRRSTSGYCTFLGSNLVSWNSAKQKVVSRSSVESEYRGMSNAAAEIAWVQSVLAEIG

Query:  ALSSAPPLLLCDNMSATYIAANPIMHHRTKHIEVDHHFIREKIARKQLLVRFVPSEDQVADIFTKSLHSPRFFGLRTKLTVVSHLCSLAGDD--------
           S  P + CDN+S+  +A+NP+ H RTKH+EVD+H++R+K+ RK+L V ++ + DQVADIFTK L   RF  L  KL V S   SL G D        
Subjt:  ALSSAPPLLLCDNMSATYIAANPIMHHRTKHIEVDHHFIREKIARKQLLVRFVPSEDQVADIFTKSLHSPRFFGLRTKLTVVSHLCSLAGDD--------

Query:  --------------------------------IIKFFGSKSHGAKINIIEDVSGVIKPGRHVSNSNS----------------GIRAQDQS---------
                                         I    + S  ++I+  + +   +K    + + +                  IR    S         
Subjt:  --------------------------------IIKFFGSKSHGAKINIIEDVSGVIKPGRHVSNSNS----------------GIRAQDQS---------

Query:  --RNQVLESRMA--------------------------------------VARYDIEKFDGKGDFDLWKAKIRAVLGQQKAIHA-------LTEPSK---
          R+  L+++ A                                        +Y+IEKF+G  +F LWK +I+AVL +   + A       +T+  K   
Subjt:  --RNQVLESRMA--------------------------------------VARYDIEKFDGKGDFDLWKAKIRAVLGQQKAIHA-------LTEPSK---

Query:  ----------------------------------------------------------LPTTLSEDDKETINQI--------------------------
                                                                  + +T+  D   T+N +                          
Subjt:  ----------------------------------------------------------LPTTLSEDDKETINQI--------------------------

Query:  AYGALILNLSNNVLRQEVKTALKYGRVSITTDAIISAIRTKELELQSLK-----KETSEGL-YAKGKSK---PKENKNQSEDKEKGK--VRCNYCHKEGH
        +Y  LI+NL+NN+   EV          +  D I +A+  +E   +S +      + +E L   +G+S    P  ++NQS  K + K  V+C++C K+GH
Subjt:  AYGALILNLSNNVLRQEVKTALKYGRVSITTDAIISAIRTKELELQSLK-----KETSEGL-YAKGKSK---PKENKNQSEDKEKGK--VRCNYCHKEGH

Query:  IKRDCYSLKRKNQYHRNKKNKQPEASVGENTI--TYSDSLATTGECSNQSACMEKFD--WVIDSGCSFHMTPSKGWFSTYKEWDGGMVYMGNNHT-----
         KR+C+ LK+  +     K K PE+S  +  +  T  D      E +  +    +F   W++DSG ++HMTP + WF  Y+   GG V+MG++H      
Subjt:  IKRDCYSLKRKNQYHRNKKNKQPEASVGENTI--TYSDSLATTGECSNQSACMEKFD--WVIDSGCSFHMTPSKGWFSTYKEWDGGMVYMGNNHT-----

Query:  ---------------------------------------CKD------VEMVQSALLV-----------------TDQGQSETDL----------WHKRL
                                               CK       +++V+ AL++                   +G++ T L          WH++L
Subjt:  ---------------------------------------CKD------VEMVQSALLV-----------------TDQGQSETDL----------WHKRL

Query:  SHISSKGLQMLSKQGILPQGTGDNMNFCEHCVVGKAKRQSFTKSQHSAKGILEYIHSDLWGPASTNSLSGSR----------------------------
         H+S +GL++L++Q +LP     ++ FCEHCV  K  R  F+ S   +K ILE IHSD+W  A   SL G++                            
Subjt:  SHISSKGLQMLSKQGILPQGTGDNMNFCEHCVVGKAKRQSFTKSQHSAKGILEYIHSDLWGPASTNSLSGSR----------------------------

Query:  --------------------------------------------TVRHTPQQNGVAERLNRTIMERV----------------------------SHTSI
                                                    TV +TPQQNGVAER+NRT++ER                               T+I
Subjt:  --------------------------------------------TVRHTPQQNGVAERLNRTIMERV----------------------------SHTSI

Query:  EFLTPEERWFGKPPKLQHLKIFGCVGFV---HQNHGKLKPRAEKCMFLGFTEGVKGYRLWHPKERRCINSRDVTFREQDMFMLQQNPADSSQSKEDTTIE
        +  TP E W GKP    HL  FG   +V    Q   KL P++ KC+FLG+ +GVKGYRLW P  R+ + SRDV F E D    ++N + S +  +  T++
Subjt:  EFLTPEERWFGKPPKLQHLKIFGCVGFV---HQNHGKLKPRAEKCMFLGFTEGVKGYRLWHPKERRCINSRDVTFREQDMFMLQQNPADSSQSKEDTTIE

Query:  VEHNVH---PQNSETDIENTEARDRQRRNIVPPLRFSEADCISLAL--------SVADSLNIE--EPISFEAAVNGPNSREWITAMNEEMRSLEENGTWT
        VE +     P +SE   E+ E    Q     P +R S  +    A         +VA  L  E  EP +F  A     +  W+ AM EE+ +L +N TW 
Subjt:  VEHNVH---PQNSETDIENTEARDRQRRNIVPPLRFSEADCISLAL--------SVADSLNIE--EPISFEAAVNGPNSREWITAMNEEMRSLEENGTWT

Query:  LKPLPKGCKPIASKWIYKVK-EGIDGVLKPRFKARLVAKGYTQREGIDYSEVFSPVVKLTSIRMLLSFVVQDNLELDQLDVKTTFLHGYLTETIYMKQPT
        L PLP+G K I +KW+YK+K +G D V   R++ARLV KG+ Q+EGID++E+FSPVV+LT++R++L+     +L L+QLDVKT FLHG L E IYM QP 
Subjt:  LKPLPKGCKPIASKWIYKVK-EGIDGVLKPRFKARLVAKGYTQREGIDYSEVFSPVVKLTSIRMLLSFVVQDNLELDQLDVKTTFLHGYLTETIYMKQPT

Query:  GY
        G+
Subjt:  GY

A0A438FEW0 Retrovirus-related Pol polyprotein from transposon TNT 1-943.7e-11433.86Show/hide
Query:  MAVARYDIEKFDGKGDFDLWKAKIRAVLGQQKAIHALTEPSKLPTTLSEDDKETINQIAYGALILNLSNNVLRQ--------------------------
        M  A++D+EKF GK DF LW+ K+RA+L QQ    AL     LP+T+ E  K  + + A+ A+IL+L + VLR+                          
Subjt:  MAVARYDIEKFDGKGDFDLWKAKIRAVLGQQKAIHALTEPSKLPTTLSEDDKETINQIAYGALILNLSNNVLRQ--------------------------

Query:  ---------------------------------------------------------EVKTALKYGRVSITTDAIISAIRTKELELQ-SLKKETSEGLYA
                                                                  +K A+ YGR S+T D + S +  +EL  Q   K+E+ EGL  
Subjt:  ---------------------------------------------------------EVKTALKYGRVSITTDAIISAIRTKELELQ-SLKKETSEGLYA

Query:  KGKS-----KPKENKNQSEDKEKGKVRCNYCHKEGHIKRDCYSLKRK---------------NQYHRNKKNKQPEASVGEN-----------TITYSDSL
        +G+S     K K +K++S+ K K K +C  CHKEGH K+DC   ++                + Y   +     E   G N            I + D +
Subjt:  KGKS-----KPKENKNQSEDKEKGKVRCNYCHKEGHIKRDCYSLKRK---------------NQYHRNKKNKQPEASVGEN-----------TITYSDSL

Query:  ATTGECSNQSACMEK----FDWVIDSGCSFHMTP-----SKGWFSTYKEW--DGGMVYMGNNHTCKDVEMVQSALLVTDQGQSETDLWHKRLSHISSKGL
            E       +++       +  SG +F   P     ++G  +  K    +G    +G   T K      S +L  D G   T LWH+RL HIS +GL
Subjt:  ATTGECSNQSACMEK----FDWVIDSGCSFHMTP-----SKGWFSTYKEW--DGGMVYMGNNHTCKDVEMVQSALLVTDQGQSETDLWHKRLSHISSKGL

Query:  QMLSKQGILPQGTGDNMNFCEHCVVGKAKRQSFTKSQHSAKGILEYIHSDLWGPASTNSLSGS-------------------------------------
        Q L KQG+L      ++ FCEHCV GKA R  F K+ H  +  L+YIHSDLWGP+   S+ G+                                     
Subjt:  QMLSKQGILPQGTGDNMNFCEHCVVGKAKRQSFTKSQHSAKGILEYIHSDLWGPASTNSLSGS-------------------------------------

Query:  -------------RTVRHTPQQNGVAERLNRTIMERV----------------------------SHTSIEFLTPEERWFGKPPKLQHLKIFGCVGFVHQ
                     RTVR+TPQQNG+AER+NRTI+ERV                              ++++F TP+E+W GK    QHLK+FGC  +VH 
Subjt:  -------------RTVRHTPQQNGVAERLNRTIMERV----------------------------SHTSIEFLTPEERWFGKPPKLQHLKIFGCVGFVHQ

Query:  NHGKLKPRAEKCMFLGFTEGVKGYRLWHPKE--RRCINSRDVTFREQDMFMLQQNPADSSQSKEDTTIEVEH-NVHPQNS-ETDIENTE-----------
           KL+PRA KC+FLG+ +GVKGY+LW   +   +CI S DVTF EQD  M +Q PA   +  +    EVEH  + P+ S ET  + T+           
Subjt:  NHGKLKPRAEKCMFLGFTEGVKGYRLWHPKE--RRCINSRDVTFREQDMFMLQQNPADSSQSKEDTTIEVEH-NVHPQNS-ETDIENTE-----------

Query:  ---------ARDRQRRNIVPPLRFSEADCISLALSVADSLNIEEPISFEAAVNGPNSREWITAMNEEMRSLEENGTWTLKPLPKGCKPIASKWIYKVKEG
                  RDRQ+R + PP R+ +A+  + ALSVA+ +   EP +++ A+N   + +W+ A+ EEM SL +N TW L   PK  K + SKW++K K+G
Subjt:  ---------ARDRQRRNIVPPLRFSEADCISLALSVADSLNIEEPISFEAAVNGPNSREWITAMNEEMRSLEENGTWTLKPLPKGCKPIASKWIYKVKEG

Query:  IDGVLKPRFKARLVAKGYTQREGIDYSEVFSPVVKLTSIRMLLSFVVQDNLELDQLDVKTTFLHGYLTETIYMKQPTGYIKSEGLK
          G   PR+KARLVAKG++Q+EG+DY+E+FSPVVK +SIR+LL+FV  ++LELDQLDVKT FLHG L E IYM+ P G+   EG+K
Subjt:  IDGVLKPRFKARLVAKGYTQREGIDYSEVFSPVVKLTSIRMLLSFVVQDNLELDQLDVKTTFLHGYLTETIYMKQPTGYIKSEGLK

A0A438IR25 Retrovirus-related Pol polyprotein from transposon TNT 1-945.3e-12134.66Show/hide
Query:  MAVARYDIEKFDGKGDFDLWKAKIRAVLGQQKAIHALTEPSKLPTTLSEDDKETINQIAYGALILNLSNNVLRQEVKTA------LK-------------
        M  A++D+E+F GK DF LW+ K+RA+L QQ    AL     LP+T+ E  K  + + A+ A+IL+L + VLR+  K        LK             
Subjt:  MAVARYDIEKFDGKGDFDLWKAKIRAVLGQQKAIHALTEPSKLPTTLSEDDKETINQIAYGALILNLSNNVLRQEVKTA------LK-------------

Query:  -YGRVSITTDAIISAIRTKE---------LELQSLKKETS------------EGLYAKGK------------SKPKENKNQSEDKEKGKVRCNYCHKEGH
         + ++ + T  +   +  +E         L+L+++    S            +  Y   K             + K +K++S+ K K K +C  CHKEGH
Subjt:  -YGRVSITTDAIISAIRTKE---------LELQSLKKETS------------EGLYAKGK------------SKPKENKNQSEDKEKGKVRCNYCHKEGH

Query:  IKRDCYSLKRKNQYHRNKKNKQPEASVGENTITYSDSLATTGECSNQSACMEKFDWVIDSGCSFHMTPSKGWFSTYKEWDGGMVYMGNNHTCK-------
         K+DC   +R+N     K   + +A+V    +   DS     E  N +      +W++DSGCSFHM P K WF  +KE DGG V +GNN  CK       
Subjt:  IKRDCYSLKRKNQYHRNKKNKQPEASVGENTITYSDSLATTGECSNQSACMEKFDWVIDSGCSFHMTPSKGWFSTYKEWDGGMVYMGNNHTCK-------

Query:  -------------DVE------------------------------------------------------MVQSALLVTDQGQSETDLWHKRLSHISSKG
                     DV                                                       ++  A  V  +    T LWH+RL H+S KG
Subjt:  -------------DVE------------------------------------------------------MVQSALLVTDQGQSETDLWHKRLSHISSKG

Query:  LQMLSKQGILPQGTGDNMNFCEHCVVGKAKRQSFTKSQHSAKGILEYIHSDLWGPASTNSLSGS--------------------------------RTVR
        LQ L KQG+L      ++ FCEHCV GKA R  F K+ H  +  L+YIHSDLWGP+   S+ G+                                RTVR
Subjt:  LQMLSKQGILPQGTGDNMNFCEHCVVGKAKRQSFTKSQHSAKGILEYIHSDLWGPASTNSLSGS--------------------------------RTVR

Query:  HTPQQNGVAERLNRTIMERV----------------------------SHTSIEFLTPEERWFGKPPKLQHLKIFGCVGFVHQNHGKLKPRAEKCMFLGF
        +TPQQNG+AER+NRTI+ER+                              ++++F TP+E+W GK    QHLK+FGC  +VH    KL+PRA KC+FLG+
Subjt:  HTPQQNGVAERLNRTIMERV----------------------------SHTSIEFLTPEERWFGKPPKLQHLKIFGCVGFVHQNHGKLKPRAEKCMFLGF

Query:  TEGVKGYRLWHPKERR--CINSRDVTFREQDMFMLQQNPADSSQSKEDTTIEVEH-NVHPQNSETDIENTE---------------------ARDRQRRN
         +GVKGY+LW   + +  CI SRDVTF EQD  M +Q PA   +  +    EVEH  + P+ S+     T                       RDRQ+R 
Subjt:  TEGVKGYRLWHPKERR--CINSRDVTFREQDMFMLQQNPADSSQSKEDTTIEVEH-NVHPQNSETDIENTE---------------------ARDRQRRN

Query:  IVPPLRFSEADCISLALSVADSLNIEEPISFEAAVNGPNSREWITAMNEEMRSLEENGTWTLKPLPKGCKPIASKWIYKVKEGIDGVLKPRFKARLVAKG
        + PP R+ + +  + ALSVA+ +   EP +++ A+N     +W+ A+ EEM SL +N TW L   PK  K + SKW++K K+G  G   PR+KARLVAKG
Subjt:  IVPPLRFSEADCISLALSVADSLNIEEPISFEAAVNGPNSREWITAMNEEMRSLEENGTWTLKPLPKGCKPIASKWIYKVKEGIDGVLKPRFKARLVAKG

Query:  YTQREGIDYSEVFSPVVKLTSIRMLLSFVVQDNLELDQLDVKTTFLHGYLTETIYMK
        ++Q+EG+DY+E+FSPVVK +SIR+LL+FV  ++LELDQLDVKT FLHG L E IYM+
Subjt:  YTQREGIDYSEVFSPVVKLTSIRMLLSFVVQDNLELDQLDVKTTFLHGYLTETIYMK

A0A5A7UB25 Putative gag-pol polyprotein5.7e-11531.46Show/hide
Query:  MAVARYDIEKFDGKGDFDLWKAKIRAVLGQQKAIHALTEPSKLPTTLSEDDKETINQIAYGALILNLSNNVLR---------------------------
        MA  R+++ KF+G GDF LW+ KIRA+L Q K    L E  +LP  ++E +K  ++++AY  ++L LS+ VLR                           
Subjt:  MAVARYDIEKFDGKGDFDLWKAKIRAVLGQQKAIHALTEPSKLPTTLSEDDKETINQIAYGALILNLSNNVLR---------------------------

Query:  --------------------------------------------------------QEVKTALKYGRVSITTDAIISAIRTKELELQSLKKETSEGLYAK
                                                                +EVK A+KYGR S+T   ++ A++T+ LE++  +K+  E L A+
Subjt:  --------------------------------------------------------QEVKTALKYGRVSITTDAIISAIRTKELELQSLKKETSEGLYAK

Query:  GKSKPKENKNQS---EDKEKGKVR-CNYCHKEGHIKRDCYSLKRKNQYHRNKKNKQPEASVGENTITYSDSLATTGECSNQSACMEKFD----WVIDSGC
        G+S+ K  K +      K KGK R C  CHKEGH K++C  L +  +   ++ N     +  E T  Y    A TG  S +   +   D    W++DSGC
Subjt:  GKSKPKENKNQS---EDKEKGKVR-CNYCHKEGHIKRDCYSLKRKNQYHRNKKNKQPEASVGENTITYSDSLATTGECSNQSACMEKFD----WVIDSGC

Query:  SFHMTPSKGWFSTYKEWDGGMVYMGNNHTCKDVEMVQSALLVTDQG-------------------------------QSE--------------------
        +FHMTP + + + +++ DGG V +G+N TC DV+   S  + T  G                               +SE                    
Subjt:  SFHMTPSKGWFSTYKEWDGGMVYMGNNHTCKDVEMVQSALLVTDQG-------------------------------QSE--------------------

Query:  ---------------------TD---LWHKRLSHISSKGLQMLSKQGILPQGTGDNMNFCEHCVVGKAKRQSFTKSQHSAKGILEYIHSDLWGPASTNSL
                             TD   LWHKRL+H+S +GLQ LS+QG+L       + FCEHC++GK+ R  F K +H+ KGIL+Y+HSDLWGP    S+
Subjt:  ---------------------TD---LWHKRLSHISSKGLQMLSKQGILPQGTGDNMNFCEHCVVGKAKRQSFTKSQHSAKGILEYIHSDLWGPASTNSL

Query:  SGSR------------------------------------------------------------------------TVRHTPQQNGVAERLNRTIMERV-
         GSR                                                                        TV +TPQQNG+AER NRTIMER  
Subjt:  SGSR------------------------------------------------------------------------TVRHTPQQNGVAERLNRTIMERV-

Query:  ---------------------------SHTSIEFLTPEERWFGKPPKLQHLKIFGCVGFVHQNHGKLKPRAEKCMFLGFTEGVKGYRLWHPKE--RRCIN
                                     T++   TP+E W GK P L+HL++FGC  + H   GKL  RA KCMF+G+ +GVKGY+LW  ++   +CI 
Subjt:  ---------------------------SHTSIEFLTPEERWFGKPPKLQHLKIFGCVGFVHQNHGKLKPRAEKCMFLGFTEGVKGYRLWHPKE--RRCIN

Query:  SRDVTFREQDMFM------------------------------------------LQQNPADSSQSKED-TTIEVEHNVHPQNSETDIENTE-ARDRQRR
        SRDVTF E +M                                             QQ+  D  QS+++   I+    +   +S  D++N +  RDR +R
Subjt:  SRDVTFREQDMFM------------------------------------------LQQNPADSSQSKED-TTIEVEHNVHPQNSETDIENTE-ARDRQRR

Query:  NIVPPLRFSEADCISLALSVADSLNIEEPISFEAAVNGPNSREWITAMNEEMRSLEENGTWTLKPLPKGCKPIASKWIYKVKEGIDGVLKPRFKARLVAK
            P+R+  AD ++ AL+ A      EP++FE A+   + ++W  AM EE+ SL +N TW+L P P   K I SKWIYK+K G  G  KPR+KARLVAK
Subjt:  NIVPPLRFSEADCISLALSVADSLNIEEPISFEAAVNGPNSREWITAMNEEMRSLEENGTWTLKPLPKGCKPIASKWIYKVKEGIDGVLKPRFKARLVAK

Query:  GYTQREGIDYSEVFSPVVKLTSIRMLLSFVVQDNLELDQLDVKTTFLHGYLTETIYMKQPTGY
        GYTQ+EG+D+ E+FSPVV+ +SIR++LS  V  ++ ++Q+DV T FLHG L E IYM QP GY
Subjt:  GYTQREGIDYSEVFSPVVKLTSIRMLLSFVVQDNLELDQLDVKTTFLHGYLTETIYMKQPTGY

A0A5D3DNU1 Putative gag-pol polyprotein9.7e-11531.46Show/hide
Query:  MAVARYDIEKFDGKGDFDLWKAKIRAVLGQQKAIHALTEPSKLPTTLSEDDKETINQIAYGALILNLSNNVLR---------------------------
        MA  R+++ KF+G GDF LW+ KIRA+L Q K    L E  +LP  ++E +K  ++++AY  ++L LS+ VLR                           
Subjt:  MAVARYDIEKFDGKGDFDLWKAKIRAVLGQQKAIHALTEPSKLPTTLSEDDKETINQIAYGALILNLSNNVLR---------------------------

Query:  --------------------------------------------------------QEVKTALKYGRVSITTDAIISAIRTKELELQSLKKETSEGLYAK
                                                                +EVK A+KYGR S+T   ++ A++T+ LE++  +K+  E L A+
Subjt:  --------------------------------------------------------QEVKTALKYGRVSITTDAIISAIRTKELELQSLKKETSEGLYAK

Query:  GKSKPKENKNQS---EDKEKGKVR-CNYCHKEGHIKRDCYSLKRKNQYHRNKKNKQPEASVGENTITYSDSLATTGECSNQSACMEKFD----WVIDSGC
        G+S+ K  K +      K KGK R C  CHKEGH K++C  L +  +   ++ N     +  E T  Y    A TG  S +   +   D    W++DSGC
Subjt:  GKSKPKENKNQS---EDKEKGKVR-CNYCHKEGHIKRDCYSLKRKNQYHRNKKNKQPEASVGENTITYSDSLATTGECSNQSACMEKFD----WVIDSGC

Query:  SFHMTPSKGWFSTYKEWDGGMVYMGNNHTCKDVEMVQSALLVTDQG-------------------------------QSE--------------------
        +FHMTP + + + +++ DGG V +G+N TC DV+   S  + T  G                               +SE                    
Subjt:  SFHMTPSKGWFSTYKEWDGGMVYMGNNHTCKDVEMVQSALLVTDQG-------------------------------QSE--------------------

Query:  ---------------------TD---LWHKRLSHISSKGLQMLSKQGILPQGTGDNMNFCEHCVVGKAKRQSFTKSQHSAKGILEYIHSDLWGPASTNSL
                             TD   LWHKRL+H+S +GLQ LS+QG+L       + FCEHC++GK+ R  F K +H+ KGIL+Y+HSDLWGP    S+
Subjt:  ---------------------TD---LWHKRLSHISSKGLQMLSKQGILPQGTGDNMNFCEHCVVGKAKRQSFTKSQHSAKGILEYIHSDLWGPASTNSL

Query:  SGSR------------------------------------------------------------------------TVRHTPQQNGVAERLNRTIMERV-
         GSR                                                                        TV +TPQQNG+AER NRTIMER  
Subjt:  SGSR------------------------------------------------------------------------TVRHTPQQNGVAERLNRTIMERV-

Query:  ---------------------------SHTSIEFLTPEERWFGKPPKLQHLKIFGCVGFVHQNHGKLKPRAEKCMFLGFTEGVKGYRLWHPKE--RRCIN
                                     T++   TP+E W GK P L+HL++FGC  + H   GKL  RA KCMF+G+ +GVKGY+LW  ++   +CI 
Subjt:  ---------------------------SHTSIEFLTPEERWFGKPPKLQHLKIFGCVGFVHQNHGKLKPRAEKCMFLGFTEGVKGYRLWHPKE--RRCIN

Query:  SRDVTFREQDMFM------------------------------------------LQQNPADSSQSKED-TTIEVEHNVHPQNSETDIENTE-ARDRQRR
        SRDVTF E +M                                             QQ+  D  QS+++   I+    +   +S  D++N +  RDR +R
Subjt:  SRDVTFREQDMFM------------------------------------------LQQNPADSSQSKED-TTIEVEHNVHPQNSETDIENTE-ARDRQRR

Query:  NIVPPLRFSEADCISLALSVADSLNIEEPISFEAAVNGPNSREWITAMNEEMRSLEENGTWTLKPLPKGCKPIASKWIYKVKEGIDGVLKPRFKARLVAK
            P+R+  AD ++ AL+ A      EP++FE A+   + ++W  AM EE+ SL +N TW+L P P   K I SKWIYK+K G  G  KPR+KARLVAK
Subjt:  NIVPPLRFSEADCISLALSVADSLNIEEPISFEAAVNGPNSREWITAMNEEMRSLEENGTWTLKPLPKGCKPIASKWIYKVKEGIDGVLKPRFKARLVAK

Query:  GYTQREGIDYSEVFSPVVKLTSIRMLLSFVVQDNLELDQLDVKTTFLHGYLTETIYMKQPTGY
        GYTQ+EG+D+ E+FSPVV+ +SIR++LS  V  ++ ++Q+DV T FLHG L E IYM QP GY
Subjt:  GYTQREGIDYSEVFSPVVKLTSIRMLLSFVVQDNLELDQLDVKTTFLHGYLTETIYMKQPTGY

SwissProt top hitse value%identityAlignment
P04146 Copia protein3.4e-4035.69Show/hide
Query:  MGELSYFLGIQAAKDGKNLHLSQQKYVLDLLKRAGLESCKPISSLMSVGLPLSLSDGSPLDSVTEYRSLVGALQYCTL-TRLDISFVVNKLCQFLHAPTT
        + E+ +F+GI+       ++LSQ  YV  +L +  +E+C  +S+ +   +   L + S  D  T  RSL+G L Y  L TR D++  VN L ++     +
Subjt:  MGELSYFLGIQAAKDGKNLHLSQQKYVLDLLKRAGLESCKPISSLMSVGLPLSLSDGSPLDSVTEYRSLVGALQYCTL-TRLDISFVVNKLCQFLHAPTT

Query:  VHMMAVKRLLRYLKGTSSMGILISPCSSF--AITCYTDADWASCPDDRRSTSGYC-TFLGSNLVSWNSAKQKVVSRSSVESEYRGMSNAAAEIAWVQSVL
             +KR+LRYLKGT  M ++     +F   I  Y D+DWA    DR+ST+GY       NL+ WN+ +Q  V+ SS E+EY  +  A  E  W++ +L
Subjt:  VHMMAVKRLLRYLKGTSSMGILISPCSSF--AITCYTDADWASCPDDRRSTSGYC-TFLGSNLVSWNSAKQKVVSRSSVESEYRGMSNAAAEIAWVQSVL

Query:  AEIGALSSAPPLLLCDNMSATYIAANPIMHHRTKHIEVDHHFIREKIARKQLLVRFVPSEDQVADIFTKSLHSPRFFGLRTKL
          I      P  +  DN     IA NP  H R KHI++ +HF RE++    + + ++P+E+Q+ADIFTK L + RF  LR KL
Subjt:  AEIGALSSAPPLLLCDNMSATYIAANPIMHHRTKHIEVDHHFIREKIARKQLLVRFVPSEDQVADIFTKSLHSPRFFGLRTKL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-948.3e-7127.25Show/hide
Query:  LTEPSKLPTTLSEDDKETINQIAYGALILNLSNNVLRQEVKTALKYGRVSITTDAIISAIRTKELELQSLKKETSEGLYAKGKSKPKEN----------K
        +T+ + L   + E+DK         A++L  S       + T + +G+ +I    + SA+   E +++   +   + L  +G+ +  +           +
Subjt:  LTEPSKLPTTLSEDDKETINQIAYGALILNLSNNVLRQEVKTALKYGRVSITTDAIISAIRTKELELQSLKKETSEGLYAKGKSKPKEN----------K

Query:  NQSEDKEKGKVR-CNYCHKEGHIKRDCYSLKRKNQYHRNKKNKQPEASVGENTITYSDSLATTGECSNQSACMEKFDWVIDSGCSFHMTPSKGWFSTYKE
         +S+++ K +VR C  C++ GH KRDC + ++       +KN    A++ +N       +    EC + S    + +WV+D+  S H TP +  F  Y  
Subjt:  NQSEDKEKGKVR-CNYCHKEGHIKRDCYSLKRKNQYHRNKKNKQPEASVGENTITYSDSLATTGECSNQSACMEKFDWVIDSGCSFHMTPSKGWFSTYKE

Query:  WDGGMVYMGN----------------NHTC----KDV------------------------------EMVQSALLVTD----------------------
         D G V MGN                N  C    KDV                               + + +L++                        
Subjt:  WDGGMVYMGN----------------NHTC----KDV------------------------------EMVQSALLVTD----------------------

Query:  QGQSETDLWHKRLSHISSKGLQMLSKQGILPQGTGDNMNFCEHCVVGKAKRQSFTKSQHSAKGILEYIHSDLWGPASTNSLSGS----------------
        Q +   DLWHKR+ H+S KGLQ+L+K+ ++    G  +  C++C+ GK  R SF  S      IL+ ++SD+ GP    S+ G+                
Subjt:  QGQSETDLWHKRLSHISSKGLQMLSKQGILPQGTGDNMNFCEHCVVGKAKRQSFTKSQHSAKGILEYIHSDLWGPASTNSLSGS----------------

Query:  --------------------------------------------------------RTVRHTPQQNGVAERLNRTIMERV--------------------
                                                                +TV  TPQ NGVAER+NRTI+E+V                    
Subjt:  --------------------------------------------------------RTVRHTPQQNGVAERLNRTIMERV--------------------

Query:  --------SHTSIEFLTPEERWFGKPPKLQHLKIFGCVGFVH---QNHGKLKPRAEKCMFLGFTEGVKGYRLWHPKERRCINSRDVTFREQDM-------
                    + F  PE  W  K     HLK+FGC  F H   +   KL  ++  C+F+G+ +   GYRLW P +++ I SRDV FRE ++       
Subjt:  --------SHTSIEFLTPEERWFGKPPKLQHLKIFGCVGFVH---QNHGKLKPRAEKCMFLGFTEGVKGYRLWHPKERRCINSRDVTFREQDM-------

Query:  ----------FMLQQNPADSSQSKEDTTIEVEH------NVHPQNSETD--IENTEARDRQRRNIVPPLRFSEADCISLA-------LSVADSLNIEEPI
                  F+   + +++  S E TT EV         V  Q  + D  +E  E    Q      PLR SE   +          + ++D     EP 
Subjt:  ----------FMLQQNPADSSQSKEDTTIEVEH------NVHPQNSETD--IENTEARDRQRRNIVPPLRFSEADCISLA-------LSVADSLNIEEPI

Query:  SFEAAVNGPNSREWITAMNEEMRSLEENGTWTLKPLPKGCKPIASKWIYKVKEGIDGVLKPRFKARLVAKGYTQREGIDYSEVFSPVVKLTSIRMLLSFV
        S +  ++ P   + + AM EEM SL++NGT+ L  LPKG +P+  KW++K+K+  D  L  R+KARLV KG+ Q++GID+ E+FSPVVK+TSIR +LS  
Subjt:  SFEAAVNGPNSREWITAMNEEMRSLEENGTWTLKPLPKGCKPIASKWIYKVKEGIDGVLKPRFKARLVAKGYTQREGIDYSEVFSPVVKLTSIRMLLSFV

Query:  VQDNLELDQLDVKTTFLHGYLTETIYMKQPTGY
           +LE++QLDVKT FLHG L E IYM+QP G+
Subjt:  VQDNLELDQLDVKTTFLHGYLTETIYMKQPTGY

P92519 Uncharacterized mitochondrial protein AtMg008108.1e-4246.6Show/hide
Query:  MGELSYFLGIQAAKDGKNLHLSQQKYVLDLLKRAGLESCKPISSLMSVGLPLSLSDGSPLDSVTEYRSLVGALQYCTLTRLDISFVVNKLCQFLHAPTTV
        +G + YFLGIQ       L LSQ KY   +L  AG+  CKP+S+ + + L  S+S     D  +++RS+VGALQY TLTR DIS+ VN +CQ +H PT  
Subjt:  MGELSYFLGIQAAKDGKNLHLSQQKYVLDLLKRAGLESCKPISSLMSVGLPLSLSDGSPLDSVTEYRSLVGALQYCTLTRLDISFVVNKLCQFLHAPTTV

Query:  HMMAVKRLLRYLKGTSSMGILISPCSSFAITCYTDADWASCPDDRRSTSGYCTFLGSNLVSWNSAKQKVVSRSSVESEYRGMSNAAAEIAW
            +KR+LRY+KGT   G+ I   S   +  + D+DWA C   RRST+G+CTFLG N++SW++ +Q  VSRSS E+EYR ++  AAE+ W
Subjt:  HMMAVKRLLRYLKGTSSMGILISPCSSFAITCYTDADWASCPDDRRSTSGYCTFLGSNLVSWNSAKQKVVSRSSVESEYRGMSNAAAEIAW

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.9e-6846.59Show/hide
Query:  ELSYFLGIQAAKDGKNLHLSQQKYVLDLLKRAGLESCKPISSLMSVGLPLSLSDGSPLDSVTEYRSLVGALQYCTLTRLDISFVVNKLCQFLHAPTTVHM
        EL YFLGI+A +    LHLSQ++Y+LDLL R  + + KP+++ M+    LSL  G+ L   TEYR +VG+LQY   TR DIS+ VN+L QF+H PT  H+
Subjt:  ELSYFLGIQAAKDGKNLHLSQQKYVLDLLKRAGLESCKPISSLMSVGLPLSLSDGSPLDSVTEYRSLVGALQYCTLTRLDISFVVNKLCQFLHAPTTVHM

Query:  MAVKRLLRYLKGTSSMGILISPCSSFAITCYTDADWASCPDDRRSTSGYCTFLGSNLVSWNSAKQKVVSRSSVESEYRGMSNAAAEIAWVQSVLAEIGAL
         A+KR+LRYL GT + GI +   ++ ++  Y+DADWA   DD  ST+GY  +LG + +SW+S KQK V RSS E+EYR ++N ++E+ W+ S+L E+G  
Subjt:  MAVKRLLRYLKGTSSMGILISPCSSFAITCYTDADWASCPDDRRSTSGYCTFLGSNLVSWNSAKQKVVSRSSVESEYRGMSNAAAEIAWVQSVLAEIGAL

Query:  SSAPPLLLCDNMSATYIAANPIMHHRTKHIEVDHHFIREKIARKQLLVRFVPSEDQVADIFTKSLHSPRFFGLRTKLTV
         + PP++ CDN+ ATY+ ANP+ H R KHI +D+HFIR ++    L V  V + DQ+AD  TK L    F    +K+ V
Subjt:  SSAPPLLLCDNMSATYIAANPIMHHRTKHIEVDHHFIREKIARKQLLVRFVPSEDQVADIFTKSLHSPRFFGLRTKLTV

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.1e-6645.36Show/hide
Query:  ELSYFLGIQAAKDGKNLHLSQQKYVLDLLKRAGLESCKPISSLMSVGLPLSLSDGSPLDSVTEYRSLVGALQYCTLTRLDISFVVNKLCQFLHAPTTVHM
        +L YFLGI+A +  + LHLSQ++Y LDLL R  + + KP+++ M+    L+L  G+ L   TEYR +VG+LQY   TR D+S+ VN+L Q++H PT  H 
Subjt:  ELSYFLGIQAAKDGKNLHLSQQKYVLDLLKRAGLESCKPISSLMSVGLPLSLSDGSPLDSVTEYRSLVGALQYCTLTRLDISFVVNKLCQFLHAPTTVHM

Query:  MAVKRLLRYLKGTSSMGILISPCSSFAITCYTDADWASCPDDRRSTSGYCTFLGSNLVSWNSAKQKVVSRSSVESEYRGMSNAAAEIAWVQSVLAEIGAL
         A+KR+LRYL GT   GI +   ++ ++  Y+DADWA   DD  ST+GY  +LG + +SW+S KQK V RSS E+EYR ++N ++E+ W+ S+L E+G  
Subjt:  MAVKRLLRYLKGTSSMGILISPCSSFAITCYTDADWASCPDDRRSTSGYCTFLGSNLVSWNSAKQKVVSRSSVESEYRGMSNAAAEIAWVQSVLAEIGAL

Query:  SSAPPLLLCDNMSATYIAANPIMHHRTKHIEVDHHFIREKIARKQLLVRFVPSEDQVADIFTKSLHSPRFFGLRTKLTVV
         S PP++ CDN+ ATY+ ANP+ H R KHI +D+HFIR ++    L V  V + DQ+AD  TK L    F     K+ V+
Subjt:  SSAPPLLLCDNMSATYIAANPIMHHRTKHIEVDHHFIREKIARKQLLVRFVPSEDQVADIFTKSLHSPRFFGLRTKLTVV

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.8e-5742.76Show/hide
Query:  MGELSYFLGIQAAKDGKNLHLSQQKYVLDLLKRAGLESCKPISSLMSVGLPLSLSDGSPLDSVTEYRSLVGALQYCTLTRLDISFVVNKLCQFLHAPTTV
        +G L YFLG++ A+    +++ Q+KY LDLL   GL  CKP S  M   +  S   G        YR L+G L Y  +TRLDISF VNKL QF  AP   
Subjt:  MGELSYFLGIQAAKDGKNLHLSQQKYVLDLLKRAGLESCKPISSLMSVGLPLSLSDGSPLDSVTEYRSLVGALQYCTLTRLDISFVVNKLCQFLHAPTTV

Query:  HMMAVKRLLRYLKGTSSMGILISPCSSFAITCYTDADWASCPDDRRSTSGYCTFLGSNLVSWNSAKQKVVSRSSVESEYRGMSNAAAEIAWVQSVLAEIG
        H  AV ++L Y+KGT   G+  S  +   +  ++DA + SC D RRST+GYC FLG++L+SW S KQ+VVS+SS E+EYR +S A  E+ W+     E+ 
Subjt:  HMMAVKRLLRYLKGTSSMGILISPCSSFAITCYTDADWASCPDDRRSTSGYCTFLGSNLVSWNSAKQKVVSRSSVESEYRGMSNAAAEIAWVQSVLAEIG

Query:  ALSSAPPLLLCDNMSATYIAANPIMHHRTKHIEVDHHFIREKIARKQLLVRFVPSEDQVADIFTKSLHSPRFFGLRTKLTVVSHLCSLAG
           S P LL CDN +A +IA N + H RTKHIE D H +RE+   +  L     + D+  D FT+ L SP    LR  +  +  +  LAG
Subjt:  ALSSAPPLLLCDNMSATYIAANPIMHHRTKHIEVDHHFIREKIARKQLLVRFVPSEDQVADIFTKSLHSPRFFGLRTKLTVVSHLCSLAG

ATMG00240.1 Gag-Pol-related retrotransposon family protein1.7e-1545.68Show/hide
Query:  YCTLTRLDISFVVNKLCQFLHAPTTVHMMAVKRLLRYLKGTSSMGILISPCSSFAITCYTDADWASCPDDRRSTSGYCTFL
        Y T+TR D++F VN+L QF  A  T  M AV ++L Y+KGT   G+  S  S   +  + D+DWASCPD RRS +G+C+ +
Subjt:  YCTLTRLDISFVVNKLCQFLHAPTTVHMMAVKRLLRYLKGTSSMGILISPCSSFAITCYTDADWASCPDDRRSTSGYCTFL

ATMG00300.1 Gag-Pol-related retrotransposon family protein7.1e-1740Show/hide
Query:  VYMGNNHTC-----KDVEMVQSALLVTDQGQSETDLWHKRLSHISSKGLQMLSKQGILPQGTGDNMNFCEHCVVGKAKRQSFTKSQHSAKGILEYIHSDL
        +  GN H         VE  +S L  T   + ET LWH RL+H+S +G+++L K+G L      ++ FCE C+ GK  R +F+  QH+ K  L+Y+HSDL
Subjt:  VYMGNNHTC-----KDVEMVQSALLVTDQGQSETDLWHKRLSHISSKGLQMLSKQGILPQGTGDNMNFCEHCVVGKAKRQSFTKSQHSAKGILEYIHSDL

Query:  WGPAS
        WG  S
Subjt:  WGPAS

ATMG00810.1 DNA/RNA polymerases superfamily protein5.7e-4346.6Show/hide
Query:  MGELSYFLGIQAAKDGKNLHLSQQKYVLDLLKRAGLESCKPISSLMSVGLPLSLSDGSPLDSVTEYRSLVGALQYCTLTRLDISFVVNKLCQFLHAPTTV
        +G + YFLGIQ       L LSQ KY   +L  AG+  CKP+S+ + + L  S+S     D  +++RS+VGALQY TLTR DIS+ VN +CQ +H PT  
Subjt:  MGELSYFLGIQAAKDGKNLHLSQQKYVLDLLKRAGLESCKPISSLMSVGLPLSLSDGSPLDSVTEYRSLVGALQYCTLTRLDISFVVNKLCQFLHAPTTV

Query:  HMMAVKRLLRYLKGTSSMGILISPCSSFAITCYTDADWASCPDDRRSTSGYCTFLGSNLVSWNSAKQKVVSRSSVESEYRGMSNAAAEIAW
            +KR+LRY+KGT   G+ I   S   +  + D+DWA C   RRST+G+CTFLG N++SW++ +Q  VSRSS E+EYR ++  AAE+ W
Subjt:  HMMAVKRLLRYLKGTSSMGILISPCSSFAITCYTDADWASCPDDRRSTSGYCTFLGSNLVSWNSAKQKVVSRSSVESEYRGMSNAAAEIAW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.3e-1543.36Show/hide
Query:  EEPISFEAAVNGPNSREWITAMNEEMRSLEENGTWTLKPLPKGCKPIASKWIYKVKEGIDGVLKPRFKARLVAKGYTQREGIDYSEVFSPVVKLTSIRML
        +EP S   A+  P    W  AM EE+ +L  N TW L P P     +  KW++K K   DG L  R KARLVAKG+ Q EGI + E +SPVV+  +IR +
Subjt:  EEPISFEAAVNGPNSREWITAMNEEMRSLEENGTWTLKPLPKGCKPIASKWIYKVKEGIDGVLKPRFKARLVAKGYTQREGIDYSEVFSPVVKLTSIRML

Query:  LSFVVQDNLELDQ
        L+  V   LE+ Q
Subjt:  LSFVVQDNLELDQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCGAACTCTCCTACTTTCTTGGCATTCAAGCTGCTAAAGATGGTAAGAATCTGCATTTATCTCAACAGAAGTATGTTCTGGATCTCCTAAAAAGAGCAGGTTTGGA
ATCATGCAAGCCCATTTCTTCTCTCATGTCTGTTGGTCTGCCGTTGTCCCTCTCTGATGGTTCACCACTGGACTCAGTCACTGAATATCGCAGCTTGGTTGGGGCACTTC
AGTATTGTACCCTCACAAGACTTGACATCAGTTTTGTAGTGAATAAACTTTGTCAGTTCCTCCATGCCCCTACCACGGTCCATATGATGGCAGTTAAGCGACTTCTGAGA
TACCTCAAAGGTACATCCTCCATGGGCATTTTGATTTCCCCTTGTTCTTCTTTTGCTATCACCTGTTATACAGATGCCGACTGGGCAAGCTGTCCAGACGACCGAAGGAG
TACAAGCGGTTACTGCACGTTTTTGGGATCAAATTTGGTTTCATGGAATTCAGCTAAGCAGAAGGTAGTCTCAAGGTCAAGCGTTGAGTCCGAGTATCGGGGTATGTCGA
ATGCAGCAGCTGAAATAGCTTGGGTCCAATCCGTTCTTGCTGAAATTGGTGCTCTCTCTTCTGCTCCTCCATTACTGCTCTGTGATAACATGTCTGCTACCTATATCGCT
GCCAACCCTATTATGCACCATCGCACTAAACACATCGAAGTTGACCACCATTTCATCAGAGAAAAGATAGCTCGAAAGCAACTTCTTGTGCGGTTTGTTCCCTCTGAAGA
CCAAGTAGCTGATATATTCACAAAGAGCTTGCATTCTCCTCGGTTTTTTGGTCTAAGAACCAAGCTCACAGTGGTTTCTCATCTGTGTTCGCTTGCGGGGGATGACATCA
TCAAGTTTTTCGGTTCAAAGTCACATGGAGCTAAAATTAACATAATTGAAGATGTTAGTGGTGTCATAAAGCCAGGAAGGCACGTGTCCAACTCTAACAGTGGTATCAGA
GCCCAAGATCAATCAAGAAACCAAGTTCTTGAGTCAAGAATGGCAGTGGCAAGGTATGACATCGAGAAGTTTGATGGGAAAGGAGATTTTGACCTATGGAAGGCCAAAAT
CAGAGCAGTTCTTGGACAACAAAAGGCAATTCATGCCCTTACCGAACCATCAAAGTTACCTACCACATTATCTGAAGACGACAAAGAGACAATAAACCAGATAGCCTATG
GAGCATTGATCTTGAACCTAAGCAACAATGTCCTACGACAAGAAGTAAAAACTGCATTGAAATATGGTAGGGTCTCAATTACTACAGATGCCATAATTTCAGCCATCAGA
ACAAAGGAATTGGAATTACAATCACTGAAAAAGGAGACTTCTGAAGGGCTTTATGCAAAGGGAAAATCCAAACCAAAAGAGAATAAGAATCAGTCTGAAGATAAAGAGAA
AGGCAAAGTTAGGTGTAACTATTGCCACAAAGAAGGCCATATTAAAAGAGACTGCTACTCTTTGAAAAGAAAGAACCAATATCACAGAAACAAGAAAAATAAACAGCCAG
AAGCCTCGGTTGGAGAGAATACAATTACATACTCGGATTCTCTTGCAACTACAGGAGAGTGCAGTAATCAATCAGCTTGTATGGAGAAGTTCGATTGGGTTATTGACTCA
GGTTGTTCTTTCCACATGACCCCATCCAAGGGTTGGTTCAGCACCTACAAGGAATGGGATGGAGGAATGGTATATATGGGGAACAACCACACTTGCAAGGATGTTGAAAT
GGTTCAATCGGCACTTCTAGTCACAGATCAAGGCCAATCAGAGACTGATTTGTGGCACAAGAGGCTGTCCCATATTAGCTCAAAGGGACTCCAAATGCTGTCCAAACAAG
GAATACTTCCTCAAGGTACGGGAGACAATATGAACTTTTGTGAACATTGTGTAGTTGGCAAGGCAAAAAGACAAAGCTTCACAAAGTCACAACACTCAGCAAAGGGAATT
CTGGAGTATATACATTCTGACCTTTGGGGACCAGCATCTACAAATTCACTAAGTGGTTCGAGGACCGTAAGGCACACTCCACAGCAAAATGGTGTGGCTGAAAGGCTTAA
CAGGACCATAATGGAGCGTGTGTCCCATACATCTATTGAATTCCTAACCCCTGAAGAAAGATGGTTTGGCAAGCCACCAAAGCTACAACATCTCAAGATATTTGGATGTG
TTGGCTTTGTGCATCAAAATCATGGAAAGCTCAAGCCGAGAGCTGAAAAATGCATGTTTTTGGGCTTTACAGAAGGGGTGAAAGGATACAGGTTGTGGCATCCAAAAGAG
AGGCGTTGTATAAACAGTAGGGATGTTACTTTTAGGGAGCAAGACATGTTTATGCTCCAACAAAACCCTGCAGACAGCTCTCAATCTAAGGAAGATACAACAATTGAGGT
GGAGCATAATGTGCATCCTCAAAATTCAGAGACAGACATTGAGAATACAGAAGCTAGAGACAGACAAAGGAGGAATATAGTTCCTCCATTAAGGTTCAGTGAGGCTGACT
GCATATCCTTGGCACTTAGTGTTGCAGATTCCTTAAACATCGAAGAACCTATCAGTTTTGAAGCCGCTGTAAATGGTCCTAACAGTCGAGAATGGATTACAGCCATGAAT
GAGGAAATGAGGTCCTTAGAAGAAAATGGAACATGGACACTCAAGCCTCTACCGAAAGGGTGTAAGCCCATTGCTTCAAAATGGATTTACAAAGTAAAAGAAGGCATAGA
TGGAGTCTTAAAACCTAGATTCAAGGCTAGGTTGGTGGCCAAAGGATACACACAGAGAGAAGGAATTGATTATTCAGAAGTATTCTCTCCAGTGGTGAAGCTAACTTCTA
TAAGGATGCTTCTCTCATTTGTAGTTCAAGATAACCTGGAGCTAGATCAGCTAGATGTAAAGACAACCTTTCTCCATGGTTATCTAACAGAGACAATCTATATGAAGCAG
CCAACCGGGTACATTAAAAGTGAAGGATTGAAATTCCCAATTCCGTTACAGCGGAAGCAATTGGACCGTTCGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGCGAACTCTCCTACTTTCTTGGCATTCAAGCTGCTAAAGATGGTAAGAATCTGCATTTATCTCAACAGAAGTATGTTCTGGATCTCCTAAAAAGAGCAGGTTTGGA
ATCATGCAAGCCCATTTCTTCTCTCATGTCTGTTGGTCTGCCGTTGTCCCTCTCTGATGGTTCACCACTGGACTCAGTCACTGAATATCGCAGCTTGGTTGGGGCACTTC
AGTATTGTACCCTCACAAGACTTGACATCAGTTTTGTAGTGAATAAACTTTGTCAGTTCCTCCATGCCCCTACCACGGTCCATATGATGGCAGTTAAGCGACTTCTGAGA
TACCTCAAAGGTACATCCTCCATGGGCATTTTGATTTCCCCTTGTTCTTCTTTTGCTATCACCTGTTATACAGATGCCGACTGGGCAAGCTGTCCAGACGACCGAAGGAG
TACAAGCGGTTACTGCACGTTTTTGGGATCAAATTTGGTTTCATGGAATTCAGCTAAGCAGAAGGTAGTCTCAAGGTCAAGCGTTGAGTCCGAGTATCGGGGTATGTCGA
ATGCAGCAGCTGAAATAGCTTGGGTCCAATCCGTTCTTGCTGAAATTGGTGCTCTCTCTTCTGCTCCTCCATTACTGCTCTGTGATAACATGTCTGCTACCTATATCGCT
GCCAACCCTATTATGCACCATCGCACTAAACACATCGAAGTTGACCACCATTTCATCAGAGAAAAGATAGCTCGAAAGCAACTTCTTGTGCGGTTTGTTCCCTCTGAAGA
CCAAGTAGCTGATATATTCACAAAGAGCTTGCATTCTCCTCGGTTTTTTGGTCTAAGAACCAAGCTCACAGTGGTTTCTCATCTGTGTTCGCTTGCGGGGGATGACATCA
TCAAGTTTTTCGGTTCAAAGTCACATGGAGCTAAAATTAACATAATTGAAGATGTTAGTGGTGTCATAAAGCCAGGAAGGCACGTGTCCAACTCTAACAGTGGTATCAGA
GCCCAAGATCAATCAAGAAACCAAGTTCTTGAGTCAAGAATGGCAGTGGCAAGGTATGACATCGAGAAGTTTGATGGGAAAGGAGATTTTGACCTATGGAAGGCCAAAAT
CAGAGCAGTTCTTGGACAACAAAAGGCAATTCATGCCCTTACCGAACCATCAAAGTTACCTACCACATTATCTGAAGACGACAAAGAGACAATAAACCAGATAGCCTATG
GAGCATTGATCTTGAACCTAAGCAACAATGTCCTACGACAAGAAGTAAAAACTGCATTGAAATATGGTAGGGTCTCAATTACTACAGATGCCATAATTTCAGCCATCAGA
ACAAAGGAATTGGAATTACAATCACTGAAAAAGGAGACTTCTGAAGGGCTTTATGCAAAGGGAAAATCCAAACCAAAAGAGAATAAGAATCAGTCTGAAGATAAAGAGAA
AGGCAAAGTTAGGTGTAACTATTGCCACAAAGAAGGCCATATTAAAAGAGACTGCTACTCTTTGAAAAGAAAGAACCAATATCACAGAAACAAGAAAAATAAACAGCCAG
AAGCCTCGGTTGGAGAGAATACAATTACATACTCGGATTCTCTTGCAACTACAGGAGAGTGCAGTAATCAATCAGCTTGTATGGAGAAGTTCGATTGGGTTATTGACTCA
GGTTGTTCTTTCCACATGACCCCATCCAAGGGTTGGTTCAGCACCTACAAGGAATGGGATGGAGGAATGGTATATATGGGGAACAACCACACTTGCAAGGATGTTGAAAT
GGTTCAATCGGCACTTCTAGTCACAGATCAAGGCCAATCAGAGACTGATTTGTGGCACAAGAGGCTGTCCCATATTAGCTCAAAGGGACTCCAAATGCTGTCCAAACAAG
GAATACTTCCTCAAGGTACGGGAGACAATATGAACTTTTGTGAACATTGTGTAGTTGGCAAGGCAAAAAGACAAAGCTTCACAAAGTCACAACACTCAGCAAAGGGAATT
CTGGAGTATATACATTCTGACCTTTGGGGACCAGCATCTACAAATTCACTAAGTGGTTCGAGGACCGTAAGGCACACTCCACAGCAAAATGGTGTGGCTGAAAGGCTTAA
CAGGACCATAATGGAGCGTGTGTCCCATACATCTATTGAATTCCTAACCCCTGAAGAAAGATGGTTTGGCAAGCCACCAAAGCTACAACATCTCAAGATATTTGGATGTG
TTGGCTTTGTGCATCAAAATCATGGAAAGCTCAAGCCGAGAGCTGAAAAATGCATGTTTTTGGGCTTTACAGAAGGGGTGAAAGGATACAGGTTGTGGCATCCAAAAGAG
AGGCGTTGTATAAACAGTAGGGATGTTACTTTTAGGGAGCAAGACATGTTTATGCTCCAACAAAACCCTGCAGACAGCTCTCAATCTAAGGAAGATACAACAATTGAGGT
GGAGCATAATGTGCATCCTCAAAATTCAGAGACAGACATTGAGAATACAGAAGCTAGAGACAGACAAAGGAGGAATATAGTTCCTCCATTAAGGTTCAGTGAGGCTGACT
GCATATCCTTGGCACTTAGTGTTGCAGATTCCTTAAACATCGAAGAACCTATCAGTTTTGAAGCCGCTGTAAATGGTCCTAACAGTCGAGAATGGATTACAGCCATGAAT
GAGGAAATGAGGTCCTTAGAAGAAAATGGAACATGGACACTCAAGCCTCTACCGAAAGGGTGTAAGCCCATTGCTTCAAAATGGATTTACAAAGTAAAAGAAGGCATAGA
TGGAGTCTTAAAACCTAGATTCAAGGCTAGGTTGGTGGCCAAAGGATACACACAGAGAGAAGGAATTGATTATTCAGAAGTATTCTCTCCAGTGGTGAAGCTAACTTCTA
TAAGGATGCTTCTCTCATTTGTAGTTCAAGATAACCTGGAGCTAGATCAGCTAGATGTAAAGACAACCTTTCTCCATGGTTATCTAACAGAGACAATCTATATGAAGCAG
CCAACCGGGTACATTAAAAGTGAAGGATTGAAATTCCCAATTCCGTTACAGCGGAAGCAATTGGACCGTTCGTAA
Protein sequenceShow/hide protein sequence
MGELSYFLGIQAAKDGKNLHLSQQKYVLDLLKRAGLESCKPISSLMSVGLPLSLSDGSPLDSVTEYRSLVGALQYCTLTRLDISFVVNKLCQFLHAPTTVHMMAVKRLLR
YLKGTSSMGILISPCSSFAITCYTDADWASCPDDRRSTSGYCTFLGSNLVSWNSAKQKVVSRSSVESEYRGMSNAAAEIAWVQSVLAEIGALSSAPPLLLCDNMSATYIA
ANPIMHHRTKHIEVDHHFIREKIARKQLLVRFVPSEDQVADIFTKSLHSPRFFGLRTKLTVVSHLCSLAGDDIIKFFGSKSHGAKINIIEDVSGVIKPGRHVSNSNSGIR
AQDQSRNQVLESRMAVARYDIEKFDGKGDFDLWKAKIRAVLGQQKAIHALTEPSKLPTTLSEDDKETINQIAYGALILNLSNNVLRQEVKTALKYGRVSITTDAIISAIR
TKELELQSLKKETSEGLYAKGKSKPKENKNQSEDKEKGKVRCNYCHKEGHIKRDCYSLKRKNQYHRNKKNKQPEASVGENTITYSDSLATTGECSNQSACMEKFDWVIDS
GCSFHMTPSKGWFSTYKEWDGGMVYMGNNHTCKDVEMVQSALLVTDQGQSETDLWHKRLSHISSKGLQMLSKQGILPQGTGDNMNFCEHCVVGKAKRQSFTKSQHSAKGI
LEYIHSDLWGPASTNSLSGSRTVRHTPQQNGVAERLNRTIMERVSHTSIEFLTPEERWFGKPPKLQHLKIFGCVGFVHQNHGKLKPRAEKCMFLGFTEGVKGYRLWHPKE
RRCINSRDVTFREQDMFMLQQNPADSSQSKEDTTIEVEHNVHPQNSETDIENTEARDRQRRNIVPPLRFSEADCISLALSVADSLNIEEPISFEAAVNGPNSREWITAMN
EEMRSLEENGTWTLKPLPKGCKPIASKWIYKVKEGIDGVLKPRFKARLVAKGYTQREGIDYSEVFSPVVKLTSIRMLLSFVVQDNLELDQLDVKTTFLHGYLTETIYMKQ
PTGYIKSEGLKFPIPLQRKQLDRS