; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0026532 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0026532
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr10:38537172..38539546
RNA-Seq ExpressionLag0026532
SyntenyLag0026532
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0047995.1 retrotransposon protein, putative, Ty1-copia sub-class [Cucumis melo var. makuwa]1.0e-8534.51Show/hide
Query:  KKKIKAILGQQNALKAIQDPKELPTTMTQDEKDNMQEAAYGTLVLNLSNNVLRQVLDQDTAYKIWTKLHELYLTKDLPNKAYLRERFFTYKMDPSKSMTD
        +KKI+AIL Q    K I D + LP  +T+ EK +M E AY T++L LS+ VLR V +  T  ++W KL  LYLTK LPNK Y++E+FF YKMD SK + +
Subjt:  KKKIKAILGQQNALKAIQDPKELPTTMTQDEKDNMQEAAYGTLVLNLSNNVLRQVLDQDTAYKIWTKLHELYLTKDLPNKAYLRERFFTYKMDPSKSMTD

Query:  NLDDFKKLSSEFKTIGDKLEDENEAFILLNSLLDNYREVKVALKYGREKITTDDIVSAVRTGELELQTQKKEISNSEGLFSKGKSKQNGKDNKNQGDDDK
        NLD+F+K+  +   IG+K+ DEN+A ILLNSL + YREVK A+KYG + +T   ++ A++T  LE+   KKE  + E L ++G+S++     K +    K
Subjt:  NLDDFKKLSSEFKTIGDKLEDENEAFILLNSLLDNYREVKVALKYGREKITTDDIVSAVRTGELELQTQKKEISNSEGLFSKGKSKQNGKDNKNQGDDDK

Query:  AGKSKIRCNFCHKDGHLKKNCFFLKRKQNQKSKKGKPAEASV--GENSLTYSDALATSDRSSKQNESLGETRLDL-------------------------
        +     +C  CHK+GH KKNC         KS++   +EA+V  G NS   +D   +++   +  E L  +  D+                         
Subjt:  AGKSKIRCNFCHKDGHLKKNCFFLKRKQNQKSKKGKPAEASV--GENSLTYSDALATSDRSSKQNESLGETRLDL-------------------------

Query:  --------------------GLWMFFPHDSIKR---NVRHVPTLKRNLISLGMLDSIGCEYRGFGGTFEVMKDSKTVLVGVKMNGLYVIKDAEMIQTALV
                            G      HD + R   NVR+VP LKRNLISLG LD  GC  +   G  +V K S   L G   +GLYV++   +  +A +
Subjt:  --------------------GLWMFFPHDSIKR---NVRHVPTLKRNLISLGMLDSIGCEYRGFGGTFEVMKDSKTVLVGVKMNGLYVIKDAEMIQTALV

Query:  VTNDSLTEGDLWHKRISHISNKGLQVLAKQGILPQGVCDELKFCEHCVLG--------------------------------------------------
         +        LWHKR++H+S +GLQ L++QG+L      EL FCEHC++G                                                  
Subjt:  VTNDSLTEGDLWHKRISHISNKGLQVLAKQGILPQGVCDELKFCEHCVLG--------------------------------------------------

Query:  ----------------------------------------------FDKFCKEHGILRHKTVRYTPQQNGVAERLNRTIMERVRCLLSDAMLPEKFWAET
                                                      F++FCK  GI RH TV YTPQQNG+AER NRTIMER RCLL++A LP KFW E 
Subjt:  ----------------------------------------------FDKFCKEHGILRHKTVRYTPQQNGVAERLNRTIMERVRCLLSDAMLPEKFWAET

Query:  SSYTVYTLNRCPHSSINLLTPEE
        +    Y +NR P +++NL TP+E
Subjt:  SSYTVYTLNRCPHSSINLLTPEE

KAA0050719.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]2.8e-8634.67Show/hide
Query:  KKKIKAILGQQNALKAIQDPKELPTTMTQDEKDNMQEAAYGTLVLNLSNNVLRQVLDQDTAYKIWTKLHELYLTKDLPNKAYLRERFFTYKMDPSKSMTD
        +KKI+AIL Q    K I D + LP  +T+ EK +M E AY T++L LS+ VLR V +  T  ++W KL  LYLTK L NK Y++E+FF YKMD SKS+ +
Subjt:  KKKIKAILGQQNALKAIQDPKELPTTMTQDEKDNMQEAAYGTLVLNLSNNVLRQVLDQDTAYKIWTKLHELYLTKDLPNKAYLRERFFTYKMDPSKSMTD

Query:  NLDDFKKLSSEFKTIGDKLEDENEAFILLNSLLDNYREVKVALKYGREKITTDDIVSAVRTGELELQTQKKEISNSEGLFSKGKSKQNGKDNKNQGDDDK
        NLD+F+K+  +   IG+K+ DEN+A ILLNSL + YREVK A+KYGR+ +T   ++ A++T  LE+   KKE  + E L ++G+S++     K +    K
Subjt:  NLDDFKKLSSEFKTIGDKLEDENEAFILLNSLLDNYREVKVALKYGREKITTDDIVSAVRTGELELQTQKKEISNSEGLFSKGKSKQNGKDNKNQGDDDK

Query:  AGKSKIRCNFCHKDGHLKKNCFFLKRKQNQKSKKGKPAEASV--GENSLTYSDALATSDRSSKQNESLGETRLDL-------------------------
        +     +C  CHK+GH KKNC         KS++   +EA+V  G NS   +D   +++   +  E L  +  D+                         
Subjt:  AGKSKIRCNFCHKDGHLKKNCFFLKRKQNQKSKKGKPAEASV--GENSLTYSDALATSDRSSKQNESLGETRLDL-------------------------

Query:  --------------------GLWMFFPHDSIKR---NVRHVPTLKRNLISLGMLDSIGCEYRGFGGTFEVMKDSKTVLVGVKMNGLYVIKDAEMIQTALV
                            G      HD + R   NVR+VP LKRNLISLG LD  GC  +   G  +V K S   L G   +GLYV++   +  +A +
Subjt:  --------------------GLWMFFPHDSIKR---NVRHVPTLKRNLISLGMLDSIGCEYRGFGGTFEVMKDSKTVLVGVKMNGLYVIKDAEMIQTALV

Query:  VTNDSLTEGDLWHKRISHISNKGLQVLAKQGILPQGVCDELKFCEHCVLG--------------------------------------------------
         +        LWHKR++H+S +GLQ L++QG+L      EL FCEHC++G                                                  
Subjt:  VTNDSLTEGDLWHKRISHISNKGLQVLAKQGILPQGVCDELKFCEHCVLG--------------------------------------------------

Query:  ----------------------------------------------FDKFCKEHGILRHKTVRYTPQQNGVAERLNRTIMERVRCLLSDAMLPEKFWAET
                                                      F++FCK  GI RH TV YTPQQNG+AER NRTIMER RCLL++A LP KFW E 
Subjt:  ----------------------------------------------FDKFCKEHGILRHKTVRYTPQQNGVAERLNRTIMERVRCLLSDAMLPEKFWAET

Query:  SSYTVYTLNRCPHSSINLLTPEE
        +    Y +NR P +++NL TP+E
Subjt:  SSYTVYTLNRCPHSSINLLTPEE

KAA0054988.1 hypothetical protein E6C27_scaffold43052G001360 [Cucumis melo var. makuwa]1.0e-8036.98Show/hide
Query:  KKIKAILGQQNALKAIQDPKELPTTMTQDEKDNMQEAAYGTLVLNLSNNVLRQVLDQDTAYKIWTKLHELYLTKDLPNKAYLRERFFTYKMDPSKSMTDN
        K+I AILG Q ALKA +DPKELP T+T+ E++ ++E AY TL++N+++NVLRQV+++  A+                                       
Subjt:  KKIKAILGQQNALKAIQDPKELPTTMTQDEKDNMQEAAYGTLVLNLSNNVLRQVLDQDTAYKIWTKLHELYLTKDLPNKAYLRERFFTYKMDPSKSMTDN

Query:  LDDFKKLSSEFKTIGDKLEDENEAFILLNSLLDNYREVKVALKYGREKITTDDIVSAVRTGELELQTQKKEISNSEGLFSKGKSKQNGKDNKNQGDDDKA
         ++FKKL++ F   G+KL  E+EA IL+NS+ D Y+EVK+ALKYGRE IT + +++A+++ ELEL+T+ K  + +E LF KGK+      NKNQ      
Subjt:  LDDFKKLSSEFKTIGDKLEDENEAFILLNSLLDNYREVKVALKYGREKITTDDIVSAVRTGELELQTQKKEISNSEGLFSKGKSKQNGKDNKNQGDDDKA

Query:  GKSKIRCNFCHKDGHLKKNC---------------------FFLKRKQNQKSKKGKPAE-----ASVGENSLTYSDALATSDRSSKQNESLGET-RLDLG
         K  ++C  CHK GH K+NC                      F + +  Q+  + +  E       VG  +  Y++ L T+++ + + ++  E   LD G
Subjt:  GKSKIRCNFCHKDGHLKKNC---------------------FFLKRKQNQKSKKGKPAE-----ASVGENSLTYSDALATSDRSSKQNESLGET-RLDLG

Query:  LWMFFPHDSIKRNVRHVPTLKRNLISLGMLDSIGCEYRGFGGTFEVMKDSKTVLVGVKMNGLYVIKDAEMIQTALVVTNDSLTEGDLWHKRISHISNKGL
                    + RHVP LKRNLISLGMLD +GC      G  +V +  + +L   K+  LY +K+    + AL+   +   E +LWH+R+SHIS KGL
Subjt:  LWMFFPHDSIKRNVRHVPTLKRNLISLGMLDSIGCEYRGFGGTFEVMKDSKTVLVGVKMNGLYVIKDAEMIQTALVVTNDSLTEGDLWHKRISHISNKGL

Query:  QVLAKQGILPQGVCDELKFCEHCVLGFDKFCK----EH----------------------GILRHKTVRYTPQQNGVAERLNRTIMERVRCLLSDAMLPE
          L K G++       L FCEHC+ G  K  K    EH                      G  RH+TV YTPQQNGVAER+NRT+MERVRC++S+A + E
Subjt:  QVLAKQGILPQGVCDELKFCEHCVLGFDKFCK----EH----------------------GILRHKTVRYTPQQNGVAERLNRTIMERVRCLLSDAMLPE

Query:  KFWAETSSYTVYTLNRCPHSSINLLTPEER
         FWAE  +   YT+ R    SI++ TPEER
Subjt:  KFWAETSSYTVYTLNRCPHSSINLLTPEER

PKU72844.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Dendrobium catenatum]1.6e-8133.74Show/hide
Query:  KIKAILGQQNALKAIQDPKELPTTMTQDEKDNMQEAAYGTLVLNLSNNVLRQVLDQDTAYKIWTKLHELYLTKDLPNKAYLRERFFTYKMDPSKSMTDNL
        K++AIL QQ   KA+    ELP+TM+  EK ++Q+ A+ +++L L++ VLR+V    T  ++W KL ELY  K LPN+ YL+E+FF YKMD +KS+ DNL
Subjt:  KIKAILGQQNALKAIQDPKELPTTMTQDEKDNMQEAAYGTLVLNLSNNVLRQVLDQDTAYKIWTKLHELYLTKDLPNKAYLRERFFTYKMDPSKSMTDNL

Query:  DDFKKLSSEFKTIGDKLEDENEAFILLNSLLDNYREVKVALKYGREKITTDDIVSAVRTGELELQTQKKEISNSEGLFSKGKSKQNG------KDNKNQG
        D+F KL  + + +  K+EDE++A ILLNSL  + R  K  LKYGRE IT D++ +A+ +  L+++  +K  S  EGL  +G+S++ G      K      
Subjt:  DDFKKLSSEFKTIGDKLEDENEAFILLNSLLDNYREVKVALKYGREKITTDDIVSAVRTGELELQTQKKEISNSEGLFSKGKSKQNG------KDNKNQG

Query:  DDDKAGKSKIRCNFCHKDGHLKKNCFFLKRKQNQKSKKGKPAEASVGENSLTYSDALATSDRSSKQNESLGETRL-DLGLWMFFPHDSIKRNVRHVPTLK
           K     ++C  C+K GH+++ C     ++N K K     +A++   +   +D L  SD     N++     +  + + M   H  I ++VRHVP LK
Subjt:  DDDKAGKSKIRCNFCHKDGHLKKNCFFLKRKQNQKSKKGKPAEASVGENSLTYSDALATSDRSSKQNESLGETRL-DLGLWMFFPHDSIKRNVRHVPTLK

Query:  RNLISLGMLDSIGCEYRGFGGTFEVMKDSKTVLVGVKMNGLYVIKDAEMIQTALVVTNDSLTEGDLWHKRISHISNKGLQVLAKQGILPQGVCDELKFCE
        RNLISLG LD  G  +R   G   + K +  ++ G+K NGLYV++ A ++    V    +L +  LWH+R+ H+S++GL  L KQG+       ++ FCE
Subjt:  RNLISLGMLDSIGCEYRGFGGTFEVMKDSKTVLVGVKMNGLYVIKDAEMIQTALVVTNDSLTEGDLWHKRISHISNKGLQVLAKQGILPQGVCDELKFCE

Query:  HCVLG-----------------------------------------------------------------------------------------------
         C++G                                                                                               
Subjt:  HCVLG-----------------------------------------------------------------------------------------------

Query:  -FDKFCKEHGILRHKTVRYTPQQNGVAERLNRTIMERVRCLLSDAMLPEKFWAETSSYTVYTLNRCPHSSINLLTPEE
         F+KFC + GI+RHKTV +TPQQNG+AER+NRT+++RVRCLL  + L + FW E  S   Y +NR P S+IN  TP+E
Subjt:  -FDKFCKEHGILRHKTVRYTPQQNGVAERLNRTIMERVRCLLSDAMLPEKFWAETSSYTVYTLNRCPHSSINLLTPEE

TYK25306.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]1.9e-8734.83Show/hide
Query:  KKKIKAILGQQNALKAIQDPKELPTTMTQDEKDNMQEAAYGTLVLNLSNNVLRQVLDQDTAYKIWTKLHELYLTKDLPNKAYLRERFFTYKMDPSKSMTD
        +KKI+AIL Q    K I D + LP  +T+ EK +M E AY T++L LS+ VLR V +  T  ++W KL  LYLTK LPNK Y++E+FF YKMD SKS+ +
Subjt:  KKKIKAILGQQNALKAIQDPKELPTTMTQDEKDNMQEAAYGTLVLNLSNNVLRQVLDQDTAYKIWTKLHELYLTKDLPNKAYLRERFFTYKMDPSKSMTD

Query:  NLDDFKKLSSEFKTIGDKLEDENEAFILLNSLLDNYREVKVALKYGREKITTDDIVSAVRTGELELQTQKKEISNSEGLFSKGKSKQNGKDNKNQGDDDK
        NLD+F+K+  +   IG+K+ DEN+A ILLNSL + YREVK A+KYGR+ +T   ++ A++T  LE+   KKE  + E L ++G+S++     K +    K
Subjt:  NLDDFKKLSSEFKTIGDKLEDENEAFILLNSLLDNYREVKVALKYGREKITTDDIVSAVRTGELELQTQKKEISNSEGLFSKGKSKQNGKDNKNQGDDDK

Query:  AGKSKIRCNFCHKDGHLKKNCFFLKRKQNQKSKKGKPAEASV--GENSLTYSDALATSDRSSKQNESLGETRLDL-------------------------
        +     +C  CHK+GH KKNC         KS++   +EA+V  G NS   +D   +++   +  E L  +  D+                         
Subjt:  AGKSKIRCNFCHKDGHLKKNCFFLKRKQNQKSKKGKPAEASV--GENSLTYSDALATSDRSSKQNESLGETRLDL-------------------------

Query:  --------------------GLWMFFPHDSIKR---NVRHVPTLKRNLISLGMLDSIGCEYRGFGGTFEVMKDSKTVLVGVKMNGLYVIKDAEMIQTALV
                            G      HD + R   NVR+VP LKRNLISLG LD  GC  +   G  +V K S   L G   +GLYV++   +  +A +
Subjt:  --------------------GLWMFFPHDSIKR---NVRHVPTLKRNLISLGMLDSIGCEYRGFGGTFEVMKDSKTVLVGVKMNGLYVIKDAEMIQTALV

Query:  VTNDSLTEGDLWHKRISHISNKGLQVLAKQGILPQGVCDELKFCEHCVLG--------------------------------------------------
         +        LWHKR++H+S +GLQ L++QG+L      EL FCEHC++G                                                  
Subjt:  VTNDSLTEGDLWHKRISHISNKGLQVLAKQGILPQGVCDELKFCEHCVLG--------------------------------------------------

Query:  ----------------------------------------------FDKFCKEHGILRHKTVRYTPQQNGVAERLNRTIMERVRCLLSDAMLPEKFWAET
                                                      F++FCK  GI RH TV YTPQQNG+AER NRTIMER RCLL++A LP KFW E 
Subjt:  ----------------------------------------------FDKFCKEHGILRHKTVRYTPQQNGVAERLNRTIMERVRCLLSDAMLPEKFWAET

Query:  SSYTVYTLNRCPHSSINLLTPEE
        +    Y +NR P +++NL TP+E
Subjt:  SSYTVYTLNRCPHSSINLLTPEE

TrEMBL top hitse value%identityAlignment
A0A2I0WB13 Retrovirus-related Pol polyprotein from transposon TNT 1-947.6e-8233.74Show/hide
Query:  KIKAILGQQNALKAIQDPKELPTTMTQDEKDNMQEAAYGTLVLNLSNNVLRQVLDQDTAYKIWTKLHELYLTKDLPNKAYLRERFFTYKMDPSKSMTDNL
        K++AIL QQ   KA+    ELP+TM+  EK ++Q+ A+ +++L L++ VLR+V    T  ++W KL ELY  K LPN+ YL+E+FF YKMD +KS+ DNL
Subjt:  KIKAILGQQNALKAIQDPKELPTTMTQDEKDNMQEAAYGTLVLNLSNNVLRQVLDQDTAYKIWTKLHELYLTKDLPNKAYLRERFFTYKMDPSKSMTDNL

Query:  DDFKKLSSEFKTIGDKLEDENEAFILLNSLLDNYREVKVALKYGREKITTDDIVSAVRTGELELQTQKKEISNSEGLFSKGKSKQNG------KDNKNQG
        D+F KL  + + +  K+EDE++A ILLNSL  + R  K  LKYGRE IT D++ +A+ +  L+++  +K  S  EGL  +G+S++ G      K      
Subjt:  DDFKKLSSEFKTIGDKLEDENEAFILLNSLLDNYREVKVALKYGREKITTDDIVSAVRTGELELQTQKKEISNSEGLFSKGKSKQNG------KDNKNQG

Query:  DDDKAGKSKIRCNFCHKDGHLKKNCFFLKRKQNQKSKKGKPAEASVGENSLTYSDALATSDRSSKQNESLGETRL-DLGLWMFFPHDSIKRNVRHVPTLK
           K     ++C  C+K GH+++ C     ++N K K     +A++   +   +D L  SD     N++     +  + + M   H  I ++VRHVP LK
Subjt:  DDDKAGKSKIRCNFCHKDGHLKKNCFFLKRKQNQKSKKGKPAEASVGENSLTYSDALATSDRSSKQNESLGETRL-DLGLWMFFPHDSIKRNVRHVPTLK

Query:  RNLISLGMLDSIGCEYRGFGGTFEVMKDSKTVLVGVKMNGLYVIKDAEMIQTALVVTNDSLTEGDLWHKRISHISNKGLQVLAKQGILPQGVCDELKFCE
        RNLISLG LD  G  +R   G   + K +  ++ G+K NGLYV++ A ++    V    +L +  LWH+R+ H+S++GL  L KQG+       ++ FCE
Subjt:  RNLISLGMLDSIGCEYRGFGGTFEVMKDSKTVLVGVKMNGLYVIKDAEMIQTALVVTNDSLTEGDLWHKRISHISNKGLQVLAKQGILPQGVCDELKFCE

Query:  HCVLG-----------------------------------------------------------------------------------------------
         C++G                                                                                               
Subjt:  HCVLG-----------------------------------------------------------------------------------------------

Query:  -FDKFCKEHGILRHKTVRYTPQQNGVAERLNRTIMERVRCLLSDAMLPEKFWAETSSYTVYTLNRCPHSSINLLTPEE
         F+KFC + GI+RHKTV +TPQQNG+AER+NRT+++RVRCLL  + L + FW E  S   Y +NR P S+IN  TP+E
Subjt:  -FDKFCKEHGILRHKTVRYTPQQNGVAERLNRTIMERVRCLLSDAMLPEKFWAETSSYTVYTLNRCPHSSINLLTPEE

A0A5A7U2U7 Retrotransposon protein, putative, Ty1-copia sub-class5.1e-8634.51Show/hide
Query:  KKKIKAILGQQNALKAIQDPKELPTTMTQDEKDNMQEAAYGTLVLNLSNNVLRQVLDQDTAYKIWTKLHELYLTKDLPNKAYLRERFFTYKMDPSKSMTD
        +KKI+AIL Q    K I D + LP  +T+ EK +M E AY T++L LS+ VLR V +  T  ++W KL  LYLTK LPNK Y++E+FF YKMD SK + +
Subjt:  KKKIKAILGQQNALKAIQDPKELPTTMTQDEKDNMQEAAYGTLVLNLSNNVLRQVLDQDTAYKIWTKLHELYLTKDLPNKAYLRERFFTYKMDPSKSMTD

Query:  NLDDFKKLSSEFKTIGDKLEDENEAFILLNSLLDNYREVKVALKYGREKITTDDIVSAVRTGELELQTQKKEISNSEGLFSKGKSKQNGKDNKNQGDDDK
        NLD+F+K+  +   IG+K+ DEN+A ILLNSL + YREVK A+KYG + +T   ++ A++T  LE+   KKE  + E L ++G+S++     K +    K
Subjt:  NLDDFKKLSSEFKTIGDKLEDENEAFILLNSLLDNYREVKVALKYGREKITTDDIVSAVRTGELELQTQKKEISNSEGLFSKGKSKQNGKDNKNQGDDDK

Query:  AGKSKIRCNFCHKDGHLKKNCFFLKRKQNQKSKKGKPAEASV--GENSLTYSDALATSDRSSKQNESLGETRLDL-------------------------
        +     +C  CHK+GH KKNC         KS++   +EA+V  G NS   +D   +++   +  E L  +  D+                         
Subjt:  AGKSKIRCNFCHKDGHLKKNCFFLKRKQNQKSKKGKPAEASV--GENSLTYSDALATSDRSSKQNESLGETRLDL-------------------------

Query:  --------------------GLWMFFPHDSIKR---NVRHVPTLKRNLISLGMLDSIGCEYRGFGGTFEVMKDSKTVLVGVKMNGLYVIKDAEMIQTALV
                            G      HD + R   NVR+VP LKRNLISLG LD  GC  +   G  +V K S   L G   +GLYV++   +  +A +
Subjt:  --------------------GLWMFFPHDSIKR---NVRHVPTLKRNLISLGMLDSIGCEYRGFGGTFEVMKDSKTVLVGVKMNGLYVIKDAEMIQTALV

Query:  VTNDSLTEGDLWHKRISHISNKGLQVLAKQGILPQGVCDELKFCEHCVLG--------------------------------------------------
         +        LWHKR++H+S +GLQ L++QG+L      EL FCEHC++G                                                  
Subjt:  VTNDSLTEGDLWHKRISHISNKGLQVLAKQGILPQGVCDELKFCEHCVLG--------------------------------------------------

Query:  ----------------------------------------------FDKFCKEHGILRHKTVRYTPQQNGVAERLNRTIMERVRCLLSDAMLPEKFWAET
                                                      F++FCK  GI RH TV YTPQQNG+AER NRTIMER RCLL++A LP KFW E 
Subjt:  ----------------------------------------------FDKFCKEHGILRHKTVRYTPQQNGVAERLNRTIMERVRCLLSDAMLPEKFWAET

Query:  SSYTVYTLNRCPHSSINLLTPEE
        +    Y +NR P +++NL TP+E
Subjt:  SSYTVYTLNRCPHSSINLLTPEE

A0A5A7UB25 Putative gag-pol polyprotein1.3e-8634.67Show/hide
Query:  KKKIKAILGQQNALKAIQDPKELPTTMTQDEKDNMQEAAYGTLVLNLSNNVLRQVLDQDTAYKIWTKLHELYLTKDLPNKAYLRERFFTYKMDPSKSMTD
        +KKI+AIL Q    K I D + LP  +T+ EK +M E AY T++L LS+ VLR V +  T  ++W KL  LYLTK L NK Y++E+FF YKMD SKS+ +
Subjt:  KKKIKAILGQQNALKAIQDPKELPTTMTQDEKDNMQEAAYGTLVLNLSNNVLRQVLDQDTAYKIWTKLHELYLTKDLPNKAYLRERFFTYKMDPSKSMTD

Query:  NLDDFKKLSSEFKTIGDKLEDENEAFILLNSLLDNYREVKVALKYGREKITTDDIVSAVRTGELELQTQKKEISNSEGLFSKGKSKQNGKDNKNQGDDDK
        NLD+F+K+  +   IG+K+ DEN+A ILLNSL + YREVK A+KYGR+ +T   ++ A++T  LE+   KKE  + E L ++G+S++     K +    K
Subjt:  NLDDFKKLSSEFKTIGDKLEDENEAFILLNSLLDNYREVKVALKYGREKITTDDIVSAVRTGELELQTQKKEISNSEGLFSKGKSKQNGKDNKNQGDDDK

Query:  AGKSKIRCNFCHKDGHLKKNCFFLKRKQNQKSKKGKPAEASV--GENSLTYSDALATSDRSSKQNESLGETRLDL-------------------------
        +     +C  CHK+GH KKNC         KS++   +EA+V  G NS   +D   +++   +  E L  +  D+                         
Subjt:  AGKSKIRCNFCHKDGHLKKNCFFLKRKQNQKSKKGKPAEASV--GENSLTYSDALATSDRSSKQNESLGETRLDL-------------------------

Query:  --------------------GLWMFFPHDSIKR---NVRHVPTLKRNLISLGMLDSIGCEYRGFGGTFEVMKDSKTVLVGVKMNGLYVIKDAEMIQTALV
                            G      HD + R   NVR+VP LKRNLISLG LD  GC  +   G  +V K S   L G   +GLYV++   +  +A +
Subjt:  --------------------GLWMFFPHDSIKR---NVRHVPTLKRNLISLGMLDSIGCEYRGFGGTFEVMKDSKTVLVGVKMNGLYVIKDAEMIQTALV

Query:  VTNDSLTEGDLWHKRISHISNKGLQVLAKQGILPQGVCDELKFCEHCVLG--------------------------------------------------
         +        LWHKR++H+S +GLQ L++QG+L      EL FCEHC++G                                                  
Subjt:  VTNDSLTEGDLWHKRISHISNKGLQVLAKQGILPQGVCDELKFCEHCVLG--------------------------------------------------

Query:  ----------------------------------------------FDKFCKEHGILRHKTVRYTPQQNGVAERLNRTIMERVRCLLSDAMLPEKFWAET
                                                      F++FCK  GI RH TV YTPQQNG+AER NRTIMER RCLL++A LP KFW E 
Subjt:  ----------------------------------------------FDKFCKEHGILRHKTVRYTPQQNGVAERLNRTIMERVRCLLSDAMLPEKFWAET

Query:  SSYTVYTLNRCPHSSINLLTPEE
        +    Y +NR P +++NL TP+E
Subjt:  SSYTVYTLNRCPHSSINLLTPEE

A0A5A7UJ23 Integrase catalytic domain-containing protein4.9e-8136.98Show/hide
Query:  KKIKAILGQQNALKAIQDPKELPTTMTQDEKDNMQEAAYGTLVLNLSNNVLRQVLDQDTAYKIWTKLHELYLTKDLPNKAYLRERFFTYKMDPSKSMTDN
        K+I AILG Q ALKA +DPKELP T+T+ E++ ++E AY TL++N+++NVLRQV+++  A+                                       
Subjt:  KKIKAILGQQNALKAIQDPKELPTTMTQDEKDNMQEAAYGTLVLNLSNNVLRQVLDQDTAYKIWTKLHELYLTKDLPNKAYLRERFFTYKMDPSKSMTDN

Query:  LDDFKKLSSEFKTIGDKLEDENEAFILLNSLLDNYREVKVALKYGREKITTDDIVSAVRTGELELQTQKKEISNSEGLFSKGKSKQNGKDNKNQGDDDKA
         ++FKKL++ F   G+KL  E+EA IL+NS+ D Y+EVK+ALKYGRE IT + +++A+++ ELEL+T+ K  + +E LF KGK+      NKNQ      
Subjt:  LDDFKKLSSEFKTIGDKLEDENEAFILLNSLLDNYREVKVALKYGREKITTDDIVSAVRTGELELQTQKKEISNSEGLFSKGKSKQNGKDNKNQGDDDKA

Query:  GKSKIRCNFCHKDGHLKKNC---------------------FFLKRKQNQKSKKGKPAE-----ASVGENSLTYSDALATSDRSSKQNESLGET-RLDLG
         K  ++C  CHK GH K+NC                      F + +  Q+  + +  E       VG  +  Y++ L T+++ + + ++  E   LD G
Subjt:  GKSKIRCNFCHKDGHLKKNC---------------------FFLKRKQNQKSKKGKPAE-----ASVGENSLTYSDALATSDRSSKQNESLGET-RLDLG

Query:  LWMFFPHDSIKRNVRHVPTLKRNLISLGMLDSIGCEYRGFGGTFEVMKDSKTVLVGVKMNGLYVIKDAEMIQTALVVTNDSLTEGDLWHKRISHISNKGL
                    + RHVP LKRNLISLGMLD +GC      G  +V +  + +L   K+  LY +K+    + AL+   +   E +LWH+R+SHIS KGL
Subjt:  LWMFFPHDSIKRNVRHVPTLKRNLISLGMLDSIGCEYRGFGGTFEVMKDSKTVLVGVKMNGLYVIKDAEMIQTALVVTNDSLTEGDLWHKRISHISNKGL

Query:  QVLAKQGILPQGVCDELKFCEHCVLGFDKFCK----EH----------------------GILRHKTVRYTPQQNGVAERLNRTIMERVRCLLSDAMLPE
          L K G++       L FCEHC+ G  K  K    EH                      G  RH+TV YTPQQNGVAER+NRT+MERVRC++S+A + E
Subjt:  QVLAKQGILPQGVCDELKFCEHCVLGFDKFCK----EH----------------------GILRHKTVRYTPQQNGVAERLNRTIMERVRCLLSDAMLPE

Query:  KFWAETSSYTVYTLNRCPHSSINLLTPEER
         FWAE  +   YT+ R    SI++ TPEER
Subjt:  KFWAETSSYTVYTLNRCPHSSINLLTPEER

A0A5D3DNU1 Putative gag-pol polyprotein9.2e-8834.83Show/hide
Query:  KKKIKAILGQQNALKAIQDPKELPTTMTQDEKDNMQEAAYGTLVLNLSNNVLRQVLDQDTAYKIWTKLHELYLTKDLPNKAYLRERFFTYKMDPSKSMTD
        +KKI+AIL Q    K I D + LP  +T+ EK +M E AY T++L LS+ VLR V +  T  ++W KL  LYLTK LPNK Y++E+FF YKMD SKS+ +
Subjt:  KKKIKAILGQQNALKAIQDPKELPTTMTQDEKDNMQEAAYGTLVLNLSNNVLRQVLDQDTAYKIWTKLHELYLTKDLPNKAYLRERFFTYKMDPSKSMTD

Query:  NLDDFKKLSSEFKTIGDKLEDENEAFILLNSLLDNYREVKVALKYGREKITTDDIVSAVRTGELELQTQKKEISNSEGLFSKGKSKQNGKDNKNQGDDDK
        NLD+F+K+  +   IG+K+ DEN+A ILLNSL + YREVK A+KYGR+ +T   ++ A++T  LE+   KKE  + E L ++G+S++     K +    K
Subjt:  NLDDFKKLSSEFKTIGDKLEDENEAFILLNSLLDNYREVKVALKYGREKITTDDIVSAVRTGELELQTQKKEISNSEGLFSKGKSKQNGKDNKNQGDDDK

Query:  AGKSKIRCNFCHKDGHLKKNCFFLKRKQNQKSKKGKPAEASV--GENSLTYSDALATSDRSSKQNESLGETRLDL-------------------------
        +     +C  CHK+GH KKNC         KS++   +EA+V  G NS   +D   +++   +  E L  +  D+                         
Subjt:  AGKSKIRCNFCHKDGHLKKNCFFLKRKQNQKSKKGKPAEASV--GENSLTYSDALATSDRSSKQNESLGETRLDL-------------------------

Query:  --------------------GLWMFFPHDSIKR---NVRHVPTLKRNLISLGMLDSIGCEYRGFGGTFEVMKDSKTVLVGVKMNGLYVIKDAEMIQTALV
                            G      HD + R   NVR+VP LKRNLISLG LD  GC  +   G  +V K S   L G   +GLYV++   +  +A +
Subjt:  --------------------GLWMFFPHDSIKR---NVRHVPTLKRNLISLGMLDSIGCEYRGFGGTFEVMKDSKTVLVGVKMNGLYVIKDAEMIQTALV

Query:  VTNDSLTEGDLWHKRISHISNKGLQVLAKQGILPQGVCDELKFCEHCVLG--------------------------------------------------
         +        LWHKR++H+S +GLQ L++QG+L      EL FCEHC++G                                                  
Subjt:  VTNDSLTEGDLWHKRISHISNKGLQVLAKQGILPQGVCDELKFCEHCVLG--------------------------------------------------

Query:  ----------------------------------------------FDKFCKEHGILRHKTVRYTPQQNGVAERLNRTIMERVRCLLSDAMLPEKFWAET
                                                      F++FCK  GI RH TV YTPQQNG+AER NRTIMER RCLL++A LP KFW E 
Subjt:  ----------------------------------------------FDKFCKEHGILRHKTVRYTPQQNGVAERLNRTIMERVRCLLSDAMLPEKFWAET

Query:  SSYTVYTLNRCPHSSINLLTPEE
        +    Y +NR P +++NL TP+E
Subjt:  SSYTVYTLNRCPHSSINLLTPEE

SwissProt top hitse value%identityAlignment
P04146 Copia protein3.9e-1921.36Show/hide
Query:  KKKIKAILGQQNALKAIQDPKELPTTMTQDEKDNMQEAAYGTLVLNLSNNVLRQVLDQDTAYKIWTKLHELYLTKDLPNKAYLRERFFTYKMDPSKSMTD
        K +I+A+L +Q+ LK +     L      D     +  A  T++  LS++ L       TA +I   L  +Y  K L ++  LR+R  + K+    S+  
Subjt:  KKKIKAILGQQNALKAIQDPKELPTTMTQDEKDNMQEAAYGTLVLNLSNNVLRQVLDQDTAYKIWTKLHELYLTKDLPNKAYLRERFFTYKMDPSKSMTD

Query:  NLDDFKKLSSEFKTIGDKLEDENEAFILLNSLLDNYREVKVALK-YGREKITTDDIVSAVRTGELELQTQKKEISNSEGLFSKGKSKQNGKDN--KNQGD
        +   F +L SE    G K+E+ ++   LL +L   Y  +  A++    E +T   + + +   E++++    + S          +    K+N  KN+  
Subjt:  NLDDFKKLSSEFKTIGDKLEDENEAFILLNSLLDNYREVKVALK-YGREKITTDDIVSAVRTGELELQTQKKEISNSEGLFSKGKSKQNGKDN--KNQGD

Query:  DDKA-----GKSKIRCNFCHKDGHLKKNCFFLKRKQNQKSKKGKPAEASVGENSLTYSDALATSDRSSKQNESLGETRLDLGLWMFFPHDS--IKRNVRH
          K       K K++C+ C ++GH+KK+CF  KR  N K+K+ +    +   + + +   +     ++   ++ G   LD G      +D      +V  
Subjt:  DDKA-----GKSKIRCNFCHKDGHLKKNCFFLKRKQNQKSKKGKPAEASVGENSLTYSDALATSDRSSKQNESLGETRLDLGLWMFFPHDS--IKRNVRH

Query:  VPTLK--------------------RNLISLGMLDSIGCEYRGFGGTFEVMKDSKTVL--------VGVKMNGLYVIKDAEMIQTALVVTNDSLT-----
        VP LK                    RN   + + D + C+    G    V +  +  +        V +  NGL V+K++ M+    V+   + +     
Subjt:  VPTLK--------------------RNLISLGMLDSIGCEYRGFGGTFEVMKDSKTVL--------VGVKMNGLYVIKDAEMIQTALVVTNDSLT-----

Query:  --EGDLWHKRISHISN-KGLQVLAKQGILPQGVCDEL----KFCEHCVLG--------------------------------------------------
             LWH+R  HIS+ K L++  K     Q + + L    + CE C+ G                                                  
Subjt:  --EGDLWHKRISHISN-KGLQVLAKQGILPQGVCDEL----KFCEHCVLG--------------------------------------------------

Query:  ------------------------------------------------FDKFCKEHGILRHKTVRYTPQQNGVAERLNRTIMERVRCLLSDAMLPEKFWA
                                                          +FC + GI  H TV +TPQ NGV+ER+ RTI E+ R ++S A L + FW 
Subjt:  ------------------------------------------------FDKFCKEHGILRHKTVRYTPQQNGVAERLNRTIMERVRCLLSDAMLPEKFWA

Query:  ETSSYTVYTLNRCPHSSI
        E      Y +NR P  ++
Subjt:  ETSSYTVYTLNRCPHSSI

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-949.0e-4825.48Show/hide
Query:  KKKIKAILGQQNALKAIQDPKELPTTMTQDEKDNMQEAAYGTLVLNLSNNVLRQVLDQDTAYKIWTKLHELYLTKDLPNKAYLRERFFTYKMDPSKSMTD
        +++++ +L QQ   K +    + P TM  ++  ++ E A   + L+LS++V+  ++D+DTA  IWT+L  LY++K L NK YL+++ +   M    +   
Subjt:  KKKIKAILGQQNALKAIQDPKELPTTMTQDEKDNMQEAAYGTLVLNLSNNVLRQVLDQDTAYKIWTKLHELYLTKDLPNKAYLRERFFTYKMDPSKSMTD

Query:  NLDDFKKLSSEFKTIGDKLEDENEAFILLNSLLDNYREVKVALKYGREKITTDDIVSAVRTGELELQTQKKEISNSEGLFSKGKSKQNGKDNKNQGDDDK
        +L+ F  L ++   +G K+E+E++A +LLNSL  +Y  +   + +G+  I   D+ SA+   E   + +KK  +  + L ++G+ +   + + N G    
Subjt:  NLDDFKKLSSEFKTIGDKLEDENEAFILLNSLLDNYREVKVALKYGREKITTDDIVSAVRTGELELQTQKKEISNSEGLFSKGKSKQNGKDNKNQGDDDK

Query:  AGKSKIR-------CNFCHKDGHLKKNCFFLKRKQNQKS-KKGKPAEASVGENS----LTYSDALATSDRSSKQNE------------------------
         GKSK R       C  C++ GH K++C   ++ + + S +K     A++ +N+    L  ++       S  ++E                        
Subjt:  AGKSKIR-------CNFCHKDGHLKKNCFFLKRKQNQKS-KKGKPAEASVGENS----LTYSDALATSDRSSKQNE------------------------

Query:  -----SLGETRL-------DLGLWMFFPHDSIKRNVRHVPTLKRNLISLGMLDSIGCEYRGFGGTFEVMKDSKTVLVGVKMNGLYVIKDAEMIQTALVVT
              +G T         D+ +        + ++VRHVP L+ NLIS   LD  G E       + + K S  +  GV    LY   +AE+ Q  L   
Subjt:  -----SLGETRL-------DLGLWMFFPHDSIKRNVRHVPTLKRNLISLGMLDSIGCEYRGFGGTFEVMKDSKTVLVGVKMNGLYVIKDAEMIQTALVVT

Query:  NDSLTEGDLWHKRISHISNKGLQVLAKQGILPQGVCDELKFCEHCVLG----------------------------------------------------
         D ++  DLWHKR+ H+S KGLQ+LAK+ ++       +K C++C+ G                                                    
Subjt:  NDSLTEGDLWHKRISHISNKGLQVLAKQGILPQGVCDELKFCEHCVLG----------------------------------------------------

Query:  --------------------------------------------FDKFCKEHGILRHKTVRYTPQQNGVAERLNRTIMERVRCLLSDAMLPEKFWAETSS
                                                    F+++C  HGI   KTV  TPQ NGVAER+NRTI+E+VR +L  A LP+ FW E   
Subjt:  --------------------------------------------FDKFCKEHGILRHKTVRYTPQQNGVAERLNRTIMERVRCLLSDAMLPEKFWAETSS

Query:  YTVYTLNRCPHSSINLLTPE
           Y +NR P   +    PE
Subjt:  YTVYTLNRCPHSSINLLTPE

P93293 Uncharacterized mitochondrial protein AtMg003006.3e-0934.88Show/hide
Query:  GTFEVMKDSKTVLVGVKMNGLYVIK-DAEMIQTALVVTNDSLTEGDLWHKRISHISNKGLQVLAKQGILPQGVCDELKFCEHCVLG
        G  +V+K  +T+L G + + LY+++   E  ++ L  T  +  E  LWH R++H+S +G+++L K+G L       LKFCE C+ G
Subjt:  GTFEVMKDSKTVLVGVKMNGLYVIK-DAEMIQTALVVTNDSLTEGDLWHKRISHISNKGLQVLAKQGILPQGVCDELKFCEHCVLG

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.9e-0632.89Show/hide
Query:  KFCKEHGILRHKTVRYTPQQNGVAERLNRTIMERVRCLLSDAMLPEKFWAETSSYTVYTLNRCPHSSINLLTPEER
        ++  +HGI    +  +TP+ NG++ER +R I+E    LLS A +P+ +W    +  VY +NR P   + L +P ++
Subjt:  KFCKEHGILRHKTVRYTPQQNGVAERLNRTIMERVRCLLSDAMLPEKFWAETSSYTVYTLNRCPHSSINLLTPEER

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE27.7e-0734.67Show/hide
Query:  FCKEHGILRHKTVRYTPQQNGVAERLNRTIMERVRCLLSDAMLPEKFWAETSSYTVYTLNRCPHSSINLLTPEER
        +  +HGI    +  +TP+ NG++ER +R I+E    LLS A +P+ +W    S  VY +NR P   + L +P ++
Subjt:  FCKEHGILRHKTVRYTPQQNGVAERLNRTIMERVRCLLSDAMLPEKFWAETSSYTVYTLNRCPHSSINLLTPEER

Arabidopsis top hitse value%identityAlignment
ATMG00300.1 Gag-Pol-related retrotransposon family protein4.5e-1034.88Show/hide
Query:  GTFEVMKDSKTVLVGVKMNGLYVIK-DAEMIQTALVVTNDSLTEGDLWHKRISHISNKGLQVLAKQGILPQGVCDELKFCEHCVLG
        G  +V+K  +T+L G + + LY+++   E  ++ L  T  +  E  LWH R++H+S +G+++L K+G L       LKFCE C+ G
Subjt:  GTFEVMKDSKTVLVGVKMNGLYVIK-DAEMIQTALVVTNDSLTEGDLWHKRISHISNKGLQVLAKQGILPQGVCDELKFCEHCVLG

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.3e-0438.78Show/hide
Query:  LNRTIMERVRCLLSDAMLPEKFWAETSSYTVYTLNRCPHSSINLLTPEE
        +NRTI+E+VR +L +  LP+ F A+ ++  V+ +N+ P ++IN   P+E
Subjt:  LNRTIMERVRCLLSDAMLPEKFWAETSSYTVYTLNRCPHSSINLLTPEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACATTGAGAAATTTGATGGCAAGGGAGATTTTGATCTGTGGAAAGAAAAAGATCAAAGCCATTCTTGGACAACAAAATGCGTTAAAGGCTATTCAAGACCCAAAAGA
ACTCCCAACCACGATGACTCAAGACGAAAAGGACAATATGCAAGAAGCGGCATATGGAACATTGGTTTTGAACCTTAGCAACAATGTTCTTAGACAAGTGTTGGATCAAG
ACACTGCCTACAAGATTTGGACGAAGCTACATGAGTTATATCTCACTAAAGATCTTCCCAACAAAGCATATTTGAGGGAAAGATTTTTTACATACAAGATGGATCCAAGT
AAGTCTATGACTGATAATCTTGATGATTTTAAGAAGCTTTCGTCTGAATTCAAGACTATTGGAGATAAGCTTGAAGATGAAAATGAGGCTTTCATCTTATTGAATTCACT
ACTAGACAATTACAGAGAAGTTAAAGTAGCCTTGAAGTATGGAAGAGAGAAAATAACCACCGATGACATTGTATCTGCAGTTCGAACAGGAGAGTTGGAACTTCAAACTC
AAAAGAAAGAGATTTCTAATTCTGAGGGTCTCTTTTCTAAGGGGAAAAGCAAACAAAACGGGAAAGACAACAAGAACCAAGGTGATGATGACAAAGCTGGGAAGTCGAAA
ATACGGTGTAATTTCTGTCATAAAGATGGACATCTCAAGAAAAACTGCTTCTTTCTTAAAAGAAAGCAGAATCAGAAGAGCAAGAAGGGTAAACCAGCTGAAGCATCCGT
GGGAGAGAATTCTCTCACTTATTCTGATGCCTTAGCAACTTCTGATCGGTCATCTAAGCAAAATGAAAGCCTTGGAGAAACAAGATTGGATCTTGGATTATGGATGTTCT
TTCCACATGACTCCATCAAAAGAAATGTGAGACATGTTCCTACGTTAAAAAGGAACTTGATATCTTTGGGAATGTTGGATTCCATTGGCTGCGAATACAGAGGATTTGGA
GGAACTTTTGAAGTCATGAAAGATTCCAAAACGGTGCTGGTTGGTGTGAAAATGAATGGTCTTTATGTAATTAAAGACGCAGAGATGATTCAAACAGCCTTGGTAGTCAC
AAATGACAGTTTGACGGAAGGAGATTTATGGCACAAACGTATCTCTCATATAAGCAACAAAGGGCTACAAGTTCTTGCCAAGCAAGGAATTTTACCTCAAGGGGTGTGTG
ATGAATTGAAATTCTGTGAACATTGCGTTCTCGGATTTGATAAATTTTGTAAAGAACATGGAATATTGCGTCACAAAACTGTTAGATACACTCCTCAACAGAATGGGGTG
GCAGAAAGACTCAATCGAACCATCATGGAAAGGGTAAGATGTTTACTTTCTGATGCAATGCTTCCTGAGAAATTCTGGGCAGAAACATCTTCATATACAGTGTATACGTT
GAACAGATGTCCTCATTCCTCTATCAATCTGTTAACTCCAGAAGAAAGGGATAAGGCTGGGTACCTTATCCTGGTGACACTATGGATACGACCCGCTTTGTATATTGATA
CAAACGTAGTGATCCAACGCGTTCATGTGGTAGACATGCGAGTGGGGGTATCCTGTGCAATGAGTTTGCACAAAGACCGGACCGCGAAATAG
mRNA sequenceShow/hide mRNA sequence
ATGACATTGAGAAATTTGATGGCAAGGGAGATTTTGATCTGTGGAAAGAAAAAGATCAAAGCCATTCTTGGACAACAAAATGCGTTAAAGGCTATTCAAGACCCAAAAGA
ACTCCCAACCACGATGACTCAAGACGAAAAGGACAATATGCAAGAAGCGGCATATGGAACATTGGTTTTGAACCTTAGCAACAATGTTCTTAGACAAGTGTTGGATCAAG
ACACTGCCTACAAGATTTGGACGAAGCTACATGAGTTATATCTCACTAAAGATCTTCCCAACAAAGCATATTTGAGGGAAAGATTTTTTACATACAAGATGGATCCAAGT
AAGTCTATGACTGATAATCTTGATGATTTTAAGAAGCTTTCGTCTGAATTCAAGACTATTGGAGATAAGCTTGAAGATGAAAATGAGGCTTTCATCTTATTGAATTCACT
ACTAGACAATTACAGAGAAGTTAAAGTAGCCTTGAAGTATGGAAGAGAGAAAATAACCACCGATGACATTGTATCTGCAGTTCGAACAGGAGAGTTGGAACTTCAAACTC
AAAAGAAAGAGATTTCTAATTCTGAGGGTCTCTTTTCTAAGGGGAAAAGCAAACAAAACGGGAAAGACAACAAGAACCAAGGTGATGATGACAAAGCTGGGAAGTCGAAA
ATACGGTGTAATTTCTGTCATAAAGATGGACATCTCAAGAAAAACTGCTTCTTTCTTAAAAGAAAGCAGAATCAGAAGAGCAAGAAGGGTAAACCAGCTGAAGCATCCGT
GGGAGAGAATTCTCTCACTTATTCTGATGCCTTAGCAACTTCTGATCGGTCATCTAAGCAAAATGAAAGCCTTGGAGAAACAAGATTGGATCTTGGATTATGGATGTTCT
TTCCACATGACTCCATCAAAAGAAATGTGAGACATGTTCCTACGTTAAAAAGGAACTTGATATCTTTGGGAATGTTGGATTCCATTGGCTGCGAATACAGAGGATTTGGA
GGAACTTTTGAAGTCATGAAAGATTCCAAAACGGTGCTGGTTGGTGTGAAAATGAATGGTCTTTATGTAATTAAAGACGCAGAGATGATTCAAACAGCCTTGGTAGTCAC
AAATGACAGTTTGACGGAAGGAGATTTATGGCACAAACGTATCTCTCATATAAGCAACAAAGGGCTACAAGTTCTTGCCAAGCAAGGAATTTTACCTCAAGGGGTGTGTG
ATGAATTGAAATTCTGTGAACATTGCGTTCTCGGATTTGATAAATTTTGTAAAGAACATGGAATATTGCGTCACAAAACTGTTAGATACACTCCTCAACAGAATGGGGTG
GCAGAAAGACTCAATCGAACCATCATGGAAAGGGTAAGATGTTTACTTTCTGATGCAATGCTTCCTGAGAAATTCTGGGCAGAAACATCTTCATATACAGTGTATACGTT
GAACAGATGTCCTCATTCCTCTATCAATCTGTTAACTCCAGAAGAAAGGGATAAGGCTGGGTACCTTATCCTGGTGACACTATGGATACGACCCGCTTTGTATATTGATA
CAAACGTAGTGATCCAACGCGTTCATGTGGTAGACATGCGAGTGGGGGTATCCTGTGCAATGAGTTTGCACAAAGACCGGACCGCGAAATAG
Protein sequenceShow/hide protein sequence
MTLRNLMAREILICGKKKIKAILGQQNALKAIQDPKELPTTMTQDEKDNMQEAAYGTLVLNLSNNVLRQVLDQDTAYKIWTKLHELYLTKDLPNKAYLRERFFTYKMDPS
KSMTDNLDDFKKLSSEFKTIGDKLEDENEAFILLNSLLDNYREVKVALKYGREKITTDDIVSAVRTGELELQTQKKEISNSEGLFSKGKSKQNGKDNKNQGDDDKAGKSK
IRCNFCHKDGHLKKNCFFLKRKQNQKSKKGKPAEASVGENSLTYSDALATSDRSSKQNESLGETRLDLGLWMFFPHDSIKRNVRHVPTLKRNLISLGMLDSIGCEYRGFG
GTFEVMKDSKTVLVGVKMNGLYVIKDAEMIQTALVVTNDSLTEGDLWHKRISHISNKGLQVLAKQGILPQGVCDELKFCEHCVLGFDKFCKEHGILRHKTVRYTPQQNGV
AERLNRTIMERVRCLLSDAMLPEKFWAETSSYTVYTLNRCPHSSINLLTPEERDKAGYLILVTLWIRPALYIDTNVVIQRVHVVDMRVGVSCAMSLHKDRTAK