; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0026882 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0026882
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionReverse transcriptase
Genome locationchr12:4154446..4164439
RNA-Seq ExpressionIVF0026882
SyntenyIVF0026882
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037397.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]3.63e-18845.44Show/hide
Query:  MKASKLPNQGSWGILASVVDTREPEVFLSFESMVRVFPDDFSNELLEFPPHKEIDFAIDLEPGTVPISRAPYRMA---LAELKVQLQKLQDKGFIRSSVS
        MKASKL +QG+WGILASVVDTREPEV LS E +VR +PD F +EL   PP +EIDF I+LEPGT PISR  YRMA   L ELKVQL+KL DKGFIR SVS
Subjt:  MKASKLPNQGSWGILASVVDTREPEVFLSFESMVRVFPDDFSNELLEFPPHKEIDFAIDLEPGTVPISRAPYRMA---LAELKVQLQKLQDKGFIRSSVS

Query:  PWGAPILFAKKKDGAMRLYIDYRELNKVTVKNRYPLLRIDNLFDQLQGATVLSKIDLRTSYHQLRIRDNGIPKAAFRYTYGHYEFIIM------------
        PWGAP+LF KKKDG+MRL IDYRELNKVTVKNRYPL RID+LFDQLQGATV SKIDLR+ YHQLRIRD+ IPK AFR  YGHYEFI++            
Subjt:  PWGAPILFAKKKDGAMRLYIDYRELNKVTVKNRYPLLRIDNLFDQLQGATVLSKIDLRTSYHQLRIRDNGIPKAAFRYTYGHYEFIIM------------

Query:  -----VFKDFLDTFVIVFIDDILVYSKTEVEHEEHLHQVLETLRANKLYAKFSKCVFWLNKND-------------DAAVLKLQPQW------SQLSNFF
             VFKDFLD+FVIVFIDDIL+YSKTE +HEEHLHQVLETL+AN+LYAKFSKC FWLNK               +   ++    W      S++ +F 
Subjt:  -----VFKDFLDTFVIVFIDDILVYSKTEVEHEEHLHQVLETLRANKLYAKFSKCVFWLNKND-------------DAAVLKLQPQW------SQLSNFF

Query:  ELH--------------------------------CEALWSLINQQFHPNSEHCAADDENALKEFVPSSSSS--CHERKQGQKALDVDNGQIRKGMILEW
         L                                 CE+ +  + Q+          D   +   +  +S     C   +QG KA  V +   RK      
Subjt:  ELH--------------------------------CEALWSLINQQFHPNSEHCAADDENALKEFVPSSSSS--CHERKQGQKALDVDNGQIRKGMILEW

Query:  FNTKSA-----YRQEEID--------------------------EIRTKWAAFKRRLKEAGQTEEFSTSSNDGLMFERQLYVSAYNIVKIELLTKAHSSP
          TK A     + + EI                           ++   +   K RL E GQ E+FS SS+DGL FE +L V   + VK ELLT+AHSSP
Subjt:  FNTKSA-----YRQEEID--------------------------EIRTKWAAFKRRLKEAGQTEEFSTSSNDGLMFERQLYVSAYNIVKIELLTKAHSSP

Query:  FSMHHGSTKMYQNLKRVYWWQNMKREVADFVSRCLVYQQV------------------------------------------------------------
        F+MH  STK+YQ+L+ VYWW+NMKREV DFVSRCLV QQV                                                            
Subjt:  FSMHHGSTKMYQNLKRVYWWQNMKREVADFVSRCLVYQQV------------------------------------------------------------

Query:  ---------------------------------------------------------------------------------GKRRMLGPELVQTTNKAIQ
                                                                                         G++RMLG ELVQT N AIQ
Subjt:  ---------------------------------------------------------------------------------GKRRMLGPELVQTTNKAIQ

Query:  KIRARMLTAQSKQKSYSDERCKDLEFNIGDMVFLKVAPTKGVLRFEKKGKA----------KSTKYIANPMHIVDFEPLQINENLSYEEQLVEILVREVK
         IRARMLTAQS+QKSY+D RCKDLEF +GDMVFLKVAP KGVLRFEKKGK           +  KY+A+  H+VDFEPLQI+ENLSYEEQ V+IL REVK
Subjt:  KIRARMLTAQSKQKSYSDERCKDLEFNIGDMVFLKVAPTKGVLRFEKKGKA----------KSTKYIANPMHIVDFEPLQINENLSYEEQLVEILVREVK

Query:  LLRNRGISLVKVLWRNHKADEAT
         LR R ISLVKVLWRNH  +EAT
Subjt:  LLRNRGISLVKVLWRNHKADEAT

KAA0056751.1 Retrotransposon protein [Cucumis melo var. makuwa]1.18e-19591.88Show/hide
Query:  MKASKLPNQGSWGILASVVDTREPEVFLSFESMVRVFPDDFSNELLEFPPHKEIDFAIDLEPGTVPISRAPYRMALAELKVQLQKLQDKGFIRSSVSPWG
        MKASKLPNQGSWGILASVVDTREPEVFLSFESMVRVFPDDFSNELLEFPPHKEIDFAIDLEPGTVPISRAPYRMALAELKVQLQKLQDKGFIRSSVSPWG
Subjt:  MKASKLPNQGSWGILASVVDTREPEVFLSFESMVRVFPDDFSNELLEFPPHKEIDFAIDLEPGTVPISRAPYRMALAELKVQLQKLQDKGFIRSSVSPWG

Query:  APILFAKKKDGAMRLYIDYRELNKVTVKNRYPLLRIDNLFDQLQGATVLSKIDLRTSYHQLRIRDNGIPKAAFRYTYGHYEFIIMVFKDFLDTFVIVFID
        APILFAKKKDGAMRLYIDYRELNKVTVKNRYPLLRIDNLFDQLQGATVLSKIDLRTSYHQ                         VFKDFLDTFVIVFID
Subjt:  APILFAKKKDGAMRLYIDYRELNKVTVKNRYPLLRIDNLFDQLQGATVLSKIDLRTSYHQLRIRDNGIPKAAFRYTYGHYEFIIMVFKDFLDTFVIVFID

Query:  DILVYSKTEVEHEEHLHQVLETLRANKLYAKFSKCVFWLNKNDDAAVLKLQPQWSQLSNFFELHCEALWSLINQQFHPNSEHCAADDENALKEFVPSSSS
        DILVYSKTEVEHEEHLHQVLETLRANKLYAKFSKCVFWLNKNDDAAVLKLQPQWSQLSNFFELHCEALWSLINQQFHPNSEHCAADDENALKEFVPSSSS
Subjt:  DILVYSKTEVEHEEHLHQVLETLRANKLYAKFSKCVFWLNKNDDAAVLKLQPQWSQLSNFFELHCEALWSLINQQFHPNSEHCAADDENALKEFVPSSSS

Query:  SCHERKQG
        SCHERKQG
Subjt:  SCHERKQG

KAA0061127.1 pol protein [Cucumis melo var. makuwa]4.08e-19544.69Show/hide
Query:  MKASKLPNQGSWGILASVVDTREPEVFLSFESMVRVFPDDFSNELLEFPPHKEIDFAIDLEPGTVPISRAPYRMA---LAELKVQLQKLQDKGFIRSSVS
        MKASKL +QG+WGILASVVDTR PEV LS E +V+ +PD F +EL   PP +EIDFAI+LEP T PISRAPYRMA   L E+KVQLQ+L DKGFIR SVS
Subjt:  MKASKLPNQGSWGILASVVDTREPEVFLSFESMVRVFPDDFSNELLEFPPHKEIDFAIDLEPGTVPISRAPYRMA---LAELKVQLQKLQDKGFIRSSVS

Query:  PWGAPILFAKKKDGAMRLYIDYRELNKVTVKNRYPLLRIDNLFDQLQGATVLSKIDLRTSYHQLRIRDNGIPKAAFRYTYGHYEFIIM------------
        PWGAP+LF KKKD +MRL IDYRELNKVTVKNRYPL RID+LFDQLQGATV SKIDLR+SYHQLRIRD+ IPK AFR  YGHYEFI+M            
Subjt:  PWGAPILFAKKKDGAMRLYIDYRELNKVTVKNRYPLLRIDNLFDQLQGATVLSKIDLRTSYHQLRIRDNGIPKAAFRYTYGHYEFIIM------------

Query:  -----VFKDFLDTFVIVFIDDILVYSKTEVEHEEHLHQVLETLRANKLYAKFSKCVFWLNKNDDAAVLKLQPQWSQLSNFFELHCEALWSLINQQFHPNS
             VFKDFLD+F IVFIDDIL+YSK E EHEEHL QVLETLRAN+LYAKF+KC FWL K   + V K++        +    CE+ +  + Q+     
Subjt:  -----VFKDFLDTFVIVFIDDILVYSKTEVEHEEHLHQVLETLRANKLYAKFSKCVFWLNKNDDAAVLKLQPQWSQLSNFFELHCEALWSLINQQFHPNS

Query:  EHCAAD--------------------------------------------DENALKEFVPSSSSSCHERKQGQKALDVDNGQI----RKGMILEWFNTKS
             D                                            D  + K F      +  +R+  +   D D   +    +  ++ +  + K 
Subjt:  EHCAAD--------------------------------------------DENALKEFVPSSSSSCHERKQGQKALDVDNGQI----RKGMILEWFNTKS

Query:  AY----------------RQE---EIDEIRTKWAAFKRRLKEAGQTEEFSTSSNDGLMFERQLYVSAYNIVKIELLTKAHSSPFSMHHGSTKMYQNLKRV
        A+                R E    + E+   +   KR L E GQ+E+FS SS+DGL FE +LYV   + VK ELLT+AHSSPF+MH GST MYQ+L+ V
Subjt:  AY----------------RQE---EIDEIRTKWAAFKRRLKEAGQTEEFSTSSNDGLMFERQLYVSAYNIVKIELLTKAHSSPFSMHHGSTKMYQNLKRV

Query:  YWWQNMKREVADFVSRCLVYQQV-----------------------------------------------------------------------------
        YWW+NMKREVADFV+RCLV QQV                                                                             
Subjt:  YWWQNMKREVADFVSRCLVYQQV-----------------------------------------------------------------------------

Query:  --------------------------------------------------GKRRMLGPELVQTTNKAIQKIRARMLTAQSKQKSYSDERCKDLEFNIGDM
                                                          G++RMLGP+LVQTTN AIQKIRAR+LT Q++QKSY+D R KDLEF +GD 
Subjt:  --------------------------------------------------GKRRMLGPELVQTTNKAIQKIRARMLTAQSKQKSYSDERCKDLEFNIGDM

Query:  VFLKVAPTKGVLRFEKKGKAKST-------------------------------------KYIANPMHIVDFEPLQINENLSYEEQLVEILVREVKLLRN
        VFLKVAP KGVLRFEKKGK                                         +Y+A+P H+VDFEPLQI+ENLSYEEQ VEIL REVK LRN
Subjt:  VFLKVAPTKGVLRFEKKGKAKST-------------------------------------KYIANPMHIVDFEPLQINENLSYEEQLVEILVREVKLLRN

Query:  RGISLVKVLWRNHKADEAT
        R ISLVKVLW+NH  +EAT
Subjt:  RGISLVKVLWRNHKADEAT

TYJ97514.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]3.63e-18845.44Show/hide
Query:  MKASKLPNQGSWGILASVVDTREPEVFLSFESMVRVFPDDFSNELLEFPPHKEIDFAIDLEPGTVPISRAPYRMA---LAELKVQLQKLQDKGFIRSSVS
        MKASKL +QG+WGILASVVDTREPEV LS E +VR +PD F +EL   PP +EIDF I+LEPGT PISR  YRMA   L ELKVQL+KL DKGFIR SVS
Subjt:  MKASKLPNQGSWGILASVVDTREPEVFLSFESMVRVFPDDFSNELLEFPPHKEIDFAIDLEPGTVPISRAPYRMA---LAELKVQLQKLQDKGFIRSSVS

Query:  PWGAPILFAKKKDGAMRLYIDYRELNKVTVKNRYPLLRIDNLFDQLQGATVLSKIDLRTSYHQLRIRDNGIPKAAFRYTYGHYEFIIM------------
        PWGAP+LF KKKDG+MRL IDYRELNKVTVKNRYPL RID+LFDQLQGATV SKIDLR+ YHQLRIRD+ IPK AFR  YGHYEFI++            
Subjt:  PWGAPILFAKKKDGAMRLYIDYRELNKVTVKNRYPLLRIDNLFDQLQGATVLSKIDLRTSYHQLRIRDNGIPKAAFRYTYGHYEFIIM------------

Query:  -----VFKDFLDTFVIVFIDDILVYSKTEVEHEEHLHQVLETLRANKLYAKFSKCVFWLNKND-------------DAAVLKLQPQW------SQLSNFF
             VFKDFLD+FVIVFIDDIL+YSKTE +HEEHLHQVLETL+AN+LYAKFSKC FWLNK               +   ++    W      S++ +F 
Subjt:  -----VFKDFLDTFVIVFIDDILVYSKTEVEHEEHLHQVLETLRANKLYAKFSKCVFWLNKND-------------DAAVLKLQPQW------SQLSNFF

Query:  ELH--------------------------------CEALWSLINQQFHPNSEHCAADDENALKEFVPSSSSS--CHERKQGQKALDVDNGQIRKGMILEW
         L                                 CE+ +  + Q+          D   +   +  +S     C   +QG KA  V +   RK      
Subjt:  ELH--------------------------------CEALWSLINQQFHPNSEHCAADDENALKEFVPSSSSS--CHERKQGQKALDVDNGQIRKGMILEW

Query:  FNTKSA-----YRQEEID--------------------------EIRTKWAAFKRRLKEAGQTEEFSTSSNDGLMFERQLYVSAYNIVKIELLTKAHSSP
          TK A     + + EI                           ++   +   K RL E GQ E+FS SS+DGL FE +L V   + VK ELLT+AHSSP
Subjt:  FNTKSA-----YRQEEID--------------------------EIRTKWAAFKRRLKEAGQTEEFSTSSNDGLMFERQLYVSAYNIVKIELLTKAHSSP

Query:  FSMHHGSTKMYQNLKRVYWWQNMKREVADFVSRCLVYQQV------------------------------------------------------------
        F+MH  STK+YQ+L+ VYWW+NMKREV DFVSRCLV QQV                                                            
Subjt:  FSMHHGSTKMYQNLKRVYWWQNMKREVADFVSRCLVYQQV------------------------------------------------------------

Query:  ---------------------------------------------------------------------------------GKRRMLGPELVQTTNKAIQ
                                                                                         G++RMLG ELVQT N AIQ
Subjt:  ---------------------------------------------------------------------------------GKRRMLGPELVQTTNKAIQ

Query:  KIRARMLTAQSKQKSYSDERCKDLEFNIGDMVFLKVAPTKGVLRFEKKGKA----------KSTKYIANPMHIVDFEPLQINENLSYEEQLVEILVREVK
         IRARMLTAQS+QKSY+D RCKDLEF +GDMVFLKVAP KGVLRFEKKGK           +  KY+A+  H+VDFEPLQI+ENLSYEEQ V+IL REVK
Subjt:  KIRARMLTAQSKQKSYSDERCKDLEFNIGDMVFLKVAPTKGVLRFEKKGKA----------KSTKYIANPMHIVDFEPLQINENLSYEEQLVEILVREVK

Query:  LLRNRGISLVKVLWRNHKADEAT
         LR R ISLVKVLWRNH  +EAT
Subjt:  LLRNRGISLVKVLWRNHKADEAT

TYK03797.1 pol protein [Cucumis melo var. makuwa]1.36e-19645.19Show/hide
Query:  MKASKLPNQGSWGILASVVDTREPEVFLSFESMVRVFPDDFSNELLEFPPHKEIDFAIDLEPGTVPISRAPYRMA---LAELKVQLQKLQDKGFIRSSVS
        MKASKL +QG+WGILASVVDTR PEV LS E +V+ +PD F +EL   PP +EIDFAI+LEP T PISRAPYRMA   L E+KVQLQ+L DKGFIR SVS
Subjt:  MKASKLPNQGSWGILASVVDTREPEVFLSFESMVRVFPDDFSNELLEFPPHKEIDFAIDLEPGTVPISRAPYRMA---LAELKVQLQKLQDKGFIRSSVS

Query:  PWGAPILFAKKKDGAMRLYIDYRELNKVTVKNRYPLLRIDNLFDQLQGATVLSKIDLRTSYHQLRIRDNGIPKAAFRYTYGHYEFIIM------------
        PWGAP+LF KKKD +MRL IDYRELNKVTVKNRYPL RID+LFDQLQGATV SKIDLR+SYHQLRIRD+ IPK AFR  YGHYEFI+M            
Subjt:  PWGAPILFAKKKDGAMRLYIDYRELNKVTVKNRYPLLRIDNLFDQLQGATVLSKIDLRTSYHQLRIRDNGIPKAAFRYTYGHYEFIIM------------

Query:  -----VFKDFLDTFVIVFIDDILVYSKTEVEHEEHLHQVLETLRANKLYAKFSKCVFWLNKNDDAAVLKLQPQWSQLSNFFELHCEALWSLINQQFHPNS
             VFKDFLD+F IVFIDDIL+YSK E EHEEHL QVLETLRAN+LYAKF+KC FWL K   + V K++        +    CE+ +  + Q+     
Subjt:  -----VFKDFLDTFVIVFIDDILVYSKTEVEHEEHLHQVLETLRANKLYAKFSKCVFWLNKNDDAAVLKLQPQWSQLSNFFELHCEALWSLINQQFHPNS

Query:  EHCAAD--------------------------------------------DENALKEFVPSSSSSCHERKQGQKALDVDNGQI----RKGMILEWFNTKS
             D                                            D  + K F      +  +R+  +   D D   +    +  ++ +  + K 
Subjt:  EHCAAD--------------------------------------------DENALKEFVPSSSSSCHERKQGQKALDVDNGQI----RKGMILEWFNTKS

Query:  AY----------------RQE---EIDEIRTKWAAFKRRLKEAGQTEEFSTSSNDGLMFERQLYVSAYNIVKIELLTKAHSSPFSMHHGSTKMYQNLKRV
        A+                R E    + E+   +   KR L E GQ+E+FS SS+DGL FE +LYV   + VK ELLT+AHSSPF+MH GST MYQ+L+ V
Subjt:  AY----------------RQE---EIDEIRTKWAAFKRRLKEAGQTEEFSTSSNDGLMFERQLYVSAYNIVKIELLTKAHSSPFSMHHGSTKMYQNLKRV

Query:  YWWQNMKREVADFVSRCLVYQQV-----------------------------------------------------------------------------
        YWW+NMKREVADFV+RCLV QQV                                                                             
Subjt:  YWWQNMKREVADFVSRCLVYQQV-----------------------------------------------------------------------------

Query:  -----------------------------------------GKRRMLGPELVQTTNKAIQKIRARMLTAQSKQKSYSDERCKDLEFNIGDMVFLKVAPTK
                                                 G++RMLGP+LVQTTN AIQKIRAR+LT Q++QKSY+D R KDLEF +GD VFLKVAP K
Subjt:  -----------------------------------------GKRRMLGPELVQTTNKAIQKIRARMLTAQSKQKSYSDERCKDLEFNIGDMVFLKVAPTK

Query:  GVLRFEKKGKAKST-------------------------------------KYIANPMHIVDFEPLQINENLSYEEQLVEILVREVKLLRNRGISLVKVL
        GVLRFEKKGK                                         +Y+A+P H+VDFEPLQI+ENLSYEEQ VEIL REVK LRNR ISLVKVL
Subjt:  GVLRFEKKGKAKST-------------------------------------KYIANPMHIVDFEPLQINENLSYEEQLVEILVREVKLLRNRGISLVKVL

Query:  WRNHKADEAT
        W+NH  +EAT
Subjt:  WRNHKADEAT

TrEMBL top hitse value%identityAlignment
A0A5A7T758 Ty3-gypsy retrotransposon protein2.1e-15945.44Show/hide
Query:  MKASKLPNQGSWGILASVVDTREPEVFLSFESMVRVFPDDFSNELLEFPPHKEIDFAIDLEPGTVPISRAPYRMA---LAELKVQLQKLQDKGFIRSSVS
        MKASKL +QG+WGILASVVDTREPEV LS E +VR +PD F +EL   PP +EIDF I+LEPGT PISR  YRMA   L ELKVQL+KL DKGFIR SVS
Subjt:  MKASKLPNQGSWGILASVVDTREPEVFLSFESMVRVFPDDFSNELLEFPPHKEIDFAIDLEPGTVPISRAPYRMA---LAELKVQLQKLQDKGFIRSSVS

Query:  PWGAPILFAKKKDGAMRLYIDYRELNKVTVKNRYPLLRIDNLFDQLQGATVLSKIDLRTSYHQLRIRDNGIPKAAFRYTYGHYEFIIM------------
        PWGAP+LF KKKDG+MRL IDYRELNKVTVKNRYPL RID+LFDQLQGATV SKIDLR+ YHQLRIRD+ IPK AFR  YGHYEFI++            
Subjt:  PWGAPILFAKKKDGAMRLYIDYRELNKVTVKNRYPLLRIDNLFDQLQGATVLSKIDLRTSYHQLRIRDNGIPKAAFRYTYGHYEFIIM------------

Query:  -----VFKDFLDTFVIVFIDDILVYSKTEVEHEEHLHQVLETLRANKLYAKFSKCVFWLNKND-------------DAAVLKLQPQW------SQLSNFF
             VFKDFLD+FVIVFIDDIL+YSKTE +HEEHLHQVLETL+AN+LYAKFSKC FWLNK               +   ++    W      S++ +F 
Subjt:  -----VFKDFLDTFVIVFIDDILVYSKTEVEHEEHLHQVLETLRANKLYAKFSKCVFWLNKND-------------DAAVLKLQPQW------SQLSNFF

Query:  EL--------------------------------HCEALWSLINQQFHPNSEHCAADDENALKEFVPSS--SSSCHERKQGQKALDVDNGQIRKGMILEW
         L                                 CE+ +  + Q+          D   +   +  +S     C   +QG KA  V +   RK      
Subjt:  EL--------------------------------HCEALWSLINQQFHPNSEHCAADDENALKEFVPSS--SSSCHERKQGQKALDVDNGQIRKGMILEW

Query:  FNTKSA-----YRQEEI--------------------------DEIRTKWAAFKRRLKEAGQTEEFSTSSNDGLMFERQLYVSAYNIVKIELLTKAHSSP
          TK A     + + EI                           ++   +   K RL E GQ E+FS SS+DGL FE +L V   + VK ELLT+AHSSP
Subjt:  FNTKSA-----YRQEEI--------------------------DEIRTKWAAFKRRLKEAGQTEEFSTSSNDGLMFERQLYVSAYNIVKIELLTKAHSSP

Query:  FSMHHGSTKMYQNLKRVYWWQNMKREVADFVSRCLVYQQ-------------------------------------------------------------
        F+MH  STK+YQ+L+ VYWW+NMKREV DFVSRCLV QQ                                                             
Subjt:  FSMHHGSTKMYQNLKRVYWWQNMKREVADFVSRCLVYQQ-------------------------------------------------------------

Query:  --------------------------------------------------------------------------------VGKRRMLGPELVQTTNKAIQ
                                                                                        VG++RMLG ELVQT N AIQ
Subjt:  --------------------------------------------------------------------------------VGKRRMLGPELVQTTNKAIQ

Query:  KIRARMLTAQSKQKSYSDERCKDLEFNIGDMVFLKVAPTKGVLRFEKKGKA----------KSTKYIANPMHIVDFEPLQINENLSYEEQLVEILVREVK
         IRARMLTAQS+QKSY+D RCKDLEF +GDMVFLKVAP KGVLRFEKKGK           +  KY+A+  H+VDFEPLQI+ENLSYEEQ V+IL REVK
Subjt:  KIRARMLTAQSKQKSYSDERCKDLEFNIGDMVFLKVAPTKGVLRFEKKGKA----------KSTKYIANPMHIVDFEPLQINENLSYEEQLVEILVREVK

Query:  LLRNRGISLVKVLWRNHKADEAT
         LR R ISLVKVLWRNH  +EAT
Subjt:  LLRNRGISLVKVLWRNHKADEAT

A0A5A7UTC6 Retrotransposon protein1.1e-15591.88Show/hide
Query:  MKASKLPNQGSWGILASVVDTREPEVFLSFESMVRVFPDDFSNELLEFPPHKEIDFAIDLEPGTVPISRAPYRMALAELKVQLQKLQDKGFIRSSVSPWG
        MKASKLPNQGSWGILASVVDTREPEVFLSFESMVRVFPDDFSNELLEFPPHKEIDFAIDLEPGTVPISRAPYRMALAELKVQLQKLQDKGFIRSSVSPWG
Subjt:  MKASKLPNQGSWGILASVVDTREPEVFLSFESMVRVFPDDFSNELLEFPPHKEIDFAIDLEPGTVPISRAPYRMALAELKVQLQKLQDKGFIRSSVSPWG

Query:  APILFAKKKDGAMRLYIDYRELNKVTVKNRYPLLRIDNLFDQLQGATVLSKIDLRTSYHQLRIRDNGIPKAAFRYTYGHYEFIIMVFKDFLDTFVIVFID
        APILFAKKKDGAMRLYIDYRELNKVTVKNRYPLLRIDNLFDQLQGATVLSKIDLRTSYHQ                         VFKDFLDTFVIVFID
Subjt:  APILFAKKKDGAMRLYIDYRELNKVTVKNRYPLLRIDNLFDQLQGATVLSKIDLRTSYHQLRIRDNGIPKAAFRYTYGHYEFIIMVFKDFLDTFVIVFID

Query:  DILVYSKTEVEHEEHLHQVLETLRANKLYAKFSKCVFWLNKNDDAAVLKLQPQWSQLSNFFELHCEALWSLINQQFHPNSEHCAADDENALKEFVPSSSS
        DILVYSKTEVEHEEHLHQVLETLRANKLYAKFSKCVFWLNKNDDAAVLKLQPQWSQLSNFFELHCEALWSLINQQFHPNSEHCAADDENALKEFVPSSSS
Subjt:  DILVYSKTEVEHEEHLHQVLETLRANKLYAKFSKCVFWLNKNDDAAVLKLQPQWSQLSNFFELHCEALWSLINQQFHPNSEHCAADDENALKEFVPSSSS

Query:  SCHERKQG
        SCHERKQG
Subjt:  SCHERKQG

A0A5A7V2M6 Pol protein6.6e-16144.69Show/hide
Query:  MKASKLPNQGSWGILASVVDTREPEVFLSFESMVRVFPDDFSNELLEFPPHKEIDFAIDLEPGTVPISRAPYRMA---LAELKVQLQKLQDKGFIRSSVS
        MKASKL +QG+WGILASVVDTR PEV LS E +V+ +PD F +EL   PP +EIDFAI+LEP T PISRAPYRMA   L E+KVQLQ+L DKGFIR SVS
Subjt:  MKASKLPNQGSWGILASVVDTREPEVFLSFESMVRVFPDDFSNELLEFPPHKEIDFAIDLEPGTVPISRAPYRMA---LAELKVQLQKLQDKGFIRSSVS

Query:  PWGAPILFAKKKDGAMRLYIDYRELNKVTVKNRYPLLRIDNLFDQLQGATVLSKIDLRTSYHQLRIRDNGIPKAAFRYTYGHYEFIIM------------
        PWGAP+LF KKKD +MRL IDYRELNKVTVKNRYPL RID+LFDQLQGATV SKIDLR+SYHQLRIRD+ IPK AFR  YGHYEFI+M            
Subjt:  PWGAPILFAKKKDGAMRLYIDYRELNKVTVKNRYPLLRIDNLFDQLQGATVLSKIDLRTSYHQLRIRDNGIPKAAFRYTYGHYEFIIM------------

Query:  -----VFKDFLDTFVIVFIDDILVYSKTEVEHEEHLHQVLETLRANKLYAKFSKCVFWLNKNDDAAVLKLQPQWSQLSNFFELHCEALWSLINQQFHPNS
             VFKDFLD+F IVFIDDIL+YSK E EHEEHL QVLETLRAN+LYAKF+KC FWL K   + V K++        +    CE+ +  + Q+     
Subjt:  -----VFKDFLDTFVIVFIDDILVYSKTEVEHEEHLHQVLETLRANKLYAKFSKCVFWLNKNDDAAVLKLQPQWSQLSNFFELHCEALWSLINQQFHPNS

Query:  EHCAAD--------------------------------------------DENALKEFVPSSSSSCHERKQGQKALDVDNGQI----RKGMILEWFNTKS
             D                                            D  + K F      +  +R+  +   D D   +    +  ++ +  + K 
Subjt:  EHCAAD--------------------------------------------DENALKEFVPSSSSSCHERKQGQKALDVDNGQI----RKGMILEWFNTKS

Query:  AY----------------RQE---EIDEIRTKWAAFKRRLKEAGQTEEFSTSSNDGLMFERQLYVSAYNIVKIELLTKAHSSPFSMHHGSTKMYQNLKRV
        A+                R E    + E+   +   KR L E GQ+E+FS SS+DGL FE +LYV   + VK ELLT+AHSSPF+MH GST MYQ+L+ V
Subjt:  AY----------------RQE---EIDEIRTKWAAFKRRLKEAGQTEEFSTSSNDGLMFERQLYVSAYNIVKIELLTKAHSSPFSMHHGSTKMYQNLKRV

Query:  YWWQNMKREVADFVSRCLVYQQ------------------------------------------------------------------------------
        YWW+NMKREVADFV+RCLV QQ                                                                              
Subjt:  YWWQNMKREVADFVSRCLVYQQ------------------------------------------------------------------------------

Query:  -------------------------------------------------VGKRRMLGPELVQTTNKAIQKIRARMLTAQSKQKSYSDERCKDLEFNIGDM
                                                         VG++RMLGP+LVQTTN AIQKIRAR+LT Q++QKSY+D R KDLEF +GD 
Subjt:  -------------------------------------------------VGKRRMLGPELVQTTNKAIQKIRARMLTAQSKQKSYSDERCKDLEFNIGDM

Query:  VFLKVAPTKGVLRFEKKGKAKS-------------------------------------TKYIANPMHIVDFEPLQINENLSYEEQLVEILVREVKLLRN
        VFLKVAP KGVLRFEKKGK                                         +Y+A+P H+VDFEPLQI+ENLSYEEQ VEIL REVK LRN
Subjt:  VFLKVAPTKGVLRFEKKGKAKS-------------------------------------TKYIANPMHIVDFEPLQINENLSYEEQLVEILVREVKLLRN

Query:  RGISLVKVLWRNHKADEAT
        R ISLVKVLW+NH  +EAT
Subjt:  RGISLVKVLWRNHKADEAT

A0A5D3BEN0 Ty3-gypsy retrotransposon protein2.1e-15945.44Show/hide
Query:  MKASKLPNQGSWGILASVVDTREPEVFLSFESMVRVFPDDFSNELLEFPPHKEIDFAIDLEPGTVPISRAPYRMA---LAELKVQLQKLQDKGFIRSSVS
        MKASKL +QG+WGILASVVDTREPEV LS E +VR +PD F +EL   PP +EIDF I+LEPGT PISR  YRMA   L ELKVQL+KL DKGFIR SVS
Subjt:  MKASKLPNQGSWGILASVVDTREPEVFLSFESMVRVFPDDFSNELLEFPPHKEIDFAIDLEPGTVPISRAPYRMA---LAELKVQLQKLQDKGFIRSSVS

Query:  PWGAPILFAKKKDGAMRLYIDYRELNKVTVKNRYPLLRIDNLFDQLQGATVLSKIDLRTSYHQLRIRDNGIPKAAFRYTYGHYEFIIM------------
        PWGAP+LF KKKDG+MRL IDYRELNKVTVKNRYPL RID+LFDQLQGATV SKIDLR+ YHQLRIRD+ IPK AFR  YGHYEFI++            
Subjt:  PWGAPILFAKKKDGAMRLYIDYRELNKVTVKNRYPLLRIDNLFDQLQGATVLSKIDLRTSYHQLRIRDNGIPKAAFRYTYGHYEFIIM------------

Query:  -----VFKDFLDTFVIVFIDDILVYSKTEVEHEEHLHQVLETLRANKLYAKFSKCVFWLNKND-------------DAAVLKLQPQW------SQLSNFF
             VFKDFLD+FVIVFIDDIL+YSKTE +HEEHLHQVLETL+AN+LYAKFSKC FWLNK               +   ++    W      S++ +F 
Subjt:  -----VFKDFLDTFVIVFIDDILVYSKTEVEHEEHLHQVLETLRANKLYAKFSKCVFWLNKND-------------DAAVLKLQPQW------SQLSNFF

Query:  EL--------------------------------HCEALWSLINQQFHPNSEHCAADDENALKEFVPSS--SSSCHERKQGQKALDVDNGQIRKGMILEW
         L                                 CE+ +  + Q+          D   +   +  +S     C   +QG KA  V +   RK      
Subjt:  EL--------------------------------HCEALWSLINQQFHPNSEHCAADDENALKEFVPSS--SSSCHERKQGQKALDVDNGQIRKGMILEW

Query:  FNTKSA-----YRQEEI--------------------------DEIRTKWAAFKRRLKEAGQTEEFSTSSNDGLMFERQLYVSAYNIVKIELLTKAHSSP
          TK A     + + EI                           ++   +   K RL E GQ E+FS SS+DGL FE +L V   + VK ELLT+AHSSP
Subjt:  FNTKSA-----YRQEEI--------------------------DEIRTKWAAFKRRLKEAGQTEEFSTSSNDGLMFERQLYVSAYNIVKIELLTKAHSSP

Query:  FSMHHGSTKMYQNLKRVYWWQNMKREVADFVSRCLVYQQ-------------------------------------------------------------
        F+MH  STK+YQ+L+ VYWW+NMKREV DFVSRCLV QQ                                                             
Subjt:  FSMHHGSTKMYQNLKRVYWWQNMKREVADFVSRCLVYQQ-------------------------------------------------------------

Query:  --------------------------------------------------------------------------------VGKRRMLGPELVQTTNKAIQ
                                                                                        VG++RMLG ELVQT N AIQ
Subjt:  --------------------------------------------------------------------------------VGKRRMLGPELVQTTNKAIQ

Query:  KIRARMLTAQSKQKSYSDERCKDLEFNIGDMVFLKVAPTKGVLRFEKKGKA----------KSTKYIANPMHIVDFEPLQINENLSYEEQLVEILVREVK
         IRARMLTAQS+QKSY+D RCKDLEF +GDMVFLKVAP KGVLRFEKKGK           +  KY+A+  H+VDFEPLQI+ENLSYEEQ V+IL REVK
Subjt:  KIRARMLTAQSKQKSYSDERCKDLEFNIGDMVFLKVAPTKGVLRFEKKGKA----------KSTKYIANPMHIVDFEPLQINENLSYEEQLVEILVREVK

Query:  LLRNRGISLVKVLWRNHKADEAT
         LR R ISLVKVLWRNH  +EAT
Subjt:  LLRNRGISLVKVLWRNHKADEAT

A0A5D3BWB9 Pol protein5.9e-16245.19Show/hide
Query:  MKASKLPNQGSWGILASVVDTREPEVFLSFESMVRVFPDDFSNELLEFPPHKEIDFAIDLEPGTVPISRAPYRMA---LAELKVQLQKLQDKGFIRSSVS
        MKASKL +QG+WGILASVVDTR PEV LS E +V+ +PD F +EL   PP +EIDFAI+LEP T PISRAPYRMA   L E+KVQLQ+L DKGFIR SVS
Subjt:  MKASKLPNQGSWGILASVVDTREPEVFLSFESMVRVFPDDFSNELLEFPPHKEIDFAIDLEPGTVPISRAPYRMA---LAELKVQLQKLQDKGFIRSSVS

Query:  PWGAPILFAKKKDGAMRLYIDYRELNKVTVKNRYPLLRIDNLFDQLQGATVLSKIDLRTSYHQLRIRDNGIPKAAFRYTYGHYEFIIM------------
        PWGAP+LF KKKD +MRL IDYRELNKVTVKNRYPL RID+LFDQLQGATV SKIDLR+SYHQLRIRD+ IPK AFR  YGHYEFI+M            
Subjt:  PWGAPILFAKKKDGAMRLYIDYRELNKVTVKNRYPLLRIDNLFDQLQGATVLSKIDLRTSYHQLRIRDNGIPKAAFRYTYGHYEFIIM------------

Query:  -----VFKDFLDTFVIVFIDDILVYSKTEVEHEEHLHQVLETLRANKLYAKFSKCVFWLNKNDDAAVLKLQPQWSQLSNFFELHCEALWSLINQQFHPNS
             VFKDFLD+F IVFIDDIL+YSK E EHEEHL QVLETLRAN+LYAKF+KC FWL K   + V K++        +    CE+ +  + Q+     
Subjt:  -----VFKDFLDTFVIVFIDDILVYSKTEVEHEEHLHQVLETLRANKLYAKFSKCVFWLNKNDDAAVLKLQPQWSQLSNFFELHCEALWSLINQQFHPNS

Query:  EHCAAD--------------------------------------------DENALKEFVPSSSSSCHERKQGQKALDVDNGQI----RKGMILEWFNTKS
             D                                            D  + K F      +  +R+  +   D D   +    +  ++ +  + K 
Subjt:  EHCAAD--------------------------------------------DENALKEFVPSSSSSCHERKQGQKALDVDNGQI----RKGMILEWFNTKS

Query:  AY----------------RQE---EIDEIRTKWAAFKRRLKEAGQTEEFSTSSNDGLMFERQLYVSAYNIVKIELLTKAHSSPFSMHHGSTKMYQNLKRV
        A+                R E    + E+   +   KR L E GQ+E+FS SS+DGL FE +LYV   + VK ELLT+AHSSPF+MH GST MYQ+L+ V
Subjt:  AY----------------RQE---EIDEIRTKWAAFKRRLKEAGQTEEFSTSSNDGLMFERQLYVSAYNIVKIELLTKAHSSPFSMHHGSTKMYQNLKRV

Query:  YWWQNMKREVADFVSRCLVYQQ------------------------------------------------------------------------------
        YWW+NMKREVADFV+RCLV QQ                                                                              
Subjt:  YWWQNMKREVADFVSRCLVYQQ------------------------------------------------------------------------------

Query:  ----------------------------------------VGKRRMLGPELVQTTNKAIQKIRARMLTAQSKQKSYSDERCKDLEFNIGDMVFLKVAPTK
                                                VG++RMLGP+LVQTTN AIQKIRAR+LT Q++QKSY+D R KDLEF +GD VFLKVAP K
Subjt:  ----------------------------------------VGKRRMLGPELVQTTNKAIQKIRARMLTAQSKQKSYSDERCKDLEFNIGDMVFLKVAPTK

Query:  GVLRFEKKGKAKS-------------------------------------TKYIANPMHIVDFEPLQINENLSYEEQLVEILVREVKLLRNRGISLVKVL
        GVLRFEKKGK                                         +Y+A+P H+VDFEPLQI+ENLSYEEQ VEIL REVK LRNR ISLVKVL
Subjt:  GVLRFEKKGKAKS-------------------------------------TKYIANPMHIVDFEPLQINENLSYEEQLVEILVREVKLLRNRGISLVKVL

Query:  WRNHKADEAT
        W+NH  +EAT
Subjt:  WRNHKADEAT

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein1.5e-2432.21Show/hide
Query:  PHKEIDFAIDL--EPGTVPISRAPYRMA-LAELKVQLQKLQDKGFIRSSVSPWGAPILFAKKKDGAMRLYIDYRELNKVTVKNRYPLLRIDNLFDQLQGA
        P K ++F ++L  E   +PI   P     +  +  ++ +    G IR S +    P++F  KK+G +R+ +DY+ LNK    N YPL  I+ L  ++QG+
Subjt:  PHKEIDFAIDL--EPGTVPISRAPYRMA-LAELKVQLQKLQDKGFIRSSVSPWGAPILFAKKKDGAMRLYIDYRELNKVTVKNRYPLLRIDNLFDQLQGA

Query:  TVLSKIDLRTSYHQLRIRDNGIPKAAFRYTYGHYEFIIMV---------FKDFLDTF--------VIVFIDDILVYSKTEVEHEEHLHQVLETLRANKLY
        T+ +K+DL+++YH +R+R     K AFR   G +E+++M          F+ F++T         V+ ++DDIL++SK+E EH +H+  VL+ L+   L 
Subjt:  TVLSKIDLRTSYHQLRIRDNGIPKAAFRYTYGHYEFIIMV---------FKDFLDTF--------VIVFIDDILVYSKTEVEHEEHLHQVLETLRANKLY

Query:  AKFSKCVF
           +KC F
Subjt:  AKFSKCVF

P0CT35 Transposon Tf2-2 polyprotein1.5e-2432.21Show/hide
Query:  PHKEIDFAIDL--EPGTVPISRAPYRMA-LAELKVQLQKLQDKGFIRSSVSPWGAPILFAKKKDGAMRLYIDYRELNKVTVKNRYPLLRIDNLFDQLQGA
        P K ++F ++L  E   +PI   P     +  +  ++ +    G IR S +    P++F  KK+G +R+ +DY+ LNK    N YPL  I+ L  ++QG+
Subjt:  PHKEIDFAIDL--EPGTVPISRAPYRMA-LAELKVQLQKLQDKGFIRSSVSPWGAPILFAKKKDGAMRLYIDYRELNKVTVKNRYPLLRIDNLFDQLQGA

Query:  TVLSKIDLRTSYHQLRIRDNGIPKAAFRYTYGHYEFIIMV---------FKDFLDTF--------VIVFIDDILVYSKTEVEHEEHLHQVLETLRANKLY
        T+ +K+DL+++YH +R+R     K AFR   G +E+++M          F+ F++T         V+ ++DDIL++SK+E EH +H+  VL+ L+   L 
Subjt:  TVLSKIDLRTSYHQLRIRDNGIPKAAFRYTYGHYEFIIMV---------FKDFLDTF--------VIVFIDDILVYSKTEVEHEEHLHQVLETLRANKLY

Query:  AKFSKCVF
           +KC F
Subjt:  AKFSKCVF

P0CT41 Transposon Tf2-12 polyprotein1.5e-2432.21Show/hide
Query:  PHKEIDFAIDL--EPGTVPISRAPYRMA-LAELKVQLQKLQDKGFIRSSVSPWGAPILFAKKKDGAMRLYIDYRELNKVTVKNRYPLLRIDNLFDQLQGA
        P K ++F ++L  E   +PI   P     +  +  ++ +    G IR S +    P++F  KK+G +R+ +DY+ LNK    N YPL  I+ L  ++QG+
Subjt:  PHKEIDFAIDL--EPGTVPISRAPYRMA-LAELKVQLQKLQDKGFIRSSVSPWGAPILFAKKKDGAMRLYIDYRELNKVTVKNRYPLLRIDNLFDQLQGA

Query:  TVLSKIDLRTSYHQLRIRDNGIPKAAFRYTYGHYEFIIMV---------FKDFLDTF--------VIVFIDDILVYSKTEVEHEEHLHQVLETLRANKLY
        T+ +K+DL+++YH +R+R     K AFR   G +E+++M          F+ F++T         V+ ++DDIL++SK+E EH +H+  VL+ L+   L 
Subjt:  TVLSKIDLRTSYHQLRIRDNGIPKAAFRYTYGHYEFIIMV---------FKDFLDTF--------VIVFIDDILVYSKTEVEHEEHLHQVLETLRANKLY

Query:  AKFSKCVF
           +KC F
Subjt:  AKFSKCVF

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein2.0e-2636.87Show/hide
Query:  IDLEPGTVPISRAPYRMA---LAELKVQLQKLQDKGFIRSSVSPWGAPILFAKKKDGAMRLYIDYRELNKVTVKNRYPLLRIDNLFDQLQGATVLSKIDL
        I+++PG       PY +      E+   +QKL D  FI  S SP  +P++   KKDG  RL +DYR LNK T+ + +PL RIDNL  ++  A + + +DL
Subjt:  IDLEPGTVPISRAPYRMA---LAELKVQLQKLQDKGFIRSSVSPWGAPILFAKKKDGAMRLYIDYRELNKVTVKNRYPLLRIDNLFDQLQGATVLSKIDL

Query:  RTSYHQLRIRDNGIPKAAFRYTYGHYEFIIMVF----------KDFLDT-----FVIVFIDDILVYSKTEVEHEEHLHQVLETLRANKLYAKFSKCVF
         + YHQ+ +      K AF    G YE+ +M F          +   DT     FV V++DDIL++S++  EH +HL  VLE L+   L  K  KC F
Subjt:  RTSYHQLRIRDNGIPKAAFRYTYGHYEFIIMVF----------KDFLDT-----FVIVFIDDILVYSKTEVEHEEHLHQVLETLRANKLYAKFSKCVF

Q99315 Transposon Ty3-G Gag-Pol polyprotein1.6e-2629.77Show/hide
Query:  IDLEPGTVPISRAPYRMA---LAELKVQLQKLQDKGFIRSSVSPWGAPILFAKKKDGAMRLYIDYRELNKVTVKNRYPLLRIDNLFDQLQGATVLSKIDL
        I+++PG       PY +      E+   +QKL D  FI  S SP  +P++   KKDG  RL +DYR LNK T+ + +PL RIDNL  ++  A + + +DL
Subjt:  IDLEPGTVPISRAPYRMA---LAELKVQLQKLQDKGFIRSSVSPWGAPILFAKKKDGAMRLYIDYRELNKVTVKNRYPLLRIDNLFDQLQGATVLSKIDL

Query:  RTSYHQLRIRDNGIPKAAFRYTYGHYEFIIMVF----------KDFLDT-----FVIVFIDDILVYSKTEVEHEEHLHQVLETLRANKLYAKFSKCVFWL
         + YHQ+ +      K AF    G YE+ +M F          +   DT     FV V++DDIL++S++  EH +HL  VLE L+   L  K  KC F  
Subjt:  RTSYHQLRIRDNGIPKAAFRYTYGHYEFIIMVF----------KDFLDT-----FVIVFIDDILVYSKTEVEHEEHLHQVLETLRANKLYAKFSKCVFWL

Query:  NKND----DAAVLKLQPQWSQLSNFFEL-------HCEALWSLIN--QQFHPNSEHCAADDENALKEFVPSSSSSCHERKQGQKALDVDNGQIRKGMILE
         + +       + K+ P   + +   +          +    +IN  ++F PN    A      ++ F+   S    ++    KA+D     +    +L 
Subjt:  NKND----DAAVLKLQPQWSQLSNFFEL-------HCEALWSLIN--QQFHPNSEHCAADDENALKEFVPSSSSSCHERKQGQKALDVDNGQIRKGMILE

Query:  WFNTKSAYR
         FN K+ YR
Subjt:  WFNTKSAYR

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAGCTAGTAAGCTACCTAACCAGGGTTCTTGGGGCATCTTGGCTAGCGTAGTAGACACAAGAGAGCCTGAAGTTTTCTTATCATTTGAATCGATGGTAAGGGTATT
CCCTGATGATTTTTCTAATGAGCTTTTAGAATTTCCGCCTCACAAAGAGATTGATTTTGCTATTGATTTAGAGCCAGGTACTGTTCCCATATCAAGAGCTCCATATAGAA
TGGCCTTAGCCGAGCTGAAGGTTCAGTTACAGAAATTGCAGGACAAGGGTTTCATTCGATCTAGTGTGTCACCTTGGGGAGCACCAATATTGTTTGCAAAGAAGAAAGAT
GGGGCGATGCGCCTTTACATTGATTATAGAGAGCTTAATAAGGTGACAGTAAAGAACAGATATCCGCTACTCAGAATCGATAATTTGTTTGATCAGTTACAAGGAGCCAC
TGTATTATCTAAGATTGACTTACGTACAAGCTATCACCAGTTGAGAATTAGAGACAATGGTATTCCTAAGGCTGCTTTTCGTTACACATATGGGCATTACGAGTTTATTA
TAATGGTGTTTAAGGATTTCTTAGACACTTTTGTGATAGTTTTTATTGATGATATCTTGGTTTACTCCAAGACAGAGGTTGAGCATGAGGAGCATTTGCATCAAGTTTTA
GAGACCCTTCGAGCCAATAAACTATATGCTAAGTTTTCTAAGTGTGTTTTCTGGCTGAATAAGAATGATGATGCGGCGGTGCTCAAATTGCAGCCACAATGGTCACAACT
CTCAAACTTTTTCGAATTGCATTGTGAAGCTCTTTGGAGTTTGATTAACCAACAGTTCCATCCGAATTCTGAGCATTGTGCTGCGGATGACGAAAATGCATTGAAGGAAT
TCGTTCCTAGCTCTTCTTCTAGTTGCCATGAGAGGAAGCAAGGGCAAAAAGCACTTGATGTTGACAATGGTCAAATTCGAAAGGGAATGATCCTCGAATGGTTCAATACC
AAAAGTGCATATAGGCAAGAGGAGATTGACGAGATTCGAACCAAATGGGCAGCTTTTAAGCGTCGCCTAAAAGAAGCAGGGCAAACTGAGGAGTTCTCCACATCCTCTAA
TGATGGACTTATGTTTGAGAGACAGTTATACGTGTCAGCATACAATATAGTTAAGATTGAGCTTTTGACTAAGGCACATAGTTCCCCATTTTCTATGCATCATGGCAGTA
CAAAGATGTACCAGAACTTGAAGCGAGTTTACTGGTGGCAAAATATGAAGAGAGAAGTGGCAGACTTCGTTAGTAGATGTTTGGTCTACCAGCAGGTTGGTAAGCGAAGA
ATGTTAGGTCCTGAGCTAGTTCAAACCACCAATAAAGCTATACAGAAGATTAGAGCTCGTATGTTGACAGCACAGAGCAAACAGAAGAGTTACTCTGATGAACGGTGTAA
GGATCTCGAGTTTAACATAGGAGACATGGTTTTTCTAAAGGTAGCACCTACAAAAGGTGTTTTGAGATTTGAGAAGAAGGGAAAAGCTAAGTCCACTAAGTATATAGCAA
ACCCGATGCATATAGTTGACTTTGAACCATTGCAAATTAATGAAAATTTGAGCTATGAGGAGCAACTCGTTGAGATTTTGGTAAGAGAGGTCAAACTGCTCCGTAATAGA
GGAATTTCATTGGTTAAAGTTCTTTGGCGAAACCACAAAGCTGATGAGGCCACATAG
mRNA sequenceShow/hide mRNA sequence
ATGAAAGCTAGTAAGCTACCTAACCAGGGTTCTTGGGGCATCTTGGCTAGCGTAGTAGACACAAGAGAGCCTGAAGTTTTCTTATCATTTGAATCGATGGTAAGGGTATT
CCCTGATGATTTTTCTAATGAGCTTTTAGAATTTCCGCCTCACAAAGAGATTGATTTTGCTATTGATTTAGAGCCAGGTACTGTTCCCATATCAAGAGCTCCATATAGAA
TGGCCTTAGCCGAGCTGAAGGTTCAGTTACAGAAATTGCAGGACAAGGGTTTCATTCGATCTAGTGTGTCACCTTGGGGAGCACCAATATTGTTTGCAAAGAAGAAAGAT
GGGGCGATGCGCCTTTACATTGATTATAGAGAGCTTAATAAGGTGACAGTAAAGAACAGATATCCGCTACTCAGAATCGATAATTTGTTTGATCAGTTACAAGGAGCCAC
TGTATTATCTAAGATTGACTTACGTACAAGCTATCACCAGTTGAGAATTAGAGACAATGGTATTCCTAAGGCTGCTTTTCGTTACACATATGGGCATTACGAGTTTATTA
TAATGGTGTTTAAGGATTTCTTAGACACTTTTGTGATAGTTTTTATTGATGATATCTTGGTTTACTCCAAGACAGAGGTTGAGCATGAGGAGCATTTGCATCAAGTTTTA
GAGACCCTTCGAGCCAATAAACTATATGCTAAGTTTTCTAAGTGTGTTTTCTGGCTGAATAAGAATGATGATGCGGCGGTGCTCAAATTGCAGCCACAATGGTCACAACT
CTCAAACTTTTTCGAATTGCATTGTGAAGCTCTTTGGAGTTTGATTAACCAACAGTTCCATCCGAATTCTGAGCATTGTGCTGCGGATGACGAAAATGCATTGAAGGAAT
TCGTTCCTAGCTCTTCTTCTAGTTGCCATGAGAGGAAGCAAGGGCAAAAAGCACTTGATGTTGACAATGGTCAAATTCGAAAGGGAATGATCCTCGAATGGTTCAATACC
AAAAGTGCATATAGGCAAGAGGAGATTGACGAGATTCGAACCAAATGGGCAGCTTTTAAGCGTCGCCTAAAAGAAGCAGGGCAAACTGAGGAGTTCTCCACATCCTCTAA
TGATGGACTTATGTTTGAGAGACAGTTATACGTGTCAGCATACAATATAGTTAAGATTGAGCTTTTGACTAAGGCACATAGTTCCCCATTTTCTATGCATCATGGCAGTA
CAAAGATGTACCAGAACTTGAAGCGAGTTTACTGGTGGCAAAATATGAAGAGAGAAGTGGCAGACTTCGTTAGTAGATGTTTGGTCTACCAGCAGGTTGGTAAGCGAAGA
ATGTTAGGTCCTGAGCTAGTTCAAACCACCAATAAAGCTATACAGAAGATTAGAGCTCGTATGTTGACAGCACAGAGCAAACAGAAGAGTTACTCTGATGAACGGTGTAA
GGATCTCGAGTTTAACATAGGAGACATGGTTTTTCTAAAGGTAGCACCTACAAAAGGTGTTTTGAGATTTGAGAAGAAGGGAAAAGCTAAGTCCACTAAGTATATAGCAA
ACCCGATGCATATAGTTGACTTTGAACCATTGCAAATTAATGAAAATTTGAGCTATGAGGAGCAACTCGTTGAGATTTTGGTAAGAGAGGTCAAACTGCTCCGTAATAGA
GGAATTTCATTGGTTAAAGTTCTTTGGCGAAACCACAAAGCTGATGAGGCCACATAG
Protein sequenceShow/hide protein sequence
MKASKLPNQGSWGILASVVDTREPEVFLSFESMVRVFPDDFSNELLEFPPHKEIDFAIDLEPGTVPISRAPYRMALAELKVQLQKLQDKGFIRSSVSPWGAPILFAKKKD
GAMRLYIDYRELNKVTVKNRYPLLRIDNLFDQLQGATVLSKIDLRTSYHQLRIRDNGIPKAAFRYTYGHYEFIIMVFKDFLDTFVIVFIDDILVYSKTEVEHEEHLHQVL
ETLRANKLYAKFSKCVFWLNKNDDAAVLKLQPQWSQLSNFFELHCEALWSLINQQFHPNSEHCAADDENALKEFVPSSSSSCHERKQGQKALDVDNGQIRKGMILEWFNT
KSAYRQEEIDEIRTKWAAFKRRLKEAGQTEEFSTSSNDGLMFERQLYVSAYNIVKIELLTKAHSSPFSMHHGSTKMYQNLKRVYWWQNMKREVADFVSRCLVYQQVGKRR
MLGPELVQTTNKAIQKIRARMLTAQSKQKSYSDERCKDLEFNIGDMVFLKVAPTKGVLRFEKKGKAKSTKYIANPMHIVDFEPLQINENLSYEEQLVEILVREVKLLRNR
GISLVKVLWRNHKADEAT