; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc04g0095541 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc04g0095541
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationCMiso1.1chr04:9143801..9145099
RNA-Seq ExpressionCmc04g0095541
SyntenyCmc04g0095541
Gene Ontology termsGO:0009231 - riboflavin biosynthetic process (biological process)
GO:0015074 - DNA integration (biological process)
GO:0071897 - DNA biosynthetic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003887 - DNA-directed DNA polymerase activity (molecular function)
GO:0004491 - methylmalonate-semialdehyde dehydrogenase (acylating) activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0008686 - 3,4-dihydroxy-2-butanone-4-phosphate synthase activity (molecular function)
InterPro domainsIPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KYP36562.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan]3.5e-16967.64Show/hide
Query:  MSTLTNIKFDGSRTIHEHILEMTNLVARLKTMRMEVNENFLVTFILNSLHSEYGPFHMNYNTLKDKWNMHELQSMLIQEEARLKKSIIHSANLMGH---K
        MSTLT +KFDGSRT+HEH++EMTN+ ARLK++ M VNENFLV FILNSL +EYGPF M+YNT+KDKWN+HEL SML+QEE RLK    HS + + H   +
Subjt:  MSTLTNIKFDGSRTIHEHILEMTNLVARLKTMRMEVNENFLVTFILNSLHSEYGPFHMNYNTLKDKWNMHELQSMLIQEEARLKKSIIHSANLMGH---K

Query:  GAGKKPGKKNDKGNHGQLKVKQSSAPIHKKGQIKDKCRFCNKLEHYQKDCLKRKTWFENKVKYNALVCFESNLTEVPYNTWWIDSGCTIHISNTMQGFLT
        GAGKK  KK+DKG    LK+ ++S PI KK    + C FC K  H+QKDC KRK WFE K K NA VCFESNLTEVP+NTWWIDSGCT H+SNTMQGF T
Subjt:  GAGKKPGKKNDKGNHGQLKVKQSSAPIHKKGQIKDKCRFCNKLEHYQKDCLKRKTWFENKVKYNALVCFESNLTEVPYNTWWIDSGCTIHISNTMQGFLT

Query:  TQTTNPNERFIFMGNRVKVPVEAVGTYRLTLDTRHYLNLFDTFYVSSISRNLISLSKLDTSGYYFKFGNECFSLFKQNIFIGSG--------------FA
         QT +PNE+F+FMGNRVKVPVEAVGTYRL L+T H+L+L +T YV S+SRNL+SLSKLD  GY F FGN CFSLFK+N  IG+G              + 
Subjt:  TQTTNPNERFIFMGNRVKVPVEAVGTYRLTLDTRHYLNLFDTFYVSSISRNLISLSKLDTSGYYFKFGNECFSLFKQNIFIGSG--------------FA

Query:  ESLLTLHHNVGTKRGQTNESLAYLWHKRLGHISKERIKRSIKNEILPDLDFTDLGICVDYIKRKQTKQTVNKEAIRSSQLLEIIHTDICGPFDVPSFGGE
        E+LLTLHHN+GTKR   NE  A+LWH+RLGHIS+ER++R IKNEILP+LDFTDL ICVD IK KQTK T  K A RS+QLLEI+HTDICGPFDV SFG E
Subjt:  ESLLTLHHNVGTKRGQTNESLAYLWHKRLGHISKERIKRSIKNEILPDLDFTDLGICVDYIKRKQTKQTVNKEAIRSSQLLEIIHTDICGPFDVPSFGGE

Query:  KYFITFIDDFLRYGYIYLLHEKSQAIDALKVFINEVERQLDRKVK
        KYFITFIDD+ RYGY+YLLHEKSQA+DAL++++NEVERQLD+KVK
Subjt:  KYFITFIDDFLRYGYIYLLHEKSQAIDALKVFINEVERQLDRKVK

KYP78985.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan]7.4e-15966.98Show/hide
Query:  MSTLTNIKFDGSRTIHEHILEMTNLVARLKTMRMEVNENFLVTFILNSLHSEYGPFHMNYNTLKDKWNMHELQSMLIQEEARLKKSIIHSANLMGHKGAG
        MSTLT +KFDGSRT+HEH++EMTN+ ARLK++ M V+ENFLV FILNSL SEYGPF MNYNT+KDKWN+HEL SML+QEE RLK    HS + + ++G  
Subjt:  MSTLTNIKFDGSRTIHEHILEMTNLVARLKTMRMEVNENFLVTFILNSLHSEYGPFHMNYNTLKDKWNMHELQSMLIQEEARLKKSIIHSANLMGHKGAG

Query:  KKPGKKNDKGNHGQLKVKQSSAPIHKKGQIKDKCRFCNKLEHYQKDCLKRKTWFENKVKYNALVCFESNLTEVPYNTWWIDSGCTIHISNTMQGFLTTQT
        KK  KK+ KG  G LK+ +SS  I KK    DKC FC K  H+QKDC KRK WFE K ++NA VCFESNLTEVP+NTWWIDSGCT H+SN MQGFLTT+T
Subjt:  KKPGKKNDKGNHGQLKVKQSSAPIHKKGQIKDKCRFCNKLEHYQKDCLKRKTWFENKVKYNALVCFESNLTEVPYNTWWIDSGCTIHISNTMQGFLTTQT

Query:  TNPNERFIFMGNRVKVPVEAVGTYRLTLDTRHYLNLFDTFYVSSISRNLISLSKLDTSGYYFKFGNECFSLFKQNIFIGSG--------------FAESL
         +PNE+F+FMGNRVKV VEAVGTY L LDT  +L+LFDTFYV SISRNL+SLSKLD +GY F+FGN CFSL+K+   IGSG              +A++L
Subjt:  TNPNERFIFMGNRVKVPVEAVGTYRLTLDTRHYLNLFDTFYVSSISRNLISLSKLDTSGYYFKFGNECFSLFKQNIFIGSG--------------FAESL

Query:  LTLHHNVGTKRGQTNESLAYLWHKRLGHISKERIKRSIKNEILPDLDFTDLGICVDYIKRKQTKQTVNKEAIRSSQLLEIIHTDICGPFDVPSFGGEKYF
        +TLHHNVGTKR   NE  AYLWHKRLGHI KERI+R +KNEIL +LDFTDL +CVD IK KQTK T  K A RS+QLLEII+ DICGPF V SF  EKYF
Subjt:  LTLHHNVGTKRGQTNESLAYLWHKRLGHISKERIKRSIKNEILPDLDFTDLGICVDYIKRKQTKQTVNKEAIRSSQLLEIIHTDICGPFDVPSFGGEKYF

Query:  ITFIDDFLRYGYIYLLHEKSQAIDALKVFI
        ITFIDDF RYGY+YLLHEKSQA++A   FI
Subjt:  ITFIDDFLRYGYIYLLHEKSQAIDALKVFI

RYE18822.1 hypothetical protein EOP45_13565, partial [Sphingobacteriaceae bacterium]2.5e-18372.62Show/hide
Query:  MSTLTNIKFDGSRTIHEHILEMTNLVARLKTMRMEVNENFLVTFILNSLHSEYGPFHMNYNTLKDKWNMHELQSMLIQEEARLKKSIIHSANLMGHKGAG
        M  LT +KFDGSRT+HEH++EM N+ ARLKTM MEVNENFLVTFILNSL  EYG FH++YNTLKDKW++HELQSMLIQEEARLKKS  HSANL+GHKGA 
Subjt:  MSTLTNIKFDGSRTIHEHILEMTNLVARLKTMRMEVNENFLVTFILNSLHSEYGPFHMNYNTLKDKWNMHELQSMLIQEEARLKKSIIHSANLMGHKGAG

Query:  KKPGKKNDKGNHGQLKVKQSSAPIHKKGQIKDKCRFCNKLEHYQKDCLKRKTWFENKVKYNALVCFESNLTEVPYNTWWIDSGCTIHISNTMQGFLTTQT
        KKP  K  KG  G  K+ +SS  IHKK Q  D CRFC K  H+QKDC KRKTWFE K K +A VCFESN  EVPYNTWW+DSGCT H+SNTMQGFLTTQT
Subjt:  KKPGKKNDKGNHGQLKVKQSSAPIHKKGQIKDKCRFCNKLEHYQKDCLKRKTWFENKVKYNALVCFESNLTEVPYNTWWIDSGCTIHISNTMQGFLTTQT

Query:  TNPNERFIFMGNRVKVPVEAVGTYRLTLDTRHYLNLFDTFYVSSISRNLISLSKLDTSGYYFKFGNECFSLFKQNIFIGSG--------------FAESL
         + NE+FI MGNR KV VEA+GTYRL LDT H+L+LF TFYV S+SRNL+S+SKLD +GY F FGN CFSLFKQN+F+GSG              FAE+L
Subjt:  TNPNERFIFMGNRVKVPVEAVGTYRLTLDTRHYLNLFDTFYVSSISRNLISLSKLDTSGYYFKFGNECFSLFKQNIFIGSG--------------FAESL

Query:  LTLHHNVGTKRGQTNESLAYLWHKRLGHISKERIKRSIKNEILPDLDFTDLGICVDYIKRKQTKQTVNKEAIRSSQLLEIIHTDICGPFDVPSFGGEKYF
        LT+HHNVGTKRG +NES AYLWHKRLGHISKERI+R +KN+ILP+LDFTDLG+CV+ IK K T+QT+ K A RSSQLLEIIHTDICGPFDVPS GGE+YF
Subjt:  LTLHHNVGTKRGQTNESLAYLWHKRLGHISKERIKRSIKNEILPDLDFTDLGICVDYIKRKQTKQTVNKEAIRSSQLLEIIHTDICGPFDVPSFGGEKYF

Query:  ITFIDDFLRYGYIYLLHEKSQAIDALKVFINEVERQLDRKVK
        ITFIDDF RYGY+YLLHEKSQ++D L+VF+NEVERQLDRKVK
Subjt:  ITFIDDFLRYGYIYLLHEKSQAIDALKVFINEVERQLDRKVK

RZC12927.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 isoform A [Glycine soja]7.1e-17067.86Show/hide
Query:  MSTLTNIKFDGSRTIHEHILEMTNLVARLKTMRMEVNENFLVTFILNSLHSEYGPFHMNYNTLKDKWNMHELQSMLIQEEARLKKSIIHSANLMGHK---
        MSTLT +KFDGSRT+HEH++EMTN+ ARLKT+ M VNENFLV FILNSL SEYGPF M+YNT+KDKWN+HEL SML+QEE RLK    HS + + H+   
Subjt:  MSTLTNIKFDGSRTIHEHILEMTNLVARLKTMRMEVNENFLVTFILNSLHSEYGPFHMNYNTLKDKWNMHELQSMLIQEEARLKKSIIHSANLMGHK---

Query:  GAGKKPGKKNDKGNHGQLKVKQSSAPIHKKGQIKDKCRFCNKLEHYQKDCLKRKTWFENKVKYNALVCFESNLTEVPYNTWWIDSGCTIHISNTMQGFLT
        GAGK   KK+DKG  G LK+K     I KK    + C FC K  H+QKDC KRK+WFE K + NALV FESNLTEVP+NTWWIDSGCT H+SNTMQGFLT
Subjt:  GAGKKPGKKNDKGNHGQLKVKQSSAPIHKKGQIKDKCRFCNKLEHYQKDCLKRKTWFENKVKYNALVCFESNLTEVPYNTWWIDSGCTIHISNTMQGFLT

Query:  TQTTNPNERFIFMGNRVKVPVEAVGTYRLTLDTRHYLNLFDTFYVSSISRNLISLSKLDTSGYYFKFGNECFSLFKQNIFIGSG--------------FA
         QT +PNE+F+FMGNRVK PVEAVGTYRL LDT H+L+L +T YV S+SRNL+SLSKLD +GY F FGN CFSLFK N  IG+G              + 
Subjt:  TQTTNPNERFIFMGNRVKVPVEAVGTYRLTLDTRHYLNLFDTFYVSSISRNLISLSKLDTSGYYFKFGNECFSLFKQNIFIGSG--------------FA

Query:  ESLLTLHHNVGTKRGQTNESLAYLWHKRLGHISKERIKRSIKNEILPDLDFTDLGICVDYIKRKQTKQTVNKEAIRSSQLLEIIHTDICGPFDVPSFGGE
        E++LTLHHNVGTKR   NE  A+LWHKRLGHIS+ERI+R IKNEILPDLDFTDL ICVD IK KQTK T  K A RS+QLLEI+HTDICGPFDV SFG E
Subjt:  ESLLTLHHNVGTKRGQTNESLAYLWHKRLGHISKERIKRSIKNEILPDLDFTDLGICVDYIKRKQTKQTVNKEAIRSSQLLEIIHTDICGPFDVPSFGGE

Query:  KYFITFIDDFLRYGYIYLLHEKSQAIDALKVFINEVERQLDRKVKNLK
        +YFITFIDD+ RYGY+YLLHEKSQA++AL++++NEVERQLDRKVK ++
Subjt:  KYFITFIDDFLRYGYIYLLHEKSQAIDALKVFINEVERQLDRKVKNLK

RZC25410.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Glycine soja]2.9e-17168.3Show/hide
Query:  MSTLTNIKFDGSRTIHEHILEMTNLVARLKTMRMEVNENFLVTFILNSLHSEYGPFHMNYNTLKDKWNMHELQSMLIQEEARLKKSIIHSANLMGHK---
        MSTLT +KFDGSRT+HEH++EMTN+ ARLKT+ M VNENFLV FILNSL SEYGPF M+YNT+KDKWN+HEL SML+QEE RLK    HS + + H+   
Subjt:  MSTLTNIKFDGSRTIHEHILEMTNLVARLKTMRMEVNENFLVTFILNSLHSEYGPFHMNYNTLKDKWNMHELQSMLIQEEARLKKSIIHSANLMGHK---

Query:  GAGKKPGKKNDKGNHGQLKVKQSSAPIHKKGQIKDKCRFCNKLEHYQKDCLKRKTWFENKVKYNALVCFESNLTEVPYNTWWIDSGCTIHISNTMQGFLT
        GAGKK  KK+DKG  G LK+K     I KK    + C FC K  H+QKDC KRK+WFE K + NALVCFESNLTEVP+NTWWIDSGCT H+SNTMQGFLT
Subjt:  GAGKKPGKKNDKGNHGQLKVKQSSAPIHKKGQIKDKCRFCNKLEHYQKDCLKRKTWFENKVKYNALVCFESNLTEVPYNTWWIDSGCTIHISNTMQGFLT

Query:  TQTTNPNERFIFMGNRVKVPVEAVGTYRLTLDTRHYLNLFDTFYVSSISRNLISLSKLDTSGYYFKFGNECFSLFKQNIFIGSG--------------FA
         QT +PNE+F+FMGNRVK PVEAVGTYRL LDT H+L+L +T YV S+SRNL+SLSKLD +GY F FGN CFSLFK N  IG+G              + 
Subjt:  TQTTNPNERFIFMGNRVKVPVEAVGTYRLTLDTRHYLNLFDTFYVSSISRNLISLSKLDTSGYYFKFGNECFSLFKQNIFIGSG--------------FA

Query:  ESLLTLHHNVGTKRGQTNESLAYLWHKRLGHISKERIKRSIKNEILPDLDFTDLGICVDYIKRKQTKQTVNKEAIRSSQLLEIIHTDICGPFDVPSFGGE
        E++LTLHHNVGTKR   NE  A+LWHKRLGHIS ERI+R IKNEILPDLDFTDL ICVD IK KQTK T  K A RS+QLLEI+HTDICGPFDV SFG E
Subjt:  ESLLTLHHNVGTKRGQTNESLAYLWHKRLGHISKERIKRSIKNEILPDLDFTDLGICVDYIKRKQTKQTVNKEAIRSSQLLEIIHTDICGPFDVPSFGGE

Query:  KYFITFIDDFLRYGYIYLLHEKSQAIDALKVFINEVERQLDRKVKNLK
        +YFITFIDD+ RYGY+YLLHEKSQA++AL++++NEVERQLDRKVK ++
Subjt:  KYFITFIDDFLRYGYIYLLHEKSQAIDALKVFINEVERQLDRKVKNLK

TrEMBL top hitse value%identityAlignment
A0A151R237 Retrovirus-related Pol polyprotein from transposon TNT 1-941.7e-16967.64Show/hide
Query:  MSTLTNIKFDGSRTIHEHILEMTNLVARLKTMRMEVNENFLVTFILNSLHSEYGPFHMNYNTLKDKWNMHELQSMLIQEEARLKKSIIHSANLMGH---K
        MSTLT +KFDGSRT+HEH++EMTN+ ARLK++ M VNENFLV FILNSL +EYGPF M+YNT+KDKWN+HEL SML+QEE RLK    HS + + H   +
Subjt:  MSTLTNIKFDGSRTIHEHILEMTNLVARLKTMRMEVNENFLVTFILNSLHSEYGPFHMNYNTLKDKWNMHELQSMLIQEEARLKKSIIHSANLMGH---K

Query:  GAGKKPGKKNDKGNHGQLKVKQSSAPIHKKGQIKDKCRFCNKLEHYQKDCLKRKTWFENKVKYNALVCFESNLTEVPYNTWWIDSGCTIHISNTMQGFLT
        GAGKK  KK+DKG    LK+ ++S PI KK    + C FC K  H+QKDC KRK WFE K K NA VCFESNLTEVP+NTWWIDSGCT H+SNTMQGF T
Subjt:  GAGKKPGKKNDKGNHGQLKVKQSSAPIHKKGQIKDKCRFCNKLEHYQKDCLKRKTWFENKVKYNALVCFESNLTEVPYNTWWIDSGCTIHISNTMQGFLT

Query:  TQTTNPNERFIFMGNRVKVPVEAVGTYRLTLDTRHYLNLFDTFYVSSISRNLISLSKLDTSGYYFKFGNECFSLFKQNIFIGSG--------------FA
         QT +PNE+F+FMGNRVKVPVEAVGTYRL L+T H+L+L +T YV S+SRNL+SLSKLD  GY F FGN CFSLFK+N  IG+G              + 
Subjt:  TQTTNPNERFIFMGNRVKVPVEAVGTYRLTLDTRHYLNLFDTFYVSSISRNLISLSKLDTSGYYFKFGNECFSLFKQNIFIGSG--------------FA

Query:  ESLLTLHHNVGTKRGQTNESLAYLWHKRLGHISKERIKRSIKNEILPDLDFTDLGICVDYIKRKQTKQTVNKEAIRSSQLLEIIHTDICGPFDVPSFGGE
        E+LLTLHHN+GTKR   NE  A+LWH+RLGHIS+ER++R IKNEILP+LDFTDL ICVD IK KQTK T  K A RS+QLLEI+HTDICGPFDV SFG E
Subjt:  ESLLTLHHNVGTKRGQTNESLAYLWHKRLGHISKERIKRSIKNEILPDLDFTDLGICVDYIKRKQTKQTVNKEAIRSSQLLEIIHTDICGPFDVPSFGGE

Query:  KYFITFIDDFLRYGYIYLLHEKSQAIDALKVFINEVERQLDRKVK
        KYFITFIDD+ RYGY+YLLHEKSQA+DAL++++NEVERQLD+KVK
Subjt:  KYFITFIDDFLRYGYIYLLHEKSQAIDALKVFINEVERQLDRKVK

A0A151UI64 Retrovirus-related Pol polyprotein from transposon TNT 1-943.6e-15966.98Show/hide
Query:  MSTLTNIKFDGSRTIHEHILEMTNLVARLKTMRMEVNENFLVTFILNSLHSEYGPFHMNYNTLKDKWNMHELQSMLIQEEARLKKSIIHSANLMGHKGAG
        MSTLT +KFDGSRT+HEH++EMTN+ ARLK++ M V+ENFLV FILNSL SEYGPF MNYNT+KDKWN+HEL SML+QEE RLK    HS + + ++G  
Subjt:  MSTLTNIKFDGSRTIHEHILEMTNLVARLKTMRMEVNENFLVTFILNSLHSEYGPFHMNYNTLKDKWNMHELQSMLIQEEARLKKSIIHSANLMGHKGAG

Query:  KKPGKKNDKGNHGQLKVKQSSAPIHKKGQIKDKCRFCNKLEHYQKDCLKRKTWFENKVKYNALVCFESNLTEVPYNTWWIDSGCTIHISNTMQGFLTTQT
        KK  KK+ KG  G LK+ +SS  I KK    DKC FC K  H+QKDC KRK WFE K ++NA VCFESNLTEVP+NTWWIDSGCT H+SN MQGFLTT+T
Subjt:  KKPGKKNDKGNHGQLKVKQSSAPIHKKGQIKDKCRFCNKLEHYQKDCLKRKTWFENKVKYNALVCFESNLTEVPYNTWWIDSGCTIHISNTMQGFLTTQT

Query:  TNPNERFIFMGNRVKVPVEAVGTYRLTLDTRHYLNLFDTFYVSSISRNLISLSKLDTSGYYFKFGNECFSLFKQNIFIGSG--------------FAESL
         +PNE+F+FMGNRVKV VEAVGTY L LDT  +L+LFDTFYV SISRNL+SLSKLD +GY F+FGN CFSL+K+   IGSG              +A++L
Subjt:  TNPNERFIFMGNRVKVPVEAVGTYRLTLDTRHYLNLFDTFYVSSISRNLISLSKLDTSGYYFKFGNECFSLFKQNIFIGSG--------------FAESL

Query:  LTLHHNVGTKRGQTNESLAYLWHKRLGHISKERIKRSIKNEILPDLDFTDLGICVDYIKRKQTKQTVNKEAIRSSQLLEIIHTDICGPFDVPSFGGEKYF
        +TLHHNVGTKR   NE  AYLWHKRLGHI KERI+R +KNEIL +LDFTDL +CVD IK KQTK T  K A RS+QLLEII+ DICGPF V SF  EKYF
Subjt:  LTLHHNVGTKRGQTNESLAYLWHKRLGHISKERIKRSIKNEILPDLDFTDLGICVDYIKRKQTKQTVNKEAIRSSQLLEIIHTDICGPFDVPSFGGEKYF

Query:  ITFIDDFLRYGYIYLLHEKSQAIDALKVFI
        ITFIDDF RYGY+YLLHEKSQA++A   FI
Subjt:  ITFIDDFLRYGYIYLLHEKSQAIDALKVFI

A0A445KPR8 Retrovirus-related Pol polyprotein from transposon TNT 1-94 isoform A3.4e-17067.86Show/hide
Query:  MSTLTNIKFDGSRTIHEHILEMTNLVARLKTMRMEVNENFLVTFILNSLHSEYGPFHMNYNTLKDKWNMHELQSMLIQEEARLKKSIIHSANLMGHK---
        MSTLT +KFDGSRT+HEH++EMTN+ ARLKT+ M VNENFLV FILNSL SEYGPF M+YNT+KDKWN+HEL SML+QEE RLK    HS + + H+   
Subjt:  MSTLTNIKFDGSRTIHEHILEMTNLVARLKTMRMEVNENFLVTFILNSLHSEYGPFHMNYNTLKDKWNMHELQSMLIQEEARLKKSIIHSANLMGHK---

Query:  GAGKKPGKKNDKGNHGQLKVKQSSAPIHKKGQIKDKCRFCNKLEHYQKDCLKRKTWFENKVKYNALVCFESNLTEVPYNTWWIDSGCTIHISNTMQGFLT
        GAGK   KK+DKG  G LK+K     I KK    + C FC K  H+QKDC KRK+WFE K + NALV FESNLTEVP+NTWWIDSGCT H+SNTMQGFLT
Subjt:  GAGKKPGKKNDKGNHGQLKVKQSSAPIHKKGQIKDKCRFCNKLEHYQKDCLKRKTWFENKVKYNALVCFESNLTEVPYNTWWIDSGCTIHISNTMQGFLT

Query:  TQTTNPNERFIFMGNRVKVPVEAVGTYRLTLDTRHYLNLFDTFYVSSISRNLISLSKLDTSGYYFKFGNECFSLFKQNIFIGSG--------------FA
         QT +PNE+F+FMGNRVK PVEAVGTYRL LDT H+L+L +T YV S+SRNL+SLSKLD +GY F FGN CFSLFK N  IG+G              + 
Subjt:  TQTTNPNERFIFMGNRVKVPVEAVGTYRLTLDTRHYLNLFDTFYVSSISRNLISLSKLDTSGYYFKFGNECFSLFKQNIFIGSG--------------FA

Query:  ESLLTLHHNVGTKRGQTNESLAYLWHKRLGHISKERIKRSIKNEILPDLDFTDLGICVDYIKRKQTKQTVNKEAIRSSQLLEIIHTDICGPFDVPSFGGE
        E++LTLHHNVGTKR   NE  A+LWHKRLGHIS+ERI+R IKNEILPDLDFTDL ICVD IK KQTK T  K A RS+QLLEI+HTDICGPFDV SFG E
Subjt:  ESLLTLHHNVGTKRGQTNESLAYLWHKRLGHISKERIKRSIKNEILPDLDFTDLGICVDYIKRKQTKQTVNKEAIRSSQLLEIIHTDICGPFDVPSFGGE

Query:  KYFITFIDDFLRYGYIYLLHEKSQAIDALKVFINEVERQLDRKVKNLK
        +YFITFIDD+ RYGY+YLLHEKSQA++AL++++NEVERQLDRKVK ++
Subjt:  KYFITFIDDFLRYGYIYLLHEKSQAIDALKVFINEVERQLDRKVKNLK

A0A445LQ30 Retrovirus-related Pol polyprotein from transposon TNT 1-941.4e-17168.3Show/hide
Query:  MSTLTNIKFDGSRTIHEHILEMTNLVARLKTMRMEVNENFLVTFILNSLHSEYGPFHMNYNTLKDKWNMHELQSMLIQEEARLKKSIIHSANLMGHK---
        MSTLT +KFDGSRT+HEH++EMTN+ ARLKT+ M VNENFLV FILNSL SEYGPF M+YNT+KDKWN+HEL SML+QEE RLK    HS + + H+   
Subjt:  MSTLTNIKFDGSRTIHEHILEMTNLVARLKTMRMEVNENFLVTFILNSLHSEYGPFHMNYNTLKDKWNMHELQSMLIQEEARLKKSIIHSANLMGHK---

Query:  GAGKKPGKKNDKGNHGQLKVKQSSAPIHKKGQIKDKCRFCNKLEHYQKDCLKRKTWFENKVKYNALVCFESNLTEVPYNTWWIDSGCTIHISNTMQGFLT
        GAGKK  KK+DKG  G LK+K     I KK    + C FC K  H+QKDC KRK+WFE K + NALVCFESNLTEVP+NTWWIDSGCT H+SNTMQGFLT
Subjt:  GAGKKPGKKNDKGNHGQLKVKQSSAPIHKKGQIKDKCRFCNKLEHYQKDCLKRKTWFENKVKYNALVCFESNLTEVPYNTWWIDSGCTIHISNTMQGFLT

Query:  TQTTNPNERFIFMGNRVKVPVEAVGTYRLTLDTRHYLNLFDTFYVSSISRNLISLSKLDTSGYYFKFGNECFSLFKQNIFIGSG--------------FA
         QT +PNE+F+FMGNRVK PVEAVGTYRL LDT H+L+L +T YV S+SRNL+SLSKLD +GY F FGN CFSLFK N  IG+G              + 
Subjt:  TQTTNPNERFIFMGNRVKVPVEAVGTYRLTLDTRHYLNLFDTFYVSSISRNLISLSKLDTSGYYFKFGNECFSLFKQNIFIGSG--------------FA

Query:  ESLLTLHHNVGTKRGQTNESLAYLWHKRLGHISKERIKRSIKNEILPDLDFTDLGICVDYIKRKQTKQTVNKEAIRSSQLLEIIHTDICGPFDVPSFGGE
        E++LTLHHNVGTKR   NE  A+LWHKRLGHIS ERI+R IKNEILPDLDFTDL ICVD IK KQTK T  K A RS+QLLEI+HTDICGPFDV SFG E
Subjt:  ESLLTLHHNVGTKRGQTNESLAYLWHKRLGHISKERIKRSIKNEILPDLDFTDLGICVDYIKRKQTKQTVNKEAIRSSQLLEIIHTDICGPFDVPSFGGE

Query:  KYFITFIDDFLRYGYIYLLHEKSQAIDALKVFINEVERQLDRKVKNLK
        +YFITFIDD+ RYGY+YLLHEKSQA++AL++++NEVERQLDRKVK ++
Subjt:  KYFITFIDDFLRYGYIYLLHEKSQAIDALKVFINEVERQLDRKVKNLK

A0A4Q3EHL3 Uncharacterized protein (Fragment)1.2e-18372.62Show/hide
Query:  MSTLTNIKFDGSRTIHEHILEMTNLVARLKTMRMEVNENFLVTFILNSLHSEYGPFHMNYNTLKDKWNMHELQSMLIQEEARLKKSIIHSANLMGHKGAG
        M  LT +KFDGSRT+HEH++EM N+ ARLKTM MEVNENFLVTFILNSL  EYG FH++YNTLKDKW++HELQSMLIQEEARLKKS  HSANL+GHKGA 
Subjt:  MSTLTNIKFDGSRTIHEHILEMTNLVARLKTMRMEVNENFLVTFILNSLHSEYGPFHMNYNTLKDKWNMHELQSMLIQEEARLKKSIIHSANLMGHKGAG

Query:  KKPGKKNDKGNHGQLKVKQSSAPIHKKGQIKDKCRFCNKLEHYQKDCLKRKTWFENKVKYNALVCFESNLTEVPYNTWWIDSGCTIHISNTMQGFLTTQT
        KKP  K  KG  G  K+ +SS  IHKK Q  D CRFC K  H+QKDC KRKTWFE K K +A VCFESN  EVPYNTWW+DSGCT H+SNTMQGFLTTQT
Subjt:  KKPGKKNDKGNHGQLKVKQSSAPIHKKGQIKDKCRFCNKLEHYQKDCLKRKTWFENKVKYNALVCFESNLTEVPYNTWWIDSGCTIHISNTMQGFLTTQT

Query:  TNPNERFIFMGNRVKVPVEAVGTYRLTLDTRHYLNLFDTFYVSSISRNLISLSKLDTSGYYFKFGNECFSLFKQNIFIGSG--------------FAESL
         + NE+FI MGNR KV VEA+GTYRL LDT H+L+LF TFYV S+SRNL+S+SKLD +GY F FGN CFSLFKQN+F+GSG              FAE+L
Subjt:  TNPNERFIFMGNRVKVPVEAVGTYRLTLDTRHYLNLFDTFYVSSISRNLISLSKLDTSGYYFKFGNECFSLFKQNIFIGSG--------------FAESL

Query:  LTLHHNVGTKRGQTNESLAYLWHKRLGHISKERIKRSIKNEILPDLDFTDLGICVDYIKRKQTKQTVNKEAIRSSQLLEIIHTDICGPFDVPSFGGEKYF
        LT+HHNVGTKRG +NES AYLWHKRLGHISKERI+R +KN+ILP+LDFTDLG+CV+ IK K T+QT+ K A RSSQLLEIIHTDICGPFDVPS GGE+YF
Subjt:  LTLHHNVGTKRGQTNESLAYLWHKRLGHISKERIKRSIKNEILPDLDFTDLGICVDYIKRKQTKQTVNKEAIRSSQLLEIIHTDICGPFDVPSFGGEKYF

Query:  ITFIDDFLRYGYIYLLHEKSQAIDALKVFINEVERQLDRKVK
        ITFIDDF RYGY+YLLHEKSQ++D L+VF+NEVERQLDRKVK
Subjt:  ITFIDDFLRYGYIYLLHEKSQAIDALKVFINEVERQLDRKVK

SwissProt top hitse value%identityAlignment
P04146 Copia protein5.9e-1821.9Show/hide
Query:  LTNIKFDGSRTIHEHILEMTNLVARLKTMRMEVNENFLVTFILNSLHSEYGPFHMNYNTL-KDKWNMHELQSMLIQEEARLKKSIIHSANLMGHKGAGKK
        L ++K     ++  H      L++ L     ++ E   ++ +L +L S Y        TL ++   +  +++ L+ +E ++K    H+        A   
Subjt:  LTNIKFDGSRTIHEHILEMTNLVARLKTMRMEVNENFLVTFILNSLHSEYGPFHMNYNTL-KDKWNMHELQSMLIQEEARLKKSIIHSANLMGHKGAGKK

Query:  PGKKNDKGNHGQLKVKQSSAPIHKKGQIKDKCRFCNKLEHYQKDCLKRKTWFENKVKYN------------ALVCFESNLTEVPYNTWWI-DSGCTIHIS
              K N  + +V +         + K KC  C +  H +KDC   K    NK K N            A +  E N T V  N  ++ DSG + H+ 
Subjt:  PGKKNDKGNHGQLKVKQSSAPIHKKGQIKDKCRFCNKLEHYQKDCLKRKTWFENKVKYN------------ALVCFESNLTEVPYNTWWI-DSGCTIHIS

Query:  NTMQGFLTTQTTNPNERFIFMGNRVKVPVEAVGTYRLTLDTRHYLNLFDTFYVSSISRNLISLSKLDTSGYYFKFGNECFSLFKQNIFI--GSGFAESLL
        N    +  +    P  +         +     G  RL  D  H + L D  +    + NL+S+ +L  +G   +F     ++ K  + +   SG   ++ 
Subjt:  NTMQGFLTTQTTNPNERFIFMGNRVKVPVEAVGTYRLTLDTRHYLNLFDTFYVSSISRNLISLSKLDTSGYYFKFGNECFSLFKQNIFI--GSGFAESLL

Query:  TLHHNVGTKRGQTNESLAYLWHKRLGHISKERIKRSIKNEILPD---LDFTDLG--ICVDYIKRKQTKQTVN--KEAIRSSQLLEIIHTDICGPFDVPSF
         ++    +   +   +   LWH+R GHIS  ++    +  +  D   L+  +L   IC   +  KQ +      K+     + L ++H+D+CGP    + 
Subjt:  TLHHNVGTKRGQTNESLAYLWHKRLGHISKERIKRSIKNEILPD---LDFTDLG--ICVDYIKRKQTKQTVN--KEAIRSSQLLEIIHTDICGPFDVPSF

Query:  GGEKYFITFIDDFLRYGYIYLLHEKSQAIDALKVFINEVERQLDRKVKNLKI
          + YF+ F+D F  Y   YL+  KS      + F+ + E   + KV  L I
Subjt:  GGEKYFITFIDDFLRYGYIYLLHEKSQAIDALKVFINEVERQLDRKVKNLKI

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.1e-2725.23Show/hide
Query:  HILEMTNLVARLKTMRMEVNENFLVTFILNSLHSEYGPFHMNYNTLKDKWNMHELQSMLIQEEARLKKSIIHSANLMGHKGAGKKPGKKNDKGNHGQLKV
        H+     L+ +L  + +++ E      +LNSL S Y          K    + ++ S L+  E +++K   +    +  +G G+    +    N+G+   
Subjt:  HILEMTNLVARLKTMRMEVNENFLVTFILNSLHSEYGPFHMNYNTLKDKWNMHELQSMLIQEEARLKKSIIHSANLMGHKGAGKKPGKKNDKGNHGQLKV

Query:  KQSSAPIHKKGQIKDKCRFCNKLEHYQKDCLK-RKTWFENKVKYN-----ALVCFESNLT------------EVPYNTWWIDSGCTIHISNTMQGFLTTQ
        +  S     K ++++ C  CN+  H+++DC   RK   E   + N     A+V    N+               P + W +D+  + H +      L  +
Subjt:  KQSSAPIHKKGQIKDKCRFCNKLEHYQKDCLK-RKTWFENKVKYN-----ALVCFESNLT------------EVPYNTWWIDSGCTIHISNTMQGFLTTQ

Query:  TTNPNERFIFMGNRVKVPVEAVGTYRLTLDTRHYLNLFDTFYVSSISRNLISLSKLDTSGYYFKFGNECFSLFKQNIFIGSGFAESLLTLHHNVGTKRGQ
            +   + MGN     +  +G   +  +    L L D  +V  +  NLIS   LD  GY   F N+ + L K ++ I  G A   L    N    +G+
Subjt:  TTNPNERFIFMGNRVKVPVEAVGTYRLTLDTRHYLNLFDTFYVSSISRNLISLSKLDTSGYYFKFGNECFSLFKQNIFIGSGFAESLLTLHHNVGTKRGQ

Query:  TN----ESLAYLWHKRLGHISKERIKRSIKNEILPDLDFTDLGICVDYIKRKQTKQTVNKEAIRSSQLLEIIHTDICGPFDVPSFGGEKYFITFIDDFLR
         N    E    LWHKR+GH+S++ ++   K  ++     T +  C   +  KQ + +    + R   +L+++++D+CGP ++ S GG KYF+TFIDD  R
Subjt:  TN----ESLAYLWHKRLGHISKERIKRSIKNEILPDLDFTDLGICVDYIKRKQTKQTVNKEAIRSSQLLEIIHTDICGPFDVPSFGGEKYFITFIDDFLR

Query:  YGYIYLLHEKSQAIDALKVFINEVERQLDRKVKNLK
          ++Y+L  K Q     + F   VER+  RK+K L+
Subjt:  YGYIYLLHEKSQAIDALKVFINEVERQLDRKVKNLK

Q12491 Transposon Ty2-B Gag-Pol polyprotein7.5e-0525.12Show/hide
Query:  NRVKVPVEAVGTYRLTLDTRHYLNLFDTFYVSSISRNLISLSKLDTSGYYFKFGNECFSLFKQNIFIGSGFAESLLTLHHNVGTKRGQTNESLAYLWHKR
        N  K  ++A+ T  +  D      L +    +  +RN +  S         K G+  F    +   I S    S LT+  N   K    N+    L H+ 
Subjt:  NRVKVPVEAVGTYRLTLDTRHYLNLFDTFYVSSISRNLISLSKLDTSGYYFKFGNECFSLFKQNIFIGSGFAESLLTLHHNVGTKRGQTNESLAYLWHKR

Query:  LGHISKERIKRSIKNEILPDLDFTDLG-------ICVDYIKRKQTKQTVNK----EAIRSSQLLEIIHTDICGPFDVPSFGGEKYFITFIDDFLRYGYIY
        LGH +   I++S+K   +  L  +D+         C D +  K TK    K    +   S +  + +HTDI GP          YFI+F D+  R+ ++Y
Subjt:  LGHISKERIKRSIKNEILPDLDFTDLG-------ICVDYIKRKQTKQTVNK----EAIRSSQLLEIIHTDICGPFDVPSFGGEKYFITFIDDFLRYGYIY

Query:  LLHEKSQ
         LH++ +
Subjt:  LLHEKSQ

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE19.4e-1622.57Show/hide
Query:  GSRTIHEHILEMTNLVARLKTMRMEVNENFLVTFILNSLHSEYGPFHMNYNTLKDKWNMHELQSMLIQEEARL-----KKSIIHSANLMGHKGA------
        G++TI +++  +     +L  +   ++ +  V  +L +L  EY P             + E+   L+  E+++        I  +AN + H+        
Subjt:  GSRTIHEHILEMTNLVARLKTMRMEVNENFLVTFILNSLHSEYGPFHMNYNTLKDKWNMHELQSMLIQEEARL-----KKSIIHSANLMGHKGA------

Query:  --GKKPGKKNDKGNHGQLKV-KQSSAPIH-KKGQIK---DKCRFCNKLEHYQKDCLKRKTWFENKVKYNALVCF-----ESNLT-EVPY--NTWWIDSGC
          G +  + +++ N+   K  +QSS   H    Q K    KC+ C    H  K C + + +  +         F      +NL    PY  N W +DSG 
Subjt:  --GKKPGKKNDKGNHGQLKV-KQSSAPIH-KKGQIK---DKCRFCNKLEHYQKDCLKRKTWFENKVKYNALVCF-----ESNLT-EVPY--NTWWIDSGC

Query:  TIHISNTMQGFLTTQTTNPNERFIFMGNRVKVPVEAVGTYRLTLDTRHYLNLFDTFYVSSISRNLISLSKL-DTSGYYFKFGNECFSLFKQNIFI-----
        T HI++        Q     +  + + +   +P+   G+  L+  +R  LNL +  YV +I +NLIS+ +L + +G   +F    F +   N  +     
Subjt:  TIHISNTMQGFLTTQTTNPNERFIFMGNRVKVPVEAVGTYRLTLDTRHYLNLFDTFYVSSISRNLISLSKL-DTSGYYFKFGNECFSLFKQNIFI-----

Query:  --GSGFAESLLTLHHNVGTKRGQTNESLAYLWHKRLGHISKERIKRSIKNEILPDLDFTDLGI-CVDYIKRKQTKQTVNKEAIRSSQLLEIIHTDICGPF
               E  +     V      ++++    WH RLGH +   +   I N  L  L+ +   + C D +  K  K   ++  I S++ LE I++D+    
Subjt:  --GSGFAESLLTLHHNVGTKRGQTNESLAYLWHKRLGHISKERIKRSIKNEILPDLDFTDLGI-CVDYIKRKQTKQTVNKEAIRSSQLLEIIHTDICGPF

Query:  DVPSFGGEKYFITFIDDFLRYGYIYLLHEKSQAIDALKVFINEVERQLDRKV
         + S    +Y++ F+D F RY ++Y L +KSQ  +    F N +E +   ++
Subjt:  DVPSFGGEKYFITFIDDFLRYGYIYLLHEKSQAIDALKVFINEVERQLDRKV

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE28.0e-1523.39Show/hide
Query:  ILNSLHSEYGPFHMNYNTLKDKWNMHELQSMLIQEEARL-----KKSIIHSANLMGHKGAGKKPGKKN--DKGNHGQLKVKQSSAPIHKKGQIKD-----
        +L +L  +Y P            ++ E+   LI  E++L      + +  +AN++ H+       + N  D  N+     + +S      G   D     
Subjt:  ILNSLHSEYGPFHMNYNTLKDKWNMHELQSMLIQEEARL-----KKSIIHSANLMGHKGAGKKPGKKN--DKGNHGQLKVKQSSAPIHKKGQIKD-----

Query:  ----KCRFCNKLEHYQKDCLKRKTWFENKVKYNALVCF-----ESNL-TEVPY--NTWWIDSGCTIHISNTMQGFLTTQTTNPNERFIFMGNRVKVPVEA
            +C+ C+   H  K C +   +     +  +   F      +NL    PY  N W +DSG T HI++        Q     +  + + +   +P+  
Subjt:  ----KCRFCNKLEHYQKDCLKRKTWFENKVKYNALVCF-----ESNL-TEVPY--NTWWIDSGCTIHISNTMQGFLTTQTTNPNERFIFMGNRVKVPVEA

Query:  VGTYRLTLDTRHYLNLFDTFYVSSISRNLISLSKL-DTSGYYFKFGNECFSLFKQNIFI-------GSGFAESLLTLHHNVGTKRGQTNESLAYLWHKRL
         G+  L   +R  L+L    YV +I +NLIS+ +L +T+    +F    F +   N  +            E  +     V       +++    WH RL
Subjt:  VGTYRLTLDTRHYLNLFDTFYVSSISRNLISLSKL-DTSGYYFKFGNECFSLFKQNIFI-------GSGFAESLLTLHHNVGTKRGQTNESLAYLWHKRL

Query:  GHISKERIKRSIKNEILPDLDFT-DLGICVDYIKRKQTKQTVNKEAIRSSQLLEIIHTDICGPFDVPSFGGEKYFITFIDDFLRYGYIYLLHEKSQAIDA
        GH S   +   I N  LP L+ +  L  C D    K  K   +   I SS+ LE I++D+     + S    +Y++ F+D F RY ++Y L +KSQ  D 
Subjt:  GHISKERIKRSIKNEILPDLDFT-DLGICVDYIKRKQTKQTVNKEAIRSSQLLEIIHTDICGPFDVPSFGGEKYFITFIDDFLRYGYIYLLHEKSQAIDA

Query:  LKVFINEVERQLDRKVKNL
          +F + VE +   ++  L
Subjt:  LKVFINEVERQLDRKVKNL

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTACTTTAACCAACATCAAGTTTGATGGTTCTCGTACTATACATGAGCATATCCTTGAAATGACGAACTTGGTAGCAAGGTTAAAGACCATGAGAATGGAAGTTAA
TGAGAATTTTTTGGTAACGTTTATCCTTAATTCCTTACATTCAGAGTATGGTCCATTTCACATGAACTATAACACTCTGAAAGATAAATGGAATATGCATGAATTACAAA
GTATGCTCATTCAAGAGGAAGCGAGGCTTAAGAAATCAATAATTCACTCTGCCAATCTCATGGGTCACAAAGGAGCTGGAAAGAAACCTGGAAAAAAGAATGACAAGGGC
AATCATGGACAATTAAAGGTAAAACAGTCATCTGCCCCAATCCACAAAAAGGGACAAATTAAGGATAAGTGTCGTTTTTGCAACAAACTAGAACACTATCAGAAAGATTG
TCTAAAACGTAAGACATGGTTCGAGAATAAAGTTAAGTATAATGCTTTAGTATGTTTCGAATCAAACTTAACTGAAGTTCCTTATAATACATGGTGGATTGATTCTGGTT
GTACCATTCATATTTCCAATACGATGCAGGGATTCCTTACGACCCAAACCACAAACCCAAATGAGAGATTCATTTTTATGGGAAACAGAGTCAAAGTTCCAGTTGAAGCT
GTGGGAACCTATCGTTTAACTTTAGATACTAGACATTATTTAAACCTTTTTGATACCTTTTATGTTTCTTCTATTTCTCGTAATTTGATTTCCTTATCAAAACTTGATAC
TTCAGGTTATTACTTTAAATTTGGGAATGAGTGTTTTAGTTTATTCAAACAAAACATTTTTATTGGTTCTGGTTTTGCTGAGAGTTTGTTAACCTTGCATCATAATGTTG
GTACTAAACGTGGTCAAACTAATGAATCGTTAGCTTACTTGTGGCATAAACGTTTAGGTCACATATCCAAAGAAAGAATTAAAAGATCGATAAAGAATGAAATTCTTCCA
GATTTGGATTTTACTGACCTTGGAATTTGTGTGGATTATATTAAAAGAAAACAAACAAAACAGACAGTTAATAAAGAAGCCATAAGAAGCTCACAACTTCTTGAAATTAT
ACACACTGATATTTGTGGGCCTTTTGATGTTCCATCTTTTGGTGGAGAAAAGTATTTTATCACCTTTATTGATGATTTCTTACGTTATGGTTATATCTATTTATTGCATG
AGAAATCTCAAGCAATAGATGCCTTAAAAGTATTTATAAATGAAGTTGAAAGGCAATTAGATAGAAAGGTGAAAAATCTTAAGATCTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGTACTTTAACCAACATCAAGTTTGATGGTTCTCGTACTATACATGAGCATATCCTTGAAATGACGAACTTGGTAGCAAGGTTAAAGACCATGAGAATGGAAGTTAA
TGAGAATTTTTTGGTAACGTTTATCCTTAATTCCTTACATTCAGAGTATGGTCCATTTCACATGAACTATAACACTCTGAAAGATAAATGGAATATGCATGAATTACAAA
GTATGCTCATTCAAGAGGAAGCGAGGCTTAAGAAATCAATAATTCACTCTGCCAATCTCATGGGTCACAAAGGAGCTGGAAAGAAACCTGGAAAAAAGAATGACAAGGGC
AATCATGGACAATTAAAGGTAAAACAGTCATCTGCCCCAATCCACAAAAAGGGACAAATTAAGGATAAGTGTCGTTTTTGCAACAAACTAGAACACTATCAGAAAGATTG
TCTAAAACGTAAGACATGGTTCGAGAATAAAGTTAAGTATAATGCTTTAGTATGTTTCGAATCAAACTTAACTGAAGTTCCTTATAATACATGGTGGATTGATTCTGGTT
GTACCATTCATATTTCCAATACGATGCAGGGATTCCTTACGACCCAAACCACAAACCCAAATGAGAGATTCATTTTTATGGGAAACAGAGTCAAAGTTCCAGTTGAAGCT
GTGGGAACCTATCGTTTAACTTTAGATACTAGACATTATTTAAACCTTTTTGATACCTTTTATGTTTCTTCTATTTCTCGTAATTTGATTTCCTTATCAAAACTTGATAC
TTCAGGTTATTACTTTAAATTTGGGAATGAGTGTTTTAGTTTATTCAAACAAAACATTTTTATTGGTTCTGGTTTTGCTGAGAGTTTGTTAACCTTGCATCATAATGTTG
GTACTAAACGTGGTCAAACTAATGAATCGTTAGCTTACTTGTGGCATAAACGTTTAGGTCACATATCCAAAGAAAGAATTAAAAGATCGATAAAGAATGAAATTCTTCCA
GATTTGGATTTTACTGACCTTGGAATTTGTGTGGATTATATTAAAAGAAAACAAACAAAACAGACAGTTAATAAAGAAGCCATAAGAAGCTCACAACTTCTTGAAATTAT
ACACACTGATATTTGTGGGCCTTTTGATGTTCCATCTTTTGGTGGAGAAAAGTATTTTATCACCTTTATTGATGATTTCTTACGTTATGGTTATATCTATTTATTGCATG
AGAAATCTCAAGCAATAGATGCCTTAAAAGTATTTATAAATGAAGTTGAAAGGCAATTAGATAGAAAGGTGAAAAATCTTAAGATCTGA
Protein sequenceShow/hide protein sequence
MSTLTNIKFDGSRTIHEHILEMTNLVARLKTMRMEVNENFLVTFILNSLHSEYGPFHMNYNTLKDKWNMHELQSMLIQEEARLKKSIIHSANLMGHKGAGKKPGKKNDKG
NHGQLKVKQSSAPIHKKGQIKDKCRFCNKLEHYQKDCLKRKTWFENKVKYNALVCFESNLTEVPYNTWWIDSGCTIHISNTMQGFLTTQTTNPNERFIFMGNRVKVPVEA
VGTYRLTLDTRHYLNLFDTFYVSSISRNLISLSKLDTSGYYFKFGNECFSLFKQNIFIGSGFAESLLTLHHNVGTKRGQTNESLAYLWHKRLGHISKERIKRSIKNEILP
DLDFTDLGICVDYIKRKQTKQTVNKEAIRSSQLLEIIHTDICGPFDVPSFGGEKYFITFIDDFLRYGYIYLLHEKSQAIDALKVFINEVERQLDRKVKNLKI