; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc06g0169151 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc06g0169151
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr06:23520089..23521357
RNA-Seq ExpressionCmc06g0169151
SyntenyCmc06g0169151
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037220.1 reverse transcriptase [Cucumis melo var. makuwa]1.0e-23795.02Show/hide
Query:  MEEGKIAAIRDWAMPKSVSELRSFLGLANYYRRFVEGFSKRASPLIELLKKEVHWNWDPDCQAAFDGLKEAMMEGPLLGIEDVTKPFEVETDASDYALGG
        MEEGKIAAIRDWAMPKSVSELRSFLGLANYYRRFVEGFSKRASPL ELLKK+VHWNWDP+CQ AFDGLK+A+MEGPLLGI DVTKPFEVETDASDYALGG
Subjt:  MEEGKIAAIRDWAMPKSVSELRSFLGLANYYRRFVEGFSKRASPLIELLKKEVHWNWDPDCQAAFDGLKEAMMEGPLLGIEDVTKPFEVETDASDYALGG

Query:  VLLQNGHPIAYESRKLNAGERRYTVSEKEMLAVVHCLRAWRQYLLGSSFVVKTDNSATCHFFTQPKLTWKQARWQEFLADFDFEFEHKKGSSNQAADALS
        VLLQNGHPIAYESRKLNA ERRYTVSEKEMLAVVHCLRAWRQYLLGSSFVVKTDNSATCHFFTQPKLT KQARWQEFLA+FDFEFEHKKGSSNQAADALS
Subjt:  VLLQNGHPIAYESRKLNAGERRYTVSEKEMLAVVHCLRAWRQYLLGSSFVVKTDNSATCHFFTQPKLTWKQARWQEFLADFDFEFEHKKGSSNQAADALS

Query:  RKQEHASICLLAHLRGSEIGGSVRDTLREFLQKDHAAQNVMNLVKAGKTRQFWVEEDLLVTKGNRLYVPRAGDLRKKLLYECHNTLWAGHPGWQRTYALL
        RKQEHA+ICLLAHL+GSEIGGSVRDTLREFLQKDHAAQNVMNL KAGKTRQFWVEEDLLVTKGNRLYVPRAG LRKKLLYECH+TLWAGHPGWQRTYALL
Subjt:  RKQEHASICLLAHLRGSEIGGSVRDTLREFLQKDHAAQNVMNLVKAGKTRQFWVEEDLLVTKGNRLYVPRAGDLRKKLLYECHNTLWAGHPGWQRTYALL

Query:  KKGYFWPNMRDDVMQYTKTCLICQQDKVEKVKVAGLLDPLPVPTRPLESVSMDFIIHLPKVSDFEAILVIIDRFSKYATFIPTTKQCSAEMTAQLFFKHV
        KKGYFWPNMRDDVMQYTKTCLICQQDKVEKVKVAGLLDPLPVPTRP ESVSMDFI HLPKV DFEAILVIIDRFSKYATFIP TKQCSAE TAQLFFKHV
Subjt:  KKGYFWPNMRDDVMQYTKTCLICQQDKVEKVKVAGLLDPLPVPTRPLESVSMDFIIHLPKVSDFEAILVIIDRFSKYATFIPTTKQCSAEMTAQLFFKHV

Query:  VKLWEVPTSIVSDRDGRFIGSF
        VKLW VPTSIVSDRDGRFIGSF
Subjt:  VKLWEVPTSIVSDRDGRFIGSF

KAA0060348.1 reverse transcriptase [Cucumis melo var. makuwa]4.7e-24397.63Show/hide
Query:  MEEGKIAAIRDWAMPKSVSELRSFLGLANYYRRFVEGFSKRASPLIELLKKEVHWNWDPDCQAAFDGLKEAMMEGPLLGIEDVTKPFEVETDASDYALGG
        MEEGKIAAIRDWAMPKSVSELRSFL LANYYRRFVEGFSKRASPLIELLKKEVHWNWDPDCQAAFDGLKEAMMEGPLLGI DVTKPFEVETDASDYALGG
Subjt:  MEEGKIAAIRDWAMPKSVSELRSFLGLANYYRRFVEGFSKRASPLIELLKKEVHWNWDPDCQAAFDGLKEAMMEGPLLGIEDVTKPFEVETDASDYALGG

Query:  VLLQNGHPIAYESRKLNAGERRYTVSEKEMLAVVHCLRAWRQYLLGSSFVVKTDNSATCHFFTQPKLTWKQARWQEFLADFDFEFEHKKGSSNQAADALS
        VLLQNGHPIAYE RKLN  ERRYTVSEKEMLAVVHCLRAWRQYLL SSFVVKTDNSATCHFFTQPKLT KQARWQEFLADFDFEFEHKKGSSNQAADALS
Subjt:  VLLQNGHPIAYESRKLNAGERRYTVSEKEMLAVVHCLRAWRQYLLGSSFVVKTDNSATCHFFTQPKLTWKQARWQEFLADFDFEFEHKKGSSNQAADALS

Query:  RKQEHASICLLAHLRGSEIGGSVRDTLREFLQKDHAAQNVMNLVKAGKTRQFWVEEDLLVTKGNRLYVPRAGDLRKKLLYECHNTLWAGHPGWQRTYALL
        RKQEHA+ICLLAHLRGSEIGGSVRDTLREFLQKDHAAQNVMNLVKAGKTRQFWVEEDLLVTKGNRLYVPRAGDLRKKLLYECHNTLW GHPGWQRTYALL
Subjt:  RKQEHASICLLAHLRGSEIGGSVRDTLREFLQKDHAAQNVMNLVKAGKTRQFWVEEDLLVTKGNRLYVPRAGDLRKKLLYECHNTLWAGHPGWQRTYALL

Query:  KKGYFWPNMRDDVMQYTKTCLICQQDKVEKVKVAGLLDPLPVPTRPLESVSMDFIIHLPKVSDFEAILVIIDRFSKYATFIPTTKQCSAEMTAQLFFKHV
        KKGYFWPNMRDDVMQYTKTCLICQQDKVEKVKVAGLLDPLPVPTRPLESVSMDFI HLPKVSDFEAILVIIDRFSKYATFIPTTKQCSAEMTAQLFFKHV
Subjt:  KKGYFWPNMRDDVMQYTKTCLICQQDKVEKVKVAGLLDPLPVPTRPLESVSMDFIIHLPKVSDFEAILVIIDRFSKYATFIPTTKQCSAEMTAQLFFKHV

Query:  VKLWEVPTSIVSDRDGRFIGSF
        VKLWEVPTSIVSDRDGRFIGSF
Subjt:  VKLWEVPTSIVSDRDGRFIGSF

KAA0067557.1 reverse transcriptase [Cucumis melo var. makuwa]1.0e-23795.02Show/hide
Query:  MEEGKIAAIRDWAMPKSVSELRSFLGLANYYRRFVEGFSKRASPLIELLKKEVHWNWDPDCQAAFDGLKEAMMEGPLLGIEDVTKPFEVETDASDYALGG
        MEEGKIAAIRDWAMPKSVSELRSFLGLANYYRRFVEGFSKRASPL ELLKK+VHWNWDP+CQ AFDGLK+A+MEGPLLGI DVTKPFEVETDASDYALGG
Subjt:  MEEGKIAAIRDWAMPKSVSELRSFLGLANYYRRFVEGFSKRASPLIELLKKEVHWNWDPDCQAAFDGLKEAMMEGPLLGIEDVTKPFEVETDASDYALGG

Query:  VLLQNGHPIAYESRKLNAGERRYTVSEKEMLAVVHCLRAWRQYLLGSSFVVKTDNSATCHFFTQPKLTWKQARWQEFLADFDFEFEHKKGSSNQAADALS
        VLLQNGHPIAYESRKLNA ERRYTVSEKEMLAVVHCLRAWRQYLLGSSFVVKTDNSATCHFFTQPKLT KQARWQEFLA+FDFEFEHKKGSSNQAADALS
Subjt:  VLLQNGHPIAYESRKLNAGERRYTVSEKEMLAVVHCLRAWRQYLLGSSFVVKTDNSATCHFFTQPKLTWKQARWQEFLADFDFEFEHKKGSSNQAADALS

Query:  RKQEHASICLLAHLRGSEIGGSVRDTLREFLQKDHAAQNVMNLVKAGKTRQFWVEEDLLVTKGNRLYVPRAGDLRKKLLYECHNTLWAGHPGWQRTYALL
        RKQEHA+ICLLAHL+GSEIGGSVRDTLREFLQKDHAAQNVMNL KAGKTRQFWVEEDLLVTKGNRLYVPRAG LRKKLLYECH+TLWAGHPGWQRTYALL
Subjt:  RKQEHASICLLAHLRGSEIGGSVRDTLREFLQKDHAAQNVMNLVKAGKTRQFWVEEDLLVTKGNRLYVPRAGDLRKKLLYECHNTLWAGHPGWQRTYALL

Query:  KKGYFWPNMRDDVMQYTKTCLICQQDKVEKVKVAGLLDPLPVPTRPLESVSMDFIIHLPKVSDFEAILVIIDRFSKYATFIPTTKQCSAEMTAQLFFKHV
        KKGYFWPNMRDDVMQYTKTCLICQQDKVEKVKVAGLLDPLPVPTRP ESVSMDFI HLPKV DFEAILVIIDRFSKYATFIP TKQCSAE TAQLFFKHV
Subjt:  KKGYFWPNMRDDVMQYTKTCLICQQDKVEKVKVAGLLDPLPVPTRPLESVSMDFIIHLPKVSDFEAILVIIDRFSKYATFIPTTKQCSAEMTAQLFFKHV

Query:  VKLWEVPTSIVSDRDGRFIGSF
        VKLW VPTSIVSDRDGRFIGSF
Subjt:  VKLWEVPTSIVSDRDGRFIGSF

TYK01597.1 reverse transcriptase [Cucumis melo var. makuwa]1.1e-23995.73Show/hide
Query:  MEEGKIAAIRDWAMPKSVSELRSFLGLANYYRRFVEGFSKRASPLIELLKKEVHWNWDPDCQAAFDGLKEAMMEGPLLGIEDVTKPFEVETDASDYALGG
        MEEGKIAAIRDWAMPKSVSELRSFLGLANYYRRFVEGFSKRASPL ELLKK+VHWNWDP+CQAAFDGLK+AMMEGPLLGI DVTKPFEVETDAS+YALGG
Subjt:  MEEGKIAAIRDWAMPKSVSELRSFLGLANYYRRFVEGFSKRASPLIELLKKEVHWNWDPDCQAAFDGLKEAMMEGPLLGIEDVTKPFEVETDASDYALGG

Query:  VLLQNGHPIAYESRKLNAGERRYTVSEKEMLAVVHCLRAWRQYLLGSSFVVKTDNSATCHFFTQPKLTWKQARWQEFLADFDFEFEHKKGSSNQAADALS
        VLLQNGHPI YESRKLN  ERRYTVSEKEMLAVVHCLRAWRQYLLGSSFVVKTDNSATCHFFTQPKLT KQARWQEFLA+FDFEFEHKKGS+NQAADALS
Subjt:  VLLQNGHPIAYESRKLNAGERRYTVSEKEMLAVVHCLRAWRQYLLGSSFVVKTDNSATCHFFTQPKLTWKQARWQEFLADFDFEFEHKKGSSNQAADALS

Query:  RKQEHASICLLAHLRGSEIGGSVRDTLREFLQKDHAAQNVMNLVKAGKTRQFWVEEDLLVTKGNRLYVPRAGDLRKKLLYECHNTLWAGHPGWQRTYALL
        RKQEHA+ICLLAHLRGSEIGGSVRDTLREFLQKDHAAQNVMNLVKAGKTRQFWVEEDLLVTKGNRLYVPRAGDLRKKLLYECH+TLWAGHPGWQRTYALL
Subjt:  RKQEHASICLLAHLRGSEIGGSVRDTLREFLQKDHAAQNVMNLVKAGKTRQFWVEEDLLVTKGNRLYVPRAGDLRKKLLYECHNTLWAGHPGWQRTYALL

Query:  KKGYFWPNMRDDVMQYTKTCLICQQDKVEKVKVAGLLDPLPVPTRPLESVSMDFIIHLPKVSDFEAILVIIDRFSKYATFIPTTKQCSAEMTAQLFFKHV
        KKGYFWPNMRDDVMQYTKTCLICQQDKVEKVKVAGLLDPLPVPTRP ESVSMDFI HLPKV DFEAILVIIDRFSKYATFIPTTKQCSAEMTAQLFFKHV
Subjt:  KKGYFWPNMRDDVMQYTKTCLICQQDKVEKVKVAGLLDPLPVPTRPLESVSMDFIIHLPKVSDFEAILVIIDRFSKYATFIPTTKQCSAEMTAQLFFKHV

Query:  VKLWEVPTSIVSDRDGRFIGSF
        VKLW VPTSIVSDRDGRFIGSF
Subjt:  VKLWEVPTSIVSDRDGRFIGSF

TYK07954.1 reverse transcriptase [Cucumis melo var. makuwa]3.5e-23895.26Show/hide
Query:  MEEGKIAAIRDWAMPKSVSELRSFLGLANYYRRFVEGFSKRASPLIELLKKEVHWNWDPDCQAAFDGLKEAMMEGPLLGIEDVTKPFEVETDASDYALGG
        MEEGKIAAIRDWAMPKSVSELRSFLGLANYYRRFVEGFSKRASPL ELLKK+VHWNWDP+CQAAFDGLK+A+MEGPLLGI DVTKPFEVETDASDYALGG
Subjt:  MEEGKIAAIRDWAMPKSVSELRSFLGLANYYRRFVEGFSKRASPLIELLKKEVHWNWDPDCQAAFDGLKEAMMEGPLLGIEDVTKPFEVETDASDYALGG

Query:  VLLQNGHPIAYESRKLNAGERRYTVSEKEMLAVVHCLRAWRQYLLGSSFVVKTDNSATCHFFTQPKLTWKQARWQEFLADFDFEFEHKKGSSNQAADALS
        VLLQNGHPIAYESRKLNA ERRYTVSEKEMLAVVHCLRAWRQYLLGSSFVVKTDNSATCHFFTQPKLT KQARWQEFLA+FDFEFEHKKGSSNQAADALS
Subjt:  VLLQNGHPIAYESRKLNAGERRYTVSEKEMLAVVHCLRAWRQYLLGSSFVVKTDNSATCHFFTQPKLTWKQARWQEFLADFDFEFEHKKGSSNQAADALS

Query:  RKQEHASICLLAHLRGSEIGGSVRDTLREFLQKDHAAQNVMNLVKAGKTRQFWVEEDLLVTKGNRLYVPRAGDLRKKLLYECHNTLWAGHPGWQRTYALL
        RKQEHA+ICLLAHL+GSEIGGSVRDTLREFLQKDHAAQNVMNL KAGKTRQFWVEEDLLVTKGNRLYVPRAG LRKKLLYECH+TLWAGHPGWQRTYALL
Subjt:  RKQEHASICLLAHLRGSEIGGSVRDTLREFLQKDHAAQNVMNLVKAGKTRQFWVEEDLLVTKGNRLYVPRAGDLRKKLLYECHNTLWAGHPGWQRTYALL

Query:  KKGYFWPNMRDDVMQYTKTCLICQQDKVEKVKVAGLLDPLPVPTRPLESVSMDFIIHLPKVSDFEAILVIIDRFSKYATFIPTTKQCSAEMTAQLFFKHV
        KKGYFWPNMRDDVMQYTKTCLICQQDKVEKVKVAGLLDPLPVPTRP ESVSMDFI HLPKV DFEAILVIIDRFSKYATFIP TKQCSAE TAQLFFKHV
Subjt:  KKGYFWPNMRDDVMQYTKTCLICQQDKVEKVKVAGLLDPLPVPTRPLESVSMDFIIHLPKVSDFEAILVIIDRFSKYATFIPTTKQCSAEMTAQLFFKHV

Query:  VKLWEVPTSIVSDRDGRFIGSF
        VKLW VPTSIVSDRDGRFIGSF
Subjt:  VKLWEVPTSIVSDRDGRFIGSF

TrEMBL top hitse value%identityAlignment
A0A5A7UYM0 Reverse transcriptase2.3e-24397.63Show/hide
Query:  MEEGKIAAIRDWAMPKSVSELRSFLGLANYYRRFVEGFSKRASPLIELLKKEVHWNWDPDCQAAFDGLKEAMMEGPLLGIEDVTKPFEVETDASDYALGG
        MEEGKIAAIRDWAMPKSVSELRSFL LANYYRRFVEGFSKRASPLIELLKKEVHWNWDPDCQAAFDGLKEAMMEGPLLGI DVTKPFEVETDASDYALGG
Subjt:  MEEGKIAAIRDWAMPKSVSELRSFLGLANYYRRFVEGFSKRASPLIELLKKEVHWNWDPDCQAAFDGLKEAMMEGPLLGIEDVTKPFEVETDASDYALGG

Query:  VLLQNGHPIAYESRKLNAGERRYTVSEKEMLAVVHCLRAWRQYLLGSSFVVKTDNSATCHFFTQPKLTWKQARWQEFLADFDFEFEHKKGSSNQAADALS
        VLLQNGHPIAYE RKLN  ERRYTVSEKEMLAVVHCLRAWRQYLL SSFVVKTDNSATCHFFTQPKLT KQARWQEFLADFDFEFEHKKGSSNQAADALS
Subjt:  VLLQNGHPIAYESRKLNAGERRYTVSEKEMLAVVHCLRAWRQYLLGSSFVVKTDNSATCHFFTQPKLTWKQARWQEFLADFDFEFEHKKGSSNQAADALS

Query:  RKQEHASICLLAHLRGSEIGGSVRDTLREFLQKDHAAQNVMNLVKAGKTRQFWVEEDLLVTKGNRLYVPRAGDLRKKLLYECHNTLWAGHPGWQRTYALL
        RKQEHA+ICLLAHLRGSEIGGSVRDTLREFLQKDHAAQNVMNLVKAGKTRQFWVEEDLLVTKGNRLYVPRAGDLRKKLLYECHNTLW GHPGWQRTYALL
Subjt:  RKQEHASICLLAHLRGSEIGGSVRDTLREFLQKDHAAQNVMNLVKAGKTRQFWVEEDLLVTKGNRLYVPRAGDLRKKLLYECHNTLWAGHPGWQRTYALL

Query:  KKGYFWPNMRDDVMQYTKTCLICQQDKVEKVKVAGLLDPLPVPTRPLESVSMDFIIHLPKVSDFEAILVIIDRFSKYATFIPTTKQCSAEMTAQLFFKHV
        KKGYFWPNMRDDVMQYTKTCLICQQDKVEKVKVAGLLDPLPVPTRPLESVSMDFI HLPKVSDFEAILVIIDRFSKYATFIPTTKQCSAEMTAQLFFKHV
Subjt:  KKGYFWPNMRDDVMQYTKTCLICQQDKVEKVKVAGLLDPLPVPTRPLESVSMDFIIHLPKVSDFEAILVIIDRFSKYATFIPTTKQCSAEMTAQLFFKHV

Query:  VKLWEVPTSIVSDRDGRFIGSF
        VKLWEVPTSIVSDRDGRFIGSF
Subjt:  VKLWEVPTSIVSDRDGRFIGSF

A0A5D3BQE4 Reverse transcriptase5.3e-24095.73Show/hide
Query:  MEEGKIAAIRDWAMPKSVSELRSFLGLANYYRRFVEGFSKRASPLIELLKKEVHWNWDPDCQAAFDGLKEAMMEGPLLGIEDVTKPFEVETDASDYALGG
        MEEGKIAAIRDWAMPKSVSELRSFLGLANYYRRFVEGFSKRASPL ELLKK+VHWNWDP+CQAAFDGLK+AMMEGPLLGI DVTKPFEVETDAS+YALGG
Subjt:  MEEGKIAAIRDWAMPKSVSELRSFLGLANYYRRFVEGFSKRASPLIELLKKEVHWNWDPDCQAAFDGLKEAMMEGPLLGIEDVTKPFEVETDASDYALGG

Query:  VLLQNGHPIAYESRKLNAGERRYTVSEKEMLAVVHCLRAWRQYLLGSSFVVKTDNSATCHFFTQPKLTWKQARWQEFLADFDFEFEHKKGSSNQAADALS
        VLLQNGHPI YESRKLN  ERRYTVSEKEMLAVVHCLRAWRQYLLGSSFVVKTDNSATCHFFTQPKLT KQARWQEFLA+FDFEFEHKKGS+NQAADALS
Subjt:  VLLQNGHPIAYESRKLNAGERRYTVSEKEMLAVVHCLRAWRQYLLGSSFVVKTDNSATCHFFTQPKLTWKQARWQEFLADFDFEFEHKKGSSNQAADALS

Query:  RKQEHASICLLAHLRGSEIGGSVRDTLREFLQKDHAAQNVMNLVKAGKTRQFWVEEDLLVTKGNRLYVPRAGDLRKKLLYECHNTLWAGHPGWQRTYALL
        RKQEHA+ICLLAHLRGSEIGGSVRDTLREFLQKDHAAQNVMNLVKAGKTRQFWVEEDLLVTKGNRLYVPRAGDLRKKLLYECH+TLWAGHPGWQRTYALL
Subjt:  RKQEHASICLLAHLRGSEIGGSVRDTLREFLQKDHAAQNVMNLVKAGKTRQFWVEEDLLVTKGNRLYVPRAGDLRKKLLYECHNTLWAGHPGWQRTYALL

Query:  KKGYFWPNMRDDVMQYTKTCLICQQDKVEKVKVAGLLDPLPVPTRPLESVSMDFIIHLPKVSDFEAILVIIDRFSKYATFIPTTKQCSAEMTAQLFFKHV
        KKGYFWPNMRDDVMQYTKTCLICQQDKVEKVKVAGLLDPLPVPTRP ESVSMDFI HLPKV DFEAILVIIDRFSKYATFIPTTKQCSAEMTAQLFFKHV
Subjt:  KKGYFWPNMRDDVMQYTKTCLICQQDKVEKVKVAGLLDPLPVPTRPLESVSMDFIIHLPKVSDFEAILVIIDRFSKYATFIPTTKQCSAEMTAQLFFKHV

Query:  VKLWEVPTSIVSDRDGRFIGSF
        VKLW VPTSIVSDRDGRFIGSF
Subjt:  VKLWEVPTSIVSDRDGRFIGSF

A0A5D3BRZ6 Reverse transcriptase4.9e-23895.02Show/hide
Query:  MEEGKIAAIRDWAMPKSVSELRSFLGLANYYRRFVEGFSKRASPLIELLKKEVHWNWDPDCQAAFDGLKEAMMEGPLLGIEDVTKPFEVETDASDYALGG
        MEEGKIAAIRDWAMPKSVSELRSFLGLANYYRRFVEGFSKRASPL ELLKK+VHWNWDP+CQ AFDGLK+A+MEGPLLGI DVTKPFEVETDASDYALGG
Subjt:  MEEGKIAAIRDWAMPKSVSELRSFLGLANYYRRFVEGFSKRASPLIELLKKEVHWNWDPDCQAAFDGLKEAMMEGPLLGIEDVTKPFEVETDASDYALGG

Query:  VLLQNGHPIAYESRKLNAGERRYTVSEKEMLAVVHCLRAWRQYLLGSSFVVKTDNSATCHFFTQPKLTWKQARWQEFLADFDFEFEHKKGSSNQAADALS
        VLLQNGHPIAYESRKLNA ERRYTVSEKEMLAVVHCLRAWRQYLLGSSFVVKTDNSATCHFFTQPKLT KQARWQEFLA+FDFEFEHKKGSSNQAADALS
Subjt:  VLLQNGHPIAYESRKLNAGERRYTVSEKEMLAVVHCLRAWRQYLLGSSFVVKTDNSATCHFFTQPKLTWKQARWQEFLADFDFEFEHKKGSSNQAADALS

Query:  RKQEHASICLLAHLRGSEIGGSVRDTLREFLQKDHAAQNVMNLVKAGKTRQFWVEEDLLVTKGNRLYVPRAGDLRKKLLYECHNTLWAGHPGWQRTYALL
        RKQEHA+ICLLAHL+GSEIGGSVRDTLREFLQKDHAAQNVMNL KAGKTRQFWVEEDLLVTKGNRLYVPRAG LRKKLLYECH+TLWAGHPGWQRTYALL
Subjt:  RKQEHASICLLAHLRGSEIGGSVRDTLREFLQKDHAAQNVMNLVKAGKTRQFWVEEDLLVTKGNRLYVPRAGDLRKKLLYECHNTLWAGHPGWQRTYALL

Query:  KKGYFWPNMRDDVMQYTKTCLICQQDKVEKVKVAGLLDPLPVPTRPLESVSMDFIIHLPKVSDFEAILVIIDRFSKYATFIPTTKQCSAEMTAQLFFKHV
        KKGYFWPNMRDDVMQYTKTCLICQQDKVEKVKVAGLLDPLPVPTRP ESVSMDFI HLPKV DFEAILVIIDRFSKYATFIP TKQCSAE TAQLFFKHV
Subjt:  KKGYFWPNMRDDVMQYTKTCLICQQDKVEKVKVAGLLDPLPVPTRPLESVSMDFIIHLPKVSDFEAILVIIDRFSKYATFIPTTKQCSAEMTAQLFFKHV

Query:  VKLWEVPTSIVSDRDGRFIGSF
        VKLW VPTSIVSDRDGRFIGSF
Subjt:  VKLWEVPTSIVSDRDGRFIGSF

A0A5D3C4R1 Reverse transcriptase4.9e-23895.02Show/hide
Query:  MEEGKIAAIRDWAMPKSVSELRSFLGLANYYRRFVEGFSKRASPLIELLKKEVHWNWDPDCQAAFDGLKEAMMEGPLLGIEDVTKPFEVETDASDYALGG
        MEEGKIAAIRDWAMPKSVSELRSFLGLANYYRRFVEGFSKRASPL ELLKK+VHWNWDP+CQ AFDGLK+A+MEGPLLGI DVTKPFEVETDASDYALGG
Subjt:  MEEGKIAAIRDWAMPKSVSELRSFLGLANYYRRFVEGFSKRASPLIELLKKEVHWNWDPDCQAAFDGLKEAMMEGPLLGIEDVTKPFEVETDASDYALGG

Query:  VLLQNGHPIAYESRKLNAGERRYTVSEKEMLAVVHCLRAWRQYLLGSSFVVKTDNSATCHFFTQPKLTWKQARWQEFLADFDFEFEHKKGSSNQAADALS
        VLLQNGHPIAYESRKLNA ERRYTVSEKEMLAVVHCLRAWRQYLLGSSFVVKTDNSATCHFFTQPKLT KQARWQEFLA+FDFEFEHKKGSSNQAADALS
Subjt:  VLLQNGHPIAYESRKLNAGERRYTVSEKEMLAVVHCLRAWRQYLLGSSFVVKTDNSATCHFFTQPKLTWKQARWQEFLADFDFEFEHKKGSSNQAADALS

Query:  RKQEHASICLLAHLRGSEIGGSVRDTLREFLQKDHAAQNVMNLVKAGKTRQFWVEEDLLVTKGNRLYVPRAGDLRKKLLYECHNTLWAGHPGWQRTYALL
        RKQEHA+ICLLAHL+GSEIGGSVRDTLREFLQKDHAAQNVMNL KAGKTRQFWVEEDLLVTKGNRLYVPRAG LRKKLLYECH+TLWAGHPGWQRTYALL
Subjt:  RKQEHASICLLAHLRGSEIGGSVRDTLREFLQKDHAAQNVMNLVKAGKTRQFWVEEDLLVTKGNRLYVPRAGDLRKKLLYECHNTLWAGHPGWQRTYALL

Query:  KKGYFWPNMRDDVMQYTKTCLICQQDKVEKVKVAGLLDPLPVPTRPLESVSMDFIIHLPKVSDFEAILVIIDRFSKYATFIPTTKQCSAEMTAQLFFKHV
        KKGYFWPNMRDDVMQYTKTCLICQQDKVEKVKVAGLLDPLPVPTRP ESVSMDFI HLPKV DFEAILVIIDRFSKYATFIP TKQCSAE TAQLFFKHV
Subjt:  KKGYFWPNMRDDVMQYTKTCLICQQDKVEKVKVAGLLDPLPVPTRPLESVSMDFIIHLPKVSDFEAILVIIDRFSKYATFIPTTKQCSAEMTAQLFFKHV

Query:  VKLWEVPTSIVSDRDGRFIGSF
        VKLW VPTSIVSDRDGRFIGSF
Subjt:  VKLWEVPTSIVSDRDGRFIGSF

A0A5D3C9P8 Reverse transcriptase1.7e-23895.26Show/hide
Query:  MEEGKIAAIRDWAMPKSVSELRSFLGLANYYRRFVEGFSKRASPLIELLKKEVHWNWDPDCQAAFDGLKEAMMEGPLLGIEDVTKPFEVETDASDYALGG
        MEEGKIAAIRDWAMPKSVSELRSFLGLANYYRRFVEGFSKRASPL ELLKK+VHWNWDP+CQAAFDGLK+A+MEGPLLGI DVTKPFEVETDASDYALGG
Subjt:  MEEGKIAAIRDWAMPKSVSELRSFLGLANYYRRFVEGFSKRASPLIELLKKEVHWNWDPDCQAAFDGLKEAMMEGPLLGIEDVTKPFEVETDASDYALGG

Query:  VLLQNGHPIAYESRKLNAGERRYTVSEKEMLAVVHCLRAWRQYLLGSSFVVKTDNSATCHFFTQPKLTWKQARWQEFLADFDFEFEHKKGSSNQAADALS
        VLLQNGHPIAYESRKLNA ERRYTVSEKEMLAVVHCLRAWRQYLLGSSFVVKTDNSATCHFFTQPKLT KQARWQEFLA+FDFEFEHKKGSSNQAADALS
Subjt:  VLLQNGHPIAYESRKLNAGERRYTVSEKEMLAVVHCLRAWRQYLLGSSFVVKTDNSATCHFFTQPKLTWKQARWQEFLADFDFEFEHKKGSSNQAADALS

Query:  RKQEHASICLLAHLRGSEIGGSVRDTLREFLQKDHAAQNVMNLVKAGKTRQFWVEEDLLVTKGNRLYVPRAGDLRKKLLYECHNTLWAGHPGWQRTYALL
        RKQEHA+ICLLAHL+GSEIGGSVRDTLREFLQKDHAAQNVMNL KAGKTRQFWVEEDLLVTKGNRLYVPRAG LRKKLLYECH+TLWAGHPGWQRTYALL
Subjt:  RKQEHASICLLAHLRGSEIGGSVRDTLREFLQKDHAAQNVMNLVKAGKTRQFWVEEDLLVTKGNRLYVPRAGDLRKKLLYECHNTLWAGHPGWQRTYALL

Query:  KKGYFWPNMRDDVMQYTKTCLICQQDKVEKVKVAGLLDPLPVPTRPLESVSMDFIIHLPKVSDFEAILVIIDRFSKYATFIPTTKQCSAEMTAQLFFKHV
        KKGYFWPNMRDDVMQYTKTCLICQQDKVEKVKVAGLLDPLPVPTRP ESVSMDFI HLPKV DFEAILVIIDRFSKYATFIP TKQCSAE TAQLFFKHV
Subjt:  KKGYFWPNMRDDVMQYTKTCLICQQDKVEKVKVAGLLDPLPVPTRPLESVSMDFIIHLPKVSDFEAILVIIDRFSKYATFIPTTKQCSAEMTAQLFFKHV

Query:  VKLWEVPTSIVSDRDGRFIGSF
        VKLW VPTSIVSDRDGRFIGSF
Subjt:  VKLWEVPTSIVSDRDGRFIGSF

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein1.1e-6433.26Show/hide
Query:  IAAIRDWAMPKSVSELRSFLGLANYYRRFVEGFSKRASPLIELLKKEVHWNWDPDCQAAFDGLKEAMMEGPLLGIEDVTKPFEVETDASDYALGGVLLQN
        I  +  W  PK+  ELR FLG  NY R+F+   S+   PL  LLKK+V W W P    A + +K+ ++  P+L   D +K   +ETDASD A+G VL Q 
Subjt:  IAAIRDWAMPKSVSELRSFLGLANYYRRFVEGFSKRASPLIELLKKEVHWNWDPDCQAAFDGLKEAMMEGPLLGIEDVTKPFEVETDASDYALGGVLLQN

Query:  G-----HPIAYESRKLNAGERRYTVSEKEMLAVVHCLRAWRQYLLGS--SFVVKTDNSATCHFFT---QPKLTWKQARWQEFLADFDFEFEHKKGSSNQA
              +P+ Y S K++  +  Y+VS+KEMLA++  L+ WR YL  +   F + TD+       T   +P+   + ARWQ FL DF+FE  ++ GS+N  
Subjt:  G-----HPIAYESRKLNAGERRYTVSEKEMLAVVHCLRAWRQYLLGS--SFVVKTDNSATCHFFT---QPKLTWKQARWQEFLADFDFEFEHKKGSSNQA

Query:  ADALSR----------KQEHASICLLAHLRGSEIGGSVRDTLREFLQKDHAAQNVMNLVKAGKTRQFWVEEDLLVTKGNRLYVPRAGDLRKKLLYECHNT
        ADALSR            E  SI  +  +    I    ++ +      D    N++N           +++ LL+   +++ +P    L + ++ + H  
Subjt:  ADALSR----------KQEHASICLLAHLRGSEIGGSVRDTLREFLQKDHAAQNVMNLVKAGKTRQFWVEEDLLVTKGNRLYVPRAGDLRKKLLYECHNT

Query:  LWAGHPGWQRTYALLKKGYFWPNMRDDVMQYTKTCLICQQDKVEKVKVAGLLDPLPVPTRPLESVSMDFIIHLPKVSDFEAILVIIDRFSKYATFIPTTK
            HPG +    ++ + + W  +R  + +Y + C  CQ +K    K  G L P+P   RP ES+SMDFI  LP+ S + A+ V++DRFSK A  +P TK
Subjt:  LWAGHPGWQRTYALLKKGYFWPNMRDDVMQYTKTCLICQQDKVEKVKVAGLLDPLPVPTRPLESVSMDFIIHLPKVSDFEAILVIIDRFSKYATFIPTTK

Query:  QCSAEMTAQLFFKHVVKLWEVPTSIVSDRDGRF
          +AE TA++F + V+  +  P  I++D D  F
Subjt:  QCSAEMTAQLFFKHVVKLWEVPTSIVSDRDGRF

P0CT35 Transposon Tf2-2 polyprotein1.1e-6433.26Show/hide
Query:  IAAIRDWAMPKSVSELRSFLGLANYYRRFVEGFSKRASPLIELLKKEVHWNWDPDCQAAFDGLKEAMMEGPLLGIEDVTKPFEVETDASDYALGGVLLQN
        I  +  W  PK+  ELR FLG  NY R+F+   S+   PL  LLKK+V W W P    A + +K+ ++  P+L   D +K   +ETDASD A+G VL Q 
Subjt:  IAAIRDWAMPKSVSELRSFLGLANYYRRFVEGFSKRASPLIELLKKEVHWNWDPDCQAAFDGLKEAMMEGPLLGIEDVTKPFEVETDASDYALGGVLLQN

Query:  G-----HPIAYESRKLNAGERRYTVSEKEMLAVVHCLRAWRQYLLGS--SFVVKTDNSATCHFFT---QPKLTWKQARWQEFLADFDFEFEHKKGSSNQA
              +P+ Y S K++  +  Y+VS+KEMLA++  L+ WR YL  +   F + TD+       T   +P+   + ARWQ FL DF+FE  ++ GS+N  
Subjt:  G-----HPIAYESRKLNAGERRYTVSEKEMLAVVHCLRAWRQYLLGS--SFVVKTDNSATCHFFT---QPKLTWKQARWQEFLADFDFEFEHKKGSSNQA

Query:  ADALSR----------KQEHASICLLAHLRGSEIGGSVRDTLREFLQKDHAAQNVMNLVKAGKTRQFWVEEDLLVTKGNRLYVPRAGDLRKKLLYECHNT
        ADALSR            E  SI  +  +    I    ++ +      D    N++N           +++ LL+   +++ +P    L + ++ + H  
Subjt:  ADALSR----------KQEHASICLLAHLRGSEIGGSVRDTLREFLQKDHAAQNVMNLVKAGKTRQFWVEEDLLVTKGNRLYVPRAGDLRKKLLYECHNT

Query:  LWAGHPGWQRTYALLKKGYFWPNMRDDVMQYTKTCLICQQDKVEKVKVAGLLDPLPVPTRPLESVSMDFIIHLPKVSDFEAILVIIDRFSKYATFIPTTK
            HPG +    ++ + + W  +R  + +Y + C  CQ +K    K  G L P+P   RP ES+SMDFI  LP+ S + A+ V++DRFSK A  +P TK
Subjt:  LWAGHPGWQRTYALLKKGYFWPNMRDDVMQYTKTCLICQQDKVEKVKVAGLLDPLPVPTRPLESVSMDFIIHLPKVSDFEAILVIIDRFSKYATFIPTTK

Query:  QCSAEMTAQLFFKHVVKLWEVPTSIVSDRDGRF
          +AE TA++F + V+  +  P  I++D D  F
Subjt:  QCSAEMTAQLFFKHVVKLWEVPTSIVSDRDGRF

P0CT36 Transposon Tf2-3 polyprotein1.1e-6433.26Show/hide
Query:  IAAIRDWAMPKSVSELRSFLGLANYYRRFVEGFSKRASPLIELLKKEVHWNWDPDCQAAFDGLKEAMMEGPLLGIEDVTKPFEVETDASDYALGGVLLQN
        I  +  W  PK+  ELR FLG  NY R+F+   S+   PL  LLKK+V W W P    A + +K+ ++  P+L   D +K   +ETDASD A+G VL Q 
Subjt:  IAAIRDWAMPKSVSELRSFLGLANYYRRFVEGFSKRASPLIELLKKEVHWNWDPDCQAAFDGLKEAMMEGPLLGIEDVTKPFEVETDASDYALGGVLLQN

Query:  G-----HPIAYESRKLNAGERRYTVSEKEMLAVVHCLRAWRQYLLGS--SFVVKTDNSATCHFFT---QPKLTWKQARWQEFLADFDFEFEHKKGSSNQA
              +P+ Y S K++  +  Y+VS+KEMLA++  L+ WR YL  +   F + TD+       T   +P+   + ARWQ FL DF+FE  ++ GS+N  
Subjt:  G-----HPIAYESRKLNAGERRYTVSEKEMLAVVHCLRAWRQYLLGS--SFVVKTDNSATCHFFT---QPKLTWKQARWQEFLADFDFEFEHKKGSSNQA

Query:  ADALSR----------KQEHASICLLAHLRGSEIGGSVRDTLREFLQKDHAAQNVMNLVKAGKTRQFWVEEDLLVTKGNRLYVPRAGDLRKKLLYECHNT
        ADALSR            E  SI  +  +    I    ++ +      D    N++N           +++ LL+   +++ +P    L + ++ + H  
Subjt:  ADALSR----------KQEHASICLLAHLRGSEIGGSVRDTLREFLQKDHAAQNVMNLVKAGKTRQFWVEEDLLVTKGNRLYVPRAGDLRKKLLYECHNT

Query:  LWAGHPGWQRTYALLKKGYFWPNMRDDVMQYTKTCLICQQDKVEKVKVAGLLDPLPVPTRPLESVSMDFIIHLPKVSDFEAILVIIDRFSKYATFIPTTK
            HPG +    ++ + + W  +R  + +Y + C  CQ +K    K  G L P+P   RP ES+SMDFI  LP+ S + A+ V++DRFSK A  +P TK
Subjt:  LWAGHPGWQRTYALLKKGYFWPNMRDDVMQYTKTCLICQQDKVEKVKVAGLLDPLPVPTRPLESVSMDFIIHLPKVSDFEAILVIIDRFSKYATFIPTTK

Query:  QCSAEMTAQLFFKHVVKLWEVPTSIVSDRDGRF
          +AE TA++F + V+  +  P  I++D D  F
Subjt:  QCSAEMTAQLFFKHVVKLWEVPTSIVSDRDGRF

P0CT41 Transposon Tf2-12 polyprotein1.1e-6433.26Show/hide
Query:  IAAIRDWAMPKSVSELRSFLGLANYYRRFVEGFSKRASPLIELLKKEVHWNWDPDCQAAFDGLKEAMMEGPLLGIEDVTKPFEVETDASDYALGGVLLQN
        I  +  W  PK+  ELR FLG  NY R+F+   S+   PL  LLKK+V W W P    A + +K+ ++  P+L   D +K   +ETDASD A+G VL Q 
Subjt:  IAAIRDWAMPKSVSELRSFLGLANYYRRFVEGFSKRASPLIELLKKEVHWNWDPDCQAAFDGLKEAMMEGPLLGIEDVTKPFEVETDASDYALGGVLLQN

Query:  G-----HPIAYESRKLNAGERRYTVSEKEMLAVVHCLRAWRQYLLGS--SFVVKTDNSATCHFFT---QPKLTWKQARWQEFLADFDFEFEHKKGSSNQA
              +P+ Y S K++  +  Y+VS+KEMLA++  L+ WR YL  +   F + TD+       T   +P+   + ARWQ FL DF+FE  ++ GS+N  
Subjt:  G-----HPIAYESRKLNAGERRYTVSEKEMLAVVHCLRAWRQYLLGS--SFVVKTDNSATCHFFT---QPKLTWKQARWQEFLADFDFEFEHKKGSSNQA

Query:  ADALSR----------KQEHASICLLAHLRGSEIGGSVRDTLREFLQKDHAAQNVMNLVKAGKTRQFWVEEDLLVTKGNRLYVPRAGDLRKKLLYECHNT
        ADALSR            E  SI  +  +    I    ++ +      D    N++N           +++ LL+   +++ +P    L + ++ + H  
Subjt:  ADALSR----------KQEHASICLLAHLRGSEIGGSVRDTLREFLQKDHAAQNVMNLVKAGKTRQFWVEEDLLVTKGNRLYVPRAGDLRKKLLYECHNT

Query:  LWAGHPGWQRTYALLKKGYFWPNMRDDVMQYTKTCLICQQDKVEKVKVAGLLDPLPVPTRPLESVSMDFIIHLPKVSDFEAILVIIDRFSKYATFIPTTK
            HPG +    ++ + + W  +R  + +Y + C  CQ +K    K  G L P+P   RP ES+SMDFI  LP+ S + A+ V++DRFSK A  +P TK
Subjt:  LWAGHPGWQRTYALLKKGYFWPNMRDDVMQYTKTCLICQQDKVEKVKVAGLLDPLPVPTRPLESVSMDFIIHLPKVSDFEAILVIIDRFSKYATFIPTTK

Query:  QCSAEMTAQLFFKHVVKLWEVPTSIVSDRDGRF
          +AE TA++F + V+  +  P  I++D D  F
Subjt:  QCSAEMTAQLFFKHVVKLWEVPTSIVSDRDGRF

Q9UR07 Transposon Tf2-11 polyprotein1.1e-6433.26Show/hide
Query:  IAAIRDWAMPKSVSELRSFLGLANYYRRFVEGFSKRASPLIELLKKEVHWNWDPDCQAAFDGLKEAMMEGPLLGIEDVTKPFEVETDASDYALGGVLLQN
        I  +  W  PK+  ELR FLG  NY R+F+   S+   PL  LLKK+V W W P    A + +K+ ++  P+L   D +K   +ETDASD A+G VL Q 
Subjt:  IAAIRDWAMPKSVSELRSFLGLANYYRRFVEGFSKRASPLIELLKKEVHWNWDPDCQAAFDGLKEAMMEGPLLGIEDVTKPFEVETDASDYALGGVLLQN

Query:  G-----HPIAYESRKLNAGERRYTVSEKEMLAVVHCLRAWRQYLLGS--SFVVKTDNSATCHFFT---QPKLTWKQARWQEFLADFDFEFEHKKGSSNQA
              +P+ Y S K++  +  Y+VS+KEMLA++  L+ WR YL  +   F + TD+       T   +P+   + ARWQ FL DF+FE  ++ GS+N  
Subjt:  G-----HPIAYESRKLNAGERRYTVSEKEMLAVVHCLRAWRQYLLGS--SFVVKTDNSATCHFFT---QPKLTWKQARWQEFLADFDFEFEHKKGSSNQA

Query:  ADALSR----------KQEHASICLLAHLRGSEIGGSVRDTLREFLQKDHAAQNVMNLVKAGKTRQFWVEEDLLVTKGNRLYVPRAGDLRKKLLYECHNT
        ADALSR            E  SI  +  +    I    ++ +      D    N++N           +++ LL+   +++ +P    L + ++ + H  
Subjt:  ADALSR----------KQEHASICLLAHLRGSEIGGSVRDTLREFLQKDHAAQNVMNLVKAGKTRQFWVEEDLLVTKGNRLYVPRAGDLRKKLLYECHNT

Query:  LWAGHPGWQRTYALLKKGYFWPNMRDDVMQYTKTCLICQQDKVEKVKVAGLLDPLPVPTRPLESVSMDFIIHLPKVSDFEAILVIIDRFSKYATFIPTTK
            HPG +    ++ + + W  +R  + +Y + C  CQ +K    K  G L P+P   RP ES+SMDFI  LP+ S + A+ V++DRFSK A  +P TK
Subjt:  LWAGHPGWQRTYALLKKGYFWPNMRDDVMQYTKTCLICQQDKVEKVKVAGLLDPLPVPTRPLESVSMDFIIHLPKVSDFEAILVIIDRFSKYATFIPTTK

Query:  QCSAEMTAQLFFKHVVKLWEVPTSIVSDRDGRF
          +AE TA++F + V+  +  P  I++D D  F
Subjt:  QCSAEMTAQLFFKHVVKLWEVPTSIVSDRDGRF

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein3.4e-1343.02Show/hide
Query:  EEGKIAAIRDWAMPKSVSELRSFLGLANYYRRFVEGFSKRASPLIELLKKEVHWNWDPDCQAAFDGLKEAMMEGPLLGIEDVTKPF
        +  K+ A+  W  PK+ +ELR FLGL  YYRRFV+ + K   PL ELLKK     W      AF  LK A+   P+L + D+  PF
Subjt:  EEGKIAAIRDWAMPKSVSELRSFLGLANYYRRFVEGFSKRASPLIELLKKEVHWNWDPDCQAAFDGLKEAMMEGPLLGIEDVTKPF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGAAGGGAAGATTGCTGCGATACGCGACTGGGCAATGCCGAAATCAGTCTCGGAGTTACGCTCCTTTCTCGGGCTGGCAAATTACTATCGTCGATTTGTCGAGGG
ATTCTCCAAACGAGCAAGCCCACTGATTGAACTACTGAAAAAAGAAGTTCACTGGAATTGGGACCCCGACTGCCAAGCCGCCTTCGACGGCCTAAAAGAAGCCATGATGG
AAGGGCCACTTCTAGGGATTGAGGATGTGACCAAACCTTTCGAAGTTGAGACAGATGCGTCTGATTATGCATTGGGGGGTGTGCTCCTACAGAATGGGCACCCGATTGCA
TACGAAAGTCGAAAATTGAATGCAGGCGAAAGGAGGTATACTGTGTCCGAAAAAGAAATGCTTGCAGTAGTACATTGTTTGAGGGCCTGGAGACAATACCTACTAGGTTC
GTCGTTTGTAGTGAAGACGGACAACAGTGCAACCTGCCACTTCTTTACCCAGCCAAAGTTGACTTGGAAACAAGCAAGATGGCAGGAATTTCTGGCCGATTTCGACTTCG
AATTTGAACACAAGAAGGGGTCGAGCAACCAAGCTGCTGATGCTCTAAGTCGAAAACAAGAACATGCATCCATATGCCTGTTAGCTCACCTCCGGGGGAGCGAGATTGGC
GGGTCGGTTAGAGACACCTTGAGAGAGTTCCTACAGAAAGATCATGCCGCTCAAAATGTCATGAATTTAGTGAAGGCGGGCAAAACACGACAGTTTTGGGTCGAGGAAGA
CTTGTTAGTCACAAAGGGCAATCGACTATATGTTCCAAGAGCAGGGGACTTAAGGAAGAAATTGTTGTATGAGTGTCACAACACTCTATGGGCTGGCCATCCCGGATGGC
AGCGGACGTACGCCCTGTTGAAGAAGGGCTACTTTTGGCCGAATATGAGAGATGATGTAATGCAGTACACTAAGACGTGTCTCATCTGCCAACAAGATAAAGTAGAGAAA
GTGAAGGTTGCTGGACTTCTCGACCCTCTACCGGTTCCAACAAGACCTTTGGAGAGTGTCTCTATGGACTTCATCATCCATCTTCCTAAGGTAAGCGACTTTGAAGCCAT
CTTAGTCATCATTGATCGTTTTTCAAAGTACGCCACCTTCATCCCCACCACCAAGCAGTGTTCAGCAGAAATGACAGCTCAATTGTTCTTTAAGCATGTTGTTAAGTTGT
GGGAAGTCCCGACAAGTATAGTGAGTGACAGGGATGGTAGATTCATTGGCTCCTTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAGAAGGGAAGATTGCTGCGATACGCGACTGGGCAATGCCGAAATCAGTCTCGGAGTTACGCTCCTTTCTCGGGCTGGCAAATTACTATCGTCGATTTGTCGAGGG
ATTCTCCAAACGAGCAAGCCCACTGATTGAACTACTGAAAAAAGAAGTTCACTGGAATTGGGACCCCGACTGCCAAGCCGCCTTCGACGGCCTAAAAGAAGCCATGATGG
AAGGGCCACTTCTAGGGATTGAGGATGTGACCAAACCTTTCGAAGTTGAGACAGATGCGTCTGATTATGCATTGGGGGGTGTGCTCCTACAGAATGGGCACCCGATTGCA
TACGAAAGTCGAAAATTGAATGCAGGCGAAAGGAGGTATACTGTGTCCGAAAAAGAAATGCTTGCAGTAGTACATTGTTTGAGGGCCTGGAGACAATACCTACTAGGTTC
GTCGTTTGTAGTGAAGACGGACAACAGTGCAACCTGCCACTTCTTTACCCAGCCAAAGTTGACTTGGAAACAAGCAAGATGGCAGGAATTTCTGGCCGATTTCGACTTCG
AATTTGAACACAAGAAGGGGTCGAGCAACCAAGCTGCTGATGCTCTAAGTCGAAAACAAGAACATGCATCCATATGCCTGTTAGCTCACCTCCGGGGGAGCGAGATTGGC
GGGTCGGTTAGAGACACCTTGAGAGAGTTCCTACAGAAAGATCATGCCGCTCAAAATGTCATGAATTTAGTGAAGGCGGGCAAAACACGACAGTTTTGGGTCGAGGAAGA
CTTGTTAGTCACAAAGGGCAATCGACTATATGTTCCAAGAGCAGGGGACTTAAGGAAGAAATTGTTGTATGAGTGTCACAACACTCTATGGGCTGGCCATCCCGGATGGC
AGCGGACGTACGCCCTGTTGAAGAAGGGCTACTTTTGGCCGAATATGAGAGATGATGTAATGCAGTACACTAAGACGTGTCTCATCTGCCAACAAGATAAAGTAGAGAAA
GTGAAGGTTGCTGGACTTCTCGACCCTCTACCGGTTCCAACAAGACCTTTGGAGAGTGTCTCTATGGACTTCATCATCCATCTTCCTAAGGTAAGCGACTTTGAAGCCAT
CTTAGTCATCATTGATCGTTTTTCAAAGTACGCCACCTTCATCCCCACCACCAAGCAGTGTTCAGCAGAAATGACAGCTCAATTGTTCTTTAAGCATGTTGTTAAGTTGT
GGGAAGTCCCGACAAGTATAGTGAGTGACAGGGATGGTAGATTCATTGGCTCCTTCTAG
Protein sequenceShow/hide protein sequence
MEEGKIAAIRDWAMPKSVSELRSFLGLANYYRRFVEGFSKRASPLIELLKKEVHWNWDPDCQAAFDGLKEAMMEGPLLGIEDVTKPFEVETDASDYALGGVLLQNGHPIA
YESRKLNAGERRYTVSEKEMLAVVHCLRAWRQYLLGSSFVVKTDNSATCHFFTQPKLTWKQARWQEFLADFDFEFEHKKGSSNQAADALSRKQEHASICLLAHLRGSEIG
GSVRDTLREFLQKDHAAQNVMNLVKAGKTRQFWVEEDLLVTKGNRLYVPRAGDLRKKLLYECHNTLWAGHPGWQRTYALLKKGYFWPNMRDDVMQYTKTCLICQQDKVEK
VKVAGLLDPLPVPTRPLESVSMDFIIHLPKVSDFEAILVIIDRFSKYATFIPTTKQCSAEMTAQLFFKHVVKLWEVPTSIVSDRDGRFIGSF