; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0036405 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0036405
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTy3-gypsy retrotransposon protein
Genome locationchr3:45897932..45907374
RNA-Seq ExpressionLag0036405
SyntenyLag0036405
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR005162 - Retrotransposon gag domain
IPR012337 - Ribonuclease H-like superfamily
IPR016197 - Chromo-like domain superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_031735972.1 uncharacterized protein LOC116401693 [Cucumis sativus]4.9e-13755.89Show/hide
Query:  GSTAHRCFNKLRLQEDKASIIASEETTLQGAYTNDKFLVKYNPLFEPDSDVVTVLMTETRAMDERMAEMQEHINNLIKAIEEKDSQIAQLKCQIENQHIA
        GS   +   K  ++  KA I+  E         ND        L E   DV++V+M +  A++  MAEM+  IN L+K ++E+D +IA LK Q++ +  A
Subjt:  GSTAHRCFNKLRLQEDKASIIASEETTLQGAYTNDKFLVKYNPLFEPDSDVVTVLMTETRAMDERMAEMQEHINNLIKAIEEKDSQIAQLKCQIENQHIA

Query:  ESSQT-------------------------------QLQDMITNCIRAQYGGPTQDSLLYSKPYTKMIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVE
        ESSQT                               QLQDMITN IRAQYGGP+Q S +YSKPYTK IDNLR P+GYQPPKFQQFDGKGNPKQH+AHFVE
Subjt:  ESSQT-------------------------------QLQDMITNCIRAQYGGPTQDSLLYSKPYTKMIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVE

Query:  TCENA--------------------------VPESVDSWEELEREFLNRFYSTRRTVIMFELTNTKQRKGELVVNYINHWRAMSLDCKDRLTELSSVEMC
        TCENA                           PES++SWE+LE+EFLNRFYSTRRTV M ELTNTKQRKGE V++YIN WRA+SLDCKDRLTELS+VEMC
Subjt:  TCENA--------------------------VPESVDSWEELEREFLNRFYSTRRTVIMFELTNTKQRKGELVVNYINHWRAMSLDCKDRLTELSSVEMC

Query:  IQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDILLPNMRKEGRN-------DEETIEESMVVNITLPKLSS------KEKRQTNGVHHLTL
         QGMHW LLYIL+GIKPRTFEELATRAHDMELSIASR  +D L+P ++K+ +         + T +ESMVVN T  K S       ++K   +    LTL
Subjt:  IQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDILLPNMRKEGRN-------DEETIEESMVVNITLPKLSS------KEKRQTNGVHHLTL

Query:  KERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLEEVAQSNLATI
        KERQ+K+YPFPD+DI DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+CFVLK+LIL+LA+E +IELDLEEVAQ+N A +
Subjt:  KERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLEEVAQSNLATI

XP_031739134.1 uncharacterized protein LOC116402863 [Cucumis sativus]1.4e-13655.69Show/hide
Query:  GSTAHRCFNKLRLQEDKASIIASEETTLQGAYTNDKFLVKYNPLFEPDSDVVTVLMTETRAMDERMAEMQEHINNLIKAIEEKDSQIAQLKCQIENQHIA
        GS   +   K  ++  KA I+  E         ND        L E   DV++V+M +  A++  MAEM+  IN L+K ++E+D +IA LK Q++ +  A
Subjt:  GSTAHRCFNKLRLQEDKASIIASEETTLQGAYTNDKFLVKYNPLFEPDSDVVTVLMTETRAMDERMAEMQEHINNLIKAIEEKDSQIAQLKCQIENQHIA

Query:  ESSQT-------------------------------QLQDMITNCIRAQYGGPTQDSLLYSKPYTKMIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVE
        ESSQT                               QLQDMIT+ IRAQYGGP+Q S +YSKPYTK IDNLR P+GYQPPKFQQFDGKGNPKQH+AHFVE
Subjt:  ESSQT-------------------------------QLQDMITNCIRAQYGGPTQDSLLYSKPYTKMIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVE

Query:  TCENA--------------------------VPESVDSWEELEREFLNRFYSTRRTVIMFELTNTKQRKGELVVNYINHWRAMSLDCKDRLTELSSVEMC
        TCENA                           PES++SWE+LE+EFLNRFYSTRRTV M ELTNTKQRKGE V++YIN WRA+SLDCKDRLTELS+VEMC
Subjt:  TCENA--------------------------VPESVDSWEELEREFLNRFYSTRRTVIMFELTNTKQRKGELVVNYINHWRAMSLDCKDRLTELSSVEMC

Query:  IQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDILLPNMRKEGRN-------DEETIEESMVVNITLPKLSS------KEKRQTNGVHHLTL
         QGMHW LLYIL+GIKPRTFEELATRAHDMELSIASR  +D L+P ++K+ +         + T +ESMVVN T  K S       ++K   +    LTL
Subjt:  IQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDILLPNMRKEGRN-------DEETIEESMVVNITLPKLSS------KEKRQTNGVHHLTL

Query:  KERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLEEVAQSNLATI
        KERQ+K+YPFPD+DI DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+CFVLK+LIL+LA+E +IELDLEEVAQ+N A +
Subjt:  KERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLEEVAQSNLATI

XP_031740568.1 uncharacterized protein LOC116403508 [Cucumis sativus]1.9e-13655.69Show/hide
Query:  GSTAHRCFNKLRLQEDKASIIASEETTLQGAYTNDKFLVKYNPLFEPDSDVVTVLMTETRAMDERMAEMQEHINNLIKAIEEKDSQIAQLKCQIENQHIA
        GS   +   K  ++  KA I+  E         ND        L E   DV++V+M +  A++  MAEM+  IN L+K ++E+D +IA LK Q++ +  A
Subjt:  GSTAHRCFNKLRLQEDKASIIASEETTLQGAYTNDKFLVKYNPLFEPDSDVVTVLMTETRAMDERMAEMQEHINNLIKAIEEKDSQIAQLKCQIENQHIA

Query:  ESSQT-------------------------------QLQDMITNCIRAQYGGPTQDSLLYSKPYTKMIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVE
        ESSQT                               QLQDMIT+ IRAQYGGP+Q S +YSKPYTK IDNLR P+GYQPPKFQQFDGKGNPKQH+AHFVE
Subjt:  ESSQT-------------------------------QLQDMITNCIRAQYGGPTQDSLLYSKPYTKMIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVE

Query:  TCENA--------------------------VPESVDSWEELEREFLNRFYSTRRTVIMFELTNTKQRKGELVVNYINHWRAMSLDCKDRLTELSSVEMC
        TCENA                           PES++SWE+LE+EFLNRFYSTRRTV M ELTNTKQRKGE V++YIN WRA+SLDCKDRLTELS+VEMC
Subjt:  TCENA--------------------------VPESVDSWEELEREFLNRFYSTRRTVIMFELTNTKQRKGELVVNYINHWRAMSLDCKDRLTELSSVEMC

Query:  IQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDILLPNMRKEGRN-------DEETIEESMVVNITLPKLSS------KEKRQTNGVHHLTL
         QGMHW LLYIL+GIKPRTFEELATRAHDMELSIASR  +D L+P ++K+ +         + T +ESMVVN T  K S       ++K   +    LTL
Subjt:  IQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDILLPNMRKEGRN-------DEETIEESMVVNITLPKLSS------KEKRQTNGVHHLTL

Query:  KERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLEEVAQSNLATI
        KERQ+K+YPFPD+DI DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+CFVLK+LIL+LA+E +IELDLEEVAQ+N A +
Subjt:  KERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLEEVAQSNLATI

XP_031742032.1 uncharacterized protein LOC116404025 [Cucumis sativus]6.4e-13755.69Show/hide
Query:  GSTAHRCFNKLRLQEDKASIIASEETTLQGAYTNDKFLVKYNPLFEPDSDVVTVLMTETRAMDERMAEMQEHINNLIKAIEEKDSQIAQLKCQIENQHIA
        GS   +   K  ++  KA I+  E         ND        L E   DV++V+M +  A++  MAEM+  IN L+K ++E+D +IA LK Q++ +  A
Subjt:  GSTAHRCFNKLRLQEDKASIIASEETTLQGAYTNDKFLVKYNPLFEPDSDVVTVLMTETRAMDERMAEMQEHINNLIKAIEEKDSQIAQLKCQIENQHIA

Query:  ESSQT-------------------------------QLQDMITNCIRAQYGGPTQDSLLYSKPYTKMIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVE
        ESSQT                               QLQDMIT+ IRAQYGGP+Q S +YSKPYTK IDNLR P+GYQPPKFQQFDGKGNPKQH+AHFVE
Subjt:  ESSQT-------------------------------QLQDMITNCIRAQYGGPTQDSLLYSKPYTKMIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVE

Query:  TCENA--------------------------VPESVDSWEELEREFLNRFYSTRRTVIMFELTNTKQRKGELVVNYINHWRAMSLDCKDRLTELSSVEMC
        TCENA                           PES++SWE+LE+EFLNRFYSTRRTV M ELTNTKQRKGE V++YIN WRA+SLDCKDRLTELS+VEMC
Subjt:  TCENA--------------------------VPESVDSWEELEREFLNRFYSTRRTVIMFELTNTKQRKGELVVNYINHWRAMSLDCKDRLTELSSVEMC

Query:  IQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDILLPNMRKEGRN-------DEETIEESMVVNITLPKLSS------KEKRQTNGVHHLTL
         QGMHW LLYIL+GIKPRTFEELATRAHDMELSIASR  +D L+P ++K+ +         + T++ESMVVN T  K S       ++K   +    LTL
Subjt:  IQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDILLPNMRKEGRN-------DEETIEESMVVNITLPKLSS------KEKRQTNGVHHLTL

Query:  KERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLEEVAQSNLATI
        KERQ+K+YPFPD+DI DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+CFVLK+LIL+LA+E +IELDLEEVAQ+N A +
Subjt:  KERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLEEVAQSNLATI

XP_031742199.1 uncharacterized protein LOC105435721 [Cucumis sativus]4.9e-13755.89Show/hide
Query:  GSTAHRCFNKLRLQEDKASIIASEETTLQGAYTNDKFLVKYNPLFEPDSDVVTVLMTETRAMDERMAEMQEHINNLIKAIEEKDSQIAQLKCQIENQHIA
        GS   +   K  ++  KA I+  E         ND        L E   DV++V+M +  A++  MAEM+  IN L+K ++E+D +IA LK Q++ +  A
Subjt:  GSTAHRCFNKLRLQEDKASIIASEETTLQGAYTNDKFLVKYNPLFEPDSDVVTVLMTETRAMDERMAEMQEHINNLIKAIEEKDSQIAQLKCQIENQHIA

Query:  ESSQT-------------------------------QLQDMITNCIRAQYGGPTQDSLLYSKPYTKMIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVE
        ESSQT                               QLQDMITN IRAQYGGP+Q S +YSKPYTK IDNLR P+GYQPPKFQQFDGKGNPKQH+AHFVE
Subjt:  ESSQT-------------------------------QLQDMITNCIRAQYGGPTQDSLLYSKPYTKMIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVE

Query:  TCENA--------------------------VPESVDSWEELEREFLNRFYSTRRTVIMFELTNTKQRKGELVVNYINHWRAMSLDCKDRLTELSSVEMC
        TCENA                           PES++SWE+LE+EFLNRFYSTRRTV M ELTNTKQRKGE V++YIN WRA+SLDCKDRLTELS+VEMC
Subjt:  TCENA--------------------------VPESVDSWEELEREFLNRFYSTRRTVIMFELTNTKQRKGELVVNYINHWRAMSLDCKDRLTELSSVEMC

Query:  IQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDILLPNMRKEGRN-------DEETIEESMVVNITLPKLSS------KEKRQTNGVHHLTL
         QGMHW LLYIL+GIKPRTFEELATRAHDMELSIASR  +D L+P ++K+ +         + T +ESMVVN T  K S       ++K   +    LTL
Subjt:  IQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDILLPNMRKEGRN-------DEETIEESMVVNITLPKLSS------KEKRQTNGVHHLTL

Query:  KERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLEEVAQSNLATI
        KERQ+K+YPFPD+DI DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+CFVLK+LIL+LA+E +IELDLEEVAQ+N A +
Subjt:  KERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLEEVAQSNLATI

TrEMBL top hitse value%identityAlignment
A0A5A7TST6 Ty3-gypsy retrotransposon protein8.2e-13053.86Show/hide
Query:  KASIIASEETTLQGAYTNDKFLVKYNPLFE--PDSDVVTVLMTETRAMDERMAEMQEHINNLIKAIEEKDSQIAQLKCQIENQHIAESSQ----------
        K  I+  E   +   Y++ K      P  E  P S++++V++T     + RMAE+++ +N L+K +EE+D +IA LK  IE++  AESS           
Subjt:  KASIIASEETTLQGAYTNDKFLVKYNPLFE--PDSDVVTVLMTETRAMDERMAEMQEHINNLIKAIEEKDSQIAQLKCQIENQHIAESSQ----------

Query:  --------------------TQLQDMITNCIRAQYGGPTQDSLLYSKPYTKMIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENA----------
                             QLQ+MI + I+ QYGGP Q   LYSKPYTK IDNLR   GYQPPKFQQFDGKGNPKQH+AHF+ETCE A          
Subjt:  --------------------TQLQDMITNCIRAQYGGPTQDSLLYSKPYTKMIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENA----------

Query:  ----------------VPESVDSWEELEREFLNRFYSTRRTVIMFELTNTKQRKGELVVNYINHWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGI
                         PES+D+WE+LER+FLNRFYSTRR V M ELTNT+Q+KGELV++YIN WRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GI
Subjt:  ----------------VPESVDSWEELEREFLNRFYSTRRTVIMFELTNTKQRKGELVVNYINHWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGI

Query:  KPRTFEELATRAHDMELSIASRENQDILLPNMRKEGRNDEET-------IEESMVVNITLPKLSSK-------EKRQTNGVHHLTLKERQKKIYPFPDAD
        KPRTFEELATRAHDMELSI +RE +D L+P  R +    ++T       I+ESMVV+ T  K  SK        K   N     TLKERQ+K+YPFPD+D
Subjt:  KPRTFEELATRAHDMELSIASRENQDILLPNMRKEGRNDEET-------IEESMVVNITLPKLSSK-------EKRQTNGVHHLTLKERQKKIYPFPDAD

Query:  IPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLEEVAQSNLATIK
        + DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+CFVLK+LILKLA+E KIELD++EVAQ+N   I+
Subjt:  IPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLEEVAQSNLATIK

A0A5A7TZU9 Ribonuclease H1.3e-13257.66Show/hide
Query:  PDSDVVTVLMTETRAMDERMAEMQEHINNLIKAIEEKDSQIAQLKCQIENQHIAESSQT------------------------------QLQDMITNCIR
        P  ++++V++T+    ++RMAE+++ +N L+KA+EE+D +IA LK  IE++  AESS T                              QLQ+MI N I+
Subjt:  PDSDVVTVLMTETRAMDERMAEMQEHINNLIKAIEEKDSQIAQLKCQIENQHIAESSQT------------------------------QLQDMITNCIR

Query:  AQYGGPTQDSLLYSKPYTKMIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENA--------------------------VPESVDSWEELEREFL
         QYGGP Q   LYSKPYTK IDN+R P GYQPPKFQQFDGKGNPKQH+AHF+ETCE A                           PES+DSWE+LER+FL
Subjt:  AQYGGPTQDSLLYSKPYTKMIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENA--------------------------VPESVDSWEELEREFL

Query:  NRFYSTRRTVIMFELTNTKQRKGELVVNYINHWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDILLPNM
        NRFYSTRR V M ELT TKQRKGE V++YIN WRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSIA+R N D+L+P +
Subjt:  NRFYSTRRTVIMFELTNTKQRKGELVVNYINHWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDILLPNM

Query:  RKEGRNDEET-------IEESMVVNITLPKLSSK----EKRQTNG-VHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYC
        RKE +  + T        +E+MVV+ T  KL SK    EKRQ  G     TLKERQ+K+YPFPD+D+PDML+QLLE QLI+LP+CKRP EM +V+DP YC
Subjt:  RKEGRNDEET-------IEESMVVNITLPKLSSK----EKRQTNG-VHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYC

Query:  KYHRVIGHPVERCFVLKDLILKLAKEGKIELDLEEVAQSNLATI
        KYHRVI HPVE+CFVLK+LILKLA + KIEL+L++VAQ+N A +
Subjt:  KYHRVIGHPVERCFVLKDLILKLAKEGKIELDLEEVAQSNLATI

A0A5A7UXF0 Ty3-gypsy retrotransposon protein2.8e-13054.28Show/hide
Query:  KASIIASEETTLQGAYTNDKFLVKYNPLFEPDS--DVVTVLMTETRAMDERMAEMQEHINNLIKAIEEKDSQIAQLKCQIENQHIAESSQ----------
        K  I+  E   +   Y++ K      P  E  S  ++++V++T     + RMAE+++ +N L+K +EE+D +IA LK  IE++  AESS           
Subjt:  KASIIASEETTLQGAYTNDKFLVKYNPLFEPDS--DVVTVLMTETRAMDERMAEMQEHINNLIKAIEEKDSQIAQLKCQIENQHIAESSQ----------

Query:  --------------------TQLQDMITNCIRAQYGGPTQDSLLYSKPYTKMIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENA----------
                             QLQ+MI + I+ QYGGP Q   LYSKPYTK IDNLR P GYQPPKFQQFDGKGNPKQH+AHF+ETCE A          
Subjt:  --------------------TQLQDMITNCIRAQYGGPTQDSLLYSKPYTKMIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENA----------

Query:  ----------------VPESVDSWEELEREFLNRFYSTRRTVIMFELTNTKQRKGELVVNYINHWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGI
                         PES+D+WE+LER+FLNRFYSTRR V M ELTNT+Q+KGELV+NYIN WRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GI
Subjt:  ----------------VPESVDSWEELEREFLNRFYSTRRTVIMFELTNTKQRKGELVVNYINHWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGI

Query:  KPRTFEELATRAHDMELSIASRENQDILLPNMRKEGRNDEET-------IEESMVVNITLPKLSSKEKRQTNGVHH-------LTLKERQKKIYPFPDAD
        KPRTFEELATRAHDMELSIA+R  +D L+P  R +    ++T       I+ESMVV+ T  K  SK K       H        TLKERQ+K+YPFPD+D
Subjt:  KPRTFEELATRAHDMELSIASRENQDILLPNMRKEGRNDEET-------IEESMVVNITLPKLSSKEKRQTNGVHH-------LTLKERQKKIYPFPDAD

Query:  IPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLEEVAQSNLATIK
        + DMLEQLLE QLI+LP+CKRPE+ EKVDDP YCKYHRVI HPVE+CFVLK+LILKLA+E KIELD++EVAQ+N   I+
Subjt:  IPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLEEVAQSNLATIK

A0A5A7VI06 Reverse transcriptase1.4e-12980.53Show/hide
Query:  ELYVDRIVSQHGVPMSIVSDRDPRFTSKFWPSVQKAMGTKLKFSTAFHPQTDGQSERTIQTLEDMLRACVLNFKGSWDTHLSLMEFAYNNSYKSSIGMTP
        +LYVDRIVSQHGV MSIVSDRDPRFTSKFWPSVQKAMGTKLKFSTAFHP TDGQSERTIQTLEDMLRACVL FKG+WDTHLSLMEFAYNN+Y+SSI MT 
Subjt:  ELYVDRIVSQHGVPMSIVSDRDPRFTSKFWPSVQKAMGTKLKFSTAFHPQTDGQSERTIQTLEDMLRACVLNFKGSWDTHLSLMEFAYNNSYKSSIGMTP

Query:  FEALYGRPCRTPVCWNIVGERKLVGPELLQVTSDNIKLIRENLKIAQDRQKSYADKRRRDLE-----------------------------------IIE
        FEALYGRPCRTPVCWN VGERKLVGPEL+QVTSDNIKLI+ENLKIAQDRQKSYADKRRRDLE                                   II+
Subjt:  FEALYGRPCRTPVCWNIVGERKLVGPELLQVTSDNIKLIRENLKIAQDRQKSYADKRRRDLE-----------------------------------IIE

Query:  RVGPAAYRLELPVELARIHDVFHVSMLRKYIPDPSHVLQVQPIELKEDLSYKEEVVQILDRKEQVLRNKTIPLVKVLWRHHGVEEATWESEDQM-RSYPT
        RVG AAYRLELP ELARIHDVFHVSMLRKYI DPSHV QVQPIELKEDLSY+EE VQILDRKEQVLRNKTI L++VLWRHHG+EEATWESEDQM RSYPT
Subjt:  RVGPAAYRLELPVELARIHDVFHVSMLRKYIPDPSHVLQVQPIELKEDLSYKEEVVQILDRKEQVLRNKTIPLVKVLWRHHGVEEATWESEDQM-RSYPT

Query:  LFT
        LFT
Subjt:  LFT

A0A5D3D4X3 Ty3-gypsy retrotransposon protein8.2e-13055.65Show/hide
Query:  KYNPLFE------PDSDVVTVLMTETRAMDERMAEMQEHINNLIKAIEEKDSQIAQLKCQIENQHIAESSQ-----------------------------
        K+N L E      P  ++++V++T+    ++RMAE+++ +N L+K +EE+D +IA LK  IE++  AESS                              
Subjt:  KYNPLFE------PDSDVVTVLMTETRAMDERMAEMQEHINNLIKAIEEKDSQIAQLKCQIENQHIAESSQ-----------------------------

Query:  -TQLQDMITNCIRAQYGGPTQDSLLYSKPYTKMIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENA--------------------------VPE
          QLQ+MI + I+ QYGGP Q   LYSKPYTK IDNLR P GYQPPKFQQFDGKGNPKQH+AHF+ETCE A                           PE
Subjt:  -TQLQDMITNCIRAQYGGPTQDSLLYSKPYTKMIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENA--------------------------VPE

Query:  SVDSWEELEREFLNRFYSTRRTVIMFELTNTKQRKGELVVNYINHWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSI
        S+D+WE+LER+FLNRFYSTRR V M ELTNT+Q+KGELV++YIN WRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSI
Subjt:  SVDSWEELEREFLNRFYSTRRTVIMFELTNTKQRKGELVVNYINHWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSI

Query:  ASRENQDILLPNMRKEGRNDEET-------IEESMVVNITLPKLSSKEK-----RQTNG--VHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKC
        A+R  +D L+P  R +    ++T       I+ESMVV+ T  K  SK K     R+ +G      TLKERQ+K+YPFPD+D+ DMLEQLLE QLI+LP+C
Subjt:  ASRENQDILLPNMRKEGRNDEET-------IEESMVVNITLPKLSSKEK-----RQTNG--VHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKC

Query:  KRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLEEVAQSNLATIK
        KRPE+  KVDDP YCKYHRVI HPVE+CFVLK+LILKLA+E KIELD++EVAQ+N   I+
Subjt:  KRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLEEVAQSNLATIK

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein1.5e-1927.95Show/hide
Query:  LYVDRIVSQHGVPMSIVSDRDPRFTSKFWPSVQKAMGTKLKFSTAFHPQTDGQSERTIQTLEDMLRACVLNFKGSWDTHLSLMEFAYNNSYKSSIGMTPF
        ++  R+++  G P  I++D D  FTS+ W          +KFS  + PQTDGQ+ERT QT+E +LR        +W  H+SL++ +YNN+  S+  MTPF
Subjt:  LYVDRIVSQHGVPMSIVSDRDPRFTSKFWPSVQKAMGTKLKFSTAFHPQTDGQSERTIQTLEDMLRACVLNFKGSWDTHLSLMEFAYNNSYKSSIGMTPF

Query:  EALYG-RPCRTPVCWNIVGERKLVGPELLQVTSDNIKLIRENLKIAQDRQKSYADKRRRDLE---------------------------------IIERV
        E ++   P  +P+      ++     E  Q T    + ++E+L     + K Y D + +++E                                 ++++ 
Subjt:  EALYG-RPCRTPVCWNIVGERKLVGPELLQVTSDNIKLIRENLKIAQDRQKSYADKRRRDLE---------------------------------IIERV

Query:  GPAAYRLELPVELARI-HDVFHVSMLRKY
        GP  Y L+LP  +  +    FHVS L KY
Subjt:  GPAAYRLELPVELARI-HDVFHVSMLRKY

P0CT35 Transposon Tf2-2 polyprotein1.5e-1927.95Show/hide
Query:  LYVDRIVSQHGVPMSIVSDRDPRFTSKFWPSVQKAMGTKLKFSTAFHPQTDGQSERTIQTLEDMLRACVLNFKGSWDTHLSLMEFAYNNSYKSSIGMTPF
        ++  R+++  G P  I++D D  FTS+ W          +KFS  + PQTDGQ+ERT QT+E +LR        +W  H+SL++ +YNN+  S+  MTPF
Subjt:  LYVDRIVSQHGVPMSIVSDRDPRFTSKFWPSVQKAMGTKLKFSTAFHPQTDGQSERTIQTLEDMLRACVLNFKGSWDTHLSLMEFAYNNSYKSSIGMTPF

Query:  EALYG-RPCRTPVCWNIVGERKLVGPELLQVTSDNIKLIRENLKIAQDRQKSYADKRRRDLE---------------------------------IIERV
        E ++   P  +P+      ++     E  Q T    + ++E+L     + K Y D + +++E                                 ++++ 
Subjt:  EALYG-RPCRTPVCWNIVGERKLVGPELLQVTSDNIKLIRENLKIAQDRQKSYADKRRRDLE---------------------------------IIERV

Query:  GPAAYRLELPVELARI-HDVFHVSMLRKY
        GP  Y L+LP  +  +    FHVS L KY
Subjt:  GPAAYRLELPVELARI-HDVFHVSMLRKY

P0CT41 Transposon Tf2-12 polyprotein1.5e-1927.95Show/hide
Query:  LYVDRIVSQHGVPMSIVSDRDPRFTSKFWPSVQKAMGTKLKFSTAFHPQTDGQSERTIQTLEDMLRACVLNFKGSWDTHLSLMEFAYNNSYKSSIGMTPF
        ++  R+++  G P  I++D D  FTS+ W          +KFS  + PQTDGQ+ERT QT+E +LR        +W  H+SL++ +YNN+  S+  MTPF
Subjt:  LYVDRIVSQHGVPMSIVSDRDPRFTSKFWPSVQKAMGTKLKFSTAFHPQTDGQSERTIQTLEDMLRACVLNFKGSWDTHLSLMEFAYNNSYKSSIGMTPF

Query:  EALYG-RPCRTPVCWNIVGERKLVGPELLQVTSDNIKLIRENLKIAQDRQKSYADKRRRDLE---------------------------------IIERV
        E ++   P  +P+      ++     E  Q T    + ++E+L     + K Y D + +++E                                 ++++ 
Subjt:  EALYG-RPCRTPVCWNIVGERKLVGPELLQVTSDNIKLIRENLKIAQDRQKSYADKRRRDLE---------------------------------IIERV

Query:  GPAAYRLELPVELARI-HDVFHVSMLRKY
        GP  Y L+LP  +  +    FHVS L KY
Subjt:  GPAAYRLELPVELARI-HDVFHVSMLRKY

Q99315 Transposon Ty3-G Gag-Pol polyprotein8.7e-2029.37Show/hide
Query:  IVSQHGVPMSIVSDRDPRFTSKFWPSVQKAMGTKLKFSTAFHPQTDGQSERTIQTLEDMLRACVLNFKGSWDTHLSLMEFAYNNSYKSSIGMTPFEALYG
        I S HG P +I SDRD R T+  +  + K +G K   S+A HPQTDGQSERTIQTL  +LRA       +W  +L  +EF YN++   ++G +PFE   G
Subjt:  IVSQHGVPMSIVSDRDPRFTSKFWPSVQKAMGTKLKFSTAFHPQTDGQSERTIQTLEDMLRACVLNFKGSWDTHLSLMEFAYNNSYKSSIGMTPFEALYG

Query:  RPCRTPVCW--NIVGERKLVGPELLQVTSDNIKLIRENLKIAQDRQKSYADKRRRDL--------------------------------EIIERVGPAAY
            TP     + V  R     EL +         +E L+ AQ   ++  ++RR+ L                                 +++++   AY
Subjt:  RPCRTPVCW--NIVGERKLVGPELLQVTSDNIKLIRENLKIAQDRQKSYADKRRRDL--------------------------------EIIERVGPAAY

Query:  RLELPVELARIHDVFHVSMLRKYIPDPSHVLQVQPIELKEDLSYKEEVVQIL
         L+L     + H V +V  L+K++  P    + +PI   E +    EV  ++
Subjt:  RLELPVELARIHDVFHVSMLRKYIPDPSHVLQVQPIELKEDLSYKEEVVQIL

Q9UR07 Transposon Tf2-11 polyprotein1.5e-1927.95Show/hide
Query:  LYVDRIVSQHGVPMSIVSDRDPRFTSKFWPSVQKAMGTKLKFSTAFHPQTDGQSERTIQTLEDMLRACVLNFKGSWDTHLSLMEFAYNNSYKSSIGMTPF
        ++  R+++  G P  I++D D  FTS+ W          +KFS  + PQTDGQ+ERT QT+E +LR        +W  H+SL++ +YNN+  S+  MTPF
Subjt:  LYVDRIVSQHGVPMSIVSDRDPRFTSKFWPSVQKAMGTKLKFSTAFHPQTDGQSERTIQTLEDMLRACVLNFKGSWDTHLSLMEFAYNNSYKSSIGMTPF

Query:  EALYG-RPCRTPVCWNIVGERKLVGPELLQVTSDNIKLIRENLKIAQDRQKSYADKRRRDLE---------------------------------IIERV
        E ++   P  +P+      ++     E  Q T    + ++E+L     + K Y D + +++E                                 ++++ 
Subjt:  EALYG-RPCRTPVCWNIVGERKLVGPELLQVTSDNIKLIRENLKIAQDRQKSYADKRRRDLE---------------------------------IIERV

Query:  GPAAYRLELPVELARI-HDVFHVSMLRKY
        GP  Y L+LP  +  +    FHVS L KY
Subjt:  GPAAYRLELPVELARI-HDVFHVSMLRKY

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCATTCAAGACTTCTTCGATGGCCGCTGTCGAGATCAAGTCTTACATGGGTTCAACTGCCCATCGTTGCTTCAATAAACTGAGGTTGCAAGAAGATAAAGCTTCTAT
CATTGCAAGCGAAGAAACAACCTTGCAGGGGGCATATACCAATGACAAATTTCTTGTTAAGTATAACCCTCTGTTTGAACCTGATTCTGACGTAGTGACTGTCCTGATGA
CTGAGACAAGAGCTATGGATGAAAGAATGGCTGAGATGCAAGAGCACATCAACAACTTGATAAAGGCGATTGAAGAAAAAGATTCTCAAATCGCGCAACTAAAGTGCCAA
ATTGAGAACCAACATATCGCCGAATCAAGTCAAACACAACTCCAAGATATGATCACAAACTGTATCAGAGCTCAGTACGGTGGACCTACTCAAGATTCCCTCTTGTATTC
CAAACCTTATACTAAGATGATTGATAACTTGAGAACGCCAATCGGGTATCAACCACCAAAATTTCAGCAGTTTGATGGAAAGGGCAATCCTAAACAACACATTGCCCACT
TCGTTGAGACATGCGAGAACGCTGTACCTGAGTCAGTAGACAGTTGGGAGGAACTCGAAAGAGAGTTTTTGAATCGCTTCTACAGCACTAGAAGAACCGTTATCATGTTC
GAGCTCACCAACACTAAACAACGAAAAGGTGAACTCGTTGTTAACTATATAAATCATTGGAGAGCCATGAGTCTAGATTGCAAAGATCGTCTCACTGAACTCTCTTCTGT
TGAGATGTGCATTCAAGGCATGCACTGGGAACTCCTCTACATCCTTAAAGGTATAAAGCCTCGCACCTTTGAGGAACTAGCAACTCGCGCCCATGATATGGAGCTAAGTA
TTGCTAGTCGAGAAAACCAAGACATTCTCCTCCCTAACATGAGAAAAGAAGGAAGAAACGATGAAGAGACTATAGAAGAATCTATGGTCGTCAACATAACCCTTCCCAAG
TTGTCTTCGAAAGAAAAGAGACAAACAAATGGAGTGCATCACTTAACTTTAAAGGAAAGACAGAAGAAAATATATCCTTTCCCTGATGCCGACATCCCTGATATGTTGGA
ACAACTATTGGAAGCGCAACTGATAGAGCTTCCTAAGTGTAAACGGCCAGAAGAGATGGAGAAAGTCGATGATCCCAAGTATTGCAAGTATCATCGAGTTATTGGTCATC
CAGTGGAAAGATGTTTCGTCCTAAAGGACTTAATTTTAAAGCTGGCTAAGGAAGGCAAAATTGAGCTCGACCTTGAAGAAGTAGCTCAATCAAATCTTGCTACAATCAAA
GGAAAGAGCAAGCATCAAAGAAAGAAGGATCCTAAGAAACTTCAACCCAAGAGGAAGAGAAGTAAAAAGTTTTCTCAACCTCGACAACCGGATCTGATCAAGACCATGAC
AAAGATAAGAGCTTTTAAATGTAAAAGCTCCTTATCGCAAGAGCCTAAACTGCATGATGCTCCTAGCCCACACGAGCTTAAAAGTTTGAAGGTTCCCACATTGCGCTGTT
GTGCTGCTTCCTTCTCCAAGTTCGAAGGTTCTGACGCTGCGCTGCTACCTTCCTCCAAGTTCGAAGGTTTTCATGCGGTTTGTTGCAGTTCCTTCTCTCCAAGTTCAAAA
GGTTCTCATGCATTCGGCTACAGTTCATTTCCTCCAAATTTGAAGGTGTTCTCACGCGCGCCGTTGCAGTTCCTTCTCTCCAAGTTCGAAGGTGTTCTCATGCGTGCCGC
TGCAGTTCCTTCTCTCCAAATTCGAAGTTCCTTCCTCCTAGTTCGAAGGTTTTCATGCGCTTTGTTGCAGTTAATTCTCTCCAAGTTCAAAAGGTTCTCACGCATTCGGC
TACAGTTCATTTCCTCCAAATTTGAAGGTGTTCTCACGCGCGCCGCTGCAGTTCCTTCTCTCCAAGTTCGAAGTTCCTTCCTCCCAAATTCGAAGGTTCTCACGACGCTC
CGCTGCAGTTCCTTCTCTCTCCAAATTCGAAGGTTCTCACGCGCTCCGCTGCAGTTCCTTCGCTTTCGCTGCAGTTCCTTCTCTCCGAGTTTGAAGGTTCTCATGACGTT
TCGTTGCAGTTCCTTCCTCCCAAATTCGAAGGTTCTCACGACGCTCCGCTGTAGTTCCTTCTCTCCAAATTTGAAGGTGTTCTCACGCGCGCCGCTGCAGTTCCTTCTCT
CCAAGTTCGAAGGTGTTCTCGTGCGCTTCGCTGCAGTTCCTTCCTCCCAAATTCGAAGGTTCTCTCACGCGCTTCGTTACAGTTCCTTCTCTCCAAGTATGAAGGTTCTC
TCCTCCAAGTCGAAGGTTCTCACGTTGCTTCACTACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGTTGCTTCGTTGTAGTCCTTCTCTCCAAGTATGAAGGTTCGCC
ACTGAGGTTCTCCTTCTCCAAGTTCGAAGGTTCACCGTTGCTCCTTTTCAAATGTTTGGCGGCGGTTGACGGCCTCGTTCCACTCCATCTTCAAATGTTGGCAGTTGACG
GCGTCCGCTTCGCTTCATCTTCAAAAAAATTGACTGTTGATAACTTTACTTCATATTCAAAAGTTGACGGTGATGAAGTCACTGCAAGTGAATCTGATGACGACCGTTGT
AGGCGAGTCGAGTCTGGTGACCACCCTTGCAGGTTACTCAGATCACCCAATAAAATGGGGACTGGTCTAGCAGGAGTGCATGAAGGCGAATCTGGTTACTCAGATCACCC
AATAAAATGGGGACTGGGTCTAGCAGGAGTGCATGAAGGCGAATCTGGTGACTACCCCTGCAGGTTACTCAGATCACCAATAAAATGGGGACTGGGTCTAGCAGGAGTGC
ATGAAGGCGAATCTGGTTACTCAGATCACCCAATAAAATGGGGACTGGGTCTAGCAGGAGTGCATGAAGGCGAATCTGGTGACTACCCCTGCAGACCCAAGCCCAATAAA
GCCCAAGCCCAAGTTGTTAGGCCCAAAAGTCACCAGGGCCCATCCAGCGAGAACTCTATAAATAGAGGGGTTCTCCTCCATTTCAAGGGTTCAGAAATTCTACACTCTCA
CAAAGACAAGAGTTTAGAGTTTCAAAGCTCTCAAGCAGAACCAGAGAATTCAGAGAGACTCCACCAAGACATTAAACCCAACGAGCTATACGTCGACAGGATTGTGAGTC
AGCATGGAGTGCCAATGTCCATAGTTTCAGATAGGGATCCAAGGTTTACTTCTAAGTTTTGGCCTAGTGTGCAGAAAGCAATGGGAACAAAGTTGAAGTTCAGTACAGCG
TTCCATCCCCAGACAGATGGTCAGTCAGAAAGGACCATCCAGACCTTAGAGGACATGTTGAGAGCATGTGTCCTTAATTTTAAGGGAAGTTGGGATACCCACTTATCACT
TATGGAGTTCGCTTATAATAACAGCTACAAATCTAGTATCGGCATGACACCATTCGAGGCTTTGTATGGCAGACCATGCAGGACTCCTGTGTGCTGGAACATAGTGGGAG
AGCGAAAGCTAGTTGGTCCTGAGTTGTTACAAGTTACGTCAGACAATATTAAACTGATTAGAGAGAACCTGAAAATAGCTCAAGATCGACAGAAGAGCTATGCAGATAAG
CGACGAAGAGACTTAGAGATAATAGAACGAGTTGGACCAGCAGCCTATAGACTTGAGTTGCCAGTGGAACTCGCTCGAATACATGATGTTTTTCATGTGTCCATGTTGAG
GAAATATATTCCAGATCCATCCCATGTGTTGCAAGTGCAACCGATTGAGCTGAAAGAAGACTTGAGTTATAAAGAGGAAGTGGTTCAGATCCTCGACAGAAAGGAGCAAG
TTTTGAGGAACAAAACGATCCCACTCGTGAAAGTTCTTTGGAGACATCACGGAGTGGAGGAGGCAACTTGGGAGTCAGAAGATCAAATGAGGAGTTACCCGACACTCTTC
ACCTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCATTCAAGACTTCTTCGATGGCCGCTGTCGAGATCAAGTCTTACATGGGTTCAACTGCCCATCGTTGCTTCAATAAACTGAGGTTGCAAGAAGATAAAGCTTCTAT
CATTGCAAGCGAAGAAACAACCTTGCAGGGGGCATATACCAATGACAAATTTCTTGTTAAGTATAACCCTCTGTTTGAACCTGATTCTGACGTAGTGACTGTCCTGATGA
CTGAGACAAGAGCTATGGATGAAAGAATGGCTGAGATGCAAGAGCACATCAACAACTTGATAAAGGCGATTGAAGAAAAAGATTCTCAAATCGCGCAACTAAAGTGCCAA
ATTGAGAACCAACATATCGCCGAATCAAGTCAAACACAACTCCAAGATATGATCACAAACTGTATCAGAGCTCAGTACGGTGGACCTACTCAAGATTCCCTCTTGTATTC
CAAACCTTATACTAAGATGATTGATAACTTGAGAACGCCAATCGGGTATCAACCACCAAAATTTCAGCAGTTTGATGGAAAGGGCAATCCTAAACAACACATTGCCCACT
TCGTTGAGACATGCGAGAACGCTGTACCTGAGTCAGTAGACAGTTGGGAGGAACTCGAAAGAGAGTTTTTGAATCGCTTCTACAGCACTAGAAGAACCGTTATCATGTTC
GAGCTCACCAACACTAAACAACGAAAAGGTGAACTCGTTGTTAACTATATAAATCATTGGAGAGCCATGAGTCTAGATTGCAAAGATCGTCTCACTGAACTCTCTTCTGT
TGAGATGTGCATTCAAGGCATGCACTGGGAACTCCTCTACATCCTTAAAGGTATAAAGCCTCGCACCTTTGAGGAACTAGCAACTCGCGCCCATGATATGGAGCTAAGTA
TTGCTAGTCGAGAAAACCAAGACATTCTCCTCCCTAACATGAGAAAAGAAGGAAGAAACGATGAAGAGACTATAGAAGAATCTATGGTCGTCAACATAACCCTTCCCAAG
TTGTCTTCGAAAGAAAAGAGACAAACAAATGGAGTGCATCACTTAACTTTAAAGGAAAGACAGAAGAAAATATATCCTTTCCCTGATGCCGACATCCCTGATATGTTGGA
ACAACTATTGGAAGCGCAACTGATAGAGCTTCCTAAGTGTAAACGGCCAGAAGAGATGGAGAAAGTCGATGATCCCAAGTATTGCAAGTATCATCGAGTTATTGGTCATC
CAGTGGAAAGATGTTTCGTCCTAAAGGACTTAATTTTAAAGCTGGCTAAGGAAGGCAAAATTGAGCTCGACCTTGAAGAAGTAGCTCAATCAAATCTTGCTACAATCAAA
GGAAAGAGCAAGCATCAAAGAAAGAAGGATCCTAAGAAACTTCAACCCAAGAGGAAGAGAAGTAAAAAGTTTTCTCAACCTCGACAACCGGATCTGATCAAGACCATGAC
AAAGATAAGAGCTTTTAAATGTAAAAGCTCCTTATCGCAAGAGCCTAAACTGCATGATGCTCCTAGCCCACACGAGCTTAAAAGTTTGAAGGTTCCCACATTGCGCTGTT
GTGCTGCTTCCTTCTCCAAGTTCGAAGGTTCTGACGCTGCGCTGCTACCTTCCTCCAAGTTCGAAGGTTTTCATGCGGTTTGTTGCAGTTCCTTCTCTCCAAGTTCAAAA
GGTTCTCATGCATTCGGCTACAGTTCATTTCCTCCAAATTTGAAGGTGTTCTCACGCGCGCCGTTGCAGTTCCTTCTCTCCAAGTTCGAAGGTGTTCTCATGCGTGCCGC
TGCAGTTCCTTCTCTCCAAATTCGAAGTTCCTTCCTCCTAGTTCGAAGGTTTTCATGCGCTTTGTTGCAGTTAATTCTCTCCAAGTTCAAAAGGTTCTCACGCATTCGGC
TACAGTTCATTTCCTCCAAATTTGAAGGTGTTCTCACGCGCGCCGCTGCAGTTCCTTCTCTCCAAGTTCGAAGTTCCTTCCTCCCAAATTCGAAGGTTCTCACGACGCTC
CGCTGCAGTTCCTTCTCTCTCCAAATTCGAAGGTTCTCACGCGCTCCGCTGCAGTTCCTTCGCTTTCGCTGCAGTTCCTTCTCTCCGAGTTTGAAGGTTCTCATGACGTT
TCGTTGCAGTTCCTTCCTCCCAAATTCGAAGGTTCTCACGACGCTCCGCTGTAGTTCCTTCTCTCCAAATTTGAAGGTGTTCTCACGCGCGCCGCTGCAGTTCCTTCTCT
CCAAGTTCGAAGGTGTTCTCGTGCGCTTCGCTGCAGTTCCTTCCTCCCAAATTCGAAGGTTCTCTCACGCGCTTCGTTACAGTTCCTTCTCTCCAAGTATGAAGGTTCTC
TCCTCCAAGTCGAAGGTTCTCACGTTGCTTCACTACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGTTGCTTCGTTGTAGTCCTTCTCTCCAAGTATGAAGGTTCGCC
ACTGAGGTTCTCCTTCTCCAAGTTCGAAGGTTCACCGTTGCTCCTTTTCAAATGTTTGGCGGCGGTTGACGGCCTCGTTCCACTCCATCTTCAAATGTTGGCAGTTGACG
GCGTCCGCTTCGCTTCATCTTCAAAAAAATTGACTGTTGATAACTTTACTTCATATTCAAAAGTTGACGGTGATGAAGTCACTGCAAGTGAATCTGATGACGACCGTTGT
AGGCGAGTCGAGTCTGGTGACCACCCTTGCAGGTTACTCAGATCACCCAATAAAATGGGGACTGGTCTAGCAGGAGTGCATGAAGGCGAATCTGGTTACTCAGATCACCC
AATAAAATGGGGACTGGGTCTAGCAGGAGTGCATGAAGGCGAATCTGGTGACTACCCCTGCAGGTTACTCAGATCACCAATAAAATGGGGACTGGGTCTAGCAGGAGTGC
ATGAAGGCGAATCTGGTTACTCAGATCACCCAATAAAATGGGGACTGGGTCTAGCAGGAGTGCATGAAGGCGAATCTGGTGACTACCCCTGCAGACCCAAGCCCAATAAA
GCCCAAGCCCAAGTTGTTAGGCCCAAAAGTCACCAGGGCCCATCCAGCGAGAACTCTATAAATAGAGGGGTTCTCCTCCATTTCAAGGGTTCAGAAATTCTACACTCTCA
CAAAGACAAGAGTTTAGAGTTTCAAAGCTCTCAAGCAGAACCAGAGAATTCAGAGAGACTCCACCAAGACATTAAACCCAACGAGCTATACGTCGACAGGATTGTGAGTC
AGCATGGAGTGCCAATGTCCATAGTTTCAGATAGGGATCCAAGGTTTACTTCTAAGTTTTGGCCTAGTGTGCAGAAAGCAATGGGAACAAAGTTGAAGTTCAGTACAGCG
TTCCATCCCCAGACAGATGGTCAGTCAGAAAGGACCATCCAGACCTTAGAGGACATGTTGAGAGCATGTGTCCTTAATTTTAAGGGAAGTTGGGATACCCACTTATCACT
TATGGAGTTCGCTTATAATAACAGCTACAAATCTAGTATCGGCATGACACCATTCGAGGCTTTGTATGGCAGACCATGCAGGACTCCTGTGTGCTGGAACATAGTGGGAG
AGCGAAAGCTAGTTGGTCCTGAGTTGTTACAAGTTACGTCAGACAATATTAAACTGATTAGAGAGAACCTGAAAATAGCTCAAGATCGACAGAAGAGCTATGCAGATAAG
CGACGAAGAGACTTAGAGATAATAGAACGAGTTGGACCAGCAGCCTATAGACTTGAGTTGCCAGTGGAACTCGCTCGAATACATGATGTTTTTCATGTGTCCATGTTGAG
GAAATATATTCCAGATCCATCCCATGTGTTGCAAGTGCAACCGATTGAGCTGAAAGAAGACTTGAGTTATAAAGAGGAAGTGGTTCAGATCCTCGACAGAAAGGAGCAAG
TTTTGAGGAACAAAACGATCCCACTCGTGAAAGTTCTTTGGAGACATCACGGAGTGGAGGAGGCAACTTGGGAGTCAGAAGATCAAATGAGGAGTTACCCGACACTCTTC
ACCTAA
Protein sequenceShow/hide protein sequence
MSFKTSSMAAVEIKSYMGSTAHRCFNKLRLQEDKASIIASEETTLQGAYTNDKFLVKYNPLFEPDSDVVTVLMTETRAMDERMAEMQEHINNLIKAIEEKDSQIAQLKCQ
IENQHIAESSQTQLQDMITNCIRAQYGGPTQDSLLYSKPYTKMIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAVPESVDSWEELEREFLNRFYSTRRTVIMF
ELTNTKQRKGELVVNYINHWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDILLPNMRKEGRNDEETIEESMVVNITLPK
LSSKEKRQTNGVHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLEEVAQSNLATIK
GKSKHQRKKDPKKLQPKRKRSKKFSQPRQPDLIKTMTKIRAFKCKSSLSQEPKLHDAPSPHELKSLKVPTLRCCAASFSKFEGSDAALLPSSKFEGFHAVCCSSFSPSSK
GSHAFGYSSFPPNLKVFSRAPLQFLLSKFEGVLMRAAAVPSLQIRSSFLLVRRFSCALLQLILSKFKRFSRIRLQFISSKFEGVLTRAAAVPSLQVRSSFLPNSKVLTTL
RCSSFSLQIRRFSRAPLQFLRFRCSSFSPSLKVLMTFRCSSFLPNSKVLTTLRCSSFSPNLKVFSRAPLQFLLSKFEGVLVRFAAVPSSQIRRFSHALRYSSFSPSMKVL
SSKSKVLTLLHYSSFLQVRRFSRCFVVVLLSKYEGSPLRFSFSKFEGSPLLLFKCLAAVDGLVPLHLQMLAVDGVRFASSSKKLTVDNFTSYSKVDGDEVTASESDDDRC
RRVESGDHPCRLLRSPNKMGTGLAGVHEGESGYSDHPIKWGLGLAGVHEGESGDYPCRLLRSPIKWGLGLAGVHEGESGYSDHPIKWGLGLAGVHEGESGDYPCRPKPNK
AQAQVVRPKSHQGPSSENSINRGVLLHFKGSEILHSHKDKSLEFQSSQAEPENSERLHQDIKPNELYVDRIVSQHGVPMSIVSDRDPRFTSKFWPSVQKAMGTKLKFSTA
FHPQTDGQSERTIQTLEDMLRACVLNFKGSWDTHLSLMEFAYNNSYKSSIGMTPFEALYGRPCRTPVCWNIVGERKLVGPELLQVTSDNIKLIRENLKIAQDRQKSYADK
RRRDLEIIERVGPAAYRLELPVELARIHDVFHVSMLRKYIPDPSHVLQVQPIELKEDLSYKEEVVQILDRKEQVLRNKTIPLVKVLWRHHGVEEATWESEDQMRSYPTLF
T