; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G22590 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G22590
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationChr1:18154479..18156662
RNA-Seq ExpressionCSPI01G22590
SyntenyCSPI01G22590
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0050896 - response to stimulus (biological process)
GO:0000166 - nucleotide binding (molecular function)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016772 - transferase activity, transferring phosphorus-containing groups (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0043826.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]4.1e-13966.98Show/hide
Query:  PVVRHSSIRLILSIVVHFDMFIEQMDVTTTFLHGELEKVIYMAQPKGYEVKGKE-----DMVCRLHKSIYGLRQSPRQWYIRFDTFILKQGFHRNSYDAC
        PVVRHSSIRLILSI VHFDMFIEQMDVTTTFLHGELE+VIYMAQPKGYEVKGKE     DMVCRLHKS+YGL+QSPRQWYIRFDTFILKQG         
Subjt:  PVVRHSSIRLILSIVVHFDMFIEQMDVTTTFLHGELEKVIYMAQPKGYEVKGKE-----DMVCRLHKSIYGLRQSPRQWYIRFDTFILKQGFHRNSYDAC

Query:  VYWKLSQRGTFIYLLLYVDDMILVSKDYAEICQLKKQLSSEFEMKDLGKLKRILGMDVKRDREKGLLTVSQESYVIKLLEKYNMSGCKAVSTPLASHFKL
                 T  +++L                 LKK  +S +      +LKRILGMDVKRD+EKGLLT+SQESYVIKLLEKYNMS  KAVSTPLASHF+L
Subjt:  VYWKLSQRGTFIYLLLYVDDMILVSKDYAEICQLKKQLSSEFEMKDLGKLKRILGMDVKRDREKGLLTVSQESYVIKLLEKYNMSGCKAVSTPLASHFKL

Query:  SSSQCPVIEQERLEMSNIPYCNAVGSIMYLMICTRPDLGHAMS------MISRDCDKSVLLKGFTDVDYDADLDKRSWKVTLQPVVALSTTESEYISLG-
        SSSQCPV +QER+EMSNIPYCNAVGSIMYLMICTRPDLG+AMS        SRDCDKS LL+GFTD DY ADLDKRS        + L    +EYISLG 
Subjt:  SSSQCPVIEQERLEMSNIPYCNAVGSIMYLMICTRPDLGHAMS------MISRDCDKSVLLKGFTDVDYDADLDKRSWKVTLQPVVALSTTESEYISLG-

Query:  ---EAVWLKRIVGELLSQEFIPIIHCDSQSAIHLAKNPSHHERSKHIDVKFHYIRNVIAQKDVELVKVHYDENLSDMLTKVLPAH---SSCTTVNDARLR
           EAVWLKRIVGELLSQEFIPIIHCDSQ+AIHLAKNPSHHERSKHIDVKFHYIRNVIAQKDVELVKVH  ENLSDMLTK L AH   ++C         
Subjt:  ---EAVWLKRIVGELLSQEFIPIIHCDSQSAIHLAKNPSHHERSKHIDVKFHYIRNVIAQKDVELVKVHYDENLSDMLTKVLPAH---SSCTTVNDARLR

Query:  HYSIVQRGSLGLSWHFVDGIVGLVVVD
         ++   R +      F  G+ G + VD
Subjt:  HYSIVQRGSLGLSWHFVDGIVGLVVVD

KAA0047995.1 retrotransposon protein, putative, Ty1-copia sub-class [Cucumis melo var. makuwa]2.8e-18078.82Show/hide
Query:  RLILSIVVHFDMFIEQMDVTTTFLHGELEKVIYMAQPKGYEVKGKEDMVCRLHKSIYGLRQSPRQWYIRFDTFILKQGFHRNSYDACVYWKLSQRGTFIY
        RLILSI VHFDMFIEQMDVTTTFLHGELE+VIYMAQPKGYEVKGKEDMVCRLHKS+YGL+QSPRQWYI FDTFILKQGFHRNSYDACVYWK SQ+GT+IY
Subjt:  RLILSIVVHFDMFIEQMDVTTTFLHGELEKVIYMAQPKGYEVKGKEDMVCRLHKSIYGLRQSPRQWYIRFDTFILKQGFHRNSYDACVYWKLSQRGTFIY

Query:  LLLYVDDMILVSKDYAEICQLKKQLSSEFEMKDLGKLKRILGMDVKRDREKGLLTVSQESYVIKLLEKYNMSGCKAVSTPLASHFKLSSSQCPVIEQERL
        LLLYVDDMILVSKDYA IC+LKKQLS+EFEMKDLG+LKRILGMDVKRDREKGLLT+SQESYVIKLLEKYNMS  KAVSTPLASHF+LSSSQCPV +QER+
Subjt:  LLLYVDDMILVSKDYAEICQLKKQLSSEFEMKDLGKLKRILGMDVKRDREKGLLTVSQESYVIKLLEKYNMSGCKAVSTPLASHFKLSSSQCPVIEQERL

Query:  EMSNIPYCNAVGSIMYLMICTRPDLGHAMSMI--------------------------------SRDCDKSVLLKGFTDVDYDADLDKR-----------
        EMSNIPYCNAVGSIMYLMICTRPDLG+AMSMI                                SRDCDKS LL+GFTD DY ADLDKR           
Subjt:  EMSNIPYCNAVGSIMYLMICTRPDLGHAMSMI--------------------------------SRDCDKSVLLKGFTDVDYDADLDKR-----------

Query:  ----SWKVTLQPVVALSTTESEYISLG----EAVWLKRIVGELLSQEFIPIIHCDSQSAIHLAKNPSHHERSKHIDVKFHYIRNVIAQKDVELVKVHYDE
            SWKV LQPVVALSTTESEYISLG    EAVWLKRIVGELLSQEFIPIIHCDSQSAIHLAKNPSHHERSKHIDVKFHYIRNVIAQKDVELVKVH  E
Subjt:  ----SWKVTLQPVVALSTTESEYISLG----EAVWLKRIVGELLSQEFIPIIHCDSQSAIHLAKNPSHHERSKHIDVKFHYIRNVIAQKDVELVKVHYDE

Query:  NLSDMLTKVLPAHSSCTTVNDARLR
        NLSDMLTK L AH    T    + R
Subjt:  NLSDMLTKVLPAHSSCTTVNDARLR

KAA0050719.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]7.7e-19481.42Show/hide
Query:  GYTQKEGVDFNEIFSPVVRHSSIRLILSIVVHFDMFIEQMDVTTTFLHGELEKVIYMAQPKGYEVKGKEDMVCRLHKSIYGLRQSPRQWYIRFDTFILKQ
        GYTQKEGVDF+EIFSPVVRHSSIRLILSI VHFDMFIEQMDVTT FLHGELE+VIYMAQPKGYEVKGKEDMVCRLHKS+YGL+QSPRQWYIRFDTFILKQ
Subjt:  GYTQKEGVDFNEIFSPVVRHSSIRLILSIVVHFDMFIEQMDVTTTFLHGELEKVIYMAQPKGYEVKGKEDMVCRLHKSIYGLRQSPRQWYIRFDTFILKQ

Query:  GFHRNSYDACVYWKLSQRGTFIYLLLYVDDMILVSKDYAEICQLKKQLSSEFEMKDLGKLKRILGMDVKRDREKGLLTVSQESYVIKLLEKYNMSGCKAV
        GFHRNSYDACVYWK SQ+GT+IYLLLYVDDMILVSKDYAEIC+LKKQLS+EFEMKDLG+LKRILGMDVKRD+EKGLLT+SQESYVIKLLEKYNMS  KAV
Subjt:  GFHRNSYDACVYWKLSQRGTFIYLLLYVDDMILVSKDYAEICQLKKQLSSEFEMKDLGKLKRILGMDVKRDREKGLLTVSQESYVIKLLEKYNMSGCKAV

Query:  STPLASHFKLSSSQCPVIEQERLEMSNIPYCNAVGSIMYLMICTRPDLGHAMSMI--------------------------------SRDCDKSVLLKGF
        STPLASHF+LSSSQCPV +QER+EMSNIPYCNAVGSIMYLMICTRPDLG+AMSMI                                SRDCDKS LL+GF
Subjt:  STPLASHFKLSSSQCPVIEQERLEMSNIPYCNAVGSIMYLMICTRPDLGHAMSMI--------------------------------SRDCDKSVLLKGF

Query:  TDVDYDADLDKR---------------SWKVTLQPVVALSTTESEYISLG----EAVWLKRIVGELLSQEFIPIIHCDSQSAIHLAKNPSHHERSKHIDV
        TD DY ADLDKR               SWKV LQPVVALSTTESEYISLG    EAVWLKRIVGELLSQEFIPIIHCDSQSAIHLAKNPSHHERSKHIDV
Subjt:  TDVDYDADLDKR---------------SWKVTLQPVVALSTTESEYISLG----EAVWLKRIVGELLSQEFIPIIHCDSQSAIHLAKNPSHHERSKHIDV

Query:  KFHYIRNVIAQKDVELVKVHYDENLSDMLTKVLPAH
        KFHYIRNVIAQKDVELVKVH  ENLSDMLTK L AH
Subjt:  KFHYIRNVIAQKDVELVKVHYDENLSDMLTKVLPAH

TYK13826.1 putative polyprotein [Cucumis melo var. makuwa]1.2e-19481.65Show/hide
Query:  GYTQKEGVDFNEIFSPVVRHSSIRLILSIVVHFDMFIEQMDVTTTFLHGELEKVIYMAQPKGYEVKGKEDMVCRLHKSIYGLRQSPRQWYIRFDTFILKQ
        GYTQKEGVDF+EIFSPVVRHSSIRLILSI VHFDMFIEQMDVTT FLHGELE+VIYMAQPKGYEVKGKEDMVCRLHKS+YGL+QSPRQWYIRFDTFILKQ
Subjt:  GYTQKEGVDFNEIFSPVVRHSSIRLILSIVVHFDMFIEQMDVTTTFLHGELEKVIYMAQPKGYEVKGKEDMVCRLHKSIYGLRQSPRQWYIRFDTFILKQ

Query:  GFHRNSYDACVYWKLSQRGTFIYLLLYVDDMILVSKDYAEICQLKKQLSSEFEMKDLGKLKRILGMDVKRDREKGLLTVSQESYVIKLLEKYNMSGCKAV
        GFHRNSYDACVYWK SQ+GT+IYLLLYVDDMILVSKDYAEIC+LKKQLS+EFEMKDLG+LKRILGMDVKRD+EKGLLT+SQESYVIKLLEKYNMSG KAV
Subjt:  GFHRNSYDACVYWKLSQRGTFIYLLLYVDDMILVSKDYAEICQLKKQLSSEFEMKDLGKLKRILGMDVKRDREKGLLTVSQESYVIKLLEKYNMSGCKAV

Query:  STPLASHFKLSSSQCPVIEQERLEMSNIPYCNAVGSIMYLMICTRPDLGHAMSMI--------------------------------SRDCDKSVLLKGF
        STPLASHF+LSSSQCPV +QER+EMSNIPYCNAVGSIMYLMICTRPDLG+AMSMI                                SRDCDKS LL+GF
Subjt:  STPLASHFKLSSSQCPVIEQERLEMSNIPYCNAVGSIMYLMICTRPDLGHAMSMI--------------------------------SRDCDKSVLLKGF

Query:  TDVDYDADLDKR---------------SWKVTLQPVVALSTTESEYISLG----EAVWLKRIVGELLSQEFIPIIHCDSQSAIHLAKNPSHHERSKHIDV
        TD DY ADLDKR               SWKV LQPVVALSTTESEYISLG    EAVWLKRIVGELLSQEFIPIIHCDSQSAIHLAKNPSHHERSKHIDV
Subjt:  TDVDYDADLDKR---------------SWKVTLQPVVALSTTESEYISLG----EAVWLKRIVGELLSQEFIPIIHCDSQSAIHLAKNPSHHERSKHIDV

Query:  KFHYIRNVIAQKDVELVKVHYDENLSDMLTKVLPAH
        KFHYIRNVIAQKDVELVKVH  ENLSDMLTK L AH
Subjt:  KFHYIRNVIAQKDVELVKVHYDENLSDMLTKVLPAH

TYK25306.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]4.8e-18885.64Show/hide
Query:  GYTQKEGVDFNEIFSPVVRHSSIRLILSIVVHFDMFIEQMDVTTTFLHGELEKVIYMAQPKGYEVKGKEDMVCRLHKSIYGLRQSPRQWYIRFDTFILKQ
        GYTQKEGVDF+EIFSPVVRHSSIRLILSI VHFDMFIEQMDVTT FLHGELE+VIYMAQPKGYEVKGKEDMVCRLHKS+YGL+QSPRQWYIRFDTFILKQ
Subjt:  GYTQKEGVDFNEIFSPVVRHSSIRLILSIVVHFDMFIEQMDVTTTFLHGELEKVIYMAQPKGYEVKGKEDMVCRLHKSIYGLRQSPRQWYIRFDTFILKQ

Query:  GFHRNSYDACVYWKLSQRGTFIYLLLYVDDMILVSKDYAEICQLKKQLSSEFEMKDLGKLKRILGMDVKRDREKGLLTVSQESYVIKLLEKYNMSGCKAV
        GFHRNSYDACVYWK SQ+GT+IYLLLYVDDMILVSKDYAEIC+LKKQLS+EFEMKDLG+LKRILGMDVKRD+EKGLLT+SQESYVIKLLEKYNMS  KAV
Subjt:  GFHRNSYDACVYWKLSQRGTFIYLLLYVDDMILVSKDYAEICQLKKQLSSEFEMKDLGKLKRILGMDVKRDREKGLLTVSQESYVIKLLEKYNMSGCKAV

Query:  STPLASHFKLSSSQCPVIEQERLEMSNIPYCNAVGSIMYLMICTRPDLGHAMS------MISRDCDKSVLLKGFTDVDYDADLDKRSWKVTLQPVVALST
        STPLASHF+LSSSQCPV +QER+EMSNIPYCNAVGSIMYLMICTRPDLG+AMS        SRDCDKS LL+GFTD DY ADLDKRS        + L  
Subjt:  STPLASHFKLSSSQCPVIEQERLEMSNIPYCNAVGSIMYLMICTRPDLGHAMS------MISRDCDKSVLLKGFTDVDYDADLDKRSWKVTLQPVVALST

Query:  TESEYISLG----EAVWLKRIVGELLSQEFIPIIHCDSQSAIHLAKNPSHHERSKHIDVKFHYIRNVIAQKDVELVKVHYDENLSDMLTKVLPAHSS
          +EYISLG    EAVWLKRIVGELLSQEFIPIIHCDSQSAIHLAKNPSHHERSKHIDVKFHYIRNVIAQKDVELVKVH  ENLSDMLTK L AH S
Subjt:  TESEYISLG----EAVWLKRIVGELLSQEFIPIIHCDSQSAIHLAKNPSHHERSKHIDVKFHYIRNVIAQKDVELVKVHYDENLSDMLTKVLPAHSS

TrEMBL top hitse value%identityAlignment
A0A5A7TP18 Putative gag-pol polyprotein2.0e-13966.98Show/hide
Query:  PVVRHSSIRLILSIVVHFDMFIEQMDVTTTFLHGELEKVIYMAQPKGYEVKGKE-----DMVCRLHKSIYGLRQSPRQWYIRFDTFILKQGFHRNSYDAC
        PVVRHSSIRLILSI VHFDMFIEQMDVTTTFLHGELE+VIYMAQPKGYEVKGKE     DMVCRLHKS+YGL+QSPRQWYIRFDTFILKQG         
Subjt:  PVVRHSSIRLILSIVVHFDMFIEQMDVTTTFLHGELEKVIYMAQPKGYEVKGKE-----DMVCRLHKSIYGLRQSPRQWYIRFDTFILKQGFHRNSYDAC

Query:  VYWKLSQRGTFIYLLLYVDDMILVSKDYAEICQLKKQLSSEFEMKDLGKLKRILGMDVKRDREKGLLTVSQESYVIKLLEKYNMSGCKAVSTPLASHFKL
                 T  +++L                 LKK  +S +      +LKRILGMDVKRD+EKGLLT+SQESYVIKLLEKYNMS  KAVSTPLASHF+L
Subjt:  VYWKLSQRGTFIYLLLYVDDMILVSKDYAEICQLKKQLSSEFEMKDLGKLKRILGMDVKRDREKGLLTVSQESYVIKLLEKYNMSGCKAVSTPLASHFKL

Query:  SSSQCPVIEQERLEMSNIPYCNAVGSIMYLMICTRPDLGHAMS------MISRDCDKSVLLKGFTDVDYDADLDKRSWKVTLQPVVALSTTESEYISLG-
        SSSQCPV +QER+EMSNIPYCNAVGSIMYLMICTRPDLG+AMS        SRDCDKS LL+GFTD DY ADLDKRS        + L    +EYISLG 
Subjt:  SSSQCPVIEQERLEMSNIPYCNAVGSIMYLMICTRPDLGHAMS------MISRDCDKSVLLKGFTDVDYDADLDKRSWKVTLQPVVALSTTESEYISLG-

Query:  ---EAVWLKRIVGELLSQEFIPIIHCDSQSAIHLAKNPSHHERSKHIDVKFHYIRNVIAQKDVELVKVHYDENLSDMLTKVLPAH---SSCTTVNDARLR
           EAVWLKRIVGELLSQEFIPIIHCDSQ+AIHLAKNPSHHERSKHIDVKFHYIRNVIAQKDVELVKVH  ENLSDMLTK L AH   ++C         
Subjt:  ---EAVWLKRIVGELLSQEFIPIIHCDSQSAIHLAKNPSHHERSKHIDVKFHYIRNVIAQKDVELVKVHYDENLSDMLTKVLPAH---SSCTTVNDARLR

Query:  HYSIVQRGSLGLSWHFVDGIVGLVVVD
         ++   R +      F  G+ G + VD
Subjt:  HYSIVQRGSLGLSWHFVDGIVGLVVVD

A0A5A7U2U7 Retrotransposon protein, putative, Ty1-copia sub-class1.4e-18078.82Show/hide
Query:  RLILSIVVHFDMFIEQMDVTTTFLHGELEKVIYMAQPKGYEVKGKEDMVCRLHKSIYGLRQSPRQWYIRFDTFILKQGFHRNSYDACVYWKLSQRGTFIY
        RLILSI VHFDMFIEQMDVTTTFLHGELE+VIYMAQPKGYEVKGKEDMVCRLHKS+YGL+QSPRQWYI FDTFILKQGFHRNSYDACVYWK SQ+GT+IY
Subjt:  RLILSIVVHFDMFIEQMDVTTTFLHGELEKVIYMAQPKGYEVKGKEDMVCRLHKSIYGLRQSPRQWYIRFDTFILKQGFHRNSYDACVYWKLSQRGTFIY

Query:  LLLYVDDMILVSKDYAEICQLKKQLSSEFEMKDLGKLKRILGMDVKRDREKGLLTVSQESYVIKLLEKYNMSGCKAVSTPLASHFKLSSSQCPVIEQERL
        LLLYVDDMILVSKDYA IC+LKKQLS+EFEMKDLG+LKRILGMDVKRDREKGLLT+SQESYVIKLLEKYNMS  KAVSTPLASHF+LSSSQCPV +QER+
Subjt:  LLLYVDDMILVSKDYAEICQLKKQLSSEFEMKDLGKLKRILGMDVKRDREKGLLTVSQESYVIKLLEKYNMSGCKAVSTPLASHFKLSSSQCPVIEQERL

Query:  EMSNIPYCNAVGSIMYLMICTRPDLGHAMSMI--------------------------------SRDCDKSVLLKGFTDVDYDADLDKR-----------
        EMSNIPYCNAVGSIMYLMICTRPDLG+AMSMI                                SRDCDKS LL+GFTD DY ADLDKR           
Subjt:  EMSNIPYCNAVGSIMYLMICTRPDLGHAMSMI--------------------------------SRDCDKSVLLKGFTDVDYDADLDKR-----------

Query:  ----SWKVTLQPVVALSTTESEYISLG----EAVWLKRIVGELLSQEFIPIIHCDSQSAIHLAKNPSHHERSKHIDVKFHYIRNVIAQKDVELVKVHYDE
            SWKV LQPVVALSTTESEYISLG    EAVWLKRIVGELLSQEFIPIIHCDSQSAIHLAKNPSHHERSKHIDVKFHYIRNVIAQKDVELVKVH  E
Subjt:  ----SWKVTLQPVVALSTTESEYISLG----EAVWLKRIVGELLSQEFIPIIHCDSQSAIHLAKNPSHHERSKHIDVKFHYIRNVIAQKDVELVKVHYDE

Query:  NLSDMLTKVLPAHSSCTTVNDARLR
        NLSDMLTK L AH    T    + R
Subjt:  NLSDMLTKVLPAHSSCTTVNDARLR

A0A5A7UB25 Putative gag-pol polyprotein3.7e-19481.42Show/hide
Query:  GYTQKEGVDFNEIFSPVVRHSSIRLILSIVVHFDMFIEQMDVTTTFLHGELEKVIYMAQPKGYEVKGKEDMVCRLHKSIYGLRQSPRQWYIRFDTFILKQ
        GYTQKEGVDF+EIFSPVVRHSSIRLILSI VHFDMFIEQMDVTT FLHGELE+VIYMAQPKGYEVKGKEDMVCRLHKS+YGL+QSPRQWYIRFDTFILKQ
Subjt:  GYTQKEGVDFNEIFSPVVRHSSIRLILSIVVHFDMFIEQMDVTTTFLHGELEKVIYMAQPKGYEVKGKEDMVCRLHKSIYGLRQSPRQWYIRFDTFILKQ

Query:  GFHRNSYDACVYWKLSQRGTFIYLLLYVDDMILVSKDYAEICQLKKQLSSEFEMKDLGKLKRILGMDVKRDREKGLLTVSQESYVIKLLEKYNMSGCKAV
        GFHRNSYDACVYWK SQ+GT+IYLLLYVDDMILVSKDYAEIC+LKKQLS+EFEMKDLG+LKRILGMDVKRD+EKGLLT+SQESYVIKLLEKYNMS  KAV
Subjt:  GFHRNSYDACVYWKLSQRGTFIYLLLYVDDMILVSKDYAEICQLKKQLSSEFEMKDLGKLKRILGMDVKRDREKGLLTVSQESYVIKLLEKYNMSGCKAV

Query:  STPLASHFKLSSSQCPVIEQERLEMSNIPYCNAVGSIMYLMICTRPDLGHAMSMI--------------------------------SRDCDKSVLLKGF
        STPLASHF+LSSSQCPV +QER+EMSNIPYCNAVGSIMYLMICTRPDLG+AMSMI                                SRDCDKS LL+GF
Subjt:  STPLASHFKLSSSQCPVIEQERLEMSNIPYCNAVGSIMYLMICTRPDLGHAMSMI--------------------------------SRDCDKSVLLKGF

Query:  TDVDYDADLDKR---------------SWKVTLQPVVALSTTESEYISLG----EAVWLKRIVGELLSQEFIPIIHCDSQSAIHLAKNPSHHERSKHIDV
        TD DY ADLDKR               SWKV LQPVVALSTTESEYISLG    EAVWLKRIVGELLSQEFIPIIHCDSQSAIHLAKNPSHHERSKHIDV
Subjt:  TDVDYDADLDKR---------------SWKVTLQPVVALSTTESEYISLG----EAVWLKRIVGELLSQEFIPIIHCDSQSAIHLAKNPSHHERSKHIDV

Query:  KFHYIRNVIAQKDVELVKVHYDENLSDMLTKVLPAH
        KFHYIRNVIAQKDVELVKVH  ENLSDMLTK L AH
Subjt:  KFHYIRNVIAQKDVELVKVHYDENLSDMLTKVLPAH

A0A5D3CTV2 Putative polyprotein5.7e-19581.65Show/hide
Query:  GYTQKEGVDFNEIFSPVVRHSSIRLILSIVVHFDMFIEQMDVTTTFLHGELEKVIYMAQPKGYEVKGKEDMVCRLHKSIYGLRQSPRQWYIRFDTFILKQ
        GYTQKEGVDF+EIFSPVVRHSSIRLILSI VHFDMFIEQMDVTT FLHGELE+VIYMAQPKGYEVKGKEDMVCRLHKS+YGL+QSPRQWYIRFDTFILKQ
Subjt:  GYTQKEGVDFNEIFSPVVRHSSIRLILSIVVHFDMFIEQMDVTTTFLHGELEKVIYMAQPKGYEVKGKEDMVCRLHKSIYGLRQSPRQWYIRFDTFILKQ

Query:  GFHRNSYDACVYWKLSQRGTFIYLLLYVDDMILVSKDYAEICQLKKQLSSEFEMKDLGKLKRILGMDVKRDREKGLLTVSQESYVIKLLEKYNMSGCKAV
        GFHRNSYDACVYWK SQ+GT+IYLLLYVDDMILVSKDYAEIC+LKKQLS+EFEMKDLG+LKRILGMDVKRD+EKGLLT+SQESYVIKLLEKYNMSG KAV
Subjt:  GFHRNSYDACVYWKLSQRGTFIYLLLYVDDMILVSKDYAEICQLKKQLSSEFEMKDLGKLKRILGMDVKRDREKGLLTVSQESYVIKLLEKYNMSGCKAV

Query:  STPLASHFKLSSSQCPVIEQERLEMSNIPYCNAVGSIMYLMICTRPDLGHAMSMI--------------------------------SRDCDKSVLLKGF
        STPLASHF+LSSSQCPV +QER+EMSNIPYCNAVGSIMYLMICTRPDLG+AMSMI                                SRDCDKS LL+GF
Subjt:  STPLASHFKLSSSQCPVIEQERLEMSNIPYCNAVGSIMYLMICTRPDLGHAMSMI--------------------------------SRDCDKSVLLKGF

Query:  TDVDYDADLDKR---------------SWKVTLQPVVALSTTESEYISLG----EAVWLKRIVGELLSQEFIPIIHCDSQSAIHLAKNPSHHERSKHIDV
        TD DY ADLDKR               SWKV LQPVVALSTTESEYISLG    EAVWLKRIVGELLSQEFIPIIHCDSQSAIHLAKNPSHHERSKHIDV
Subjt:  TDVDYDADLDKR---------------SWKVTLQPVVALSTTESEYISLG----EAVWLKRIVGELLSQEFIPIIHCDSQSAIHLAKNPSHHERSKHIDV

Query:  KFHYIRNVIAQKDVELVKVHYDENLSDMLTKVLPAH
        KFHYIRNVIAQKDVELVKVH  ENLSDMLTK L AH
Subjt:  KFHYIRNVIAQKDVELVKVHYDENLSDMLTKVLPAH

A0A5D3DNU1 Putative gag-pol polyprotein2.3e-18885.64Show/hide
Query:  GYTQKEGVDFNEIFSPVVRHSSIRLILSIVVHFDMFIEQMDVTTTFLHGELEKVIYMAQPKGYEVKGKEDMVCRLHKSIYGLRQSPRQWYIRFDTFILKQ
        GYTQKEGVDF+EIFSPVVRHSSIRLILSI VHFDMFIEQMDVTT FLHGELE+VIYMAQPKGYEVKGKEDMVCRLHKS+YGL+QSPRQWYIRFDTFILKQ
Subjt:  GYTQKEGVDFNEIFSPVVRHSSIRLILSIVVHFDMFIEQMDVTTTFLHGELEKVIYMAQPKGYEVKGKEDMVCRLHKSIYGLRQSPRQWYIRFDTFILKQ

Query:  GFHRNSYDACVYWKLSQRGTFIYLLLYVDDMILVSKDYAEICQLKKQLSSEFEMKDLGKLKRILGMDVKRDREKGLLTVSQESYVIKLLEKYNMSGCKAV
        GFHRNSYDACVYWK SQ+GT+IYLLLYVDDMILVSKDYAEIC+LKKQLS+EFEMKDLG+LKRILGMDVKRD+EKGLLT+SQESYVIKLLEKYNMS  KAV
Subjt:  GFHRNSYDACVYWKLSQRGTFIYLLLYVDDMILVSKDYAEICQLKKQLSSEFEMKDLGKLKRILGMDVKRDREKGLLTVSQESYVIKLLEKYNMSGCKAV

Query:  STPLASHFKLSSSQCPVIEQERLEMSNIPYCNAVGSIMYLMICTRPDLGHAMS------MISRDCDKSVLLKGFTDVDYDADLDKRSWKVTLQPVVALST
        STPLASHF+LSSSQCPV +QER+EMSNIPYCNAVGSIMYLMICTRPDLG+AMS        SRDCDKS LL+GFTD DY ADLDKRS        + L  
Subjt:  STPLASHFKLSSSQCPVIEQERLEMSNIPYCNAVGSIMYLMICTRPDLGHAMS------MISRDCDKSVLLKGFTDVDYDADLDKRSWKVTLQPVVALST

Query:  TESEYISLG----EAVWLKRIVGELLSQEFIPIIHCDSQSAIHLAKNPSHHERSKHIDVKFHYIRNVIAQKDVELVKVHYDENLSDMLTKVLPAHSS
          +EYISLG    EAVWLKRIVGELLSQEFIPIIHCDSQSAIHLAKNPSHHERSKHIDVKFHYIRNVIAQKDVELVKVH  ENLSDMLTK L AH S
Subjt:  TESEYISLG----EAVWLKRIVGELLSQEFIPIIHCDSQSAIHLAKNPSHHERSKHIDVKFHYIRNVIAQKDVELVKVHYDENLSDMLTKVLPAHSS

SwissProt top hitse value%identityAlignment
P04146 Copia protein5.0e-6334.62Show/hide
Query:  GYTQKEGVDFNEIFSPVVRHSSIRLILSIVVHFDMFIEQMDVTTTFLHGELEKVIYMAQPKGYEVKGKEDMVCRLHKSIYGLRQSPRQWYIRFDTFILKQ
        G+TQK  +D+ E F+PV R SS R ILS+V+ +++ + QMDV T FL+G L++ IYM  P+G  +    D VC+L+K+IYGL+Q+ R W+  F+  + + 
Subjt:  GYTQKEGVDFNEIFSPVVRHSSIRLILSIVVHFDMFIEQMDVTTTFLHGELEKVIYMAQPKGYEVKGKEDMVCRLHKSIYGLRQSPRQWYIRFDTFILKQ

Query:  GFHRNSYDACVYWKLSQRGTF---IYLLLYVDDMILVSKDYAEICQLKKQLSSEFEMKDLGKLKRILGMDVKRDREKGLLTVSQESYVIKLLEKYNMSGC
         F  +S D C+Y  +  +G     IY+LLYVDD+++ + D   +   K+ L  +F M DL ++K  +G+ ++   +K  + +SQ +YV K+L K+NM  C
Subjt:  GFHRNSYDACVYWKLSQRGTF---IYLLLYVDDMILVSKDYAEICQLKKQLSSEFEMKDLGKLKRILGMDVKRDREKGLLTVSQESYVIKLLEKYNMSGC

Query:  KAVSTPLAS--HFKLSSSQCPVIEQERLEMSNIPYCNAVGSIMYLMICTRPDLGHAMSMISR----------------------DCDKSVLLK-------
         AVSTPL S  +++L +S          E  N P  + +G +MY+M+CTRPDL  A++++SR                        D  ++ K       
Subjt:  KAVSTPLAS--HFKLSSSQCPVIEQERLEMSNIPYCNAVGSIMYLMICTRPDLGHAMSMISR----------------------DCDKSVLLK-------

Query:  ---GFTDVDY-DADLDKRS---------------WKVTLQPVVALSTTESEYISLGEAV----WLKRIVGELLSQEFIPI-IHCDSQSAIHLAKNPSHHE
           G+ D D+  +++D++S               W    Q  VA S+TE+EY++L EAV    WLK ++  +  +   PI I+ D+Q  I +A NPS H+
Subjt:  ---GFTDVDY-DADLDKRS---------------WKVTLQPVVALSTTESEYISLGEAV----WLKRIVGELLSQEFIPI-IHCDSQSAIHLAKNPSHHE

Query:  RSKHIDVKFHYIRNVIAQKDVELVKVHYDENLSDMLTKVLPA
        R+KHID+K+H+ R  +    + L  +  +  L+D+ TK LPA
Subjt:  RSKHIDVKFHYIRNVIAQKDVELVKVHYDENLSDMLTKVLPA

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.3e-10746.17Show/hide
Query:  GYTQKEGVDFNEIFSPVVRHSSIRLILSIVVHFDMFIEQMDVTTTFLHGELEKVIYMAQPKGYEVKGKEDMVCRLHKSIYGLRQSPRQWYIRFDTFILKQ
        G+ QK+G+DF+EIFSPVV+ +SIR ILS+    D+ +EQ+DV T FLHG+LE+ IYM QP+G+EV GK+ MVC+L+KS+YGL+Q+PRQWY++FD+F+  Q
Subjt:  GYTQKEGVDFNEIFSPVVRHSSIRLILSIVVHFDMFIEQMDVTTTFLHGELEKVIYMAQPKGYEVKGKEDMVCRLHKSIYGLRQSPRQWYIRFDTFILKQ

Query:  GFHRNSYDACVYWKLSQRGTFIYLLLYVDDMILVSKDYAEICQLKKQLSSEFEMKDLGKLKRILGMDVKRDREKGLLTVSQESYVIKLLEKYNMSGCKAV
         + +   D CVY+K      FI LLLYVDDM++V KD   I +LK  LS  F+MKDLG  ++ILGM + R+R    L +SQE Y+ ++LE++NM   K V
Subjt:  GFHRNSYDACVYWKLSQRGTFIYLLLYVDDMILVSKDYAEICQLKKQLSSEFEMKDLGKLKRILGMDVKRDREKGLLTVSQESYVIKLLEKYNMSGCKAV

Query:  STPLASHFKLSSSQCPVIEQERLEMSNIPYCNAVGSIMYLMICTRPDLGHAMSMISR-------------------------DC----DKSVLLKGFTDV
        STPLA H KLS   CP   +E+  M+ +PY +AVGS+MY M+CTRPD+ HA+ ++SR                         DC        +LKG+TD 
Subjt:  STPLASHFKLSSSQCPVIEQERLEMSNIPYCNAVGSIMYLMICTRPDLGHAMSMISR-------------------------DC----DKSVLLKGFTDV

Query:  DYDADLDKR---------------SWKVTLQPVVALSTTESEYISLGEA----VWLKRIVGELLSQEFIPIIHCDSQSAIHLAKNPSHHERSKHIDVKFH
        D   D+D R               SW+  LQ  VALSTTE+EYI+  E     +WLKR + EL   +   +++CDSQSAI L+KN  +H R+KHIDV++H
Subjt:  DYDADLDKR---------------SWKVTLQPVVALSTTESEYISLGEA----VWLKRIVGELLSQEFIPIIHCDSQSAIHLAKNPSHHERSKHIDVKFH

Query:  YIRNVIAQKDVELVKVHYDENLSDMLTKVLP
        +IR ++  + ++++K+  +EN +DMLTKV+P
Subjt:  YIRNVIAQKDVELVKVHYDENLSDMLTKVLP

P25600 Putative transposon Ty5-1 protein YCL074W3.0e-2328.9Show/hide
Query:  MDVTTTFLHGELEKVIYMAQPKGYEVKGKEDMVCRLHKSIYGLRQSPRQWYIRFDTFILKQGFHRNSYDACVYWKLSQRGTFIYLLLYVDDMILVSKDYA
        MDV T FL+  +++ IY+ QP G+  +   D V  L+  +YGL+Q+P  W    +  + K GF R+  +  +Y++ +  G  IY+ +YVDD+++ +    
Subjt:  MDVTTTFLHGELEKVIYMAQPKGYEVKGKEDMVCRLHKSIYGLRQSPRQWYIRFDTFILKQGFHRNSYDACVYWKLSQRGTFIYLLLYVDDMILVSKDYA

Query:  EICQLKKQLSSEFEMKDLGKLKRILGMDVKRDREKGLLTVSQESYVIKLLEKYNMSGCKAVSTPLASHFKLSSSQCPVIEQERLEMSNIPYCNAVGSIMY
           ++K++L+  + MKDLGK+ + LG+++ +    G +T+S + Y+ K   +  ++  K   TPL +   L  +  P ++         PY + VG +++
Subjt:  EICQLKKQLSSEFEMKDLGKLKRILGMDVKRDREKGLLTVSQESYVIKLLEKYNMSGCKAVSTPLASHFKLSSSQCPVIEQERLEMSNIPYCNAVGSIMY

Query:  LMICTRPDLGHAMSMISR
             RPD+ + +S++SR
Subjt:  LMICTRPDLGHAMSMISR

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.0e-5132.1Show/hide
Query:  GYTQKEGVDFNEIFSPVVRHSSIRLILSIVVHFDMFIEQMDVTTTFLHGELEKVIYMAQPKGYEVKGKEDMVCRLHKSIYGLRQSPRQWYIRFDTFILKQ
        GY Q+ G+D+ E FSPV++ +SIR++L + V     I Q+DV   FL G L   +YM+QP G+  K + + VC+L K++YGL+Q+PR WY+    ++L  
Subjt:  GYTQKEGVDFNEIFSPVVRHSSIRLILSIVVHFDMFIEQMDVTTTFLHGELEKVIYMAQPKGYEVKGKEDMVCRLHKSIYGLRQSPRQWYIRFDTFILKQ

Query:  GFHRNSYDACVYWKLSQRG-TFIYLLLYVDDMILVSKDYAEICQLKKQLSSEFEMKDLGKLKRILGMDVKRDREKGLLTVSQESYVIKLLEKYNMSGCKA
        GF  +  D  ++  + QRG + +Y+L+YVDD+++   D   +      LS  F +KD  +L   LG++ KR      L +SQ  Y++ LL + NM   K 
Subjt:  GFHRNSYDACVYWKLSQRG-TFIYLLLYVDDMILVSKDYAEICQLKKQLSSEFEMKDLGKLKRILGMDVKRDREKGLLTVSQESYVIKLLEKYNMSGCKA

Query:  VSTPLASHFKLSSSQCPVIEQERLEMSNIPYCNAVGSIMYLMICTRPDLGHAMSMISR----------DCDKSVL--------------------LKGFT
        V+TP+A   KLS      +           Y   VGS+ YL   TRPD+ +A++ +S+             K +L                    L  ++
Subjt:  VSTPLASHFKLSSSQCPVIEQERLEMSNIPYCNAVGSIMYLMICTRPDLGHAMSMISR----------DCDKSVL--------------------LKGFT

Query:  DVDYDADLDKR---------------SWKVTLQPVVALSTTESEYISL----GEAVWLKRIVGEL-LSQEFIPIIHCDSQSAIHLAKNPSHHERSKHIDV
        D D+  D D                 SW    Q  V  S+TE+EY S+     E  W+  ++ EL +     P+I+CD+  A +L  NP  H R KHI +
Subjt:  DVDYDADLDKR---------------SWKVTLQPVVALSTTESEYISL----GEAVWLKRIVGEL-LSQEFIPIIHCDSQSAIHLAKNPSHHERSKHIDV

Query:  KFHYIRNVIAQKDVELVKVHYDENLSDMLTKVL
         +H+IRN +    + +V V   + L+D LTK L
Subjt:  KFHYIRNVIAQKDVELVKVHYDENLSDMLTKVL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE25.6e-5433.26Show/hide
Query:  GYTQKEGVDFNEIFSPVVRHSSIRLILSIVVHFDMFIEQMDVTTTFLHGELEKVIYMAQPKGYEVKGKEDMVCRLHKSIYGLRQSPRQWYIRFDTFILKQ
        GY Q+ G+D+ E FSPV++ +SIR++L + V     I Q+DV   FL G L   +YM+QP G+  K + D VCRL K+IYGL+Q+PR WY+   T++L  
Subjt:  GYTQKEGVDFNEIFSPVVRHSSIRLILSIVVHFDMFIEQMDVTTTFLHGELEKVIYMAQPKGYEVKGKEDMVCRLHKSIYGLRQSPRQWYIRFDTFILKQ

Query:  GFHRNSYDACVYWKLSQRG-TFIYLLLYVDDMILVSKDYAEICQLKKQLSSEFEMKDLGKLKRILGMDVKRDREKGLLTVSQESYVIKLLEKYNMSGCKA
        GF  +  D  ++  + QRG + IY+L+YVDD+++   D   +      LS  F +K+   L   LG++ KR  +   L +SQ  Y + LL + NM   K 
Subjt:  GFHRNSYDACVYWKLSQRG-TFIYLLLYVDDMILVSKDYAEICQLKKQLSSEFEMKDLGKLKRILGMDVKRDREKGLLTVSQESYVIKLLEKYNMSGCKA

Query:  VSTPLASHFKL---SSSQCPVIEQERLEMSNIPYCNAVGSIMYLMICTRPDLGHAMSMISR----------------------DCDKSVLLK--------
        V+TP+A+  KL   S ++ P             Y   VGS+ YL   TRPDL +A++ +S+                        D  + LK        
Subjt:  VSTPLASHFKL---SSSQCPVIEQERLEMSNIPYCNAVGSIMYLMICTRPDLGHAMSMISR----------------------DCDKSVLLK--------

Query:  GFTDVDYDADLDKR---------------SWKVTLQPVVALSTTESEYISL----GEAVWLKRIVGEL-LSQEFIPIIHCDSQSAIHLAKNPSHHERSKH
         ++D D+  D D                 SW    Q  V  S+TE+EY S+     E  W+  ++ EL +     P+I+CD+  A +L  NP  H R KH
Subjt:  GFTDVDYDADLDKR---------------SWKVTLQPVVALSTTESEYISL----GEAVWLKRIVGEL-LSQEFIPIIHCDSQSAIHLAKNPSHHERSKH

Query:  IDVKFHYIRNVIAQKDVELVKVHYDENLSDMLTKVL
        I + +H+IRN +    + +V V   + L+D LTK L
Subjt:  IDVKFHYIRNVIAQKDVELVKVHYDENLSDMLTKVL

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.9e-5031.54Show/hide
Query:  GYTQKEGVDFNEIFSPVVRHSSIRLILSIVVHFDMFIEQMDVTTTFLHGELEKVIYMAQPKGYEVKGKEDM----VCRLHKSIYGLRQSPRQWYIRFDTF
        GYTQ+EG+DF E FSPV + +S++LIL+I   ++  + Q+D++  FL+G+L++ IYM  P GY  +  + +    VC L KSIYGL+Q+ RQW+++F   
Subjt:  GYTQKEGVDFNEIFSPVVRHSSIRLILSIVVHFDMFIEQMDVTTTFLHGELEKVIYMAQPKGYEVKGKEDM----VCRLHKSIYGLRQSPRQWYIRFDTF

Query:  ILKQGFHRNSYDACVYWKLSQRGTFIYLLLYVDDMILVSKDYAEICQLKKQLSSEFEMKDLGKLKRILGMDVKRDREKGLLTVSQESYVIKLLEKYNMSG
        ++  GF ++  D   + K++    F+ +L+YVDD+I+ S + A + +LK QL S F+++DLG LK  LG+++ R      + + Q  Y + LL++  + G
Subjt:  ILKQGFHRNSYDACVYWKLSQRGTFIYLLLYVDDMILVSKDYAEICQLKKQLSSEFEMKDLGKLKRILGMDVKRDREKGLLTVSQESYVIKLLEKYNMSG

Query:  CKAVSTPLASHFKLSSSQCPVIEQERLEMSNIPYCNAVGSIMYLMICTRPDLGHAMSMISR------------------------------DCDKSVLLK
        CK  S P+      S+         +       Y   +G +MYL I TR D+  A++ +S+                                   + L+
Subjt:  CKAVSTPLASHFKLSSSQCPVIEQERLEMSNIPYCNAVGSIMYLMICTRPDLGHAMSMISR------------------------------DCDKSVLLK

Query:  GFTDVDYDADLDKR---------------SWKVTLQPVVALSTTESEYISLG----EAVWLKRIVGEL-LSQEFIPIIHCDSQSAIHLAKNPSHHERSKH
         F+D  + +  D R               SWK   Q VV+ S+ E+EY +L     E +WL +   EL L      ++ CD+ +AIH+A N   HER+KH
Subjt:  GFTDVDYDADLDKR---------------SWKVTLQPVVALSTTESEYISLG----EAVWLKRIVGEL-LSQEFIPIIHCDSQSAIHLAKNPSHHERSKH

Query:  IDVKFHYIR
        I+   H +R
Subjt:  IDVKFHYIR

ATMG00810.1 DNA/RNA polymerases superfamily protein4.9e-1330.53Show/hide
Query:  IYLLLYVDDMILVSKDYAEICQLKKQLSSEFEMKDLGKLKRILGMDVKRDREKGLLTVSQESYVIKLLEKYNMSGCKAVSTPLASHFKLSSSQCPVIEQE
        +YLLLYVDD++L       +  L  QLSS F MKDLG +   LG+ +K     GL  +SQ  Y  ++L    M  CK +STPL    KL+SS       +
Subjt:  IYLLLYVDDMILVSKDYAEICQLKKQLSSEFEMKDLGKLKRILGMDVKRDREKGLLTVSQESYVIKLLEKYNMSGCKAVSTPLASHFKLSSSQCPVIEQE

Query:  RLEMSNIPYCNAVGSIMYLMICTRPDLGHAMSMISR----------DCDKSVL--------------------LKGFTDVDYDADLDKR-----------
          +  +I     VG++ YL + TRPD+ +A++++ +          D  K VL                    ++ F D D+      R           
Subjt:  RLEMSNIPYCNAVGSIMYLMICTRPDLGHAMSMISR----------DCDKSVL--------------------LKGFTDVDYDADLDKR-----------

Query:  ----SWKVTLQPVVALSTTESEYISL
            SW    QP V+ S+TE+EY +L
Subjt:  ----SWKVTLQPVVALSTTESEYISL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCTACACTCAGAAGGAGGGAGTTGACTTTAATGAGATTTTTTCTCCGGTGGTGAGACATTCGTCCATTAGATTAATTTTATCTATTGTTGTTCACTTTGATATGTT
CATTGAACAGATGGACGTCACCACAACATTTCTTCATGGAGAACTGGAGAAGGTGATTTACATGGCTCAACCTAAAGGCTATGAGGTGAAGGGTAAGGAAGACATGGTTT
GTCGTCTTCACAAGTCCATCTATGGACTTAGACAATCTCCAAGACAGTGGTATATCAGGTTTGATACTTTCATTCTAAAGCAGGGGTTTCACAGGAACTCATATGATGCT
TGTGTTTACTGGAAACTATCTCAGAGAGGTACATTTATCTATCTACTGTTGTATGTAGATGATATGATACTAGTGTCTAAGGATTATGCTGAAATCTGTCAACTCAAGAA
ACAACTAAGTAGTGAGTTTGAAATGAAAGATTTAGGTAAGCTAAAAAGGATCCTAGGCATGGATGTGAAAAGAGATAGGGAAAAAGGTTTGTTAACCGTTTCGCAGGAGA
GTTATGTGATTAAACTGCTTGAAAAGTATAATATGTCTGGTTGCAAGGCAGTTTCTACACCCTTAGCATCTCATTTTAAACTTTCTTCATCTCAATGTCCTGTTATTGAG
CAAGAAAGGTTAGAGATGTCTAATATTCCATATTGTAATGCTGTTGGAAGTATTATGTACCTGATGATTTGTACTAGGCCTGACTTGGGTCATGCTATGAGTATGATAAG
TAGGGATTGTGATAAGTCAGTATTGTTGAAAGGCTTCACAGATGTAGACTATGATGCAGATCTTGATAAAAGAAGTTGGAAAGTTACCCTACAACCAGTTGTTGCTTTGT
CGACTACTGAGTCAGAATATATTTCTCTTGGTGAAGCAGTGTGGTTGAAAAGAATTGTTGGTGAGTTGTTATCGCAAGAGTTTATTCCTATCATCCATTGTGATAGCCAG
AGTGCTATTCATCTTGCGAAGAATCCATCTCATCATGAGCGGTCTAAGCATATCGATGTCAAATTTCATTATATCAGAAATGTTATTGCTCAGAAAGATGTTGAACTGGT
CAAAGTTCATTACGATGAGAATTTGTCAGATATGTTAACCAAAGTTCTTCCAGCTCATAGTTCCTGCACAACGGTGAACGACGCTCGTTTGCGGCATTACTCCATTGTTC
AACGTGGGTCTCTTGGTCTCTCTTGGCATTTTGTGGACGGTATAGTTGGTTTGGTTGTGGTTGATTACAGATTACTTATTGTGCTGAACTCTCTCGAACATCCTTATATG
CTATTCAATCATGCGCTTAGCTTTTTGAAGCTCGAGTCTTTGTTGCTCAATGAGAACTACATTAGCTTGTTCGATCGTCGTAGAGTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGCTACACTCAGAAGGAGGGAGTTGACTTTAATGAGATTTTTTCTCCGGTGGTGAGACATTCGTCCATTAGATTAATTTTATCTATTGTTGTTCACTTTGATATGTT
CATTGAACAGATGGACGTCACCACAACATTTCTTCATGGAGAACTGGAGAAGGTGATTTACATGGCTCAACCTAAAGGCTATGAGGTGAAGGGTAAGGAAGACATGGTTT
GTCGTCTTCACAAGTCCATCTATGGACTTAGACAATCTCCAAGACAGTGGTATATCAGGTTTGATACTTTCATTCTAAAGCAGGGGTTTCACAGGAACTCATATGATGCT
TGTGTTTACTGGAAACTATCTCAGAGAGGTACATTTATCTATCTACTGTTGTATGTAGATGATATGATACTAGTGTCTAAGGATTATGCTGAAATCTGTCAACTCAAGAA
ACAACTAAGTAGTGAGTTTGAAATGAAAGATTTAGGTAAGCTAAAAAGGATCCTAGGCATGGATGTGAAAAGAGATAGGGAAAAAGGTTTGTTAACCGTTTCGCAGGAGA
GTTATGTGATTAAACTGCTTGAAAAGTATAATATGTCTGGTTGCAAGGCAGTTTCTACACCCTTAGCATCTCATTTTAAACTTTCTTCATCTCAATGTCCTGTTATTGAG
CAAGAAAGGTTAGAGATGTCTAATATTCCATATTGTAATGCTGTTGGAAGTATTATGTACCTGATGATTTGTACTAGGCCTGACTTGGGTCATGCTATGAGTATGATAAG
TAGGGATTGTGATAAGTCAGTATTGTTGAAAGGCTTCACAGATGTAGACTATGATGCAGATCTTGATAAAAGAAGTTGGAAAGTTACCCTACAACCAGTTGTTGCTTTGT
CGACTACTGAGTCAGAATATATTTCTCTTGGTGAAGCAGTGTGGTTGAAAAGAATTGTTGGTGAGTTGTTATCGCAAGAGTTTATTCCTATCATCCATTGTGATAGCCAG
AGTGCTATTCATCTTGCGAAGAATCCATCTCATCATGAGCGGTCTAAGCATATCGATGTCAAATTTCATTATATCAGAAATGTTATTGCTCAGAAAGATGTTGAACTGGT
CAAAGTTCATTACGATGAGAATTTGTCAGATATGTTAACCAAAGTTCTTCCAGCTCATAGTTCCTGCACAACGGTGAACGACGCTCGTTTGCGGCATTACTCCATTGTTC
AACGTGGGTCTCTTGGTCTCTCTTGGCATTTTGTGGACGGTATAGTTGGTTTGGTTGTGGTTGATTACAGATTACTTATTGTGCTGAACTCTCTCGAACATCCTTATATG
CTATTCAATCATGCGCTTAGCTTTTTGAAGCTCGAGTCTTTGTTGCTCAATGAGAACTACATTAGCTTGTTCGATCGTCGTAGAGTGTGA
Protein sequenceShow/hide protein sequence
MGYTQKEGVDFNEIFSPVVRHSSIRLILSIVVHFDMFIEQMDVTTTFLHGELEKVIYMAQPKGYEVKGKEDMVCRLHKSIYGLRQSPRQWYIRFDTFILKQGFHRNSYDA
CVYWKLSQRGTFIYLLLYVDDMILVSKDYAEICQLKKQLSSEFEMKDLGKLKRILGMDVKRDREKGLLTVSQESYVIKLLEKYNMSGCKAVSTPLASHFKLSSSQCPVIE
QERLEMSNIPYCNAVGSIMYLMICTRPDLGHAMSMISRDCDKSVLLKGFTDVDYDADLDKRSWKVTLQPVVALSTTESEYISLGEAVWLKRIVGELLSQEFIPIIHCDSQ
SAIHLAKNPSHHERSKHIDVKFHYIRNVIAQKDVELVKVHYDENLSDMLTKVLPAHSSCTTVNDARLRHYSIVQRGSLGLSWHFVDGIVGLVVVDYRLLIVLNSLEHPYM
LFNHALSFLKLESLLLNENYISLFDRRRV