; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0036663 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0036663
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTy3/gypsy retrotransposon protein
Genome locationchr2:259857..261542
RNA-Seq ExpressionLag0036663
SyntenyLag0036663
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025132.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]8.9e-16454.21Show/hide
Query:  LNKGGRFKWTEEASLAFEQLKQAMMTLPVLAHADFSQPFIVEKDASGTRLGAVLSQNQRPIAFFSHTLSSPTQAKSIYERELMAVVMAISRWRPYLLGRK
        L K G +KWTEE   AFE+LK+AMMTLPVLA  DF+ PF +E DASG  +GAVL Q+++P+A+FS  LS+  +A+ +YERELMAVV A+ RWRPYLLGRK
Subjt:  LNKGGRFKWTEEASLAFEQLKQAMMTLPVLAHADFSQPFIVEKDASGTRLGAVLSQNQRPIAFFSHTLSSPTQAKSIYERELMAVVMAISRWRPYLLGRK

Query:  FLVRKDQRALKHLLEQRVIQPEYQRWVSKLLGYALTLSTDQALRTK--------------RLMLYP-------------NDPNLSSIISRLQNDPDDTSR
        F V+ DQR+LK LLEQRVIQP+YQ+W++KLLGY+  +     L  K                +  P              DP L  I+S ++    +   
Subjt:  FLVRKDQRALKHLLEQRVIQPEYQRWVSKLLGYALTLSTDQALRTK--------------RLMLYP-------------NDPNLSSIISRLQNDPDDTSR

Query:  FSLQRGILKYKGRLVISKTSSLIPTILHTYHDSVLGGHLGFLRTYRRLTVDLYWERMKSNVKRYMEECLVCQQHKTQAASPVGLLQPLDVPTRIWEDITT
        ++  +GILK+KGRLV+SK S+LIPTI+HTYHDSV GGH GFLRTY+R+  +LYW+ MK +V++Y EEC++CQ++K+ A SP GLL PL++P  IW DI+ 
Subjt:  FSLQRGILKYKGRLVISKTSSLIPTILHTYHDSVLGGHLGFLRTYRRLTVDLYWERMKSNVKRYMEECLVCQQHKTQAASPVGLLQPLDVPTRIWEDITT

Query:  DFFEGLPRSEGGGQVL-----------------------------QEVVRLHGFPGSIISDRDKVFVRHFWTELFKVHGTKLKRSTSYHPHTDGHTKIVN
        DF EGLP+S G   +L                             +EVVRLHGFP SI+SDRDK+F+ HFW+E+FK+ GTKL RS+SYHP TDG T++VN
Subjt:  DFFEGLPRSEGGGQVL-----------------------------QEVVRLHGFPGSIISDRDKVFVRHFWTELFKVHGTKLKRSTSYHPHTDGHTKIVN

Query:  KCVEHFLRCFCSERRKQWCQWLGWEEYWYNTTFHITIGITPFEAVYGIPPPPLISYGECSISNSTLESQLIERDIALDVLKEKMRLAQERTKKFVVRKRR
        K VE +LRCFC E+ + W QWL W EYWYNTT+H +IGITPF+AVYG  PPPLI YGE    NSTL+ QL +RD+ L  LKE +R+AQER KKF   KRR
Subjt:  KCVEHFLRCFCSERRKQWCQWLGWEEYWYNTTFHITIGITPFEAVYGIPPPPLISYGECSISNSTLESQLIERDIALDVLKEKMRLAQERTKKFVVRKRR

Query:  EIEYEEGDMVFLKVRPYRQTTLAKKRNEKLSPTVF
        ++E++ GD VFLK+RPYRQT+L KKRNEKLSP  F
Subjt:  EIEYEEGDMVFLKVRPYRQTTLAKKRNEKLSPTVF

KAA0049630.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]2.0e-16354.21Show/hide
Query:  LNKGGRFKWTEEASLAFEQLKQAMMTLPVLAHADFSQPFIVEKDASGTRLGAVLSQNQRPIAFFSHTLSSPTQAKSIYERELMAVVMAISRWRPYLLGRK
        L K G +KWTEE   AFE+LK+AMMTLPVLA  DF+ PF +E DASG  +GAVL Q+++P+A+FS  LS+  +A+ +YERELMAVV A+ RWRPYLLGRK
Subjt:  LNKGGRFKWTEEASLAFEQLKQAMMTLPVLAHADFSQPFIVEKDASGTRLGAVLSQNQRPIAFFSHTLSSPTQAKSIYERELMAVVMAISRWRPYLLGRK

Query:  FLVRKDQRALKHLLEQRVIQPEYQRWVSKLLGYALTLSTDQALRTK--------------RLMLYP-------------NDPNLSSIISRLQNDPDDTSR
        F V+ DQR+LK LLEQRVIQP+YQ+W+ KLLGY+  +     L  K                +  P              DP L  I+S ++    +   
Subjt:  FLVRKDQRALKHLLEQRVIQPEYQRWVSKLLGYALTLSTDQALRTK--------------RLMLYP-------------NDPNLSSIISRLQNDPDDTSR

Query:  FSLQRGILKYKGRLVISKTSSLIPTILHTYHDSVLGGHLGFLRTYRRLTVDLYWERMKSNVKRYMEECLVCQQHKTQAASPVGLLQPLDVPTRIWEDITT
        ++  +GILK+KGRLV+SK S+LIPTI+HTYHDSV GGH GFLRTY+R+  +LYW+ MK +V++Y EEC++CQ++K+ A SP GLL PL++P  IW DI+ 
Subjt:  FSLQRGILKYKGRLVISKTSSLIPTILHTYHDSVLGGHLGFLRTYRRLTVDLYWERMKSNVKRYMEECLVCQQHKTQAASPVGLLQPLDVPTRIWEDITT

Query:  DFFEGLPRSEGGGQVL-----------------------------QEVVRLHGFPGSIISDRDKVFVRHFWTELFKVHGTKLKRSTSYHPHTDGHTKIVN
        DF EGLP+S G   +L                             +EVVRLHGFP SI+SDRDK+F+ HFW+E+FK+ GTKL RS+SYHP TDG T++VN
Subjt:  DFFEGLPRSEGGGQVL-----------------------------QEVVRLHGFPGSIISDRDKVFVRHFWTELFKVHGTKLKRSTSYHPHTDGHTKIVN

Query:  KCVEHFLRCFCSERRKQWCQWLGWEEYWYNTTFHITIGITPFEAVYGIPPPPLISYGECSISNSTLESQLIERDIALDVLKEKMRLAQERTKKFVVRKRR
        K VE +LRCFC E+ + W QWL W EYWYNTT+H +IGITPF+AVYG  PPPLI YGE    NSTL+ QL +RD+ L  LKE +R+AQER KKF   KRR
Subjt:  KCVEHFLRCFCSERRKQWCQWLGWEEYWYNTTFHITIGITPFEAVYGIPPPPLISYGECSISNSTLESQLIERDIALDVLKEKMRLAQERTKKFVVRKRR

Query:  EIEYEEGDMVFLKVRPYRQTTLAKKRNEKLSPTVF
        ++E++ GD VFLK+RPYRQT+L KKRNEKLSP  F
Subjt:  EIEYEEGDMVFLKVRPYRQTTLAKKRNEKLSPTVF

KAA0049776.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]1.2e-16354.21Show/hide
Query:  LNKGGRFKWTEEASLAFEQLKQAMMTLPVLAHADFSQPFIVEKDASGTRLGAVLSQNQRPIAFFSHTLSSPTQAKSIYERELMAVVMAISRWRPYLLGRK
        L K G +KWTEE   AFE+LK+AMMTLPVLA  DF+ PF +E DASG  +GAVL Q+++P+A+FS  LS+  +A+ +YERELMAVV A+ RWRPYLLGRK
Subjt:  LNKGGRFKWTEEASLAFEQLKQAMMTLPVLAHADFSQPFIVEKDASGTRLGAVLSQNQRPIAFFSHTLSSPTQAKSIYERELMAVVMAISRWRPYLLGRK

Query:  FLVRKDQRALKHLLEQRVIQPEYQRWVSKLLGYALTLSTDQALRTK--------------RLMLYP-------------NDPNLSSIISRLQNDPDDTSR
        F V+ DQR+LK LLEQRVIQP+YQ+W++KLLGY+  +     L  K                +  P              DP L  I+S ++    +   
Subjt:  FLVRKDQRALKHLLEQRVIQPEYQRWVSKLLGYALTLSTDQALRTK--------------RLMLYP-------------NDPNLSSIISRLQNDPDDTSR

Query:  FSLQRGILKYKGRLVISKTSSLIPTILHTYHDSVLGGHLGFLRTYRRLTVDLYWERMKSNVKRYMEECLVCQQHKTQAASPVGLLQPLDVPTRIWEDITT
        ++  +GILK+KGRLV+SK S+LIPTI+HTYHDSV GGH GFLRTY+R+  +LYW+ MK +V++Y EEC++CQ++K+ A SP GLL PL++P  IW DI+ 
Subjt:  FSLQRGILKYKGRLVISKTSSLIPTILHTYHDSVLGGHLGFLRTYRRLTVDLYWERMKSNVKRYMEECLVCQQHKTQAASPVGLLQPLDVPTRIWEDITT

Query:  DFFEGLPRSEGGGQVL-----------------------------QEVVRLHGFPGSIISDRDKVFVRHFWTELFKVHGTKLKRSTSYHPHTDGHTKIVN
        DF EGLP+S G   +L                             +EVVRLHGFP SI+SDRDK+F+ HFW+E+FK+ GTKL RS+SYHP TDG T++VN
Subjt:  DFFEGLPRSEGGGQVL-----------------------------QEVVRLHGFPGSIISDRDKVFVRHFWTELFKVHGTKLKRSTSYHPHTDGHTKIVN

Query:  KCVEHFLRCFCSERRKQWCQWLGWEEYWYNTTFHITIGITPFEAVYGIPPPPLISYGECSISNSTLESQLIERDIALDVLKEKMRLAQERTKKFVVRKRR
        K VE +LRCFC E+ + W QWL W EYWYNTT+H +IGITPF+AVYG  PPPLI YGE    NSTL+ QL +RD+ L  LKE +R+AQER KKF   KRR
Subjt:  KCVEHFLRCFCSERRKQWCQWLGWEEYWYNTTFHITIGITPFEAVYGIPPPPLISYGECSISNSTLESQLIERDIALDVLKEKMRLAQERTKKFVVRKRR

Query:  EIEYEEGDMVFLKVRPYRQTTLAKKRNEKLSPTVF
        ++E++ GD VFLK+RPYRQT+L KKRNEKLSP  F
Subjt:  EIEYEEGDMVFLKVRPYRQTTLAKKRNEKLSPTVF

TYK15990.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]8.9e-16454.21Show/hide
Query:  LNKGGRFKWTEEASLAFEQLKQAMMTLPVLAHADFSQPFIVEKDASGTRLGAVLSQNQRPIAFFSHTLSSPTQAKSIYERELMAVVMAISRWRPYLLGRK
        L K G +KWTEE   AFE+LK+AMMTLPVLA  DF+ PF +E DASG  +GAVL Q+++P+A+FS  LS+  +A+ +YERELMAVV A+ RWRPYLLGRK
Subjt:  LNKGGRFKWTEEASLAFEQLKQAMMTLPVLAHADFSQPFIVEKDASGTRLGAVLSQNQRPIAFFSHTLSSPTQAKSIYERELMAVVMAISRWRPYLLGRK

Query:  FLVRKDQRALKHLLEQRVIQPEYQRWVSKLLGYALTLSTDQALRTK--------------RLMLYP-------------NDPNLSSIISRLQNDPDDTSR
        F V+ DQR+LK LLEQRVIQP+YQ+W++KLLGY+  +     L  K                +  P              DP L  I+S ++    +   
Subjt:  FLVRKDQRALKHLLEQRVIQPEYQRWVSKLLGYALTLSTDQALRTK--------------RLMLYP-------------NDPNLSSIISRLQNDPDDTSR

Query:  FSLQRGILKYKGRLVISKTSSLIPTILHTYHDSVLGGHLGFLRTYRRLTVDLYWERMKSNVKRYMEECLVCQQHKTQAASPVGLLQPLDVPTRIWEDITT
        ++  +GILK+KGRLV+SK S+LIPTI+HTYHDSV GGH GFLRTY+R+  +LYW+ MK +V++Y EEC++CQ++K+ A SP GLL PL++P  IW DI+ 
Subjt:  FSLQRGILKYKGRLVISKTSSLIPTILHTYHDSVLGGHLGFLRTYRRLTVDLYWERMKSNVKRYMEECLVCQQHKTQAASPVGLLQPLDVPTRIWEDITT

Query:  DFFEGLPRSEGGGQVL-----------------------------QEVVRLHGFPGSIISDRDKVFVRHFWTELFKVHGTKLKRSTSYHPHTDGHTKIVN
        DF EGLP+S G   +L                             +EVVRLHGFP SI+SDRDK+F+ HFW+E+FK+ GTKL RS+SYHP TDG T++VN
Subjt:  DFFEGLPRSEGGGQVL-----------------------------QEVVRLHGFPGSIISDRDKVFVRHFWTELFKVHGTKLKRSTSYHPHTDGHTKIVN

Query:  KCVEHFLRCFCSERRKQWCQWLGWEEYWYNTTFHITIGITPFEAVYGIPPPPLISYGECSISNSTLESQLIERDIALDVLKEKMRLAQERTKKFVVRKRR
        K VE +LRCFC E+ + W QWL W EYWYNTT+H +IGITPF+AVYG  PPPLI YGE    NSTL+ QL +RD+ L  LKE +R+AQER KKF   KRR
Subjt:  KCVEHFLRCFCSERRKQWCQWLGWEEYWYNTTFHITIGITPFEAVYGIPPPPLISYGECSISNSTLESQLIERDIALDVLKEKMRLAQERTKKFVVRKRR

Query:  EIEYEEGDMVFLKVRPYRQTTLAKKRNEKLSPTVF
        ++E++ GD VFLK+RPYRQT+L KKRNEKLSP  F
Subjt:  EIEYEEGDMVFLKVRPYRQTTLAKKRNEKLSPTVF

TYK23090.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]8.9e-16454.21Show/hide
Query:  LNKGGRFKWTEEASLAFEQLKQAMMTLPVLAHADFSQPFIVEKDASGTRLGAVLSQNQRPIAFFSHTLSSPTQAKSIYERELMAVVMAISRWRPYLLGRK
        L K G +KWTEE   AFE+LK+AMMTLPVLA  DF+ PF +E DASG  +GAVL Q+++P+A+FS  LS+  +A+ +YERELMAVV A+ RWRPYLLGRK
Subjt:  LNKGGRFKWTEEASLAFEQLKQAMMTLPVLAHADFSQPFIVEKDASGTRLGAVLSQNQRPIAFFSHTLSSPTQAKSIYERELMAVVMAISRWRPYLLGRK

Query:  FLVRKDQRALKHLLEQRVIQPEYQRWVSKLLGYALTLSTDQALRTK--------------RLMLYP-------------NDPNLSSIISRLQNDPDDTSR
        F V+ DQR+LK LLEQRVIQP+YQ+W++KLLGY+  +     L  K                +  P              DP L  I+S ++    +   
Subjt:  FLVRKDQRALKHLLEQRVIQPEYQRWVSKLLGYALTLSTDQALRTK--------------RLMLYP-------------NDPNLSSIISRLQNDPDDTSR

Query:  FSLQRGILKYKGRLVISKTSSLIPTILHTYHDSVLGGHLGFLRTYRRLTVDLYWERMKSNVKRYMEECLVCQQHKTQAASPVGLLQPLDVPTRIWEDITT
        ++  +GILK+KGRLV+SK S+LIPTI+HTYHDSV GGH GFLRTY+R+  +LYW+ MK +V++Y EEC++CQ++K+ A SP GLL PL++P  IW DI+ 
Subjt:  FSLQRGILKYKGRLVISKTSSLIPTILHTYHDSVLGGHLGFLRTYRRLTVDLYWERMKSNVKRYMEECLVCQQHKTQAASPVGLLQPLDVPTRIWEDITT

Query:  DFFEGLPRSEGGGQVL-----------------------------QEVVRLHGFPGSIISDRDKVFVRHFWTELFKVHGTKLKRSTSYHPHTDGHTKIVN
        DF EGLP+S G   +L                             +EVVRLHGFP SI+SDRDK+F+ HFW+E+FK+ GTKL RS+SYHP TDG T++VN
Subjt:  DFFEGLPRSEGGGQVL-----------------------------QEVVRLHGFPGSIISDRDKVFVRHFWTELFKVHGTKLKRSTSYHPHTDGHTKIVN

Query:  KCVEHFLRCFCSERRKQWCQWLGWEEYWYNTTFHITIGITPFEAVYGIPPPPLISYGECSISNSTLESQLIERDIALDVLKEKMRLAQERTKKFVVRKRR
        K VE +LRCFC E+ + W QWL W EYWYNTT+H +IGITPF+AVYG  PPPLI YGE    NSTL+ QL +RD+ L  LKE +R+AQER KKF   KRR
Subjt:  KCVEHFLRCFCSERRKQWCQWLGWEEYWYNTTFHITIGITPFEAVYGIPPPPLISYGECSISNSTLESQLIERDIALDVLKEKMRLAQERTKKFVVRKRR

Query:  EIEYEEGDMVFLKVRPYRQTTLAKKRNEKLSPTVF
        ++E++ GD VFLK+RPYRQT+L KKRNEKLSP  F
Subjt:  EIEYEEGDMVFLKVRPYRQTTLAKKRNEKLSPTVF

TrEMBL top hitse value%identityAlignment
A0A5A7SIV7 Ty3/gypsy retrotransposon protein4.3e-16454.21Show/hide
Query:  LNKGGRFKWTEEASLAFEQLKQAMMTLPVLAHADFSQPFIVEKDASGTRLGAVLSQNQRPIAFFSHTLSSPTQAKSIYERELMAVVMAISRWRPYLLGRK
        L K G +KWTEE   AFE+LK+AMMTLPVLA  DF+ PF +E DASG  +GAVL Q+++P+A+FS  LS+  +A+ +YERELMAVV A+ RWRPYLLGRK
Subjt:  LNKGGRFKWTEEASLAFEQLKQAMMTLPVLAHADFSQPFIVEKDASGTRLGAVLSQNQRPIAFFSHTLSSPTQAKSIYERELMAVVMAISRWRPYLLGRK

Query:  FLVRKDQRALKHLLEQRVIQPEYQRWVSKLLGYALTLSTDQALRTK--------------RLMLYP-------------NDPNLSSIISRLQNDPDDTSR
        F V+ DQR+LK LLEQRVIQP+YQ+W++KLLGY+  +     L  K                +  P              DP L  I+S ++    +   
Subjt:  FLVRKDQRALKHLLEQRVIQPEYQRWVSKLLGYALTLSTDQALRTK--------------RLMLYP-------------NDPNLSSIISRLQNDPDDTSR

Query:  FSLQRGILKYKGRLVISKTSSLIPTILHTYHDSVLGGHLGFLRTYRRLTVDLYWERMKSNVKRYMEECLVCQQHKTQAASPVGLLQPLDVPTRIWEDITT
        ++  +GILK+KGRLV+SK S+LIPTI+HTYHDSV GGH GFLRTY+R+  +LYW+ MK +V++Y EEC++CQ++K+ A SP GLL PL++P  IW DI+ 
Subjt:  FSLQRGILKYKGRLVISKTSSLIPTILHTYHDSVLGGHLGFLRTYRRLTVDLYWERMKSNVKRYMEECLVCQQHKTQAASPVGLLQPLDVPTRIWEDITT

Query:  DFFEGLPRSEGGGQVL-----------------------------QEVVRLHGFPGSIISDRDKVFVRHFWTELFKVHGTKLKRSTSYHPHTDGHTKIVN
        DF EGLP+S G   +L                             +EVVRLHGFP SI+SDRDK+F+ HFW+E+FK+ GTKL RS+SYHP TDG T++VN
Subjt:  DFFEGLPRSEGGGQVL-----------------------------QEVVRLHGFPGSIISDRDKVFVRHFWTELFKVHGTKLKRSTSYHPHTDGHTKIVN

Query:  KCVEHFLRCFCSERRKQWCQWLGWEEYWYNTTFHITIGITPFEAVYGIPPPPLISYGECSISNSTLESQLIERDIALDVLKEKMRLAQERTKKFVVRKRR
        K VE +LRCFC E+ + W QWL W EYWYNTT+H +IGITPF+AVYG  PPPLI YGE    NSTL+ QL +RD+ L  LKE +R+AQER KKF   KRR
Subjt:  KCVEHFLRCFCSERRKQWCQWLGWEEYWYNTTFHITIGITPFEAVYGIPPPPLISYGECSISNSTLESQLIERDIALDVLKEKMRLAQERTKKFVVRKRR

Query:  EIEYEEGDMVFLKVRPYRQTTLAKKRNEKLSPTVF
        ++E++ GD VFLK+RPYRQT+L KKRNEKLSP  F
Subjt:  EIEYEEGDMVFLKVRPYRQTTLAKKRNEKLSPTVF

A0A5A7U2S1 Ty3/gypsy retrotransposon protein9.6e-16454.21Show/hide
Query:  LNKGGRFKWTEEASLAFEQLKQAMMTLPVLAHADFSQPFIVEKDASGTRLGAVLSQNQRPIAFFSHTLSSPTQAKSIYERELMAVVMAISRWRPYLLGRK
        L K G +KWTEE   AFE+LK+AMMTLPVLA  DF+ PF +E DASG  +GAVL Q+++P+A+FS  LS+  +A+ +YERELMAVV A+ RWRPYLLGRK
Subjt:  LNKGGRFKWTEEASLAFEQLKQAMMTLPVLAHADFSQPFIVEKDASGTRLGAVLSQNQRPIAFFSHTLSSPTQAKSIYERELMAVVMAISRWRPYLLGRK

Query:  FLVRKDQRALKHLLEQRVIQPEYQRWVSKLLGYALTLSTDQALRTK--------------RLMLYP-------------NDPNLSSIISRLQNDPDDTSR
        F V+ DQR+LK LLEQRVIQP+YQ+W+ KLLGY+  +     L  K                +  P              DP L  I+S ++    +   
Subjt:  FLVRKDQRALKHLLEQRVIQPEYQRWVSKLLGYALTLSTDQALRTK--------------RLMLYP-------------NDPNLSSIISRLQNDPDDTSR

Query:  FSLQRGILKYKGRLVISKTSSLIPTILHTYHDSVLGGHLGFLRTYRRLTVDLYWERMKSNVKRYMEECLVCQQHKTQAASPVGLLQPLDVPTRIWEDITT
        ++  +GILK+KGRLV+SK S+LIPTI+HTYHDSV GGH GFLRTY+R+  +LYW+ MK +V++Y EEC++CQ++K+ A SP GLL PL++P  IW DI+ 
Subjt:  FSLQRGILKYKGRLVISKTSSLIPTILHTYHDSVLGGHLGFLRTYRRLTVDLYWERMKSNVKRYMEECLVCQQHKTQAASPVGLLQPLDVPTRIWEDITT

Query:  DFFEGLPRSEGGGQVL-----------------------------QEVVRLHGFPGSIISDRDKVFVRHFWTELFKVHGTKLKRSTSYHPHTDGHTKIVN
        DF EGLP+S G   +L                             +EVVRLHGFP SI+SDRDK+F+ HFW+E+FK+ GTKL RS+SYHP TDG T++VN
Subjt:  DFFEGLPRSEGGGQVL-----------------------------QEVVRLHGFPGSIISDRDKVFVRHFWTELFKVHGTKLKRSTSYHPHTDGHTKIVN

Query:  KCVEHFLRCFCSERRKQWCQWLGWEEYWYNTTFHITIGITPFEAVYGIPPPPLISYGECSISNSTLESQLIERDIALDVLKEKMRLAQERTKKFVVRKRR
        K VE +LRCFC E+ + W QWL W EYWYNTT+H +IGITPF+AVYG  PPPLI YGE    NSTL+ QL +RD+ L  LKE +R+AQER KKF   KRR
Subjt:  KCVEHFLRCFCSERRKQWCQWLGWEEYWYNTTFHITIGITPFEAVYGIPPPPLISYGECSISNSTLESQLIERDIALDVLKEKMRLAQERTKKFVVRKRR

Query:  EIEYEEGDMVFLKVRPYRQTTLAKKRNEKLSPTVF
        ++E++ GD VFLK+RPYRQT+L KKRNEKLSP  F
Subjt:  EIEYEEGDMVFLKVRPYRQTTLAKKRNEKLSPTVF

A0A5A7U6J3 Ty3/gypsy retrotransposon protein5.6e-16454.21Show/hide
Query:  LNKGGRFKWTEEASLAFEQLKQAMMTLPVLAHADFSQPFIVEKDASGTRLGAVLSQNQRPIAFFSHTLSSPTQAKSIYERELMAVVMAISRWRPYLLGRK
        L K G +KWTEE   AFE+LK+AMMTLPVLA  DF+ PF +E DASG  +GAVL Q+++P+A+FS  LS+  +A+ +YERELMAVV A+ RWRPYLLGRK
Subjt:  LNKGGRFKWTEEASLAFEQLKQAMMTLPVLAHADFSQPFIVEKDASGTRLGAVLSQNQRPIAFFSHTLSSPTQAKSIYERELMAVVMAISRWRPYLLGRK

Query:  FLVRKDQRALKHLLEQRVIQPEYQRWVSKLLGYALTLSTDQALRTK--------------RLMLYP-------------NDPNLSSIISRLQNDPDDTSR
        F V+ DQR+LK LLEQRVIQP+YQ+W++KLLGY+  +     L  K                +  P              DP L  I+S ++    +   
Subjt:  FLVRKDQRALKHLLEQRVIQPEYQRWVSKLLGYALTLSTDQALRTK--------------RLMLYP-------------NDPNLSSIISRLQNDPDDTSR

Query:  FSLQRGILKYKGRLVISKTSSLIPTILHTYHDSVLGGHLGFLRTYRRLTVDLYWERMKSNVKRYMEECLVCQQHKTQAASPVGLLQPLDVPTRIWEDITT
        ++  +GILK+KGRLV+SK S+LIPTI+HTYHDSV GGH GFLRTY+R+  +LYW+ MK +V++Y EEC++CQ++K+ A SP GLL PL++P  IW DI+ 
Subjt:  FSLQRGILKYKGRLVISKTSSLIPTILHTYHDSVLGGHLGFLRTYRRLTVDLYWERMKSNVKRYMEECLVCQQHKTQAASPVGLLQPLDVPTRIWEDITT

Query:  DFFEGLPRSEGGGQVL-----------------------------QEVVRLHGFPGSIISDRDKVFVRHFWTELFKVHGTKLKRSTSYHPHTDGHTKIVN
        DF EGLP+S G   +L                             +EVVRLHGFP SI+SDRDK+F+ HFW+E+FK+ GTKL RS+SYHP TDG T++VN
Subjt:  DFFEGLPRSEGGGQVL-----------------------------QEVVRLHGFPGSIISDRDKVFVRHFWTELFKVHGTKLKRSTSYHPHTDGHTKIVN

Query:  KCVEHFLRCFCSERRKQWCQWLGWEEYWYNTTFHITIGITPFEAVYGIPPPPLISYGECSISNSTLESQLIERDIALDVLKEKMRLAQERTKKFVVRKRR
        K VE +LRCFC E+ + W QWL W EYWYNTT+H +IGITPF+AVYG  PPPLI YGE    NSTL+ QL +RD+ L  LKE +R+AQER KKF   KRR
Subjt:  KCVEHFLRCFCSERRKQWCQWLGWEEYWYNTTFHITIGITPFEAVYGIPPPPLISYGECSISNSTLESQLIERDIALDVLKEKMRLAQERTKKFVVRKRR

Query:  EIEYEEGDMVFLKVRPYRQTTLAKKRNEKLSPTVF
        ++E++ GD VFLK+RPYRQT+L KKRNEKLSP  F
Subjt:  EIEYEEGDMVFLKVRPYRQTTLAKKRNEKLSPTVF

A0A5D3CXB1 Ty3/gypsy retrotransposon protein4.3e-16454.21Show/hide
Query:  LNKGGRFKWTEEASLAFEQLKQAMMTLPVLAHADFSQPFIVEKDASGTRLGAVLSQNQRPIAFFSHTLSSPTQAKSIYERELMAVVMAISRWRPYLLGRK
        L K G +KWTEE   AFE+LK+AMMTLPVLA  DF+ PF +E DASG  +GAVL Q+++P+A+FS  LS+  +A+ +YERELMAVV A+ RWRPYLLGRK
Subjt:  LNKGGRFKWTEEASLAFEQLKQAMMTLPVLAHADFSQPFIVEKDASGTRLGAVLSQNQRPIAFFSHTLSSPTQAKSIYERELMAVVMAISRWRPYLLGRK

Query:  FLVRKDQRALKHLLEQRVIQPEYQRWVSKLLGYALTLSTDQALRTK--------------RLMLYP-------------NDPNLSSIISRLQNDPDDTSR
        F V+ DQR+LK LLEQRVIQP+YQ+W++KLLGY+  +     L  K                +  P              DP L  I+S ++    +   
Subjt:  FLVRKDQRALKHLLEQRVIQPEYQRWVSKLLGYALTLSTDQALRTK--------------RLMLYP-------------NDPNLSSIISRLQNDPDDTSR

Query:  FSLQRGILKYKGRLVISKTSSLIPTILHTYHDSVLGGHLGFLRTYRRLTVDLYWERMKSNVKRYMEECLVCQQHKTQAASPVGLLQPLDVPTRIWEDITT
        ++  +GILK+KGRLV+SK S+LIPTI+HTYHDSV GGH GFLRTY+R+  +LYW+ MK +V++Y EEC++CQ++K+ A SP GLL PL++P  IW DI+ 
Subjt:  FSLQRGILKYKGRLVISKTSSLIPTILHTYHDSVLGGHLGFLRTYRRLTVDLYWERMKSNVKRYMEECLVCQQHKTQAASPVGLLQPLDVPTRIWEDITT

Query:  DFFEGLPRSEGGGQVL-----------------------------QEVVRLHGFPGSIISDRDKVFVRHFWTELFKVHGTKLKRSTSYHPHTDGHTKIVN
        DF EGLP+S G   +L                             +EVVRLHGFP SI+SDRDK+F+ HFW+E+FK+ GTKL RS+SYHP TDG T++VN
Subjt:  DFFEGLPRSEGGGQVL-----------------------------QEVVRLHGFPGSIISDRDKVFVRHFWTELFKVHGTKLKRSTSYHPHTDGHTKIVN

Query:  KCVEHFLRCFCSERRKQWCQWLGWEEYWYNTTFHITIGITPFEAVYGIPPPPLISYGECSISNSTLESQLIERDIALDVLKEKMRLAQERTKKFVVRKRR
        K VE +LRCFC E+ + W QWL W EYWYNTT+H +IGITPF+AVYG  PPPLI YGE    NSTL+ QL +RD+ L  LKE +R+AQER KKF   KRR
Subjt:  KCVEHFLRCFCSERRKQWCQWLGWEEYWYNTTFHITIGITPFEAVYGIPPPPLISYGECSISNSTLESQLIERDIALDVLKEKMRLAQERTKKFVVRKRR

Query:  EIEYEEGDMVFLKVRPYRQTTLAKKRNEKLSPTVF
        ++E++ GD VFLK+RPYRQT+L KKRNEKLSP  F
Subjt:  EIEYEEGDMVFLKVRPYRQTTLAKKRNEKLSPTVF

A0A5D3DI73 Ty3/gypsy retrotransposon protein4.3e-16454.21Show/hide
Query:  LNKGGRFKWTEEASLAFEQLKQAMMTLPVLAHADFSQPFIVEKDASGTRLGAVLSQNQRPIAFFSHTLSSPTQAKSIYERELMAVVMAISRWRPYLLGRK
        L K G +KWTEE   AFE+LK+AMMTLPVLA  DF+ PF +E DASG  +GAVL Q+++P+A+FS  LS+  +A+ +YERELMAVV A+ RWRPYLLGRK
Subjt:  LNKGGRFKWTEEASLAFEQLKQAMMTLPVLAHADFSQPFIVEKDASGTRLGAVLSQNQRPIAFFSHTLSSPTQAKSIYERELMAVVMAISRWRPYLLGRK

Query:  FLVRKDQRALKHLLEQRVIQPEYQRWVSKLLGYALTLSTDQALRTK--------------RLMLYP-------------NDPNLSSIISRLQNDPDDTSR
        F V+ DQR+LK LLEQRVIQP+YQ+W++KLLGY+  +     L  K                +  P              DP L  I+S ++    +   
Subjt:  FLVRKDQRALKHLLEQRVIQPEYQRWVSKLLGYALTLSTDQALRTK--------------RLMLYP-------------NDPNLSSIISRLQNDPDDTSR

Query:  FSLQRGILKYKGRLVISKTSSLIPTILHTYHDSVLGGHLGFLRTYRRLTVDLYWERMKSNVKRYMEECLVCQQHKTQAASPVGLLQPLDVPTRIWEDITT
        ++  +GILK+KGRLV+SK S+LIPTI+HTYHDSV GGH GFLRTY+R+  +LYW+ MK +V++Y EEC++CQ++K+ A SP GLL PL++P  IW DI+ 
Subjt:  FSLQRGILKYKGRLVISKTSSLIPTILHTYHDSVLGGHLGFLRTYRRLTVDLYWERMKSNVKRYMEECLVCQQHKTQAASPVGLLQPLDVPTRIWEDITT

Query:  DFFEGLPRSEGGGQVL-----------------------------QEVVRLHGFPGSIISDRDKVFVRHFWTELFKVHGTKLKRSTSYHPHTDGHTKIVN
        DF EGLP+S G   +L                             +EVVRLHGFP SI+SDRDK+F+ HFW+E+FK+ GTKL RS+SYHP TDG T++VN
Subjt:  DFFEGLPRSEGGGQVL-----------------------------QEVVRLHGFPGSIISDRDKVFVRHFWTELFKVHGTKLKRSTSYHPHTDGHTKIVN

Query:  KCVEHFLRCFCSERRKQWCQWLGWEEYWYNTTFHITIGITPFEAVYGIPPPPLISYGECSISNSTLESQLIERDIALDVLKEKMRLAQERTKKFVVRKRR
        K VE +LRCFC E+ + W QWL W EYWYNTT+H +IGITPF+AVYG  PPPLI YGE    NSTL+ QL +RD+ L  LKE +R+AQER KKF   KRR
Subjt:  KCVEHFLRCFCSERRKQWCQWLGWEEYWYNTTFHITIGITPFEAVYGIPPPPLISYGECSISNSTLESQLIERDIALDVLKEKMRLAQERTKKFVVRKRR

Query:  EIEYEEGDMVFLKVRPYRQTTLAKKRNEKLSPTVF
        ++E++ GD VFLK+RPYRQT+L KKRNEKLSP  F
Subjt:  EIEYEEGDMVFLKVRPYRQTTLAKKRNEKLSPTVF

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein1.9e-2324.32Show/hide
Query:  LNKGGRFKWTEEASLAFEQLKQAMMTLPVLAHADFSQPFIVEKDASGTRLGAVLSQNQ-----RPIAFFSHTLSSPTQAKSIYERELMAVVMAISRWRPY
        L K  R+KWT   + A E +KQ +++ PVL H DFS+  ++E DAS   +GAVLSQ        P+ ++S  +S      S+ ++E++A++ ++  WR Y
Subjt:  LNKGGRFKWTEEASLAFEQLKQAMMTLPVLAHADFSQPFIVEKDASGTRLGAVLSQNQ-----RPIAFFSHTLSSPTQAKSIYERELMAVVMAISRWRPY

Query:  LLG--RKFLVRKDQRAL------------------------------------KHLLE--QRVI---QPEYQRWVSKLLGYALTLSTDQALRTKRLMLYP
        L      F +  D R L                                     H+ +   R++   +P  +      + +   +S     + + +  Y 
Subjt:  LLG--RKFLVRKDQRAL------------------------------------KHLLE--QRVI---QPEYQRWVSKLLGYALTLSTDQALRTKRLMLYP

Query:  NDPNLSSIISRLQNDPDDTSRFSLQRGIL-KYKGRLVISKTSSLIPTILHTYHDSVLGGHLGFLRTYRRLTVDLYWERMKSNVKRYMEECLVCQQHKTQA
        ND  L ++++    D        L+ G+L   K ++++   + L  TI+  YH+     H G       +     W+ ++  ++ Y++ C  CQ +K++ 
Subjt:  NDPNLSSIISRLQNDPDDTSRFSLQRGIL-KYKGRLVISKTSSLIPTILHTYHDSVLGGHLGFLRTYRRLTVDLYWERMKSNVKRYMEECLVCQQHKTQA

Query:  ASPVGLLQPLDVPTRIWEDITTDFFEGLPRSEG
          P G LQP+    R WE ++ DF   LP S G
Subjt:  ASPVGLLQPLDVPTRIWEDITTDFFEGLPRSEG

P0CT41 Transposon Tf2-12 polyprotein1.9e-2324.32Show/hide
Query:  LNKGGRFKWTEEASLAFEQLKQAMMTLPVLAHADFSQPFIVEKDASGTRLGAVLSQNQ-----RPIAFFSHTLSSPTQAKSIYERELMAVVMAISRWRPY
        L K  R+KWT   + A E +KQ +++ PVL H DFS+  ++E DAS   +GAVLSQ        P+ ++S  +S      S+ ++E++A++ ++  WR Y
Subjt:  LNKGGRFKWTEEASLAFEQLKQAMMTLPVLAHADFSQPFIVEKDASGTRLGAVLSQNQ-----RPIAFFSHTLSSPTQAKSIYERELMAVVMAISRWRPY

Query:  LLG--RKFLVRKDQRAL------------------------------------KHLLE--QRVI---QPEYQRWVSKLLGYALTLSTDQALRTKRLMLYP
        L      F +  D R L                                     H+ +   R++   +P  +      + +   +S     + + +  Y 
Subjt:  LLG--RKFLVRKDQRAL------------------------------------KHLLE--QRVI---QPEYQRWVSKLLGYALTLSTDQALRTKRLMLYP

Query:  NDPNLSSIISRLQNDPDDTSRFSLQRGIL-KYKGRLVISKTSSLIPTILHTYHDSVLGGHLGFLRTYRRLTVDLYWERMKSNVKRYMEECLVCQQHKTQA
        ND  L ++++    D        L+ G+L   K ++++   + L  TI+  YH+     H G       +     W+ ++  ++ Y++ C  CQ +K++ 
Subjt:  NDPNLSSIISRLQNDPDDTSRFSLQRGIL-KYKGRLVISKTSSLIPTILHTYHDSVLGGHLGFLRTYRRLTVDLYWERMKSNVKRYMEECLVCQQHKTQA

Query:  ASPVGLLQPLDVPTRIWEDITTDFFEGLPRSEG
          P G LQP+    R WE ++ DF   LP S G
Subjt:  ASPVGLLQPLDVPTRIWEDITTDFFEGLPRSEG

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein5.8e-2526.25Show/hide
Query:  YALTLSTDQALRTKR-LMLYPNDPNLSSIISRLQN------DPDDTSRF-----------------SLQRGILKYKGRLVISKTSSLIPTILHTYHDSVL
        Y +T  T + + T+     Y +DP  S+++  ++        P+D S F                 SL+  ++ Y+ RLV+         ++  YHD  L
Subjt:  YALTLSTDQALRTKR-LMLYPNDPNLSSIISRLQN------DPDDTSRF-----------------SLQRGILKYKGRLVISKTSSLIPTILHTYHDSVL

Query:  -GGHLGFLRTYRRLTVDLYWERMKSNVKRYMEECLVCQQHKTQAASPVGLLQPLDVPTRIWEDITTDFFEGLPRSEGGGQVLQEVV--------------
         GGH G   T  +++   YW +++ ++ +Y+  C+ CQ  K+      GLLQPL +    W DI+ DF  GLP +     ++  VV              
Subjt:  -GGHLGFLRTYRRLTVDLYWERMKSNVKRYMEECLVCQQHKTQAASPVGLLQPLDVPTRIWEDITTDFFEGLPRSEGGGQVLQEVV--------------

Query:  ----------------RLHGFPGSIISDRDKVFVRHFWTELFKVHGTKLKRSTSYHPHTDGHTKIVNKCVEHFLRCFCSERRKQWCQWLGWEEYWYNTTF
                          HGFP +I SDRD       + EL K  G K   S++ HP TDG ++   + +   LR + S   + W  +L   E+ YN+T 
Subjt:  ----------------RLHGFPGSIISDRDKVFVRHFWTELFKVHGTKLKRSTSYHPHTDGHTKIVNKCVEHFLRCFCSERRKQWCQWLGWEEYWYNTTF

Query:  HITIGITPFEAVYG-IPPPPLISYGECSISNSTLESQLIERDIALDV-LKEKMRLAQERTKKFVVRKRREIEYEEGDMVFL
          T+G +PFE   G +P  P I   +   + S    +L +   AL +  KE++  AQ   +    ++R+ +    GD V +
Subjt:  HITIGITPFEAVYG-IPPPPLISYGECSISNSTLESQLIERDIALDV-LKEKMRLAQERTKKFVVRKRREIEYEEGDMVFL

Q99315 Transposon Ty3-G Gag-Pol polyprotein4.5e-2526.25Show/hide
Query:  YALTLSTDQALRTKR-LMLYPNDPNLSSIISRLQN------DPDDTSRF-----------------SLQRGILKYKGRLVISKTSSLIPTILHTYHDSVL
        Y +T  T + + T+     Y +DP  S+++  ++        P+D S F                 SL+  ++ Y+ RLV+         ++  YHD  L
Subjt:  YALTLSTDQALRTKR-LMLYPNDPNLSSIISRLQN------DPDDTSRF-----------------SLQRGILKYKGRLVISKTSSLIPTILHTYHDSVL

Query:  -GGHLGFLRTYRRLTVDLYWERMKSNVKRYMEECLVCQQHKTQAASPVGLLQPLDVPTRIWEDITTDFFEGLPRSEGGGQVLQEVV--------------
         GGH G   T  +++   YW +++ ++ +Y+  C+ CQ  K+      GLLQPL +    W DI+ DF  GLP +     ++  VV              
Subjt:  -GGHLGFLRTYRRLTVDLYWERMKSNVKRYMEECLVCQQHKTQAASPVGLLQPLDVPTRIWEDITTDFFEGLPRSEGGGQVLQEVV--------------

Query:  ----------------RLHGFPGSIISDRDKVFVRHFWTELFKVHGTKLKRSTSYHPHTDGHTKIVNKCVEHFLRCFCSERRKQWCQWLGWEEYWYNTTF
                          HGFP +I SDRD       + EL K  G K   S++ HP TDG ++   + +   LR + S   + W  +L   E+ YN+T 
Subjt:  ----------------RLHGFPGSIISDRDKVFVRHFWTELFKVHGTKLKRSTSYHPHTDGHTKIVNKCVEHFLRCFCSERRKQWCQWLGWEEYWYNTTF

Query:  HITIGITPFEAVYG-IPPPPLISYGECSISNSTLESQLIERDIALDV-LKEKMRLAQERTKKFVVRKRREIEYEEGDMVFL
          T+G +PFE   G +P  P I   +   + S    +L +   AL +  KE++  AQ   +    ++R+ +    GD V +
Subjt:  HITIGITPFEAVYG-IPPPPLISYGECSISNSTLESQLIERDIALDV-LKEKMRLAQERTKKFVVRKRREIEYEEGDMVFL

Q9UR07 Transposon Tf2-11 polyprotein1.9e-2324.32Show/hide
Query:  LNKGGRFKWTEEASLAFEQLKQAMMTLPVLAHADFSQPFIVEKDASGTRLGAVLSQNQ-----RPIAFFSHTLSSPTQAKSIYERELMAVVMAISRWRPY
        L K  R+KWT   + A E +KQ +++ PVL H DFS+  ++E DAS   +GAVLSQ        P+ ++S  +S      S+ ++E++A++ ++  WR Y
Subjt:  LNKGGRFKWTEEASLAFEQLKQAMMTLPVLAHADFSQPFIVEKDASGTRLGAVLSQNQ-----RPIAFFSHTLSSPTQAKSIYERELMAVVMAISRWRPY

Query:  LLG--RKFLVRKDQRAL------------------------------------KHLLE--QRVI---QPEYQRWVSKLLGYALTLSTDQALRTKRLMLYP
        L      F +  D R L                                     H+ +   R++   +P  +      + +   +S     + + +  Y 
Subjt:  LLG--RKFLVRKDQRAL------------------------------------KHLLE--QRVI---QPEYQRWVSKLLGYALTLSTDQALRTKRLMLYP

Query:  NDPNLSSIISRLQNDPDDTSRFSLQRGIL-KYKGRLVISKTSSLIPTILHTYHDSVLGGHLGFLRTYRRLTVDLYWERMKSNVKRYMEECLVCQQHKTQA
        ND  L ++++    D        L+ G+L   K ++++   + L  TI+  YH+     H G       +     W+ ++  ++ Y++ C  CQ +K++ 
Subjt:  NDPNLSSIISRLQNDPDDTSRFSLQRGIL-KYKGRLVISKTSSLIPTILHTYHDSVLGGHLGFLRTYRRLTVDLYWERMKSNVKRYMEECLVCQQHKTQA

Query:  ASPVGLLQPLDVPTRIWEDITTDFFEGLPRSEG
          P G LQP+    R WE ++ DF   LP S G
Subjt:  ASPVGLLQPLDVPTRIWEDITTDFFEGLPRSEG

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein5.8e-0455Show/hide
Query:  LNKGGRFKWTEEASLAFEQLKQAMMTLPVLAHADFSQPFI
        L K    KWTE A+LAF+ LK A+ TLPVLA  D   PF+
Subjt:  LNKGGRFKWTEEASLAFEQLKQAMMTLPVLAHADFSQPFI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGAACATTTGAACAAAGGGGGAAGATTCAAATGGACAGAGGAGGCCTCGTTGGCTTTTGAACAACTCAAACAAGCCATGATGACTCTTCCTGTCTTAGCTCATGC
AGATTTCAGTCAACCATTTATAGTAGAAAAAGACGCCTCAGGGACGAGATTGGGGGCAGTTTTATCTCAGAACCAACGACCGATTGCTTTCTTCAGCCATACCTTGTCAT
CTCCGACACAAGCTAAGTCTATTTACGAGAGAGAATTGATGGCTGTGGTAATGGCAATTTCGCGGTGGAGACCTTATTTGCTGGGTAGAAAATTTTTGGTTCGGAAAGAT
CAAAGGGCCTTGAAGCATCTCCTTGAACAGAGGGTAATTCAGCCTGAATACCAGCGATGGGTTTCCAAGCTGTTGGGGTATGCTTTGACATTGAGTACAGACCAGGCCTT
GAGAACAAAGCGGTTGATGCTTTATCCCAATGATCCCAACCTCAGTTCCATTATCAGTCGGCTACAAAACGATCCGGATGACACCAGTAGATTCTCATTGCAGCGGGGCA
TTCTGAAGTATAAGGGAAGGTTAGTGATCTCGAAAACCTCTTCTCTAATTCCTACTATCTTACATACCTATCACGACTCTGTTCTTGGGGGTCACTTGGGTTTTCTGAGA
ACATATAGGAGGTTGACCGTTGATTTATACTGGGAGCGAATGAAATCGAATGTTAAAAGATACATGGAAGAATGTCTTGTTTGTCAGCAACATAAGACCCAGGCAGCTTC
TCCGGTAGGGTTACTCCAACCATTGGATGTGCCAACCCGAATATGGGAAGACATCACAACGGATTTCTTCGAAGGCTTACCACGATCAGAGGGTGGCGGCCAAGTTCTTC
AAGAGGTGGTCCGATTACATGGTTTCCCTGGTTCGATAATTTCTGATAGGGACAAGGTTTTTGTCCGTCATTTCTGGACTGAACTATTTAAAGTACACGGAACAAAACTG
AAGCGCAGCACATCGTATCATCCACATACGGATGGGCATACAAAGATTGTGAATAAGTGTGTTGAGCATTTCTTACGGTGCTTCTGTTCGGAACGACGAAAACAATGGTG
TCAATGGTTAGGATGGGAGGAGTACTGGTACAACACTACGTTCCATATAACGATTGGCATCACTCCATTCGAAGCAGTGTACGGTATACCACCACCACCTCTGATTTCGT
ATGGTGAGTGTAGCATAAGCAATTCCACACTGGAATCACAGCTGATTGAACGCGATATTGCTCTGGATGTGTTAAAAGAGAAGATGAGATTAGCCCAAGAAAGAACGAAG
AAATTTGTTGTCCGTAAACGGAGAGAGATCGAATATGAGGAAGGGGACATGGTTTTTCTGAAAGTTAGGCCATATCGACAGACGACATTAGCTAAGAAGAGGAATGAAAA
GCTCTCGCCCACTGTCTTCACATTTGAAATATGCATATTTGGGGGAGCAAGAGACATTACCCATTATAATCTCTTCAAAATTGCTCGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGAACATTTGAACAAAGGGGGAAGATTCAAATGGACAGAGGAGGCCTCGTTGGCTTTTGAACAACTCAAACAAGCCATGATGACTCTTCCTGTCTTAGCTCATGC
AGATTTCAGTCAACCATTTATAGTAGAAAAAGACGCCTCAGGGACGAGATTGGGGGCAGTTTTATCTCAGAACCAACGACCGATTGCTTTCTTCAGCCATACCTTGTCAT
CTCCGACACAAGCTAAGTCTATTTACGAGAGAGAATTGATGGCTGTGGTAATGGCAATTTCGCGGTGGAGACCTTATTTGCTGGGTAGAAAATTTTTGGTTCGGAAAGAT
CAAAGGGCCTTGAAGCATCTCCTTGAACAGAGGGTAATTCAGCCTGAATACCAGCGATGGGTTTCCAAGCTGTTGGGGTATGCTTTGACATTGAGTACAGACCAGGCCTT
GAGAACAAAGCGGTTGATGCTTTATCCCAATGATCCCAACCTCAGTTCCATTATCAGTCGGCTACAAAACGATCCGGATGACACCAGTAGATTCTCATTGCAGCGGGGCA
TTCTGAAGTATAAGGGAAGGTTAGTGATCTCGAAAACCTCTTCTCTAATTCCTACTATCTTACATACCTATCACGACTCTGTTCTTGGGGGTCACTTGGGTTTTCTGAGA
ACATATAGGAGGTTGACCGTTGATTTATACTGGGAGCGAATGAAATCGAATGTTAAAAGATACATGGAAGAATGTCTTGTTTGTCAGCAACATAAGACCCAGGCAGCTTC
TCCGGTAGGGTTACTCCAACCATTGGATGTGCCAACCCGAATATGGGAAGACATCACAACGGATTTCTTCGAAGGCTTACCACGATCAGAGGGTGGCGGCCAAGTTCTTC
AAGAGGTGGTCCGATTACATGGTTTCCCTGGTTCGATAATTTCTGATAGGGACAAGGTTTTTGTCCGTCATTTCTGGACTGAACTATTTAAAGTACACGGAACAAAACTG
AAGCGCAGCACATCGTATCATCCACATACGGATGGGCATACAAAGATTGTGAATAAGTGTGTTGAGCATTTCTTACGGTGCTTCTGTTCGGAACGACGAAAACAATGGTG
TCAATGGTTAGGATGGGAGGAGTACTGGTACAACACTACGTTCCATATAACGATTGGCATCACTCCATTCGAAGCAGTGTACGGTATACCACCACCACCTCTGATTTCGT
ATGGTGAGTGTAGCATAAGCAATTCCACACTGGAATCACAGCTGATTGAACGCGATATTGCTCTGGATGTGTTAAAAGAGAAGATGAGATTAGCCCAAGAAAGAACGAAG
AAATTTGTTGTCCGTAAACGGAGAGAGATCGAATATGAGGAAGGGGACATGGTTTTTCTGAAAGTTAGGCCATATCGACAGACGACATTAGCTAAGAAGAGGAATGAAAA
GCTCTCGCCCACTGTCTTCACATTTGAAATATGCATATTTGGGGGAGCAAGAGACATTACCCATTATAATCTCTTCAAAATTGCTCGCTGA
Protein sequenceShow/hide protein sequence
MEEHLNKGGRFKWTEEASLAFEQLKQAMMTLPVLAHADFSQPFIVEKDASGTRLGAVLSQNQRPIAFFSHTLSSPTQAKSIYERELMAVVMAISRWRPYLLGRKFLVRKD
QRALKHLLEQRVIQPEYQRWVSKLLGYALTLSTDQALRTKRLMLYPNDPNLSSIISRLQNDPDDTSRFSLQRGILKYKGRLVISKTSSLIPTILHTYHDSVLGGHLGFLR
TYRRLTVDLYWERMKSNVKRYMEECLVCQQHKTQAASPVGLLQPLDVPTRIWEDITTDFFEGLPRSEGGGQVLQEVVRLHGFPGSIISDRDKVFVRHFWTELFKVHGTKL
KRSTSYHPHTDGHTKIVNKCVEHFLRCFCSERRKQWCQWLGWEEYWYNTTFHITIGITPFEAVYGIPPPPLISYGECSISNSTLESQLIERDIALDVLKEKMRLAQERTK
KFVVRKRREIEYEEGDMVFLKVRPYRQTTLAKKRNEKLSPTVFTFEICIFGGARDITHYNLFKIAR