; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc04g0086991 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc04g0086991
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationCMiso1.1chr04:98342..100027
RNA-Seq ExpressionCmc04g0086991
SyntenyCmc04g0086991
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
BBH01581.1 transposable element gene [Prunus dulcis]7.4e-11949.54Show/hide
Query:  NKHDSNWIIDSGATHHMSCSPNNFLNLITSNEPQFVTTANGGQTKIFGTGTISVFNKPV-NEVLYLPDFHSNLLSVNKIVKDLNCAVIFLPEKVIFQDIV
        + HD  WIIDSGAT HM+   +   N  +   P  V+ ANG    + G G + + +  V +  LY+P F   LLSV +I   LNC  IF P  V+FQD+ 
Subjt:  NKHDSNWIIDSGATHHMSCSPNNFLNLITSNEPQFVTTANGGQTKIFGTGTISVFNKPV-NEVLYLPDFHSNLLSVNKIVKDLNCAVIFLPEKVIFQDIV

Query:  SGEMIGEGILRNGLYYLQQN---NKCFVSSKNTDRGHLLHLRFGHPSDQVLNRLF-HYNYDSFSCDTCRFAKQTRLPFPTSITKVEKCFDLIHSDVWGPS
        + ++IGEG   NGLYY  +N    K F  S N +   L H R  HPS+ VL+ LF      S  C+ C  +K TRLPF +SI++  K F+++HSDVWGP+
Subjt:  SGEMIGEGILRNGLYYLQQN---NKCFVSSKNTDRGHLLHLRFGHPSDQVLNRLF-HYNYDSFSCDTCRFAKQTRLPFPTSITKVEKCFDLIHSDVWGPS

Query:  PEESYNHYKYYVTFIDDFSKTTWVYLLKTKNEVFSCFQEFFNFITNQYNAQVKIFRSDNGTEYVNKEFTNFFKQHGILHQTTCTHTPQQNGVSERKNRHL
        P ES++ Y+YYVTF+DDFS+ TW+YLLK K+EV   F+ F N + N +++Q+ I RSDNGTEY +K  TN+   HGI+HQT+C  TPQQNG++ERKNR L
Subjt:  PEESYNHYKYYVTFIDDFSKTTWVYLLKTKNEVFSCFQEFFNFITNQYNAQVKIFRSDNGTEYVNKEFTNFFKQHGILHQTTCTHTPQQNGVSERKNRHL

Query:  LEKTRALLLQNNVPKKFWSDAILTATYIINRLPSPNLNNLSPLEILKGRKIDLDHIRVFGCTCFVYIK--RKDKLDKNSVKTIFLGYSSTQKGYKCFDPE
        LEKTRAL+LQ NVPK+FWS  +LTATY+INRLPS  L++ SP E+++ +KI+L H+R+FGCTC+ +I+   +DKLD  ++K +F+GYSSTQKGYKC++P 
Subjt:  LEKTRALLLQNNVPKKFWSDAILTATYIINRLPSPNLNNLSPLEILKGRKIDLDHIRVFGCTCFVYIK--RKDKLDKNSVKTIFLGYSSTQKGYKCFDPE

Query:  QNKLYISRDVVFREHEPFFTPTQDTTAATPSTLQFLFP
          KL++SRDV F E +P+F    D        L F FP
Subjt:  QNKLYISRDVVFREHEPFFTPTQDTTAATPSTLQFLFP

BBH02949.1 hypothetical protein Prudu_013672 [Prunus dulcis]2.8e-11849.32Show/hide
Query:  NKHDSNWIIDSGATHHMSCSPNNFLNLITSNEPQFVTTANGGQTKIFGTGTISVFNKPV-NEVLYLPDFHSNLLSVNKIVKDLNCAVIFLPEKVIFQDIV
        + HD  WIIDSGAT HM+   +   N  +   P  V+ ANG    + G G + + +  V +  LY+P F   LLSV +I   LNC  IF P  V+FQD+ 
Subjt:  NKHDSNWIIDSGATHHMSCSPNNFLNLITSNEPQFVTTANGGQTKIFGTGTISVFNKPV-NEVLYLPDFHSNLLSVNKIVKDLNCAVIFLPEKVIFQDIV

Query:  SGEMIGEGILRNGLYYLQQN---NKCFVSSKNTDRGHLLHLRFGHPSDQVLNRLF-HYNYDSFSCDTCRFAKQTRLPFPTSITKVEKCFDLIHSDVWGPS
        + ++IGEG   NGLYY  +N    K F  S N +   L H R  HPS+ VL+ LF      S  C+ C  +K TRLPF +SI++  K F+++HSDVWGP+
Subjt:  SGEMIGEGILRNGLYYLQQN---NKCFVSSKNTDRGHLLHLRFGHPSDQVLNRLF-HYNYDSFSCDTCRFAKQTRLPFPTSITKVEKCFDLIHSDVWGPS

Query:  PEESYNHYKYYVTFIDDFSKTTWVYLLKTKNEVFSCFQEFFNFITNQYNAQVKIFRSDNGTEYVNKEFTNFFKQHGILHQTTCTHTPQQNGVSERKNRHL
        P ES++ Y+YYVTF+DDFS+ TW+YLLK K+EV   F+ F N + N +++Q+ I RSDNGTEY +K  TN+   HGI+HQT+C  TPQQNG++ERKNR L
Subjt:  PEESYNHYKYYVTFIDDFSKTTWVYLLKTKNEVFSCFQEFFNFITNQYNAQVKIFRSDNGTEYVNKEFTNFFKQHGILHQTTCTHTPQQNGVSERKNRHL

Query:  LEKTRALLLQNNVPKKFWSDAILTATYIINRLPSPNLNNLSPLEILKGRKIDLDHIRVFGCTCFVYIK--RKDKLDKNSVKTIFLGYSSTQKGYKCFDPE
        LEKTRAL+LQ NVPK+FWS  +L ATY+INRLPS  L++ SP E+++ +KI+L H+R+FGCTC+ +I+   +DKLD  ++K +F+GYSSTQKGYKC++P 
Subjt:  LEKTRALLLQNNVPKKFWSDAILTATYIINRLPSPNLNNLSPLEILKGRKIDLDHIRVFGCTCFVYIK--RKDKLDKNSVKTIFLGYSSTQKGYKCFDPE

Query:  QNKLYISRDVVFREHEPFFTPTQDTTAATPSTLQFLFP
          KL++SRDV F E +P+F    D        L F FP
Subjt:  QNKLYISRDVVFREHEPFFTPTQDTTAATPSTLQFLFP

BBH03633.1 Seven transmembrane MLO family protein [Prunus dulcis]2.8e-11849.32Show/hide
Query:  NKHDSNWIIDSGATHHMSCSPNNFLNLITSNEPQFVTTANGGQTKIFGTGTISVFNKPV-NEVLYLPDFHSNLLSVNKIVKDLNCAVIFLPEKVIFQDIV
        + HD  WIIDSGAT HM+   +   N  +   P  V+ ANG    + G G + + +  V +  LY+P F   LLSV +I   LNC  IF P  V+FQD+ 
Subjt:  NKHDSNWIIDSGATHHMSCSPNNFLNLITSNEPQFVTTANGGQTKIFGTGTISVFNKPV-NEVLYLPDFHSNLLSVNKIVKDLNCAVIFLPEKVIFQDIV

Query:  SGEMIGEGILRNGLYYLQQN---NKCFVSSKNTDRGHLLHLRFGHPSDQVLNRLF-HYNYDSFSCDTCRFAKQTRLPFPTSITKVEKCFDLIHSDVWGPS
        + ++IGEG   NGLYY  +N    K F  S N +   L H R  HPS+ VL+ LF      S  C+ C  +K TRLPF +SI++  K F+++HSDVWGP+
Subjt:  SGEMIGEGILRNGLYYLQQN---NKCFVSSKNTDRGHLLHLRFGHPSDQVLNRLF-HYNYDSFSCDTCRFAKQTRLPFPTSITKVEKCFDLIHSDVWGPS

Query:  PEESYNHYKYYVTFIDDFSKTTWVYLLKTKNEVFSCFQEFFNFITNQYNAQVKIFRSDNGTEYVNKEFTNFFKQHGILHQTTCTHTPQQNGVSERKNRHL
        P ES++ Y+YYVTF+DDFS+ TW+YLLK K+EV   F+ F N + N +++Q+ I RSDNGTEY +K  TN+   HGI+HQT+C  TPQQNG++ERKNR L
Subjt:  PEESYNHYKYYVTFIDDFSKTTWVYLLKTKNEVFSCFQEFFNFITNQYNAQVKIFRSDNGTEYVNKEFTNFFKQHGILHQTTCTHTPQQNGVSERKNRHL

Query:  LEKTRALLLQNNVPKKFWSDAILTATYIINRLPSPNLNNLSPLEILKGRKIDLDHIRVFGCTCFVYIK--RKDKLDKNSVKTIFLGYSSTQKGYKCFDPE
        LEKTRAL+LQ NVPK+FWS  +L ATY+INRLPS  L++ SP E+++ +KI+L H+R+FGCTC+ +I+   +DKLD  ++K +F+GYSSTQKGYKC++P 
Subjt:  LEKTRALLLQNNVPKKFWSDAILTATYIINRLPSPNLNNLSPLEILKGRKIDLDHIRVFGCTCFVYIK--RKDKLDKNSVKTIFLGYSSTQKGYKCFDPE

Query:  QNKLYISRDVVFREHEPFFTPTQDTTAATPSTLQFLFP
          KL++SRDV F E +P+F    D        L F FP
Subjt:  QNKLYISRDVVFREHEPFFTPTQDTTAATPSTLQFLFP

BBH08924.1 Seven transmembrane MLO family protein [Prunus dulcis]5.7e-11949.77Show/hide
Query:  WIIDSGATHHMSCSPNNFLNLITSNEPQFVTTANGGQTKIFGTGTISVFNKPV-NEVLYLPDFHSNLLSVNKIVKDLNCAVIFLPEKVIFQDIVSGEMIG
        WIIDSGAT HM+   +   N  +   P  V+ ANG    + G G + + +  V +  LY+P F   LLSV +I   LNC  IF P  V+FQD+ + ++IG
Subjt:  WIIDSGATHHMSCSPNNFLNLITSNEPQFVTTANGGQTKIFGTGTISVFNKPV-NEVLYLPDFHSNLLSVNKIVKDLNCAVIFLPEKVIFQDIVSGEMIG

Query:  EGILRNGLYYLQQN---NKCFVSSKNTDRGHLLHLRFGHPSDQVLNRLF-HYNYDSFSCDTCRFAKQTRLPFPTSITKVEKCFDLIHSDVWGPSPEESYN
        EG   NGLYY  +N    K F  S N +   L H R  HPS+ VL+ LF      S  C+ C  +K TRLPF +SI++  K F+++HSDVWGP+P ES++
Subjt:  EGILRNGLYYLQQN---NKCFVSSKNTDRGHLLHLRFGHPSDQVLNRLF-HYNYDSFSCDTCRFAKQTRLPFPTSITKVEKCFDLIHSDVWGPSPEESYN

Query:  HYKYYVTFIDDFSKTTWVYLLKTKNEVFSCFQEFFNFITNQYNAQVKIFRSDNGTEYVNKEFTNFFKQHGILHQTTCTHTPQQNGVSERKNRHLLEKTRA
         Y+YYVTF+DDFS+ TW+YLLK K+EV   F+ F N + N +++Q+ I RSDNGTEY +K  TN+   HGI+HQT+C  TPQQNG++ERKNR LLEKTRA
Subjt:  HYKYYVTFIDDFSKTTWVYLLKTKNEVFSCFQEFFNFITNQYNAQVKIFRSDNGTEYVNKEFTNFFKQHGILHQTTCTHTPQQNGVSERKNRHLLEKTRA

Query:  LLLQNNVPKKFWSDAILTATYIINRLPSPNLNNLSPLEILKGRKIDLDHIRVFGCTCFVYIK--RKDKLDKNSVKTIFLGYSSTQKGYKCFDPEQNKLYI
        L+LQ NVPK+FWS  +LTATY+INRLPS  L++ SP E+++ +KI+L H+R+FGCTC+ +I+   +DKLD  ++K +F+GYSSTQKGYKC++P   KL++
Subjt:  LLLQNNVPKKFWSDAILTATYIINRLPSPNLNNLSPLEILKGRKIDLDHIRVFGCTCFVYIK--RKDKLDKNSVKTIFLGYSSTQKGYKCFDPEQNKLYI

Query:  SRDVVFREHEPFFTPTQDTTAATPSTLQFLFP
        SRDV F E +P+F    D        L F FP
Subjt:  SRDVVFREHEPFFTPTQDTTAATPSTLQFLFP

BBN67583.1 Seven transmembrane MLO family protein [Prunus dulcis]2.8e-11849.32Show/hide
Query:  NKHDSNWIIDSGATHHMSCSPNNFLNLITSNEPQFVTTANGGQTKIFGTGTISVFNKPV-NEVLYLPDFHSNLLSVNKIVKDLNCAVIFLPEKVIFQDIV
        + HD  WIIDSGAT HM+   +   N  +   P  V+ ANG    + G G + + +  V +  LY+P F   LLSV +I   LNC  IF P  V+FQD+ 
Subjt:  NKHDSNWIIDSGATHHMSCSPNNFLNLITSNEPQFVTTANGGQTKIFGTGTISVFNKPV-NEVLYLPDFHSNLLSVNKIVKDLNCAVIFLPEKVIFQDIV

Query:  SGEMIGEGILRNGLYYLQQN---NKCFVSSKNTDRGHLLHLRFGHPSDQVLNRLF-HYNYDSFSCDTCRFAKQTRLPFPTSITKVEKCFDLIHSDVWGPS
        + ++IGEG   NGLYY  +N    K F  S N +   L H R  HPS+ VL+ LF      S  C+ C  +K TRLPF +SI++  K F+++HSDVWGP+
Subjt:  SGEMIGEGILRNGLYYLQQN---NKCFVSSKNTDRGHLLHLRFGHPSDQVLNRLF-HYNYDSFSCDTCRFAKQTRLPFPTSITKVEKCFDLIHSDVWGPS

Query:  PEESYNHYKYYVTFIDDFSKTTWVYLLKTKNEVFSCFQEFFNFITNQYNAQVKIFRSDNGTEYVNKEFTNFFKQHGILHQTTCTHTPQQNGVSERKNRHL
        P ES++ Y+YYVTF+DDFS+ TW+YLLK K+EV   F+ F N + N +++Q+ I RSDNGTEY +K  TN+   HGI+HQT+C  TPQQNG++ERKNR L
Subjt:  PEESYNHYKYYVTFIDDFSKTTWVYLLKTKNEVFSCFQEFFNFITNQYNAQVKIFRSDNGTEYVNKEFTNFFKQHGILHQTTCTHTPQQNGVSERKNRHL

Query:  LEKTRALLLQNNVPKKFWSDAILTATYIINRLPSPNLNNLSPLEILKGRKIDLDHIRVFGCTCFVYIK--RKDKLDKNSVKTIFLGYSSTQKGYKCFDPE
        LEKTRAL+LQ NVPK+FWS  +L ATY+INRLPS  L++ SP E+++ +KI+L H+R+FGCTC+ +I+   +DKLD  ++K +F+GYSSTQKGYKC++P 
Subjt:  LEKTRALLLQNNVPKKFWSDAILTATYIINRLPSPNLNNLSPLEILKGRKIDLDHIRVFGCTCFVYIK--RKDKLDKNSVKTIFLGYSSTQKGYKCFDPE

Query:  QNKLYISRDVVFREHEPFFTPTQDTTAATPSTLQFLFP
          KL++SRDV F E +P+F    D        L F FP
Subjt:  QNKLYISRDVVFREHEPFFTPTQDTTAATPSTLQFLFP

TrEMBL top hitse value%identityAlignment
A0A4Y1RBJ3 Transposable element protein3.6e-11949.54Show/hide
Query:  NKHDSNWIIDSGATHHMSCSPNNFLNLITSNEPQFVTTANGGQTKIFGTGTISVFNKPV-NEVLYLPDFHSNLLSVNKIVKDLNCAVIFLPEKVIFQDIV
        + HD  WIIDSGAT HM+   +   N  +   P  V+ ANG    + G G + + +  V +  LY+P F   LLSV +I   LNC  IF P  V+FQD+ 
Subjt:  NKHDSNWIIDSGATHHMSCSPNNFLNLITSNEPQFVTTANGGQTKIFGTGTISVFNKPV-NEVLYLPDFHSNLLSVNKIVKDLNCAVIFLPEKVIFQDIV

Query:  SGEMIGEGILRNGLYYLQQN---NKCFVSSKNTDRGHLLHLRFGHPSDQVLNRLF-HYNYDSFSCDTCRFAKQTRLPFPTSITKVEKCFDLIHSDVWGPS
        + ++IGEG   NGLYY  +N    K F  S N +   L H R  HPS+ VL+ LF      S  C+ C  +K TRLPF +SI++  K F+++HSDVWGP+
Subjt:  SGEMIGEGILRNGLYYLQQN---NKCFVSSKNTDRGHLLHLRFGHPSDQVLNRLF-HYNYDSFSCDTCRFAKQTRLPFPTSITKVEKCFDLIHSDVWGPS

Query:  PEESYNHYKYYVTFIDDFSKTTWVYLLKTKNEVFSCFQEFFNFITNQYNAQVKIFRSDNGTEYVNKEFTNFFKQHGILHQTTCTHTPQQNGVSERKNRHL
        P ES++ Y+YYVTF+DDFS+ TW+YLLK K+EV   F+ F N + N +++Q+ I RSDNGTEY +K  TN+   HGI+HQT+C  TPQQNG++ERKNR L
Subjt:  PEESYNHYKYYVTFIDDFSKTTWVYLLKTKNEVFSCFQEFFNFITNQYNAQVKIFRSDNGTEYVNKEFTNFFKQHGILHQTTCTHTPQQNGVSERKNRHL

Query:  LEKTRALLLQNNVPKKFWSDAILTATYIINRLPSPNLNNLSPLEILKGRKIDLDHIRVFGCTCFVYIK--RKDKLDKNSVKTIFLGYSSTQKGYKCFDPE
        LEKTRAL+LQ NVPK+FWS  +LTATY+INRLPS  L++ SP E+++ +KI+L H+R+FGCTC+ +I+   +DKLD  ++K +F+GYSSTQKGYKC++P 
Subjt:  LEKTRALLLQNNVPKKFWSDAILTATYIINRLPSPNLNNLSPLEILKGRKIDLDHIRVFGCTCFVYIK--RKDKLDKNSVKTIFLGYSSTQKGYKCFDPE

Query:  QNKLYISRDVVFREHEPFFTPTQDTTAATPSTLQFLFP
          KL++SRDV F E +P+F    D        L F FP
Subjt:  QNKLYISRDVVFREHEPFFTPTQDTTAATPSTLQFLFP

A0A4Y1RFA9 Integrase catalytic domain-containing protein1.4e-11849.32Show/hide
Query:  NKHDSNWIIDSGATHHMSCSPNNFLNLITSNEPQFVTTANGGQTKIFGTGTISVFNKPV-NEVLYLPDFHSNLLSVNKIVKDLNCAVIFLPEKVIFQDIV
        + HD  WIIDSGAT HM+   +   N  +   P  V+ ANG    + G G + + +  V +  LY+P F   LLSV +I   LNC  IF P  V+FQD+ 
Subjt:  NKHDSNWIIDSGATHHMSCSPNNFLNLITSNEPQFVTTANGGQTKIFGTGTISVFNKPV-NEVLYLPDFHSNLLSVNKIVKDLNCAVIFLPEKVIFQDIV

Query:  SGEMIGEGILRNGLYYLQQN---NKCFVSSKNTDRGHLLHLRFGHPSDQVLNRLF-HYNYDSFSCDTCRFAKQTRLPFPTSITKVEKCFDLIHSDVWGPS
        + ++IGEG   NGLYY  +N    K F  S N +   L H R  HPS+ VL+ LF      S  C+ C  +K TRLPF +SI++  K F+++HSDVWGP+
Subjt:  SGEMIGEGILRNGLYYLQQN---NKCFVSSKNTDRGHLLHLRFGHPSDQVLNRLF-HYNYDSFSCDTCRFAKQTRLPFPTSITKVEKCFDLIHSDVWGPS

Query:  PEESYNHYKYYVTFIDDFSKTTWVYLLKTKNEVFSCFQEFFNFITNQYNAQVKIFRSDNGTEYVNKEFTNFFKQHGILHQTTCTHTPQQNGVSERKNRHL
        P ES++ Y+YYVTF+DDFS+ TW+YLLK K+EV   F+ F N + N +++Q+ I RSDNGTEY +K  TN+   HGI+HQT+C  TPQQNG++ERKNR L
Subjt:  PEESYNHYKYYVTFIDDFSKTTWVYLLKTKNEVFSCFQEFFNFITNQYNAQVKIFRSDNGTEYVNKEFTNFFKQHGILHQTTCTHTPQQNGVSERKNRHL

Query:  LEKTRALLLQNNVPKKFWSDAILTATYIINRLPSPNLNNLSPLEILKGRKIDLDHIRVFGCTCFVYIK--RKDKLDKNSVKTIFLGYSSTQKGYKCFDPE
        LEKTRAL+LQ NVPK+FWS  +L ATY+INRLPS  L++ SP E+++ +KI+L H+R+FGCTC+ +I+   +DKLD  ++K +F+GYSSTQKGYKC++P 
Subjt:  LEKTRALLLQNNVPKKFWSDAILTATYIINRLPSPNLNNLSPLEILKGRKIDLDHIRVFGCTCFVYIK--RKDKLDKNSVKTIFLGYSSTQKGYKCFDPE

Query:  QNKLYISRDVVFREHEPFFTPTQDTTAATPSTLQFLFP
          KL++SRDV F E +P+F    D        L F FP
Subjt:  QNKLYISRDVVFREHEPFFTPTQDTTAATPSTLQFLFP

A0A4Y1RIV4 Seven transmembrane MLO family protein1.4e-11849.32Show/hide
Query:  NKHDSNWIIDSGATHHMSCSPNNFLNLITSNEPQFVTTANGGQTKIFGTGTISVFNKPV-NEVLYLPDFHSNLLSVNKIVKDLNCAVIFLPEKVIFQDIV
        + HD  WIIDSGAT HM+   +   N  +   P  V+ ANG    + G G + + +  V +  LY+P F   LLSV +I   LNC  IF P  V+FQD+ 
Subjt:  NKHDSNWIIDSGATHHMSCSPNNFLNLITSNEPQFVTTANGGQTKIFGTGTISVFNKPV-NEVLYLPDFHSNLLSVNKIVKDLNCAVIFLPEKVIFQDIV

Query:  SGEMIGEGILRNGLYYLQQN---NKCFVSSKNTDRGHLLHLRFGHPSDQVLNRLF-HYNYDSFSCDTCRFAKQTRLPFPTSITKVEKCFDLIHSDVWGPS
        + ++IGEG   NGLYY  +N    K F  S N +   L H R  HPS+ VL+ LF      S  C+ C  +K TRLPF +SI++  K F+++HSDVWGP+
Subjt:  SGEMIGEGILRNGLYYLQQN---NKCFVSSKNTDRGHLLHLRFGHPSDQVLNRLF-HYNYDSFSCDTCRFAKQTRLPFPTSITKVEKCFDLIHSDVWGPS

Query:  PEESYNHYKYYVTFIDDFSKTTWVYLLKTKNEVFSCFQEFFNFITNQYNAQVKIFRSDNGTEYVNKEFTNFFKQHGILHQTTCTHTPQQNGVSERKNRHL
        P ES++ Y+YYVTF+DDFS+ TW+YLLK K+EV   F+ F N + N +++Q+ I RSDNGTEY +K  TN+   HGI+HQT+C  TPQQNG++ERKNR L
Subjt:  PEESYNHYKYYVTFIDDFSKTTWVYLLKTKNEVFSCFQEFFNFITNQYNAQVKIFRSDNGTEYVNKEFTNFFKQHGILHQTTCTHTPQQNGVSERKNRHL

Query:  LEKTRALLLQNNVPKKFWSDAILTATYIINRLPSPNLNNLSPLEILKGRKIDLDHIRVFGCTCFVYIK--RKDKLDKNSVKTIFLGYSSTQKGYKCFDPE
        LEKTRAL+LQ NVPK+FWS  +L ATY+INRLPS  L++ SP E+++ +KI+L H+R+FGCTC+ +I+   +DKLD  ++K +F+GYSSTQKGYKC++P 
Subjt:  LEKTRALLLQNNVPKKFWSDAILTATYIINRLPSPNLNNLSPLEILKGRKIDLDHIRVFGCTCFVYIK--RKDKLDKNSVKTIFLGYSSTQKGYKCFDPE

Query:  QNKLYISRDVVFREHEPFFTPTQDTTAATPSTLQFLFP
          KL++SRDV F E +P+F    D        L F FP
Subjt:  QNKLYISRDVVFREHEPFFTPTQDTTAATPSTLQFLFP

A0A4Y1RX42 Seven transmembrane MLO family protein2.7e-11949.77Show/hide
Query:  WIIDSGATHHMSCSPNNFLNLITSNEPQFVTTANGGQTKIFGTGTISVFNKPV-NEVLYLPDFHSNLLSVNKIVKDLNCAVIFLPEKVIFQDIVSGEMIG
        WIIDSGAT HM+   +   N  +   P  V+ ANG    + G G + + +  V +  LY+P F   LLSV +I   LNC  IF P  V+FQD+ + ++IG
Subjt:  WIIDSGATHHMSCSPNNFLNLITSNEPQFVTTANGGQTKIFGTGTISVFNKPV-NEVLYLPDFHSNLLSVNKIVKDLNCAVIFLPEKVIFQDIVSGEMIG

Query:  EGILRNGLYYLQQN---NKCFVSSKNTDRGHLLHLRFGHPSDQVLNRLF-HYNYDSFSCDTCRFAKQTRLPFPTSITKVEKCFDLIHSDVWGPSPEESYN
        EG   NGLYY  +N    K F  S N +   L H R  HPS+ VL+ LF      S  C+ C  +K TRLPF +SI++  K F+++HSDVWGP+P ES++
Subjt:  EGILRNGLYYLQQN---NKCFVSSKNTDRGHLLHLRFGHPSDQVLNRLF-HYNYDSFSCDTCRFAKQTRLPFPTSITKVEKCFDLIHSDVWGPSPEESYN

Query:  HYKYYVTFIDDFSKTTWVYLLKTKNEVFSCFQEFFNFITNQYNAQVKIFRSDNGTEYVNKEFTNFFKQHGILHQTTCTHTPQQNGVSERKNRHLLEKTRA
         Y+YYVTF+DDFS+ TW+YLLK K+EV   F+ F N + N +++Q+ I RSDNGTEY +K  TN+   HGI+HQT+C  TPQQNG++ERKNR LLEKTRA
Subjt:  HYKYYVTFIDDFSKTTWVYLLKTKNEVFSCFQEFFNFITNQYNAQVKIFRSDNGTEYVNKEFTNFFKQHGILHQTTCTHTPQQNGVSERKNRHLLEKTRA

Query:  LLLQNNVPKKFWSDAILTATYIINRLPSPNLNNLSPLEILKGRKIDLDHIRVFGCTCFVYIK--RKDKLDKNSVKTIFLGYSSTQKGYKCFDPEQNKLYI
        L+LQ NVPK+FWS  +LTATY+INRLPS  L++ SP E+++ +KI+L H+R+FGCTC+ +I+   +DKLD  ++K +F+GYSSTQKGYKC++P   KL++
Subjt:  LLLQNNVPKKFWSDAILTATYIINRLPSPNLNNLSPLEILKGRKIDLDHIRVFGCTCFVYIK--RKDKLDKNSVKTIFLGYSSTQKGYKCFDPEQNKLYI

Query:  SRDVVFREHEPFFTPTQDTTAATPSTLQFLFP
        SRDV F E +P+F    D        L F FP
Subjt:  SRDVVFREHEPFFTPTQDTTAATPSTLQFLFP

A0A5H2XGM8 Seven transmembrane MLO family protein1.4e-11849.32Show/hide
Query:  NKHDSNWIIDSGATHHMSCSPNNFLNLITSNEPQFVTTANGGQTKIFGTGTISVFNKPV-NEVLYLPDFHSNLLSVNKIVKDLNCAVIFLPEKVIFQDIV
        + HD  WIIDSGAT HM+   +   N  +   P  V+ ANG    + G G + + +  V +  LY+P F   LLSV +I   LNC  IF P  V+FQD+ 
Subjt:  NKHDSNWIIDSGATHHMSCSPNNFLNLITSNEPQFVTTANGGQTKIFGTGTISVFNKPV-NEVLYLPDFHSNLLSVNKIVKDLNCAVIFLPEKVIFQDIV

Query:  SGEMIGEGILRNGLYYLQQN---NKCFVSSKNTDRGHLLHLRFGHPSDQVLNRLF-HYNYDSFSCDTCRFAKQTRLPFPTSITKVEKCFDLIHSDVWGPS
        + ++IGEG   NGLYY  +N    K F  S N +   L H R  HPS+ VL+ LF      S  C+ C  +K TRLPF +SI++  K F+++HSDVWGP+
Subjt:  SGEMIGEGILRNGLYYLQQN---NKCFVSSKNTDRGHLLHLRFGHPSDQVLNRLF-HYNYDSFSCDTCRFAKQTRLPFPTSITKVEKCFDLIHSDVWGPS

Query:  PEESYNHYKYYVTFIDDFSKTTWVYLLKTKNEVFSCFQEFFNFITNQYNAQVKIFRSDNGTEYVNKEFTNFFKQHGILHQTTCTHTPQQNGVSERKNRHL
        P ES++ Y+YYVTF+DDFS+ TW+YLLK K+EV   F+ F N + N +++Q+ I RSDNGTEY +K  TN+   HGI+HQT+C  TPQQNG++ERKNR L
Subjt:  PEESYNHYKYYVTFIDDFSKTTWVYLLKTKNEVFSCFQEFFNFITNQYNAQVKIFRSDNGTEYVNKEFTNFFKQHGILHQTTCTHTPQQNGVSERKNRHL

Query:  LEKTRALLLQNNVPKKFWSDAILTATYIINRLPSPNLNNLSPLEILKGRKIDLDHIRVFGCTCFVYIK--RKDKLDKNSVKTIFLGYSSTQKGYKCFDPE
        LEKTRAL+LQ NVPK+FWS  +L ATY+INRLPS  L++ SP E+++ +KI+L H+R+FGCTC+ +I+   +DKLD  ++K +F+GYSSTQKGYKC++P 
Subjt:  LEKTRALLLQNNVPKKFWSDAILTATYIINRLPSPNLNNLSPLEILKGRKIDLDHIRVFGCTCFVYIK--RKDKLDKNSVKTIFLGYSSTQKGYKCFDPE

Query:  QNKLYISRDVVFREHEPFFTPTQDTTAATPSTLQFLFP
          KL++SRDV F E +P+F    D        L F FP
Subjt:  QNKLYISRDVVFREHEPFFTPTQDTTAATPSTLQFLFP

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.6e-4730.31Show/hide
Query:  KLSNNSIYLNKHDSNWIIDSGATHHMSCSPNNFLNLITSNEPQFVTTANGGQTKIFGT--GTISVFNK---PVNEVLYLPDFHSNLLSVNKIVKDLNCAV
        +++N S+  N     +++DSGA+ H+    + + + +    P  +  A  G+  I+ T  G + + N     + +VL+  +   NL+SV ++ ++   ++
Subjt:  KLSNNSIYLNKHDSNWIIDSGATHHMSCSPNNFLNLITSNEPQFVTTANGGQTKIFGT--GTISVFNK---PVNEVLYLPDFHSNLLSVNKIVKDLNCAV

Query:  IFLPEKVIFQDIVSGEMIGEGILRNGLYYLQQNNKC-----------FVSSKNTDRGHLLHLRFGHPSDQVLNRLFHYNYDSFS------------CDTC
         F             +  G  I +NGL  ++ +               +++K+ +   L H RFGH SD  L  +   N  S              C+ C
Subjt:  IFLPEKVIFQDIVSGEMIGEGILRNGLYYLQQNNKC-----------FVSSKNTDRGHLLHLRFGHPSDQVLNRLFHYNYDSFS------------CDTC

Query:  RFAKQTRLPFP--TSITKVEKCFDLIHSDVWGPSPEESYNHYKYYVTFIDDFSKTTWVYLLKTKNEVFSCFQEFFNFITNQYNAQVKIFRSDNGTEYVNK
           KQ RLPF      T +++   ++HSDV GP    + +   Y+V F+D F+     YL+K K++VFS FQ+F       +N +V     DNG EY++ 
Subjt:  RFAKQTRLPFP--TSITKVEKCFDLIHSDVWGPSPEESYNHYKYYVTFIDDFSKTTWVYLLKTKNEVFSCFQEFFNFITNQYNAQVKIFRSDNGTEYVNK

Query:  EFTNFFKQHGILHQTTCTHTPQQNGVSERKNRHLLEKTRALLLQNNVPKKFWSDAILTATYIINRLPSPNL--NNLSPLEILKGRKIDLDHIRVFGCTCF
        E   F  + GI +  T  HTPQ NGVSER  R + EK R ++    + K FW +A+LTATY+INR+PS  L  ++ +P E+   +K  L H+RVFG T +
Subjt:  EFTNFFKQHGILHQTTCTHTPQQNGVSERKNRHLLEKTRALLLQNNVPKKFWSDAILTATYIINRLPSPNL--NNLSPLEILKGRKIDLDHIRVFGCTCF

Query:  VYIKRKD-KLDKNSVKTIFLGYSSTQKGYKCFDPEQNKLYISRDVVFREHEPFFTPTQDTTAATPSTLQFLFPSLDDEENPSASSSGEIMRMNGTIQKTD
        V+IK K  K D  S K+IF+GY     G+K +D    K  ++RDVV  E     T   ++ A    T+ FL  S + E     + S +I++   T    +
Subjt:  VYIKRKD-KLDKNSVKTIFLGYSSTQKGYKCFDPEQNKLYISRDVVFREHEPFFTPTQDTTAATPSTLQFLFPSLDDEENPSASSSGEIMRMNGTIQKTD

Query:  KKKKEKIQSDDDQHEQGN
         K+ + IQ   D  E  N
Subjt:  KKKKEKIQSDDDQHEQGN

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.9e-6532.35Show/hide
Query:  IYLNKHDSNWIIDSGATHHMSCSPNNFLNLITSNEPQFVTTANGGQTKIFGTGTISVFNK-----PVNEVLYLPDFHSNLLSVNKIVKDLNCAVIFLPEK
        ++L+  +S W++D+ A+HH +   + F   + + +   V   N   +KI G G I +         + +V ++PD   NL+S   + +D      F  +K
Subjt:  IYLNKHDSNWIIDSGATHHMSCSPNNFLNLITSNEPQFVTTANGGQTKIFGTGTISVFNK-----PVNEVLYLPDFHSNLLSVNKIVKDLNCAVIFLPEK

Query:  VIFQDIVSGEMIGEGILRNGLYYLQQNNKCFVSSKNTDRGH----LLHLRFGHPSDQVLNRLFHYNYDSFS-------CDTCRFAKQTRLPFPTSITKVE
          ++      +I +G+ R  LY  + N +      N  +      L H R GH S++ L  L   +  S++       CD C F KQ R+ F TS  +  
Subjt:  VIFQDIVSGEMIGEGILRNGLYYLQQNNKCFVSSKNTDRGH----LLHLRFGHPSDQVLNRLFHYNYDSFS-------CDTCRFAKQTRLPFPTSITKVE

Query:  KCFDLIHSDVWGPSPEESYNHYKYYVTFIDDFSKTTWVYLLKTKNEVFSCFQEFFNFITNQYNAQVKIFRSDNGTEYVNKEFTNFFKQHGILHQTTCTHT
           DL++SDV GP   ES    KY+VTFIDD S+  WVY+LKTK++VF  FQ+F   +  +   ++K  RSDNG EY ++EF  +   HGI H+ T   T
Subjt:  KCFDLIHSDVWGPSPEESYNHYKYYVTFIDDFSKTTWVYLLKTKNEVFSCFQEFFNFITNQYNAQVKIFRSDNGTEYVNKEFTNFFKQHGILHQTTCTHT

Query:  PQQNGVSERKNRHLLEKTRALLLQNNVPKKFWSDAILTATYIINRLPSPNLNNLSPLEILKGRKIDLDHIRVFGCTCFVYI--KRKDKLDKNSVKTIFLG
        PQ NGV+ER NR ++EK R++L    +PK FW +A+ TA Y+INR PS  L    P  +   +++   H++VFGC  F ++  +++ KLD  S+  IF+G
Subjt:  PQQNGVSERKNRHLLEKTRALLLQNNVPKKFWSDAILTATYIINRLPSPNLNNLSPLEILKGRKIDLDHIRVFGCTCFVYI--KRKDKLDKNSVKTIFLG

Query:  YSSTQKGYKCFDPEQNKLYISRDVVFREHE----------------PFFTPTQDTTAATPSTLQFLFPSLDDEENPSASSSGEIMRMNGTIQKTDKKKKE
        Y   + GY+ +DP + K+  SRDVVFRE E                P F  T  +T+  P++ +    S  DE +      GE++     + +  ++ + 
Subjt:  YSSTQKGYKCFDPEQNKLYISRDVVFREHE----------------PFFTPTQDTTAATPSTLQFLFPSLDDEENPSASSSGEIMRMNGTIQKTDKKKKE

Query:  KIQSDDDQHE
          Q  ++QH+
Subjt:  KIQSDDDQHE

P47024 Transposon Ty4-J Gag-Pol polyprotein5.9e-1826.06Show/hide
Query:  HLRFGHPS-DQVLNRLFHYNYD----------SFSCDTCRFAKQT-RLPFPTSITKVEKCFDLIHS---DVWGPSPEESYNHYKYYVTFIDDFSK--TTW
        H R GH    Q+ N + H +Y+           F C TC+ +K T R  +  S+       +   S   D++GP    + +  +Y +  +D+ ++   T 
Subjt:  HLRFGHPS-DQVLNRLFHYNYD----------SFSCDTCRFAKQT-RLPFPTSITKVEKCFDLIHS---DVWGPSPEESYNHYKYYVTFIDDFSK--TTW

Query:  VYLLKTKNEVFSCFQEFFNFITNQYNAQVKIFRSDNGTEYVNKEFTNFFKQHGILHQTTCTHTPQQNGVSERKNRHLLEKTRALLLQNNVPKKFWSDAIL
         +  K    + +  ++   ++  Q++ +V+   SD GTE+ N +   +F   GI H  T T     NG +ER  R ++     LL Q+N+  KFW  A+ 
Subjt:  VYLLKTKNEVFSCFQEFFNFITNQYNAQVKIFRSDNGTEYVNKEFTNFFKQHGILHQTTCTHTPQQNGVSERKNRHLLEKTRALLLQNNVPKKFWSDAIL

Query:  TATYIINRLPSPNLNNLSPLEILKGRKID--LDHIRVFGCTCFVYIKRKDKLDKNSVKTIFLGYSSTQKGYKCFDPEQNKLYIS
        +AT I N L   +   L PL+ +  + +   L     FG    ++     KL  + + +I L       GYK F P +NK+  S
Subjt:  TATYIINRLPSPNLNNLSPLEILKGRKID--LDHIRVFGCTCFVYIKRKDKLDKNSVKTIFLGYSSTQKGYKCFDPEQNKLYIS

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE16.6e-7034.62Show/hide
Query:  GPSGPNSDQLMQLVNQLNQLLQPRQQNSGLSELKLSNNSIYLNKHDSNWIIDSGATHHMSCSPNNFLNLITSNEPQFVTTANGGQTKIFGTGTISVFNK-
        G S     QL   ++ +N    P           L+  S Y     +NW++DSGATHH++   NN            V  A+G    I  TG+ S+  K 
Subjt:  GPSGPNSDQLMQLVNQLNQLLQPRQQNSGLSELKLSNNSIYLNKHDSNWIIDSGATHHMSCSPNNFLNLITSNEPQFVTTANGGQTKIFGTGTISVFNK-

Query:  -PVN--EVLYLPDFHSNLLSVNKIVKDLNCAVIFLPEKVIFQDIVSGEMIGEGILRNGLY----YLQQNNKCFVSSKNTDRGHLLHLRFGHPSDQVLN--
         P+N   +LY+P+ H NL+SV ++      +V F P     +D+ +G  + +G  ++ LY       Q    F S  +       H R GHP+  +LN  
Subjt:  -PVN--EVLYLPDFHSNLLSVNKIVKDLNCAVIFLPEKVIFQDIVSGEMIGEGILRNGLY----YLQQNNKCFVSSKNTDRGHLLHLRFGHPSDQVLN--

Query:  ------RLFHYNYDSFSCDTCRFAKQTRLPFPTSITKVEKCFDLIHSDVWGPSPEESYNHYKYYVTFIDDFSKTTWVYLLKTKNEVFSCFQEFFNFITNQ
               + + ++   SC  C   K  ++PF  S     +  + I+SDVW  SP  S+++Y+YYV F+D F++ TW+Y LK K++V   F  F N + N+
Subjt:  ------RLFHYNYDSFSCDTCRFAKQTRLPFPTSITKVEKCFDLIHSDVWGPSPEESYNHYKYYVTFIDDFSKTTWVYLLKTKNEVFSCFQEFFNFITNQ

Query:  YNAQVKIFRSDNGTEYVNKEFTNFFKQHGILHQTTCTHTPQQNGVSERKNRHLLEKTRALLLQNNVPKKFWSDAILTATYIINRLPSPNLNNLSPLEILK
        +  ++  F SDNG E+V      +F QHGI H T+  HTP+ NG+SERK+RH++E    LL   ++PK +W  A   A Y+INRLP+P L   SP + L 
Subjt:  YNAQVKIFRSDNGTEYVNKEFTNFFKQHGILHQTTCTHTPQQNGVSERKNRHLLEKTRALLLQNNVPKKFWSDAILTATYIINRLPSPNLNNLSPLEILK

Query:  GRKIDLDHIRVFGCTCFVYIK--RKDKLDKNSVKTIFLGYSSTQKGYKCFDPEQNKLYISRDVVFREH
        G   + D +RVFGC C+ +++   + KLD  S + +FLGYS TQ  Y C   + ++LYISR V F E+
Subjt:  GRKIDLDHIRVFGCTCFVYIK--RKDKLDKNSVKTIFLGYSSTQKGYKCFDPEQNKLYISRDVVFREH

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.2e-6432.83Show/hide
Query:  GPSGPNSDQLMQLVNQLNQLLQPRQQNSGLSELKLSNNSIYLNKHDSNWIIDSGATHHMSCSPNNFLNLITSNEPQFVTTANGGQTKIFGTGTISV----
        G S     QL Q  +  NQ               L+ NS Y   + +NW++DSGATHH++   NN            V  A+G    I  TG+ S+    
Subjt:  GPSGPNSDQLMQLVNQLNQLLQPRQQNSGLSELKLSNNSIYLNKHDSNWIIDSGATHHMSCSPNNFLNLITSNEPQFVTTANGGQTKIFGTGTISV----

Query:  FNKPVNEVLYLPDFHSNLLSVNKIVKDLNCAVIFLPEKVIFQDIVSGEMIGEGILRNGLY----YLQQNNKCFVSSKNTDRGHLLHLRFGHPSDQVLNR-
         +  +N+VLY+P+ H NL+SV ++      +V F P     +D+ +G  + +G  ++ LY       Q    F S  +       H R GHPS  +LN  
Subjt:  FNKPVNEVLYLPDFHSNLLSVNKIVKDLNCAVIFLPEKVIFQDIVSGEMIGEGILRNGLY----YLQQNNKCFVSSKNTDRGHLLHLRFGHPSDQVLNR-

Query:  -------LFHYNYDSFSCDTCRFAKQTRLPFPTSITKVEKCFDLIHSDVWGPSPEESYNHYKYYVTFIDDFSKTTWVYLLKTKNEVFSCFQEFFNFITNQ
               + + ++   SC  C   K  ++PF  S     K  + I+SDVW  SP  S ++Y+YYV F+D F++ TW+Y LK K++V   F  F + + N+
Subjt:  -------LFHYNYDSFSCDTCRFAKQTRLPFPTSITKVEKCFDLIHSDVWGPSPEESYNHYKYYVTFIDDFSKTTWVYLLKTKNEVFSCFQEFFNFITNQ

Query:  YNAQVKIFRSDNGTEYVNKEFTNFFKQHGILHQTTCTHTPQQNGVSERKNRHLLEKTRALLLQNNVPKKFWSDAILTATYIINRLPSPNLNNLSPLEILK
        +  ++    SDNG E+V     ++  QHGI H T+  HTP+ NG+SERK+RH++E    LL   +VPK +W  A   A Y+INRLP+P L   SP + L 
Subjt:  YNAQVKIFRSDNGTEYVNKEFTNFFKQHGILHQTTCTHTPQQNGVSERKNRHLLEKTRALLLQNNVPKKFWSDAILTATYIINRLPSPNLNNLSPLEILK

Query:  GRKIDLDHIRVFGCTCFVYIK--RKDKLDKNSVKTIFLGYSSTQKGYKCFDPEQNKLYISRDVVFREH-EPFFT-------------------PTQDTTA
        G+  + + ++VFGC C+ +++   + KL+  S +  F+GYS TQ  Y C      +LY SR V F E   PF T                   P+  T  
Subjt:  GRKIDLDHIRVFGCTCFVYIK--RKDKLDKNSVKTIFLGYSSTQKGYKCFDPEQNKLYISRDVVFREH-EPFFT-------------------PTQDTTA

Query:  ATPSTL---QFLFPSLDDEENPSASSS
         TP  L     L P LD    P +S S
Subjt:  ATPSTL---QFLFPSLDDEENPSASSS

Arabidopsis top hitse value%identityAlignment
ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein8.2e-0733.33Show/hide
Query:  NRHLLEKTRALLLQNNVPKKFWSDAILTATYIINRLPSPNLNNLSPLEILKGRKIDLDHIRVFGCTCFVYI---KRKDKLDKNSVKTIFL
        NR ++EK R++L +  +PK F +DA  TA +IIN+ PS  +N   P E+         ++R FGC  +++    K K +  K   K  +L
Subjt:  NRHLLEKTRALLLQNNVPKKFWSDAILTATYIINRLPSPNLNNLSPLEILKGRKIDLDHIRVFGCTCFVYI---KRKDKLDKNSVKTIFL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATGGGATGAACCCTAATCCCTCTCAAGGGCCAGCTGATCACGGGCCACCAGGCTTCTATGGAACGACTGCTGCTGGGCCAGTGCAAGACCCAATCTCCTTT
GTCACGCCTGCTGGGCCTTCCGGACCTAATTCGGATCAACTCATGCAGCTGGTTAACCAATTAAATCAACTACTTCAGCCGAGGCAACAGAACTCAGGTTTATCT
GAACTCAAACTTTCAAATAATTCAATATACTTGAATAAGCATGACTCAAATTGGATTATTGATTCGGGAGCAACGCATCACATGTCATGTAGTCCAAACAATTTT
TTAAACTTGATAACCTCCAATGAACCACAATTTGTCACAACTGCTAATGGTGGTCAGACCAAAATTTTCGGTACCGGAACCATTTCTGTTTTTAACAAACCAGTC
AATGAGGTTTTGTATCTACCTGATTTTCACTCTAATTTATTATCTGTTAATAAGATTGTCAAAGATCTTAATTGTGCTGTAATATTCTTACCAGAAAAAGTGATT
TTTCAGGACATAGTCTCAGGGGAGATGATTGGTGAAGGAATTCTTAGGAACGGGCTATATTATCTTCAACAAAATAATAAATGCTTTGTATCAAGTAAAAACACT
GATCGTGGACATTTATTGCATTTAAGATTTGGCCATCCATCTGATCAGGTTTTGAATAGACTTTTTCATTATAATTATGATTCCTTTAGTTGTGATACTTGTAGA
TTTGCAAAACAAACTCGTTTACCTTTTCCTACTTCCATAACTAAAGTAGAAAAATGTTTTGATTTAATTCATTCTGATGTTTGGGGACCTTCTCCCGAAGAATCA
TACAATCATTATAAATACTATGTTACATTCATTGATGATTTTTCAAAAACTACTTGGGTATATCTTTTAAAAACCAAAAATGAAGTCTTCTCATGCTTTCAAGAA
TTTTTTAATTTTATTACCAACCAATACAATGCTCAAGTCAAAATTTTTCGATCTGACAATGGTACTGAGTATGTGAATAAGGAATTCACCAATTTTTTCAAACAA
CATGGTATTCTTCATCAAACAACATGTACTCATACACCACAACAAAATGGAGTTTCTGAAAGAAAAAACAGACATCTTCTTGAAAAAACAAGAGCTTTACTACTT
CAGAATAATGTTCCAAAAAAATTCTGGTCAGATGCAATTCTAACTGCTACTTATATCATAAATAGATTACCAAGCCCAAATCTCAATAATTTAAGTCCTCTTGAA
ATTCTCAAAGGAAGAAAAATCGACTTAGATCATATTAGAGTATTTGGATGCACCTGCTTTGTATATATAAAACGAAAGGACAAACTAGATAAAAACTCTGTGAAA
ACTATTTTTCTTGGCTACTCCTCAACCCAAAAGGGATACAAGTGCTTTGATCCCGAACAAAATAAACTGTATATTTCCAGGGATGTAGTTTTCAGAGAGCATGAA
CCATTCTTCACGCCTACACAAGACACCACCGCTGCAACACCAAGCACTCTGCAATTCCTCTTTCCTTCCCTTGACGATGAAGAAAATCCTTCCGCATCTTCTTCA
GGGGAGATTATGAGGATGAACGGAACAATACAGAAGACAGACAAGAAGAAGAAGGAGAAGATACAATCAGACGACGATCAACACGAACAAGGCAACCTTCAACAA
GGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAATGGGATGAACCCTAATCCCTCTCAAGGGCCAGCTGATCACGGGCCACCAGGCTTCTATGGAACGACTGCTGCTGGGCCAGTGCAAGACCCAATCTCCTTT
GTCACGCCTGCTGGGCCTTCCGGACCTAATTCGGATCAACTCATGCAGCTGGTTAACCAATTAAATCAACTACTTCAGCCGAGGCAACAGAACTCAGGTTTATCT
GAACTCAAACTTTCAAATAATTCAATATACTTGAATAAGCATGACTCAAATTGGATTATTGATTCGGGAGCAACGCATCACATGTCATGTAGTCCAAACAATTTT
TTAAACTTGATAACCTCCAATGAACCACAATTTGTCACAACTGCTAATGGTGGTCAGACCAAAATTTTCGGTACCGGAACCATTTCTGTTTTTAACAAACCAGTC
AATGAGGTTTTGTATCTACCTGATTTTCACTCTAATTTATTATCTGTTAATAAGATTGTCAAAGATCTTAATTGTGCTGTAATATTCTTACCAGAAAAAGTGATT
TTTCAGGACATAGTCTCAGGGGAGATGATTGGTGAAGGAATTCTTAGGAACGGGCTATATTATCTTCAACAAAATAATAAATGCTTTGTATCAAGTAAAAACACT
GATCGTGGACATTTATTGCATTTAAGATTTGGCCATCCATCTGATCAGGTTTTGAATAGACTTTTTCATTATAATTATGATTCCTTTAGTTGTGATACTTGTAGA
TTTGCAAAACAAACTCGTTTACCTTTTCCTACTTCCATAACTAAAGTAGAAAAATGTTTTGATTTAATTCATTCTGATGTTTGGGGACCTTCTCCCGAAGAATCA
TACAATCATTATAAATACTATGTTACATTCATTGATGATTTTTCAAAAACTACTTGGGTATATCTTTTAAAAACCAAAAATGAAGTCTTCTCATGCTTTCAAGAA
TTTTTTAATTTTATTACCAACCAATACAATGCTCAAGTCAAAATTTTTCGATCTGACAATGGTACTGAGTATGTGAATAAGGAATTCACCAATTTTTTCAAACAA
CATGGTATTCTTCATCAAACAACATGTACTCATACACCACAACAAAATGGAGTTTCTGAAAGAAAAAACAGACATCTTCTTGAAAAAACAAGAGCTTTACTACTT
CAGAATAATGTTCCAAAAAAATTCTGGTCAGATGCAATTCTAACTGCTACTTATATCATAAATAGATTACCAAGCCCAAATCTCAATAATTTAAGTCCTCTTGAA
ATTCTCAAAGGAAGAAAAATCGACTTAGATCATATTAGAGTATTTGGATGCACCTGCTTTGTATATATAAAACGAAAGGACAAACTAGATAAAAACTCTGTGAAA
ACTATTTTTCTTGGCTACTCCTCAACCCAAAAGGGATACAAGTGCTTTGATCCCGAACAAAATAAACTGTATATTTCCAGGGATGTAGTTTTCAGAGAGCATGAA
CCATTCTTCACGCCTACACAAGACACCACCGCTGCAACACCAAGCACTCTGCAATTCCTCTTTCCTTCCCTTGACGATGAAGAAAATCCTTCCGCATCTTCTTCA
GGGGAGATTATGAGGATGAACGGAACAATACAGAAGACAGACAAGAAGAAGAAGGAGAAGATACAATCAGACGACGATCAACACGAACAAGGCAACCTTCAACAA
GGTTAA
Protein sequenceShow/hide protein sequence
MNGMNPNPSQGPADHGPPGFYGTTAAGPVQDPISFVTPAGPSGPNSDQLMQLVNQLNQLLQPRQQNSGLSELKLSNNSIYLNKHDSNWIIDSGATHHMSCSPNNF
LNLITSNEPQFVTTANGGQTKIFGTGTISVFNKPVNEVLYLPDFHSNLLSVNKIVKDLNCAVIFLPEKVIFQDIVSGEMIGEGILRNGLYYLQQNNKCFVSSKNT
DRGHLLHLRFGHPSDQVLNRLFHYNYDSFSCDTCRFAKQTRLPFPTSITKVEKCFDLIHSDVWGPSPEESYNHYKYYVTFIDDFSKTTWVYLLKTKNEVFSCFQE
FFNFITNQYNAQVKIFRSDNGTEYVNKEFTNFFKQHGILHQTTCTHTPQQNGVSERKNRHLLEKTRALLLQNNVPKKFWSDAILTATYIINRLPSPNLNNLSPLE
ILKGRKIDLDHIRVFGCTCFVYIKRKDKLDKNSVKTIFLGYSSTQKGYKCFDPEQNKLYISRDVVFREHEPFFTPTQDTTAATPSTLQFLFPSLDDEENPSASSS
GEIMRMNGTIQKTDKKKKEKIQSDDDQHEQGNLQQG