; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0027766 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0027766
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotransposon protein
Genome locationchr8:4707731..4715312
RNA-Seq ExpressionLag0027766
SyntenyLag0027766
Gene Ontology termsNA
InterPro domainsIPR009027 - Ribosomal protein L9/RNase H1, N-terminal
IPR011320 - Ribonuclease H1, N-terminal
IPR024752 - Myb/SANT-like domain
IPR027806 - Harbinger transposase-derived nuclease domain
IPR037056 - Ribonuclease H1, N-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADN33754.1 retrotransposon protein [Cucumis melo subsp. melo]1.2e-12746.19Show/hide
Query:  LLRTTSGLVGTEIVDVEEMVVMFLHIIAHDAKNRVIRRQFARSGETVSRHFNATLSVVLRLYDVLLKKPEPITTSCQDGRWKWFENCLGALDGTYVKVHV
        LLR  +GL  TEIVDVEEMV MFLH++AHD KNRVI+++F RSGETVSRHFN  L  VLRLY+ L+K+P P+T++C D RWK FENCLGALDGTY+KV+V
Subjt:  LLRTTSGLVGTEIVDVEEMVVMFLHIIAHDAKNRVIRRQFARSGETVSRHFNATLSVVLRLYDVLLKKPEPITTSCQDGRWKWFENCLGALDGTYVKVHV

Query:  SAADRPRYRTRKGEIDANVLGVVSPKGEFIFVMSGWKGSGADSRVLRDAISRPNGLIVSKGYYYLCDAGYPNADGLLAPYRG------------------
         A DRP +RTRKGEI  NVLGV   KG+F++V++GW+GS ADSR+LRDAIS+ NGL V KGYYYLCDAGYPNA+G LAPY+G                  
Subjt:  SAADRPRYRTRKGEIDANVLGVVSPKGEFIFVMSGWKGSGADSRVLRDAISRPNGLIVSKGYYYLCDAGYPNADGLLAPYRG------------------

Query:  ---------------ERTVGDTS-------GESYYPARTQCRIITACCLLHNLITREMGLDVGLDEGDIGRSE-PVPLDGENITFIQSSTEWMQKRDDLA
                       ER  G          G+SYYP + QCR I AC LLHNLI REM     +++ D G S        E+I +I+++ EW Q RDDLA
Subjt:  ---------------ERTVGDTS-------GESYYPARTQCRIITACCLLHNLITREMGLDVGLDEGDIGRSE-PVPLDGENITFIQSSTEWMQKRDDLA

Query:  TGCST----RGGNIIPDPVSTYNYLLCCADKQLKHIWTRQKEARLVESLVELVHEGGWRGDNGTFRVGYLARLKRMIKDKMTTCTIESTSVIDRKVRSLK
        T   T    RGG+              C                     +ELV  GGW+ DNGTFR GYLA+L RM+ +K++ C + +T+VID ++++LK
Subjt:  TGCST----RGGNIIPDPVSTYNYLLCCADKQLKHIWTRQKEARLVESLVELVHEGGWRGDNGTFRVGYLARLKRMIKDKMTTCTIESTSVIDRKVRSLK

Query:  WQYSAISEMLGQGCNGFGWNDEFKCIQAEREVYDAWVKSHSAAKGLLNKSFPHYEDLAFVFGKDMASD-------DACDQNVQAEHDCY------VPAPP
          + AI+EMLG  C+GFGWNDE KCI AE+E++D WV+S  AAKGLLN  FP+Y++L +VFG+D A+        D         +D +         PP
Subjt:  WQYSAISEMLGQGCNGFGWNDEFKCIQAEREVYDAWVKSHSAAKGLLNKSFPHYEDLAFVFGKDMASD-------DACDQNVQAEHDCY------VPAPP

Query:  DINLVADIDIEDVPITPTSRPT--NAGSSQSRKRSRASYEAEALDIMRQSVAMQETQFTKIADWP
          +   DI  +DV  +  SR +    GSS S KR R S     ++ +  ++     Q  +IA+WP
Subjt:  DINLVADIDIEDVPITPTSRPT--NAGSSQSRKRSRASYEAEALDIMRQSVAMQETQFTKIADWP

ADN34114.1 retrotransposon protein [Cucumis melo subsp. melo]3.2e-13344.91Show/hide
Query:  TTSGLVGTEIVDVEEMVVMFLHIIAHDAKNRVIRRQFARSGETVSRHFNATLSVVLRLYDVLLKKPEPITTSCQDGRWKWFENCLGALDGTYVKVHVSAA
        T +GL  TE+VDVEEMV MFLHI+AHD K+RVI+R+F RSGET+SRHFN  L  V+RL++ LLKKP+P+   C D RW+WFENCLGALDGTY+KV+V A+
Subjt:  TTSGLVGTEIVDVEEMVVMFLHIIAHDAKNRVIRRQFARSGETVSRHFNATLSVVLRLYDVLLKKPEPITTSCQDGRWKWFENCLGALDGTYVKVHVSAA

Query:  DRPRYRTRKGEIDANVLGVVSPKGEFIFVMSGWKGSGADSRVLRDAISRPNGLIVSKGYYYLCDAGYPNADGLLAPYRGER----------TVGDTS---
        DR RYRTRKGE+  NVLGV   KG+F++V++GW+GS ADSR+LRDA+SRPN L V KGYYYL D GYPNA+G LAPYRG+R              TS   
Subjt:  DRPRYRTRKGEIDANVLGVVSPKGEFIFVMSGWKGSGADSRVLRDAISRPNGLIVSKGYYYLCDAGYPNADGLLAPYRGER----------TVGDTS---

Query:  ---------------------------GESYYPARTQCRIITACCLLHNLITREM---GLDVGLDEGDIGRSEPVPLDGENITFIQSSTEWMQKRDDLAT
                                   G+SYYP   QCR I ACCLLHNLI REM    ++  +DE D   S       ++I +I++S EW Q RD+LA 
Subjt:  ---------------------------GESYYPARTQCRIITACCLLHNLITREM---GLDVGLDEGDIGRSEPVPLDGENITFIQSSTEWMQKRDDLAT

Query:  GCSTRGGNIIPDPVSTYNYLLCCADKQLKHIWTRQKEARLVESLVELVHEGGWRGDNGTFRVGYLARLKRMIKDKMTTCTIESTSVIDRKVRSLKWQYSA
                           ++  + +  KH WT+++EA LVE LVELV+ GGWR DNGTFR GYL +L RM+  K+    I + S ID +++ +K  + A
Subjt:  GCSTRGGNIIPDPVSTYNYLLCCADKQLKHIWTRQKEARLVESLVELVHEGGWRGDNGTFRVGYLARLKRMIKDKMTTCTIESTSVIDRKVRSLKWQYSA

Query:  ISEMLGQGCNGFGWNDEFKCIQAEREVYDAWVKSHSAAKGLLNKSFPHYEDLAFVFGKDMASD-----------------DACDQNVQAEHDCYVPAPPD
        ++EM G  C+GFGWNDE KCI AE+EV+D W  SH AAKGLLNKSF HY++L++VFGKD A+                  DA   +   + D      P 
Subjt:  ISEMLGQGCNGFGWNDEFKCIQAEREVYDAWVKSHSAAKGLLNKSFPHYEDLAFVFGKDMASD-----------------DACDQNVQAEHDCYVPAPPD

Query:  INLVADIDIEDVPITPTSRPTNAGS-SQSRKRSRASYEAEALDIMRQSVAMQETQFTKIADWP--EALDAREFKRWDTVREMLLAQHELSDDERVALMCI
        +N+  D    D+  T T+R +   + S   KR R  +  ++ DI+R ++     Q  +IA+WP  +  DA +  R + VR  L A  EL+  +R  LM I
Subjt:  INLVADIDIEDVPITPTSRPTNAGS-SQSRKRSRASYEAEALDIMRQSVAMQETQFTKIADWP--EALDAREFKRWDTVREMLLAQHELSDDERVALMCI

Query:  LFAK-PKMTNMMSVPSNLR
        L      M   + VP +++
Subjt:  LFAK-PKMTNMMSVPSNLR

KAA0034843.1 retrotransposon protein [Cucumis melo var. makuwa]7.5e-12743.66Show/hide
Query:  CYSISQILLRTTSGLVGTEIVDVEEMVVMFLHIIAHDAKNRVIRRQFARSGETVSRHFNATLSVVLRLYDVLLKKPEPITTSCQDGRWKWFENCLGALDG
        C++I   LLRT +GL  TE+VDVEEMV MFLHI+AHD KNRVI+R+F RSGET+SRHFN  L  V+RL+D LLKKP+P+   C D RW+WFENCLGALDG
Subjt:  CYSISQILLRTTSGLVGTEIVDVEEMVVMFLHIIAHDAKNRVIRRQFARSGETVSRHFNATLSVVLRLYDVLLKKPEPITTSCQDGRWKWFENCLGALDG

Query:  TYVKVHVSAADRPRYRTRKGEIDANVLGVVSPKGEFIFVMSGWKGSGADSRVLRDAISRPNGLIVSKGYYYLCDAGYPNADGLLAPYRGER---------
        TY+KV+V A+DR RYRTRKGE+  NVLGV   KG+F++V++GW+GS ADSR+LRDA+SRPN L V KGYYYL DAGYPNA+G LAPYRG+R         
Subjt:  TYVKVHVSAADRPRYRTRKGEIDANVLGVVSPKGEFIFVMSGWKGSGADSRVLRDAISRPNGLIVSKGYYYLCDAGYPNADGLLAPYRGER---------

Query:  -TVGDTS------------------------------GESYYPARTQCRIITACCLLHNLITREM-GLDVGLDEGDIGRSEPVPLDGENITFIQSSTEWM
             TS                              G+SY+P   QC  I ACCLLHNLI REM   D+                 +NI  + SS+   
Subjt:  -TVGDTS------------------------------GESYYPARTQCRIITACCLLHNLITREM-GLDVGLDEGDIGRSEPVPLDGENITFIQSSTEWM

Query:  QKRDDLATGCSTRGGNIIPDPVSTYNYLLCCADKQLKHIWTRQKEARLVESLVELVHEGGWRGDNGTFRVGYLARLKRMIKDKMTTCTIESTSVIDRKVR
                         +P                 KH WT+++EA     LVELV+ GGWR DNGTFR GYL +L RM+  K+  C I + S ID +++
Subjt:  QKRDDLATGCSTRGGNIIPDPVSTYNYLLCCADKQLKHIWTRQKEARLVESLVELVHEGGWRGDNGTFRVGYLARLKRMIKDKMTTCTIESTSVIDRKVR

Query:  SLKWQYSAISEMLGQGCNGFGWNDEFKCIQAEREVYDAWVKSHSAAKGLLNKSFPHYEDLAFVFGKDMASDDACD------QNVQAEHDCY-VPAPPD--
         +K  + A++EM G  C+GFGWNDE KCI AE+EV+D W  SH AAKGLLNKSF HY++L++VFGKD A+    +       N    +D +   A PD  
Subjt:  SLKWQYSAISEMLGQGCNGFGWNDEFKCIQAEREVYDAWVKSHSAAKGLLNKSFPHYEDLAFVFGKDMASDDACD------QNVQAEHDCY-VPAPPD--

Query:  ----INLVADIDIEDVPITPTSRPTNAGS-SQSRKRSRASYEAEALDIMRQSVAMQETQFTKIADWP--EALDAREFKRWDTVREMLLAQHELSDDERVA
             +L  ++  +D+  T T+R +   + S   KR R  +  ++ DI+R ++     Q  +IA+WP  +  DA + ++   + + L A  EL+  +R  
Subjt:  ----INLVADIDIEDVPITPTSRPTNAGS-SQSRKRSRASYEAEALDIMRQSVAMQETQFTKIADWP--EALDAREFKRWDTVREMLLAQHELSDDERVA

Query:  LMCILFAK-PKMTNMMSVPSNLR
        LM IL      M   + VP N++
Subjt:  LMCILFAK-PKMTNMMSVPSNLR

KAA0036474.1 retrotransposon protein [Cucumis melo var. makuwa]2.3e-11552.78Show/hide
Query:  LLRTTSGLVGTEIVDVEEMVVMFLHIIAHDAKNRVIRRQFARSGETVSRHFNATLSVVLRLYDVLLKKPEPITTSCQDGRWKWFENCLGALDGTYVKVHV
        LLR  +GL  TEIVDVEEMV MFLHI AHD KNRVI+R+F RSGETVSRHFN  L  VLRLY+ L+K+P P+T++C D RWK FENCLGALDGTY+KV+V
Subjt:  LLRTTSGLVGTEIVDVEEMVVMFLHIIAHDAKNRVIRRQFARSGETVSRHFNATLSVVLRLYDVLLKKPEPITTSCQDGRWKWFENCLGALDGTYVKVHV

Query:  SAADRPRYRTRKGEIDANVLGVVSPKGEFIFVMSGWKGSGADSRVLRDAISRPNGLIVSKGYYYLCDAGYPNADGLLAPYRGER-TVGDTSGESYYPART
         A DRP +RTRKGEI  NVLGV   KG+F++V++GWKGS ADSR+LRDAISR NGL V KGYYYLCDAGYPNA+G LAPYRG+R  + +  G +  P   
Subjt:  SAADRPRYRTRKGEIDANVLGVVSPKGEFIFVMSGWKGSGADSRVLRDAISRPNGLIVSKGYYYLCDAGYPNADGLLAPYRGER-TVGDTSGESYYPART

Query:  QCRIITACCLLHNLITREMGLDVGL-------------------DEGDIGRSEPVPLDGENITFIQSSTEWMQKRDDLATGCSTRGGNIIPDPVSTYNYL
        +           N+I R  G+  G                    DEGD   +       E+I +I+++ EW Q RDDLA          I   +ST N  
Subjt:  QCRIITACCLLHNLITREMGLDVGL-------------------DEGDIGRSEPVPLDGENITFIQSSTEWMQKRDDLATGCSTRGGNIIPDPVSTYNYL

Query:  LCCADKQLKHIWTRQKEARLVESLVELVHEGGWRGDNGTFRVGYLARLKRMIKDKMTTCTIESTSVIDRKVRSLKWQYSAISEMLGQGCNGFGWNDEFKC
             +  +H+WTR++E  LVE L+ELV  GGW+ DNGTFR GYLA+L RM+ +K+  C + +T+VID ++++LK  + AI+EM G  C+GFGWNDE KC
Subjt:  LCCADKQLKHIWTRQKEARLVESLVELVHEGGWRGDNGTFRVGYLARLKRMIKDKMTTCTIESTSVIDRKVRSLKWQYSAISEMLGQGCNGFGWNDEFKC

Query:  IQAEREVYDAWVK
        I AE+E++D WV+
Subjt:  IQAEREVYDAWVK

KAA0062747.1 retrotransposon protein [Cucumis melo var. makuwa]1.3e-11041.24Show/hide
Query:  CYSISQILLRTTSGLVGTEIVDVEEMVVMFLHIIAHDAKNRVIRRQFARSGETVSRHFNATLSVVLRLYDVLLKKPEPITTSCQDGRWKWFE---NCLGA
        C++I   LLRTT+GLV TE++DVEEMV MFLHI+AH  KNR+I+R+F RSGETVSRHFN  L    RL+D LLKKP+P+T SC D RWKWFE   NCL +
Subjt:  CYSISQILLRTTSGLVGTEIVDVEEMVVMFLHIIAHDAKNRVIRRQFARSGETVSRHFNATLSVVLRLYDVLLKKPEPITTSCQDGRWKWFE---NCLGA

Query:  LDGTYVKVHVSAADRPRYRTRKGEIDANVLGVVSPKGEFIFVMSGWKGSGADSRVLRDAISRPNGLIVSKGYYYLCDAGYPNADGLLAPYRGER-----T
         +GTY+KV+VSA DRPRYRTRKGE+  NVLG    KG+F+FV+ GW+GS ADSR+LRDAISR NGL V KGYYYLCDAGYPNA+G LAPYRGER      
Subjt:  LDGTYVKVHVSAADRPRYRTRKGEIDANVLGVVSPKGEFIFVMSGWKGSGADSRVLRDAISRPNGLIVSKGYYYLCDAGYPNADGLLAPYRGER-----T

Query:  VGDTS-----------------------------------GESYYPARTQCRIITACCLLHNLITREMGLDVGLDEGDIGRSEPVPLDGENITFIQSSTE
         G+++                                   G+SYYP   QCR I ACCLLHNLI REM     +D+ D G S      G+ I +I++S E
Subjt:  VGDTS-----------------------------------GESYYPARTQCRIITACCLLHNLITREMGLDVGLDEGDIGRSEPVPLDGENITFIQSSTE

Query:  WMQKRDDLATGCSTRGGNIIPDPVSTYNYLLCCADKQLKHIWTRQKEARLVESLVELVHEGGWRGDNGTFRVGYLARLKRMIKDKMTTCTIESTSVIDRK
        W + RD LA         +  D      + +C   + +  +  ++K+  L+ +     H G      G F +                            
Subjt:  WMQKRDDLATGCSTRGGNIIPDPVSTYNYLLCCADKQLKHIWTRQKEARLVESLVELVHEGGWRGDNGTFRVGYLARLKRMIKDKMTTCTIESTSVIDRK

Query:  VRSLKWQYSAISEMLGQGCNGFGWNDEFKCIQAEREVYDAWVKSHSAAKGLLNKSFPHYEDLAFVFGKDMASDDACDQNVQAEHDCYVPAP-PDINLVAD
                    EM G  C+GFGWN+EF+CI AER+++D+WVKSH A KGLL+KSFP+Y+DL++VFGKD A+    +  V    +  VP    D   + D
Subjt:  VRSLKWQYSAISEMLGQGCNGFGWNDEFKCIQAEREVYDAWVKSHSAAKGLLNKSFPHYEDLAFVFGKDMASDDACDQNVQAEHDCYVPAP-PDINLVAD

Query:  IDIEDVPITPTSRPTNAGSSQSRKRSRASYEAEALDIMRQSVAMQETQFTKIADWPEALDAREFKRWDTVREMLLAQHELSDDERVALMCILF-AKPKMT
           ED+P            SQ    S         +++R  +     Q   IADW +   A E +    V + L    EL    R  LM ILF +   + 
Subjt:  IDIEDVPITPTSRPTNAGSSQSRKRSRASYEAEALDIMRQSVAMQETQFTKIADWPEALDAREFKRWDTVREMLLAQHELSDDERVALMCILF-AKPKMT

Query:  NMMSVPSNLRL
          +S+P+ L+L
Subjt:  NMMSVPSNLRL

TrEMBL top hitse value%identityAlignment
A0A5A7SWD8 Retrotransposon protein3.6e-12743.66Show/hide
Query:  CYSISQILLRTTSGLVGTEIVDVEEMVVMFLHIIAHDAKNRVIRRQFARSGETVSRHFNATLSVVLRLYDVLLKKPEPITTSCQDGRWKWFENCLGALDG
        C++I   LLRT +GL  TE+VDVEEMV MFLHI+AHD KNRVI+R+F RSGET+SRHFN  L  V+RL+D LLKKP+P+   C D RW+WFENCLGALDG
Subjt:  CYSISQILLRTTSGLVGTEIVDVEEMVVMFLHIIAHDAKNRVIRRQFARSGETVSRHFNATLSVVLRLYDVLLKKPEPITTSCQDGRWKWFENCLGALDG

Query:  TYVKVHVSAADRPRYRTRKGEIDANVLGVVSPKGEFIFVMSGWKGSGADSRVLRDAISRPNGLIVSKGYYYLCDAGYPNADGLLAPYRGER---------
        TY+KV+V A+DR RYRTRKGE+  NVLGV   KG+F++V++GW+GS ADSR+LRDA+SRPN L V KGYYYL DAGYPNA+G LAPYRG+R         
Subjt:  TYVKVHVSAADRPRYRTRKGEIDANVLGVVSPKGEFIFVMSGWKGSGADSRVLRDAISRPNGLIVSKGYYYLCDAGYPNADGLLAPYRGER---------

Query:  -TVGDTS------------------------------GESYYPARTQCRIITACCLLHNLITREM-GLDVGLDEGDIGRSEPVPLDGENITFIQSSTEWM
             TS                              G+SY+P   QC  I ACCLLHNLI REM   D+                 +NI  + SS+   
Subjt:  -TVGDTS------------------------------GESYYPARTQCRIITACCLLHNLITREM-GLDVGLDEGDIGRSEPVPLDGENITFIQSSTEWM

Query:  QKRDDLATGCSTRGGNIIPDPVSTYNYLLCCADKQLKHIWTRQKEARLVESLVELVHEGGWRGDNGTFRVGYLARLKRMIKDKMTTCTIESTSVIDRKVR
                         +P                 KH WT+++EA     LVELV+ GGWR DNGTFR GYL +L RM+  K+  C I + S ID +++
Subjt:  QKRDDLATGCSTRGGNIIPDPVSTYNYLLCCADKQLKHIWTRQKEARLVESLVELVHEGGWRGDNGTFRVGYLARLKRMIKDKMTTCTIESTSVIDRKVR

Query:  SLKWQYSAISEMLGQGCNGFGWNDEFKCIQAEREVYDAWVKSHSAAKGLLNKSFPHYEDLAFVFGKDMASDDACD------QNVQAEHDCY-VPAPPD--
         +K  + A++EM G  C+GFGWNDE KCI AE+EV+D W  SH AAKGLLNKSF HY++L++VFGKD A+    +       N    +D +   A PD  
Subjt:  SLKWQYSAISEMLGQGCNGFGWNDEFKCIQAEREVYDAWVKSHSAAKGLLNKSFPHYEDLAFVFGKDMASDDACD------QNVQAEHDCY-VPAPPD--

Query:  ----INLVADIDIEDVPITPTSRPTNAGS-SQSRKRSRASYEAEALDIMRQSVAMQETQFTKIADWP--EALDAREFKRWDTVREMLLAQHELSDDERVA
             +L  ++  +D+  T T+R +   + S   KR R  +  ++ DI+R ++     Q  +IA+WP  +  DA + ++   + + L A  EL+  +R  
Subjt:  ----INLVADIDIEDVPITPTSRPTNAGS-SQSRKRSRASYEAEALDIMRQSVAMQETQFTKIADWP--EALDAREFKRWDTVREMLLAQHELSDDERVA

Query:  LMCILFAK-PKMTNMMSVPSNLR
        LM IL      M   + VP N++
Subjt:  LMCILFAK-PKMTNMMSVPSNLR

A0A5A7SYW1 Retrotransposon protein1.1e-11552.78Show/hide
Query:  LLRTTSGLVGTEIVDVEEMVVMFLHIIAHDAKNRVIRRQFARSGETVSRHFNATLSVVLRLYDVLLKKPEPITTSCQDGRWKWFENCLGALDGTYVKVHV
        LLR  +GL  TEIVDVEEMV MFLHI AHD KNRVI+R+F RSGETVSRHFN  L  VLRLY+ L+K+P P+T++C D RWK FENCLGALDGTY+KV+V
Subjt:  LLRTTSGLVGTEIVDVEEMVVMFLHIIAHDAKNRVIRRQFARSGETVSRHFNATLSVVLRLYDVLLKKPEPITTSCQDGRWKWFENCLGALDGTYVKVHV

Query:  SAADRPRYRTRKGEIDANVLGVVSPKGEFIFVMSGWKGSGADSRVLRDAISRPNGLIVSKGYYYLCDAGYPNADGLLAPYRGER-TVGDTSGESYYPART
         A DRP +RTRKGEI  NVLGV   KG+F++V++GWKGS ADSR+LRDAISR NGL V KGYYYLCDAGYPNA+G LAPYRG+R  + +  G +  P   
Subjt:  SAADRPRYRTRKGEIDANVLGVVSPKGEFIFVMSGWKGSGADSRVLRDAISRPNGLIVSKGYYYLCDAGYPNADGLLAPYRGER-TVGDTSGESYYPART

Query:  QCRIITACCLLHNLITREMGLDVGL-------------------DEGDIGRSEPVPLDGENITFIQSSTEWMQKRDDLATGCSTRGGNIIPDPVSTYNYL
        +           N+I R  G+  G                    DEGD   +       E+I +I+++ EW Q RDDLA          I   +ST N  
Subjt:  QCRIITACCLLHNLITREMGLDVGL-------------------DEGDIGRSEPVPLDGENITFIQSSTEWMQKRDDLATGCSTRGGNIIPDPVSTYNYL

Query:  LCCADKQLKHIWTRQKEARLVESLVELVHEGGWRGDNGTFRVGYLARLKRMIKDKMTTCTIESTSVIDRKVRSLKWQYSAISEMLGQGCNGFGWNDEFKC
             +  +H+WTR++E  LVE L+ELV  GGW+ DNGTFR GYLA+L RM+ +K+  C + +T+VID ++++LK  + AI+EM G  C+GFGWNDE KC
Subjt:  LCCADKQLKHIWTRQKEARLVESLVELVHEGGWRGDNGTFRVGYLARLKRMIKDKMTTCTIESTSVIDRKVRSLKWQYSAISEMLGQGCNGFGWNDEFKC

Query:  IQAEREVYDAWVK
        I AE+E++D WV+
Subjt:  IQAEREVYDAWVK

A0A5D3DG22 Retrotransposon protein6.3e-11141.24Show/hide
Query:  CYSISQILLRTTSGLVGTEIVDVEEMVVMFLHIIAHDAKNRVIRRQFARSGETVSRHFNATLSVVLRLYDVLLKKPEPITTSCQDGRWKWFE---NCLGA
        C++I   LLRTT+GLV TE++DVEEMV MFLHI+AH  KNR+I+R+F RSGETVSRHFN  L    RL+D LLKKP+P+T SC D RWKWFE   NCL +
Subjt:  CYSISQILLRTTSGLVGTEIVDVEEMVVMFLHIIAHDAKNRVIRRQFARSGETVSRHFNATLSVVLRLYDVLLKKPEPITTSCQDGRWKWFE---NCLGA

Query:  LDGTYVKVHVSAADRPRYRTRKGEIDANVLGVVSPKGEFIFVMSGWKGSGADSRVLRDAISRPNGLIVSKGYYYLCDAGYPNADGLLAPYRGER-----T
         +GTY+KV+VSA DRPRYRTRKGE+  NVLG    KG+F+FV+ GW+GS ADSR+LRDAISR NGL V KGYYYLCDAGYPNA+G LAPYRGER      
Subjt:  LDGTYVKVHVSAADRPRYRTRKGEIDANVLGVVSPKGEFIFVMSGWKGSGADSRVLRDAISRPNGLIVSKGYYYLCDAGYPNADGLLAPYRGER-----T

Query:  VGDTS-----------------------------------GESYYPARTQCRIITACCLLHNLITREMGLDVGLDEGDIGRSEPVPLDGENITFIQSSTE
         G+++                                   G+SYYP   QCR I ACCLLHNLI REM     +D+ D G S      G+ I +I++S E
Subjt:  VGDTS-----------------------------------GESYYPARTQCRIITACCLLHNLITREMGLDVGLDEGDIGRSEPVPLDGENITFIQSSTE

Query:  WMQKRDDLATGCSTRGGNIIPDPVSTYNYLLCCADKQLKHIWTRQKEARLVESLVELVHEGGWRGDNGTFRVGYLARLKRMIKDKMTTCTIESTSVIDRK
        W + RD LA         +  D      + +C   + +  +  ++K+  L+ +     H G      G F +                            
Subjt:  WMQKRDDLATGCSTRGGNIIPDPVSTYNYLLCCADKQLKHIWTRQKEARLVESLVELVHEGGWRGDNGTFRVGYLARLKRMIKDKMTTCTIESTSVIDRK

Query:  VRSLKWQYSAISEMLGQGCNGFGWNDEFKCIQAEREVYDAWVKSHSAAKGLLNKSFPHYEDLAFVFGKDMASDDACDQNVQAEHDCYVPAP-PDINLVAD
                    EM G  C+GFGWN+EF+CI AER+++D+WVKSH A KGLL+KSFP+Y+DL++VFGKD A+    +  V    +  VP    D   + D
Subjt:  VRSLKWQYSAISEMLGQGCNGFGWNDEFKCIQAEREVYDAWVKSHSAAKGLLNKSFPHYEDLAFVFGKDMASDDACDQNVQAEHDCYVPAP-PDINLVAD

Query:  IDIEDVPITPTSRPTNAGSSQSRKRSRASYEAEALDIMRQSVAMQETQFTKIADWPEALDAREFKRWDTVREMLLAQHELSDDERVALMCILF-AKPKMT
           ED+P            SQ    S         +++R  +     Q   IADW +   A E +    V + L    EL    R  LM ILF +   + 
Subjt:  IDIEDVPITPTSRPTNAGSSQSRKRSRASYEAEALDIMRQSVAMQETQFTKIADWPEALDAREFKRWDTVREMLLAQHELSDDERVALMCILF-AKPKMT

Query:  NMMSVPSNLRL
          +S+P+ L+L
Subjt:  NMMSVPSNLRL

E5GBB2 Retrotransposon protein5.6e-12846.19Show/hide
Query:  LLRTTSGLVGTEIVDVEEMVVMFLHIIAHDAKNRVIRRQFARSGETVSRHFNATLSVVLRLYDVLLKKPEPITTSCQDGRWKWFENCLGALDGTYVKVHV
        LLR  +GL  TEIVDVEEMV MFLH++AHD KNRVI+++F RSGETVSRHFN  L  VLRLY+ L+K+P P+T++C D RWK FENCLGALDGTY+KV+V
Subjt:  LLRTTSGLVGTEIVDVEEMVVMFLHIIAHDAKNRVIRRQFARSGETVSRHFNATLSVVLRLYDVLLKKPEPITTSCQDGRWKWFENCLGALDGTYVKVHV

Query:  SAADRPRYRTRKGEIDANVLGVVSPKGEFIFVMSGWKGSGADSRVLRDAISRPNGLIVSKGYYYLCDAGYPNADGLLAPYRG------------------
         A DRP +RTRKGEI  NVLGV   KG+F++V++GW+GS ADSR+LRDAIS+ NGL V KGYYYLCDAGYPNA+G LAPY+G                  
Subjt:  SAADRPRYRTRKGEIDANVLGVVSPKGEFIFVMSGWKGSGADSRVLRDAISRPNGLIVSKGYYYLCDAGYPNADGLLAPYRG------------------

Query:  ---------------ERTVGDTS-------GESYYPARTQCRIITACCLLHNLITREMGLDVGLDEGDIGRSE-PVPLDGENITFIQSSTEWMQKRDDLA
                       ER  G          G+SYYP + QCR I AC LLHNLI REM     +++ D G S        E+I +I+++ EW Q RDDLA
Subjt:  ---------------ERTVGDTS-------GESYYPARTQCRIITACCLLHNLITREMGLDVGLDEGDIGRSE-PVPLDGENITFIQSSTEWMQKRDDLA

Query:  TGCST----RGGNIIPDPVSTYNYLLCCADKQLKHIWTRQKEARLVESLVELVHEGGWRGDNGTFRVGYLARLKRMIKDKMTTCTIESTSVIDRKVRSLK
        T   T    RGG+              C                     +ELV  GGW+ DNGTFR GYLA+L RM+ +K++ C + +T+VID ++++LK
Subjt:  TGCST----RGGNIIPDPVSTYNYLLCCADKQLKHIWTRQKEARLVESLVELVHEGGWRGDNGTFRVGYLARLKRMIKDKMTTCTIESTSVIDRKVRSLK

Query:  WQYSAISEMLGQGCNGFGWNDEFKCIQAEREVYDAWVKSHSAAKGLLNKSFPHYEDLAFVFGKDMASD-------DACDQNVQAEHDCY------VPAPP
          + AI+EMLG  C+GFGWNDE KCI AE+E++D WV+S  AAKGLLN  FP+Y++L +VFG+D A+        D         +D +         PP
Subjt:  WQYSAISEMLGQGCNGFGWNDEFKCIQAEREVYDAWVKSHSAAKGLLNKSFPHYEDLAFVFGKDMASD-------DACDQNVQAEHDCY------VPAPP

Query:  DINLVADIDIEDVPITPTSRPT--NAGSSQSRKRSRASYEAEALDIMRQSVAMQETQFTKIADWP
          +   DI  +DV  +  SR +    GSS S KR R S     ++ +  ++     Q  +IA+WP
Subjt:  DINLVADIDIEDVPITPTSRPT--NAGSSQSRKRSRASYEAEALDIMRQSVAMQETQFTKIADWP

E5GCB5 Retrotransposon protein1.5e-13344.91Show/hide
Query:  TTSGLVGTEIVDVEEMVVMFLHIIAHDAKNRVIRRQFARSGETVSRHFNATLSVVLRLYDVLLKKPEPITTSCQDGRWKWFENCLGALDGTYVKVHVSAA
        T +GL  TE+VDVEEMV MFLHI+AHD K+RVI+R+F RSGET+SRHFN  L  V+RL++ LLKKP+P+   C D RW+WFENCLGALDGTY+KV+V A+
Subjt:  TTSGLVGTEIVDVEEMVVMFLHIIAHDAKNRVIRRQFARSGETVSRHFNATLSVVLRLYDVLLKKPEPITTSCQDGRWKWFENCLGALDGTYVKVHVSAA

Query:  DRPRYRTRKGEIDANVLGVVSPKGEFIFVMSGWKGSGADSRVLRDAISRPNGLIVSKGYYYLCDAGYPNADGLLAPYRGER----------TVGDTS---
        DR RYRTRKGE+  NVLGV   KG+F++V++GW+GS ADSR+LRDA+SRPN L V KGYYYL D GYPNA+G LAPYRG+R              TS   
Subjt:  DRPRYRTRKGEIDANVLGVVSPKGEFIFVMSGWKGSGADSRVLRDAISRPNGLIVSKGYYYLCDAGYPNADGLLAPYRGER----------TVGDTS---

Query:  ---------------------------GESYYPARTQCRIITACCLLHNLITREM---GLDVGLDEGDIGRSEPVPLDGENITFIQSSTEWMQKRDDLAT
                                   G+SYYP   QCR I ACCLLHNLI REM    ++  +DE D   S       ++I +I++S EW Q RD+LA 
Subjt:  ---------------------------GESYYPARTQCRIITACCLLHNLITREM---GLDVGLDEGDIGRSEPVPLDGENITFIQSSTEWMQKRDDLAT

Query:  GCSTRGGNIIPDPVSTYNYLLCCADKQLKHIWTRQKEARLVESLVELVHEGGWRGDNGTFRVGYLARLKRMIKDKMTTCTIESTSVIDRKVRSLKWQYSA
                           ++  + +  KH WT+++EA LVE LVELV+ GGWR DNGTFR GYL +L RM+  K+    I + S ID +++ +K  + A
Subjt:  GCSTRGGNIIPDPVSTYNYLLCCADKQLKHIWTRQKEARLVESLVELVHEGGWRGDNGTFRVGYLARLKRMIKDKMTTCTIESTSVIDRKVRSLKWQYSA

Query:  ISEMLGQGCNGFGWNDEFKCIQAEREVYDAWVKSHSAAKGLLNKSFPHYEDLAFVFGKDMASD-----------------DACDQNVQAEHDCYVPAPPD
        ++EM G  C+GFGWNDE KCI AE+EV+D W  SH AAKGLLNKSF HY++L++VFGKD A+                  DA   +   + D      P 
Subjt:  ISEMLGQGCNGFGWNDEFKCIQAEREVYDAWVKSHSAAKGLLNKSFPHYEDLAFVFGKDMASD-----------------DACDQNVQAEHDCYVPAPPD

Query:  INLVADIDIEDVPITPTSRPTNAGS-SQSRKRSRASYEAEALDIMRQSVAMQETQFTKIADWP--EALDAREFKRWDTVREMLLAQHELSDDERVALMCI
        +N+  D    D+  T T+R +   + S   KR R  +  ++ DI+R ++     Q  +IA+WP  +  DA +  R + VR  L A  EL+  +R  LM I
Subjt:  INLVADIDIEDVPITPTSRPTNAGS-SQSRKRSRASYEAEALDIMRQSVAMQETQFTKIADWP--EALDAREFKRWDTVREMLLAQHELSDDERVALMCI

Query:  LFAK-PKMTNMMSVPSNLR
        L      M   + VP +++
Subjt:  LFAK-PKMTNMMSVPSNLR

SwissProt top hitse value%identityAlignment
Q04740 Ribonuclease H2.2e-0435.21Show/hide
Query:  FYVVFAGRNLGIYTSWVECHRQVNQFKGALRKSYPTFEEAEYAFRHYVAGTRGEPNLVDEHDSCRNLHVGG
        FY V  GR  GIY +W EC  QV+ + GA+ K + ++E+A+           G+PN    + S  + H GG
Subjt:  FYVVFAGRNLGIYTSWVECHRQVNQFKGALRKSYPTFEEAEYAFRHYVAGTRGEPNLVDEHDSCRNLHVGG

Q07762 Ribonuclease H3.8e-0452.5Show/hide
Query:  FYVVFAGRNLGIYTSWVECHRQVNQFKGALRKSYPTFEEA
        FYVV  GR  GIY++W +C  QV  F GA+ KS+ T  EA
Subjt:  FYVVFAGRNLGIYTSWVECHRQVNQFKGALRKSYPTFEEA

Q9KEI9 Ribonuclease H8.7e-0946.34Show/hide
Query:  MGALKFYVVFAGRNLGIYTSWVECHRQVNQFKGALRKSYPTFEEAEYAFRHYVAGTRGEPNLVDEHDSCRNLHVGGYGTVGN
        M   K+YVV+ GR  GIYTSW  C  QV  + GA  KSYP+ EEAE AFR    G    P L  E     +L V   G+ GN
Subjt:  MGALKFYVVFAGRNLGIYTSWVECHRQVNQFKGALRKSYPTFEEAEYAFRHYVAGTRGEPNLVDEHDSCRNLHVGGYGTVGN

Arabidopsis top hitse value%identityAlignment
AT1G43722.1 unknown protein5.9e-2131.66Show/hide
Query:  CYSISQILLRTTSGLVGTEIVDVEEMVVMFLHIIAHDAKNRVIRRQFARSGETVSRHFNATLSVVLRLYDVLLKKPE-------PITTSCQDGRWKWFEN
        C++    +L+T   L  T  + +EE V MFL I  H+   R +  +F R+ ETV R F   L+    L    ++ P        P         W +F  
Subjt:  CYSISQILLRTTSGLVGTEIVDVEEMVVMFLHIIAHDAKNRVIRRQFARSGETVSRHFNATLSVVLRLYDVLLKKPE-------PITTSCQDGRWKWFEN

Query:  CLGALDGTYVKVHVSAADRPRYRTRKGEIDANVLGVVSPKGEFIFVMSGWKGSGADSRVLRDAISRPNGL-IVSKGYYYLCDAGYPNADGLLAPYRGER
         +GA+DGT+V V V    +  Y  R      N++ +   K  F ++ +G  GS  D+ VL+ A    +   +     YYL D+GYPN  GLLAPYR  R
Subjt:  CLGALDGTYVKVHVSAADRPRYRTRKGEIDANVLGVVSPKGEFIFVMSGWKGSGADSRVLRDAISRPNGL-IVSKGYYYLCDAGYPNADGLLAPYRGER

AT4G02210.1 unknown protein7.0e-0621.76Show/hide
Query:  VIDRKVRSLKWQYSAISEMLGQGCNGFGWNDEFKCIQAEREVYDAWVKSHSAAKGLLNKSFPHYEDLAFVFGKDMASDDACDQNVQAEHDCYV-----PA
        V+  + +SL+ Q++AI  +L    +GF W++E + + A+  V+  ++K+H  A+  + +  P+Y+DL  + G         D  ++ E++C+V       
Subjt:  VIDRKVRSLKWQYSAISEMLGQGCNGFGWNDEFKCIQAEREVYDAWVKSHSAAKGLLNKSFPHYEDLAFVFGKDMASDDACDQNVQAEHDCYV-----PA

Query:  PPDINLVADIDIEDVPITPTSRPTNAGSSQSRKRSRASYEAEALDIMRQSVAMQETQFTKIADWPEALDAREFKRWDTVREMLLAQHELSDDE
          +          D+ I+     +N+     + +       +   I  +   + ETQ   I D  EA+ A      D   E++L   +L +D+
Subjt:  PPDINLVADIDIEDVPITPTSRPTNAGSSQSRKRSRASYEAEALDIMRQSVAMQETQFTKIADWPEALDAREFKRWDTVREMLLAQHELSDDE

AT5G28950.1 unknown protein1.6e-1343.06Show/hide
Query:  WKWFENCLGALDGTYVKVHVSAADRPRYRTRKGEIDANVLGVVSPKGEFIFVMSGWKGSGADSRVLRDAISR
        + +F++C+GA+D T++   VS    P +R RKG+I  N+L   +   EF++V+SGW+GS  DS+VL DA++R
Subjt:  WKWFENCLGALDGTYVKVHVSAADRPRYRTRKGEIDANVLGVVSPKGEFIFVMSGWKGSGADSRVLRDAISR

AT5G35695.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)1.1e-0646.43Show/hide
Query:  FIFVMSGWKGSGADSRVLRDAISRPNGLIVSKGYYYLCDAGYPNADGLLAPYRGER
        FI+V+SGW+GS  DSRVL DA+ +          +YL D G+ N    LAP+RG R
Subjt:  FIFVMSGWKGSGADSRVLRDAISRPNGLIVSKGYYYLCDAGYPNADGLLAPYRGER

AT5G41980.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)1.1e-3031.45Show/hide
Query:  LLRTTSGLVGTEIVDVEEMVVMFLHIIAHDAKNRVIRRQFARSGETVSRHFNATLSVVLRLYDVLLKKPEPITTSCQDGRWKWFENCLGALDGTYVKVHV
        LL+T   L  T  + +E  + +FL II H+ + R ++  F  SGET+SRHFN  L+ V+ +     +      T   D    +F++C+G +D  ++ V V
Subjt:  LLRTTSGLVGTEIVDVEEMVVMFLHIIAHDAKNRVIRRQFARSGETVSRHFNATLSVVLRLYDVLLKKPEPITTSCQDGRWKWFENCLGALDGTYVKVHV

Query:  SAADRPRYRTRKGEIDANVLGVVSPKGEFIFVMSGWKGSGADSRVLRDAISRPNGLIVSKGYYYLCDAGYPNADGLLAPYRG------------------
           ++  +R   G +  NVL   S    F +V++GW+GS +D +VL  A++R N L V +G YY+ D  YPN  G +APY G                  
Subjt:  SAADRPRYRTRKGEIDANVLGVVSPKGEFIFVMSGWKGSGADSRVLRDAISRPNGLIVSKGYYYLCDAGYPNADGLLAPYRG------------------

Query:  --ERTVGDTSG---ESY--------YPARTQCRIITACCLLHNLITRE
           R +  T G   E +        YP +TQ +++ A C LHN +  E
Subjt:  --ERTVGDTSG---ESY--------YPARTQCRIITACCLLHNLITRE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGTTCTTCCTTCATATTTTCTTCGGATCTCATCTGGTATTTGTGTTCTTCCTCATTTTTTCTTCGGATCTGAACTTCCTACTTCGTCCCGTCACGTCTTCGGGTCT
TGGATCAAATGCATGCCATCTCCTTGTTCATGAATCTCGTTTTAGAAACCCACCCATGTCGCTTCAGCTTGTCGTGGGTGTCGACGATAGGAATCGGTATCCGCTTGGTA
GATTGGAGCACGGTGAGGGGTGGGACGAAGGTTTACTCCAAAAAACACCGTTATATTCGAGGACTATGGGGGCATTGAAATTCTATGTCGTGTTTGCTGGTCGCAACCTA
GGAATTTACACCTCATGGGTTGAATGCCACAGACAAGTTAACCAATTTAAGGGAGCACTACGCAAGTCGTATCCAACATTTGAAGAAGCAGAGTATGCATTTAGGCATTA
CGTTGCGGGCACCCGGGGCGAACCAAACCTCGTTGACGAACATGATTCGTGTCGTAACCTTCATGTAGGGGGCTATGGAACCGTAGGAAATTGTTATAGTATTAGTCAGA
TTCTGTTGAGAACGACGTCCGGGTTGGTAGGAACAGAAATCGTAGACGTGGAAGAGATGGTTGTGATGTTCTTGCACATCATTGCTCATGATGCTAAGAATCGAGTCATT
AGAAGACAGTTTGCACGGTCGGGTGAAACTGTTTCTCGGCACTTCAACGCGACTTTGAGTGTCGTACTACGATTGTACGACGTTCTACTTAAGAAACCTGAACCGATCAC
GACTTCTTGCCAAGATGGGAGGTGGAAATGGTTTGAGAATTGTTTAGGGGCATTGGACGGTACGTACGTAAAGGTCCATGTTAGTGCAGCTGATCGACCAAGGTATAGGA
CGCGGAAGGGTGAGATTGATGCAAATGTATTGGGCGTTGTGTCACCAAAAGGTGAATTCATTTTTGTTATGTCGGGATGGAAAGGTTCGGGTGCTGATTCTCGTGTACTC
AGAGATGCTATATCACGCCCCAATGGACTAATAGTGTCGAAGGGTTACTATTACCTCTGTGACGCTGGGTACCCAAACGCAGATGGTTTATTGGCACCTTATAGAGGAGA
GCGGACGGTGGGCGATACTTCAGGGGAATCCTACTACCCTGCTCGGACCCAGTGTCGAATTATAACAGCGTGCTGTTTACTCCACAACCTTATCACCCGAGAGATGGGTC
TGGATGTTGGGTTAGATGAAGGTGATATTGGTCGATCTGAACCCGTACCTCTAGATGGTGAGAACATAACCTTCATTCAAAGCTCCACTGAATGGATGCAAAAGCGAGAT
GACCTAGCAACAGGATGTTCAACGCGTGGGGGCAACATAATCCCTGATCCAGTTTCGACATACAACTATCTGCTATGTTGTGCAGATAAACAACTGAAACACATCTGGAC
GAGGCAGAAAGAGGCAAGGTTGGTGGAATCCCTCGTGGAGCTCGTCCACGAAGGTGGATGGAGAGGGGACAACGGGACCTTCAGGGTCGGATACCTAGCCCGACTGAAGC
GAATGATAAAAGATAAAATGACGACCTGCACCATAGAGTCAACGTCCGTAATAGACCGCAAGGTGCGGTCCTTGAAATGGCAATACAGTGCCATCTCAGAGATGTTGGGT
CAGGGTTGCAATGGATTCGGTTGGAATGATGAGTTTAAATGCATCCAGGCTGAGAGAGAGGTATATGATGCATGGGTGAAGTCACACTCGGCCGCGAAAGGGCTGCTGAA
CAAGTCATTTCCTCATTACGAGGATCTTGCTTTCGTGTTCGGCAAAGACATGGCGAGTGACGATGCGTGTGATCAGAATGTCCAGGCGGAACATGATTGTTATGTCCCCG
CTCCTCCGGACATTAATCTGGTCGCGGACATCGACATCGAGGACGTCCCCATCACACCGACAAGCCGACCAACCAATGCAGGGTCCTCCCAGAGTCGAAAGCGGAGCAGA
GCATCATATGAAGCTGAAGCCCTTGATATTATGAGGCAGTCAGTGGCTATGCAGGAGACACAGTTCACTAAGATCGCTGACTGGCCGGAAGCCCTAGATGCGCGAGAGTT
CAAGAGGTGGGACACGGTCAGAGAGATGCTCCTGGCGCAGCACGAGCTATCGGACGATGAGAGAGTTGCTCTAATGTGCATCCTCTTCGCCAAACCGAAGATGACAAATA
TGATGTCTGTGCCATCGAACCTCAGGCTTTGCTTTCTATGA
mRNA sequenceShow/hide mRNA sequence
ATGGTGTTCTTCCTTCATATTTTCTTCGGATCTCATCTGGTATTTGTGTTCTTCCTCATTTTTTCTTCGGATCTGAACTTCCTACTTCGTCCCGTCACGTCTTCGGGTCT
TGGATCAAATGCATGCCATCTCCTTGTTCATGAATCTCGTTTTAGAAACCCACCCATGTCGCTTCAGCTTGTCGTGGGTGTCGACGATAGGAATCGGTATCCGCTTGGTA
GATTGGAGCACGGTGAGGGGTGGGACGAAGGTTTACTCCAAAAAACACCGTTATATTCGAGGACTATGGGGGCATTGAAATTCTATGTCGTGTTTGCTGGTCGCAACCTA
GGAATTTACACCTCATGGGTTGAATGCCACAGACAAGTTAACCAATTTAAGGGAGCACTACGCAAGTCGTATCCAACATTTGAAGAAGCAGAGTATGCATTTAGGCATTA
CGTTGCGGGCACCCGGGGCGAACCAAACCTCGTTGACGAACATGATTCGTGTCGTAACCTTCATGTAGGGGGCTATGGAACCGTAGGAAATTGTTATAGTATTAGTCAGA
TTCTGTTGAGAACGACGTCCGGGTTGGTAGGAACAGAAATCGTAGACGTGGAAGAGATGGTTGTGATGTTCTTGCACATCATTGCTCATGATGCTAAGAATCGAGTCATT
AGAAGACAGTTTGCACGGTCGGGTGAAACTGTTTCTCGGCACTTCAACGCGACTTTGAGTGTCGTACTACGATTGTACGACGTTCTACTTAAGAAACCTGAACCGATCAC
GACTTCTTGCCAAGATGGGAGGTGGAAATGGTTTGAGAATTGTTTAGGGGCATTGGACGGTACGTACGTAAAGGTCCATGTTAGTGCAGCTGATCGACCAAGGTATAGGA
CGCGGAAGGGTGAGATTGATGCAAATGTATTGGGCGTTGTGTCACCAAAAGGTGAATTCATTTTTGTTATGTCGGGATGGAAAGGTTCGGGTGCTGATTCTCGTGTACTC
AGAGATGCTATATCACGCCCCAATGGACTAATAGTGTCGAAGGGTTACTATTACCTCTGTGACGCTGGGTACCCAAACGCAGATGGTTTATTGGCACCTTATAGAGGAGA
GCGGACGGTGGGCGATACTTCAGGGGAATCCTACTACCCTGCTCGGACCCAGTGTCGAATTATAACAGCGTGCTGTTTACTCCACAACCTTATCACCCGAGAGATGGGTC
TGGATGTTGGGTTAGATGAAGGTGATATTGGTCGATCTGAACCCGTACCTCTAGATGGTGAGAACATAACCTTCATTCAAAGCTCCACTGAATGGATGCAAAAGCGAGAT
GACCTAGCAACAGGATGTTCAACGCGTGGGGGCAACATAATCCCTGATCCAGTTTCGACATACAACTATCTGCTATGTTGTGCAGATAAACAACTGAAACACATCTGGAC
GAGGCAGAAAGAGGCAAGGTTGGTGGAATCCCTCGTGGAGCTCGTCCACGAAGGTGGATGGAGAGGGGACAACGGGACCTTCAGGGTCGGATACCTAGCCCGACTGAAGC
GAATGATAAAAGATAAAATGACGACCTGCACCATAGAGTCAACGTCCGTAATAGACCGCAAGGTGCGGTCCTTGAAATGGCAATACAGTGCCATCTCAGAGATGTTGGGT
CAGGGTTGCAATGGATTCGGTTGGAATGATGAGTTTAAATGCATCCAGGCTGAGAGAGAGGTATATGATGCATGGGTGAAGTCACACTCGGCCGCGAAAGGGCTGCTGAA
CAAGTCATTTCCTCATTACGAGGATCTTGCTTTCGTGTTCGGCAAAGACATGGCGAGTGACGATGCGTGTGATCAGAATGTCCAGGCGGAACATGATTGTTATGTCCCCG
CTCCTCCGGACATTAATCTGGTCGCGGACATCGACATCGAGGACGTCCCCATCACACCGACAAGCCGACCAACCAATGCAGGGTCCTCCCAGAGTCGAAAGCGGAGCAGA
GCATCATATGAAGCTGAAGCCCTTGATATTATGAGGCAGTCAGTGGCTATGCAGGAGACACAGTTCACTAAGATCGCTGACTGGCCGGAAGCCCTAGATGCGCGAGAGTT
CAAGAGGTGGGACACGGTCAGAGAGATGCTCCTGGCGCAGCACGAGCTATCGGACGATGAGAGAGTTGCTCTAATGTGCATCCTCTTCGCCAAACCGAAGATGACAAATA
TGATGTCTGTGCCATCGAACCTCAGGCTTTGCTTTCTATGA
Protein sequenceShow/hide protein sequence
MVFFLHIFFGSHLVFVFFLIFSSDLNFLLRPVTSSGLGSNACHLLVHESRFRNPPMSLQLVVGVDDRNRYPLGRLEHGEGWDEGLLQKTPLYSRTMGALKFYVVFAGRNL
GIYTSWVECHRQVNQFKGALRKSYPTFEEAEYAFRHYVAGTRGEPNLVDEHDSCRNLHVGGYGTVGNCYSISQILLRTTSGLVGTEIVDVEEMVVMFLHIIAHDAKNRVI
RRQFARSGETVSRHFNATLSVVLRLYDVLLKKPEPITTSCQDGRWKWFENCLGALDGTYVKVHVSAADRPRYRTRKGEIDANVLGVVSPKGEFIFVMSGWKGSGADSRVL
RDAISRPNGLIVSKGYYYLCDAGYPNADGLLAPYRGERTVGDTSGESYYPARTQCRIITACCLLHNLITREMGLDVGLDEGDIGRSEPVPLDGENITFIQSSTEWMQKRD
DLATGCSTRGGNIIPDPVSTYNYLLCCADKQLKHIWTRQKEARLVESLVELVHEGGWRGDNGTFRVGYLARLKRMIKDKMTTCTIESTSVIDRKVRSLKWQYSAISEMLG
QGCNGFGWNDEFKCIQAEREVYDAWVKSHSAAKGLLNKSFPHYEDLAFVFGKDMASDDACDQNVQAEHDCYVPAPPDINLVADIDIEDVPITPTSRPTNAGSSQSRKRSR
ASYEAEALDIMRQSVAMQETQFTKIADWPEALDAREFKRWDTVREMLLAQHELSDDERVALMCILFAKPKMTNMMSVPSNLRLCFL