; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0019578 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0019578
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTy3-gypsy retrotransposon protein
Genome locationchr5:43468958..43472322
RNA-Seq ExpressionLag0019578
SyntenyLag0019578
Gene Ontology termsGO:0006310 - DNA recombination (biological process)
GO:0015074 - DNA integration (biological process)
GO:0071897 - DNA biosynthetic process (biological process)
GO:0090502 - RNA phosphodiester bond hydrolysis, endonucleolytic (biological process)
GO:0003723 - RNA binding (molecular function)
GO:0003887 - DNA-directed DNA polymerase activity (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032121.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]6.7e-17954.55Show/hide
Query:  EPDSDVVTVMMTETRTTEERMTEMQEHINNLMKAIEEKDSQIAQLKCQIENQHIAESSQTQVVKNHDKGKTIVQDDQP-QCSASIASLSIQQLQDMITNC
        E   +VV+VMM +  T E  M EM+  IN LMK +EE+D +IA LK Q++    +ESSQT VVK  DKGK +V+++QP Q S S+ASLS+QQLQDMI N 
Subjt:  EPDSDVVTVMMTETRTTEERMTEMQEHINNLMKAIEEKDSQIAQLKCQIENQHIAESSQTQVVKNHDKGKTIVQDDQP-QCSASIASLSIQQLQDMITNC

Query:  IRAQYGGPTQESLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNTFDWYTDLEPESIDSWEELERE
        IRAQYGGP Q S +YSKPYTKRIDNLR P+GYQP KFQQFDGKGNPKQHI HFVETCENAG+RGD LV+QFVR+LKGN F+                   
Subjt:  IRAQYGGPTQESLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNTFDWYTDLEPESIDSWEELERE

Query:  FLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASQENQDLLLP
               TRR VSM ELTNT QRKGE V++YINRWRA+SLDCKD+LTELS+VEMC QGMHWELLYIL+GIKPRTFEELATRAHDM+LSIA++  +D L+ 
Subjt:  FLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASQENQDLLLP

Query:  NMRKEGRNDKET-------IEESMVVNTTLPKSSSKEKRQTNGAHH-------LTLKERQKKIYHFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDD
          R +     +T       + ESM+V  T  KS SK K   +  +H        TL+ERQKK+Y FPD+D+ DMLEQL+E QLI+LP+CKRPE++ KVDD
Subjt:  NMRKEGRNDKET-------IEESMVVNTTLPKSSSKEKRQTNGAHH-------LTLKERQKKIYHFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDD

Query:  PKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLDTIKGNNK--------HQRKN-------DP------KKLQPKRKRSKKFPQPQQ-
        P YCKYHRVI H VE+CFVLK+LI KLA+E KIELD+DEVAQ+N   +   +          QRK+       +P      +K      ++K+ P   + 
Subjt:  PKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLDTIKGNNK--------HQRKN-------DP------KKLQPKRKRSKKFPQPQQ-

Query:  ---LVTLNKSFSKNFHKK--EKKNFATSYCIDV-------EEVDNSEKGEQKISVFDRIKPQTTRPSVFQRMSMAATEEENQCSVSTFTRPSAFQRLSVS
           +  L +SF ++  ++  E     T+  ++V       EE+DNS + +Q+ SVFD IKP TTR SVFQR+SMA  +EENQC   T+ + SAF+RLS+S
Subjt:  ---LVTLNKSFSKNFHKK--EKKNFATSYCIDV-------EEVDNSEKGEQKISVFDRIKPQTTRPSVFQRMSMAATEEENQCSVSTFTRPSAFQRLSVS

Query:  TSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNNDKKLQSSIPSRMKRKFSVLINTEGSLKVKPNLIILTNPTSQGPDQDHDEANGSC
         SKK +PST  FDRLK+T+DQ +R+M  L+ K F E N+D K+ S +PSRMKRK SV INTEGSL VKP  II TNP ++G ++  DE N SC
Subjt:  TSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNNDKKLQSSIPSRMKRKFSVLINTEGSLKVKPNLIILTNPTSQGPDQDHDEANGSC

KAA0033746.1 retrotransposon gag protein [Cucumis melo var. makuwa]1.2e-17550.2Show/hide
Query:  EPDSDVVTVMMTETRTTEERMTEMQEHINNLMKAIEEKDSQIAQLKCQIENQHIAESSQTQVVKNHDKGKTIVQDDQP-QCSASIASLSIQQLQDMITNC
        E   +VV+VMM +  T E  M EM+  IN LMK  EE+D +IA LK Q++     ESSQT VVK  DKGK +VQ++QP Q S S+ASLS+QQLQDMI N 
Subjt:  EPDSDVVTVMMTETRTTEERMTEMQEHINNLMKAIEEKDSQIAQLKCQIENQHIAESSQTQVVKNHDKGKTIVQDDQP-QCSASIASLSIQQLQDMITNC

Query:  IRAQYGGPTQESLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNTFDWYTDLEPESIDSWEELERE
        IRAQYGGP Q S +YSK YTKRIDNLR P+GYQPPKFQQFDGKGNPKQHIAHFVETCENAG+RGD LV+QFVR+LKGN F+WYTDLEPE           
Subjt:  IRAQYGGPTQESLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNTFDWYTDLEPESIDSWEELERE

Query:  FLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASQENQDLLLP
                                GE V++YINRWRA+SLDCKD+LTELS+VEMC QGMHWELLYIL+GIKPRTFEEL+TRAHDMELSIA+   +D L+ 
Subjt:  FLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASQENQDLLLP

Query:  NMRKEGRND--------KETIEESMVVNTTLPKSSSKEKRQTNGAHH-------LTLKERQKKIYHFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVD
          ++  +N+           + ESM+V  T  KS SK K   +  ++        TL+ERQKK+Y FPD+D+ DMLEQL+E QLI+LP+CKRPE+  KVD
Subjt:  NMRKEGRND--------KETIEESMVVNTTLPKSSSKEKRQTNGAHH-------LTLKERQKKIYHFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVD

Query:  DPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSN----------LDTI-----------------------------------------
        DP YCKYHRVI HPVE+CFVLK+LILKLA+E KIELD+DEVAQ+N          L +I                                         
Subjt:  DPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSN----------LDTI-----------------------------------------

Query:  -----------------------------KGNNKHQRKNDPKKL----QPKRKRSKKFPQPQQLVTLNKSFSKNFHKKEKKNFA------TSYCIDV---
                                     KGN  H++K    K     +P + + K F QP++ + L +   ++F +   +         T+  ++V   
Subjt:  -----------------------------KGNNKHQRKNDPKKL----QPKRKRSKKFPQPQQLVTLNKSFSKNFHKKEKKNFA------TSYCIDV---

Query:  ----EEVDNSEKGEQKISVFDRIKPQTTRPSVFQRMSMAATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDE
            EEVDNS + +Q+  VF RIKP T R SVFQR+SMA  EEENQC  ST+ R SAF+RLS+ST KK +PSTS FDRLK+ +DQ +R+M +L+ K F E
Subjt:  ----EEVDNSEKGEQKISVFDRIKPQTTRPSVFQRMSMAATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDE

Query:  VNNDKKLQSSIPSRMKRKFSVLINTEGSLKVKPNLIILTNPTSQGPDQDHDEANGSC
         N+D K+ S +PSRMKRK SV INTEGSL VKP  II TNP ++G ++  DE N SC
Subjt:  VNNDKKLQSSIPSRMKRKFSVLINTEGSLKVKPNLIILTNPTSQGPDQDHDEANGSC

KAA0047477.1 uncharacterized protein E6C27_scaffold498G00940 [Cucumis melo var. makuwa]2.1e-16969.32Show/hide
Query:  PDSDVVTVMMTETRTTEERMTEMQEHINNLMKAIEEKDSQIAQLKCQIENQHIAESSQTQVVKNHDKGKTIVQDDQPQCSASIASLSIQQLQDMITNCIR
        P  ++++VM+T+  T+E+RM E+++ +N LMKA+EE+D +IA LK  IE++  AESS T  +KN +KGK I+Q+ QPQ S SIASLS+QQLQ+MI N I+
Subjt:  PDSDVVTVMMTETRTTEERMTEMQEHINNLMKAIEEKDSQIAQLKCQIENQHIAESSQTQVVKNHDKGKTIVQDDQPQCSASIASLSIQQLQDMITNCIR

Query:  AQYGGPTQESLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNTFDWYTDLEPESIDSWEELEREFL
         QYGGP Q   LYSKPYTKRIDN+R P GYQPPKFQQFDGKGNPKQH+AHF+ETCE AGTRGDLLVKQFVRTLKGN FDWYTDLEPESIDSWE+LER+FL
Subjt:  AQYGGPTQESLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNTFDWYTDLEPESIDSWEELEREFL

Query:  NRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASQENQDLLLPNM
        NRFYSTRR VSM ELT TKQRKGE V++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSIA++ N DLL+P +
Subjt:  NRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASQENQDLLLPNM

Query:  RKEGRNDKET-------IEESMVVNTT----LPKSSSKEKRQTNG-AHHLTLKERQKKIYHFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYC
        RKE +  K T        +E+MVV+TT    + K    EKRQ  G     TLKERQ+K+Y FPD+D+PDML+QLLE QLI+LP+CKRP EM +V+DP YC
Subjt:  RKEGRNDKET-------IEESMVVNTT----LPKSSSKEKRQTNG-AHHLTLKERQKKIYHFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYC

Query:  KYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSN
        KYHRVI HPVE+CFVLK+LILKLA + KIEL+LD+VAQ+N
Subjt:  KYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSN

KAA0047477.1 uncharacterized protein E6C27_scaffold498G00940 [Cucumis melo var. makuwa]8.0e-1531.97Show/hide
Query:  DMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLDTIK-GNNKHQRKNDPKKLQPKRKRSK
        ++L++   A L ++ K    +++EK D   Y    R +     + + L      +AK G      D   ++ L ++K  + + +     KKLQ   K+  
Subjt:  DMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLDTIK-GNNKHQRKNDPKKLQPKRKRSK

Query:  KFPQPQQLVTLNKSFSKNFHKKEKKNFATSYCIDVEEVDNSEKGE----QKISVFDRIKPQTTRPSVFQRMSMAATEEENQCSVSTFTRPSAFQRLSVST
          P  +  +    S       K K   A +  I VEE  +SE+G+    Q+ SVFDRI     RPSVFQR+S +  ++ NQ S  + TR SAFQRL+ S 
Subjt:  KFPQPQQLVTLNKSFSKNFHKKEKKNFATSYCIDVEEVDNSEKGE----QKISVFDRIKPQTTRPSVFQRMSMAATEEENQCSVSTFTRPSAFQRLSVST

Query:  SK----KSQPST--SVFDRLKVTSDQPKRKMDNLEVKLFDEVNNDKKLQSSIPSRMKRKFSVLINTEGSLKVKPNLIILTNPTSQGPDQDHDEA
         K       P+T  S F RL V+  + ++K           V  D++++S+ PSRMKRK  V +NTEGSLKVK + ++ T P    P+ + D A
Subjt:  SK----KSQPST--SVFDRLKVTSDQPKRKMDNLEVKLFDEVNNDKKLQSSIPSRMKRKFSVLINTEGSLKVKPNLIILTNPTSQGPDQDHDEA

KAA0056121.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]9.2e-20558.11Show/hide
Query:  KASIIASEETTLQGAYTNDKFLVKYDPLFE--PDSDVVTVMMTETRTTEERMTEMQEHINNLMKAIEEKDSQIAQLKCQIENQHIAESSQTQVVKNHDKG
        K  I+  E   +   Y++ K      P  E  P  ++++VM+T   T+E RM E+++ +N LMK +EE+D +IA LK  IE++  AESS    VKN DKG
Subjt:  KASIIASEETTLQGAYTNDKFLVKYDPLFE--PDSDVVTVMMTETRTTEERMTEMQEHINNLMKAIEEKDSQIAQLKCQIENQHIAESSQTQVVKNHDKG

Query:  KTIVQDDQPQCSASIASLSIQQLQDMITNCIRAQYGGPTQESLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQ
        K ++Q+ QPQ S SIASLS+QQLQ+MI + I+ QYGGP Q   LY KPYTKRIDNLR P GYQPPKFQQFDGKGNPKQH+AHF++TCE AGTRGDLLVKQ
Subjt:  KTIVQDDQPQCSASIASLSIQQLQDMITNCIRAQYGGPTQESLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQ

Query:  FVRTLKGNTFDWYTDLEPESIDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGI
        FVRTLKGN  DWY DLEPESID+WE+LER+FLNRFYSTR  VSM ELTNT+Q+KGELV++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GI
Subjt:  FVRTLKGNTFDWYTDLEPESIDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGI

Query:  KPRTFEELATRAHDMELSIASQENQDLLLPNMRKEGRNDKET-------IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYHFPDAD
        KPRTFEELATRAHDMELSIA++  +D L+P  R +     +T       I+ESMVV+ T  KS SK K     R+ +G      TLKERQ+K+Y FPD+D
Subjt:  KPRTFEELATRAHDMELSIASQENQDLLLPNMRKEGRNDKET-------IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYHFPDAD

Query:  IPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLDTIKGNNKHQRKNDPKKLQPKRK--
        + DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+CFVLK+LILKLA+E KIELD+DEVAQ+N   I+  +   +  D   LQ +R   
Subjt:  IPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLDTIKGNNKHQRKNDPKKLQPKRK--

Query:  ------RSKKFPQPQQLVTLNKSFSKNFHKKEKKNFATSYCIDVEEVDNSEKGEQKISVFDRIKPQTTRPSVFQRMSMAATEEENQCSVSTFTRPSAFQR
              RS     P++++ +    + +  + +  N+ +S     +EV+NS +  Q+ SVFDRIKP TTR SVFQR+S+A  EEENQC    +TR S  +R
Subjt:  ------RSKKFPQPQQLVTLNKSFSKNFHKKEKKNFATSYCIDVEEVDNSEKGEQKISVFDRIKPQTTRPSVFQRMSMAATEEENQCSVSTFTRPSAFQR

Query:  LSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNNDKKLQSSIPSRMKRKFSVLINTEGSLKVKPNLIILTNPTSQGPDQDHDEANGSC
        LS+ST KK +PSTS FDRLK+T+DQ +R+M + + K F E N+D K+ S +PSRMKRK  V INTEGSL VKP  II TNPT++G +Q   E N SC
Subjt:  LSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNNDKKLQSSIPSRMKRKFSVLINTEGSLKVKPNLIILTNPTSQGPDQDHDEANGSC

TYK03695.1 retrotransposon gag protein [Cucumis melo var. makuwa]2.4e-19254.38Show/hide
Query:  PDSDVVTVMMTETRTTEERMTEMQEHINNLMKAIEEKDSQIAQLKCQIENQHIAESSQTQVVKNHDKGKTIVQDDQPQCSASIASLSIQQLQDMITNCIR
        P  ++++VM+T   T+E RM E+++ +N LMK +EE+D +IA LK  IE++  AESS    VKN DKGK ++Q+ QPQ S SIASLS+QQLQ+MI + I+
Subjt:  PDSDVVTVMMTETRTTEERMTEMQEHINNLMKAIEEKDSQIAQLKCQIENQHIAESSQTQVVKNHDKGKTIVQDDQPQCSASIASLSIQQLQDMITNCIR

Query:  AQYGGPTQESLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNTFDWYTDLEPESIDSWEELEREFL
         QYGGP Q   LYSKPYTKRIDNLR P GYQPPKFQQFDGKGNPKQH+AHF+ETCE AGTRGDLLVKQFVRTLKGN FD Y DLEPESID+WE+LER+FL
Subjt:  AQYGGPTQESLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNTFDWYTDLEPESIDSWEELEREFL

Query:  NRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASQENQDLLLPNM
        NRFYSTRR VSM ELTNT+Q+KGELV++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSI ++  +D L+P  
Subjt:  NRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASQENQDLLLPNM

Query:  RKEGRNDKET-------IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYHFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPK
        R +     +T       I+ESMVV+ T  KS SK K     R+ +G      TLKERQ+K+Y F D+D+ DMLEQLLE QLI+LPKCKRP++ EKVDDP 
Subjt:  RKEGRNDKET-------IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYHFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPK

Query:  YCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSN-----------------------------------LDTIKGNNK--------------
        YCKYHRVI HPVE+CFVLK+LILKLA+E KIEL++DEVAQ+N                                   + TI   NK              
Subjt:  YCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSN-----------------------------------LDTIKGNNK--------------

Query:  -HQRKNDPKKLQ-----------------PKRKRSKK-------------FPQPQQLVTLNKSFSKNF---HKKEKKNFATSYCIDV----------EEV
          Q+   P  +Q                  K +R+KK             F Q ++ +TL +   ++F   H +E     T +   +          +EV
Subjt:  -HQRKNDPKKLQ-----------------PKRKRSKK-------------FPQPQQLVTLNKSFSKNF---HKKEKKNFATSYCIDV----------EEV

Query:  DNSEKGEQKISVFDRIKPQTTRPSVFQRMSMAATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNNDKKL
        +N  +  Q+ SVFDRIKP TTR SVFQR+SMA  EEENQC    +TR S F+RLS+S SKK++PSTS FDRLK+T+DQ +R+M +L+ K F E N+D K+
Subjt:  DNSEKGEQKISVFDRIKPQTTRPSVFQRMSMAATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNNDKKL

Query:  QSSIPSRMKRKFSVLINTE
         S +PSRMKRK  V INT+
Subjt:  QSSIPSRMKRKFSVLINTE

TrEMBL top hitse value%identityAlignment
A0A5A7SRE2 Ty3-gypsy retrotransposon protein3.2e-17954.55Show/hide
Query:  EPDSDVVTVMMTETRTTEERMTEMQEHINNLMKAIEEKDSQIAQLKCQIENQHIAESSQTQVVKNHDKGKTIVQDDQP-QCSASIASLSIQQLQDMITNC
        E   +VV+VMM +  T E  M EM+  IN LMK +EE+D +IA LK Q++    +ESSQT VVK  DKGK +V+++QP Q S S+ASLS+QQLQDMI N 
Subjt:  EPDSDVVTVMMTETRTTEERMTEMQEHINNLMKAIEEKDSQIAQLKCQIENQHIAESSQTQVVKNHDKGKTIVQDDQP-QCSASIASLSIQQLQDMITNC

Query:  IRAQYGGPTQESLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNTFDWYTDLEPESIDSWEELERE
        IRAQYGGP Q S +YSKPYTKRIDNLR P+GYQP KFQQFDGKGNPKQHI HFVETCENAG+RGD LV+QFVR+LKGN F+                   
Subjt:  IRAQYGGPTQESLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNTFDWYTDLEPESIDSWEELERE

Query:  FLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASQENQDLLLP
               TRR VSM ELTNT QRKGE V++YINRWRA+SLDCKD+LTELS+VEMC QGMHWELLYIL+GIKPRTFEELATRAHDM+LSIA++  +D L+ 
Subjt:  FLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASQENQDLLLP

Query:  NMRKEGRNDKET-------IEESMVVNTTLPKSSSKEKRQTNGAHH-------LTLKERQKKIYHFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDD
          R +     +T       + ESM+V  T  KS SK K   +  +H        TL+ERQKK+Y FPD+D+ DMLEQL+E QLI+LP+CKRPE++ KVDD
Subjt:  NMRKEGRNDKET-------IEESMVVNTTLPKSSSKEKRQTNGAHH-------LTLKERQKKIYHFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDD

Query:  PKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLDTIKGNNK--------HQRKN-------DP------KKLQPKRKRSKKFPQPQQ-
        P YCKYHRVI H VE+CFVLK+LI KLA+E KIELD+DEVAQ+N   +   +          QRK+       +P      +K      ++K+ P   + 
Subjt:  PKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLDTIKGNNK--------HQRKN-------DP------KKLQPKRKRSKKFPQPQQ-

Query:  ---LVTLNKSFSKNFHKK--EKKNFATSYCIDV-------EEVDNSEKGEQKISVFDRIKPQTTRPSVFQRMSMAATEEENQCSVSTFTRPSAFQRLSVS
           +  L +SF ++  ++  E     T+  ++V       EE+DNS + +Q+ SVFD IKP TTR SVFQR+SMA  +EENQC   T+ + SAF+RLS+S
Subjt:  ---LVTLNKSFSKNFHKK--EKKNFATSYCIDV-------EEVDNSEKGEQKISVFDRIKPQTTRPSVFQRMSMAATEEENQCSVSTFTRPSAFQRLSVS

Query:  TSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNNDKKLQSSIPSRMKRKFSVLINTEGSLKVKPNLIILTNPTSQGPDQDHDEANGSC
         SKK +PST  FDRLK+T+DQ +R+M  L+ K F E N+D K+ S +PSRMKRK SV INTEGSL VKP  II TNP ++G ++  DE N SC
Subjt:  TSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNNDKKLQSSIPSRMKRKFSVLINTEGSLKVKPNLIILTNPTSQGPDQDHDEANGSC

A0A5A7SUW1 Retrotransposon gag protein5.7e-17650.2Show/hide
Query:  EPDSDVVTVMMTETRTTEERMTEMQEHINNLMKAIEEKDSQIAQLKCQIENQHIAESSQTQVVKNHDKGKTIVQDDQP-QCSASIASLSIQQLQDMITNC
        E   +VV+VMM +  T E  M EM+  IN LMK  EE+D +IA LK Q++     ESSQT VVK  DKGK +VQ++QP Q S S+ASLS+QQLQDMI N 
Subjt:  EPDSDVVTVMMTETRTTEERMTEMQEHINNLMKAIEEKDSQIAQLKCQIENQHIAESSQTQVVKNHDKGKTIVQDDQP-QCSASIASLSIQQLQDMITNC

Query:  IRAQYGGPTQESLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNTFDWYTDLEPESIDSWEELERE
        IRAQYGGP Q S +YSK YTKRIDNLR P+GYQPPKFQQFDGKGNPKQHIAHFVETCENAG+RGD LV+QFVR+LKGN F+WYTDLEPE           
Subjt:  IRAQYGGPTQESLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNTFDWYTDLEPESIDSWEELERE

Query:  FLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASQENQDLLLP
                                GE V++YINRWRA+SLDCKD+LTELS+VEMC QGMHWELLYIL+GIKPRTFEEL+TRAHDMELSIA+   +D L+ 
Subjt:  FLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASQENQDLLLP

Query:  NMRKEGRND--------KETIEESMVVNTTLPKSSSKEKRQTNGAHH-------LTLKERQKKIYHFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVD
          ++  +N+           + ESM+V  T  KS SK K   +  ++        TL+ERQKK+Y FPD+D+ DMLEQL+E QLI+LP+CKRPE+  KVD
Subjt:  NMRKEGRND--------KETIEESMVVNTTLPKSSSKEKRQTNGAHH-------LTLKERQKKIYHFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVD

Query:  DPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSN----------LDTI-----------------------------------------
        DP YCKYHRVI HPVE+CFVLK+LILKLA+E KIELD+DEVAQ+N          L +I                                         
Subjt:  DPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSN----------LDTI-----------------------------------------

Query:  -----------------------------KGNNKHQRKNDPKKL----QPKRKRSKKFPQPQQLVTLNKSFSKNFHKKEKKNFA------TSYCIDV---
                                     KGN  H++K    K     +P + + K F QP++ + L +   ++F +   +         T+  ++V   
Subjt:  -----------------------------KGNNKHQRKNDPKKL----QPKRKRSKKFPQPQQLVTLNKSFSKNFHKKEKKNFA------TSYCIDV---

Query:  ----EEVDNSEKGEQKISVFDRIKPQTTRPSVFQRMSMAATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDE
            EEVDNS + +Q+  VF RIKP T R SVFQR+SMA  EEENQC  ST+ R SAF+RLS+ST KK +PSTS FDRLK+ +DQ +R+M +L+ K F E
Subjt:  ----EEVDNSEKGEQKISVFDRIKPQTTRPSVFQRMSMAATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDE

Query:  VNNDKKLQSSIPSRMKRKFSVLINTEGSLKVKPNLIILTNPTSQGPDQDHDEANGSC
         N+D K+ S +PSRMKRK SV INTEGSL VKP  II TNP ++G ++  DE N SC
Subjt:  VNNDKKLQSSIPSRMKRKFSVLINTEGSLKVKPNLIILTNPTSQGPDQDHDEANGSC

A0A5A7TZU9 Ribonuclease H1.0e-16969.32Show/hide
Query:  PDSDVVTVMMTETRTTEERMTEMQEHINNLMKAIEEKDSQIAQLKCQIENQHIAESSQTQVVKNHDKGKTIVQDDQPQCSASIASLSIQQLQDMITNCIR
        P  ++++VM+T+  T+E+RM E+++ +N LMKA+EE+D +IA LK  IE++  AESS T  +KN +KGK I+Q+ QPQ S SIASLS+QQLQ+MI N I+
Subjt:  PDSDVVTVMMTETRTTEERMTEMQEHINNLMKAIEEKDSQIAQLKCQIENQHIAESSQTQVVKNHDKGKTIVQDDQPQCSASIASLSIQQLQDMITNCIR

Query:  AQYGGPTQESLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNTFDWYTDLEPESIDSWEELEREFL
         QYGGP Q   LYSKPYTKRIDN+R P GYQPPKFQQFDGKGNPKQH+AHF+ETCE AGTRGDLLVKQFVRTLKGN FDWYTDLEPESIDSWE+LER+FL
Subjt:  AQYGGPTQESLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNTFDWYTDLEPESIDSWEELEREFL

Query:  NRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASQENQDLLLPNM
        NRFYSTRR VSM ELT TKQRKGE V++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSIA++ N DLL+P +
Subjt:  NRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASQENQDLLLPNM

Query:  RKEGRNDKET-------IEESMVVNTT----LPKSSSKEKRQTNG-AHHLTLKERQKKIYHFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYC
        RKE +  K T        +E+MVV+TT    + K    EKRQ  G     TLKERQ+K+Y FPD+D+PDML+QLLE QLI+LP+CKRP EM +V+DP YC
Subjt:  RKEGRNDKET-------IEESMVVNTT----LPKSSSKEKRQTNG-AHHLTLKERQKKIYHFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYC

Query:  KYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSN
        KYHRVI HPVE+CFVLK+LILKLA + KIEL+LD+VAQ+N
Subjt:  KYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSN

A0A5A7TZU9 Ribonuclease H3.9e-1531.97Show/hide
Query:  DMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLDTIK-GNNKHQRKNDPKKLQPKRKRSK
        ++L++   A L ++ K    +++EK D   Y    R +     + + L      +AK G      D   ++ L ++K  + + +     KKLQ   K+  
Subjt:  DMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLDTIK-GNNKHQRKNDPKKLQPKRKRSK

Query:  KFPQPQQLVTLNKSFSKNFHKKEKKNFATSYCIDVEEVDNSEKGE----QKISVFDRIKPQTTRPSVFQRMSMAATEEENQCSVSTFTRPSAFQRLSVST
          P  +  +    S       K K   A +  I VEE  +SE+G+    Q+ SVFDRI     RPSVFQR+S +  ++ NQ S  + TR SAFQRL+ S 
Subjt:  KFPQPQQLVTLNKSFSKNFHKKEKKNFATSYCIDVEEVDNSEKGE----QKISVFDRIKPQTTRPSVFQRMSMAATEEENQCSVSTFTRPSAFQRLSVST

Query:  SK----KSQPST--SVFDRLKVTSDQPKRKMDNLEVKLFDEVNNDKKLQSSIPSRMKRKFSVLINTEGSLKVKPNLIILTNPTSQGPDQDHDEA
         K       P+T  S F RL V+  + ++K           V  D++++S+ PSRMKRK  V +NTEGSLKVK + ++ T P    P+ + D A
Subjt:  SK----KSQPST--SVFDRLKVTSDQPKRKMDNLEVKLFDEVNNDKKLQSSIPSRMKRKFSVLINTEGSLKVKPNLIILTNPTSQGPDQDHDEA

A0A5A7URH1 Ty3-gypsy retrotransposon protein4.5e-20558.11Show/hide
Query:  KASIIASEETTLQGAYTNDKFLVKYDPLFE--PDSDVVTVMMTETRTTEERMTEMQEHINNLMKAIEEKDSQIAQLKCQIENQHIAESSQTQVVKNHDKG
        K  I+  E   +   Y++ K      P  E  P  ++++VM+T   T+E RM E+++ +N LMK +EE+D +IA LK  IE++  AESS    VKN DKG
Subjt:  KASIIASEETTLQGAYTNDKFLVKYDPLFE--PDSDVVTVMMTETRTTEERMTEMQEHINNLMKAIEEKDSQIAQLKCQIENQHIAESSQTQVVKNHDKG

Query:  KTIVQDDQPQCSASIASLSIQQLQDMITNCIRAQYGGPTQESLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQ
        K ++Q+ QPQ S SIASLS+QQLQ+MI + I+ QYGGP Q   LY KPYTKRIDNLR P GYQPPKFQQFDGKGNPKQH+AHF++TCE AGTRGDLLVKQ
Subjt:  KTIVQDDQPQCSASIASLSIQQLQDMITNCIRAQYGGPTQESLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQ

Query:  FVRTLKGNTFDWYTDLEPESIDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGI
        FVRTLKGN  DWY DLEPESID+WE+LER+FLNRFYSTR  VSM ELTNT+Q+KGELV++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GI
Subjt:  FVRTLKGNTFDWYTDLEPESIDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGI

Query:  KPRTFEELATRAHDMELSIASQENQDLLLPNMRKEGRNDKET-------IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYHFPDAD
        KPRTFEELATRAHDMELSIA++  +D L+P  R +     +T       I+ESMVV+ T  KS SK K     R+ +G      TLKERQ+K+Y FPD+D
Subjt:  KPRTFEELATRAHDMELSIASQENQDLLLPNMRKEGRNDKET-------IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYHFPDAD

Query:  IPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLDTIKGNNKHQRKNDPKKLQPKRK--
        + DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+CFVLK+LILKLA+E KIELD+DEVAQ+N   I+  +   +  D   LQ +R   
Subjt:  IPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLDTIKGNNKHQRKNDPKKLQPKRK--

Query:  ------RSKKFPQPQQLVTLNKSFSKNFHKKEKKNFATSYCIDVEEVDNSEKGEQKISVFDRIKPQTTRPSVFQRMSMAATEEENQCSVSTFTRPSAFQR
              RS     P++++ +    + +  + +  N+ +S     +EV+NS +  Q+ SVFDRIKP TTR SVFQR+S+A  EEENQC    +TR S  +R
Subjt:  ------RSKKFPQPQQLVTLNKSFSKNFHKKEKKNFATSYCIDVEEVDNSEKGEQKISVFDRIKPQTTRPSVFQRMSMAATEEENQCSVSTFTRPSAFQR

Query:  LSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNNDKKLQSSIPSRMKRKFSVLINTEGSLKVKPNLIILTNPTSQGPDQDHDEANGSC
        LS+ST KK +PSTS FDRLK+T+DQ +R+M + + K F E N+D K+ S +PSRMKRK  V INTEGSL VKP  II TNPT++G +Q   E N SC
Subjt:  LSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNNDKKLQSSIPSRMKRKFSVLINTEGSLKVKPNLIILTNPTSQGPDQDHDEANGSC

A0A5D3BX77 Retrotransposon gag protein1.1e-19254.38Show/hide
Query:  PDSDVVTVMMTETRTTEERMTEMQEHINNLMKAIEEKDSQIAQLKCQIENQHIAESSQTQVVKNHDKGKTIVQDDQPQCSASIASLSIQQLQDMITNCIR
        P  ++++VM+T   T+E RM E+++ +N LMK +EE+D +IA LK  IE++  AESS    VKN DKGK ++Q+ QPQ S SIASLS+QQLQ+MI + I+
Subjt:  PDSDVVTVMMTETRTTEERMTEMQEHINNLMKAIEEKDSQIAQLKCQIENQHIAESSQTQVVKNHDKGKTIVQDDQPQCSASIASLSIQQLQDMITNCIR

Query:  AQYGGPTQESLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNTFDWYTDLEPESIDSWEELEREFL
         QYGGP Q   LYSKPYTKRIDNLR P GYQPPKFQQFDGKGNPKQH+AHF+ETCE AGTRGDLLVKQFVRTLKGN FD Y DLEPESID+WE+LER+FL
Subjt:  AQYGGPTQESLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNTFDWYTDLEPESIDSWEELEREFL

Query:  NRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASQENQDLLLPNM
        NRFYSTRR VSM ELTNT+Q+KGELV++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSI ++  +D L+P  
Subjt:  NRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASQENQDLLLPNM

Query:  RKEGRNDKET-------IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYHFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPK
        R +     +T       I+ESMVV+ T  KS SK K     R+ +G      TLKERQ+K+Y F D+D+ DMLEQLLE QLI+LPKCKRP++ EKVDDP 
Subjt:  RKEGRNDKET-------IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYHFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPK

Query:  YCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSN-----------------------------------LDTIKGNNK--------------
        YCKYHRVI HPVE+CFVLK+LILKLA+E KIEL++DEVAQ+N                                   + TI   NK              
Subjt:  YCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSN-----------------------------------LDTIKGNNK--------------

Query:  -HQRKNDPKKLQ-----------------PKRKRSKK-------------FPQPQQLVTLNKSFSKNF---HKKEKKNFATSYCIDV----------EEV
          Q+   P  +Q                  K +R+KK             F Q ++ +TL +   ++F   H +E     T +   +          +EV
Subjt:  -HQRKNDPKKLQ-----------------PKRKRSKK-------------FPQPQQLVTLNKSFSKNF---HKKEKKNFATSYCIDV----------EEV

Query:  DNSEKGEQKISVFDRIKPQTTRPSVFQRMSMAATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNNDKKL
        +N  +  Q+ SVFDRIKP TTR SVFQR+SMA  EEENQC    +TR S F+RLS+S SKK++PSTS FDRLK+T+DQ +R+M +L+ K F E N+D K+
Subjt:  DNSEKGEQKISVFDRIKPQTTRPSVFQRMSMAATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNNDKKL

Query:  QSSIPSRMKRKFSVLINTE
         S +PSRMKRK  V INT+
Subjt:  QSSIPSRMKRKFSVLINTE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCATTCAAGACTTCTTTGATGACCGCTGTCAAGAACAAGTCTTACATCAGTTCTACTGCCCATTGTTGCTTCAATGAACTGAGGTTGCAAGAAGATAAAGCTTCTAT
CATTGCAAGCGAAGAAACAACCTTGCAGGGGGCATATACCAACGACAAGTTTCTTGTTAAGTACGACCCTCTGTTTGAACCTGATTCTGACGTAGTGACTGTCATGATGA
CTGAGACAAGAACTACGGAAGAAAGAATGACTGAGATGCAAGAACACATCAACAATTTGATGAAGGCGATTGAAGAAAAAGATTCTCAAATCGCGCAACTAAAGTGCCAA
ATTGAGAACCAACATATCGCCGAATCAAGTCAAACCCAAGTCGTAAAAAATCACGACAAAGGAAAGACTATAGTGCAAGATGATCAACCACAGTGTTCTGCTTCGATCGC
TTCACTATCCATCCAACAGCTCCAAGATATGATCACAAACTGTATCAGAGCTCAGTACGGTGGACCTACTCAAGAATCCCTCTTGTATTCCAAACCTTATACTAAGAGGA
TTGATAACTTGAGAACGCCAATCGGGTATCAGCCACCAAAATTTCAACAGTTTGATGGAAAGGGCAATCCTAAACAACATATTGCCCACTTCGTTGAGACATGTGAGAAC
GCTGGTACTCGAGGGGACCTACTAGTCAAACAGTTCGTTCGAACACTTAAAGGAAATACGTTTGACTGGTACACTGATCTAGAACCTGAGTCAATAGACAGTTGGGAGGA
ACTCGAAAGAGAGTTTTTGAATCGCTTCTACAGCACTAGACGAACCGTTAGCATGTTCGAGCTCACCAACACTAAACAACGAAAAGGTGAACTCGTTGTTAACTATATAA
ATCGCTGGAGAGCCATGAGCCTAGATTGCAAAGATCGTCTCACTGAACTCTCTTCCGTCGAGATGTGCATTCAAGGCATGCACTGGGAACTCCTCTACATCCTTAAAGGT
ATAAAGCCTCGCACCTTTGAAGAACTAGCAACTCGCGCCCACGATATGGAGCTAAGTATTGCTAGTCAAGAAAACCAAGACCTTCTCCTCCCTAACATGAGAAAAGAAGG
AAGGAACGATAAAGAGACTATAGAAGAATCCATGGTTGTAAACACAACCCTTCCCAAGTCGTCTTCGAAAGAAAAGCGACAAACTAATGGAGCGCATCACTTAACTTTAA
AGGAAAGACAGAAGAAAATCTATCATTTCCCTGATGCCGACATCCCTGATATGTTGGAACAACTATTGGAAGCGCAACTGATAGAGCTTCCTAAGTGTAAACGACCAGAA
GAGATGGAGAAAGTCGATGATCCCAAGTATTGCAAGTATCATCGAGTTATTGGTCATCCAGTGGAAAGATGTTTCGTCCTAAAGGACTTGATTTTAAAGCTGGCTAAGGA
AGGCAAAATCGAGCTCGACCTTGATGAAGTAGCCCAATCAAATCTTGATACAATCAAAGGAAATAACAAACATCAAAGAAAGAATGATCCTAAGAAACTTCAACCCAAGA
GGAAGAGAAGTAAAAAGTTTCCTCAACCTCAACAACTGGTGACGTTGAATAAATCCTTCTCCAAAAATTTCCACAAAAAGGAAAAAAAGAACTTTGCGACTTCGTACTGC
ATCGACGTAGAAGAAGTTGACAATTCTGAGAAGGGTGAACAAAAGATCTCCGTCTTCGATCGCATCAAGCCTCAAACTACTCGTCCTTCAGTATTCCAAAGAATGAGTAT
GGCCGCGACAGAAGAAGAAAATCAATGTTCGGTGTCCACCTTCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCGAAGAAAAGTCAACCTTCGACATCTG
TTTTTGATCGCCTCAAAGTAACAAGCGATCAACCTAAAAGAAAGATGGATAACTTGGAGGTGAAACTTTTCGATGAAGTAAACAACGACAAGAAGCTTCAAAGTAGCATC
CCGTCACGTATGAAGAGGAAGTTCTCTGTTCTCATAAATACAGAAGGTTCCTTGAAGGTGAAACCAAATCTCATTATCTTGACCAATCCTACAAGTCAAGGACCTGATCA
AGACCATGATGAAGCGAATGGAAGTTGCTTCCTCCAAGTTCGAAGGTTCCCACGCGCTTCGCTGCAGTTCCTTCCCCCCAAATTCGAAGGTTCTCACGCGTTTCGCTGCA
GTTCCTTCCTCACAGTTCGAAGGTTCTCACGTGCTTCGCTGCAGTTCCTTCCCCCAAGTTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCAAAGTTCAAAGGT
TCTCACGCGCTTCGTTGCAGTTCTTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTTGAAGGTTCTCACGCGCTTCGTTGCAGTTCC
TTCCTCACAATTCGAAGGTTCTCACGCGCTTCGCTGTAGTTCCTTCCCCCAAGTTCGAAGGTTCTCACGTCGCTTCGCTGCAGTTCCTTCCTCCAAGTTTGAAGGTTCTC
ACATCGCTTCGTTGCGATCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCCCAAGTTCGAAGGTTCTCACGCGCTTCGTGCAGTTCCTTCC
TCCAAATTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCAAAGTTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCCCCCAAGTTCGAAGGTTCTCACGT
CGCTTTGCTGCAGTTCCTTCTTCCAAGTTTGAAGGTTCTCACATCGCTTCGCTTCGCGCTGCGCTTCGTTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACACGCTTC
GCTGCAGTTTCTTCTCCCTAAGTTCGAAGGTTCTCACGCGCTTCGATGCAGTTCCTTCCTCCCTAAGTTCGAAGGTTCTCACTCGCTTCGCTGCAGTTCCTTCCTCCAAA
TTCGAAGGTTTCGAAGGTTCTCACGCGCTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCATTCAAGACTTCTTTGATGACCGCTGTCAAGAACAAGTCTTACATCAGTTCTACTGCCCATTGTTGCTTCAATGAACTGAGGTTGCAAGAAGATAAAGCTTCTAT
CATTGCAAGCGAAGAAACAACCTTGCAGGGGGCATATACCAACGACAAGTTTCTTGTTAAGTACGACCCTCTGTTTGAACCTGATTCTGACGTAGTGACTGTCATGATGA
CTGAGACAAGAACTACGGAAGAAAGAATGACTGAGATGCAAGAACACATCAACAATTTGATGAAGGCGATTGAAGAAAAAGATTCTCAAATCGCGCAACTAAAGTGCCAA
ATTGAGAACCAACATATCGCCGAATCAAGTCAAACCCAAGTCGTAAAAAATCACGACAAAGGAAAGACTATAGTGCAAGATGATCAACCACAGTGTTCTGCTTCGATCGC
TTCACTATCCATCCAACAGCTCCAAGATATGATCACAAACTGTATCAGAGCTCAGTACGGTGGACCTACTCAAGAATCCCTCTTGTATTCCAAACCTTATACTAAGAGGA
TTGATAACTTGAGAACGCCAATCGGGTATCAGCCACCAAAATTTCAACAGTTTGATGGAAAGGGCAATCCTAAACAACATATTGCCCACTTCGTTGAGACATGTGAGAAC
GCTGGTACTCGAGGGGACCTACTAGTCAAACAGTTCGTTCGAACACTTAAAGGAAATACGTTTGACTGGTACACTGATCTAGAACCTGAGTCAATAGACAGTTGGGAGGA
ACTCGAAAGAGAGTTTTTGAATCGCTTCTACAGCACTAGACGAACCGTTAGCATGTTCGAGCTCACCAACACTAAACAACGAAAAGGTGAACTCGTTGTTAACTATATAA
ATCGCTGGAGAGCCATGAGCCTAGATTGCAAAGATCGTCTCACTGAACTCTCTTCCGTCGAGATGTGCATTCAAGGCATGCACTGGGAACTCCTCTACATCCTTAAAGGT
ATAAAGCCTCGCACCTTTGAAGAACTAGCAACTCGCGCCCACGATATGGAGCTAAGTATTGCTAGTCAAGAAAACCAAGACCTTCTCCTCCCTAACATGAGAAAAGAAGG
AAGGAACGATAAAGAGACTATAGAAGAATCCATGGTTGTAAACACAACCCTTCCCAAGTCGTCTTCGAAAGAAAAGCGACAAACTAATGGAGCGCATCACTTAACTTTAA
AGGAAAGACAGAAGAAAATCTATCATTTCCCTGATGCCGACATCCCTGATATGTTGGAACAACTATTGGAAGCGCAACTGATAGAGCTTCCTAAGTGTAAACGACCAGAA
GAGATGGAGAAAGTCGATGATCCCAAGTATTGCAAGTATCATCGAGTTATTGGTCATCCAGTGGAAAGATGTTTCGTCCTAAAGGACTTGATTTTAAAGCTGGCTAAGGA
AGGCAAAATCGAGCTCGACCTTGATGAAGTAGCCCAATCAAATCTTGATACAATCAAAGGAAATAACAAACATCAAAGAAAGAATGATCCTAAGAAACTTCAACCCAAGA
GGAAGAGAAGTAAAAAGTTTCCTCAACCTCAACAACTGGTGACGTTGAATAAATCCTTCTCCAAAAATTTCCACAAAAAGGAAAAAAAGAACTTTGCGACTTCGTACTGC
ATCGACGTAGAAGAAGTTGACAATTCTGAGAAGGGTGAACAAAAGATCTCCGTCTTCGATCGCATCAAGCCTCAAACTACTCGTCCTTCAGTATTCCAAAGAATGAGTAT
GGCCGCGACAGAAGAAGAAAATCAATGTTCGGTGTCCACCTTCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCGAAGAAAAGTCAACCTTCGACATCTG
TTTTTGATCGCCTCAAAGTAACAAGCGATCAACCTAAAAGAAAGATGGATAACTTGGAGGTGAAACTTTTCGATGAAGTAAACAACGACAAGAAGCTTCAAAGTAGCATC
CCGTCACGTATGAAGAGGAAGTTCTCTGTTCTCATAAATACAGAAGGTTCCTTGAAGGTGAAACCAAATCTCATTATCTTGACCAATCCTACAAGTCAAGGACCTGATCA
AGACCATGATGAAGCGAATGGAAGTTGCTTCCTCCAAGTTCGAAGGTTCCCACGCGCTTCGCTGCAGTTCCTTCCCCCCAAATTCGAAGGTTCTCACGCGTTTCGCTGCA
GTTCCTTCCTCACAGTTCGAAGGTTCTCACGTGCTTCGCTGCAGTTCCTTCCCCCAAGTTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCAAAGTTCAAAGGT
TCTCACGCGCTTCGTTGCAGTTCTTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTTGAAGGTTCTCACGCGCTTCGTTGCAGTTCC
TTCCTCACAATTCGAAGGTTCTCACGCGCTTCGCTGTAGTTCCTTCCCCCAAGTTCGAAGGTTCTCACGTCGCTTCGCTGCAGTTCCTTCCTCCAAGTTTGAAGGTTCTC
ACATCGCTTCGTTGCGATCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCCCAAGTTCGAAGGTTCTCACGCGCTTCGTGCAGTTCCTTCC
TCCAAATTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCAAAGTTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCCCCCAAGTTCGAAGGTTCTCACGT
CGCTTTGCTGCAGTTCCTTCTTCCAAGTTTGAAGGTTCTCACATCGCTTCGCTTCGCGCTGCGCTTCGTTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACACGCTTC
GCTGCAGTTTCTTCTCCCTAAGTTCGAAGGTTCTCACGCGCTTCGATGCAGTTCCTTCCTCCCTAAGTTCGAAGGTTCTCACTCGCTTCGCTGCAGTTCCTTCCTCCAAA
TTCGAAGGTTTCGAAGGTTCTCACGCGCTGTGA
Protein sequenceShow/hide protein sequence
MSFKTSLMTAVKNKSYISSTAHCCFNELRLQEDKASIIASEETTLQGAYTNDKFLVKYDPLFEPDSDVVTVMMTETRTTEERMTEMQEHINNLMKAIEEKDSQIAQLKCQ
IENQHIAESSQTQVVKNHDKGKTIVQDDQPQCSASIASLSIQQLQDMITNCIRAQYGGPTQESLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCEN
AGTRGDLLVKQFVRTLKGNTFDWYTDLEPESIDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKG
IKPRTFEELATRAHDMELSIASQENQDLLLPNMRKEGRNDKETIEESMVVNTTLPKSSSKEKRQTNGAHHLTLKERQKKIYHFPDADIPDMLEQLLEAQLIELPKCKRPE
EMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLDTIKGNNKHQRKNDPKKLQPKRKRSKKFPQPQQLVTLNKSFSKNFHKKEKKNFATSYC
IDVEEVDNSEKGEQKISVFDRIKPQTTRPSVFQRMSMAATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNNDKKLQSSI
PSRMKRKFSVLINTEGSLKVKPNLIILTNPTSQGPDQDHDEANGSCFLQVRRFPRASLQFLPPKFEGSHAFRCSSFLTVRRFSRASLQFLPPSSKVLTRFAAVPSSKFKG
SHALRCSSFLQVRRFSRASLQFLPPSLKVLTRFVAVPSSQFEGSHALRCSSFPQVRRFSRRFAAVPSSKFEGSHIASLRSFLQVRRFSRASLCNSFPKFEGSHALRAVPS
SKFEGSHALRCSSFLKVRRFSRASLQFLPPKFEGSHVALLQFLLPSLKVLTSLRFALRFVAVPSSKFEGSHTLRCSFFSLSSKVLTRFDAVPSSLSSKVLTRFAAVPSSK
FEGFEGSHAL