; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022221 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022221
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTy3-gypsy retrotransposon protein
Genome locationchr7:21413974..21420153
RNA-Seq ExpressionLag0022221
SyntenyLag0022221
Gene Ontology termsGO:0006310 - DNA recombination (biological process)
GO:0015074 - DNA integration (biological process)
GO:0071897 - DNA biosynthetic process (biological process)
GO:0090502 - RNA phosphodiester bond hydrolysis, endonucleolytic (biological process)
GO:0003723 - RNA binding (molecular function)
GO:0003887 - DNA-directed DNA polymerase activity (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032121.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]1.5e-17555.27Show/hide
Query:  TNDKTMAKMQEHIDTLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKTIVQDDQP-QCSASIASLSIQQLQDMITNCIRAQYGGLTQDSLLY
        T + TMA+M+  I+ LMK +EE+D +I  LK Q++    +ESSQT VVK  DKGK +V+++QP Q S S+ASLS+QQLQDMI N IRAQYGG  Q S +Y
Subjt:  TNDKTMAKMQEHIDTLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKTIVQDDQP-QCSASIASLSIQQLQDMITNCIRAQYGGLTQDSLLY

Query:  SKPYTKRIDNLRMPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWEELEREFLNRFYSTRRTVSMF
        SKPYTKRIDNLRMP+GYQP KFQQFDGKGNPKQHI HFVETCENAG+RGD LV+QFVR+LKGNAF+                          TRR VSM 
Subjt:  SKPYTKRIDNLRMPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWEELEREFLNRFYSTRRTVSMF

Query:  ELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET---
        ELTNT QRKGE V++YINRWRA+SLDCKD+LTELS+VEMC QGMHWELLYIL+GIKPRTFEELATRAHDM+LSIA+R  +D L+   R +     +T   
Subjt:  ELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET---

Query:  ----IEESMVVNTTLPKSSSKEKRQTNRAHHS-------TLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVE
            + ESM+V  T  KS SK K   ++ +H        TL+ERQKK+YPFPD+D+ DMLEQL+E QLI+LP+CKRPE++ KVDDP YCKYHRVI H VE
Subjt:  ----IEESMVVNTTLPKSSSKEKRQTNRAHHS-------TLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVE

Query:  RCFVLKDLILKLAMERKIELDLDEVAQSNLATIKEKSK--------HQRKK-------DPKKLQPKRKRSKKFSQPQQ----------LVMLNKSFSKTF
        +CFVLK+LI KLA E KIELD+DEVAQ+N   +   S          QRK        +P  ++ ++K     SQ ++          +  L +SF +  
Subjt:  RCFVLKDLILKLAMERKIELDLDEVAQSNLATIKEKSK--------HQRKK-------DPKKLQPKRKRSKKFSQPQQ----------LVMLNKSFSKTF

Query:  HKKKKENLA---------TSYYINVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRL
         ++  E  A          + Y + EE+DNS + +QRTSVFD IKP TTR SVF R+SMA  +EENQC   T  + SAF+RLS+S SKK R ST  FDRL
Subjt:  HKKKKENLA---------TSYYINVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRL

Query:  KVTSDQPKRKMDNLEVKLFDEVNNDKKLQSSIPSRMKRKFSVLINTEGSLKVKPNLIILTNPTSQGPDQDHDE
        K+T+DQ +R+M  L+ K F E N+D K+ S +PSRMKRK SV INTEGSL VKP  II TNP ++G ++  DE
Subjt:  KVTSDQPKRKMDNLEVKLFDEVNNDKKLQSSIPSRMKRKFSVLINTEGSLKVKPNLIILTNPTSQGPDQDHDE

KAA0033746.1 retrotransposon gag protein [Cucumis melo var. makuwa]2.3e-17150.07Show/hide
Query:  TNDKTMAKMQEHIDTLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKTIVQDDQP-QCSASIASLSIQQLQDMITNCIRAQYGGLTQDSLLY
        T + TM +M+  I+ LMK  EE+D +I  LK Q++     ESSQT VVK  DKGK +VQ++QP Q S S+ASLS+QQLQDMI N IRAQYGG  Q S +Y
Subjt:  TNDKTMAKMQEHIDTLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKTIVQDDQP-QCSASIASLSIQQLQDMITNCIRAQYGGLTQDSLLY

Query:  SKPYTKRIDNLRMPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWEELEREFLNRFYSTRRTVSMF
        SK YTKRIDNLRMP+GYQPPKFQQFDGKGNPKQHIAHFVETCENAG+RGD LV+QFVR+LKGNAF+WYTDLEPE                          
Subjt:  SKPYTKRIDNLRMPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWEELEREFLNRFYSTRRTVSMF

Query:  ELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRND------
                 GE V++YINRWRA+SLDCKD+LTELS+VEMC QGMHWELLYIL+GIKPRTFEEL+TRAHDMELSIA+   +D L+   ++  +N+      
Subjt:  ELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRND------

Query:  --EETIEESMVVNTTLPKSSSKEKRQTNRAHHS-------TLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPV
             + ESM+V  T  KS SK K   +  ++        TL+ERQKK+YPFPD+D+ DMLEQL+E QLI+LP+CKRPE+  KVDDP YCKYHRVI HPV
Subjt:  --EETIEESMVVNTTLPKSSSKEKRQTNRAHHS-------TLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPV

Query:  ERCFVLKDLILKLAMERKIELDLDEVAQSN----------------------------------------------------------------------
        E+CFVLK+LILKLA E KIELD+DEVAQ+N                                                                      
Subjt:  ERCFVLKDLILKLAMERKIELDLDEVAQSN----------------------------------------------------------------------

Query:  --------------LATIKEKSKHQRKKDPKKLQPKRKRSKKFSQPQQLVMLNKSFSKTFHKKKKENL-------------ATSYYINVEEVDNSKKSEQ
                         I  K K +R K   K +P + + K F QP++ + L +   ++F +   E +               + Y + EEVDNS + +Q
Subjt:  --------------LATIKEKSKHQRKKDPKKLQPKRKRSKKFSQPQQLVMLNKSFSKTFHKKKKENL-------------ATSYYINVEEVDNSKKSEQ

Query:  RTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNNDKKLQSSIPSRM
        RT VF RIKP T R SVF R+SMA  EEENQC  ST  R SAF+RLS+ST KK R STS FDRLK+ +DQ +R+M +L+ K F E N+D K+ S +PSRM
Subjt:  RTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNNDKKLQSSIPSRM

Query:  KRKFSVLINTEGSLKVKPNLIILTNPTSQGPDQDHDE
        KRK SV INTEGSL VKP  II TNP ++G ++  DE
Subjt:  KRKFSVLINTEGSLKVKPNLIILTNPTSQGPDQDHDE

KAA0047477.1 uncharacterized protein E6C27_scaffold498G00940 [Cucumis melo var. makuwa]1.3e-16669.77Show/hide
Query:  TNDKTMAKMQEHIDTLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKTIVQDDQPQCSASIASLSIQQLQDMITNCIRAQYGGLTQDSLLYS
        T++  MA++++ ++ LMKA+EE+D +I  LK+ IE++  AESS T  +KN +KGK I+Q+ QPQ S SIASLS+QQLQ+MI N I+ QYGG  Q   LYS
Subjt:  TNDKTMAKMQEHIDTLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKTIVQDDQPQCSASIASLSIQQLQDMITNCIRAQYGGLTQDSLLYS

Query:  KPYTKRIDNLRMPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWEELEREFLNRFYSTRRTVSMFE
        KPYTKRIDN+RMP GYQPPKFQQFDGKGNPKQH+AHF+ETCE AGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWE+LER+FLNRFYSTRR VSM E
Subjt:  KPYTKRIDNLRMPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWEELEREFLNRFYSTRRTVSMFE

Query:  LTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET----
        LT TKQRKGE V++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSIA+R N DLL+P +RKE +  + T    
Subjt:  LTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET----

Query:  ---IEESMVVNTTLPKSSSKEKRQTNR-----AHHSTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCF
            +E+MVV+TT  K  SKEK+   R         TLKERQ+K+YPFPD+D+PDML+QLLE QLI+LP+CKRP EM +V+DP YCKYHRVI HPVE+CF
Subjt:  ---IEESMVVNTTLPKSSSKEKRQTNR-----AHHSTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCF

Query:  VLKDLILKLAMERKIELDLDEVAQSNLATI
        VLK+LILKLA+++KIEL+LD+VAQ+N A +
Subjt:  VLKDLILKLAMERKIELDLDEVAQSNLATI

KAA0047477.1 uncharacterized protein E6C27_scaffold498G00940 [Cucumis melo var. makuwa]1.4e-1436.65Show/hide
Query:  KLQPKRKRSKK--FSQPQQLVMLNKSFSKTFH--KKKKENLATSYYINVEEVDNSKKSE----QRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMST
        +L P +K+ +K  +S P     +    S+      K K  +A + +I VEE  +S++ +    QR+SVFDRI     RPSVF R+S +  ++ NQ S  +
Subjt:  KLQPKRKRSKK--FSQPQQLVMLNKSFSKTFH--KKKKENLATSYYINVEEVDNSKKSE----QRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMST

Query:  STRPSAFQRLSVSTSK-KSRSST-----SVFDRLKVTSDQPKRKMDNLEVKLFDEVNNDKKLQSSIPSRMKRKFSVLINTEGSLKVKPNLIILTNPTSQG
        STR SAFQRL+ S  K +S S T     S F RL V+  + ++K           V  D++++S+ PSRMKRK  V +NTEGSLKVK + ++ T P    
Subjt:  STRPSAFQRLSVSTSK-KSRSST-----SVFDRLKVTSDQPKRKMDNLEVKLFDEVNNDKKLQSSIPSRMKRKFSVLINTEGSLKVKPNLIILTNPTSQG

Query:  PDQDHDEVPSLQVRGSLHCTL
        P+ + D      V G  H T+
Subjt:  PDQDHDEVPSLQVRGSLHCTL

KAA0056121.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]1.8e-20061.21Show/hide
Query:  TNDKTMAKMQEHIDTLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKTIVQDDQPQCSASIASLSIQQLQDMITNCIRAQYGGLTQDSLLYS
        T++  MA++++ ++ LMK +EE+D +I  LK+ IE++  AESS    VKN DKGK ++Q+ QPQ S SIASLS+QQLQ+MI + I+ QYGG  Q   LY 
Subjt:  TNDKTMAKMQEHIDTLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKTIVQDDQPQCSASIASLSIQQLQDMITNCIRAQYGGLTQDSLLYS

Query:  KPYTKRIDNLRMPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWEELEREFLNRFYSTRRTVSMFE
        KPYTKRIDNLRMP GYQPPKFQQFDGKGNPKQH+AHF++TCE AGTRGDLLVKQFVRTLKGNA DWY DLEPESID+WE+LER+FLNRFYSTR  VSM E
Subjt:  KPYTKRIDNLRMPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWEELEREFLNRFYSTRRTVSMFE

Query:  LTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET----
        LTNT+Q+KGELV++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSIA+R  +D L+P  R +    ++T    
Subjt:  LTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET----

Query:  ---IEESMVVNTTLPKSSSKEKR-QTNRAH------HSTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVER
           I+ESMVV+ T  KS SK K  +  R H       STLKERQ+K+YPFPD+D+ DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+
Subjt:  ---IEESMVVNTTLPKSSSKEKR-QTNRAH------HSTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVER

Query:  CFVLKDLILKLAMERKIELDLDEVAQSNLATIKEKSKHQRKKDPKKLQPKRK--------RSKKFSQPQQLVMLNKSFSKTFHKKKKENLATSYYINVEE
        CFVLK+LILKLA E+KIELD+DEVAQ+N A I+  S   + KD   LQ +R         RS     P++++ +    + +  +        + Y + +E
Subjt:  CFVLKDLILKLAMERKIELDLDEVAQSNLATIKEKSKHQRKKDPKKLQPKRK--------RSKKFSQPQQLVMLNKSFSKTFHKKKKENLATSYYINVEE

Query:  VDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNNDKK
        V+NS +  QRTSVFDRIKP TTR SVF R+S+A  EEENQC     TR S  +RLS+ST KK R STS FDRLK+T+DQ +R+M + + K F E N+D K
Subjt:  VDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNNDKK

Query:  LQSSIPSRMKRKFSVLINTEGSLKVKPNLIILTNPTSQGPDQ
        + S +PSRMKRK  V INTEGSL VKP  II TNPT++G +Q
Subjt:  LQSSIPSRMKRKFSVLINTEGSLKVKPNLIILTNPTSQGPDQ

TYK03695.1 retrotransposon gag protein [Cucumis melo var. makuwa]4.6e-18853.07Show/hide
Query:  TNDKTMAKMQEHIDTLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKTIVQDDQPQCSASIASLSIQQLQDMITNCIRAQYGGLTQDSLLYS
        T++  MA++++ ++ LMK +EE+D +I  LK+ IE++  AESS    VKN DKGK ++Q+ QPQ S SIASLS+QQLQ+MI + I+ QYGG  Q   LYS
Subjt:  TNDKTMAKMQEHIDTLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKTIVQDDQPQCSASIASLSIQQLQDMITNCIRAQYGGLTQDSLLYS

Query:  KPYTKRIDNLRMPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWEELEREFLNRFYSTRRTVSMFE
        KPYTKRIDNLRMP GYQPPKFQQFDGKGNPKQH+AHF+ETCE AGTRGDLLVKQFVRTLKGNAFD Y DLEPESID+WE+LER+FLNRFYSTRR VSM E
Subjt:  KPYTKRIDNLRMPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWEELEREFLNRFYSTRRTVSMFE

Query:  LTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET----
        LTNT+Q+KGELV++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSI +R  +D L+P  R +     +T    
Subjt:  LTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET----

Query:  ---IEESMVVNTTLPKSSSKEKR-QTNRAH------HSTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVER
           I+ESMVV+ T  KS SK K  +  R H       STLKERQ+K+YPF D+D+ DMLEQLLE QLI+LPKCKRP++ EKVDDP YCKYHRVI HPVE+
Subjt:  ---IEESMVVNTTLPKSSSKEKR-QTNRAH------HSTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVER

Query:  CFVLKDLILKLAMERKIELDLDEVAQSN-----------------------------------LATIKEKSKHQRKKD---------------PKKLQ--
        CFVLK+LILKLA E+KIEL++DEVAQ+N                                   + TI  ++K    KD               P  +Q  
Subjt:  CFVLKDLILKLAMERKIELDLDEVAQSN-----------------------------------LATIKEKSKHQRKKD---------------PKKLQ--

Query:  ---------------PKRKRSKK-------------FSQPQQLVMLNKSFSKTFHKKKKENL-------------ATSYYINVEEVDNSKKSEQRTSVFD
                        K +R+KK             F Q ++ + L +   ++F +   E +               + Y + +EV+N  +  QRTSVFD
Subjt:  ---------------PKRKRSKK-------------FSQPQQLVMLNKSFSKTFHKKKKENL-------------ATSYYINVEEVDNSKKSEQRTSVFD

Query:  RIKPPTTRPSVFHRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNNDKKLQSSIPSRMKRKFSV
        RIKP TTR SVF R+SMA  EEENQC     TR S F+RLS+S SKK+R STS FDRLK+T+DQ +R+M +L+ K F E N+D K+ S +PSRMKRK  V
Subjt:  RIKPPTTRPSVFHRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNNDKKLQSSIPSRMKRKFSV

Query:  LINTEGSLKVKPNLIILTNPTSQGPDQDHDEVPSLQVRGSLHCTLLRC
         INT    K K   I++ +   Q  D           R  LH  LL C
Subjt:  LINTEGSLKVKPNLIILTNPTSQGPDQDHDEVPSLQVRGSLHCTLLRC

TrEMBL top hitse value%identityAlignment
A0A5A7SRE2 Ty3-gypsy retrotransposon protein7.4e-17655.27Show/hide
Query:  TNDKTMAKMQEHIDTLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKTIVQDDQP-QCSASIASLSIQQLQDMITNCIRAQYGGLTQDSLLY
        T + TMA+M+  I+ LMK +EE+D +I  LK Q++    +ESSQT VVK  DKGK +V+++QP Q S S+ASLS+QQLQDMI N IRAQYGG  Q S +Y
Subjt:  TNDKTMAKMQEHIDTLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKTIVQDDQP-QCSASIASLSIQQLQDMITNCIRAQYGGLTQDSLLY

Query:  SKPYTKRIDNLRMPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWEELEREFLNRFYSTRRTVSMF
        SKPYTKRIDNLRMP+GYQP KFQQFDGKGNPKQHI HFVETCENAG+RGD LV+QFVR+LKGNAF+                          TRR VSM 
Subjt:  SKPYTKRIDNLRMPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWEELEREFLNRFYSTRRTVSMF

Query:  ELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET---
        ELTNT QRKGE V++YINRWRA+SLDCKD+LTELS+VEMC QGMHWELLYIL+GIKPRTFEELATRAHDM+LSIA+R  +D L+   R +     +T   
Subjt:  ELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET---

Query:  ----IEESMVVNTTLPKSSSKEKRQTNRAHHS-------TLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVE
            + ESM+V  T  KS SK K   ++ +H        TL+ERQKK+YPFPD+D+ DMLEQL+E QLI+LP+CKRPE++ KVDDP YCKYHRVI H VE
Subjt:  ----IEESMVVNTTLPKSSSKEKRQTNRAHHS-------TLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVE

Query:  RCFVLKDLILKLAMERKIELDLDEVAQSNLATIKEKSK--------HQRKK-------DPKKLQPKRKRSKKFSQPQQ----------LVMLNKSFSKTF
        +CFVLK+LI KLA E KIELD+DEVAQ+N   +   S          QRK        +P  ++ ++K     SQ ++          +  L +SF +  
Subjt:  RCFVLKDLILKLAMERKIELDLDEVAQSNLATIKEKSK--------HQRKK-------DPKKLQPKRKRSKKFSQPQQ----------LVMLNKSFSKTF

Query:  HKKKKENLA---------TSYYINVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRL
         ++  E  A          + Y + EE+DNS + +QRTSVFD IKP TTR SVF R+SMA  +EENQC   T  + SAF+RLS+S SKK R ST  FDRL
Subjt:  HKKKKENLA---------TSYYINVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRL

Query:  KVTSDQPKRKMDNLEVKLFDEVNNDKKLQSSIPSRMKRKFSVLINTEGSLKVKPNLIILTNPTSQGPDQDHDE
        K+T+DQ +R+M  L+ K F E N+D K+ S +PSRMKRK SV INTEGSL VKP  II TNP ++G ++  DE
Subjt:  KVTSDQPKRKMDNLEVKLFDEVNNDKKLQSSIPSRMKRKFSVLINTEGSLKVKPNLIILTNPTSQGPDQDHDE

A0A5A7SUW1 Retrotransposon gag protein1.1e-17150.07Show/hide
Query:  TNDKTMAKMQEHIDTLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKTIVQDDQP-QCSASIASLSIQQLQDMITNCIRAQYGGLTQDSLLY
        T + TM +M+  I+ LMK  EE+D +I  LK Q++     ESSQT VVK  DKGK +VQ++QP Q S S+ASLS+QQLQDMI N IRAQYGG  Q S +Y
Subjt:  TNDKTMAKMQEHIDTLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKTIVQDDQP-QCSASIASLSIQQLQDMITNCIRAQYGGLTQDSLLY

Query:  SKPYTKRIDNLRMPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWEELEREFLNRFYSTRRTVSMF
        SK YTKRIDNLRMP+GYQPPKFQQFDGKGNPKQHIAHFVETCENAG+RGD LV+QFVR+LKGNAF+WYTDLEPE                          
Subjt:  SKPYTKRIDNLRMPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWEELEREFLNRFYSTRRTVSMF

Query:  ELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRND------
                 GE V++YINRWRA+SLDCKD+LTELS+VEMC QGMHWELLYIL+GIKPRTFEEL+TRAHDMELSIA+   +D L+   ++  +N+      
Subjt:  ELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRND------

Query:  --EETIEESMVVNTTLPKSSSKEKRQTNRAHHS-------TLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPV
             + ESM+V  T  KS SK K   +  ++        TL+ERQKK+YPFPD+D+ DMLEQL+E QLI+LP+CKRPE+  KVDDP YCKYHRVI HPV
Subjt:  --EETIEESMVVNTTLPKSSSKEKRQTNRAHHS-------TLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPV

Query:  ERCFVLKDLILKLAMERKIELDLDEVAQSN----------------------------------------------------------------------
        E+CFVLK+LILKLA E KIELD+DEVAQ+N                                                                      
Subjt:  ERCFVLKDLILKLAMERKIELDLDEVAQSN----------------------------------------------------------------------

Query:  --------------LATIKEKSKHQRKKDPKKLQPKRKRSKKFSQPQQLVMLNKSFSKTFHKKKKENL-------------ATSYYINVEEVDNSKKSEQ
                         I  K K +R K   K +P + + K F QP++ + L +   ++F +   E +               + Y + EEVDNS + +Q
Subjt:  --------------LATIKEKSKHQRKKDPKKLQPKRKRSKKFSQPQQLVMLNKSFSKTFHKKKKENL-------------ATSYYINVEEVDNSKKSEQ

Query:  RTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNNDKKLQSSIPSRM
        RT VF RIKP T R SVF R+SMA  EEENQC  ST  R SAF+RLS+ST KK R STS FDRLK+ +DQ +R+M +L+ K F E N+D K+ S +PSRM
Subjt:  RTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNNDKKLQSSIPSRM

Query:  KRKFSVLINTEGSLKVKPNLIILTNPTSQGPDQDHDE
        KRK SV INTEGSL VKP  II TNP ++G ++  DE
Subjt:  KRKFSVLINTEGSLKVKPNLIILTNPTSQGPDQDHDE

A0A5A7TZU9 Ribonuclease H6.3e-16769.77Show/hide
Query:  TNDKTMAKMQEHIDTLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKTIVQDDQPQCSASIASLSIQQLQDMITNCIRAQYGGLTQDSLLYS
        T++  MA++++ ++ LMKA+EE+D +I  LK+ IE++  AESS T  +KN +KGK I+Q+ QPQ S SIASLS+QQLQ+MI N I+ QYGG  Q   LYS
Subjt:  TNDKTMAKMQEHIDTLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKTIVQDDQPQCSASIASLSIQQLQDMITNCIRAQYGGLTQDSLLYS

Query:  KPYTKRIDNLRMPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWEELEREFLNRFYSTRRTVSMFE
        KPYTKRIDN+RMP GYQPPKFQQFDGKGNPKQH+AHF+ETCE AGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWE+LER+FLNRFYSTRR VSM E
Subjt:  KPYTKRIDNLRMPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWEELEREFLNRFYSTRRTVSMFE

Query:  LTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET----
        LT TKQRKGE V++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSIA+R N DLL+P +RKE +  + T    
Subjt:  LTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET----

Query:  ---IEESMVVNTTLPKSSSKEKRQTNR-----AHHSTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCF
            +E+MVV+TT  K  SKEK+   R         TLKERQ+K+YPFPD+D+PDML+QLLE QLI+LP+CKRP EM +V+DP YCKYHRVI HPVE+CF
Subjt:  ---IEESMVVNTTLPKSSSKEKRQTNR-----AHHSTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCF

Query:  VLKDLILKLAMERKIELDLDEVAQSNLATI
        VLK+LILKLA+++KIEL+LD+VAQ+N A +
Subjt:  VLKDLILKLAMERKIELDLDEVAQSNLATI

A0A5A7TZU9 Ribonuclease H6.6e-1536.65Show/hide
Query:  KLQPKRKRSKK--FSQPQQLVMLNKSFSKTFH--KKKKENLATSYYINVEEVDNSKKSE----QRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMST
        +L P +K+ +K  +S P     +    S+      K K  +A + +I VEE  +S++ +    QR+SVFDRI     RPSVF R+S +  ++ NQ S  +
Subjt:  KLQPKRKRSKK--FSQPQQLVMLNKSFSKTFH--KKKKENLATSYYINVEEVDNSKKSE----QRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMST

Query:  STRPSAFQRLSVSTSK-KSRSST-----SVFDRLKVTSDQPKRKMDNLEVKLFDEVNNDKKLQSSIPSRMKRKFSVLINTEGSLKVKPNLIILTNPTSQG
        STR SAFQRL+ S  K +S S T     S F RL V+  + ++K           V  D++++S+ PSRMKRK  V +NTEGSLKVK + ++ T P    
Subjt:  STRPSAFQRLSVSTSK-KSRSST-----SVFDRLKVTSDQPKRKMDNLEVKLFDEVNNDKKLQSSIPSRMKRKFSVLINTEGSLKVKPNLIILTNPTSQG

Query:  PDQDHDEVPSLQVRGSLHCTL
        P+ + D      V G  H T+
Subjt:  PDQDHDEVPSLQVRGSLHCTL

A0A5A7URH1 Ty3-gypsy retrotransposon protein8.7e-20161.21Show/hide
Query:  TNDKTMAKMQEHIDTLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKTIVQDDQPQCSASIASLSIQQLQDMITNCIRAQYGGLTQDSLLYS
        T++  MA++++ ++ LMK +EE+D +I  LK+ IE++  AESS    VKN DKGK ++Q+ QPQ S SIASLS+QQLQ+MI + I+ QYGG  Q   LY 
Subjt:  TNDKTMAKMQEHIDTLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKTIVQDDQPQCSASIASLSIQQLQDMITNCIRAQYGGLTQDSLLYS

Query:  KPYTKRIDNLRMPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWEELEREFLNRFYSTRRTVSMFE
        KPYTKRIDNLRMP GYQPPKFQQFDGKGNPKQH+AHF++TCE AGTRGDLLVKQFVRTLKGNA DWY DLEPESID+WE+LER+FLNRFYSTR  VSM E
Subjt:  KPYTKRIDNLRMPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWEELEREFLNRFYSTRRTVSMFE

Query:  LTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET----
        LTNT+Q+KGELV++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSIA+R  +D L+P  R +    ++T    
Subjt:  LTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET----

Query:  ---IEESMVVNTTLPKSSSKEKR-QTNRAH------HSTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVER
           I+ESMVV+ T  KS SK K  +  R H       STLKERQ+K+YPFPD+D+ DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+
Subjt:  ---IEESMVVNTTLPKSSSKEKR-QTNRAH------HSTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVER

Query:  CFVLKDLILKLAMERKIELDLDEVAQSNLATIKEKSKHQRKKDPKKLQPKRK--------RSKKFSQPQQLVMLNKSFSKTFHKKKKENLATSYYINVEE
        CFVLK+LILKLA E+KIELD+DEVAQ+N A I+  S   + KD   LQ +R         RS     P++++ +    + +  +        + Y + +E
Subjt:  CFVLKDLILKLAMERKIELDLDEVAQSNLATIKEKSKHQRKKDPKKLQPKRK--------RSKKFSQPQQLVMLNKSFSKTFHKKKKENLATSYYINVEE

Query:  VDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNNDKK
        V+NS +  QRTSVFDRIKP TTR SVF R+S+A  EEENQC     TR S  +RLS+ST KK R STS FDRLK+T+DQ +R+M + + K F E N+D K
Subjt:  VDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNNDKK

Query:  LQSSIPSRMKRKFSVLINTEGSLKVKPNLIILTNPTSQGPDQ
        + S +PSRMKRK  V INTEGSL VKP  II TNPT++G +Q
Subjt:  LQSSIPSRMKRKFSVLINTEGSLKVKPNLIILTNPTSQGPDQ

A0A5D3BX77 Retrotransposon gag protein2.2e-18853.07Show/hide
Query:  TNDKTMAKMQEHIDTLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKTIVQDDQPQCSASIASLSIQQLQDMITNCIRAQYGGLTQDSLLYS
        T++  MA++++ ++ LMK +EE+D +I  LK+ IE++  AESS    VKN DKGK ++Q+ QPQ S SIASLS+QQLQ+MI + I+ QYGG  Q   LYS
Subjt:  TNDKTMAKMQEHIDTLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKTIVQDDQPQCSASIASLSIQQLQDMITNCIRAQYGGLTQDSLLYS

Query:  KPYTKRIDNLRMPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWEELEREFLNRFYSTRRTVSMFE
        KPYTKRIDNLRMP GYQPPKFQQFDGKGNPKQH+AHF+ETCE AGTRGDLLVKQFVRTLKGNAFD Y DLEPESID+WE+LER+FLNRFYSTRR VSM E
Subjt:  KPYTKRIDNLRMPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWEELEREFLNRFYSTRRTVSMFE

Query:  LTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET----
        LTNT+Q+KGELV++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSI +R  +D L+P  R +     +T    
Subjt:  LTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET----

Query:  ---IEESMVVNTTLPKSSSKEKR-QTNRAH------HSTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVER
           I+ESMVV+ T  KS SK K  +  R H       STLKERQ+K+YPF D+D+ DMLEQLLE QLI+LPKCKRP++ EKVDDP YCKYHRVI HPVE+
Subjt:  ---IEESMVVNTTLPKSSSKEKR-QTNRAH------HSTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVER

Query:  CFVLKDLILKLAMERKIELDLDEVAQSN-----------------------------------LATIKEKSKHQRKKD---------------PKKLQ--
        CFVLK+LILKLA E+KIEL++DEVAQ+N                                   + TI  ++K    KD               P  +Q  
Subjt:  CFVLKDLILKLAMERKIELDLDEVAQSN-----------------------------------LATIKEKSKHQRKKD---------------PKKLQ--

Query:  ---------------PKRKRSKK-------------FSQPQQLVMLNKSFSKTFHKKKKENL-------------ATSYYINVEEVDNSKKSEQRTSVFD
                        K +R+KK             F Q ++ + L +   ++F +   E +               + Y + +EV+N  +  QRTSVFD
Subjt:  ---------------PKRKRSKK-------------FSQPQQLVMLNKSFSKTFHKKKKENL-------------ATSYYINVEEVDNSKKSEQRTSVFD

Query:  RIKPPTTRPSVFHRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNNDKKLQSSIPSRMKRKFSV
        RIKP TTR SVF R+SMA  EEENQC     TR S F+RLS+S SKK+R STS FDRLK+T+DQ +R+M +L+ K F E N+D K+ S +PSRMKRK  V
Subjt:  RIKPPTTRPSVFHRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNNDKKLQSSIPSRMKRKFSV

Query:  LINTEGSLKVKPNLIILTNPTSQGPDQDHDEVPSLQVRGSLHCTLLRC
         INT    K K   I++ +   Q  D           R  LH  LL C
Subjt:  LINTEGSLKVKPNLIILTNPTSQGPDQDHDEVPSLQVRGSLHCTLLRC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTTCAAGACTTCTTCAATGGTCGCTGTCATGAACAAGTCTTACATGGGTTCTACTACCCATTGTTGCCTCAGTGAACTGACGTTGCAAGAAGATAAAGCTTCTAT
CGCTGGAGGCCAAGAAACAACCTTGCAGGGGGCATATACTAATGACAAAACTATGGCTAAGATGCAGGAGCATATCGACACCTTGATGAAGGCGATTGAAGAAAAAGATT
CTCAAATTGAGCAACTAAAGAGCCAAATTGAGAACCAACATATCGCCGAATCAAGTCAAACCCAAGTCGTAAAAAATCACGACAAAGGAAAGACTATAGTGCAAGATGAT
CAACCACAGTGTTCTGCTTCGATCGCTTCACTATCCATCCAACAACTCCAAGATATGATCACAAACTGTATCAGAGCTCAGTACGGTGGACTTACTCAAGATTCCCTCTT
GTATTCCAAACCTTATACTAAGAGGATTGATAACTTGAGAATGCCAATCGGGTATCAGCCACCAAAATTTCAACAGTTTGATGGAAAGGGCAATCCTAAACAACATATTG
CCCACTTCGTTGAGACATGTGAGAACGCTGGTACTCGAGGGGACCTACTAGTCAAACAGTTCGTTCGAACACTTAAAGGAAATGCGTTTGACTGGTACACTGATCTAGAA
CCTGAGTCAATAGACAGTTGGGAGGAACTCGAAAGAGAGTTTTTGAATCGCTTCTACAGCACTAGACGAACCGTTAGCATGTTCGAGCTCACCAACACTAAACAGCGAAA
AGGTGAACTCGTTGTTAACTATATAAATCGTTGGAGAGCTATGAGTCTAGATTGCAAAGATCGCCTCACTGAACTCTCTTCCGTTGAGATGTGTATTCAAGGCATGCACT
GGGAACTCCTCTACATCCTTAAAGGTATAAAGCCTCGCACCTTTGAGGAACTAGCAACTCGTGCCCACGATATGGAGCTAAGTATTGCTAGTCGAGAAAACCAAGACCTT
CTCCTCCCTAACATGAGAAAAGAAGGAAGGAACGATGAAGAGACTATAGAAGAATCCATGGTTGTAAACACAACCCTTCCCAAGTCGTCTTCGAAAGAAAAGCGACAAAC
TAATAGAGCGCATCACTCAACTTTAAAGGAAAGACAGAAGAAAATCTATCCTTTCCCTGATGCCGACATCCCTGATATGTTGGAACAACTATTGGAAGCACAACTGATAA
AACTTCCTAAGTGTAAACGACCAGAAGAGATGGAGAAAGTCGATGATCCCAAGTATTGCAAGTATCATCGAGTTATTGGTCATCCAGTAGAAAGATGTTTCGTCCTAAAG
GACTTAATTCTAAAGCTGGCTATGGAAAGAAAAATTGAGCTCGACCTTGATGAAGTAGCTCAATCAAATCTTGCTACAATCAAAGAAAAGAGCAAACATCAAAGAAAGAA
GGATCCTAAGAAACTTCAACCCAAGAGAAAGAGAAGTAAAAAGTTCTCTCAACCTCAACAACTGGTGATGTTGAATAAATCATTCTCCAAAACTTTCCACAAAAAGAAAA
AAGAGAACCTTGCGACTTCCTACTACATCAACGTAGAAGAAGTTGACAATTCTAAGAAGAGTGAACAAAGGACTTCCGTCTTTGATCGCATCAAGCCTCCAACTACTCGT
CCTTCGGTATTCCATAGAATGAGTATGGCCGCGACAGAGGAAGAAAATCAATGTTCGATGTCCACCTCCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATC
GAAGAAAAGTCGATCTTCAACATCTGTCTTTGATCGCCTCAAAGTAACAAGCGATCAACCTAAAAGAAAGATGGATAACTTGGAGGTGAAACTTTTCGATGAAGTAAACA
ACGACAAGAAGCTTCAAAGTAGCATCCCGTCACGTATGAAGAGGAAGTTCTCTGTTCTCATAAATACAGAAGGTTCCTTGAAGGTGAAACCAAATCTCATTATCTTGACC
AATCCTACAAGTCAAGGACCTGATCAAGACCATGATGAAGTTCCTTCTCTCCAAGTTCGAGGGTCCTTACACTGTACGCTATTGCGTTGTTCCTTCTCCAAGTTCGAAGG
TTCTTCGTTGTATCCTGCTGCGTTGTTCCTTCTCCAAGTTCGAGGGTTCTCAGTTGTACGACTGCTACGTTGTTCCTCCTCCAAGTGTGAAGGATCTTATGTGGTGCGTT
GTTGCATTGTTCCCTCTTCTCTCAAGTTCGATGGTTCTCACGCAGCTTTGCTGGAGTTTCTTCTCCCCAAGTTCGAAGGTTCTCACGCGCTCCGTTGCAGTTCCTTCTTT
TCAAGGTCGAAGGTTCTCACTCGCTGCGTTGCAGTTCTTTCTCCCCAAGTTCGAAGGTTCACGCACTTCGCTGCAGTTCCTTCTCCCAAATTCGAAGGTTCTCACGCGCT
TCGCTCTGCAATTCCTTCCCCCAAGTTCGAAGGTTCTCACGCGCTTCGCTGTAGTTCCTTCCCCCTAAGTTTGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCA
AGTTCAAAGTTCCTTCCTCACAGTTCAAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCCCCAAGTTCGAAGGTTCTCACGTCGCTTCGCTGCAGTTCCTTCCTCCAAG
TTTGAAGGTTCTCATATCGCTTCGCTTCGCGCTGCGCTTCGTTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACTCGCTTCGCTGCAGTTCATTCCTCCAAATTCGAA
GGTTTCGAAGGTTCTCAAGCGCTGTGATTCGTTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCATGCGCTTCGTTGCTCCTTCCTCCAAGTTCGAAGGTTCTCTCACGC
GCTGCTGCAGCTCCTTCCTCCAAGTTCGAAGGTTCCCTCACGCGCTTCGCTCGCTCCTTCTCCAAGTTCGAAGGCGCCTCTCTCCACTGTTCCTTCCTCCAAGTTCGAAG
GTTCTCAGGCGCTTCGTTGCTACCTTCCTCCAAGTTCGAAGGTTCTCTCACGCGCTGCTGCAGCTCCTTCCTCCAAGTTCGAAGGTTCCCTCACGCGCTTCGCTCGCTCC
TTCTCCAAGTTCGAAGGCGCTTCTCTCCACTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCTACTGCTCATTCTCCAAGTTCGAAGGTGCTTCTCTCCACCCCTCTTT
TTGAAGGTTCGCCACTGAGGTTCTCCTTCTCCAAGTTCGAAGGTTCACCGTTGCTCCTTTTCAAATGTTTGGCGGCGGTTGACGTCCTCGTTCCGCTTCATCTTCAAATG
TTGGCAGTTGACGGCGTTCGCTTCGCTTCATCTTCAAAAATTGACTGTGGTGAAATCACTGCAAGTGAAAAGCTGATGACGACCGTGGTGACCACCCCTGCAGGAAACTA
CAGTCATCAAAGTGACTGGTCTAGACAGACAGGTGGTGAAATCACTGCAAGTGAAAAGCTGATGACGACCATGGTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTG
ACTGGTCTAGACAGGTGGTGAAATCACTGCAAGTGAAGCTGATGACGATCGTGGTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTGATTGGTCTAGACAGGCCGAT
CATCCAAGAGATCAACAAGCCAACCGACCGATCAAGAAGATCAACAAGTCAGCAGGCCGATCATCCAAGAGATCAACAAGCCAACTGACCGATCAAGAAGATCAACAAGT
CAGCAGGCCGATCATCCAAGAGATCAACAAGCCAACCGACCGATCAAGAAGATCAACAAGTCAACAGACCGATCATCCAAGAAGATCGACAAGGCAACCGATCGAACAGA
TCATCAAGCCAACAGGCCGATCCAAGAGATCATCAAGTCAGCAGGCTGATCATCCAAGAAGATCAACAAGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCGTTCAAGACTTCTTCAATGGTCGCTGTCATGAACAAGTCTTACATGGGTTCTACTACCCATTGTTGCCTCAGTGAACTGACGTTGCAAGAAGATAAAGCTTCTAT
CGCTGGAGGCCAAGAAACAACCTTGCAGGGGGCATATACTAATGACAAAACTATGGCTAAGATGCAGGAGCATATCGACACCTTGATGAAGGCGATTGAAGAAAAAGATT
CTCAAATTGAGCAACTAAAGAGCCAAATTGAGAACCAACATATCGCCGAATCAAGTCAAACCCAAGTCGTAAAAAATCACGACAAAGGAAAGACTATAGTGCAAGATGAT
CAACCACAGTGTTCTGCTTCGATCGCTTCACTATCCATCCAACAACTCCAAGATATGATCACAAACTGTATCAGAGCTCAGTACGGTGGACTTACTCAAGATTCCCTCTT
GTATTCCAAACCTTATACTAAGAGGATTGATAACTTGAGAATGCCAATCGGGTATCAGCCACCAAAATTTCAACAGTTTGATGGAAAGGGCAATCCTAAACAACATATTG
CCCACTTCGTTGAGACATGTGAGAACGCTGGTACTCGAGGGGACCTACTAGTCAAACAGTTCGTTCGAACACTTAAAGGAAATGCGTTTGACTGGTACACTGATCTAGAA
CCTGAGTCAATAGACAGTTGGGAGGAACTCGAAAGAGAGTTTTTGAATCGCTTCTACAGCACTAGACGAACCGTTAGCATGTTCGAGCTCACCAACACTAAACAGCGAAA
AGGTGAACTCGTTGTTAACTATATAAATCGTTGGAGAGCTATGAGTCTAGATTGCAAAGATCGCCTCACTGAACTCTCTTCCGTTGAGATGTGTATTCAAGGCATGCACT
GGGAACTCCTCTACATCCTTAAAGGTATAAAGCCTCGCACCTTTGAGGAACTAGCAACTCGTGCCCACGATATGGAGCTAAGTATTGCTAGTCGAGAAAACCAAGACCTT
CTCCTCCCTAACATGAGAAAAGAAGGAAGGAACGATGAAGAGACTATAGAAGAATCCATGGTTGTAAACACAACCCTTCCCAAGTCGTCTTCGAAAGAAAAGCGACAAAC
TAATAGAGCGCATCACTCAACTTTAAAGGAAAGACAGAAGAAAATCTATCCTTTCCCTGATGCCGACATCCCTGATATGTTGGAACAACTATTGGAAGCACAACTGATAA
AACTTCCTAAGTGTAAACGACCAGAAGAGATGGAGAAAGTCGATGATCCCAAGTATTGCAAGTATCATCGAGTTATTGGTCATCCAGTAGAAAGATGTTTCGTCCTAAAG
GACTTAATTCTAAAGCTGGCTATGGAAAGAAAAATTGAGCTCGACCTTGATGAAGTAGCTCAATCAAATCTTGCTACAATCAAAGAAAAGAGCAAACATCAAAGAAAGAA
GGATCCTAAGAAACTTCAACCCAAGAGAAAGAGAAGTAAAAAGTTCTCTCAACCTCAACAACTGGTGATGTTGAATAAATCATTCTCCAAAACTTTCCACAAAAAGAAAA
AAGAGAACCTTGCGACTTCCTACTACATCAACGTAGAAGAAGTTGACAATTCTAAGAAGAGTGAACAAAGGACTTCCGTCTTTGATCGCATCAAGCCTCCAACTACTCGT
CCTTCGGTATTCCATAGAATGAGTATGGCCGCGACAGAGGAAGAAAATCAATGTTCGATGTCCACCTCCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATC
GAAGAAAAGTCGATCTTCAACATCTGTCTTTGATCGCCTCAAAGTAACAAGCGATCAACCTAAAAGAAAGATGGATAACTTGGAGGTGAAACTTTTCGATGAAGTAAACA
ACGACAAGAAGCTTCAAAGTAGCATCCCGTCACGTATGAAGAGGAAGTTCTCTGTTCTCATAAATACAGAAGGTTCCTTGAAGGTGAAACCAAATCTCATTATCTTGACC
AATCCTACAAGTCAAGGACCTGATCAAGACCATGATGAAGTTCCTTCTCTCCAAGTTCGAGGGTCCTTACACTGTACGCTATTGCGTTGTTCCTTCTCCAAGTTCGAAGG
TTCTTCGTTGTATCCTGCTGCGTTGTTCCTTCTCCAAGTTCGAGGGTTCTCAGTTGTACGACTGCTACGTTGTTCCTCCTCCAAGTGTGAAGGATCTTATGTGGTGCGTT
GTTGCATTGTTCCCTCTTCTCTCAAGTTCGATGGTTCTCACGCAGCTTTGCTGGAGTTTCTTCTCCCCAAGTTCGAAGGTTCTCACGCGCTCCGTTGCAGTTCCTTCTTT
TCAAGGTCGAAGGTTCTCACTCGCTGCGTTGCAGTTCTTTCTCCCCAAGTTCGAAGGTTCACGCACTTCGCTGCAGTTCCTTCTCCCAAATTCGAAGGTTCTCACGCGCT
TCGCTCTGCAATTCCTTCCCCCAAGTTCGAAGGTTCTCACGCGCTTCGCTGTAGTTCCTTCCCCCTAAGTTTGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCA
AGTTCAAAGTTCCTTCCTCACAGTTCAAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCCCCAAGTTCGAAGGTTCTCACGTCGCTTCGCTGCAGTTCCTTCCTCCAAG
TTTGAAGGTTCTCATATCGCTTCGCTTCGCGCTGCGCTTCGTTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACTCGCTTCGCTGCAGTTCATTCCTCCAAATTCGAA
GGTTTCGAAGGTTCTCAAGCGCTGTGATTCGTTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCATGCGCTTCGTTGCTCCTTCCTCCAAGTTCGAAGGTTCTCTCACGC
GCTGCTGCAGCTCCTTCCTCCAAGTTCGAAGGTTCCCTCACGCGCTTCGCTCGCTCCTTCTCCAAGTTCGAAGGCGCCTCTCTCCACTGTTCCTTCCTCCAAGTTCGAAG
GTTCTCAGGCGCTTCGTTGCTACCTTCCTCCAAGTTCGAAGGTTCTCTCACGCGCTGCTGCAGCTCCTTCCTCCAAGTTCGAAGGTTCCCTCACGCGCTTCGCTCGCTCC
TTCTCCAAGTTCGAAGGCGCTTCTCTCCACTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCTACTGCTCATTCTCCAAGTTCGAAGGTGCTTCTCTCCACCCCTCTTT
TTGAAGGTTCGCCACTGAGGTTCTCCTTCTCCAAGTTCGAAGGTTCACCGTTGCTCCTTTTCAAATGTTTGGCGGCGGTTGACGTCCTCGTTCCGCTTCATCTTCAAATG
TTGGCAGTTGACGGCGTTCGCTTCGCTTCATCTTCAAAAATTGACTGTGGTGAAATCACTGCAAGTGAAAAGCTGATGACGACCGTGGTGACCACCCCTGCAGGAAACTA
CAGTCATCAAAGTGACTGGTCTAGACAGACAGGTGGTGAAATCACTGCAAGTGAAAAGCTGATGACGACCATGGTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTG
ACTGGTCTAGACAGGTGGTGAAATCACTGCAAGTGAAGCTGATGACGATCGTGGTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTGATTGGTCTAGACAGGCCGAT
CATCCAAGAGATCAACAAGCCAACCGACCGATCAAGAAGATCAACAAGTCAGCAGGCCGATCATCCAAGAGATCAACAAGCCAACTGACCGATCAAGAAGATCAACAAGT
CAGCAGGCCGATCATCCAAGAGATCAACAAGCCAACCGACCGATCAAGAAGATCAACAAGTCAACAGACCGATCATCCAAGAAGATCGACAAGGCAACCGATCGAACAGA
TCATCAAGCCAACAGGCCGATCCAAGAGATCATCAAGTCAGCAGGCTGATCATCCAAGAAGATCAACAAGCTAA
Protein sequenceShow/hide protein sequence
MSFKTSSMVAVMNKSYMGSTTHCCLSELTLQEDKASIAGGQETTLQGAYTNDKTMAKMQEHIDTLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKTIVQDD
QPQCSASIASLSIQQLQDMITNCIRAQYGGLTQDSLLYSKPYTKRIDNLRMPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLE
PESIDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDL
LLPNMRKEGRNDEETIEESMVVNTTLPKSSSKEKRQTNRAHHSTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLK
DLILKLAMERKIELDLDEVAQSNLATIKEKSKHQRKKDPKKLQPKRKRSKKFSQPQQLVMLNKSFSKTFHKKKKENLATSYYINVEEVDNSKKSEQRTSVFDRIKPPTTR
PSVFHRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNNDKKLQSSIPSRMKRKFSVLINTEGSLKVKPNLIILT
NPTSQGPDQDHDEVPSLQVRGSLHCTLLRCSFSKFEGSSLYPAALFLLQVRGFSVVRLLRCSSSKCEGSYVVRCCIVPSSLKFDGSHAALLEFLLPKFEGSHALRCSSFF
SRSKVLTRCVAVLSPQVRRFTHFAAVPSPKFEGSHALRSAIPSPKFEGSHALRCSSFPLSLKVLTRFAAVPSSKFKVPSSQFKGSHALRCSSFPQVRRFSRRFAAVPSSK
FEGSHIASLRAALRCSSFLQVRRFSLASLQFIPPNSKVSKVLKRCDSLQFLPPSSKVLMRFVAPSSKFEGSLTRCCSSFLQVRRFPHALRSLLLQVRRRLSPLFLPPSSK
VLRRFVATFLQVRRFSHALLQLLPPSSKVPSRASLAPSPSSKALLSTAPSPSSKALLSTAHSPSSKVLLSTPLFEGSPLRFSFSKFEGSPLLLFKCLAAVDVLVPLHLQM
LAVDGVRFASSSKIDCGEITASEKLMTTVVTTPAGNYSHQSDWSRQTGGEITASEKLMTTMVTTPAGNYSHQSDWSRQVVKSLQVKLMTIVVTTPAGNYSHQSDWSRQAD
HPRDQQANRPIKKINKSAGRSSKRSTSQLTDQEDQQVSRPIIQEINKPTDRSRRSTSQQTDHPRRSTRQPIEQIIKPTGRSKRSSSQQADHPRRSTS