; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0018795 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0018795
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTy3-gypsy retrotransposon protein
Genome locationchr5:34585643..34590790
RNA-Seq ExpressionLag0018795
SyntenyLag0018795
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0140640 - catalytic activity, acting on a nucleic acid (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032121.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]1.5e-17451.42Show/hide
Query:  KTSSMVSVMNKSYIGSTTHCCFNELTLQEDTTSIVAGQETTLQGAYPNDKFLVKYNPLF------------EPDSDVVTVMMTETRTMEERMTEMQEHID
        K +S  S ++ SYIG  T         +++   ++  +        P    +++ NPLF            E   +VV+VMM +  T E  M EM+  I+
Subjt:  KTSSMVSVMNKSYIGSTTHCCFNELTLQEDTTSIVAGQETTLQGAYPNDKFLVKYNPLF------------EPDSDVVTVMMTETRTMEERMTEMQEHID

Query:  TLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKTIVQDDQP-QCSASIASLSIQQLQDMITNCIRAQYGGPTQESLLYSKPYTKRIDNLRTP
         LMK +EE+D +I  LK Q++    +ESSQT VVK  DKGK +V+++QP Q S S+ASLS+QQLQDMI N IRAQYGGP Q S +YSKPYTKRIDNLR P
Subjt:  TLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKTIVQDDQP-QCSASIASLSIQQLQDMITNCIRAQYGGPTQESLLYSKPYTKRIDNLRTP

Query:  IGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVV
        +GYQP KFQQFDGKGNPKQHI HFVETCENAG+RGD LV+QFVR+LKGNAF+                          TRR VSM ELTNT QRKGE V+
Subjt:  IGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVV

Query:  NYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASQENQDLLLPNMRKEGRNDEET-------IEESMVVNTT
        +YINRWRA+SLDCKD+LTELS+VEMC QGMHWELLYIL+GIKPRTFEELATRAHDM+LSIA++  +D L+   R +     +T       + ESM+V  T
Subjt:  NYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASQENQDLLLPNMRKEGRNDEET-------IEESMVVNTT

Query:  LPKSSSKEKRQTNGAHH-------LTLKERQKKIYPFPDADIPGMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHQVIGHPVERCFVLKDLILKLAK
          KS SK K   +  +H        TL+ERQKK+YPFPD+D+  MLEQL+E QLI+LP+CKRPE++ KVDDP YCKYH+VI H VE+CFVLK+LI KLA+
Subjt:  LPKSSSKEKRQTNGAHH-------LTLKERQKKIYPFPDADIPGMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHQVIGHPVERCFVLKDLILKLAK

Query:  EGKIELDLDEVAQSNLATIKGKS--------------------------KHQRKKNPKKLQPKRKRSKKFSQPQQLVMLNKSFSKAFHKKTKENLA--TS
        E KIELD+DEVAQ+N   +   S                          + Q+K      Q K + S+   + + +  L +SF +   ++  E  A  T+
Subjt:  EGKIELDLDEVAQSNLATIKGKS--------------------------KHQRKKNPKKLQPKRKRSKKFSQPQQLVMLNKSFSKAFHKKTKENLA--TS

Query:  YCINV-------EEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSTQPSAFQRLSVSTSKKNRSSTSVFDRLKVTNDQPKRKMNN
          + V       EE+DNS + +QRTSVFD IKP TTR SVF R+SMA  +EENQC   T  Q SAF+RLS+S SKK+R ST  FDRLK+TNDQ +R+M  
Subjt:  YCINV-------EEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSTQPSAFQRLSVSTSKKNRSSTSVFDRLKVTNDQPKRKMNN

Query:  LELKLFDEVNSDEKLHSSIPSRMKRKFSVLINTEGSL
        L+ K F E N D+K+HS +PSRMKRK SV INTEGSL
Subjt:  LELKLFDEVNSDEKLHSSIPSRMKRKFSVLINTEGSL

KAA0033746.1 retrotransposon gag protein [Cucumis melo var. makuwa]1.8e-17047.38Show/hide
Query:  KTSSMVSVMNKSYIGSTTHCCFNELTLQEDTTSIVAGQETTLQGAYPNDKFLVKYNPLF------------EPDSDVVTVMMTETRTMEERMTEMQEHID
        K +S  S  + SYIG  T         +++   ++           P    +++ NPLF            E   +VV+VMM +  T E  M EM+  I+
Subjt:  KTSSMVSVMNKSYIGSTTHCCFNELTLQEDTTSIVAGQETTLQGAYPNDKFLVKYNPLF------------EPDSDVVTVMMTETRTMEERMTEMQEHID

Query:  TLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKTIVQDDQP-QCSASIASLSIQQLQDMITNCIRAQYGGPTQESLLYSKPYTKRIDNLRTP
         LMK  EE+D +I  LK Q++     ESSQT VVK  DKGK +VQ++QP Q S S+ASLS+QQLQDMI N IRAQYGGP Q S +YSK YTKRIDNLR P
Subjt:  TLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKTIVQDDQP-QCSASIASLSIQQLQDMITNCIRAQYGGPTQESLLYSKPYTKRIDNLRTP

Query:  IGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVV
        +GYQPPKFQQFDGKGNPKQHIAHFVETCENAG+RGD LV+QFVR+LKGNAF+WYTDLEPE                                   GE V+
Subjt:  IGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVV

Query:  NYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASQENQDLLLPNMRKEGRND--------EETIEESMVVNT
        +YINRWRA+SLDCKD+LTELS+VEMC QGMHWELLYIL+GIKPRTFEEL+TRAHDMELSIA+   +D L+   ++  +N+           + ESM+V  
Subjt:  NYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASQENQDLLLPNMRKEGRND--------EETIEESMVVNT

Query:  TLPKSSSKEKRQTNGAHH-------LTLKERQKKIYPFPDADIPGMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHQVIGHPVERCFVLKDLILKLA
        T  KS SK K   +  ++        TL+ERQKK+YPFPD+D+  MLEQL+E QLI+LP+CKRPE+  KVDDP YCKYH+VI HPVE+CFVLK+LILKLA
Subjt:  TLPKSSSKEKRQTNGAHH-------LTLKERQKKIYPFPDADIPGMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHQVIGHPVERCFVLKDLILKLA

Query:  KEGKIELDLDEVAQSNLATI--------------------------------------------------------------------------------
        +E KIELD+DEVAQ+N   +                                                                                
Subjt:  KEGKIELDLDEVAQSNLATI--------------------------------------------------------------------------------

Query:  KGKSKHQRK--KNPKKLQPK--RKRSKKFSQPQQLVMLNKSFSKAFHKKTKENLA------TSYCINV-------EEVDNSKKSEQRTSVFDRIKPPTTR
        KG   H++K  +N K  +PK  + + K F QP++ + L +   ++F +   E +       T+  + V       EEVDNS + +QRT VF RIKP T R
Subjt:  KGKSKHQRK--KNPKKLQPK--RKRSKKFSQPQQLVMLNKSFSKAFHKKTKENLA------TSYCINV-------EEVDNSKKSEQRTSVFDRIKPPTTR

Query:  PSVFHRMSMAATEEENQCSMSTSTQPSAFQRLSVSTSKKNRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDEKLHSSIPSRMKRKFSVLINTEGSL
         SVF R+SMA  EEENQC  ST  + SAF+RLS+ST KK+R STS FDRLK+ NDQ +R+M +L+ K F E N D+K+HS +PSRMKRK SV INTEGSL
Subjt:  PSVFHRMSMAATEEENQCSMSTSTQPSAFQRLSVSTSKKNRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDEKLHSSIPSRMKRKFSVLINTEGSL

KAA0056121.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]1.4e-19660.6Show/hide
Query:  PDSDVVTVMMTETRTMEERMTEMQEHIDTLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKTIVQDDQPQCSASIASLSIQQLQDMITNCIR
        P  ++++VM+T   T E RM E+++ ++ LMK +EE+D +I  LK+ IE++  AESS    VKN DKGK ++Q+ QPQ S SIASLS+QQLQ+MI + I+
Subjt:  PDSDVVTVMMTETRTMEERMTEMQEHIDTLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKTIVQDDQPQCSASIASLSIQQLQDMITNCIR

Query:  AQYGGPTQESLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWEELEREFL
         QYGGP Q   LY KPYTKRIDNLR P GYQPPKFQQFDGKGNPKQH+AHF++TCE AGTRGDLLVKQFVRTLKGNA DWY DLEPESID+WE+LER+FL
Subjt:  AQYGGPTQESLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWEELEREFL

Query:  NRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASQENQDLLLPNM
        NRFYSTR  VSM ELTNT+Q+KGELV++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSIA++  +D L+P  
Subjt:  NRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASQENQDLLLPNM

Query:  RKEGRNDEET-------IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPGMLEQLLEAQLIELPKCKRPEEMEKVDDPK
        R +    ++T       I+ESMVV+ T  KS SK K     R+ +G      TLKERQ+K+YPFPD+D+  MLEQLLE QLI+LP+CKRPE+  KVDDP 
Subjt:  RKEGRNDEET-------IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPGMLEQLLEAQLIELPKCKRPEEMEKVDDPK

Query:  YCKYHQVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGKSKHQRKKNPKKLQPKRK--------RSKKFSQPQQLVMLNKSFSKAFHKKT
        YCKYH+VI HPVE+CFVLK+LILKLA+E KIELD+DEVAQ+N A I+  S   + K+   LQ +R         RS     P++++ +      A H  +
Subjt:  YCKYHQVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGKSKHQRKKNPKKLQPKRK--------RSKKFSQPQQLVMLNKSFSKAFHKKT

Query:  KENLATSYCINVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSTQPSAFQRLSVSTSKKNRSSTSVFDRLKVTNDQPKRKMNN
           +  +Y  + +EV+NS +  QRTSVFDRIKP TTR SVF R+S+A  EEENQC     T+ S  +RLS+ST KK+R STS FDRLK+TNDQ +R+M +
Subjt:  KENLATSYCINVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSTQPSAFQRLSVSTSKKNRSSTSVFDRLKVTNDQPKRKMNN

Query:  LELKLFDEVNSDEKLHSSIPSRMKRKFSVLINTEGSL
         + K F E N D+K+HS +PSRMKRK  V INTEGSL
Subjt:  LELKLFDEVNSDEKLHSSIPSRMKRKFSVLINTEGSL

TYK03695.1 retrotransposon gag protein [Cucumis melo var. makuwa]3.3e-19351.53Show/hide
Query:  SYIGS--TTHCCFNELTLQEDTTSIVAGQETTLQ-GAYPNDKFLVKYNPLFE-------------PDSDVVTVMMTETRTMEERMTEMQEHIDTLMKAIE
        SYIG    TH C  E+   ED       +    Q    P    ++K NP  +             P  ++++VM+T   T E RM E+++ ++ LMK +E
Subjt:  SYIGS--TTHCCFNELTLQEDTTSIVAGQETTLQ-GAYPNDKFLVKYNPLFE-------------PDSDVVTVMMTETRTMEERMTEMQEHIDTLMKAIE

Query:  EKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKTIVQDDQPQCSASIASLSIQQLQDMITNCIRAQYGGPTQESLLYSKPYTKRIDNLRTPIGYQPPKF
        E+D +I  LK+ IE++  AESS    VKN DKGK ++Q+ QPQ S SIASLS+QQLQ+MI + I+ QYGGP Q   LYSKPYTKRIDNLR P GYQPPKF
Subjt:  EKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKTIVQDDQPQCSASIASLSIQQLQDMITNCIRAQYGGPTQESLLYSKPYTKRIDNLRTPIGYQPPKF

Query:  QQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRA
        QQFDGKGNPKQH+AHF+ETCE AGTRGDLLVKQFVRTLKGNAFD Y DLEPESID+WE+LER+FLNRFYSTRR VSM ELTNT+Q+KGELV++YINRWRA
Subjt:  QQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRA

Query:  MSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASQENQDLLLPNMRKEGRNDEET-------IEESMVVNTTLPKSSSKE
        +SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSI ++  +D L+P  R +     +T       I+ESMVV+ T  KS SK 
Subjt:  MSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASQENQDLLLPNMRKEGRNDEET-------IEESMVVNTTLPKSSSKE

Query:  K-----RQTNG--AHHLTLKERQKKIYPFPDADIPGMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHQVIGHPVERCFVLKDLILKLAKEGKIELDL
        K     R+ +G      TLKERQ+K+YPF D+D+  MLEQLLE QLI+LPKCKRP++ EKVDDP YCKYH+VI HPVE+CFVLK+LILKLA+E KIEL++
Subjt:  K-----RQTNG--AHHLTLKERQKKIYPFPDADIPGMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHQVIGHPVERCFVLKDLILKLAKEGKIELDL

Query:  DEVAQSNLATI----------------------------------------------------------------------------KGKSKHQRK--KN
        DEVAQ+N   I                                                                            KG   H++K  +N
Subjt:  DEVAQSNLATI----------------------------------------------------------------------------KGKSKHQRK--KN

Query:  PKKLQPK--RKRSKKFSQPQQLVMLNKSFSKAF---HKKTKENLATSYCINVEEVDNSKKS----------EQRTSVFDRIKPPTTRPSVFHRMSMAATE
         K   PK  + + + F Q ++ + L +   ++F   H +    + T +  ++ EV+N+  S           QRTSVFDRIKP TTR SVF R+SMA  E
Subjt:  PKKLQPK--RKRSKKFSQPQQLVMLNKSFSKAF---HKKTKENLATSYCINVEEVDNSKKS----------EQRTSVFDRIKPPTTRPSVFHRMSMAATE

Query:  EENQCSMSTSTQPSAFQRLSVSTSKKNRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDEKLHSSIPSRMKRKFSVLINTE
        EENQC     T+ S F+RLS+S SKKNR STS FDRLK+TNDQ +R+M +L+ K F E N D+K+HS +PSRMKRK  V INT+
Subjt:  EENQCSMSTSTQPSAFQRLSVSTSKKNRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDEKLHSSIPSRMKRKFSVLINTE

XP_031742199.1 uncharacterized protein LOC105435721 [Cucumis sativus]1.9e-16962.31Show/hide
Query:  SFKTSSMVSVMNKSYIGSTTHCCFNELTLQEDTTSIVAGQETTLQGAYPNDKFLVKYNPLF------------EPDSDVVTVMMTETRTMEERMTEMQEH
        S K +S  S  + +Y G  T      +   +D  S +A          P    ++K NPL+            E   DV++VMM +   +E  M EM+  
Subjt:  SFKTSSMVSVMNKSYIGSTTHCCFNELTLQEDTTSIVAGQETTLQGAYPNDKFLVKYNPLF------------EPDSDVVTVMMTETRTMEERMTEMQEH

Query:  IDTLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKTIVQDDQP-QCSASIASLSIQQLQDMITNCIRAQYGGPTQESLLYSKPYTKRIDNLR
        I+ LMK ++E+D +I  LK Q++ +  AESSQT VVK  DKGK +VQ++QP Q S S+ASLS+QQLQDMITN IRAQYGGP+Q S +YSKPYTKRIDNLR
Subjt:  IDTLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKTIVQDDQP-QCSASIASLSIQQLQDMITNCIRAQYGGPTQESLLYSKPYTKRIDNLR

Query:  TPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGEL
         P+GYQPPKFQQFDGKGNPKQH+AHFVETCENAG+RGD LV+QFVR+LKGNAF+WYTDLEPESI+SWE+LE+EFLNRFYSTRRTVSM ELTNTKQRKGE 
Subjt:  TPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGEL

Query:  VVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASQENQDLLLPNMRKEGRN-------DEETIEESMVVN
        V++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSIAS+  +D L+P ++K+ +         + T +ESMVVN
Subjt:  VVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASQENQDLLLPNMRKEGRN-------DEETIEESMVVN

Query:  TT-LPKSSSKEKR---QTNGA--HHLTLKERQKKIYPFPDADIPGMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHQVIGHPVERCFVLKDLILKLA
        TT L  S  KE R   + +G+    LTLKERQ+K+YPFPD+DI  MLEQLLE QLI+LP+CKRPE+  KVDDP YCKYH+VI HPVE+CFVLK+LIL+LA
Subjt:  TT-LPKSSSKEKR---QTNGA--HHLTLKERQKKIYPFPDADIPGMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHQVIGHPVERCFVLKDLILKLA

Query:  KEGKIELDLDEVAQSNLATI
        +E +IELDL+EVAQ+N A +
Subjt:  KEGKIELDLDEVAQSNLATI

TrEMBL top hitse value%identityAlignment
A0A5A7SRE2 Ty3-gypsy retrotransposon protein7.4e-17551.42Show/hide
Query:  KTSSMVSVMNKSYIGSTTHCCFNELTLQEDTTSIVAGQETTLQGAYPNDKFLVKYNPLF------------EPDSDVVTVMMTETRTMEERMTEMQEHID
        K +S  S ++ SYIG  T         +++   ++  +        P    +++ NPLF            E   +VV+VMM +  T E  M EM+  I+
Subjt:  KTSSMVSVMNKSYIGSTTHCCFNELTLQEDTTSIVAGQETTLQGAYPNDKFLVKYNPLF------------EPDSDVVTVMMTETRTMEERMTEMQEHID

Query:  TLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKTIVQDDQP-QCSASIASLSIQQLQDMITNCIRAQYGGPTQESLLYSKPYTKRIDNLRTP
         LMK +EE+D +I  LK Q++    +ESSQT VVK  DKGK +V+++QP Q S S+ASLS+QQLQDMI N IRAQYGGP Q S +YSKPYTKRIDNLR P
Subjt:  TLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKTIVQDDQP-QCSASIASLSIQQLQDMITNCIRAQYGGPTQESLLYSKPYTKRIDNLRTP

Query:  IGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVV
        +GYQP KFQQFDGKGNPKQHI HFVETCENAG+RGD LV+QFVR+LKGNAF+                          TRR VSM ELTNT QRKGE V+
Subjt:  IGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVV

Query:  NYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASQENQDLLLPNMRKEGRNDEET-------IEESMVVNTT
        +YINRWRA+SLDCKD+LTELS+VEMC QGMHWELLYIL+GIKPRTFEELATRAHDM+LSIA++  +D L+   R +     +T       + ESM+V  T
Subjt:  NYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASQENQDLLLPNMRKEGRNDEET-------IEESMVVNTT

Query:  LPKSSSKEKRQTNGAHH-------LTLKERQKKIYPFPDADIPGMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHQVIGHPVERCFVLKDLILKLAK
          KS SK K   +  +H        TL+ERQKK+YPFPD+D+  MLEQL+E QLI+LP+CKRPE++ KVDDP YCKYH+VI H VE+CFVLK+LI KLA+
Subjt:  LPKSSSKEKRQTNGAHH-------LTLKERQKKIYPFPDADIPGMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHQVIGHPVERCFVLKDLILKLAK

Query:  EGKIELDLDEVAQSNLATIKGKS--------------------------KHQRKKNPKKLQPKRKRSKKFSQPQQLVMLNKSFSKAFHKKTKENLA--TS
        E KIELD+DEVAQ+N   +   S                          + Q+K      Q K + S+   + + +  L +SF +   ++  E  A  T+
Subjt:  EGKIELDLDEVAQSNLATIKGKS--------------------------KHQRKKNPKKLQPKRKRSKKFSQPQQLVMLNKSFSKAFHKKTKENLA--TS

Query:  YCINV-------EEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSTQPSAFQRLSVSTSKKNRSSTSVFDRLKVTNDQPKRKMNN
          + V       EE+DNS + +QRTSVFD IKP TTR SVF R+SMA  +EENQC   T  Q SAF+RLS+S SKK+R ST  FDRLK+TNDQ +R+M  
Subjt:  YCINV-------EEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSTQPSAFQRLSVSTSKKNRSSTSVFDRLKVTNDQPKRKMNN

Query:  LELKLFDEVNSDEKLHSSIPSRMKRKFSVLINTEGSL
        L+ K F E N D+K+HS +PSRMKRK SV INTEGSL
Subjt:  LELKLFDEVNSDEKLHSSIPSRMKRKFSVLINTEGSL

A0A5A7SUW1 Retrotransposon gag protein8.5e-17147.38Show/hide
Query:  KTSSMVSVMNKSYIGSTTHCCFNELTLQEDTTSIVAGQETTLQGAYPNDKFLVKYNPLF------------EPDSDVVTVMMTETRTMEERMTEMQEHID
        K +S  S  + SYIG  T         +++   ++           P    +++ NPLF            E   +VV+VMM +  T E  M EM+  I+
Subjt:  KTSSMVSVMNKSYIGSTTHCCFNELTLQEDTTSIVAGQETTLQGAYPNDKFLVKYNPLF------------EPDSDVVTVMMTETRTMEERMTEMQEHID

Query:  TLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKTIVQDDQP-QCSASIASLSIQQLQDMITNCIRAQYGGPTQESLLYSKPYTKRIDNLRTP
         LMK  EE+D +I  LK Q++     ESSQT VVK  DKGK +VQ++QP Q S S+ASLS+QQLQDMI N IRAQYGGP Q S +YSK YTKRIDNLR P
Subjt:  TLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKTIVQDDQP-QCSASIASLSIQQLQDMITNCIRAQYGGPTQESLLYSKPYTKRIDNLRTP

Query:  IGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVV
        +GYQPPKFQQFDGKGNPKQHIAHFVETCENAG+RGD LV+QFVR+LKGNAF+WYTDLEPE                                   GE V+
Subjt:  IGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVV

Query:  NYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASQENQDLLLPNMRKEGRND--------EETIEESMVVNT
        +YINRWRA+SLDCKD+LTELS+VEMC QGMHWELLYIL+GIKPRTFEEL+TRAHDMELSIA+   +D L+   ++  +N+           + ESM+V  
Subjt:  NYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASQENQDLLLPNMRKEGRND--------EETIEESMVVNT

Query:  TLPKSSSKEKRQTNGAHH-------LTLKERQKKIYPFPDADIPGMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHQVIGHPVERCFVLKDLILKLA
        T  KS SK K   +  ++        TL+ERQKK+YPFPD+D+  MLEQL+E QLI+LP+CKRPE+  KVDDP YCKYH+VI HPVE+CFVLK+LILKLA
Subjt:  TLPKSSSKEKRQTNGAHH-------LTLKERQKKIYPFPDADIPGMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHQVIGHPVERCFVLKDLILKLA

Query:  KEGKIELDLDEVAQSNLATI--------------------------------------------------------------------------------
        +E KIELD+DEVAQ+N   +                                                                                
Subjt:  KEGKIELDLDEVAQSNLATI--------------------------------------------------------------------------------

Query:  KGKSKHQRK--KNPKKLQPK--RKRSKKFSQPQQLVMLNKSFSKAFHKKTKENLA------TSYCINV-------EEVDNSKKSEQRTSVFDRIKPPTTR
        KG   H++K  +N K  +PK  + + K F QP++ + L +   ++F +   E +       T+  + V       EEVDNS + +QRT VF RIKP T R
Subjt:  KGKSKHQRK--KNPKKLQPK--RKRSKKFSQPQQLVMLNKSFSKAFHKKTKENLA------TSYCINV-------EEVDNSKKSEQRTSVFDRIKPPTTR

Query:  PSVFHRMSMAATEEENQCSMSTSTQPSAFQRLSVSTSKKNRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDEKLHSSIPSRMKRKFSVLINTEGSL
         SVF R+SMA  EEENQC  ST  + SAF+RLS+ST KK+R STS FDRLK+ NDQ +R+M +L+ K F E N D+K+HS +PSRMKRK SV INTEGSL
Subjt:  PSVFHRMSMAATEEENQCSMSTSTQPSAFQRLSVSTSKKNRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDEKLHSSIPSRMKRKFSVLINTEGSL

A0A5A7TZU9 Ribonuclease H4.7e-16965.18Show/hide
Query:  PNDKFLVKYNPLFE-------------PDSDVVTVMMTETRTMEERMTEMQEHIDTLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKTIVQ
        P    ++K NP  +             P  ++++VM+T+  T E+RM E+++ ++ LMKA+EE+D +I  LK+ IE++  AESS T  +KN +KGK I+Q
Subjt:  PNDKFLVKYNPLFE-------------PDSDVVTVMMTETRTMEERMTEMQEHIDTLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKTIVQ

Query:  DDQPQCSASIASLSIQQLQDMITNCIRAQYGGPTQESLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTL
        + QPQ S SIASLS+QQLQ+MI N I+ QYGGP Q   LYSKPYTKRIDN+R P GYQPPKFQQFDGKGNPKQH+AHF+ETCE AGTRGDLLVKQFVRTL
Subjt:  DDQPQCSASIASLSIQQLQDMITNCIRAQYGGPTQESLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTL

Query:  KGNAFDWYTDLEPESIDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTF
        KGNAFDWYTDLEPESIDSWE+LER+FLNRFYSTRR VSM ELT TKQRKGE V++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTF
Subjt:  KGNAFDWYTDLEPESIDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTF

Query:  EELATRAHDMELSIASQENQDLLLPNMRKEGRNDEET-------IEESMVVNTT----LPKSSSKEKRQTNG-AHHLTLKERQKKIYPFPDADIPGMLEQ
        EELATRAHDMELSIA++ N DLL+P +RKE +  + T        +E+MVV+TT    + K    EKRQ  G     TLKERQ+K+YPFPD+D+P ML+Q
Subjt:  EELATRAHDMELSIASQENQDLLLPNMRKEGRNDEET-------IEESMVVNTT----LPKSSSKEKRQTNG-AHHLTLKERQKKIYPFPDADIPGMLEQ

Query:  LLEAQLIELPKCKRPEEMEKVDDPKYCKYHQVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATI
        LLE QLI+LP+CKRP EM +V+DP YCKYH+VI HPVE+CFVLK+LILKLA + KIEL+LD+VAQ+N A +
Subjt:  LLEAQLIELPKCKRPEEMEKVDDPKYCKYHQVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATI

A0A5A7TZU9 Ribonuclease H4.4e-1037.3Show/hide
Query:  KLQPKRKRSKK--FSQPQQLVMLNKSFSKAFH--KKTKENLATSYCINVEEVDNSKKSE----QRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMST
        +L P +K+ +K  +S P     +    S+      K K  +A +  I VEE  +S++ +    QR+SVFDRI     RPSVF R+S +  ++ NQ S  +
Subjt:  KLQPKRKRSKK--FSQPQQLVMLNKSFSKAFH--KKTKENLATSYCINVEEVDNSKKSE----QRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMST

Query:  STQPSAFQRLSVSTSKKNRSS------TSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDEKLHSSIPSRMKRKFSVLINTEGSLK
        ST+ SAFQRL+ S  K    S       S F RL V+  + ++K +         V  DE++ S+ PSRMKRK  V +NTEGSLK
Subjt:  STQPSAFQRLSVSTSKKNRSS------TSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDEKLHSSIPSRMKRKFSVLINTEGSLK

A0A5A7URH1 Ty3-gypsy retrotransposon protein6.9e-19760.6Show/hide
Query:  PDSDVVTVMMTETRTMEERMTEMQEHIDTLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKTIVQDDQPQCSASIASLSIQQLQDMITNCIR
        P  ++++VM+T   T E RM E+++ ++ LMK +EE+D +I  LK+ IE++  AESS    VKN DKGK ++Q+ QPQ S SIASLS+QQLQ+MI + I+
Subjt:  PDSDVVTVMMTETRTMEERMTEMQEHIDTLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKTIVQDDQPQCSASIASLSIQQLQDMITNCIR

Query:  AQYGGPTQESLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWEELEREFL
         QYGGP Q   LY KPYTKRIDNLR P GYQPPKFQQFDGKGNPKQH+AHF++TCE AGTRGDLLVKQFVRTLKGNA DWY DLEPESID+WE+LER+FL
Subjt:  AQYGGPTQESLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWEELEREFL

Query:  NRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASQENQDLLLPNM
        NRFYSTR  VSM ELTNT+Q+KGELV++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSIA++  +D L+P  
Subjt:  NRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASQENQDLLLPNM

Query:  RKEGRNDEET-------IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPGMLEQLLEAQLIELPKCKRPEEMEKVDDPK
        R +    ++T       I+ESMVV+ T  KS SK K     R+ +G      TLKERQ+K+YPFPD+D+  MLEQLLE QLI+LP+CKRPE+  KVDDP 
Subjt:  RKEGRNDEET-------IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPGMLEQLLEAQLIELPKCKRPEEMEKVDDPK

Query:  YCKYHQVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGKSKHQRKKNPKKLQPKRK--------RSKKFSQPQQLVMLNKSFSKAFHKKT
        YCKYH+VI HPVE+CFVLK+LILKLA+E KIELD+DEVAQ+N A I+  S   + K+   LQ +R         RS     P++++ +      A H  +
Subjt:  YCKYHQVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGKSKHQRKKNPKKLQPKRK--------RSKKFSQPQQLVMLNKSFSKAFHKKT

Query:  KENLATSYCINVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSTQPSAFQRLSVSTSKKNRSSTSVFDRLKVTNDQPKRKMNN
           +  +Y  + +EV+NS +  QRTSVFDRIKP TTR SVF R+S+A  EEENQC     T+ S  +RLS+ST KK+R STS FDRLK+TNDQ +R+M +
Subjt:  KENLATSYCINVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSTQPSAFQRLSVSTSKKNRSSTSVFDRLKVTNDQPKRKMNN

Query:  LELKLFDEVNSDEKLHSSIPSRMKRKFSVLINTEGSL
         + K F E N D+K+HS +PSRMKRK  V INTEGSL
Subjt:  LELKLFDEVNSDEKLHSSIPSRMKRKFSVLINTEGSL

A0A5D3BX77 Retrotransposon gag protein1.6e-19351.53Show/hide
Query:  SYIGS--TTHCCFNELTLQEDTTSIVAGQETTLQ-GAYPNDKFLVKYNPLFE-------------PDSDVVTVMMTETRTMEERMTEMQEHIDTLMKAIE
        SYIG    TH C  E+   ED       +    Q    P    ++K NP  +             P  ++++VM+T   T E RM E+++ ++ LMK +E
Subjt:  SYIGS--TTHCCFNELTLQEDTTSIVAGQETTLQ-GAYPNDKFLVKYNPLFE-------------PDSDVVTVMMTETRTMEERMTEMQEHIDTLMKAIE

Query:  EKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKTIVQDDQPQCSASIASLSIQQLQDMITNCIRAQYGGPTQESLLYSKPYTKRIDNLRTPIGYQPPKF
        E+D +I  LK+ IE++  AESS    VKN DKGK ++Q+ QPQ S SIASLS+QQLQ+MI + I+ QYGGP Q   LYSKPYTKRIDNLR P GYQPPKF
Subjt:  EKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKTIVQDDQPQCSASIASLSIQQLQDMITNCIRAQYGGPTQESLLYSKPYTKRIDNLRTPIGYQPPKF

Query:  QQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRA
        QQFDGKGNPKQH+AHF+ETCE AGTRGDLLVKQFVRTLKGNAFD Y DLEPESID+WE+LER+FLNRFYSTRR VSM ELTNT+Q+KGELV++YINRWRA
Subjt:  QQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRA

Query:  MSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASQENQDLLLPNMRKEGRNDEET-------IEESMVVNTTLPKSSSKE
        +SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSI ++  +D L+P  R +     +T       I+ESMVV+ T  KS SK 
Subjt:  MSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASQENQDLLLPNMRKEGRNDEET-------IEESMVVNTTLPKSSSKE

Query:  K-----RQTNG--AHHLTLKERQKKIYPFPDADIPGMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHQVIGHPVERCFVLKDLILKLAKEGKIELDL
        K     R+ +G      TLKERQ+K+YPF D+D+  MLEQLLE QLI+LPKCKRP++ EKVDDP YCKYH+VI HPVE+CFVLK+LILKLA+E KIEL++
Subjt:  K-----RQTNG--AHHLTLKERQKKIYPFPDADIPGMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHQVIGHPVERCFVLKDLILKLAKEGKIELDL

Query:  DEVAQSNLATI----------------------------------------------------------------------------KGKSKHQRK--KN
        DEVAQ+N   I                                                                            KG   H++K  +N
Subjt:  DEVAQSNLATI----------------------------------------------------------------------------KGKSKHQRK--KN

Query:  PKKLQPK--RKRSKKFSQPQQLVMLNKSFSKAF---HKKTKENLATSYCINVEEVDNSKKS----------EQRTSVFDRIKPPTTRPSVFHRMSMAATE
         K   PK  + + + F Q ++ + L +   ++F   H +    + T +  ++ EV+N+  S           QRTSVFDRIKP TTR SVF R+SMA  E
Subjt:  PKKLQPK--RKRSKKFSQPQQLVMLNKSFSKAF---HKKTKENLATSYCINVEEVDNSKKS----------EQRTSVFDRIKPPTTRPSVFHRMSMAATE

Query:  EENQCSMSTSTQPSAFQRLSVSTSKKNRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDEKLHSSIPSRMKRKFSVLINTE
        EENQC     T+ S F+RLS+S SKKNR STS FDRLK+TNDQ +R+M +L+ K F E N D+K+HS +PSRMKRK  V INT+
Subjt:  EENQCSMSTSTQPSAFQRLSVSTSKKNRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDEKLHSSIPSRMKRKFSVLINTE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTTCAAGACTTCCTCAATGGTCTCTGTCATGAACAAGTCTTACATAGGTTCTACTACTCATTGTTGCTTCAACGAACTGACGTTGCAAGAAGATACAACTTCTAT
CGTTGCAGGCCAAGAAACAACCTTGCAGGGGGCATATCCTAATGACAAATTTCTTGTTAAGTATAACCCTCTGTTTGAACCTGATTCTGACGTAGTGACTGTCATGATGA
CTGAGACAAGAACTATGGAAGAAAGAATGACTGAGATGCAGGAGCATATCGACACCTTGATGAAGGCGATTGAAGAAAAAGATTCTCAAATCGAGCAACTAAAGAGCCAA
ATTGAGAACCAACATATCGCCGAATCAAGTCAAACCCAAGTCGTAAAAAATCACGACAAAGGAAAGACTATAGTGCAAGATGATCAACCACAGTGTTCTGCTTCGATCGC
TTCACTATCCATCCAACAGCTCCAAGATATGATCACAAACTGTATCAGAGCTCAGTACGGTGGACCTACTCAAGAATCCCTCTTGTATTCCAAACCTTATACTAAGAGGA
TTGATAACTTGAGAACGCCAATCGGGTATCAGCCACCAAAATTTCAACAGTTTGATGGAAAGGGCAATCCTAAACAACATATTGCCCACTTCGTTGAGACATGTGAGAAC
GCTGGTACTCGAGGGGACCTACTAGTCAAACAGTTCGTTCGAACACTTAAAGGAAATGCGTTTGACTGGTACACTGATCTAGAACCTGAGTCAATAGACAGTTGGGAGGA
ACTCGAAAGAGAGTTTTTGAATCGCTTCTACAGCACTAGACGAACCGTTAGCATGTTCGAGCTCACCAACACTAAACAACGAAAAGGTGAACTCGTTGTTAACTATATAA
ATCGCTGGAGAGCCATGAGCCTAGATTGCAAAGATCGTCTCACTGAACTCTCTTCCGTCGAGATGTGCATTCAAGGCATGCACTGGGAACTCCTCTACATCCTTAAAGGT
ATAAAGCCTCGCACCTTTGAAGAACTAGCAACTCGCGCCCACGATATGGAGCTAAGTATTGCTAGTCAAGAAAACCAAGACCTTCTCCTCCCTAACATGAGAAAAGAAGG
AAGGAACGATGAAGAGACTATAGAAGAATCCATGGTTGTAAACACAACCCTTCCCAAGTCGTCTTCGAAAGAAAAGCGACAAACTAATGGAGCGCATCACTTAACTTTAA
AGGAAAGACAGAAGAAAATCTATCCTTTCCCTGATGCCGACATCCCTGGTATGTTGGAACAACTATTGGAAGCGCAACTGATAGAGCTTCCTAAGTGTAAACGACCAGAA
GAGATGGAGAAAGTCGATGATCCCAAGTATTGCAAGTATCATCAAGTTATTGGTCATCCAGTGGAAAGATGTTTCGTCCTAAAGGACTTAATTTTAAAGCTGGCTAAGGA
AGGCAAAATTGAGCTCGACCTTGATGAAGTAGCTCAATCAAATCTTGCTACAATCAAAGGAAAGAGCAAGCATCAAAGAAAGAAGAATCCTAAGAAACTTCAACCCAAGA
GGAAGAGAAGTAAAAAGTTTTCTCAACCTCAACAACTGGTGATGTTGAATAAATCATTCTCCAAAGCTTTCCACAAAAAGACAAAAGAGAACCTTGCGACTTCCTACTGC
ATCAACGTAGAAGAAGTTGACAATTCTAAGAAGAGTGAACAAAGGACTTCCGTCTTTGATCGCATCAAGCCTCCAACTACTCGTCCTTCGGTATTCCATAGAATGAGTAT
GGCCGCGACAGAGGAAGAAAATCAATGTTCGATGTCCACCTCCACTCAACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCGAAGAAAAATCGATCTTCAACATCTG
TCTTTGATCGCCTCAAAGTAACAAACGATCAACCTAAAAGAAAGATGAACAACTTGGAGTTGAAACTTTTCGATGAAGTAAACAGTGACGAGAAGCTTCATAGTAGCATC
CCGTCACGTATGAAGAGGAAGTTTTCTGTTCTCATAAATACGGAAGGTTCCTTGAAGGATTTGATCAAGAACATGACAAAGATAAGAGCTTTTAAATGTAAAAGCTCCTT
GTCGCAAGAGCCTAAACTGCATGATGCTCCTAGCCCACACGAGCTTAAAAGCGAATGGAAGTTGCTTCCTCCAAGTTCGAAGGTTCCCACGCGCTTCGCTGCAGTTCCTT
CCCCCCAAATTCGAAGGTTCTCACGCGTTTCACTGCAGTTCTTTCCTCACAGTTCGAAGGTTCTCACGCGCTTCACTGCAGTTCCTTCCCCCCAAGTTCGAAGGTTCTCA
CGTCGCTTCGCTGAGTTCCTTCCTCCAAGTTTGAAGGTTCTCACATCGCATCGCTTCGCTGCAGTTCCTTCCCCCAAATTCGAAGGTTCTCACGCGTTTCACTGCAGTTC
TTTCCTCACAGTTCGAAGGTTCTCACGCGCTTCACTGCAGTTCCTTCCCCAAGTTCGAAGGTTCTCACGTCGCTTCGCTGAGTTCCTTCCTCCAAGTTTGAAGGTTCTCA
CATCGCTTCGCTTCGCTCACGCGCTTCGCTGCGTTCCTTCCTCCAAGTTCGAAGGTTCTCACACGCTTCACTGCAGTTCCTTCCCCAAGTTCGAAGGTTCTCACGTCGCT
TCGCTGAGTTCCTTCCTCCAAGTTGAAGGTTCTCACATCGCTTCGCTTCGCTCACGCGCTTCGCTGCGTTCCTTCCTCCAAGTTCGAAGGTTCTCACACGCTTCACTGCA
GTTCCTTCCCCAAGTTCGAAGGTTCTCACATCGCTTCGCTGAGTTCCTTCCTCCAAGTTTGAAGGTTCTCACATCGCTTCGCTTCGCTGCGTTCCTTCCTCCAAGTTCGA
AGGTTCTCACACGCTTCGCTCTGCAATTCCTTCCCCAAGTTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCAAAGTTCGAAGGTTCTCACGCGCTGCGCTGCA
GTTCCTTCCTCAAGTTCAAAGGTTCTCACGCGCTTCGTTGCAGTTCTTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCACTGCAGTTCCTTCCCCCAATTCCTTCCTCA
CAATTCGAAGGTTCTCACGCGCTTCGCTGTAGTTCCTTCCCCCAAGTTCGAAGGTTCTCACGTCGCTTCGCTGCGCTCATGCGCTTCGCTACAGTTCCTTCCTCCAAGTT
TGAAGGTTCTCACATCGCTTCGCTTCGCTCACGCGCTTCGCTGTGTTCCTTCCTCCAAGTTCGAAGGTTCTCACACGCTTCGCTCTGCAATTCCTTCCCCCAAGTTCGAA
GGTTCTCACTCGCTTCGTGAAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCGTTTCGCTGCATTTCCTTCCCCCCAAATTCGAAGGTTCTCACGCGCTTCGCTGCAGT
TCCTTCCTCAAAGTTCGAAGGTTCTCACGCGCTGCGCTGCAGTTCCTTCCTCAAAGTTCAAAGGTTCTCACGCGCTTCGTTGCAGTTCTTTCCTCCAAGTTCGAAGGTTC
TCACGCGCTTTGCTGCAGTTCCTTCCCCCAAGTTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCTCCAAGTTTGAAGTTCCTTCCCCCAAGTTCGAAGGTTCTCAC
GTCGCTTCGCTGCGCTCATGCGCTTCGCTACAGTTCCTTCCTCCAAGTTTGAAGGTTCTCACATCGCTTCGCTGCGATCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCT
TCGCTCTGCAATTCCTTCCCCAAGTTCGAAGGTTCTCATGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCGCTTCGCTGCACTCCAGCGCTACTTCC
TAAAGTCCAAAGACGTCAATTGTCCTCACGCTGCGCTGCTTCCTTCTCCAAGTTCGAGGGTCCTCATGCTACGCTCGGCTACATTGCTACGCTACTTCCTAAAGTCCAAA
GACGTCAATTGTCCCTGCACTCATGCTGAAAAGGGCATGGCGGCGACACAAGTCCAAGGACATGTCCCAAAGCGAGGAACATGTCCCTGTACTCGTGCTGAAAGGCGCGG
CGGCCACACAAGTCCAAGGAACATGTCCCAGCTCAAGGAACATGTCCTTGCACTCGTGCTGGAAGGCGCGACGGCAGCACAAGTCCAAGGAACATGTCCCAACTCAAGGA
ACATGTCCGCGTGGCGGCGACACAAGTCCAAGGAACATGTCCCAACTCAAGGAACACGTCCTTGCACTCGTGCTGAAAGGCGTGGCGGCGACACAAGTCCAAGGAACATG
TCCCAACTCAAGGAACATGTCCGTGCACTCGTGCTGGAAGGCGCGGCGGCGACACAAGTCCAAGGAACATGTCCCAACTCAAGGAACATGTCCGTGCACTCGTGCTGGAA
GGCGTGGCGGCGGCACAAGTCCAAGGAACATGTCCCAACTCAAGGAACACGTCCTTGTACTCGTGCTGAAAGGCGTGGCGGCGGCACAAGTCCAAGGAACATGTCCCAAC
TCAAGGAACACGTCCTTGTACTCGTGCTGAAAGGCGTGGCGACGACACAAGTCCAAGGAACATGTCCCAACTCAAGGAACACGTCCTTGCACTCGTGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCGTTCAAGACTTCCTCAATGGTCTCTGTCATGAACAAGTCTTACATAGGTTCTACTACTCATTGTTGCTTCAACGAACTGACGTTGCAAGAAGATACAACTTCTAT
CGTTGCAGGCCAAGAAACAACCTTGCAGGGGGCATATCCTAATGACAAATTTCTTGTTAAGTATAACCCTCTGTTTGAACCTGATTCTGACGTAGTGACTGTCATGATGA
CTGAGACAAGAACTATGGAAGAAAGAATGACTGAGATGCAGGAGCATATCGACACCTTGATGAAGGCGATTGAAGAAAAAGATTCTCAAATCGAGCAACTAAAGAGCCAA
ATTGAGAACCAACATATCGCCGAATCAAGTCAAACCCAAGTCGTAAAAAATCACGACAAAGGAAAGACTATAGTGCAAGATGATCAACCACAGTGTTCTGCTTCGATCGC
TTCACTATCCATCCAACAGCTCCAAGATATGATCACAAACTGTATCAGAGCTCAGTACGGTGGACCTACTCAAGAATCCCTCTTGTATTCCAAACCTTATACTAAGAGGA
TTGATAACTTGAGAACGCCAATCGGGTATCAGCCACCAAAATTTCAACAGTTTGATGGAAAGGGCAATCCTAAACAACATATTGCCCACTTCGTTGAGACATGTGAGAAC
GCTGGTACTCGAGGGGACCTACTAGTCAAACAGTTCGTTCGAACACTTAAAGGAAATGCGTTTGACTGGTACACTGATCTAGAACCTGAGTCAATAGACAGTTGGGAGGA
ACTCGAAAGAGAGTTTTTGAATCGCTTCTACAGCACTAGACGAACCGTTAGCATGTTCGAGCTCACCAACACTAAACAACGAAAAGGTGAACTCGTTGTTAACTATATAA
ATCGCTGGAGAGCCATGAGCCTAGATTGCAAAGATCGTCTCACTGAACTCTCTTCCGTCGAGATGTGCATTCAAGGCATGCACTGGGAACTCCTCTACATCCTTAAAGGT
ATAAAGCCTCGCACCTTTGAAGAACTAGCAACTCGCGCCCACGATATGGAGCTAAGTATTGCTAGTCAAGAAAACCAAGACCTTCTCCTCCCTAACATGAGAAAAGAAGG
AAGGAACGATGAAGAGACTATAGAAGAATCCATGGTTGTAAACACAACCCTTCCCAAGTCGTCTTCGAAAGAAAAGCGACAAACTAATGGAGCGCATCACTTAACTTTAA
AGGAAAGACAGAAGAAAATCTATCCTTTCCCTGATGCCGACATCCCTGGTATGTTGGAACAACTATTGGAAGCGCAACTGATAGAGCTTCCTAAGTGTAAACGACCAGAA
GAGATGGAGAAAGTCGATGATCCCAAGTATTGCAAGTATCATCAAGTTATTGGTCATCCAGTGGAAAGATGTTTCGTCCTAAAGGACTTAATTTTAAAGCTGGCTAAGGA
AGGCAAAATTGAGCTCGACCTTGATGAAGTAGCTCAATCAAATCTTGCTACAATCAAAGGAAAGAGCAAGCATCAAAGAAAGAAGAATCCTAAGAAACTTCAACCCAAGA
GGAAGAGAAGTAAAAAGTTTTCTCAACCTCAACAACTGGTGATGTTGAATAAATCATTCTCCAAAGCTTTCCACAAAAAGACAAAAGAGAACCTTGCGACTTCCTACTGC
ATCAACGTAGAAGAAGTTGACAATTCTAAGAAGAGTGAACAAAGGACTTCCGTCTTTGATCGCATCAAGCCTCCAACTACTCGTCCTTCGGTATTCCATAGAATGAGTAT
GGCCGCGACAGAGGAAGAAAATCAATGTTCGATGTCCACCTCCACTCAACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCGAAGAAAAATCGATCTTCAACATCTG
TCTTTGATCGCCTCAAAGTAACAAACGATCAACCTAAAAGAAAGATGAACAACTTGGAGTTGAAACTTTTCGATGAAGTAAACAGTGACGAGAAGCTTCATAGTAGCATC
CCGTCACGTATGAAGAGGAAGTTTTCTGTTCTCATAAATACGGAAGGTTCCTTGAAGGATTTGATCAAGAACATGACAAAGATAAGAGCTTTTAAATGTAAAAGCTCCTT
GTCGCAAGAGCCTAAACTGCATGATGCTCCTAGCCCACACGAGCTTAAAAGCGAATGGAAGTTGCTTCCTCCAAGTTCGAAGGTTCCCACGCGCTTCGCTGCAGTTCCTT
CCCCCCAAATTCGAAGGTTCTCACGCGTTTCACTGCAGTTCTTTCCTCACAGTTCGAAGGTTCTCACGCGCTTCACTGCAGTTCCTTCCCCCCAAGTTCGAAGGTTCTCA
CGTCGCTTCGCTGAGTTCCTTCCTCCAAGTTTGAAGGTTCTCACATCGCATCGCTTCGCTGCAGTTCCTTCCCCCAAATTCGAAGGTTCTCACGCGTTTCACTGCAGTTC
TTTCCTCACAGTTCGAAGGTTCTCACGCGCTTCACTGCAGTTCCTTCCCCAAGTTCGAAGGTTCTCACGTCGCTTCGCTGAGTTCCTTCCTCCAAGTTTGAAGGTTCTCA
CATCGCTTCGCTTCGCTCACGCGCTTCGCTGCGTTCCTTCCTCCAAGTTCGAAGGTTCTCACACGCTTCACTGCAGTTCCTTCCCCAAGTTCGAAGGTTCTCACGTCGCT
TCGCTGAGTTCCTTCCTCCAAGTTGAAGGTTCTCACATCGCTTCGCTTCGCTCACGCGCTTCGCTGCGTTCCTTCCTCCAAGTTCGAAGGTTCTCACACGCTTCACTGCA
GTTCCTTCCCCAAGTTCGAAGGTTCTCACATCGCTTCGCTGAGTTCCTTCCTCCAAGTTTGAAGGTTCTCACATCGCTTCGCTTCGCTGCGTTCCTTCCTCCAAGTTCGA
AGGTTCTCACACGCTTCGCTCTGCAATTCCTTCCCCAAGTTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCAAAGTTCGAAGGTTCTCACGCGCTGCGCTGCA
GTTCCTTCCTCAAGTTCAAAGGTTCTCACGCGCTTCGTTGCAGTTCTTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCACTGCAGTTCCTTCCCCCAATTCCTTCCTCA
CAATTCGAAGGTTCTCACGCGCTTCGCTGTAGTTCCTTCCCCCAAGTTCGAAGGTTCTCACGTCGCTTCGCTGCGCTCATGCGCTTCGCTACAGTTCCTTCCTCCAAGTT
TGAAGGTTCTCACATCGCTTCGCTTCGCTCACGCGCTTCGCTGTGTTCCTTCCTCCAAGTTCGAAGGTTCTCACACGCTTCGCTCTGCAATTCCTTCCCCCAAGTTCGAA
GGTTCTCACTCGCTTCGTGAAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCGTTTCGCTGCATTTCCTTCCCCCCAAATTCGAAGGTTCTCACGCGCTTCGCTGCAGT
TCCTTCCTCAAAGTTCGAAGGTTCTCACGCGCTGCGCTGCAGTTCCTTCCTCAAAGTTCAAAGGTTCTCACGCGCTTCGTTGCAGTTCTTTCCTCCAAGTTCGAAGGTTC
TCACGCGCTTTGCTGCAGTTCCTTCCCCCAAGTTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCTCCAAGTTTGAAGTTCCTTCCCCCAAGTTCGAAGGTTCTCAC
GTCGCTTCGCTGCGCTCATGCGCTTCGCTACAGTTCCTTCCTCCAAGTTTGAAGGTTCTCACATCGCTTCGCTGCGATCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCT
TCGCTCTGCAATTCCTTCCCCAAGTTCGAAGGTTCTCATGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCGCTTCGCTGCACTCCAGCGCTACTTCC
TAAAGTCCAAAGACGTCAATTGTCCTCACGCTGCGCTGCTTCCTTCTCCAAGTTCGAGGGTCCTCATGCTACGCTCGGCTACATTGCTACGCTACTTCCTAAAGTCCAAA
GACGTCAATTGTCCCTGCACTCATGCTGAAAAGGGCATGGCGGCGACACAAGTCCAAGGACATGTCCCAAAGCGAGGAACATGTCCCTGTACTCGTGCTGAAAGGCGCGG
CGGCCACACAAGTCCAAGGAACATGTCCCAGCTCAAGGAACATGTCCTTGCACTCGTGCTGGAAGGCGCGACGGCAGCACAAGTCCAAGGAACATGTCCCAACTCAAGGA
ACATGTCCGCGTGGCGGCGACACAAGTCCAAGGAACATGTCCCAACTCAAGGAACACGTCCTTGCACTCGTGCTGAAAGGCGTGGCGGCGACACAAGTCCAAGGAACATG
TCCCAACTCAAGGAACATGTCCGTGCACTCGTGCTGGAAGGCGCGGCGGCGACACAAGTCCAAGGAACATGTCCCAACTCAAGGAACATGTCCGTGCACTCGTGCTGGAA
GGCGTGGCGGCGGCACAAGTCCAAGGAACATGTCCCAACTCAAGGAACACGTCCTTGTACTCGTGCTGAAAGGCGTGGCGGCGGCACAAGTCCAAGGAACATGTCCCAAC
TCAAGGAACACGTCCTTGTACTCGTGCTGAAAGGCGTGGCGACGACACAAGTCCAAGGAACATGTCCCAACTCAAGGAACACGTCCTTGCACTCGTGCTGA
Protein sequenceShow/hide protein sequence
MSFKTSSMVSVMNKSYIGSTTHCCFNELTLQEDTTSIVAGQETTLQGAYPNDKFLVKYNPLFEPDSDVVTVMMTETRTMEERMTEMQEHIDTLMKAIEEKDSQIEQLKSQ
IENQHIAESSQTQVVKNHDKGKTIVQDDQPQCSASIASLSIQQLQDMITNCIRAQYGGPTQESLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCEN
AGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKG
IKPRTFEELATRAHDMELSIASQENQDLLLPNMRKEGRNDEETIEESMVVNTTLPKSSSKEKRQTNGAHHLTLKERQKKIYPFPDADIPGMLEQLLEAQLIELPKCKRPE
EMEKVDDPKYCKYHQVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGKSKHQRKKNPKKLQPKRKRSKKFSQPQQLVMLNKSFSKAFHKKTKENLATSYC
INVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSTQPSAFQRLSVSTSKKNRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDEKLHSSI
PSRMKRKFSVLINTEGSLKDLIKNMTKIRAFKCKSSLSQEPKLHDAPSPHELKSEWKLLPPSSKVPTRFAAVPSPQIRRFSRVSLQFFPHSSKVLTRFTAVPSPQVRRFS
RRFAEFLPPSLKVLTSHRFAAVPSPKFEGSHAFHCSSFLTVRRFSRASLQFLPQVRRFSRRFAEFLPPSLKVLTSLRFAHALRCVPSSKFEGSHTLHCSSFPKFEGSHVA
SLSSFLQVEGSHIASLRSRASLRSFLQVRRFSHASLQFLPQVRRFSHRFAEFLPPSLKVLTSLRFAAFLPPSSKVLTRFALQFLPQVRRFSRASLQFLPQSSKVLTRCAA
VPSSSSKVLTRFVAVLSSKFEGSHALHCSSFPQFLPHNSKVLTRFAVVPSPKFEGSHVASLRSCASLQFLPPSLKVLTSLRFAHALRCVPSSKFEGSHTLRSAIPSPKFE
GSHSLREVPSSKFEGSHAFRCISFPPNSKVLTRFAAVPSSKFEGSHALRCSSFLKVQRFSRASLQFFPPSSKVLTRFAAVPSPKFEGSHALRCSSFSKFEVPSPKFEGSH
VASLRSCASLQFLPPSLKVLTSLRCDPSSKFEGSHALRSAIPSPSSKVLMRFVQFLPPNSKVLTRFAALQRYFLKSKDVNCPHAALLPSPSSRVLMLRSATLLRYFLKSK
DVNCPCTHAEKGMAATQVQGHVPKRGTCPCTRAERRGGHTSPRNMSQLKEHVLALVLEGATAAQVQGTCPNSRNMSAWRRHKSKEHVPTQGTRPCTRAERRGGDTSPRNM
SQLKEHVRALVLEGAAATQVQGTCPNSRNMSVHSCWKAWRRHKSKEHVPTQGTRPCTRAERRGGGTSPRNMSQLKEHVLVLVLKGVATTQVQGTCPNSRNTSLHSC