; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0036153 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0036153
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTy3-gypsy retrotransposon protein
Genome locationchr3:40549474..40552954
RNA-Seq ExpressionLag0036153
SyntenyLag0036153
Gene Ontology termsGO:0006310 - DNA recombination (biological process)
GO:0015074 - DNA integration (biological process)
GO:0071897 - DNA biosynthetic process (biological process)
GO:0090502 - RNA phosphodiester bond hydrolysis, endonucleolytic (biological process)
GO:0003723 - RNA binding (molecular function)
GO:0003887 - DNA-directed DNA polymerase activity (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032121.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]8.9e-18051.25Show/hide
Query:  KTSSMVSVMNKSYIGSTAHCCFNELTLQEDTTSIVAGQETTLHGAYPNDKFLVKYNPLF------------EPDSDVVTVMMTETRTMEERMIEMQEHID
        K +S  S ++ SYIG            +++   ++  +        P    +++ NPLF            E   +VV+VMM +  T E  M EM+  I+
Subjt:  KTSSMVSVMNKSYIGSTAHCCFNELTLQEDTTSIVAGQETTLHGAYPNDKFLVKYNPLF------------EPDSDVVTVMMTETRTMEERMIEMQEHID

Query:  TLMKAIEEKDSQIEQLKSQIEDQHIAESSQTQVVKNHDKGKTIVQDDQP-QCSASIASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRTP
         LMK +EE+D +I  LK Q++    +ESSQT VVK  DKGK +V+++QP Q S S+ASLS+QQLQDMI N IRAQYGGP Q S +YSKPYTKRIDNLR P
Subjt:  TLMKAIEEKDSQIEQLKSQIEDQHIAESSQTQVVKNHDKGKTIVQDDQP-QCSASIASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRTP

Query:  IGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVV
        +GYQP KFQQFDGKGNPKQHI HFVETCENAG+RGD LV+QFVR+LKGNAF+                          TRR VSM ELTNT QRKGE V+
Subjt:  IGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVV

Query:  NYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPCTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET-------IEESMVVNTT
        +YINRWRA+SLDCKD+LTELS+VEMC QGMHWELLYIL+GIKP TFEELATRAHDM+LSIA+R  +D L+   R +     +T       + ESM+V  T
Subjt:  NYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPCTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET-------IEESMVVNTT

Query:  LPKSSSKEKRQTNGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAK
          KS SK K   +  +H        TL+ERQKK+YPFPD+D+ DMLEQL+E QLI+LP+CKRPE++ KVDDP YCKYHRVI H VE+CFVLK+LI KLA+
Subjt:  LPKSSSKEKRQTNGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAK

Query:  EGKIELDLDEVAQSNLATIKGNNK--------HQRKK-------DPKKLQPKRKRSKKFSQPQQ----------LVMLNKSFSKTFHKKTKENLA-----
        E KIELD+DEVAQ+N   +   +          QRK        +P  ++ ++K     SQ ++          +  L +SF +   ++  E  A     
Subjt:  EGKIELDLDEVAQSNLATIKGNNK--------HQRKK-------DPKKLQPKRKRSKKFSQPQQ----------LVMLNKSFSKTFHKKTKENLA-----

Query:  -----NSYCINVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSTQPLAFQRLSVSTSNKNRSSTSVFDRLKVTNDQPKRKMNN
             N+Y  + EE+DNS + +QRTSVFD IKP TTR SVF R+SMA  +EENQC   T  Q  AF+RLS+S S K+R ST  FDRLK+TNDQ +R+M  
Subjt:  -----NSYCINVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSTQPLAFQRLSVSTSNKNRSSTSVFDRLKVTNDQPKRKMNN

Query:  LELKLFDEVNSDEKLHSSIPSRMKRKFSVLIYTEGSLKVKSNLIILTNPANEGFDQDHDKYKS
        L+ K F E N D+K+HS +PSRMKRK SV I TEGSL VK   II TNP NEG ++  D+ KS
Subjt:  LELKLFDEVNSDEKLHSSIPSRMKRKFSVLIYTEGSLKVKSNLIILTNPANEGFDQDHDKYKS

KAA0033746.1 retrotransposon gag protein [Cucumis melo var. makuwa]8.6e-17547.04Show/hide
Query:  KTSSMVSVMNKSYIGSTAHCCFNELTLQEDTTSIVAGQETTLHGAYPNDKFLVKYNPLF------------EPDSDVVTVMMTETRTMEERMIEMQEHID
        K +S  S  + SYIG            +++   ++           P    +++ NPLF            E   +VV+VMM +  T E  M EM+  I+
Subjt:  KTSSMVSVMNKSYIGSTAHCCFNELTLQEDTTSIVAGQETTLHGAYPNDKFLVKYNPLF------------EPDSDVVTVMMTETRTMEERMIEMQEHID

Query:  TLMKAIEEKDSQIEQLKSQIEDQHIAESSQTQVVKNHDKGKTIVQDDQP-QCSASIASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRTP
         LMK  EE+D +I  LK Q++     ESSQT VVK  DKGK +VQ++QP Q S S+ASLS+QQLQDMI N IRAQYGGP Q S +YSK YTKRIDNLR P
Subjt:  TLMKAIEEKDSQIEQLKSQIEDQHIAESSQTQVVKNHDKGKTIVQDDQP-QCSASIASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRTP

Query:  IGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVV
        +GYQPPKFQQFDGKGNPKQHIAHFVETCENAG+RGD LV+QFVR+LKGNAF+WYTDLEPE                                   GE V+
Subjt:  IGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVV

Query:  NYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPCTFEELATRAHDMELSIASRENQDLLLPNMRKEGRND--------EETIEESMVVNT
        +YINRWRA+SLDCKD+LTELS+VEMC QGMHWELLYIL+GIKP TFEEL+TRAHDMELSIA+   +D L+   ++  +N+           + ESM+V  
Subjt:  NYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPCTFEELATRAHDMELSIASRENQDLLLPNMRKEGRND--------EETIEESMVVNT

Query:  TLPKSSSKEKRQTNGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLA
        T  KS SK K   +  ++        TL+ERQKK+YPFPD+D+ DMLEQL+E QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+CFVLK+LILKLA
Subjt:  TLPKSSSKEKRQTNGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLA

Query:  KEGKIELDLDEVAQSNLATI--------------------------------------------------------------------------------
        +E KIELD+DEVAQ+N   +                                                                                
Subjt:  KEGKIELDLDEVAQSNLATI--------------------------------------------------------------------------------

Query:  KGNNKHQRK----KDPKKLQPKRKRSKKFSQPQQLVMLNKSFSKTF--------------HKKTKENLANSYCINVEEVDNSKKSEQRTSVFDRIKPPTT
        KGN  H++K    K   K +P + + K F QP++ + L +   ++F              H  +   + N+Y  + EEVDNS + +QRT VF RIKP T 
Subjt:  KGNNKHQRK----KDPKKLQPKRKRSKKFSQPQQLVMLNKSFSKTF--------------HKKTKENLANSYCINVEEVDNSKKSEQRTSVFDRIKPPTT

Query:  RPSVFHRMSMAATEEENQCSMSTSTQPLAFQRLSVSTSNKNRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDEKLHSSIPSRMKRKFSVLIYTEGS
        R SVF R+SMA  EEENQC  ST  +  AF+RLS+ST  K+R STS FDRLK+ NDQ +R+M +L+ K F E N D+K+HS +PSRMKRK SV I TEGS
Subjt:  RPSVFHRMSMAATEEENQCSMSTSTQPLAFQRLSVSTSNKNRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDEKLHSSIPSRMKRKFSVLIYTEGS

Query:  LKVKSNLIILTNPANEGFDQDHDKYKS
        L VK   II TNP NEG ++  D+ KS
Subjt:  LKVKSNLIILTNPANEGFDQDHDKYKS

KAA0056121.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]1.3e-20260.37Show/hide
Query:  PDSDVVTVMMTETRTMEERMIEMQEHIDTLMKAIEEKDSQIEQLKSQIEDQHIAESSQTQVVKNHDKGKTIVQDDQPQCSASIASLSIQQLQDMITNCIR
        P  ++++VM+T   T E RM E+++ ++ LMK +EE+D +I  LK+ IE +  AESS    VKN DKGK ++Q+ QPQ S SIASLS+QQLQ+MI + I+
Subjt:  PDSDVVTVMMTETRTMEERMIEMQEHIDTLMKAIEEKDSQIEQLKSQIEDQHIAESSQTQVVKNHDKGKTIVQDDQPQCSASIASLSIQQLQDMITNCIR

Query:  AQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWEELEREFL
         QYGGP Q   LY KPYTKRIDNLR P GYQPPKFQQFDGKGNPKQH+AHF++TCE AGTRGDLLVKQFVRTLKGNA DWY DLEPESID+WE+LER+FL
Subjt:  AQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWEELEREFL

Query:  NRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPCTFEELATRAHDMELSIASRENQDLLLPNM
        NRFYSTR  VSM ELTNT+Q+KGELV++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKP TFEELATRAHDMELSIA+R  +D L+P  
Subjt:  NRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPCTFEELATRAHDMELSIASRENQDLLLPNM

Query:  RKEGRNDEET-------IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPK
        R +    ++T       I+ESMVV+ T  KS SK K     R+ +G      TLKERQ+K+YPFPD+D+ DMLEQLLE QLI+LP+CKRPE+  KVDDP 
Subjt:  RKEGRNDEET-------IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPK

Query:  YCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGNNKHQRKKDPKKLQPKRK--------RSKKFSQPQQLVMLNKSFSKTFHKKT
        YCKYHRVI HPVE+CFVLK+LILKLA+E KIELD+DEVAQ+N A I+  +   + KD   LQ +R         RS     P++++ +        H  +
Subjt:  YCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGNNKHQRKKDPKKLQPKRK--------RSKKFSQPQQLVMLNKSFSKTFHKKT

Query:  KENLANSYCINVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSTQPLAFQRLSVSTSNKNRSSTSVFDRLKVTNDQPKRKMNN
           + N+Y  + +EV+NS +  QRTSVFDRIKP TTR SVF R+S+A  EEENQC     T+    +RLS+ST  K+R STS FDRLK+TNDQ +R+M +
Subjt:  KENLANSYCINVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSTQPLAFQRLSVSTSNKNRSSTSVFDRLKVTNDQPKRKMNN

Query:  LELKLFDEVNSDEKLHSSIPSRMKRKFSVLIYTEGSLKVKSNLIILTNPANEGFDQ
         + K F E N D+K+HS +PSRMKRK  V I TEGSL VK   II TNP NEG +Q
Subjt:  LELKLFDEVNSDEKLHSSIPSRMKRKFSVLIYTEGSLKVKSNLIILTNPANEGFDQ

TYK03695.1 retrotransposon gag protein [Cucumis melo var. makuwa]6.0e-19251.72Show/hide
Query:  SYIGS--TAHCCFNELTLQEDTTSI-VAGQETTLHGAYPNDKFLVKYNPLFE-------------PDSDVVTVMMTETRTMEERMIEMQEHIDTLMKAIE
        SYIG     H C  E+   ED     VA          P    ++K NP  +             P  ++++VM+T   T E RM E+++ ++ LMK +E
Subjt:  SYIGS--TAHCCFNELTLQEDTTSI-VAGQETTLHGAYPNDKFLVKYNPLFE-------------PDSDVVTVMMTETRTMEERMIEMQEHIDTLMKAIE

Query:  EKDSQIEQLKSQIEDQHIAESSQTQVVKNHDKGKTIVQDDQPQCSASIASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKF
        E+D +I  LK+ IE +  AESS    VKN DKGK ++Q+ QPQ S SIASLS+QQLQ+MI + I+ QYGGP Q   LYSKPYTKRIDNLR P GYQPPKF
Subjt:  EKDSQIEQLKSQIEDQHIAESSQTQVVKNHDKGKTIVQDDQPQCSASIASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKF

Query:  QQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRA
        QQFDGKGNPKQH+AHF+ETCE AGTRGDLLVKQFVRTLKGNAFD Y DLEPESID+WE+LER+FLNRFYSTRR VSM ELTNT+Q+KGELV++YINRWRA
Subjt:  QQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRA

Query:  MSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPCTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET-------IEESMVVNTTLPKSSSKE
        +SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKP TFEELATRAHDMELSI +R  +D L+P  R +     +T       I+ESMVV+ T  KS SK 
Subjt:  MSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPCTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET-------IEESMVVNTTLPKSSSKE

Query:  K-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDL
        K     R+ +G      TLKERQ+K+YPF D+D+ DMLEQLLE QLI+LPKCKRP++ EKVDDP YCKYHRVI HPVE+CFVLK+LILKLA+E KIEL++
Subjt:  K-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDL

Query:  DEVAQSN-----------------------------------LATIKGNNKHQRKKD---------------PKKLQ-----------------PKRKRS
        DEVAQ+N                                   + TI   NK    KD               P  +Q                  K +R+
Subjt:  DEVAQSN-----------------------------------LATIKGNNKHQRKKD---------------PKKLQ-----------------PKRKRS

Query:  KKFSQP-------QQLVMLNKSFS--------------------KTFHKKTKENLANSYCINVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAAT
        KK   P       +  + L +S +                     T H  +   + N+Y  + +EV+N  +  QRTSVFDRIKP TTR SVF R+SMA  
Subjt:  KKFSQP-------QQLVMLNKSFS--------------------KTFHKKTKENLANSYCINVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAAT

Query:  EEENQCSMSTSTQPLAFQRLSVSTSNKNRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDEKLHSSIPSRMKRKFSVLIYTE
        EEENQC     T+   F+RLS+S S KNR STS FDRLK+TNDQ +R+M +L+ K F E N D+K+HS +PSRMKRK  V I T+
Subjt:  EEENQCSMSTSTQPLAFQRLSVSTSNKNRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDEKLHSSIPSRMKRKFSVLIYTE

XP_031742199.1 uncharacterized protein LOC105435721 [Cucumis sativus]1.1e-16962.5Show/hide
Query:  SFKTSSMVSVMNKSYIGSTAHCCFNELTLQEDTTSIVAGQETTLHGAYPNDKFLVKYNPLF------------EPDSDVVTVMMTETRTMEERMIEMQEH
        S K +S  S  + +Y G         +   +D  S +A          P    ++K NPL+            E   DV++VMM +   +E  M EM+  
Subjt:  SFKTSSMVSVMNKSYIGSTAHCCFNELTLQEDTTSIVAGQETTLHGAYPNDKFLVKYNPLF------------EPDSDVVTVMMTETRTMEERMIEMQEH

Query:  IDTLMKAIEEKDSQIEQLKSQIEDQHIAESSQTQVVKNHDKGKTIVQDDQP-QCSASIASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLR
        I+ LMK ++E+D +I  LK Q++ +  AESSQT VVK  DKGK +VQ++QP Q S S+ASLS+QQLQDMITN IRAQYGGP+Q S +YSKPYTKRIDNLR
Subjt:  IDTLMKAIEEKDSQIEQLKSQIEDQHIAESSQTQVVKNHDKGKTIVQDDQP-QCSASIASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLR

Query:  TPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGEL
         P+GYQPPKFQQFDGKGNPKQH+AHFVETCENAG+RGD LV+QFVR+LKGNAF+WYTDLEPESI+SWE+LE+EFLNRFYSTRRTVSM ELTNTKQRKGE 
Subjt:  TPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGEL

Query:  VVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPCTFEELATRAHDMELSIASRENQDLLLPNMRKEGRN-------DEETIEESMVVN
        V++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKP TFEELATRAHDMELSIASR  +D L+P ++K+ +         + T +ESMVVN
Subjt:  VVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPCTFEELATRAHDMELSIASRENQDLLLPNMRKEGRN-------DEETIEESMVVN

Query:  TT-LPKSSSKEKR---QTNGA--HHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLA
        TT L  S  KE R   + +G+    LTLKERQ+K+YPFPD+DI DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+CFVLK+LIL+LA
Subjt:  TT-LPKSSSKEKR---QTNGA--HHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLA

Query:  KEGKIELDLDEVAQSNLATI
        +E +IELDL+EVAQ+N A +
Subjt:  KEGKIELDLDEVAQSNLATI

TrEMBL top hitse value%identityAlignment
A0A5A7SRE2 Ty3-gypsy retrotransposon protein4.3e-18051.25Show/hide
Query:  KTSSMVSVMNKSYIGSTAHCCFNELTLQEDTTSIVAGQETTLHGAYPNDKFLVKYNPLF------------EPDSDVVTVMMTETRTMEERMIEMQEHID
        K +S  S ++ SYIG            +++   ++  +        P    +++ NPLF            E   +VV+VMM +  T E  M EM+  I+
Subjt:  KTSSMVSVMNKSYIGSTAHCCFNELTLQEDTTSIVAGQETTLHGAYPNDKFLVKYNPLF------------EPDSDVVTVMMTETRTMEERMIEMQEHID

Query:  TLMKAIEEKDSQIEQLKSQIEDQHIAESSQTQVVKNHDKGKTIVQDDQP-QCSASIASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRTP
         LMK +EE+D +I  LK Q++    +ESSQT VVK  DKGK +V+++QP Q S S+ASLS+QQLQDMI N IRAQYGGP Q S +YSKPYTKRIDNLR P
Subjt:  TLMKAIEEKDSQIEQLKSQIEDQHIAESSQTQVVKNHDKGKTIVQDDQP-QCSASIASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRTP

Query:  IGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVV
        +GYQP KFQQFDGKGNPKQHI HFVETCENAG+RGD LV+QFVR+LKGNAF+                          TRR VSM ELTNT QRKGE V+
Subjt:  IGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVV

Query:  NYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPCTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET-------IEESMVVNTT
        +YINRWRA+SLDCKD+LTELS+VEMC QGMHWELLYIL+GIKP TFEELATRAHDM+LSIA+R  +D L+   R +     +T       + ESM+V  T
Subjt:  NYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPCTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET-------IEESMVVNTT

Query:  LPKSSSKEKRQTNGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAK
          KS SK K   +  +H        TL+ERQKK+YPFPD+D+ DMLEQL+E QLI+LP+CKRPE++ KVDDP YCKYHRVI H VE+CFVLK+LI KLA+
Subjt:  LPKSSSKEKRQTNGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAK

Query:  EGKIELDLDEVAQSNLATIKGNNK--------HQRKK-------DPKKLQPKRKRSKKFSQPQQ----------LVMLNKSFSKTFHKKTKENLA-----
        E KIELD+DEVAQ+N   +   +          QRK        +P  ++ ++K     SQ ++          +  L +SF +   ++  E  A     
Subjt:  EGKIELDLDEVAQSNLATIKGNNK--------HQRKK-------DPKKLQPKRKRSKKFSQPQQ----------LVMLNKSFSKTFHKKTKENLA-----

Query:  -----NSYCINVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSTQPLAFQRLSVSTSNKNRSSTSVFDRLKVTNDQPKRKMNN
             N+Y  + EE+DNS + +QRTSVFD IKP TTR SVF R+SMA  +EENQC   T  Q  AF+RLS+S S K+R ST  FDRLK+TNDQ +R+M  
Subjt:  -----NSYCINVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSTQPLAFQRLSVSTSNKNRSSTSVFDRLKVTNDQPKRKMNN

Query:  LELKLFDEVNSDEKLHSSIPSRMKRKFSVLIYTEGSLKVKSNLIILTNPANEGFDQDHDKYKS
        L+ K F E N D+K+HS +PSRMKRK SV I TEGSL VK   II TNP NEG ++  D+ KS
Subjt:  LELKLFDEVNSDEKLHSSIPSRMKRKFSVLIYTEGSLKVKSNLIILTNPANEGFDQDHDKYKS

A0A5A7SUW1 Retrotransposon gag protein4.2e-17547.04Show/hide
Query:  KTSSMVSVMNKSYIGSTAHCCFNELTLQEDTTSIVAGQETTLHGAYPNDKFLVKYNPLF------------EPDSDVVTVMMTETRTMEERMIEMQEHID
        K +S  S  + SYIG            +++   ++           P    +++ NPLF            E   +VV+VMM +  T E  M EM+  I+
Subjt:  KTSSMVSVMNKSYIGSTAHCCFNELTLQEDTTSIVAGQETTLHGAYPNDKFLVKYNPLF------------EPDSDVVTVMMTETRTMEERMIEMQEHID

Query:  TLMKAIEEKDSQIEQLKSQIEDQHIAESSQTQVVKNHDKGKTIVQDDQP-QCSASIASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRTP
         LMK  EE+D +I  LK Q++     ESSQT VVK  DKGK +VQ++QP Q S S+ASLS+QQLQDMI N IRAQYGGP Q S +YSK YTKRIDNLR P
Subjt:  TLMKAIEEKDSQIEQLKSQIEDQHIAESSQTQVVKNHDKGKTIVQDDQP-QCSASIASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRTP

Query:  IGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVV
        +GYQPPKFQQFDGKGNPKQHIAHFVETCENAG+RGD LV+QFVR+LKGNAF+WYTDLEPE                                   GE V+
Subjt:  IGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVV

Query:  NYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPCTFEELATRAHDMELSIASRENQDLLLPNMRKEGRND--------EETIEESMVVNT
        +YINRWRA+SLDCKD+LTELS+VEMC QGMHWELLYIL+GIKP TFEEL+TRAHDMELSIA+   +D L+   ++  +N+           + ESM+V  
Subjt:  NYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPCTFEELATRAHDMELSIASRENQDLLLPNMRKEGRND--------EETIEESMVVNT

Query:  TLPKSSSKEKRQTNGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLA
        T  KS SK K   +  ++        TL+ERQKK+YPFPD+D+ DMLEQL+E QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+CFVLK+LILKLA
Subjt:  TLPKSSSKEKRQTNGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLA

Query:  KEGKIELDLDEVAQSNLATI--------------------------------------------------------------------------------
        +E KIELD+DEVAQ+N   +                                                                                
Subjt:  KEGKIELDLDEVAQSNLATI--------------------------------------------------------------------------------

Query:  KGNNKHQRK----KDPKKLQPKRKRSKKFSQPQQLVMLNKSFSKTF--------------HKKTKENLANSYCINVEEVDNSKKSEQRTSVFDRIKPPTT
        KGN  H++K    K   K +P + + K F QP++ + L +   ++F              H  +   + N+Y  + EEVDNS + +QRT VF RIKP T 
Subjt:  KGNNKHQRK----KDPKKLQPKRKRSKKFSQPQQLVMLNKSFSKTF--------------HKKTKENLANSYCINVEEVDNSKKSEQRTSVFDRIKPPTT

Query:  RPSVFHRMSMAATEEENQCSMSTSTQPLAFQRLSVSTSNKNRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDEKLHSSIPSRMKRKFSVLIYTEGS
        R SVF R+SMA  EEENQC  ST  +  AF+RLS+ST  K+R STS FDRLK+ NDQ +R+M +L+ K F E N D+K+HS +PSRMKRK SV I TEGS
Subjt:  RPSVFHRMSMAATEEENQCSMSTSTQPLAFQRLSVSTSNKNRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDEKLHSSIPSRMKRKFSVLIYTEGS

Query:  LKVKSNLIILTNPANEGFDQDHDKYKS
        L VK   II TNP NEG ++  D+ KS
Subjt:  LKVKSNLIILTNPANEGFDQDHDKYKS

A0A5A7TZU9 Ribonuclease H9.0e-17065.61Show/hide
Query:  PNDKFLVKYNPLFE-------------PDSDVVTVMMTETRTMEERMIEMQEHIDTLMKAIEEKDSQIEQLKSQIEDQHIAESSQTQVVKNHDKGKTIVQ
        P    ++K NP  +             P  ++++VM+T+  T E+RM E+++ ++ LMKA+EE+D +I  LK+ IE +  AESS T  +KN +KGK I+Q
Subjt:  PNDKFLVKYNPLFE-------------PDSDVVTVMMTETRTMEERMIEMQEHIDTLMKAIEEKDSQIEQLKSQIEDQHIAESSQTQVVKNHDKGKTIVQ

Query:  DDQPQCSASIASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTL
        + QPQ S SIASLS+QQLQ+MI N I+ QYGGP Q   LYSKPYTKRIDN+R P GYQPPKFQQFDGKGNPKQH+AHF+ETCE AGTRGDLLVKQFVRTL
Subjt:  DDQPQCSASIASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTL

Query:  KGNAFDWYTDLEPESIDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPCTF
        KGNAFDWYTDLEPESIDSWE+LER+FLNRFYSTRR VSM ELT TKQRKGE V++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKP TF
Subjt:  KGNAFDWYTDLEPESIDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPCTF

Query:  EELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET-------IEESMVVNTT----LPKSSSKEKRQTNG-AHHLTLKERQKKIYPFPDADIPDMLEQ
        EELATRAHDMELSIA+R N DLL+P +RKE +  + T        +E+MVV+TT    + K    EKRQ  G     TLKERQ+K+YPFPD+D+PDML+Q
Subjt:  EELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET-------IEESMVVNTT----LPKSSSKEKRQTNG-AHHLTLKERQKKIYPFPDADIPDMLEQ

Query:  LLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATI
        LLE QLI+LP+CKRP EM +V+DP YCKYHRVI HPVE+CFVLK+LILKLA + KIEL+LD+VAQ+N A +
Subjt:  LLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATI

A0A5A7TZU9 Ribonuclease H1.3e-1129.28Show/hide
Query:  HLTLKERQKKIYPFPDAD------IPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLA
        ++ L  R+K   PF +          ++L++   A L K+ K    +++EK D   Y    R +     + + L      +AK G      D   ++ L 
Subjt:  HLTLKERQKKIYPFPDAD------IPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLA

Query:  TIKGNNKHQRKKDPKKLQPKRKRSKK--FSQPQQLVMLNKSFSKTFH--KKTKENLANSYCINVEEVDNSKKSE----QRTSVFDRIKPPTTRPSVFHRM
        ++K  ++        +L P +K+ +K  +S P     +    S+      K K  +AN+  I VEE  +S++ +    QR+SVFDRI     RPSVF R+
Subjt:  TIKGNNKHQRKKDPKKLQPKRKRSKK--FSQPQQLVMLNKSFSKTFH--KKTKENLANSYCINVEEVDNSKKSE----QRTSVFDRIKPPTTRPSVFHRM

Query:  SMAATEEENQCSMSTSTQPLAFQRLSVSTSNKNRSS------TSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDEKLHSSIPSRMKRKFSVLIYTEGSLK
        S +  ++ NQ S  +ST+  AFQRL+ S       S       S F RL V+  + ++K +         V  DE++ S+ PSRMKRK  V + TEGSLK
Subjt:  SMAATEEENQCSMSTSTQPLAFQRLSVSTSNKNRSS------TSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDEKLHSSIPSRMKRKFSVLIYTEGSLK

Query:  VKSNLIILTNPANEGFDQDHD
        VK + ++ T P +   + + D
Subjt:  VKSNLIILTNPANEGFDQDHD

A0A5A7URH1 Ty3-gypsy retrotransposon protein6.2e-20360.37Show/hide
Query:  PDSDVVTVMMTETRTMEERMIEMQEHIDTLMKAIEEKDSQIEQLKSQIEDQHIAESSQTQVVKNHDKGKTIVQDDQPQCSASIASLSIQQLQDMITNCIR
        P  ++++VM+T   T E RM E+++ ++ LMK +EE+D +I  LK+ IE +  AESS    VKN DKGK ++Q+ QPQ S SIASLS+QQLQ+MI + I+
Subjt:  PDSDVVTVMMTETRTMEERMIEMQEHIDTLMKAIEEKDSQIEQLKSQIEDQHIAESSQTQVVKNHDKGKTIVQDDQPQCSASIASLSIQQLQDMITNCIR

Query:  AQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWEELEREFL
         QYGGP Q   LY KPYTKRIDNLR P GYQPPKFQQFDGKGNPKQH+AHF++TCE AGTRGDLLVKQFVRTLKGNA DWY DLEPESID+WE+LER+FL
Subjt:  AQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWEELEREFL

Query:  NRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPCTFEELATRAHDMELSIASRENQDLLLPNM
        NRFYSTR  VSM ELTNT+Q+KGELV++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKP TFEELATRAHDMELSIA+R  +D L+P  
Subjt:  NRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPCTFEELATRAHDMELSIASRENQDLLLPNM

Query:  RKEGRNDEET-------IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPK
        R +    ++T       I+ESMVV+ T  KS SK K     R+ +G      TLKERQ+K+YPFPD+D+ DMLEQLLE QLI+LP+CKRPE+  KVDDP 
Subjt:  RKEGRNDEET-------IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPK

Query:  YCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGNNKHQRKKDPKKLQPKRK--------RSKKFSQPQQLVMLNKSFSKTFHKKT
        YCKYHRVI HPVE+CFVLK+LILKLA+E KIELD+DEVAQ+N A I+  +   + KD   LQ +R         RS     P++++ +        H  +
Subjt:  YCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGNNKHQRKKDPKKLQPKRK--------RSKKFSQPQQLVMLNKSFSKTFHKKT

Query:  KENLANSYCINVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSTQPLAFQRLSVSTSNKNRSSTSVFDRLKVTNDQPKRKMNN
           + N+Y  + +EV+NS +  QRTSVFDRIKP TTR SVF R+S+A  EEENQC     T+    +RLS+ST  K+R STS FDRLK+TNDQ +R+M +
Subjt:  KENLANSYCINVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSTQPLAFQRLSVSTSNKNRSSTSVFDRLKVTNDQPKRKMNN

Query:  LELKLFDEVNSDEKLHSSIPSRMKRKFSVLIYTEGSLKVKSNLIILTNPANEGFDQ
         + K F E N D+K+HS +PSRMKRK  V I TEGSL VK   II TNP NEG +Q
Subjt:  LELKLFDEVNSDEKLHSSIPSRMKRKFSVLIYTEGSLKVKSNLIILTNPANEGFDQ

A0A5D3BX77 Retrotransposon gag protein2.9e-19251.72Show/hide
Query:  SYIGS--TAHCCFNELTLQEDTTSI-VAGQETTLHGAYPNDKFLVKYNPLFE-------------PDSDVVTVMMTETRTMEERMIEMQEHIDTLMKAIE
        SYIG     H C  E+   ED     VA          P    ++K NP  +             P  ++++VM+T   T E RM E+++ ++ LMK +E
Subjt:  SYIGS--TAHCCFNELTLQEDTTSI-VAGQETTLHGAYPNDKFLVKYNPLFE-------------PDSDVVTVMMTETRTMEERMIEMQEHIDTLMKAIE

Query:  EKDSQIEQLKSQIEDQHIAESSQTQVVKNHDKGKTIVQDDQPQCSASIASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKF
        E+D +I  LK+ IE +  AESS    VKN DKGK ++Q+ QPQ S SIASLS+QQLQ+MI + I+ QYGGP Q   LYSKPYTKRIDNLR P GYQPPKF
Subjt:  EKDSQIEQLKSQIEDQHIAESSQTQVVKNHDKGKTIVQDDQPQCSASIASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKF

Query:  QQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRA
        QQFDGKGNPKQH+AHF+ETCE AGTRGDLLVKQFVRTLKGNAFD Y DLEPESID+WE+LER+FLNRFYSTRR VSM ELTNT+Q+KGELV++YINRWRA
Subjt:  QQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRA

Query:  MSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPCTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET-------IEESMVVNTTLPKSSSKE
        +SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKP TFEELATRAHDMELSI +R  +D L+P  R +     +T       I+ESMVV+ T  KS SK 
Subjt:  MSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPCTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET-------IEESMVVNTTLPKSSSKE

Query:  K-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDL
        K     R+ +G      TLKERQ+K+YPF D+D+ DMLEQLLE QLI+LPKCKRP++ EKVDDP YCKYHRVI HPVE+CFVLK+LILKLA+E KIEL++
Subjt:  K-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDL

Query:  DEVAQSN-----------------------------------LATIKGNNKHQRKKD---------------PKKLQ-----------------PKRKRS
        DEVAQ+N                                   + TI   NK    KD               P  +Q                  K +R+
Subjt:  DEVAQSN-----------------------------------LATIKGNNKHQRKKD---------------PKKLQ-----------------PKRKRS

Query:  KKFSQP-------QQLVMLNKSFS--------------------KTFHKKTKENLANSYCINVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAAT
        KK   P       +  + L +S +                     T H  +   + N+Y  + +EV+N  +  QRTSVFDRIKP TTR SVF R+SMA  
Subjt:  KKFSQP-------QQLVMLNKSFS--------------------KTFHKKTKENLANSYCINVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAAT

Query:  EEENQCSMSTSTQPLAFQRLSVSTSNKNRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDEKLHSSIPSRMKRKFSVLIYTE
        EEENQC     T+   F+RLS+S S KNR STS FDRLK+TNDQ +R+M +L+ K F E N D+K+HS +PSRMKRK  V I T+
Subjt:  EEENQCSMSTSTQPLAFQRLSVSTSNKNRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDEKLHSSIPSRMKRKFSVLIYTE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTTCAAGACTTCCTCAATGGTCTCTGTCATGAACAAGTCTTACATAGGTTCTACTGCTCATTGTTGCTTCAACGAACTGACGTTGCAAGAAGATACAACTTCTAT
CGTTGCAGGCCAAGAAACAACCTTGCATGGGGCATATCCTAATGACAAATTTCTTGTTAAGTATAACCCTCTGTTTGAACCTGATTCTGACGTAGTGACTGTCATGATGA
CTGAGACAAGAACTATGGAAGAAAGAATGATTGAGATGCAGGAGCATATCGACACCTTGATGAAGGCGATTGAAGAAAAAGATTCTCAAATCGAGCAACTAAAGAGCCAA
ATTGAGGACCAACATATCGCCGAATCAAGTCAAACCCAAGTCGTAAAAAATCACGACAAAGGAAAGACTATAGTGCAAGATGATCAACCACAGTGTTCTGCTTCGATCGC
TTCACTATCCATCCAACAGCTCCAAGATATGATCACAAACTGTATCAGAGCTCAGTACGGTGGACCTACTCAAGATTCCCTCTTGTATTCCAAACCTTATACTAAGAGGA
TTGATAACCTGAGAACGCCAATCGGGTATCAGCCACCAAAATTTCAACAGTTTGATGGAAAGGGCAATCCTAAACAACATATTGCCCACTTCGTTGAGACATGTGAGAAC
GCTGGTACTCGAGGGGACCTACTAGTCAAACAGTTCGTTCGAACACTTAAAGGAAATGCGTTTGACTGGTACACTGATCTAGAACCTGAGTCAATAGACAGTTGGGAGGA
ACTCGAAAGAGAGTTTTTGAATCGCTTCTACAGCACTAGACGAACCGTTAGCATGTTCGAGCTCACAAACACTAAACAACGAAAAGGTGAACTCGTTGTTAACTATATAA
ATCGCTGGAGAGCTATGAGTCTAGATTGCAAAGATCGCCTCACTGAACTCTCTTCCGTTGAGATGTGCATTCAAGGCATGCACTGGGAACTCCTCTACATCCTTAAAGGT
ATAAAGCCTTGCACCTTTGAGGAACTAGCAACTCGTGCCCACGATATGGAGCTAAGTATTGCTAGTCGAGAAAACCAAGACCTTCTCCTCCCTAACATGAGAAAAGAAGG
AAGGAATGATGAAGAGACTATAGAAGAATCCATGGTTGTAAACACAACCCTTCCCAAGTCGTCTTCGAAAGAAAAGCGACAAACTAATGGAGCGCATCACTTAACTTTAA
AGGAAAGACAGAAGAAAATCTATCCTTTCCCTGATGCCGACATCCCTGATATGTTGGAACAACTATTGGAAGCGCAACTGATAAAGCTTCCTAAGTGTAAACGACCAGAA
GAGATGGAGAAAGTCGATGATCCCAAGTATTGCAAGTATCATCGAGTTATTGGTCATCCAGTGGAAAGATGTTTCGTCCTAAAGGACTTAATTTTAAAGCTGGCTAAGGA
AGGCAAAATCGAGCTCGACCTTGATGAAGTAGCCCAATCAAATCTTGCTACAATAAAAGGAAATAACAAACATCAAAGAAAGAAGGATCCTAAGAAACTTCAACCCAAGA
GGAAGAGAAGTAAAAAGTTTTCTCAACCTCAACAACTGGTGATGTTGAATAAATCATTCTCCAAAACTTTCCACAAAAAGACAAAAGAGAACCTTGCGAATTCCTACTGC
ATCAACGTAGAAGAAGTTGACAATTCTAAGAAGAGTGAACAAAGGACTTCCGTCTTTGATCGCATCAAGCCTCCAACTACTCGTCCTTCGGTATTCCATAGAATGAGTAT
GGCCGCGACAGAGGAAGAAAATCAATGTTCGATGTCCACCTCCACTCAACCTTTAGCTTTCCAAAGGCTAAGTGTCTCCACATCGAATAAAAATCGATCTTCAACATCTG
TCTTTGATCGCCTCAAAGTAACAAACGATCAACCTAAAAGAAAGATGAACAACTTGGAGTTGAAACTTTTCGATGAAGTAAACAGTGACGAGAAGCTTCATAGTAGCATC
CCGTCACGTATGAAGAGGAAGTTTTCTGTTCTCATATATACGGAAGGTTCCTTGAAGGTGAAGTCAAATCTCATTATCTTGACCAATCCTGCAAATGAAGGATTTGATCA
AGACCATGACAAATATAAGAGTTTTAAATGTAAAAGCTCCTTGTCGCAAGAGCCTAAACTGCATGATGCTCCTAGCCCACACGAGCTTAAAAGGCAGCTTAAAGAAAACT
TTAAGTTCAGTCTCTACAAACAAAAAAAAAAGGAAAGTGCAACAAATATTGAAGCACGACGATATAAATTGAGGAAATTCATCACTAGTGGGGGCAACACAGCGAATGGA
AGTTGCTTCCTCCAAGTTCGAAGGTTCCCACGCGCTTCGCTGCAGTTCCTTCCCCCAAATTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCACAGTTCGAAGG
TTCTCACGCGCTTCGCTACAGTTCCTCCTCCAAGTTCGAGGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAGGGTTCTCACGCGCTTCGCTGCAGTGCC
TTCCCCCAAGTTCAAAGGTTCTCACTTCGCTTTGTTGCGCTCATGCGCTTCGCTACAGTTCATCCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCTTC
CCCCAAGTTCGAAGGTTCTCACGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCGCTTCGTTGCAGTTCCTTCCCCCAAATTCGAAGGTTCTCACGCG
CTTCGCTACAGTTCCTTCCCCCAAGTTCGAAGGTTCTCACGCACTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACACGCTTCGTTGCATTTCCTTCCCCCTAA
ATTCGAAGGTTCTCACGCGCTTCGCTGGAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCCCAAGTTCGAAGGTTCTCATGCGCTTC
GTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACACGCTTCGTTGCATTTCCTTCCCCCAAATTCGAAGGCTCTCACGCGCTTCGCTGGAGTTCCTTCCTCCAAGTTCG
AAGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCCCCAAGTTCGAAGGTTCTCACGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCGCTTCGTTGC
AGTTCCTTCCCCCCAAATTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCCCCCAAGTTTGAAGGTTCTCACGCACTTCGCTGCAGTTCCTTCCTCCAAGTTCAAA
GGTTCTCACGCGCTTCGCTGCACTCCAGTGCTACTTCCTAAAGTCCAAAGACGTCAATTGTCCTCATGCTACGCTCGGCTACATTGCTGCGCTACTTCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCGTTCAAGACTTCCTCAATGGTCTCTGTCATGAACAAGTCTTACATAGGTTCTACTGCTCATTGTTGCTTCAACGAACTGACGTTGCAAGAAGATACAACTTCTAT
CGTTGCAGGCCAAGAAACAACCTTGCATGGGGCATATCCTAATGACAAATTTCTTGTTAAGTATAACCCTCTGTTTGAACCTGATTCTGACGTAGTGACTGTCATGATGA
CTGAGACAAGAACTATGGAAGAAAGAATGATTGAGATGCAGGAGCATATCGACACCTTGATGAAGGCGATTGAAGAAAAAGATTCTCAAATCGAGCAACTAAAGAGCCAA
ATTGAGGACCAACATATCGCCGAATCAAGTCAAACCCAAGTCGTAAAAAATCACGACAAAGGAAAGACTATAGTGCAAGATGATCAACCACAGTGTTCTGCTTCGATCGC
TTCACTATCCATCCAACAGCTCCAAGATATGATCACAAACTGTATCAGAGCTCAGTACGGTGGACCTACTCAAGATTCCCTCTTGTATTCCAAACCTTATACTAAGAGGA
TTGATAACCTGAGAACGCCAATCGGGTATCAGCCACCAAAATTTCAACAGTTTGATGGAAAGGGCAATCCTAAACAACATATTGCCCACTTCGTTGAGACATGTGAGAAC
GCTGGTACTCGAGGGGACCTACTAGTCAAACAGTTCGTTCGAACACTTAAAGGAAATGCGTTTGACTGGTACACTGATCTAGAACCTGAGTCAATAGACAGTTGGGAGGA
ACTCGAAAGAGAGTTTTTGAATCGCTTCTACAGCACTAGACGAACCGTTAGCATGTTCGAGCTCACAAACACTAAACAACGAAAAGGTGAACTCGTTGTTAACTATATAA
ATCGCTGGAGAGCTATGAGTCTAGATTGCAAAGATCGCCTCACTGAACTCTCTTCCGTTGAGATGTGCATTCAAGGCATGCACTGGGAACTCCTCTACATCCTTAAAGGT
ATAAAGCCTTGCACCTTTGAGGAACTAGCAACTCGTGCCCACGATATGGAGCTAAGTATTGCTAGTCGAGAAAACCAAGACCTTCTCCTCCCTAACATGAGAAAAGAAGG
AAGGAATGATGAAGAGACTATAGAAGAATCCATGGTTGTAAACACAACCCTTCCCAAGTCGTCTTCGAAAGAAAAGCGACAAACTAATGGAGCGCATCACTTAACTTTAA
AGGAAAGACAGAAGAAAATCTATCCTTTCCCTGATGCCGACATCCCTGATATGTTGGAACAACTATTGGAAGCGCAACTGATAAAGCTTCCTAAGTGTAAACGACCAGAA
GAGATGGAGAAAGTCGATGATCCCAAGTATTGCAAGTATCATCGAGTTATTGGTCATCCAGTGGAAAGATGTTTCGTCCTAAAGGACTTAATTTTAAAGCTGGCTAAGGA
AGGCAAAATCGAGCTCGACCTTGATGAAGTAGCCCAATCAAATCTTGCTACAATAAAAGGAAATAACAAACATCAAAGAAAGAAGGATCCTAAGAAACTTCAACCCAAGA
GGAAGAGAAGTAAAAAGTTTTCTCAACCTCAACAACTGGTGATGTTGAATAAATCATTCTCCAAAACTTTCCACAAAAAGACAAAAGAGAACCTTGCGAATTCCTACTGC
ATCAACGTAGAAGAAGTTGACAATTCTAAGAAGAGTGAACAAAGGACTTCCGTCTTTGATCGCATCAAGCCTCCAACTACTCGTCCTTCGGTATTCCATAGAATGAGTAT
GGCCGCGACAGAGGAAGAAAATCAATGTTCGATGTCCACCTCCACTCAACCTTTAGCTTTCCAAAGGCTAAGTGTCTCCACATCGAATAAAAATCGATCTTCAACATCTG
TCTTTGATCGCCTCAAAGTAACAAACGATCAACCTAAAAGAAAGATGAACAACTTGGAGTTGAAACTTTTCGATGAAGTAAACAGTGACGAGAAGCTTCATAGTAGCATC
CCGTCACGTATGAAGAGGAAGTTTTCTGTTCTCATATATACGGAAGGTTCCTTGAAGGTGAAGTCAAATCTCATTATCTTGACCAATCCTGCAAATGAAGGATTTGATCA
AGACCATGACAAATATAAGAGTTTTAAATGTAAAAGCTCCTTGTCGCAAGAGCCTAAACTGCATGATGCTCCTAGCCCACACGAGCTTAAAAGGCAGCTTAAAGAAAACT
TTAAGTTCAGTCTCTACAAACAAAAAAAAAAGGAAAGTGCAACAAATATTGAAGCACGACGATATAAATTGAGGAAATTCATCACTAGTGGGGGCAACACAGCGAATGGA
AGTTGCTTCCTCCAAGTTCGAAGGTTCCCACGCGCTTCGCTGCAGTTCCTTCCCCCAAATTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCACAGTTCGAAGG
TTCTCACGCGCTTCGCTACAGTTCCTCCTCCAAGTTCGAGGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAGGGTTCTCACGCGCTTCGCTGCAGTGCC
TTCCCCCAAGTTCAAAGGTTCTCACTTCGCTTTGTTGCGCTCATGCGCTTCGCTACAGTTCATCCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCTTC
CCCCAAGTTCGAAGGTTCTCACGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCGCTTCGTTGCAGTTCCTTCCCCCAAATTCGAAGGTTCTCACGCG
CTTCGCTACAGTTCCTTCCCCCAAGTTCGAAGGTTCTCACGCACTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACACGCTTCGTTGCATTTCCTTCCCCCTAA
ATTCGAAGGTTCTCACGCGCTTCGCTGGAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCCCAAGTTCGAAGGTTCTCATGCGCTTC
GTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACACGCTTCGTTGCATTTCCTTCCCCCAAATTCGAAGGCTCTCACGCGCTTCGCTGGAGTTCCTTCCTCCAAGTTCG
AAGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCCCCAAGTTCGAAGGTTCTCACGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCGCTTCGTTGC
AGTTCCTTCCCCCCAAATTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCCCCCAAGTTTGAAGGTTCTCACGCACTTCGCTGCAGTTCCTTCCTCCAAGTTCAAA
GGTTCTCACGCGCTTCGCTGCACTCCAGTGCTACTTCCTAAAGTCCAAAGACGTCAATTGTCCTCATGCTACGCTCGGCTACATTGCTGCGCTACTTCCTAA
Protein sequenceShow/hide protein sequence
MSFKTSSMVSVMNKSYIGSTAHCCFNELTLQEDTTSIVAGQETTLHGAYPNDKFLVKYNPLFEPDSDVVTVMMTETRTMEERMIEMQEHIDTLMKAIEEKDSQIEQLKSQ
IEDQHIAESSQTQVVKNHDKGKTIVQDDQPQCSASIASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCEN
AGTRGDLLVKQFVRTLKGNAFDWYTDLEPESIDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKG
IKPCTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEETIEESMVVNTTLPKSSSKEKRQTNGAHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPE
EMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGNNKHQRKKDPKKLQPKRKRSKKFSQPQQLVMLNKSFSKTFHKKTKENLANSYC
INVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSTQPLAFQRLSVSTSNKNRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDEKLHSSI
PSRMKRKFSVLIYTEGSLKVKSNLIILTNPANEGFDQDHDKYKSFKCKSSLSQEPKLHDAPSPHELKRQLKENFKFSLYKQKKKESATNIEARRYKLRKFITSGGNTANG
SCFLQVRRFPRASLQFLPPNSKVLTRFAAVPSSQFEGSHALRYSSSSKFEGSHALRCSSFLQVRGFSRASLQCLPPSSKVLTSLCCAHALRYSSSLQVRRFSRASLCNSF
PQVRRFSRASCSSFLQIRRFSRASLQFLPPNSKVLTRFATVPSPKFEGSHALRAVPSSKFEGSHTLRCISFPLNSKVLTRFAGVPSSKFEGSHALRSAIPSPSSKVLMRF
VQFLPPNSKVLTRFVAFPSPKFEGSHALRWSSFLQVRRFSRASLCNSFPQVRRFSRASCSSFLQIRRFSRASLQFLPPKFEGSHALRCSSFPPSLKVLTHFAAVPSSKFK
GSHALRCTPVLLPKVQRRQLSSCYARLHCCATS