; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0016556 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0016556
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTy3-gypsy retrotransposon protein
Genome locationchr12:38945841..38950734
RNA-Seq ExpressionLag0016556
SyntenyLag0016556
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0047477.1 uncharacterized protein E6C27_scaffold498G00940 [Cucumis melo var. makuwa]5.7e-16868.24Show/hide
Query:  PDSDVVTVMMTETRTMEERMTEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKTIVQDDQPQCSASIASLSIQQLQDMITNCIR
        P  ++++VM+T+  T E+RM E+++ +N LMKA+EE+D +I  LK+ IE++  AESS T  +KN +KGK I+Q+ QPQ S SIASLS+QQLQ+MI N I+
Subjt:  PDSDVVTVMMTETRTMEERMTEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKTIVQDDQPQCSASIASLSIQQLQDMITNCIR

Query:  AQYGGLTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWEELEREFL
         QYGG  Q   LYSKPYTKRIDN+R P GYQPPKFQQFDGKGNPKQH+AHF+ETCE AGTRGDLLVKQFVRTLKGNAFDWYTDLEPES+DSWE+LER+FL
Subjt:  AQYGGLTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWEELEREFL

Query:  NRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQYLLLPNM
        NRFYSTRR VSM ELT TKQRKGE V++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSIA+R N  LL+P +
Subjt:  NRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQYLLLPNM

Query:  RKEGRNDEET-------IEESMVVNTT----LPKSSSKGKRQTNG-AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYC
        RKE +  + T        +E+MVV+TT    + K     KRQ  G     TLKERQ+K+YPFPD+D+PDML+QLLE QLI+LP+CKRP EM +V+DP YC
Subjt:  RKEGRNDEET-------IEESMVVNTT----LPKSSSKGKRQTNG-AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYC

Query:  KYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATI
        KYHRVI HPVE+CFVLK+LILKLA + KIEL+LD+VAQ+N A +
Subjt:  KYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATI

KAA0047477.1 uncharacterized protein E6C27_scaffold498G00940 [Cucumis melo var. makuwa]2.0e-0840.62Show/hide
Query:  RKKDPKKLQPKRKRK-------EVDNSKKSE-------QRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSK-KSRSST--
        +  +P ++  K K K        V+ SK SE       QR+SVFDRI     RPSVF R+S +  ++ NQ S  +STR SAFQRL+ S  K +S S T  
Subjt:  RKKDPKKLQPKRKRK-------EVDNSKKSE-------QRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSK-KSRSST--

Query:  ---SVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSLK
           S F RL V+  + ++K +         V  D+++ S+ PSRMKRK  V +NTEGSLK
Subjt:  ---SVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSLK

KAA0047477.1 uncharacterized protein E6C27_scaffold498G00940 [Cucumis melo var. makuwa]1.3e-16752.99Show/hide
Query:  LVKYNPLF------------EPDSDVVTVMMTETRTMEERMTEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKTIVQDDQP-Q
        +++ NPLF            E   +VV+VMM +  T E  M EM+  IN LMK +EE+D +I  LK Q++    +ESSQT VVK  DKGK +V+++QP Q
Subjt:  LVKYNPLF------------EPDSDVVTVMMTETRTMEERMTEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKTIVQDDQP-Q

Query:  CSASIASLSIQQLQDMITNCIRAQYGGLTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAF
         S S+ASLS+QQLQDMI N IRAQYGG  Q S +YSKPYTKRIDNLR P+GYQP KFQQFDGKGNPKQHI HFVETCENAG+RGD LV+QFVR+LKGNAF
Subjt:  CSASIASLSIQQLQDMITNCIRAQYGGLTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAF

Query:  DWYTDLEPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELAT
        +                          TRR VSM ELTNT QRKGE V++YINRWRA+SLDCKD+LTELS+VEMC QGMHWELLYIL+GIKPRTFEELAT
Subjt:  DWYTDLEPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELAT

Query:  RAHDMELSIASRENQYLLLPNMRKEGRNDEET-------IEESMVVNTTLPKSSSKGKRQTNGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLE
        RAHDM+LSIA+R  +  L+   R +     +T       + ESM+V  T  KS SK K   +  +H        TL+ERQKK+YPFPD+D+ DMLEQL+E
Subjt:  RAHDMELSIASRENQYLLLPNMRKEGRNDEET-------IEESMVVNTTLPKSSSKGKRQTNGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLE

Query:  AQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSN--------------------------------LATI
         QLI+LP+CKRPE++ KVDDP YCKYHRVI H VE+CFVLK+LI KLA+E KIELD+DEVAQ+N                                L   
Subjt:  AQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSN--------------------------------LATI

Query:  KEK---SKHQRKKDPKKLQPKR------------------------------------KRKEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEE
        ++K   S  Q K++P + + +                                       +E+DNS + +QRTSVFD IKP TTR SVF R+SMA  +EE
Subjt:  KEK---SKHQRKKDPKKLQPKR------------------------------------KRKEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEE

Query:  NQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSL
        NQC   T  + SAF+RLS+S SKK R ST  FDRLK+TNDQ +R+M  L+ K F E N D K+HS +PSRMKRK SV INTEGSL
Subjt:  NQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSL

KAA0056121.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]2.4e-19558.97Show/hide
Query:  KASIIASEETTLHGAYTNDKFLVKYNPLFE--PDSDVVTVMMTETRTMEERMTEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKG
        K  I+  E   +   Y++ K      P  E  P  ++++VM+T   T E RM E+++ +N LMK +EE+D +I  LK+ IE++  AESS    VKN DKG
Subjt:  KASIIASEETTLHGAYTNDKFLVKYNPLFE--PDSDVVTVMMTETRTMEERMTEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKG

Query:  KTIVQDDQPQCSASIASLSIQQLQDMITNCIRAQYGGLTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQ
        K ++Q+ QPQ S SIASLS+QQLQ+MI + I+ QYGG  Q   LY KPYTKRIDNLR P GYQPPKFQQFDGKGNPKQH+AHF++TCE AGTRGDLLVKQ
Subjt:  KTIVQDDQPQCSASIASLSIQQLQDMITNCIRAQYGGLTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQ

Query:  FVRTLKGNAFDWYTDLEPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGI
        FVRTLKGNA DWY DLEPES+D+WE+LER+FLNRFYSTR  VSM ELTNT+Q+KGELV++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GI
Subjt:  FVRTLKGNAFDWYTDLEPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGI

Query:  KPRTFEELATRAHDMELSIASRENQYLLLPNMRKEGRNDEET-------IEESMVVNTTLPKSSSKGK-----RQTNG--AHHLTLKERQKKIYPFPDAD
        KPRTFEELATRAHDMELSIA+R  +  L+P  R +    ++T       I+ESMVV+ T  KS SK K     R+ +G      TLKERQ+K+YPFPD+D
Subjt:  KPRTFEELATRAHDMELSIASRENQYLLLPNMRKEGRNDEET-------IEESMVVNTTLPKSSSKGK-----RQTNG--AHHLTLKERQKKIYPFPDAD

Query:  IPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKEKSKHQRKKDPKKLQPKR---
        + DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+CFVLK+LILKLA+E KIELD+DEVAQ+N A I+  S   + KD   LQ +R   
Subjt:  IPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKEKSKHQRKKDPKKLQPKR---

Query:  ------------------------------------KRKEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSTRPSAFQRLSVSTS
                                              KEV+NS +  QRTSVFDRIKP TTR SVF R+S+A  EEENQC     TR S  +RLS+ST 
Subjt:  ------------------------------------KRKEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSTRPSAFQRLSVSTS

Query:  KKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSL
        KK R STS FDRLK+TNDQ +R+M + + K F E N D K+HS +PSRMKRK  V INTEGSL
Subjt:  KKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSL

TYK03695.1 retrotransposon gag protein [Cucumis melo var. makuwa]7.9e-18650.44Show/hide
Query:  SYMGS--TAHCCFNELRLQEDKASI-IASEETTLHGAYTNDKFLVKYNPLFE-------------PDSDVVTVMMTETRTMEERMTEMQEHINNLMKAIE
        SY+G     H C  E++  ED     +A               ++K NP  +             P  ++++VM+T   T E RM E+++ +N LMK +E
Subjt:  SYMGS--TAHCCFNELRLQEDKASI-IASEETTLHGAYTNDKFLVKYNPLFE-------------PDSDVVTVMMTETRTMEERMTEMQEHINNLMKAIE

Query:  EKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKTIVQDDQPQCSASIASLSIQQLQDMITNCIRAQYGGLTQDSLLYSKPYTKRIDNLRTPIGYQPPKF
        E+D +I  LK+ IE++  AESS    VKN DKGK ++Q+ QPQ S SIASLS+QQLQ+MI + I+ QYGG  Q   LYSKPYTKRIDNLR P GYQPPKF
Subjt:  EKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKTIVQDDQPQCSASIASLSIQQLQDMITNCIRAQYGGLTQDSLLYSKPYTKRIDNLRTPIGYQPPKF

Query:  QQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRA
        QQFDGKGNPKQH+AHF+ETCE AGTRGDLLVKQFVRTLKGNAFD Y DLEPES+D+WE+LER+FLNRFYSTRR VSM ELTNT+Q+KGELV++YINRWRA
Subjt:  QQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRA

Query:  MSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQYLLLPNMRKEGRNDEET-------IEESMVVNTTLPKSSSKG
        +SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSI +R  +  L+P  R +     +T       I+ESMVV+ T  KS SK 
Subjt:  MSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQYLLLPNMRKEGRNDEET-------IEESMVVNTTLPKSSSKG

Query:  K-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDL
        K     R+ +G      TLKERQ+K+YPF D+D+ DMLEQLLE QLI+LPKCKRP++ EKVDDP YCKYHRVI HPVE+CFVLK+LILKLA+E KIEL++
Subjt:  K-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDL

Query:  DEVAQSN-----------------------------------LATIKEKSK-----------------------------------------HQRK----
        DEVAQ+N                                   + TI  ++K                                         H++K    
Subjt:  DEVAQSN-----------------------------------LATIKEKSK-----------------------------------------HQRK----

Query:  -----------KDPKKLQPKR---------------------------------------KRKEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATE
                   KD   LQ +R                                         KEV+N  +  QRTSVFDRIKP TTR SVF R+SMA  E
Subjt:  -----------KDPKKLQPKR---------------------------------------KRKEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATE

Query:  EENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSLKF
        EENQC     TR S F+RLS+S SKK+R STS FDRLK+TNDQ +R+M +L+ K F E N D K+HS +PSRMKRK  V INT+    F
Subjt:  EENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSLKF

XP_031742199.1 uncharacterized protein LOC105435721 [Cucumis sativus]5.1e-16965.9Show/hide
Query:  LQEDKASIIASEETTLHGAYTNDKFLVKYNPLFEPDSDVVTVMMTETRTMEERMTEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHD
        ++  KA I+  E         ND        L E   DV++VMM +   +E  M EM+  IN LMK ++E+D +I  LK Q++ +  AESSQT VVK  D
Subjt:  LQEDKASIIASEETTLHGAYTNDKFLVKYNPLFEPDSDVVTVMMTETRTMEERMTEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHD

Query:  KGKTIVQDDQP-QCSASIASLSIQQLQDMITNCIRAQYGGLTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLL
        KGK +VQ++QP Q S S+ASLS+QQLQDMITN IRAQYGG +Q S +YSKPYTKRIDNLR P+GYQPPKFQQFDGKGNPKQH+AHFVETCENAG+RGD L
Subjt:  KGKTIVQDDQP-QCSASIASLSIQQLQDMITNCIRAQYGGLTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLL

Query:  VKQFVRTLKGNAFDWYTDLEPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYIL
        V+QFVR+LKGNAF+WYTDLEPES++SWE+LE+EFLNRFYSTRRTVSM ELTNTKQRKGE V++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL
Subjt:  VKQFVRTLKGNAFDWYTDLEPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYIL

Query:  KGIKPRTFEELATRAHDMELSIASRENQYLLLPNMRKEGRN-------DEETIEESMVVNTTLPKSSSKGK-----RQTNGA--HHLTLKERQKKIYPFP
        +GIKPRTFEELATRAHDMELSIASR  +  L+P ++K+ +         + T +ESMVVNTT P   SKGK     ++ +G+    LTLKERQ+K+YPFP
Subjt:  KGIKPRTFEELATRAHDMELSIASRENQYLLLPNMRKEGRN-------DEETIEESMVVNTTLPKSSSKGK-----RQTNGA--HHLTLKERQKKIYPFP

Query:  DADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATI
        D+DI DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+CFVLK+LIL+LA+E +IELDL+EVAQ+N A +
Subjt:  DADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATI

TrEMBL top hitse value%identityAlignment
A0A5A7TZU9 Ribonuclease H2.7e-16868.24Show/hide
Query:  PDSDVVTVMMTETRTMEERMTEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKTIVQDDQPQCSASIASLSIQQLQDMITNCIR
        P  ++++VM+T+  T E+RM E+++ +N LMKA+EE+D +I  LK+ IE++  AESS T  +KN +KGK I+Q+ QPQ S SIASLS+QQLQ+MI N I+
Subjt:  PDSDVVTVMMTETRTMEERMTEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKTIVQDDQPQCSASIASLSIQQLQDMITNCIR

Query:  AQYGGLTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWEELEREFL
         QYGG  Q   LYSKPYTKRIDN+R P GYQPPKFQQFDGKGNPKQH+AHF+ETCE AGTRGDLLVKQFVRTLKGNAFDWYTDLEPES+DSWE+LER+FL
Subjt:  AQYGGLTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWEELEREFL

Query:  NRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQYLLLPNM
        NRFYSTRR VSM ELT TKQRKGE V++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSIA+R N  LL+P +
Subjt:  NRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQYLLLPNM

Query:  RKEGRNDEET-------IEESMVVNTT----LPKSSSKGKRQTNG-AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYC
        RKE +  + T        +E+MVV+TT    + K     KRQ  G     TLKERQ+K+YPFPD+D+PDML+QLLE QLI+LP+CKRP EM +V+DP YC
Subjt:  RKEGRNDEET-------IEESMVVNTT----LPKSSSKGKRQTNG-AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYC

Query:  KYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATI
        KYHRVI HPVE+CFVLK+LILKLA + KIEL+LD+VAQ+N A +
Subjt:  KYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATI

A0A5A7TZU9 Ribonuclease H9.9e-0940.62Show/hide
Query:  RKKDPKKLQPKRKRK-------EVDNSKKSE-------QRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSK-KSRSST--
        +  +P ++  K K K        V+ SK SE       QR+SVFDRI     RPSVF R+S +  ++ NQ S  +STR SAFQRL+ S  K +S S T  
Subjt:  RKKDPKKLQPKRKRK-------EVDNSKKSE-------QRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSK-KSRSST--

Query:  ---SVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSLK
           S F RL V+  + ++K +         V  D+++ S+ PSRMKRK  V +NTEGSLK
Subjt:  ---SVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSLK

A0A5A7TZU9 Ribonuclease H6.1e-16852.99Show/hide
Query:  LVKYNPLF------------EPDSDVVTVMMTETRTMEERMTEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKTIVQDDQP-Q
        +++ NPLF            E   +VV+VMM +  T E  M EM+  IN LMK +EE+D +I  LK Q++    +ESSQT VVK  DKGK +V+++QP Q
Subjt:  LVKYNPLF------------EPDSDVVTVMMTETRTMEERMTEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKTIVQDDQP-Q

Query:  CSASIASLSIQQLQDMITNCIRAQYGGLTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAF
         S S+ASLS+QQLQDMI N IRAQYGG  Q S +YSKPYTKRIDNLR P+GYQP KFQQFDGKGNPKQHI HFVETCENAG+RGD LV+QFVR+LKGNAF
Subjt:  CSASIASLSIQQLQDMITNCIRAQYGGLTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAF

Query:  DWYTDLEPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELAT
        +                          TRR VSM ELTNT QRKGE V++YINRWRA+SLDCKD+LTELS+VEMC QGMHWELLYIL+GIKPRTFEELAT
Subjt:  DWYTDLEPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELAT

Query:  RAHDMELSIASRENQYLLLPNMRKEGRNDEET-------IEESMVVNTTLPKSSSKGKRQTNGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLE
        RAHDM+LSIA+R  +  L+   R +     +T       + ESM+V  T  KS SK K   +  +H        TL+ERQKK+YPFPD+D+ DMLEQL+E
Subjt:  RAHDMELSIASRENQYLLLPNMRKEGRNDEET-------IEESMVVNTTLPKSSSKGKRQTNGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLE

Query:  AQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSN--------------------------------LATI
         QLI+LP+CKRPE++ KVDDP YCKYHRVI H VE+CFVLK+LI KLA+E KIELD+DEVAQ+N                                L   
Subjt:  AQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSN--------------------------------LATI

Query:  KEK---SKHQRKKDPKKLQPKR------------------------------------KRKEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEE
        ++K   S  Q K++P + + +                                       +E+DNS + +QRTSVFD IKP TTR SVF R+SMA  +EE
Subjt:  KEK---SKHQRKKDPKKLQPKR------------------------------------KRKEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEE

Query:  NQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSL
        NQC   T  + SAF+RLS+S SKK R ST  FDRLK+TNDQ +R+M  L+ K F E N D K+HS +PSRMKRK SV INTEGSL
Subjt:  NQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSL

A0A5A7URH1 Ty3-gypsy retrotransposon protein1.2e-19558.97Show/hide
Query:  KASIIASEETTLHGAYTNDKFLVKYNPLFE--PDSDVVTVMMTETRTMEERMTEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKG
        K  I+  E   +   Y++ K      P  E  P  ++++VM+T   T E RM E+++ +N LMK +EE+D +I  LK+ IE++  AESS    VKN DKG
Subjt:  KASIIASEETTLHGAYTNDKFLVKYNPLFE--PDSDVVTVMMTETRTMEERMTEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKG

Query:  KTIVQDDQPQCSASIASLSIQQLQDMITNCIRAQYGGLTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQ
        K ++Q+ QPQ S SIASLS+QQLQ+MI + I+ QYGG  Q   LY KPYTKRIDNLR P GYQPPKFQQFDGKGNPKQH+AHF++TCE AGTRGDLLVKQ
Subjt:  KTIVQDDQPQCSASIASLSIQQLQDMITNCIRAQYGGLTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQ

Query:  FVRTLKGNAFDWYTDLEPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGI
        FVRTLKGNA DWY DLEPES+D+WE+LER+FLNRFYSTR  VSM ELTNT+Q+KGELV++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GI
Subjt:  FVRTLKGNAFDWYTDLEPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGI

Query:  KPRTFEELATRAHDMELSIASRENQYLLLPNMRKEGRNDEET-------IEESMVVNTTLPKSSSKGK-----RQTNG--AHHLTLKERQKKIYPFPDAD
        KPRTFEELATRAHDMELSIA+R  +  L+P  R +    ++T       I+ESMVV+ T  KS SK K     R+ +G      TLKERQ+K+YPFPD+D
Subjt:  KPRTFEELATRAHDMELSIASRENQYLLLPNMRKEGRNDEET-------IEESMVVNTTLPKSSSKGK-----RQTNG--AHHLTLKERQKKIYPFPDAD

Query:  IPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKEKSKHQRKKDPKKLQPKR---
        + DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+CFVLK+LILKLA+E KIELD+DEVAQ+N A I+  S   + KD   LQ +R   
Subjt:  IPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKEKSKHQRKKDPKKLQPKR---

Query:  ------------------------------------KRKEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSTRPSAFQRLSVSTS
                                              KEV+NS +  QRTSVFDRIKP TTR SVF R+S+A  EEENQC     TR S  +RLS+ST 
Subjt:  ------------------------------------KRKEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSTRPSAFQRLSVSTS

Query:  KKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSL
        KK R STS FDRLK+TNDQ +R+M + + K F E N D K+HS +PSRMKRK  V INTEGSL
Subjt:  KKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSL

A0A5A7UXF0 Ty3-gypsy retrotransposon protein3.0e-16764.52Show/hide
Query:  KASIIASEETTLHGAYTNDKFLVKYNPLFEPDS--DVVTVMMTETRTMEERMTEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKG
        K  I+  E   +   Y++ K      P  E  S  ++++VM+T   T E RM E+++ +N LMK +EE+D +I  LK+ IE++  AESS   +VKN DKG
Subjt:  KASIIASEETTLHGAYTNDKFLVKYNPLFEPDS--DVVTVMMTETRTMEERMTEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKG

Query:  KTIVQDDQPQCSASIASLSIQQLQDMITNCIRAQYGGLTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQ
        K ++Q+ QPQ S SIASLS+QQLQ+MI + I+ QYGG  Q   LYSKPYTKRIDNLR P GYQPPKFQQFDGKGNPKQH+AHF+ETCE AGTRGDLLVKQ
Subjt:  KTIVQDDQPQCSASIASLSIQQLQDMITNCIRAQYGGLTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQ

Query:  FVRTLKGNAFDWYTDLEPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGI
        FVRTLKGNAFDWY DLEPES+D+WE+LER+FLNRFYSTRR VSM ELTNT+Q+KGELV+NYINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GI
Subjt:  FVRTLKGNAFDWYTDLEPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGI

Query:  KPRTFEELATRAHDMELSIASRENQYLLLPNMRKEGRNDEET-------IEESMVVNTTLPKSSSKGKRQTNGAHH-------LTLKERQKKIYPFPDAD
        KPRTFEELATRAHDMELSIA+R  +  L+P  R +    ++T       I+ESMVV+ T  KS SK K       H        TLKERQ+K+YPFPD+D
Subjt:  KPRTFEELATRAHDMELSIASRENQYLLLPNMRKEGRNDEET-------IEESMVVNTTLPKSSSKGKRQTNGAHH-------LTLKERQKKIYPFPDAD

Query:  IPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKEKS
        + DMLEQLLE QLI+LP+CKRPE+ EKVDDP YCKYHRVI HPVE+CFVLK+LILKLA+E KIELD+DEVAQ+N   I+  S
Subjt:  IPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKEKS

A0A5D3BX77 Retrotransposon gag protein3.8e-18650.44Show/hide
Query:  SYMGS--TAHCCFNELRLQEDKASI-IASEETTLHGAYTNDKFLVKYNPLFE-------------PDSDVVTVMMTETRTMEERMTEMQEHINNLMKAIE
        SY+G     H C  E++  ED     +A               ++K NP  +             P  ++++VM+T   T E RM E+++ +N LMK +E
Subjt:  SYMGS--TAHCCFNELRLQEDKASI-IASEETTLHGAYTNDKFLVKYNPLFE-------------PDSDVVTVMMTETRTMEERMTEMQEHINNLMKAIE

Query:  EKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKTIVQDDQPQCSASIASLSIQQLQDMITNCIRAQYGGLTQDSLLYSKPYTKRIDNLRTPIGYQPPKF
        E+D +I  LK+ IE++  AESS    VKN DKGK ++Q+ QPQ S SIASLS+QQLQ+MI + I+ QYGG  Q   LYSKPYTKRIDNLR P GYQPPKF
Subjt:  EKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKTIVQDDQPQCSASIASLSIQQLQDMITNCIRAQYGGLTQDSLLYSKPYTKRIDNLRTPIGYQPPKF

Query:  QQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRA
        QQFDGKGNPKQH+AHF+ETCE AGTRGDLLVKQFVRTLKGNAFD Y DLEPES+D+WE+LER+FLNRFYSTRR VSM ELTNT+Q+KGELV++YINRWRA
Subjt:  QQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRA

Query:  MSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQYLLLPNMRKEGRNDEET-------IEESMVVNTTLPKSSSKG
        +SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSI +R  +  L+P  R +     +T       I+ESMVV+ T  KS SK 
Subjt:  MSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQYLLLPNMRKEGRNDEET-------IEESMVVNTTLPKSSSKG

Query:  K-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDL
        K     R+ +G      TLKERQ+K+YPF D+D+ DMLEQLLE QLI+LPKCKRP++ EKVDDP YCKYHRVI HPVE+CFVLK+LILKLA+E KIEL++
Subjt:  K-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDL

Query:  DEVAQSN-----------------------------------LATIKEKSK-----------------------------------------HQRK----
        DEVAQ+N                                   + TI  ++K                                         H++K    
Subjt:  DEVAQSN-----------------------------------LATIKEKSK-----------------------------------------HQRK----

Query:  -----------KDPKKLQPKR---------------------------------------KRKEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATE
                   KD   LQ +R                                         KEV+N  +  QRTSVFDRIKP TTR SVF R+SMA  E
Subjt:  -----------KDPKKLQPKR---------------------------------------KRKEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATE

Query:  EENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSLKF
        EENQC     TR S F+RLS+S SKK+R STS FDRLK+TNDQ +R+M +L+ K F E N D K+HS +PSRMKRK  V INT+    F
Subjt:  EENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSLKF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCATTCAAGACTTCTTCGATGGTCGCTTTCGATATCAAGTCTTACATGGGTTCAACTGCCCATTGTTGCTTCAATGAACTGAGGTTGCAAGAAGATAAAGCTTCTAT
CATTGCAAGCGAAGAAACAACCTTGCATGGGGCATATACCAATGACAAGTTTCTTGTTAAGTATAACCCTCTGTTTGAACCTGATTCTGACGTAGTGACTGTCATGATGA
CTGAGACAAGAACTATGGAAGAAAGAATGACTGAGATGCAAGAACACATCAACAACTTGATGAAGGCGATTGAAGAAAAAGATTCTCAAATTGAGCAACTAAAGAGCCAA
ATTGAGAACCAACATATCGCCGAATCAAGTCAAACCCAAGTCGTAAAAAATCACGACAAAGGAAAGACTATAGTGCAAGACGATCAACCACAGTGTTCTGCTTCGATCGC
TTCACTATCCATCCAACAGCTCCAAGATATGATCACAAACTGTATCAGAGCTCAGTACGGTGGACTTACTCAAGATTCCCTCTTGTATTCCAAACCTTATACTAAGAGGA
TTGATAACTTGAGAACGCCAATCGGGTATCAGCCACCGAAATTTCAACAATTTGATGGAAAAGGCAATCCTAAACAACATATTGCCCACTTCGTTGAGACATGCGAGAAC
GCTGGTACTCGAGGGGATCTACTAGTCAAACAGTTCGTTCGAACACTTAAAGGAAATGCTTTTGACTGGTACACTGATCTAGAACCTGAGTCAGTAGACAGTTGGGAGGA
ACTCGAAAGAGAGTTTTTGAATCGCTTCTACAGCACTAGACGAACCGTTAGCATGTTCGAGCTCACAAACACTAAACAACGAAAAGGTGAACTCGTTGTTAACTATATAA
ATCGCTGGAGAGCTATGAGTCTAGATTGCAAAGATCGCCTCACTGAACTCTCTTCCGTTGAGATGTGCATTCAAGGCATGCACTGGGAACTCCTCTACATCCTTAAAGGT
ATAAAGCCTCGCACCTTTGAGGAACTAGCAACTCGTGCCCACGATATGGAGCTAAGTATTGCTAGTCGAGAAAACCAATACCTTCTCCTCCCTAACATGAGAAAAGAAGG
AAGGAATGATGAAGAGACTATAGAAGAATCCATGGTTGTAAACACAACCCTTCCCAAGTCGTCTTCGAAAGGAAAGCGACAAACAAATGGAGCGCATCACTTAACTTTAA
AGGAAAGACAAAAGAAAATCTATCCTTTCCCTGATGCCGACATCCCTGATATGTTGGAACAACTATTGGAAGCGCAACTGATAGAGCTTCCTAAGTGTAAACGACCAGAA
GAGATGGAGAAAGTCGATGATCCCAAGTATTGCAAGTATCATCGAGTTATTGGTCATCCAGTGGAAAGATGTTTCGTCCTAAAGGACTTAATTTTAAAGCTGGCTAAGGA
AGGCAAAATTGAGCTCGACCTTGATGAAGTAGCCCAATCAAATCTTGCTACAATCAAAGAAAAGAGCAAACATCAAAGAAAGAAGGATCCTAAGAAACTTCAACCCAAAA
GGAAGAGAAAAGAAGTTGACAATTCTAAGAAGAGTGAACAAAGGACTTCCGTCTTTGATCGCATCAAGCCTCCAACTACTCGTCCTTCGGTATTCCATAGAATGAGTATG
GCCGCGACAGAGGAAGAAAATCAATGTTCGATGTCCACCTCCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCGAAGAAAAGTCGATCTTCAACATCTGT
CTTTGATCGCCTCAAAGTAACAAACGATCAACCTAAAAGAAAGATGAACAACTTGGAGTTGAAACTTTTCGATGAAGTAAACAGTGACAAGAAGCTTCATAGTAGCATCC
CGTCACGTATGAAGAGGAAGTTTTCTGTTCTCATAAATACGGAAGGTTCCTTGAAGTTCGAAGGTTCTTCGTTGTATCCTGCTGCGTTGTTCCTTCTCTCCAAGGTCGAG
GGTCCTTACATTGTACGCTATTGCGTTGTTCCTTCTCCAAGTTCGAAGGTTCTTCGTTGTATCCTGCTGCGTTGTTCCTTCTCCAAGTTCGAGGGTTCTCAGTTGTACGA
CTGCTACGTTGTTCCTCCTCCAAGTGTGAAGGATCTTATGTGGTGCGTTGTTGCATTGTTCCCTCTTCTCTCAAGTTCGATGGTTCTCACGCAGCTTCGCTGGAGTTTCT
TCTCCCCAAGTTCGAAGGTTCTCACGCGCTCCGTTGCAGTTCCTTCTTTTCAAGGTCGAAGGTTCTCACTCGCTGCGTTGCAGTTCTTTCTCCCCAAGTTCGAAGGTTCA
CGCACTTCGTTGCAGTTCCTTCTCCCAAATTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCCCCAAGTTCGAAGGTTCTCACGCGCTTCGCTGTAGTTCCTTC
CCCCAAGTTTGAAGGTTCTCACACGCTTCGCTGCAATTCCTTCCTCCAAGTTCAAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCCCCAAGTTCGAAGGTTCTTACGC
GCTTTGCTGCAGTTCCTTCCTCACAGTTCAAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCCCCAAGTTCGAAGGTTCTCACGTCGCTTCGCTGCAGTTCCTTCCTCC
AAGTTCGAAGGTTCTCACGTCGCTTCGCTGCAGTTCCTTCCTCCAAGTTTGAAGGTTCTCACATCGCTTCGCTTCGCGCTGCGCTTCGTTGCAGTTCCTTCCTCCAAGTT
CGAAGTTCCTTCCTCCAAGTTCGAAGGTTCTCATGCGCTTCGTTGCTCCTTCCTCCAAGTTCGAAGGTTCTCTCACGCGCTGCTGCAGCTCCTTCCTCCAAGTTCGAAGG
TTCCCTCACGCGCTTCGCTCGCTCCTTCTCCAAGTTCGAAGGCGCCTCTCTCCACTGTTCCTTCCTCCAAGTTCGAAGGTTCTCATGCGCTTCGTTGCTACCTTCCTCCA
AGTTCGAAGGTTCTCTCACGCGCTGCTGCAGCTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTCCGTTGCAGTTCCTTCTTTTCAAGGTCGAAGGTTCTCACTCGCTG
CGTTGCAGTTCTTTCTCCCCAAGTTCGAATGTTCACGCAATTCGCTGCAGTTCCTTCTCCCAAATTCGAAGGTTCTCACACGCTTCGCTCTGCAATTCCTTCCCCCAAGT
TCGAAGGTTCTCACGCGCTTCGCTGTAGTTCCTTCCCCCCAAGTTTGAAGGTTCTCACACGCTTCGCTGCAATTCCTTCCTCCAAGTTCAAAGGTTCTCACGCGCTTCGC
TGCAGTTCCTTCCCCCAAGTTCGAAGGTTCTTACGCGCTTTGCTGCAGTTCCTTCCTCACAGTTCAAAGGTCCTCACGCGCTTCGCTGCAGTTCCTTCCCCCCAAGTTCG
AAGGTTCTCACGTCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGTCGCTTCGCTGCAGTTCCTTCCTCCAAGTTTGAAGGTTCTCACATCGCTTCGCT
TCGCGCTGCGCTTCGTTGCAGTTCCTTCCTCCAAGTTCGAAGTTCCTTCCTCCAAGTTCGAAGGTTCTCATGCGCTTTGTTGCTCCTTCCTCCAAGTTCGAAGGTTCTCT
CACGCGCTGCTGCAGCTCCTTCCTCCAAGTTCGAAGGTTCCCTCACGCGCTTCGCTCGCTCCTTCTCCAAGTTCGAAGGCGCCTCTCTCCACTGTTCCTTCCTCCAAGTT
CGAAGGTTCTCATGGCTTCGTTGCTACCTTCCTCCAAGTTCGAAGGTTCTCTCACGCGCTGCTGCAGCTCCTTCCTCCAAGTTCGAAGGTTCCCTCACGCGCTTCGCTCG
CTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCACTACTCCTTCTCCAAGTTCGAAGGCGTTTCTCTCCGCTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCACTGCT
CCTTCTCCAAGTTCGAAGGCGCTTCTCTCTACTGTTCCTTCTCCAAGTTCGAAGGTGCTTCTCTCCACCCCTCTTTTTGAAGGTTCGCCACTGAGGTTCTCCTTCTCCAA
GTTCGAAGGTTCACCGTTGCTCCTTTTCAAATGTTTGGCGGCGGTTGACGTCCTCGTTCCGCTTCATCTTCAAATGTTGGCAGTTGACGGCGTTCGCTTCGCTTCATCTT
CAAAAATTGACTGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCATTCAAGACTTCTTCGATGGTCGCTTTCGATATCAAGTCTTACATGGGTTCAACTGCCCATTGTTGCTTCAATGAACTGAGGTTGCAAGAAGATAAAGCTTCTAT
CATTGCAAGCGAAGAAACAACCTTGCATGGGGCATATACCAATGACAAGTTTCTTGTTAAGTATAACCCTCTGTTTGAACCTGATTCTGACGTAGTGACTGTCATGATGA
CTGAGACAAGAACTATGGAAGAAAGAATGACTGAGATGCAAGAACACATCAACAACTTGATGAAGGCGATTGAAGAAAAAGATTCTCAAATTGAGCAACTAAAGAGCCAA
ATTGAGAACCAACATATCGCCGAATCAAGTCAAACCCAAGTCGTAAAAAATCACGACAAAGGAAAGACTATAGTGCAAGACGATCAACCACAGTGTTCTGCTTCGATCGC
TTCACTATCCATCCAACAGCTCCAAGATATGATCACAAACTGTATCAGAGCTCAGTACGGTGGACTTACTCAAGATTCCCTCTTGTATTCCAAACCTTATACTAAGAGGA
TTGATAACTTGAGAACGCCAATCGGGTATCAGCCACCGAAATTTCAACAATTTGATGGAAAAGGCAATCCTAAACAACATATTGCCCACTTCGTTGAGACATGCGAGAAC
GCTGGTACTCGAGGGGATCTACTAGTCAAACAGTTCGTTCGAACACTTAAAGGAAATGCTTTTGACTGGTACACTGATCTAGAACCTGAGTCAGTAGACAGTTGGGAGGA
ACTCGAAAGAGAGTTTTTGAATCGCTTCTACAGCACTAGACGAACCGTTAGCATGTTCGAGCTCACAAACACTAAACAACGAAAAGGTGAACTCGTTGTTAACTATATAA
ATCGCTGGAGAGCTATGAGTCTAGATTGCAAAGATCGCCTCACTGAACTCTCTTCCGTTGAGATGTGCATTCAAGGCATGCACTGGGAACTCCTCTACATCCTTAAAGGT
ATAAAGCCTCGCACCTTTGAGGAACTAGCAACTCGTGCCCACGATATGGAGCTAAGTATTGCTAGTCGAGAAAACCAATACCTTCTCCTCCCTAACATGAGAAAAGAAGG
AAGGAATGATGAAGAGACTATAGAAGAATCCATGGTTGTAAACACAACCCTTCCCAAGTCGTCTTCGAAAGGAAAGCGACAAACAAATGGAGCGCATCACTTAACTTTAA
AGGAAAGACAAAAGAAAATCTATCCTTTCCCTGATGCCGACATCCCTGATATGTTGGAACAACTATTGGAAGCGCAACTGATAGAGCTTCCTAAGTGTAAACGACCAGAA
GAGATGGAGAAAGTCGATGATCCCAAGTATTGCAAGTATCATCGAGTTATTGGTCATCCAGTGGAAAGATGTTTCGTCCTAAAGGACTTAATTTTAAAGCTGGCTAAGGA
AGGCAAAATTGAGCTCGACCTTGATGAAGTAGCCCAATCAAATCTTGCTACAATCAAAGAAAAGAGCAAACATCAAAGAAAGAAGGATCCTAAGAAACTTCAACCCAAAA
GGAAGAGAAAAGAAGTTGACAATTCTAAGAAGAGTGAACAAAGGACTTCCGTCTTTGATCGCATCAAGCCTCCAACTACTCGTCCTTCGGTATTCCATAGAATGAGTATG
GCCGCGACAGAGGAAGAAAATCAATGTTCGATGTCCACCTCCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCGAAGAAAAGTCGATCTTCAACATCTGT
CTTTGATCGCCTCAAAGTAACAAACGATCAACCTAAAAGAAAGATGAACAACTTGGAGTTGAAACTTTTCGATGAAGTAAACAGTGACAAGAAGCTTCATAGTAGCATCC
CGTCACGTATGAAGAGGAAGTTTTCTGTTCTCATAAATACGGAAGGTTCCTTGAAGTTCGAAGGTTCTTCGTTGTATCCTGCTGCGTTGTTCCTTCTCTCCAAGGTCGAG
GGTCCTTACATTGTACGCTATTGCGTTGTTCCTTCTCCAAGTTCGAAGGTTCTTCGTTGTATCCTGCTGCGTTGTTCCTTCTCCAAGTTCGAGGGTTCTCAGTTGTACGA
CTGCTACGTTGTTCCTCCTCCAAGTGTGAAGGATCTTATGTGGTGCGTTGTTGCATTGTTCCCTCTTCTCTCAAGTTCGATGGTTCTCACGCAGCTTCGCTGGAGTTTCT
TCTCCCCAAGTTCGAAGGTTCTCACGCGCTCCGTTGCAGTTCCTTCTTTTCAAGGTCGAAGGTTCTCACTCGCTGCGTTGCAGTTCTTTCTCCCCAAGTTCGAAGGTTCA
CGCACTTCGTTGCAGTTCCTTCTCCCAAATTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCCCCAAGTTCGAAGGTTCTCACGCGCTTCGCTGTAGTTCCTTC
CCCCAAGTTTGAAGGTTCTCACACGCTTCGCTGCAATTCCTTCCTCCAAGTTCAAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCCCCAAGTTCGAAGGTTCTTACGC
GCTTTGCTGCAGTTCCTTCCTCACAGTTCAAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCCCCAAGTTCGAAGGTTCTCACGTCGCTTCGCTGCAGTTCCTTCCTCC
AAGTTCGAAGGTTCTCACGTCGCTTCGCTGCAGTTCCTTCCTCCAAGTTTGAAGGTTCTCACATCGCTTCGCTTCGCGCTGCGCTTCGTTGCAGTTCCTTCCTCCAAGTT
CGAAGTTCCTTCCTCCAAGTTCGAAGGTTCTCATGCGCTTCGTTGCTCCTTCCTCCAAGTTCGAAGGTTCTCTCACGCGCTGCTGCAGCTCCTTCCTCCAAGTTCGAAGG
TTCCCTCACGCGCTTCGCTCGCTCCTTCTCCAAGTTCGAAGGCGCCTCTCTCCACTGTTCCTTCCTCCAAGTTCGAAGGTTCTCATGCGCTTCGTTGCTACCTTCCTCCA
AGTTCGAAGGTTCTCTCACGCGCTGCTGCAGCTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTCCGTTGCAGTTCCTTCTTTTCAAGGTCGAAGGTTCTCACTCGCTG
CGTTGCAGTTCTTTCTCCCCAAGTTCGAATGTTCACGCAATTCGCTGCAGTTCCTTCTCCCAAATTCGAAGGTTCTCACACGCTTCGCTCTGCAATTCCTTCCCCCAAGT
TCGAAGGTTCTCACGCGCTTCGCTGTAGTTCCTTCCCCCCAAGTTTGAAGGTTCTCACACGCTTCGCTGCAATTCCTTCCTCCAAGTTCAAAGGTTCTCACGCGCTTCGC
TGCAGTTCCTTCCCCCAAGTTCGAAGGTTCTTACGCGCTTTGCTGCAGTTCCTTCCTCACAGTTCAAAGGTCCTCACGCGCTTCGCTGCAGTTCCTTCCCCCCAAGTTCG
AAGGTTCTCACGTCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGTCGCTTCGCTGCAGTTCCTTCCTCCAAGTTTGAAGGTTCTCACATCGCTTCGCT
TCGCGCTGCGCTTCGTTGCAGTTCCTTCCTCCAAGTTCGAAGTTCCTTCCTCCAAGTTCGAAGGTTCTCATGCGCTTTGTTGCTCCTTCCTCCAAGTTCGAAGGTTCTCT
CACGCGCTGCTGCAGCTCCTTCCTCCAAGTTCGAAGGTTCCCTCACGCGCTTCGCTCGCTCCTTCTCCAAGTTCGAAGGCGCCTCTCTCCACTGTTCCTTCCTCCAAGTT
CGAAGGTTCTCATGGCTTCGTTGCTACCTTCCTCCAAGTTCGAAGGTTCTCTCACGCGCTGCTGCAGCTCCTTCCTCCAAGTTCGAAGGTTCCCTCACGCGCTTCGCTCG
CTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCACTACTCCTTCTCCAAGTTCGAAGGCGTTTCTCTCCGCTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCACTGCT
CCTTCTCCAAGTTCGAAGGCGCTTCTCTCTACTGTTCCTTCTCCAAGTTCGAAGGTGCTTCTCTCCACCCCTCTTTTTGAAGGTTCGCCACTGAGGTTCTCCTTCTCCAA
GTTCGAAGGTTCACCGTTGCTCCTTTTCAAATGTTTGGCGGCGGTTGACGTCCTCGTTCCGCTTCATCTTCAAATGTTGGCAGTTGACGGCGTTCGCTTCGCTTCATCTT
CAAAAATTGACTGTTGA
Protein sequenceShow/hide protein sequence
MSFKTSSMVAFDIKSYMGSTAHCCFNELRLQEDKASIIASEETTLHGAYTNDKFLVKYNPLFEPDSDVVTVMMTETRTMEERMTEMQEHINNLMKAIEEKDSQIEQLKSQ
IENQHIAESSQTQVVKNHDKGKTIVQDDQPQCSASIASLSIQQLQDMITNCIRAQYGGLTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCEN
AGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKG
IKPRTFEELATRAHDMELSIASRENQYLLLPNMRKEGRNDEETIEESMVVNTTLPKSSSKGKRQTNGAHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPE
EMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKEKSKHQRKKDPKKLQPKRKRKEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSM
AATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSLKFEGSSLYPAALFLLSKVE
GPYIVRYCVVPSPSSKVLRCILLRCSFSKFEGSQLYDCYVVPPPSVKDLMWCVVALFPLLSSSMVLTQLRWSFFSPSSKVLTRSVAVPSFQGRRFSLAALQFFLPKFEGS
RTSLQFLLPNSKVLTRFALQFLPPSSKVLTRFAVVPSPKFEGSHTLRCNSFLQVQRFSRASLQFLPPSSKVLTRFAAVPSSQFKGSHALRCSSFPQVRRFSRRFAAVPSS
KFEGSHVASLQFLPPSLKVLTSLRFALRFVAVPSSKFEVPSSKFEGSHALRCSFLQVRRFSHALLQLLPPSSKVPSRASLAPSPSSKAPLSTVPSSKFEGSHALRCYLPP
SSKVLSRAAAAPSSKFEGSHALRCSSFFSRSKVLTRCVAVLSPQVRMFTQFAAVPSPKFEGSHTLRSAIPSPKFEGSHALRCSSFPPSLKVLTRFAAIPSSKFKGSHALR
CSSFPQVRRFLRALLQFLPHSSKVLTRFAAVPSPQVRRFSRRFAAVPSSKFEGSHVASLQFLPPSLKVLTSLRFALRFVAVPSSKFEVPSSKFEGSHALCCSFLQVRRFS
HALLQLLPPSSKVPSRASLAPSPSSKAPLSTVPSSKFEGSHGFVATFLQVRRFSHALLQLLPPSSKVPSRASLAPSPSSKALLSTTPSPSSKAFLSAAPSPSSKALLSTA
PSPSSKALLSTVPSPSSKVLLSTPLFEGSPLRFSFSKFEGSPLLLFKCLAAVDVLVPLHLQMLAVDGVRFASSSKIDC