; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0031541 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0031541
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTy3-gypsy retrotransposon protein
Genome locationchr11:9753395..9757649
RNA-Seq ExpressionLag0031541
SyntenyLag0031541
Gene Ontology termsGO:0006310 - DNA recombination (biological process)
GO:0015074 - DNA integration (biological process)
GO:0071897 - DNA biosynthetic process (biological process)
GO:0090502 - RNA phosphodiester bond hydrolysis, endonucleolytic (biological process)
GO:0003723 - RNA binding (molecular function)
GO:0003887 - DNA-directed DNA polymerase activity (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032121.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]4.7e-17054.07Show/hide
Query:  VKYNPLF------------EPDFDVVTVMMTETRTMEERMAEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKSIVQDDHP-QC
        ++ NPLF            E   +VV+VMM +  T E  MAEM+  IN LMK +EE+D +I  LK Q++    +ESSQT VVK  DKGK++V+++ P Q 
Subjt:  VKYNPLF------------EPDFDVVTVMMTETRTMEERMAEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKSIVQDDHP-QC

Query:  SASVASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFD
        S SVASLS+QQLQDMI N IRAQYGGP Q S +YSKPYTKRIDNLR P+GYQP KFQQFDGKGNPKQHI HFVETCENAG+RGD LV+QFVR+LKGNAF+
Subjt:  SASVASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFD

Query:  WYTDLEPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKCELVVNYINRWRAMSLDYKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATH
                                  TRR VSM ELTNT QRK E V++YINRWRA+SLD KD+LTELS+VEMC QGMHWELLYIL+GIKPRTFEELAT 
Subjt:  WYTDLEPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKCELVVNYINRWRAMSLDYKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATH

Query:  AHDMELSIASRENQDLLLPNMRKEGRNDEET-------IEESMVVNTTLPKSSSKEKRQTNGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLEA
        AHDM+LSIA+R  +D L+   R +     +T       + ESM+V  T  KS SK K   +  +H        TL+ERQKK+YPFPD+D+ DMLEQL+E 
Subjt:  AHDMELSIASRENQDLLLPNMRKEGRNDEET-------IEESMVVNTTLPKSSSKEKRQTNGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLEA

Query:  QLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNLATTKEKSK--------HQRKK-------DPKKLQPK
        QLI+LP+CKRPE++ KVDDP YCKYHRVI H VE+CFVLK+LI KLA E KIELD+DEVAQ+N       S          QRK        +P  ++ +
Subjt:  QLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNLATTKEKSK--------HQRKK-------DPKKLQPK

Query:  RKISKKSFQPQQ----------LVILNKSFSKTFYKKKKENLA--TSYCIDV-------EEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEEN
        +K    + Q ++          +  L +SF +   ++  E  A  T+  ++V       EE+DNS + +QRTSVFD IKP TTR SVFQR+SMA  +EEN
Subjt:  RKISKKSFQPQQ----------LVILNKSFSKTFYKKKKENLA--TSYCIDV-------EEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEEN

Query:  QCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRFKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSGMKRKFSVLINTEGSLKFLP
        QC   T  + SAF+RLS+S SKK R ST  FDR K+TNDQ +R+M  L+ K F E N D K+HS +PS MKRK SV INTEGSL   P
Subjt:  QCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRFKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSGMKRKFSVLINTEGSLKFLP

KAA0047477.1 uncharacterized protein E6C27_scaffold498G00940 [Cucumis melo var. makuwa]9.8e-16868.33Show/hide
Query:  PDFDVVTVMMTETRTMEERMAEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKSIVQDDHPQCSASVASLSIQQLQDMITNCIR
        P  ++++VM+T+  T E+RMAE+++ +N LMKA+EE+D +I  LK+ IE++  AESS T  +KN +KGK+I+Q+  PQ S S+ASLS+QQLQ+MI N I+
Subjt:  PDFDVVTVMMTETRTMEERMAEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKSIVQDDHPQCSASVASLSIQQLQDMITNCIR

Query:  AQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWEELEREFL
         QYGGP Q   LYSKPYTKRIDN+R P GYQPPKFQQFDGKGNPKQH+AHF+ETCE AGTRGDLLVKQFVRTLKGNAFDWYTDLEPES+DSWE+LER+FL
Subjt:  AQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWEELEREFL

Query:  NRFYSTRRTVSMFELTNTKQRKCELVVNYINRWRAMSLDYKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATHAHDMELSIASRENQDLLLPNM
        NRFYSTRR VSM ELT TKQRK E V++YINRWRA+SLD KDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELAT AHDMELSIA+R N DLL+P +
Subjt:  NRFYSTRRTVSMFELTNTKQRKCELVVNYINRWRAMSLDYKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATHAHDMELSIASRENQDLLLPNM

Query:  RKEGRNDEET-------IEESMVVNTT----LPKSSSKEKRQTNG-AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYC
        RKE +  + T        +E+MVV+TT    + K    EKRQ  G     TLKERQ+K+YPFPD+D+PDML+QLLE QLI+LP+CKRP EM +V+DP YC
Subjt:  RKEGRNDEET-------IEESMVVNTT----LPKSSSKEKRQTNG-AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYC

Query:  KYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNLA
        KYHRVI HPVE+CFVLK+LILKLA+++KIEL+LD+VAQ+N A
Subjt:  KYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNLA

KAA0047477.1 uncharacterized protein E6C27_scaffold498G00940 [Cucumis melo var. makuwa]7.9e-0830Show/hide
Query:  HLTLKERQKKIYPFPDAD------IPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEV----AQ
        ++ L  R+K   PF +          ++L++   A L K+ K    +++EK D   Y    R +     + +    L+ K   +     +L  V     +
Subjt:  HLTLKERQKKIYPFPDAD------IPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEV----AQ

Query:  SNLATTKEKSKHQRKKDPKKLQPKRKISKKSFQPQQLVILNKSFSKTFYKKKKENLATSYCIDVEEVDNSKKSE----QRTSVFDRIKPPTTRPSVFQRM
          L+ T++K + Q    P     +  I  +S +P ++             K K  +A +  I VEE  +S++ +    QR+SVFDRI     RPSVFQR+
Subjt:  SNLATTKEKSKHQRKKDPKKLQPKRKISKKSFQPQQLVILNKSFSKTFYKKKKENLATSYCIDVEEVDNSKKSE----QRTSVFDRIKPPTTRPSVFQRM

Query:  SMAATEEENQCSMSTSTRPSAFQRLSVSTSK-KSRSST-----SVFDRFKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSGMKRKFSVLINTEGSLK
        S +  ++ NQ S  +STR SAFQRL+ S  K +S S T     S F R  V+  + ++K +         V  D+++ S+ PS MKRK  V +NTEGSLK
Subjt:  SMAATEEENQCSMSTSTRPSAFQRLSVSTSK-KSRSST-----SVFDRFKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSGMKRKFSVLINTEGSLK

KAA0047477.1 uncharacterized protein E6C27_scaffold498G00940 [Cucumis melo var. makuwa]2.2e-16763.51Show/hide
Query:  SYMGSTVHRCFNGLTLQEDKASVVAGQETTLQGAYTNDK--FFVKYNPLF------------EPDFDVVTVMMTETRTMEERMAEMQEHINNLMKAIEEK
        +Y G        G+   +D+ S +A  ++ L+    + K    +K NPL+            E   DV++VMM +   +E  MAEM+  IN LMK ++E+
Subjt:  SYMGSTVHRCFNGLTLQEDKASVVAGQETTLQGAYTNDK--FFVKYNPLF------------EPDFDVVTVMMTETRTMEERMAEMQEHINNLMKAIEEK

Query:  DSQIEQLKSQIENQHIAESSQTQVVKNHDKGKSIVQDDHP-QCSASVASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQ
        D +I  LK Q++ +  AESSQT VVK  DKGK++VQ++ P Q S SVASLS+QQLQDMITN IRAQYGGP+Q S +YSKPYTKRIDNLR P+GYQPPKFQ
Subjt:  DSQIEQLKSQIENQHIAESSQTQVVKNHDKGKSIVQDDHP-QCSASVASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQ

Query:  QFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKCELVVNYINRWRAM
        QFDGKGNPKQH+AHFVETCENAG+RGD LV+QFVR+LKGNAF+WYTDLEPES++SWE+LE+EFLNRFYSTRRTVSM ELTNTKQRK E V++YINRWRA+
Subjt:  QFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKCELVVNYINRWRAM

Query:  SLDYKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATHAHDMELSIASRENQDLLLPNMRKEGRN-------DEETIEESMVVNTT-LPKSSSKE
        SLD KDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELAT AHDMELSIASR  +D L+P ++K+ +         + T +ESMVVNTT L  S  KE
Subjt:  SLDYKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATHAHDMELSIASRENQDLLLPNMRKEGRN-------DEETIEESMVVNTT-LPKSSSKE

Query:  KR---QTNGA--HHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDE
         R   + +G+    LTLKERQ+K+YPFPD+DI DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+CFVLK+LIL+LA E++IELDL+E
Subjt:  KR---QTNGA--HHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDE

Query:  VAQSNLA
        VAQ+N A
Subjt:  VAQSNLA

KAA0056121.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]1.1e-19560.59Show/hide
Query:  PDFDVVTVMMTETRTMEERMAEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKSIVQDDHPQCSASVASLSIQQLQDMITNCIR
        P  ++++VM+T   T E RMAE+++ +N LMK +EE+D +I  LK+ IE++  AESS    VKN DKGK+++Q+  PQ S S+ASLS+QQLQ+MI + I+
Subjt:  PDFDVVTVMMTETRTMEERMAEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKSIVQDDHPQCSASVASLSIQQLQDMITNCIR

Query:  AQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWEELEREFL
         QYGGP Q   LY KPYTKRIDNLR P GYQPPKFQQFDGKGNPKQH+AHF++TCE AGTRGDLLVKQFVRTLKGNA DWY DLEPES+D+WE+LER+FL
Subjt:  AQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWEELEREFL

Query:  NRFYSTRRTVSMFELTNTKQRKCELVVNYINRWRAMSLDYKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATHAHDMELSIASRENQDLLLPNM
        NRFYSTR  VSM ELTNT+Q+K ELV++YINRWRA+SLD KDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELAT AHDMELSIA+R  +D L+P  
Subjt:  NRFYSTRRTVSMFELTNTKQRKCELVVNYINRWRAMSLDYKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATHAHDMELSIASRENQDLLLPNM

Query:  RKEGRNDEET-------IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPK
        R +    ++T       I+ESMVV+ T  KS SK K     R+ +G      TLKERQ+K+YPFPD+D+ DMLEQLLE QLI+LP+CKRPE+  KVDDP 
Subjt:  RKEGRNDEET-------IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPK

Query:  YCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNLATTKEKSKHQRKKDPKKLQPKRKISKKSFQPQQLVILNKSFSKTFYKKKKENLATSY
        YCKYHRVI HPVE+CFVLK+LILKLA E+KIELD+DEVAQ+N A  +  S   + KD   LQ +R I+   F P       +SF +   ++  E  A   
Subjt:  YCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNLATTKEKSKHQRKKDPKKLQPKRKISKKSFQPQQLVILNKSFSKTFYKKKKENLATSY

Query:  CIDVE---------EVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRFKVTNDQPKRKMN
           VE         EV+NS +  QRTSVFDRIKP TTR SVFQR+S+A  EEENQC     TR S  +RLS+ST KK R STS FDR K+TNDQ +R+M 
Subjt:  CIDVE---------EVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRFKVTNDQPKRKMN

Query:  NLELKLFDEVNSDKKLHSSIPSGMKRKFSVLINTEGSLKFLP
        + + K F E N D K+HS +PS MKRK  V INTEGSL   P
Subjt:  NLELKLFDEVNSDKKLHSSIPSGMKRKFSVLINTEGSLKFLP

TYK03695.1 retrotransposon gag protein [Cucumis melo var. makuwa]1.0e-18853.27Show/hide
Query:  PDFDVVTVMMTETRTMEERMAEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKSIVQDDHPQCSASVASLSIQQLQDMITNCIR
        P  ++++VM+T   T E RMAE+++ +N LMK +EE+D +I  LK+ IE++  AESS    VKN DKGK+++Q+  PQ S S+ASLS+QQLQ+MI + I+
Subjt:  PDFDVVTVMMTETRTMEERMAEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKSIVQDDHPQCSASVASLSIQQLQDMITNCIR

Query:  AQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWEELEREFL
         QYGGP Q   LYSKPYTKRIDNLR P GYQPPKFQQFDGKGNPKQH+AHF+ETCE AGTRGDLLVKQFVRTLKGNAFD Y DLEPES+D+WE+LER+FL
Subjt:  AQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWEELEREFL

Query:  NRFYSTRRTVSMFELTNTKQRKCELVVNYINRWRAMSLDYKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATHAHDMELSIASRENQDLLLPNM
        NRFYSTRR VSM ELTNT+Q+K ELV++YINRWRA+SLD KDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELAT AHDMELSI +R  +D L+P  
Subjt:  NRFYSTRRTVSMFELTNTKQRKCELVVNYINRWRAMSLDYKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATHAHDMELSIASRENQDLLLPNM

Query:  RKEGRNDEET-------IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPK
        R +     +T       I+ESMVV+ T  KS SK K     R+ +G      TLKERQ+K+YPF D+D+ DMLEQLLE QLI+LPKCKRP++ EKVDDP 
Subjt:  RKEGRNDEET-------IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPK

Query:  YCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSN-----------------------------------LATTKEKSKHQRKKD--------
        YCKYHRVI HPVE+CFVLK+LILKLA E+KIEL++DEVAQ+N                                   + T   ++K    KD        
Subjt:  YCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSN-----------------------------------LATTKEKSKHQRKKD--------

Query:  -------PKKLQ-----------------PKRKISKKSFQPQQL-------------VILNKSFSKTFYKKKKENL---ATSYCIDVEEVDNSKKS----
               P  +Q                  K + +KK + P+ +             + L +   ++F +   E +    T +   + EV+N+  S    
Subjt:  -------PKKLQ-----------------PKRKISKKSFQPQQL-------------VILNKSFSKTFYKKKKENL---ATSYCIDVEEVDNSKKS----

Query:  ------EQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRFKVTNDQPKRKMNNLELKLFDEVNSDKKL
               QRTSVFDRIKP TTR SVFQR+SMA  EEENQC     TR S F+RLS+S SKK+R STS FDR K+TNDQ +R+M +L+ K F E N D K+
Subjt:  ------EQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRFKVTNDQPKRKMNNLELKLFDEVNSDKKL

Query:  HSSIPSGMKRKFSVLINTEGSLKFLPPSSKVLTL
        HS +PS MKRK  V INT+    F+   S VL L
Subjt:  HSSIPSGMKRKFSVLINTEGSLKFLPPSSKVLTL

TrEMBL top hitse value%identityAlignment
A0A5A7SRE2 Ty3-gypsy retrotransposon protein2.3e-17054.07Show/hide
Query:  VKYNPLF------------EPDFDVVTVMMTETRTMEERMAEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKSIVQDDHP-QC
        ++ NPLF            E   +VV+VMM +  T E  MAEM+  IN LMK +EE+D +I  LK Q++    +ESSQT VVK  DKGK++V+++ P Q 
Subjt:  VKYNPLF------------EPDFDVVTVMMTETRTMEERMAEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKSIVQDDHP-QC

Query:  SASVASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFD
        S SVASLS+QQLQDMI N IRAQYGGP Q S +YSKPYTKRIDNLR P+GYQP KFQQFDGKGNPKQHI HFVETCENAG+RGD LV+QFVR+LKGNAF+
Subjt:  SASVASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFD

Query:  WYTDLEPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKCELVVNYINRWRAMSLDYKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATH
                                  TRR VSM ELTNT QRK E V++YINRWRA+SLD KD+LTELS+VEMC QGMHWELLYIL+GIKPRTFEELAT 
Subjt:  WYTDLEPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKCELVVNYINRWRAMSLDYKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATH

Query:  AHDMELSIASRENQDLLLPNMRKEGRNDEET-------IEESMVVNTTLPKSSSKEKRQTNGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLEA
        AHDM+LSIA+R  +D L+   R +     +T       + ESM+V  T  KS SK K   +  +H        TL+ERQKK+YPFPD+D+ DMLEQL+E 
Subjt:  AHDMELSIASRENQDLLLPNMRKEGRNDEET-------IEESMVVNTTLPKSSSKEKRQTNGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLEA

Query:  QLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNLATTKEKSK--------HQRKK-------DPKKLQPK
        QLI+LP+CKRPE++ KVDDP YCKYHRVI H VE+CFVLK+LI KLA E KIELD+DEVAQ+N       S          QRK        +P  ++ +
Subjt:  QLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNLATTKEKSK--------HQRKK-------DPKKLQPK

Query:  RKISKKSFQPQQ----------LVILNKSFSKTFYKKKKENLA--TSYCIDV-------EEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEEN
        +K    + Q ++          +  L +SF +   ++  E  A  T+  ++V       EE+DNS + +QRTSVFD IKP TTR SVFQR+SMA  +EEN
Subjt:  RKISKKSFQPQQ----------LVILNKSFSKTFYKKKKENLA--TSYCIDV-------EEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEEN

Query:  QCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRFKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSGMKRKFSVLINTEGSLKFLP
        QC   T  + SAF+RLS+S SKK R ST  FDR K+TNDQ +R+M  L+ K F E N D K+HS +PS MKRK SV INTEGSL   P
Subjt:  QCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRFKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSGMKRKFSVLINTEGSLKFLP

A0A5A7TZU9 Ribonuclease H4.8e-16868.33Show/hide
Query:  PDFDVVTVMMTETRTMEERMAEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKSIVQDDHPQCSASVASLSIQQLQDMITNCIR
        P  ++++VM+T+  T E+RMAE+++ +N LMKA+EE+D +I  LK+ IE++  AESS T  +KN +KGK+I+Q+  PQ S S+ASLS+QQLQ+MI N I+
Subjt:  PDFDVVTVMMTETRTMEERMAEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKSIVQDDHPQCSASVASLSIQQLQDMITNCIR

Query:  AQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWEELEREFL
         QYGGP Q   LYSKPYTKRIDN+R P GYQPPKFQQFDGKGNPKQH+AHF+ETCE AGTRGDLLVKQFVRTLKGNAFDWYTDLEPES+DSWE+LER+FL
Subjt:  AQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWEELEREFL

Query:  NRFYSTRRTVSMFELTNTKQRKCELVVNYINRWRAMSLDYKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATHAHDMELSIASRENQDLLLPNM
        NRFYSTRR VSM ELT TKQRK E V++YINRWRA+SLD KDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELAT AHDMELSIA+R N DLL+P +
Subjt:  NRFYSTRRTVSMFELTNTKQRKCELVVNYINRWRAMSLDYKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATHAHDMELSIASRENQDLLLPNM

Query:  RKEGRNDEET-------IEESMVVNTT----LPKSSSKEKRQTNG-AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYC
        RKE +  + T        +E+MVV+TT    + K    EKRQ  G     TLKERQ+K+YPFPD+D+PDML+QLLE QLI+LP+CKRP EM +V+DP YC
Subjt:  RKEGRNDEET-------IEESMVVNTT----LPKSSSKEKRQTNG-AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYC

Query:  KYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNLA
        KYHRVI HPVE+CFVLK+LILKLA+++KIEL+LD+VAQ+N A
Subjt:  KYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNLA

A0A5A7TZU9 Ribonuclease H3.8e-0830Show/hide
Query:  HLTLKERQKKIYPFPDAD------IPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEV----AQ
        ++ L  R+K   PF +          ++L++   A L K+ K    +++EK D   Y    R +     + +    L+ K   +     +L  V     +
Subjt:  HLTLKERQKKIYPFPDAD------IPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEV----AQ

Query:  SNLATTKEKSKHQRKKDPKKLQPKRKISKKSFQPQQLVILNKSFSKTFYKKKKENLATSYCIDVEEVDNSKKSE----QRTSVFDRIKPPTTRPSVFQRM
          L+ T++K + Q    P     +  I  +S +P ++             K K  +A +  I VEE  +S++ +    QR+SVFDRI     RPSVFQR+
Subjt:  SNLATTKEKSKHQRKKDPKKLQPKRKISKKSFQPQQLVILNKSFSKTFYKKKKENLATSYCIDVEEVDNSKKSE----QRTSVFDRIKPPTTRPSVFQRM

Query:  SMAATEEENQCSMSTSTRPSAFQRLSVSTSK-KSRSST-----SVFDRFKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSGMKRKFSVLINTEGSLK
        S +  ++ NQ S  +STR SAFQRL+ S  K +S S T     S F R  V+  + ++K +         V  D+++ S+ PS MKRK  V +NTEGSLK
Subjt:  SMAATEEENQCSMSTSTRPSAFQRLSVSTSK-KSRSST-----SVFDRFKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSGMKRKFSVLINTEGSLK

A0A5A7TZU9 Ribonuclease H2.6e-16649.07Show/hide
Query:  VKYNPLF------------EPDFDVVTVMMTETRTMEERMAEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKSIVQDDHP-QC
        ++ NPLF            E   +VV+VMM +  T E  M EM+  IN LMK  EE+D +I  LK Q++     ESSQT VVK  DKGK++VQ++ P Q 
Subjt:  VKYNPLF------------EPDFDVVTVMMTETRTMEERMAEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKSIVQDDHP-QC

Query:  SASVASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFD
        S SVASLS+QQLQDMI N IRAQYGGP Q S +YSK YTKRIDNLR P+GYQPPKFQQFDGKGNPKQHIAHFVETCENAG+RGD LV+QFVR+LKGNAF+
Subjt:  SASVASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFD

Query:  WYTDLEPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKCELVVNYINRWRAMSLDYKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATH
        WYTDLEPE                                    E V++YINRWRA+SLD KD+LTELS+VEMC QGMHWELLYIL+GIKPRTFEEL+T 
Subjt:  WYTDLEPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKCELVVNYINRWRAMSLDYKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATH

Query:  AHDMELSIASRENQDLLLPNMRKEGRND--------EETIEESMVVNTTLPKSSSKEKRQTNGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLE
        AHDMELSIA+   +D L+   ++  +N+           + ESM+V  T  KS SK K   +  ++        TL+ERQKK+YPFPD+D+ DMLEQL+E
Subjt:  AHDMELSIASRENQDLLLPNMRKEGRND--------EETIEESMVVNTTLPKSSSKEKRQTNGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLE

Query:  AQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSN------------------------------------
         QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+CFVLK+LILKLA E KIELD+DEVAQ+N                                    
Subjt:  AQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSN------------------------------------

Query:  ------------------------------------------------LATTKEKSKHQRKKDPKKLQPKRKISKKSFQPQQLVILNKSFSKTFYKKKKE
                                                              K K +R K   K +P +   K  FQP++ + L +   ++F +   E
Subjt:  ------------------------------------------------LATTKEKSKHQRKKDPKKLQPKRKISKKSFQPQQLVILNKSFSKTFYKKKKE

Query:  NLA------TSYCIDV-------EEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRFKV
         +       T+  ++V       EEVDNS + +QRT VF RIKP T R SVFQR+SMA  EEENQC  ST  R SAF+RLS+ST KK R STS FDR K+
Subjt:  NLA------TSYCIDV-------EEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRFKV

Query:  TNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSGMKRKFSVLINTEGSLKFLP
         NDQ +R+M +L+ K F E N D K+HS +PS MKRK SV INTEGSL   P
Subjt:  TNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSGMKRKFSVLINTEGSLKFLP

A0A5A7URH1 Ty3-gypsy retrotransposon protein5.4e-19660.59Show/hide
Query:  PDFDVVTVMMTETRTMEERMAEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKSIVQDDHPQCSASVASLSIQQLQDMITNCIR
        P  ++++VM+T   T E RMAE+++ +N LMK +EE+D +I  LK+ IE++  AESS    VKN DKGK+++Q+  PQ S S+ASLS+QQLQ+MI + I+
Subjt:  PDFDVVTVMMTETRTMEERMAEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKSIVQDDHPQCSASVASLSIQQLQDMITNCIR

Query:  AQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWEELEREFL
         QYGGP Q   LY KPYTKRIDNLR P GYQPPKFQQFDGKGNPKQH+AHF++TCE AGTRGDLLVKQFVRTLKGNA DWY DLEPES+D+WE+LER+FL
Subjt:  AQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWEELEREFL

Query:  NRFYSTRRTVSMFELTNTKQRKCELVVNYINRWRAMSLDYKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATHAHDMELSIASRENQDLLLPNM
        NRFYSTR  VSM ELTNT+Q+K ELV++YINRWRA+SLD KDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELAT AHDMELSIA+R  +D L+P  
Subjt:  NRFYSTRRTVSMFELTNTKQRKCELVVNYINRWRAMSLDYKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATHAHDMELSIASRENQDLLLPNM

Query:  RKEGRNDEET-------IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPK
        R +    ++T       I+ESMVV+ T  KS SK K     R+ +G      TLKERQ+K+YPFPD+D+ DMLEQLLE QLI+LP+CKRPE+  KVDDP 
Subjt:  RKEGRNDEET-------IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPK

Query:  YCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNLATTKEKSKHQRKKDPKKLQPKRKISKKSFQPQQLVILNKSFSKTFYKKKKENLATSY
        YCKYHRVI HPVE+CFVLK+LILKLA E+KIELD+DEVAQ+N A  +  S   + KD   LQ +R I+   F P       +SF +   ++  E  A   
Subjt:  YCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNLATTKEKSKHQRKKDPKKLQPKRKISKKSFQPQQLVILNKSFSKTFYKKKKENLATSY

Query:  CIDVE---------EVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRFKVTNDQPKRKMN
           VE         EV+NS +  QRTSVFDRIKP TTR SVFQR+S+A  EEENQC     TR S  +RLS+ST KK R STS FDR K+TNDQ +R+M 
Subjt:  CIDVE---------EVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRFKVTNDQPKRKMN

Query:  NLELKLFDEVNSDKKLHSSIPSGMKRKFSVLINTEGSLKFLP
        + + K F E N D K+HS +PS MKRK  V INTEGSL   P
Subjt:  NLELKLFDEVNSDKKLHSSIPSGMKRKFSVLINTEGSLKFLP

A0A5D3BX77 Retrotransposon gag protein4.9e-18953.27Show/hide
Query:  PDFDVVTVMMTETRTMEERMAEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKSIVQDDHPQCSASVASLSIQQLQDMITNCIR
        P  ++++VM+T   T E RMAE+++ +N LMK +EE+D +I  LK+ IE++  AESS    VKN DKGK+++Q+  PQ S S+ASLS+QQLQ+MI + I+
Subjt:  PDFDVVTVMMTETRTMEERMAEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKSIVQDDHPQCSASVASLSIQQLQDMITNCIR

Query:  AQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWEELEREFL
         QYGGP Q   LYSKPYTKRIDNLR P GYQPPKFQQFDGKGNPKQH+AHF+ETCE AGTRGDLLVKQFVRTLKGNAFD Y DLEPES+D+WE+LER+FL
Subjt:  AQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWEELEREFL

Query:  NRFYSTRRTVSMFELTNTKQRKCELVVNYINRWRAMSLDYKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATHAHDMELSIASRENQDLLLPNM
        NRFYSTRR VSM ELTNT+Q+K ELV++YINRWRA+SLD KDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELAT AHDMELSI +R  +D L+P  
Subjt:  NRFYSTRRTVSMFELTNTKQRKCELVVNYINRWRAMSLDYKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATHAHDMELSIASRENQDLLLPNM

Query:  RKEGRNDEET-------IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPK
        R +     +T       I+ESMVV+ T  KS SK K     R+ +G      TLKERQ+K+YPF D+D+ DMLEQLLE QLI+LPKCKRP++ EKVDDP 
Subjt:  RKEGRNDEET-------IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPK

Query:  YCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSN-----------------------------------LATTKEKSKHQRKKD--------
        YCKYHRVI HPVE+CFVLK+LILKLA E+KIEL++DEVAQ+N                                   + T   ++K    KD        
Subjt:  YCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSN-----------------------------------LATTKEKSKHQRKKD--------

Query:  -------PKKLQ-----------------PKRKISKKSFQPQQL-------------VILNKSFSKTFYKKKKENL---ATSYCIDVEEVDNSKKS----
               P  +Q                  K + +KK + P+ +             + L +   ++F +   E +    T +   + EV+N+  S    
Subjt:  -------PKKLQ-----------------PKRKISKKSFQPQQL-------------VILNKSFSKTFYKKKKENL---ATSYCIDVEEVDNSKKS----

Query:  ------EQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRFKVTNDQPKRKMNNLELKLFDEVNSDKKL
               QRTSVFDRIKP TTR SVFQR+SMA  EEENQC     TR S F+RLS+S SKK+R STS FDR K+TNDQ +R+M +L+ K F E N D K+
Subjt:  ------EQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRFKVTNDQPKRKMNNLELKLFDEVNSDKKL

Query:  HSSIPSGMKRKFSVLINTEGSLKFLPPSSKVLTL
        HS +PS MKRK  V INT+    F+   S VL L
Subjt:  HSSIPSGMKRKFSVLINTEGSLKFLPPSSKVLTL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCGCTGTCATGAACAAGTCTTACATGGGTTCTACTGTCCATCGTTGCTTCAACGGACTGACGTTGCAAGAAGATAAAGCTTCTGTCGTTGCAGGCCAAGAAACAAC
CTTGCAGGGGGCATATACTAATGACAAGTTTTTTGTCAAGTATAACCCTTTGTTTGAACCTGATTTTGACGTAGTGACTGTTATGATGACTGAGACAAGAACTATGGAAG
AAAGAATGGCTGAGATGCAAGAACACATCAATAACTTGATGAAGGCGATTGAAGAAAAAGATTCTCAAATTGAGCAACTAAAGAGCCAAATTGAGAACCAACATATCGCC
GAATCAAGTCAAACCCAAGTCGTAAAAAATCACGACAAAGGAAAGTCTATAGTGCAAGATGATCATCCACAATGTTCTGCTTCAGTCGCTTCACTATCCATCCAACAGCT
CCAAGATATGATCACAAACTGTATCAGAGCTCAGTACGGTGGACCTACTCAAGATTCCCTCTTGTACTCCAAACCTTATACTAAGAGGATTGATAATTTGAGAACGCCAA
TCGGGTATCAGCCACCGAAATTTCAACAATTTGATGGAAAGGGCAATCCTAAACAACATATTGCCCACTTCGTTGAGACATGCGAGAACGCTGGTACTCGAGGGGACCTG
CTAGTCAAACAGTTCGTTCGAACACTTAAAGGAAATGCTTTTGACTGGTACACTGATCTAGAGCCGGAGTCAGTAGACAGTTGGGAGGAACTCGAAAGAGAGTTTTTGAA
TCGCTTCTACAGCACTAGACGAACCGTTAGCATGTTCGAGCTCACAAACACTAAACAACGAAAATGTGAACTCGTTGTTAATTATATAAATCGCTGGAGAGCTATGAGTC
TAGATTACAAAGATCGCCTCACTGAACTCTCTTCCGTTGAGATGTGCATTCAAGGCATGCACTGGGAACTCCTCTACATCCTTAAAGGTATAAAGCCTCGCACCTTTGAG
GAACTAGCAACTCATGCCCACGATATGGAGCTAAGTATTGCTAGTCGAGAAAACCAAGACCTTCTCCTCCCTAACATGAGAAAAGAAGGAAGGAACGATGAAGAGACTAT
AGAAGAATCCATGGTTGTAAACACAACCCTTCCCAAGTCGTCTTCGAAAGAAAAGCGACAAACTAATGGAGCGCATCACTTAACTTTAAAGGAAAGACAAAAGAAAATCT
ATCCTTTCCCTGATGCCGACATCCCTGATATGTTGGAACAACTATTGGAAGCGCAACTGATAAAGCTTCCTAAGTGTAAACGACCAGAAGAGATGGAGAAAGTCGATGAT
CCCAAGTATTGCAAGTATCATCGAGTTATTGGTCATCCAGTGGAAAGATGTTTCGTCCTAAAGGACTTAATTCTAAAGCTGGCTATGGAAAGAAAAATTGAGCTCGACCT
TGATGAAGTAGCTCAATCAAATCTTGCTACAACCAAAGAAAAGAGCAAACATCAAAGAAAGAAGGATCCTAAGAAACTTCAACCCAAGAGGAAAATAAGTAAAAAGTCTT
TTCAACCTCAACAACTGGTGATATTGAATAAATCCTTCTCCAAAACTTTCTACAAAAAGAAAAAAGAGAACCTTGCAACTTCCTACTGCATCGACGTAGAAGAAGTTGAC
AATTCCAAGAAAAGTGAACAAAGGACTTCTGTCTTCGATCGCATCAAGCCTCCAACTACTCGTCCTTCAGTATTCCAAAGAATGAGTATGGCCGCGACAGAGGAAGAAAA
TCAATGTTCGATGTCTACCTCCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCGAAGAAAAGTCGATCTTCAACATCTGTCTTTGATCGATTCAAAGTAA
CAAACGATCAACCTAAAAGAAAGATGAACAACTTGGAGTTGAAACTTTTCGATGAAGTAAACAGTGACAAGAAGCTTCATAGTAGCATCCCGTCAGGTATGAAGAGGAAG
TTTTCTGTTCTCATAAATACGGAAGGTTCCTTGAAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGTTGCCTCGTCGTGGTTCCTTCTCTCCAAGTACGAATGTGACAGT
TCCTGACAGTTCCTTCCTCCAAGTTCGAAGTTCGAAGGTTCTCAGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTGGTTCTCACGCGCTTCGCCACAGTTCCTT
CCTCCAAGTTCGAAGATCTGGTCGGCGTTAGCAACGTTCGTGGTTGGTGTTCATGGTCGGCGTTCGCAACGTTCGTGGTTGGTGTTCATGGTCGGTGTTCGCAACGTTCG
TGTCGGTGTTCATGGTCGGCGTTCGCAGCGTTCGTGGTCGGTGTTCGTAGTCGGCGTTCGCGACCCAAAATCCAGATCTGGTCGGTGTTCGTGGACGTTCGTCAGGGTTC
ACAGGCGTTCGTGATCAGGGTTCAAGAAACCTTCGCCGCAGATTCATCTTCCCAGTCGTCGACTGCCCCATCTCCGGCCACCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTCGCTGTCATGAACAAGTCTTACATGGGTTCTACTGTCCATCGTTGCTTCAACGGACTGACGTTGCAAGAAGATAAAGCTTCTGTCGTTGCAGGCCAAGAAACAAC
CTTGCAGGGGGCATATACTAATGACAAGTTTTTTGTCAAGTATAACCCTTTGTTTGAACCTGATTTTGACGTAGTGACTGTTATGATGACTGAGACAAGAACTATGGAAG
AAAGAATGGCTGAGATGCAAGAACACATCAATAACTTGATGAAGGCGATTGAAGAAAAAGATTCTCAAATTGAGCAACTAAAGAGCCAAATTGAGAACCAACATATCGCC
GAATCAAGTCAAACCCAAGTCGTAAAAAATCACGACAAAGGAAAGTCTATAGTGCAAGATGATCATCCACAATGTTCTGCTTCAGTCGCTTCACTATCCATCCAACAGCT
CCAAGATATGATCACAAACTGTATCAGAGCTCAGTACGGTGGACCTACTCAAGATTCCCTCTTGTACTCCAAACCTTATACTAAGAGGATTGATAATTTGAGAACGCCAA
TCGGGTATCAGCCACCGAAATTTCAACAATTTGATGGAAAGGGCAATCCTAAACAACATATTGCCCACTTCGTTGAGACATGCGAGAACGCTGGTACTCGAGGGGACCTG
CTAGTCAAACAGTTCGTTCGAACACTTAAAGGAAATGCTTTTGACTGGTACACTGATCTAGAGCCGGAGTCAGTAGACAGTTGGGAGGAACTCGAAAGAGAGTTTTTGAA
TCGCTTCTACAGCACTAGACGAACCGTTAGCATGTTCGAGCTCACAAACACTAAACAACGAAAATGTGAACTCGTTGTTAATTATATAAATCGCTGGAGAGCTATGAGTC
TAGATTACAAAGATCGCCTCACTGAACTCTCTTCCGTTGAGATGTGCATTCAAGGCATGCACTGGGAACTCCTCTACATCCTTAAAGGTATAAAGCCTCGCACCTTTGAG
GAACTAGCAACTCATGCCCACGATATGGAGCTAAGTATTGCTAGTCGAGAAAACCAAGACCTTCTCCTCCCTAACATGAGAAAAGAAGGAAGGAACGATGAAGAGACTAT
AGAAGAATCCATGGTTGTAAACACAACCCTTCCCAAGTCGTCTTCGAAAGAAAAGCGACAAACTAATGGAGCGCATCACTTAACTTTAAAGGAAAGACAAAAGAAAATCT
ATCCTTTCCCTGATGCCGACATCCCTGATATGTTGGAACAACTATTGGAAGCGCAACTGATAAAGCTTCCTAAGTGTAAACGACCAGAAGAGATGGAGAAAGTCGATGAT
CCCAAGTATTGCAAGTATCATCGAGTTATTGGTCATCCAGTGGAAAGATGTTTCGTCCTAAAGGACTTAATTCTAAAGCTGGCTATGGAAAGAAAAATTGAGCTCGACCT
TGATGAAGTAGCTCAATCAAATCTTGCTACAACCAAAGAAAAGAGCAAACATCAAAGAAAGAAGGATCCTAAGAAACTTCAACCCAAGAGGAAAATAAGTAAAAAGTCTT
TTCAACCTCAACAACTGGTGATATTGAATAAATCCTTCTCCAAAACTTTCTACAAAAAGAAAAAAGAGAACCTTGCAACTTCCTACTGCATCGACGTAGAAGAAGTTGAC
AATTCCAAGAAAAGTGAACAAAGGACTTCTGTCTTCGATCGCATCAAGCCTCCAACTACTCGTCCTTCAGTATTCCAAAGAATGAGTATGGCCGCGACAGAGGAAGAAAA
TCAATGTTCGATGTCTACCTCCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCGAAGAAAAGTCGATCTTCAACATCTGTCTTTGATCGATTCAAAGTAA
CAAACGATCAACCTAAAAGAAAGATGAACAACTTGGAGTTGAAACTTTTCGATGAAGTAAACAGTGACAAGAAGCTTCATAGTAGCATCCCGTCAGGTATGAAGAGGAAG
TTTTCTGTTCTCATAAATACGGAAGGTTCCTTGAAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGTTGCCTCGTCGTGGTTCCTTCTCTCCAAGTACGAATGTGACAGT
TCCTGACAGTTCCTTCCTCCAAGTTCGAAGTTCGAAGGTTCTCAGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTGGTTCTCACGCGCTTCGCCACAGTTCCTT
CCTCCAAGTTCGAAGATCTGGTCGGCGTTAGCAACGTTCGTGGTTGGTGTTCATGGTCGGCGTTCGCAACGTTCGTGGTTGGTGTTCATGGTCGGTGTTCGCAACGTTCG
TGTCGGTGTTCATGGTCGGCGTTCGCAGCGTTCGTGGTCGGTGTTCGTAGTCGGCGTTCGCGACCCAAAATCCAGATCTGGTCGGTGTTCGTGGACGTTCGTCAGGGTTC
ACAGGCGTTCGTGATCAGGGTTCAAGAAACCTTCGCCGCAGATTCATCTTCCCAGTCGTCGACTGCCCCATCTCCGGCCACCTGA
Protein sequenceShow/hide protein sequence
MVAVMNKSYMGSTVHRCFNGLTLQEDKASVVAGQETTLQGAYTNDKFFVKYNPLFEPDFDVVTVMMTETRTMEERMAEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIA
ESSQTQVVKNHDKGKSIVQDDHPQCSASVASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDL
LVKQFVRTLKGNAFDWYTDLEPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKCELVVNYINRWRAMSLDYKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFE
ELATHAHDMELSIASRENQDLLLPNMRKEGRNDEETIEESMVVNTTLPKSSSKEKRQTNGAHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDD
PKYCKYHRVIGHPVERCFVLKDLILKLAMERKIELDLDEVAQSNLATTKEKSKHQRKKDPKKLQPKRKISKKSFQPQQLVILNKSFSKTFYKKKKENLATSYCIDVEEVD
NSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRFKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSGMKRK
FSVLINTEGSLKFLPPSSKVLTLPRRGSFSPSTNVTVPDSSFLQVRSSKVLSASPQFLPPSSKVVLTRFATVPSSKFEDLVGVSNVRGWCSWSAFATFVVGVHGRCSQRS
CRCSWSAFAAFVVGVRSRRSRPKIQIWSVFVDVRQGSQAFVIRVQETFAADSSSQSSTAPSPAT