; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0041578 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0041578
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTy3-gypsy retrotransposon protein
Genome locationchr13:21081102..21084822
RNA-Seq ExpressionLag0041578
SyntenyLag0041578
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032121.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]2.1e-14352.63Show/hide
Query:  EPDSDVVTVMMTGTRTMEERM-----SQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLK
        +P    V+V     + +++ +     +QYGGP Q S +YSKPYTKRIDNLR P+GYQP KFQQFDGKGNPKQHI HFVETCENAG+RGD LV+QFVR+LK
Subjt:  EPDSDVVTVMMTGTRTMEERM-----SQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLK

Query:  GNAFDWYTDLEPESLDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRVMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFE
        GNAF+                          TRR VSM ELTNT QRKGE V++YINRWR +SLDCKD+LTELS+VEMC QGMHWELLYIL+GIKPRTFE
Subjt:  GNAFDWYTDLEPESLDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRVMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFE

Query:  ELATRAHDMELSIASQENQDLLLPNMRKEGRNDEET-------IEESMVVNTTLPKSSSKEKRQTNGAHH---------FKGKTKKIYPFPDADIPDMLE
        ELATRAHDM+LSIA++  +D L+   R +     +T       + ESM+V  T  KS SK K   +  +H          + + KK+YPFPD+D+ DMLE
Subjt:  ELATRAHDMELSIASQENQDLLLPNMRKEGRNDEET-------IEESMVVNTTLPKSSSKEKRQTNGAHH---------FKGKTKKIYPFPDADIPDMLE

Query:  QLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIDHQVERCFVLKDLILKLAMERKIELDLDEVAQSNLATIKEKSK--------HQRKK---------D
        QL+E QLI+LP+CKRPE++ KVDDP YCKYHRVI H VE+CFVLK+LI KLA E KIELD+DEVAQ+N   +   S          QRK           
Subjt:  QLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIDHQVERCFVLKDLILKLAMERKIELDLDEVAQSNLATIKEKSK--------HQRKK---------D

Query:  LKKLQPK--RKRSKKFSQPQQ------LVMLNKSFSKNFHKK--EKKNFATSYCIDV-------EEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMVA
        L + Q K     S+   +P +      +  L +SF ++  ++  E     T+  ++V       EE+DNS + +QRTSVFD IKP TTR SVF R+SM  
Subjt:  LKKLQPK--RKRSKKFSQPQQ------LVMLNKSFSKNFHKK--EKKNFATSYCIDV-------EEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMVA

Query:  TEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSL
         +EENQC   T  + SAF+RLS+S SKK R ST  FDRLK+TNDQ +R+M  L+ K F E N D K+HS +PSRMKRK SV INTEGSL
Subjt:  TEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSL

KAA0033746.1 retrotransposon gag protein [Cucumis melo var. makuwa]1.6e-14047.09Show/hide
Query:  EPDSDVVTVMMTGTRTMEERM-----SQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLK
        +P    V+V     + +++ +     +QYGGP Q S +YSK YTKRIDNLR P+GYQPPKFQQFDGKGNPKQHIAHFVETCENAG+RGD LV+QFVR+LK
Subjt:  EPDSDVVTVMMTGTRTMEERM-----SQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLK

Query:  GNAFDWYTDLEPESLDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRVMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFE
        GNAF+WYTDLEPE                                   GE V++YINRWR +SLDCKD+LTELS+VEMC QGMHWELLYIL+GIKPRTFE
Subjt:  GNAFDWYTDLEPESLDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRVMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFE

Query:  ELATRAHDMELSIASQENQDLLLPNMRKEGRND--------EETIEESMVVNTTLPKSSSKEKRQTNGAHHFKG----------KTKKIYPFPDADIPDM
        EL+TRAHDMELSIA+   +D L+   ++  +N+           + ESM+V  T  KS SK K +T    ++ G          + KK+YPFPD+D+ DM
Subjt:  ELATRAHDMELSIASQENQDLLLPNMRKEGRND--------EETIEESMVVNTTLPKSSSKEKRQTNGAHHFKG----------KTKKIYPFPDADIPDM

Query:  LEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIDHQVERCFVLKDLILKLAMERKIELDLDEVAQSN------------------------------
        LEQL+E QLI+LP+CKRPE+  KVDDP YCKYHRVI H VE+CFVLK+LILKLA E KIELD+DEVAQ+N                              
Subjt:  LEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIDHQVERCFVLKDLILKLAMERKIELDLDEVAQSN------------------------------

Query:  ------------------------------------------------------LATIKEKSKHQRKKDLKKLQPKRKRSKKFSQPQQLVMLNKSFSKNF
                                                                 I  K K +R K + K +P + + K F QP++ + L +   ++F
Subjt:  ------------------------------------------------------LATIKEKSKHQRKKDLKKLQPKRKRSKKFSQPQQLVMLNKSFSKNF

Query:  HKKEKKNFA------TSYCIDV-------EEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMVATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSV
         +   +         T+  ++V       EEVDNS + +QRT VF RIKP T R SVF R+SM   EEENQC  ST  R SAF+RLS+ST KK R STS 
Subjt:  HKKEKKNFA------TSYCIDV-------EEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMVATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSV

Query:  FDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSL
        FDRLK+ NDQ +R+M +L+ K F E N D K+HS +PSRMKRK SV INTEGSL
Subjt:  FDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSL

KAA0047477.1 uncharacterized protein E6C27_scaffold498G00940 [Cucumis melo var. makuwa]7.6e-13871.51Show/hide
Query:  SQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESLDSWEELEREFL
        +QYGGP Q   LYSKPYTKRIDN+R P GYQPPKFQQFDGKGNPKQH+AHF+ETCE AGTRGDLLVKQFVRTLKGNAFDWYTDLEPES+DSWE+LER+FL
Subjt:  SQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESLDSWEELEREFL

Query:  NRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRVMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASQENQDLLLPNM
        NRFYSTRR VSM ELT TKQRKGE V++YINRWR +SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSIA++ N DLL+P +
Subjt:  NRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRVMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASQENQDLLLPNM

Query:  RKEGRNDEET-------IEESMVVNTT----LPKSSSKEKRQTNGAHH---FKGKTKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYC
        RKE +  + T        +E+MVV+TT    + K    EKRQ  G       K + +K+YPFPD+D+PDML+QLLE QLI+LP+CKRP EM +V+DP YC
Subjt:  RKEGRNDEET-------IEESMVVNTT----LPKSSSKEKRQTNGAHH---FKGKTKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYC

Query:  KYHRVIDHQVERCFVLKDLILKLAMERKIELDLDEVAQSNLATI
        KYHRVI H VE+CFVLK+LILKLA+++KIEL+LD+VAQ+N A +
Subjt:  KYHRVIDHQVERCFVLKDLILKLAMERKIELDLDEVAQSNLATI

KAA0047477.1 uncharacterized protein E6C27_scaffold498G00940 [Cucumis melo var. makuwa]2.3e-0938.38Show/hide
Query:  KLQPKRKRSKK--FSQPQQLVMLNKSFSK--NFHKKEKKNFATSYCIDVEEVDNSKKSE----QRTSVFDRIKPPTTRPSVFHRMSMVATEEENQCSMST
        +L P +K+ +K  +S P     +    S+      K K   A +  I VEE  +S++ +    QR+SVFDRI     RPSVF R+S    ++ NQ S  +
Subjt:  KLQPKRKRSKK--FSQPQQLVMLNKSFSK--NFHKKEKKNFATSYCIDVEEVDNSKKSE----QRTSVFDRIKPPTTRPSVFHRMSMVATEEENQCSMST

Query:  STRPSAFQRLSVSTSK-KSRSST-----SVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSLK
        STR SAFQRL+ S  K +S S T     S F RL V+  + ++K +         V  D+++ S+ PSRMKRK  V +NTEGSLK
Subjt:  STRPSAFQRLSVSTSK-KSRSST-----SVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSLK

KAA0056121.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]7.8e-16761.57Show/hide
Query:  QYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESLDSWEELEREFLN
        QYGGP Q   LY KPYTKRIDNLR P GYQPPKFQQFDGKGNPKQH+AHF++TCE AGTRGDLLVKQFVRTLKGNA DWY DLEPES+D+WE+LER+FLN
Subjt:  QYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESLDSWEELEREFLN

Query:  RFYSTRRTVSMFELTNTKQRKGELVVNYINRWRVMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASQENQDLLLPNMR
        RFYSTR  VSM ELTNT+Q+KGELV++YINRWR +SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSIA++  +D L+P  R
Subjt:  RFYSTRRTVSMFELTNTKQRKGELVVNYINRWRVMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASQENQDLLLPNMR

Query:  KEGRNDEET-------IEESMVVNTTLPKSSSKEKRQTNGAHH---------FKGKTKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKY
         +    ++T       I+ESMVV+ T  KS SK K       H          K + +K+YPFPD+D+ DMLEQLLE QLI+LP+CKRPE+  KVDDP Y
Subjt:  KEGRNDEET-------IEESMVVNTTLPKSSSKEKRQTNGAHH---------FKGKTKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKY

Query:  CKYHRVIDHQVERCFVLKDLILKLAMERKIELDLDEVAQSNLATIKEKSKHQRKKDLKKLQPKRK--------RSKKFSQPQQLVMLNKSFSKNFHKKEK
        CKYHRVI H VE+CFVLK+LILKLA E+KIELD+DEVAQ+N A I+  S   + KD   LQ +R         RS     P++++ +    + +  + + 
Subjt:  CKYHRVIDHQVERCFVLKDLILKLAMERKIELDLDEVAQSNLATIKEKSKHQRKKDLKKLQPKRK--------RSKKFSQPQQLVMLNKSFSKNFHKKEK

Query:  KNFATSYCIDVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMVATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNL
         N+ +S     +EV+NS +  QRTSVFDRIKP TTR SVF R+S+   EEENQC     TR S  +RLS+ST KK R STS FDRLK+TNDQ +R+M + 
Subjt:  KNFATSYCIDVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMVATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNL

Query:  ELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSL
        + K F E N D K+HS +PSRMKRK  V INTEGSL
Subjt:  ELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSL

TYK03695.1 retrotransposon gag protein [Cucumis melo var. makuwa]2.9e-16153.99Show/hide
Query:  SQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESLDSWEELEREFL
        +QYGGP Q   LYSKPYTKRIDNLR P GYQPPKFQQFDGKGNPKQH+AHF+ETCE AGTRGDLLVKQFVRTLKGNAFD Y DLEPES+D+WE+LER+FL
Subjt:  SQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESLDSWEELEREFL

Query:  NRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRVMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASQENQDLLLPNM
        NRFYSTRR VSM ELTNT+Q+KGELV++YINRWR +SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSI ++  +D L+P  
Subjt:  NRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRVMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASQENQDLLLPNM

Query:  RKEGRNDEET-------IEESMVVNTTLPKSSSKEKRQTNGAHH---------FKGKTKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPK
        R +     +T       I+ESMVV+ T  KS SK K       H          K + +K+YPF D+D+ DMLEQLLE QLI+LPKCKRP++ EKVDDP 
Subjt:  RKEGRNDEET-------IEESMVVNTTLPKSSSKEKRQTNGAHH---------FKGKTKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPK

Query:  YCKYHRVIDHQVERCFVLKDLILKLAMERKIELDLDEVAQSN-----------------------------------LATIKEKSKHQRKKD--------
        YCKYHRVI H VE+CFVLK+LILKLA E+KIEL++DEVAQ+N                                   + TI  ++K    KD        
Subjt:  YCKYHRVIDHQVERCFVLKDLILKLAMERKIELDLDEVAQSN-----------------------------------LATIKEKSKHQRKKD--------

Query:  --LKKLQP----------------------KRKRSKK-------------FSQPQQLVMLNKSFSKNF---HKKEKKNFATSYCIDVEEVDNSKKS----
           K  QP                      K +R+KK             F Q ++ + L +   ++F   H +E     T +   + EV+N+  S    
Subjt:  --LKKLQP----------------------KRKRSKK-------------FSQPQQLVMLNKSFSKNF---HKKEKKNFATSYCIDVEEVDNSKKS----

Query:  ------EQRTSVFDRIKPPTTRPSVFHRMSMVATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKL
               QRTSVFDRIKP TTR SVF R+SM   EEENQC     TR S F+RLS+S SKK+R STS FDRLK+TNDQ +R+M +L+ K F E N D K+
Subjt:  ------EQRTSVFDRIKPPTTRPSVFHRMSMVATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKL

Query:  HSSIPSRMKRKFSVLINTEGSLKFLL
        HS +PSRMKRK  V INT+    F+L
Subjt:  HSSIPSRMKRKFSVLINTEGSLKFLL

TrEMBL top hitse value%identityAlignment
A0A5A7SRE2 Ty3-gypsy retrotransposon protein1.0e-14352.63Show/hide
Query:  EPDSDVVTVMMTGTRTMEERM-----SQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLK
        +P    V+V     + +++ +     +QYGGP Q S +YSKPYTKRIDNLR P+GYQP KFQQFDGKGNPKQHI HFVETCENAG+RGD LV+QFVR+LK
Subjt:  EPDSDVVTVMMTGTRTMEERM-----SQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLK

Query:  GNAFDWYTDLEPESLDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRVMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFE
        GNAF+                          TRR VSM ELTNT QRKGE V++YINRWR +SLDCKD+LTELS+VEMC QGMHWELLYIL+GIKPRTFE
Subjt:  GNAFDWYTDLEPESLDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRVMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFE

Query:  ELATRAHDMELSIASQENQDLLLPNMRKEGRNDEET-------IEESMVVNTTLPKSSSKEKRQTNGAHH---------FKGKTKKIYPFPDADIPDMLE
        ELATRAHDM+LSIA++  +D L+   R +     +T       + ESM+V  T  KS SK K   +  +H          + + KK+YPFPD+D+ DMLE
Subjt:  ELATRAHDMELSIASQENQDLLLPNMRKEGRNDEET-------IEESMVVNTTLPKSSSKEKRQTNGAHH---------FKGKTKKIYPFPDADIPDMLE

Query:  QLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIDHQVERCFVLKDLILKLAMERKIELDLDEVAQSNLATIKEKSK--------HQRKK---------D
        QL+E QLI+LP+CKRPE++ KVDDP YCKYHRVI H VE+CFVLK+LI KLA E KIELD+DEVAQ+N   +   S          QRK           
Subjt:  QLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIDHQVERCFVLKDLILKLAMERKIELDLDEVAQSNLATIKEKSK--------HQRKK---------D

Query:  LKKLQPK--RKRSKKFSQPQQ------LVMLNKSFSKNFHKK--EKKNFATSYCIDV-------EEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMVA
        L + Q K     S+   +P +      +  L +SF ++  ++  E     T+  ++V       EE+DNS + +QRTSVFD IKP TTR SVF R+SM  
Subjt:  LKKLQPK--RKRSKKFSQPQQ------LVMLNKSFSKNFHKK--EKKNFATSYCIDV-------EEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMVA

Query:  TEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSL
         +EENQC   T  + SAF+RLS+S SKK R ST  FDRLK+TNDQ +R+M  L+ K F E N D K+HS +PSRMKRK SV INTEGSL
Subjt:  TEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSL

A0A5A7SUW1 Retrotransposon gag protein8.0e-14147.09Show/hide
Query:  EPDSDVVTVMMTGTRTMEERM-----SQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLK
        +P    V+V     + +++ +     +QYGGP Q S +YSK YTKRIDNLR P+GYQPPKFQQFDGKGNPKQHIAHFVETCENAG+RGD LV+QFVR+LK
Subjt:  EPDSDVVTVMMTGTRTMEERM-----SQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLK

Query:  GNAFDWYTDLEPESLDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRVMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFE
        GNAF+WYTDLEPE                                   GE V++YINRWR +SLDCKD+LTELS+VEMC QGMHWELLYIL+GIKPRTFE
Subjt:  GNAFDWYTDLEPESLDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRVMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFE

Query:  ELATRAHDMELSIASQENQDLLLPNMRKEGRND--------EETIEESMVVNTTLPKSSSKEKRQTNGAHHFKG----------KTKKIYPFPDADIPDM
        EL+TRAHDMELSIA+   +D L+   ++  +N+           + ESM+V  T  KS SK K +T    ++ G          + KK+YPFPD+D+ DM
Subjt:  ELATRAHDMELSIASQENQDLLLPNMRKEGRND--------EETIEESMVVNTTLPKSSSKEKRQTNGAHHFKG----------KTKKIYPFPDADIPDM

Query:  LEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIDHQVERCFVLKDLILKLAMERKIELDLDEVAQSN------------------------------
        LEQL+E QLI+LP+CKRPE+  KVDDP YCKYHRVI H VE+CFVLK+LILKLA E KIELD+DEVAQ+N                              
Subjt:  LEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIDHQVERCFVLKDLILKLAMERKIELDLDEVAQSN------------------------------

Query:  ------------------------------------------------------LATIKEKSKHQRKKDLKKLQPKRKRSKKFSQPQQLVMLNKSFSKNF
                                                                 I  K K +R K + K +P + + K F QP++ + L +   ++F
Subjt:  ------------------------------------------------------LATIKEKSKHQRKKDLKKLQPKRKRSKKFSQPQQLVMLNKSFSKNF

Query:  HKKEKKNFA------TSYCIDV-------EEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMVATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSV
         +   +         T+  ++V       EEVDNS + +QRT VF RIKP T R SVF R+SM   EEENQC  ST  R SAF+RLS+ST KK R STS 
Subjt:  HKKEKKNFA------TSYCIDV-------EEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMVATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSV

Query:  FDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSL
        FDRLK+ NDQ +R+M +L+ K F E N D K+HS +PSRMKRK SV INTEGSL
Subjt:  FDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSL

A0A5A7TZU9 Ribonuclease H3.7e-13871.51Show/hide
Query:  SQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESLDSWEELEREFL
        +QYGGP Q   LYSKPYTKRIDN+R P GYQPPKFQQFDGKGNPKQH+AHF+ETCE AGTRGDLLVKQFVRTLKGNAFDWYTDLEPES+DSWE+LER+FL
Subjt:  SQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESLDSWEELEREFL

Query:  NRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRVMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASQENQDLLLPNM
        NRFYSTRR VSM ELT TKQRKGE V++YINRWR +SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSIA++ N DLL+P +
Subjt:  NRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRVMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASQENQDLLLPNM

Query:  RKEGRNDEET-------IEESMVVNTT----LPKSSSKEKRQTNGAHH---FKGKTKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYC
        RKE +  + T        +E+MVV+TT    + K    EKRQ  G       K + +K+YPFPD+D+PDML+QLLE QLI+LP+CKRP EM +V+DP YC
Subjt:  RKEGRNDEET-------IEESMVVNTT----LPKSSSKEKRQTNGAHH---FKGKTKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYC

Query:  KYHRVIDHQVERCFVLKDLILKLAMERKIELDLDEVAQSNLATI
        KYHRVI H VE+CFVLK+LILKLA+++KIEL+LD+VAQ+N A +
Subjt:  KYHRVIDHQVERCFVLKDLILKLAMERKIELDLDEVAQSNLATI

A0A5A7TZU9 Ribonuclease H1.1e-0938.38Show/hide
Query:  KLQPKRKRSKK--FSQPQQLVMLNKSFSK--NFHKKEKKNFATSYCIDVEEVDNSKKSE----QRTSVFDRIKPPTTRPSVFHRMSMVATEEENQCSMST
        +L P +K+ +K  +S P     +    S+      K K   A +  I VEE  +S++ +    QR+SVFDRI     RPSVF R+S    ++ NQ S  +
Subjt:  KLQPKRKRSKK--FSQPQQLVMLNKSFSK--NFHKKEKKNFATSYCIDVEEVDNSKKSE----QRTSVFDRIKPPTTRPSVFHRMSMVATEEENQCSMST

Query:  STRPSAFQRLSVSTSK-KSRSST-----SVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSLK
        STR SAFQRL+ S  K +S S T     S F RL V+  + ++K +         V  D+++ S+ PSRMKRK  V +NTEGSLK
Subjt:  STRPSAFQRLSVSTSK-KSRSST-----SVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSLK

A0A5A7URH1 Ty3-gypsy retrotransposon protein3.8e-16761.57Show/hide
Query:  QYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESLDSWEELEREFLN
        QYGGP Q   LY KPYTKRIDNLR P GYQPPKFQQFDGKGNPKQH+AHF++TCE AGTRGDLLVKQFVRTLKGNA DWY DLEPES+D+WE+LER+FLN
Subjt:  QYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESLDSWEELEREFLN

Query:  RFYSTRRTVSMFELTNTKQRKGELVVNYINRWRVMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASQENQDLLLPNMR
        RFYSTR  VSM ELTNT+Q+KGELV++YINRWR +SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSIA++  +D L+P  R
Subjt:  RFYSTRRTVSMFELTNTKQRKGELVVNYINRWRVMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASQENQDLLLPNMR

Query:  KEGRNDEET-------IEESMVVNTTLPKSSSKEKRQTNGAHH---------FKGKTKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKY
         +    ++T       I+ESMVV+ T  KS SK K       H          K + +K+YPFPD+D+ DMLEQLLE QLI+LP+CKRPE+  KVDDP Y
Subjt:  KEGRNDEET-------IEESMVVNTTLPKSSSKEKRQTNGAHH---------FKGKTKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKY

Query:  CKYHRVIDHQVERCFVLKDLILKLAMERKIELDLDEVAQSNLATIKEKSKHQRKKDLKKLQPKRK--------RSKKFSQPQQLVMLNKSFSKNFHKKEK
        CKYHRVI H VE+CFVLK+LILKLA E+KIELD+DEVAQ+N A I+  S   + KD   LQ +R         RS     P++++ +    + +  + + 
Subjt:  CKYHRVIDHQVERCFVLKDLILKLAMERKIELDLDEVAQSNLATIKEKSKHQRKKDLKKLQPKRK--------RSKKFSQPQQLVMLNKSFSKNFHKKEK

Query:  KNFATSYCIDVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMVATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNL
         N+ +S     +EV+NS +  QRTSVFDRIKP TTR SVF R+S+   EEENQC     TR S  +RLS+ST KK R STS FDRLK+TNDQ +R+M + 
Subjt:  KNFATSYCIDVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMVATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNL

Query:  ELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSL
        + K F E N D K+HS +PSRMKRK  V INTEGSL
Subjt:  ELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSL

A0A5D3BX77 Retrotransposon gag protein1.4e-16153.99Show/hide
Query:  SQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESLDSWEELEREFL
        +QYGGP Q   LYSKPYTKRIDNLR P GYQPPKFQQFDGKGNPKQH+AHF+ETCE AGTRGDLLVKQFVRTLKGNAFD Y DLEPES+D+WE+LER+FL
Subjt:  SQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESLDSWEELEREFL

Query:  NRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRVMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASQENQDLLLPNM
        NRFYSTRR VSM ELTNT+Q+KGELV++YINRWR +SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSI ++  +D L+P  
Subjt:  NRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRVMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASQENQDLLLPNM

Query:  RKEGRNDEET-------IEESMVVNTTLPKSSSKEKRQTNGAHH---------FKGKTKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPK
        R +     +T       I+ESMVV+ T  KS SK K       H          K + +K+YPF D+D+ DMLEQLLE QLI+LPKCKRP++ EKVDDP 
Subjt:  RKEGRNDEET-------IEESMVVNTTLPKSSSKEKRQTNGAHH---------FKGKTKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPK

Query:  YCKYHRVIDHQVERCFVLKDLILKLAMERKIELDLDEVAQSN-----------------------------------LATIKEKSKHQRKKD--------
        YCKYHRVI H VE+CFVLK+LILKLA E+KIEL++DEVAQ+N                                   + TI  ++K    KD        
Subjt:  YCKYHRVIDHQVERCFVLKDLILKLAMERKIELDLDEVAQSN-----------------------------------LATIKEKSKHQRKKD--------

Query:  --LKKLQP----------------------KRKRSKK-------------FSQPQQLVMLNKSFSKNF---HKKEKKNFATSYCIDVEEVDNSKKS----
           K  QP                      K +R+KK             F Q ++ + L +   ++F   H +E     T +   + EV+N+  S    
Subjt:  --LKKLQP----------------------KRKRSKK-------------FSQPQQLVMLNKSFSKNF---HKKEKKNFATSYCIDVEEVDNSKKS----

Query:  ------EQRTSVFDRIKPPTTRPSVFHRMSMVATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKL
               QRTSVFDRIKP TTR SVF R+SM   EEENQC     TR S F+RLS+S SKK+R STS FDRLK+TNDQ +R+M +L+ K F E N D K+
Subjt:  ------EQRTSVFDRIKPPTTRPSVFHRMSMVATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKL

Query:  HSSIPSRMKRKFSVLINTEGSLKFLL
        HS +PSRMKRK  V INT+    F+L
Subjt:  HSSIPSRMKRKFSVLINTEGSLKFLL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACGTTGCAAGAAGATAAAGCTTCTATCGCTGGAGGCCAAGAAACAACCTTGCAGGGGGCATATACTAATGACAAATTTCTTGCTAAGTATAACCCTTTGTTTGAACC
TGATTCTGACGTAGTGACTGTCATGATGACTGGGACTAGAACTATGGAAGAAAGAATGAGCCAGTACGGTGGACCTACTCAAGATTCCCTCTTGTATTCCAAACCTTATA
CTAAGAGGATTGATAACTTGAGAACGCCAATCGGGTATCAGCCACCAAAATTTCAGCAGTTTGATGGAAAAGGCAATCCTAAACAACATATTGCCCACTTCGTTGAGACA
TGCGAGAACGCTGGTACTCGAGGGGACCTACTAGTCAAGCAGTTCGTTCGAACACTTAAAGGAAATGCTTTTGACTGGTACACTGATCTAGAACCTGAGTCATTAGACAG
TTGGGAGGAACTCGAAAGAGAGTTTTTGAATCGCTTCTACAGCACTAGAAGAACCGTTAGCATGTTCGAGCTCACCAACACTAAACAACGAAAAGGTGAACTCGTTGTTA
ACTATATAAATCGCTGGAGAGTCATGAGTCTAGATTGCAAAGATCGTCTCACTGAACTCTCTTCCGTCGAGATGTGCATTCAAGGCATGCACTGGGAACTCCTCTACATC
CTTAAAGGTATAAAACCTCGCACCTTTGAAGAACTAGCAACTCGCGCCCACGATATGGAACTAAGTATTGCTAGCCAAGAAAACCAAGACCTTCTCCTCCCTAACATGAG
AAAAGAAGGAAGGAACGATGAAGAGACTATAGAAGAATCCATGGTTGTAAACACAACCCTTCCCAAGTCGTCTTCGAAAGAAAAGCGACAAACTAATGGAGCGCATCACT
TTAAAGGAAAGACAAAGAAAATCTATCCTTTCCCTGATGCCGACATCCCTGATATGTTGGAACAACTATTGGAAGCGCAACTGATAAAGCTTCCTAAGTGTAAACGACCA
GAAGAGATGGAGAAAGTCGATGATCCCAAGTATTGCAAGTATCATCGAGTTATTGATCATCAAGTGGAAAGATGTTTCGTCCTAAAGGACTTAATCCTAAAGCTGGCTAT
GGAAAGAAAAATTGAGCTCGACCTTGATGAAGTAGCTCAATCAAATCTTGCTACAATCAAAGAAAAGAGCAAACATCAAAGAAAGAAGGATCTTAAGAAACTTCAACCCA
AGAGGAAGAGAAGTAAAAAGTTTTCTCAACCTCAACAACTGGTGATGTTGAATAAATCCTTCTCCAAAAATTTCCACAAAAAGGAAAAAAAGAACTTTGCGACTTCGTAC
TGTATCGACGTAGAAGAAGTTGACAATTCTAAGAAGAGTGAACAAAGGACTTCCGTCTTTGATCGCATCAAGCCTCCAACTACTCGTCCTTCGGTATTCCATAGAATGAG
TATGGTCGCGACAGAGGAAGAAAATCAATGTTCGATGTCCACCTCCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCGAAGAAAAGTCGATCTTCAACAT
CTGTCTTTGATCGCCTCAAAGTAACAAACGATCAACCTAAAAGAAAGATGAACAACTTGGAGTTGAAACTTTTCGATGAAGTAAACAGTGACAAGAAGCTTCATAGTAGC
ATCCCGTCACGTATGAAGAGGAAGTTTTCTGTTCTCATAAATACGGAAGGTTCCTTGAAGTTCCTTCTCTCCAAGTTCGAGGGTCCTTACACTGTACGCTATTGCGTTGT
TCCTTCTCCAAGTTCGAAGGTTCTTCGTTGTATCCTGCTGCGTTGTTCATTCTCCAAGTTCGAGGGTTCTCAGTTGTACGACTGCTACGTTGTTCCTCCTCCAAGTGTGA
AGGATCTTATGTGGTGCGTTGTTGCATTGTTCCCTCTTCTCTCAAGTTCGATGGTTCTCACGCAGCTTTGCTGGAGTTTCTTCTCCCCAAGTTCGAAGGTTCTCACGCGC
TCCGTTGCAGTTCCTTCTTTCAAGGTCGAAGGTTCTCACTCGCTGCGTTGCAGTTCTTTCTCCCCAAGTTCGAAGGTTCACGCACTTCGCTGTAGTTCCTTCTCCCAAAT
TCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCCCCAAGTTCGAAGGTTCTTACGCGCTTTGCTGCAGTTCCTTCCTCACAGTTCAAAGGTTCTCACGCGCTTCGCT
GCAGTTCCTTCCCCCAAGTTCGAAGGTTCTCACGTCGCTTCGCTGCAGTTCCTTCTTCCAAGTTTGAAGGTTCTCACATCGCTTCGCTTCGCGCTGCGCTTCGTTGCAGT
TCCTTCCTCCAAGTTCGAAGGTTCTCACTCGCTTCGCTGCAGTTCCTTCCTCCAAATTCGAAGGTTTCGAAGGTTCTCAAGCGCTGTGATTCGTTGCAGTTCCTTCCTCC
AAGTTCGAAGGTTCTCATGCACTTCGTTGCTCCTTCCTCCAAGTTCGAAGGTTCTCTCACGCGCTGCTGCAGCTCCTTCCTCCAAGTTCGAAGGTTCCCTCACGCGCTTC
GCTCGCTCCTTCTCCAAGTTCGAAGGCGCCTCTCTCCACTGTTCCTTCCTCCAAGTTCGAAGGTTCTCATGCGCTTCGTTGCTACCTTCCTCCAAGTTCGAAGGTTCTCT
CACGCGCTGCTGCAGCTCCTTCCTCCAAGTTCGAAGGTTCCCTCACGCGCTTCGCTCGCTCCTTCTCCAAGTTCGAAGACGCTTCTCTCCACTGCTCCTTCTCCAAGTTC
GAAGGCGCTTCTCTCCACTACTCCTTCTTCAAGTTCGAAGGCGTTTCTCTCCAGCTGCTCCTTCTCCAAGTTCAAAGGCGCTTCCCTCCACTGCTCCTTCTCCAAGTTCG
AAGGCGCTTCTCTCTACTGCTCCTTCTCCAAGTTCAAAGGTGCTTCTCTCCACCCCTTTTTTGAAGGTTCGCCACTGAGGTTCTCCTTCTCCAAGTTCGAAGGTTCACCG
TTGCTCCTTTTCAAATGTTTGGCGGCGGTTGACGTCCTCGTTCCGCTTCATCTTCAAATGTTGGCAGTTGACGGCGTTCGCTTCGCTTCATCTTCAAAAATTGGCTGTTG
A
mRNA sequenceShow/hide mRNA sequence
ATGACGTTGCAAGAAGATAAAGCTTCTATCGCTGGAGGCCAAGAAACAACCTTGCAGGGGGCATATACTAATGACAAATTTCTTGCTAAGTATAACCCTTTGTTTGAACC
TGATTCTGACGTAGTGACTGTCATGATGACTGGGACTAGAACTATGGAAGAAAGAATGAGCCAGTACGGTGGACCTACTCAAGATTCCCTCTTGTATTCCAAACCTTATA
CTAAGAGGATTGATAACTTGAGAACGCCAATCGGGTATCAGCCACCAAAATTTCAGCAGTTTGATGGAAAAGGCAATCCTAAACAACATATTGCCCACTTCGTTGAGACA
TGCGAGAACGCTGGTACTCGAGGGGACCTACTAGTCAAGCAGTTCGTTCGAACACTTAAAGGAAATGCTTTTGACTGGTACACTGATCTAGAACCTGAGTCATTAGACAG
TTGGGAGGAACTCGAAAGAGAGTTTTTGAATCGCTTCTACAGCACTAGAAGAACCGTTAGCATGTTCGAGCTCACCAACACTAAACAACGAAAAGGTGAACTCGTTGTTA
ACTATATAAATCGCTGGAGAGTCATGAGTCTAGATTGCAAAGATCGTCTCACTGAACTCTCTTCCGTCGAGATGTGCATTCAAGGCATGCACTGGGAACTCCTCTACATC
CTTAAAGGTATAAAACCTCGCACCTTTGAAGAACTAGCAACTCGCGCCCACGATATGGAACTAAGTATTGCTAGCCAAGAAAACCAAGACCTTCTCCTCCCTAACATGAG
AAAAGAAGGAAGGAACGATGAAGAGACTATAGAAGAATCCATGGTTGTAAACACAACCCTTCCCAAGTCGTCTTCGAAAGAAAAGCGACAAACTAATGGAGCGCATCACT
TTAAAGGAAAGACAAAGAAAATCTATCCTTTCCCTGATGCCGACATCCCTGATATGTTGGAACAACTATTGGAAGCGCAACTGATAAAGCTTCCTAAGTGTAAACGACCA
GAAGAGATGGAGAAAGTCGATGATCCCAAGTATTGCAAGTATCATCGAGTTATTGATCATCAAGTGGAAAGATGTTTCGTCCTAAAGGACTTAATCCTAAAGCTGGCTAT
GGAAAGAAAAATTGAGCTCGACCTTGATGAAGTAGCTCAATCAAATCTTGCTACAATCAAAGAAAAGAGCAAACATCAAAGAAAGAAGGATCTTAAGAAACTTCAACCCA
AGAGGAAGAGAAGTAAAAAGTTTTCTCAACCTCAACAACTGGTGATGTTGAATAAATCCTTCTCCAAAAATTTCCACAAAAAGGAAAAAAAGAACTTTGCGACTTCGTAC
TGTATCGACGTAGAAGAAGTTGACAATTCTAAGAAGAGTGAACAAAGGACTTCCGTCTTTGATCGCATCAAGCCTCCAACTACTCGTCCTTCGGTATTCCATAGAATGAG
TATGGTCGCGACAGAGGAAGAAAATCAATGTTCGATGTCCACCTCCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCGAAGAAAAGTCGATCTTCAACAT
CTGTCTTTGATCGCCTCAAAGTAACAAACGATCAACCTAAAAGAAAGATGAACAACTTGGAGTTGAAACTTTTCGATGAAGTAAACAGTGACAAGAAGCTTCATAGTAGC
ATCCCGTCACGTATGAAGAGGAAGTTTTCTGTTCTCATAAATACGGAAGGTTCCTTGAAGTTCCTTCTCTCCAAGTTCGAGGGTCCTTACACTGTACGCTATTGCGTTGT
TCCTTCTCCAAGTTCGAAGGTTCTTCGTTGTATCCTGCTGCGTTGTTCATTCTCCAAGTTCGAGGGTTCTCAGTTGTACGACTGCTACGTTGTTCCTCCTCCAAGTGTGA
AGGATCTTATGTGGTGCGTTGTTGCATTGTTCCCTCTTCTCTCAAGTTCGATGGTTCTCACGCAGCTTTGCTGGAGTTTCTTCTCCCCAAGTTCGAAGGTTCTCACGCGC
TCCGTTGCAGTTCCTTCTTTCAAGGTCGAAGGTTCTCACTCGCTGCGTTGCAGTTCTTTCTCCCCAAGTTCGAAGGTTCACGCACTTCGCTGTAGTTCCTTCTCCCAAAT
TCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCCCCAAGTTCGAAGGTTCTTACGCGCTTTGCTGCAGTTCCTTCCTCACAGTTCAAAGGTTCTCACGCGCTTCGCT
GCAGTTCCTTCCCCCAAGTTCGAAGGTTCTCACGTCGCTTCGCTGCAGTTCCTTCTTCCAAGTTTGAAGGTTCTCACATCGCTTCGCTTCGCGCTGCGCTTCGTTGCAGT
TCCTTCCTCCAAGTTCGAAGGTTCTCACTCGCTTCGCTGCAGTTCCTTCCTCCAAATTCGAAGGTTTCGAAGGTTCTCAAGCGCTGTGATTCGTTGCAGTTCCTTCCTCC
AAGTTCGAAGGTTCTCATGCACTTCGTTGCTCCTTCCTCCAAGTTCGAAGGTTCTCTCACGCGCTGCTGCAGCTCCTTCCTCCAAGTTCGAAGGTTCCCTCACGCGCTTC
GCTCGCTCCTTCTCCAAGTTCGAAGGCGCCTCTCTCCACTGTTCCTTCCTCCAAGTTCGAAGGTTCTCATGCGCTTCGTTGCTACCTTCCTCCAAGTTCGAAGGTTCTCT
CACGCGCTGCTGCAGCTCCTTCCTCCAAGTTCGAAGGTTCCCTCACGCGCTTCGCTCGCTCCTTCTCCAAGTTCGAAGACGCTTCTCTCCACTGCTCCTTCTCCAAGTTC
GAAGGCGCTTCTCTCCACTACTCCTTCTTCAAGTTCGAAGGCGTTTCTCTCCAGCTGCTCCTTCTCCAAGTTCAAAGGCGCTTCCCTCCACTGCTCCTTCTCCAAGTTCG
AAGGCGCTTCTCTCTACTGCTCCTTCTCCAAGTTCAAAGGTGCTTCTCTCCACCCCTTTTTTGAAGGTTCGCCACTGAGGTTCTCCTTCTCCAAGTTCGAAGGTTCACCG
TTGCTCCTTTTCAAATGTTTGGCGGCGGTTGACGTCCTCGTTCCGCTTCATCTTCAAATGTTGGCAGTTGACGGCGTTCGCTTCGCTTCATCTTCAAAAATTGGCTGTTG
A
Protein sequenceShow/hide protein sequence
MTLQEDKASIAGGQETTLQGAYTNDKFLAKYNPLFEPDSDVVTVMMTGTRTMEERMSQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVET
CENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESLDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRVMSLDCKDRLTELSSVEMCIQGMHWELLYI
LKGIKPRTFEELATRAHDMELSIASQENQDLLLPNMRKEGRNDEETIEESMVVNTTLPKSSSKEKRQTNGAHHFKGKTKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRP
EEMEKVDDPKYCKYHRVIDHQVERCFVLKDLILKLAMERKIELDLDEVAQSNLATIKEKSKHQRKKDLKKLQPKRKRSKKFSQPQQLVMLNKSFSKNFHKKEKKNFATSY
CIDVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMVATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSS
IPSRMKRKFSVLINTEGSLKFLLSKFEGPYTVRYCVVPSPSSKVLRCILLRCSFSKFEGSQLYDCYVVPPPSVKDLMWCVVALFPLLSSSMVLTQLCWSFFSPSSKVLTR
SVAVPSFKVEGSHSLRCSSFSPSSKVHALRCSSFSQIRRFSRASLQFLPPSSKVLTRFAAVPSSQFKGSHALRCSSFPQVRRFSRRFAAVPSSKFEGSHIASLRAALRCS
SFLQVRRFSLASLQFLPPNSKVSKVLKRCDSLQFLPPSSKVLMHFVAPSSKFEGSLTRCCSSFLQVRRFPHALRSLLLQVRRRLSPLFLPPSSKVLMRFVATFLQVRRFS
HALLQLLPPSSKVPSRASLAPSPSSKTLLSTAPSPSSKALLSTTPSSSSKAFLSSCSFSKFKGASLHCSFSKFEGASLYCSFSKFKGASLHPFFEGSPLRFSFSKFEGSP
LLLFKCLAAVDVLVPLHLQMLAVDGVRFASSSKIGC