; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0019130 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0019130
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTy3-gypsy retrotransposon protein
Genome locationchr5:38922596..38928484
RNA-Seq ExpressionLag0019130
SyntenyLag0019130
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032121.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]2.0e-17651.83Show/hide
Query:  KTSSMVAVMNKSYMGSTAHCCFNKLALQEDKASIVAGQETTLQGAYTNDK--FIVKYNPLFEHDS------------DVVTVMMTGTRTMEERMVEMQEH
        K +S  + ++ SY+G        K ++QE +   V  ++ +L+    + K   +++ NPLF + +            +VV+VMM    T E  M EM+  
Subjt:  KTSSMVAVMNKSYMGSTAHCCFNKLALQEDKASIVAGQETTLQGAYTNDK--FIVKYNPLFEHDS------------DVVTVMMTGTRTMEERMVEMQEH

Query:  IDTLMKAIEEKDSQIAQLKCQIENQHIAESNQTQVIKNHDKGKTIVQDDQP-QCSTSIASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLR
        I+ LMK +EE+D +IA LK Q++    +ES+QT V+K  DKGK +V+++QP Q S S+ASLS+QQLQDMI N IRAQYGGP Q S +YSKPYTKRIDNLR
Subjt:  IDTLMKAIEEKDSQIAQLKCQIENQHIAESNQTQVIKNHDKGKTIVQDDQP-QCSTSIASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLR

Query:  IPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGEL
        +P+GYQP KFQQFDGKGNPKQHI HFVETCENAG+RGD LV+QFVR+LKGNAF+                          TRR VSM ELTNT QRKGE 
Subjt:  IPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGEL

Query:  VVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET-------IEESMVVN
        V++YINRWRA+SLDCKD+LTELS+VEMC QGMHWELLYIL+GIKPRTFEELATRAHDM+LSIA+R  +D L+   R +     +T       + ESM+V 
Subjt:  VVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET-------IEESMVVN

Query:  TTLPKSSSKGKRQTNGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVEGCFVLKDLILKL
         T  KS SK K   +  +H        TL+ERQKK+YPFPD+D+ DMLEQL+E QLI+LP+CKRPE++ KVDDP YCKYHRVI H VE CFVLK+LI KL
Subjt:  TTLPKSSSKGKRQTNGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVEGCFVLKDLILKL

Query:  AKEGKIELDLDEVAQSNLATIKGKSK--------HQRKK-------DPKKLQPKRKRSKKFSQPQQ----------LVMLNKSFSKNFHKKEKKNLATSY
        A+E KIELD+DEVAQ+N   +   S          QRK        +P  ++ ++K     SQ ++          +  L +SF ++ H +E   +   +
Subjt:  AKEGKIELDLDEVAQSNLATIKGKSK--------HQRKK-------DPKKLQPKRKRSKKFSQPQQ----------LVMLNKSFSKNFHKKEKKNLATSY

Query:  CIDV----------EEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCLMSTSTRPSAFQRLSVSTSKKSRPSTSVFDRLKVTSDQPKRKM
           +          EE+DNS + +QRTSVFD IKP TTR SVFQR+SMA  +EENQC   T  + SAF+RLS+S SKK RPST  FDRLK+T+DQ +R+M
Subjt:  CIDV----------EEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCLMSTSTRPSAFQRLSVSTSKKSRPSTSVFDRLKVTSDQPKRKM

Query:  DKLEVKPFDEVNSDKKLQSSIPSRMKRKLSVLINTEGSL
          L+ KPF E N D K+ S +PSRMKRKLSV INTEGSL
Subjt:  DKLEVKPFDEVNSDKKLQSSIPSRMKRKLSVLINTEGSL

KAA0033746.1 retrotransposon gag protein [Cucumis melo var. makuwa]3.1e-17247.88Show/hide
Query:  KTSSMVAVMNKSYMGSTAHCCFNKLALQEDKASIVAGQETTLQGAYTNDK--FIVKYNPLF------------EHDSDVVTVMMTGTRTMEERMVEMQEH
        K +S  +  + SY+G        K ++QE +   V  ++ +L+    + K   I++ NPLF            E   +VV+VMM    T E  M EM+  
Subjt:  KTSSMVAVMNKSYMGSTAHCCFNKLALQEDKASIVAGQETTLQGAYTNDK--FIVKYNPLF------------EHDSDVVTVMMTGTRTMEERMVEMQEH

Query:  IDTLMKAIEEKDSQIAQLKCQIENQHIAESNQTQVIKNHDKGKTIVQDDQP-QCSTSIASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLR
        I+ LMK  EE+D +IA LK Q++     ES+QT V+K  DKGK +VQ++QP Q S S+ASLS+QQLQDMI N IRAQYGGP Q S +YSK YTKRIDNLR
Subjt:  IDTLMKAIEEKDSQIAQLKCQIENQHIAESNQTQVIKNHDKGKTIVQDDQP-QCSTSIASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLR

Query:  IPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGEL
        +P+GYQPPKFQQFDGKGNPKQHIAHFVETCENAG+RGD LV+QFVR+LKGNAF+WYTDLEPE                                   GE 
Subjt:  IPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGEL

Query:  VVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRND--------EETIEESMVV
        V++YINRWRA+SLDCKD+LTELS+VEMC QGMHWELLYIL+GIKPRTFEEL+TRAHDMELSIA+   +D L+   ++  +N+           + ESM+V
Subjt:  VVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRND--------EETIEESMVV

Query:  NTTLPKSSSKGKRQTNGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVEGCFVLKDLILK
          T  KS SK K   +  ++        TL+ERQKK+YPFPD+D+ DMLEQL+E QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE CFVLK+LILK
Subjt:  NTTLPKSSSKGKRQTNGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVEGCFVLKDLILK

Query:  LAKEGKIELDLDEVAQSN----------------------------------------------------------------------------------
        LA+E KIELD+DEVAQ+N                                                                                  
Subjt:  LAKEGKIELDLDEVAQSN----------------------------------------------------------------------------------

Query:  --LATIKGKSKHQRKKDPKKLQPKRKRSKKFSQPQQLVMLNKSFSKNFHKKEKKNLA------TSYCIDV-------EEVDNSKKSEQRTSVFDRIKPPT
             I  K K +R K   K +P + + K F QP++ + L +   ++F +   + +       T+  ++V       EEVDNS + +QRT VF RIKP T
Subjt:  --LATIKGKSKHQRKKDPKKLQPKRKRSKKFSQPQQLVMLNKSFSKNFHKKEKKNLA------TSYCIDV-------EEVDNSKKSEQRTSVFDRIKPPT

Query:  TRPSVFQRMSMAATEEENQCLMSTSTRPSAFQRLSVSTSKKSRPSTSVFDRLKVTSDQPKRKMDKLEVKPFDEVNSDKKLQSSIPSRMKRKLSVLINTEG
         R SVFQR+SMA  EEENQC  ST  R SAF+RLS+ST KK RPSTS FDRLK+ +DQ +R+M  L+ KPF E N D K+ S +PSRMKRKLSV INTEG
Subjt:  TRPSVFQRMSMAATEEENQCLMSTSTRPSAFQRLSVSTSKKSRPSTSVFDRLKVTSDQPKRKMDKLEVKPFDEVNSDKKLQSSIPSRMKRKLSVLINTEG

Query:  SL
        SL
Subjt:  SL

KAA0056121.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]2.0e-20059.37Show/hide
Query:  KASIVAGQETTLQGAYTNDKFIVKYNPLFEHDSDVVTVMMTGTRTMEERMVEMQEHIDTLMKAIEEKDSQIAQLKCQIENQHIAESNQTQVIKNHDKGKT
        K  IV  +   +   Y++ K   +  P      ++++VM+T   T E RM E+++ ++ LMK +EE+D +IA LK  IE++  AES+    +KN DKGK 
Subjt:  KASIVAGQETTLQGAYTNDKFIVKYNPLFEHDSDVVTVMMTGTRTMEERMVEMQEHIDTLMKAIEEKDSQIAQLKCQIENQHIAESNQTQVIKNHDKGKT

Query:  IVQDDQPQCSTSIASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRIPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFV
        ++Q+ QPQ STSIASLS+QQLQ+MI + I+ QYGGP Q   LY KPYTKRIDNLR+P GYQPPKFQQFDGKGNPKQH+AHF++TCE AGTRGDLLVKQFV
Subjt:  IVQDDQPQCSTSIASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRIPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFV

Query:  RTLKGNAFDWYTDLEPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKP
        RTLKGNA DWY DLEPES+D+WE+LER+FLNRFYSTR  VSM ELTNT+Q+KGELV++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKP
Subjt:  RTLKGNAFDWYTDLEPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKP

Query:  RTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET-------IEESMVVNTTLPKSSSKGK-----RQTNG--AHHLTLKERQKKIYPFPDADIP
        RTFEELATRAHDMELSIA+R  +D L+P  R +    ++T       I+ESMVV+ T  KS SK K     R+ +G      TLKERQ+K+YPFPD+D+ 
Subjt:  RTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET-------IEESMVVNTTLPKSSSKGK-----RQTNG--AHHLTLKERQKKIYPFPDADIP

Query:  DMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVEGCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGKSKHQRKKDPKKLQPKRK----
        DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE CFVLK+LILKLA+E KIELD+DEVAQ+N A I+  S   + KD   LQ +R     
Subjt:  DMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVEGCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGKSKHQRKKDPKKLQPKRK----

Query:  ----RSKKFSQPQQLVMLNKSFSKNFHKKEKKNLATSYCIDVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCLMSTSTRPSAFQRLS
            RS     P++++ +    + +  + +  N  +S     +EV+NS +  QRTSVFDRIKP TTR SVFQR+S+A  EEENQC     TR S  +RLS
Subjt:  ----RSKKFSQPQQLVMLNKSFSKNFHKKEKKNLATSYCIDVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCLMSTSTRPSAFQRLS

Query:  VSTSKKSRPSTSVFDRLKVTSDQPKRKMDKLEVKPFDEVNSDKKLQSSIPSRMKRKLSVLINTEGSL
        +ST KK RPSTS FDRLK+T+DQ +R+M   + KPF E N D K+ S +PSRMKRKL V INTEGSL
Subjt:  VSTSKKSRPSTSVFDRLKVTSDQPKRKMDKLEVKPFDEVNSDKKLQSSIPSRMKRKLSVLINTEGSL

TYK03695.1 retrotransposon gag protein [Cucumis melo var. makuwa]4.0e-19651.96Show/hide
Query:  SYMGS--TAHCCFNKLALQEDKASIVAGQETTLQ-GAYTNDKFIVKYNP-LFEHDS------------DVVTVMMTGTRTMEERMVEMQEHIDTLMKAIE
        SY+G     H C  ++   ED       +    Q         ++K NP + EH+S            ++++VM+T   T E RM E+++ ++ LMK +E
Subjt:  SYMGS--TAHCCFNKLALQEDKASIVAGQETTLQ-GAYTNDKFIVKYNP-LFEHDS------------DVVTVMMTGTRTMEERMVEMQEHIDTLMKAIE

Query:  EKDSQIAQLKCQIENQHIAESNQTQVIKNHDKGKTIVQDDQPQCSTSIASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRIPIGYQPPKF
        E+D +IA LK  IE++  AES+    +KN DKGK ++Q+ QPQ STSIASLS+QQLQ+MI + I+ QYGGP Q   LYSKPYTKRIDNLR+P GYQPPKF
Subjt:  EKDSQIAQLKCQIENQHIAESNQTQVIKNHDKGKTIVQDDQPQCSTSIASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRIPIGYQPPKF

Query:  QQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRA
        QQFDGKGNPKQH+AHF+ETCE AGTRGDLLVKQFVRTLKGNAFD Y DLEPES+D+WE+LER+FLNRFYSTRR VSM ELTNT+Q+KGELV++YINRWRA
Subjt:  QQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRA

Query:  MSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET-------IEESMVVNTTLPKSSSKG
        +SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSI +R  +D L+P  R +     +T       I+ESMVV+ T  KS SK 
Subjt:  MSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET-------IEESMVVNTTLPKSSSKG

Query:  K-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVEGCFVLKDLILKLAKEGKIELDL
        K     R+ +G      TLKERQ+K+YPF D+D+ DMLEQLLE QLI+LPKCKRP++ EKVDDP YCKYHRVI HPVE CFVLK+LILKLA+E KIEL++
Subjt:  K-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVEGCFVLKDLILKLAKEGKIELDL

Query:  DEVAQSN-----------------------------------LATIKGKSKHQRKKD---------------PKKLQ-----------------PKRKRS
        DEVAQ+N                                   + TI  ++K    KD               P  +Q                  K +R+
Subjt:  DEVAQSN-----------------------------------LATIKGKSKHQRKKD---------------PKKLQ-----------------PKRKRS

Query:  KK-------------FSQPQQLVMLNKSFSKNF---HKKEKKNLATSYCIDVEEVDNSKKS----------EQRTSVFDRIKPPTTRPSVFQRMSMAATE
        KK             F Q ++ + L +   ++F   H +E   + T +   + EV+N+  S           QRTSVFDRIKP TTR SVFQR+SMA  E
Subjt:  KK-------------FSQPQQLVMLNKSFSKNF---HKKEKKNLATSYCIDVEEVDNSKKS----------EQRTSVFDRIKPPTTRPSVFQRMSMAATE

Query:  EENQCLMSTSTRPSAFQRLSVSTSKKSRPSTSVFDRLKVTSDQPKRKMDKLEVKPFDEVNSDKKLQSSIPSRMKRKLSVLINTEGSLKFLL
        EENQC     TR S F+RLS+S SKK+RPSTS FDRLK+T+DQ +R+M  L+ KPF E N D K+ S +PSRMKRKL V INT+    F+L
Subjt:  EENQCLMSTSTRPSAFQRLSVSTSKKSRPSTSVFDRLKVTSDQPKRKMDKLEVKPFDEVNSDKKLQSSIPSRMKRKLSVLINTEGSLKFLL

XP_031742199.1 uncharacterized protein LOC105435721 [Cucumis sativus]4.0e-17262.33Show/hide
Query:  SFKTSSMVAVMNKSYMGSTAHCCFNKLALQEDKASIVAGQETTLQGAYTNDK--FIVKYNPLF-EHDS-----------DVVTVMMTGTRTMEERMVEMQ
        S K +S  +  + +Y G         +   +D+ S +A  ++ L+    + K   ++K NPL+ ++DS           DV++VMM     +E  M EM+
Subjt:  SFKTSSMVAVMNKSYMGSTAHCCFNKLALQEDKASIVAGQETTLQGAYTNDK--FIVKYNPLF-EHDS-----------DVVTVMMTGTRTMEERMVEMQ

Query:  EHIDTLMKAIEEKDSQIAQLKCQIENQHIAESNQTQVIKNHDKGKTIVQDDQP-QCSTSIASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDN
          I+ LMK ++E+D +IA LK Q++ +  AES+QT V+K  DKGK +VQ++QP Q STS+ASLS+QQLQDMITN IRAQYGGP+Q S +YSKPYTKRIDN
Subjt:  EHIDTLMKAIEEKDSQIAQLKCQIENQHIAESNQTQVIKNHDKGKTIVQDDQP-QCSTSIASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDN

Query:  LRIPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKG
        LR+P+GYQPPKFQQFDGKGNPKQH+AHFVETCENAG+RGD LV+QFVR+LKGNAF+WYTDLEPES++SWE+LE+EFLNRFYSTRRTVSM ELTNTKQRKG
Subjt:  LRIPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKG

Query:  ELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRN-------DEETIEESMV
        E V++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSIASR  +D L+P ++K+ +         + T +ESMV
Subjt:  ELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRN-------DEETIEESMV

Query:  VNTTLPKSSSKGK-----RQTNGA--HHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVEGCFVLKDLIL
        VNTT P   SKGK     ++ +G+    LTLKERQ+K+YPFPD+DI DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE CFVLK+LIL
Subjt:  VNTTLPKSSSKGK-----RQTNGA--HHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVEGCFVLKDLIL

Query:  KLAKEGKIELDLDEVAQSNLATI
        +LA+E +IELDL+EVAQ+N A +
Subjt:  KLAKEGKIELDLDEVAQSNLATI

TrEMBL top hitse value%identityAlignment
A0A5A7SRE2 Ty3-gypsy retrotransposon protein9.9e-17751.83Show/hide
Query:  KTSSMVAVMNKSYMGSTAHCCFNKLALQEDKASIVAGQETTLQGAYTNDK--FIVKYNPLFEHDS------------DVVTVMMTGTRTMEERMVEMQEH
        K +S  + ++ SY+G        K ++QE +   V  ++ +L+    + K   +++ NPLF + +            +VV+VMM    T E  M EM+  
Subjt:  KTSSMVAVMNKSYMGSTAHCCFNKLALQEDKASIVAGQETTLQGAYTNDK--FIVKYNPLFEHDS------------DVVTVMMTGTRTMEERMVEMQEH

Query:  IDTLMKAIEEKDSQIAQLKCQIENQHIAESNQTQVIKNHDKGKTIVQDDQP-QCSTSIASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLR
        I+ LMK +EE+D +IA LK Q++    +ES+QT V+K  DKGK +V+++QP Q S S+ASLS+QQLQDMI N IRAQYGGP Q S +YSKPYTKRIDNLR
Subjt:  IDTLMKAIEEKDSQIAQLKCQIENQHIAESNQTQVIKNHDKGKTIVQDDQP-QCSTSIASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLR

Query:  IPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGEL
        +P+GYQP KFQQFDGKGNPKQHI HFVETCENAG+RGD LV+QFVR+LKGNAF+                          TRR VSM ELTNT QRKGE 
Subjt:  IPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGEL

Query:  VVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET-------IEESMVVN
        V++YINRWRA+SLDCKD+LTELS+VEMC QGMHWELLYIL+GIKPRTFEELATRAHDM+LSIA+R  +D L+   R +     +T       + ESM+V 
Subjt:  VVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET-------IEESMVVN

Query:  TTLPKSSSKGKRQTNGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVEGCFVLKDLILKL
         T  KS SK K   +  +H        TL+ERQKK+YPFPD+D+ DMLEQL+E QLI+LP+CKRPE++ KVDDP YCKYHRVI H VE CFVLK+LI KL
Subjt:  TTLPKSSSKGKRQTNGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVEGCFVLKDLILKL

Query:  AKEGKIELDLDEVAQSNLATIKGKSK--------HQRKK-------DPKKLQPKRKRSKKFSQPQQ----------LVMLNKSFSKNFHKKEKKNLATSY
        A+E KIELD+DEVAQ+N   +   S          QRK        +P  ++ ++K     SQ ++          +  L +SF ++ H +E   +   +
Subjt:  AKEGKIELDLDEVAQSNLATIKGKSK--------HQRKK-------DPKKLQPKRKRSKKFSQPQQ----------LVMLNKSFSKNFHKKEKKNLATSY

Query:  CIDV----------EEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCLMSTSTRPSAFQRLSVSTSKKSRPSTSVFDRLKVTSDQPKRKM
           +          EE+DNS + +QRTSVFD IKP TTR SVFQR+SMA  +EENQC   T  + SAF+RLS+S SKK RPST  FDRLK+T+DQ +R+M
Subjt:  CIDV----------EEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCLMSTSTRPSAFQRLSVSTSKKSRPSTSVFDRLKVTSDQPKRKM

Query:  DKLEVKPFDEVNSDKKLQSSIPSRMKRKLSVLINTEGSL
          L+ KPF E N D K+ S +PSRMKRKLSV INTEGSL
Subjt:  DKLEVKPFDEVNSDKKLQSSIPSRMKRKLSVLINTEGSL

A0A5A7SUW1 Retrotransposon gag protein1.5e-17247.88Show/hide
Query:  KTSSMVAVMNKSYMGSTAHCCFNKLALQEDKASIVAGQETTLQGAYTNDK--FIVKYNPLF------------EHDSDVVTVMMTGTRTMEERMVEMQEH
        K +S  +  + SY+G        K ++QE +   V  ++ +L+    + K   I++ NPLF            E   +VV+VMM    T E  M EM+  
Subjt:  KTSSMVAVMNKSYMGSTAHCCFNKLALQEDKASIVAGQETTLQGAYTNDK--FIVKYNPLF------------EHDSDVVTVMMTGTRTMEERMVEMQEH

Query:  IDTLMKAIEEKDSQIAQLKCQIENQHIAESNQTQVIKNHDKGKTIVQDDQP-QCSTSIASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLR
        I+ LMK  EE+D +IA LK Q++     ES+QT V+K  DKGK +VQ++QP Q S S+ASLS+QQLQDMI N IRAQYGGP Q S +YSK YTKRIDNLR
Subjt:  IDTLMKAIEEKDSQIAQLKCQIENQHIAESNQTQVIKNHDKGKTIVQDDQP-QCSTSIASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLR

Query:  IPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGEL
        +P+GYQPPKFQQFDGKGNPKQHIAHFVETCENAG+RGD LV+QFVR+LKGNAF+WYTDLEPE                                   GE 
Subjt:  IPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGEL

Query:  VVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRND--------EETIEESMVV
        V++YINRWRA+SLDCKD+LTELS+VEMC QGMHWELLYIL+GIKPRTFEEL+TRAHDMELSIA+   +D L+   ++  +N+           + ESM+V
Subjt:  VVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRND--------EETIEESMVV

Query:  NTTLPKSSSKGKRQTNGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVEGCFVLKDLILK
          T  KS SK K   +  ++        TL+ERQKK+YPFPD+D+ DMLEQL+E QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE CFVLK+LILK
Subjt:  NTTLPKSSSKGKRQTNGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVEGCFVLKDLILK

Query:  LAKEGKIELDLDEVAQSN----------------------------------------------------------------------------------
        LA+E KIELD+DEVAQ+N                                                                                  
Subjt:  LAKEGKIELDLDEVAQSN----------------------------------------------------------------------------------

Query:  --LATIKGKSKHQRKKDPKKLQPKRKRSKKFSQPQQLVMLNKSFSKNFHKKEKKNLA------TSYCIDV-------EEVDNSKKSEQRTSVFDRIKPPT
             I  K K +R K   K +P + + K F QP++ + L +   ++F +   + +       T+  ++V       EEVDNS + +QRT VF RIKP T
Subjt:  --LATIKGKSKHQRKKDPKKLQPKRKRSKKFSQPQQLVMLNKSFSKNFHKKEKKNLA------TSYCIDV-------EEVDNSKKSEQRTSVFDRIKPPT

Query:  TRPSVFQRMSMAATEEENQCLMSTSTRPSAFQRLSVSTSKKSRPSTSVFDRLKVTSDQPKRKMDKLEVKPFDEVNSDKKLQSSIPSRMKRKLSVLINTEG
         R SVFQR+SMA  EEENQC  ST  R SAF+RLS+ST KK RPSTS FDRLK+ +DQ +R+M  L+ KPF E N D K+ S +PSRMKRKLSV INTEG
Subjt:  TRPSVFQRMSMAATEEENQCLMSTSTRPSAFQRLSVSTSKKSRPSTSVFDRLKVTSDQPKRKMDKLEVKPFDEVNSDKKLQSSIPSRMKRKLSVLINTEG

Query:  SL
        SL
Subjt:  SL

A0A5A7TZU9 Ribonuclease H6.2e-17166.74Show/hide
Query:  IVKYNP-LFEHDS------------DVVTVMMTGTRTMEERMVEMQEHIDTLMKAIEEKDSQIAQLKCQIENQHIAESNQTQVIKNHDKGKTIVQDDQPQ
        ++K NP + EH+S            ++++VM+T   T E+RM E+++ ++ LMKA+EE+D +IA LK  IE++  AES+ T  IKN +KGK I+Q+ QPQ
Subjt:  IVKYNP-LFEHDS------------DVVTVMMTGTRTMEERMVEMQEHIDTLMKAIEEKDSQIAQLKCQIENQHIAESNQTQVIKNHDKGKTIVQDDQPQ

Query:  CSTSIASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRIPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAF
         STSIASLS+QQLQ+MI N I+ QYGGP Q   LYSKPYTKRIDN+R+P GYQPPKFQQFDGKGNPKQH+AHF+ETCE AGTRGDLLVKQFVRTLKGNAF
Subjt:  CSTSIASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRIPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAF

Query:  DWYTDLEPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELAT
        DWYTDLEPES+DSWE+LER+FLNRFYSTRR VSM ELT TKQRKGE V++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELAT
Subjt:  DWYTDLEPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELAT

Query:  RAHDMELSIASRENQDLLLPNMRKEGRNDEET-------IEESMVVNTT----LPKSSSKGKRQTNG-AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQ
        RAHDMELSIA+R N DLL+P +RKE +  + T        +E+MVV+TT    + K     KRQ  G     TLKERQ+K+YPFPD+D+PDML+QLLE Q
Subjt:  RAHDMELSIASRENQDLLLPNMRKEGRNDEET-------IEESMVVNTT----LPKSSSKGKRQTNG-AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQ

Query:  LIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVEGCFVLKDLILKLAKEGKIELDLDEVAQSNLATI
        LI+LP+CKRP EM +V+DP YCKYHRVI HPVE CFVLK+LILKLA + KIEL+LD+VAQ+N A +
Subjt:  LIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVEGCFVLKDLILKLAKEGKIELDLDEVAQSNLATI

A0A5A7TZU9 Ribonuclease H1.9e-1037.84Show/hide
Query:  KLQPKRKRSKK--FSQPQQLVMLNKSFSK--NFHKKEKKNLATSYCIDVEEVDNSKKSE----QRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCLMST
        +L P +K+ +K  +S P     +    S+      K K  +A +  I VEE  +S++ +    QR+SVFDRI     RPSVFQR+S +  ++ NQ    +
Subjt:  KLQPKRKRSKK--FSQPQQLVMLNKSFSK--NFHKKEKKNLATSYCIDVEEVDNSKKSE----QRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCLMST

Query:  STRPSAFQRLSVSTSK----KSRPST--SVFDRLKVTSDQPKRKMDKLEVKPFDEVNSDKKLQSSIPSRMKRKLSVLINTEGSLK
        STR SAFQRL+ S  K       P+T  S F RL V+  + ++K           V  D++++S+ PSRMKRK+ V +NTEGSLK
Subjt:  STRPSAFQRLSVSTSK----KSRPST--SVFDRLKVTSDQPKRKMDKLEVKPFDEVNSDKKLQSSIPSRMKRKLSVLINTEGSLK

A0A5A7URH1 Ty3-gypsy retrotransposon protein9.8e-20159.37Show/hide
Query:  KASIVAGQETTLQGAYTNDKFIVKYNPLFEHDSDVVTVMMTGTRTMEERMVEMQEHIDTLMKAIEEKDSQIAQLKCQIENQHIAESNQTQVIKNHDKGKT
        K  IV  +   +   Y++ K   +  P      ++++VM+T   T E RM E+++ ++ LMK +EE+D +IA LK  IE++  AES+    +KN DKGK 
Subjt:  KASIVAGQETTLQGAYTNDKFIVKYNPLFEHDSDVVTVMMTGTRTMEERMVEMQEHIDTLMKAIEEKDSQIAQLKCQIENQHIAESNQTQVIKNHDKGKT

Query:  IVQDDQPQCSTSIASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRIPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFV
        ++Q+ QPQ STSIASLS+QQLQ+MI + I+ QYGGP Q   LY KPYTKRIDNLR+P GYQPPKFQQFDGKGNPKQH+AHF++TCE AGTRGDLLVKQFV
Subjt:  IVQDDQPQCSTSIASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRIPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFV

Query:  RTLKGNAFDWYTDLEPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKP
        RTLKGNA DWY DLEPES+D+WE+LER+FLNRFYSTR  VSM ELTNT+Q+KGELV++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKP
Subjt:  RTLKGNAFDWYTDLEPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKP

Query:  RTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET-------IEESMVVNTTLPKSSSKGK-----RQTNG--AHHLTLKERQKKIYPFPDADIP
        RTFEELATRAHDMELSIA+R  +D L+P  R +    ++T       I+ESMVV+ T  KS SK K     R+ +G      TLKERQ+K+YPFPD+D+ 
Subjt:  RTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET-------IEESMVVNTTLPKSSSKGK-----RQTNG--AHHLTLKERQKKIYPFPDADIP

Query:  DMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVEGCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGKSKHQRKKDPKKLQPKRK----
        DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE CFVLK+LILKLA+E KIELD+DEVAQ+N A I+  S   + KD   LQ +R     
Subjt:  DMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVEGCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGKSKHQRKKDPKKLQPKRK----

Query:  ----RSKKFSQPQQLVMLNKSFSKNFHKKEKKNLATSYCIDVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCLMSTSTRPSAFQRLS
            RS     P++++ +    + +  + +  N  +S     +EV+NS +  QRTSVFDRIKP TTR SVFQR+S+A  EEENQC     TR S  +RLS
Subjt:  ----RSKKFSQPQQLVMLNKSFSKNFHKKEKKNLATSYCIDVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCLMSTSTRPSAFQRLS

Query:  VSTSKKSRPSTSVFDRLKVTSDQPKRKMDKLEVKPFDEVNSDKKLQSSIPSRMKRKLSVLINTEGSL
        +ST KK RPSTS FDRLK+T+DQ +R+M   + KPF E N D K+ S +PSRMKRKL V INTEGSL
Subjt:  VSTSKKSRPSTSVFDRLKVTSDQPKRKMDKLEVKPFDEVNSDKKLQSSIPSRMKRKLSVLINTEGSL

A0A5D3BX77 Retrotransposon gag protein1.9e-19651.96Show/hide
Query:  SYMGS--TAHCCFNKLALQEDKASIVAGQETTLQ-GAYTNDKFIVKYNP-LFEHDS------------DVVTVMMTGTRTMEERMVEMQEHIDTLMKAIE
        SY+G     H C  ++   ED       +    Q         ++K NP + EH+S            ++++VM+T   T E RM E+++ ++ LMK +E
Subjt:  SYMGS--TAHCCFNKLALQEDKASIVAGQETTLQ-GAYTNDKFIVKYNP-LFEHDS------------DVVTVMMTGTRTMEERMVEMQEHIDTLMKAIE

Query:  EKDSQIAQLKCQIENQHIAESNQTQVIKNHDKGKTIVQDDQPQCSTSIASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRIPIGYQPPKF
        E+D +IA LK  IE++  AES+    +KN DKGK ++Q+ QPQ STSIASLS+QQLQ+MI + I+ QYGGP Q   LYSKPYTKRIDNLR+P GYQPPKF
Subjt:  EKDSQIAQLKCQIENQHIAESNQTQVIKNHDKGKTIVQDDQPQCSTSIASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRIPIGYQPPKF

Query:  QQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRA
        QQFDGKGNPKQH+AHF+ETCE AGTRGDLLVKQFVRTLKGNAFD Y DLEPES+D+WE+LER+FLNRFYSTRR VSM ELTNT+Q+KGELV++YINRWRA
Subjt:  QQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRA

Query:  MSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET-------IEESMVVNTTLPKSSSKG
        +SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSI +R  +D L+P  R +     +T       I+ESMVV+ T  KS SK 
Subjt:  MSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET-------IEESMVVNTTLPKSSSKG

Query:  K-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVEGCFVLKDLILKLAKEGKIELDL
        K     R+ +G      TLKERQ+K+YPF D+D+ DMLEQLLE QLI+LPKCKRP++ EKVDDP YCKYHRVI HPVE CFVLK+LILKLA+E KIEL++
Subjt:  K-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVEGCFVLKDLILKLAKEGKIELDL

Query:  DEVAQSN-----------------------------------LATIKGKSKHQRKKD---------------PKKLQ-----------------PKRKRS
        DEVAQ+N                                   + TI  ++K    KD               P  +Q                  K +R+
Subjt:  DEVAQSN-----------------------------------LATIKGKSKHQRKKD---------------PKKLQ-----------------PKRKRS

Query:  KK-------------FSQPQQLVMLNKSFSKNF---HKKEKKNLATSYCIDVEEVDNSKKS----------EQRTSVFDRIKPPTTRPSVFQRMSMAATE
        KK             F Q ++ + L +   ++F   H +E   + T +   + EV+N+  S           QRTSVFDRIKP TTR SVFQR+SMA  E
Subjt:  KK-------------FSQPQQLVMLNKSFSKNF---HKKEKKNLATSYCIDVEEVDNSKKS----------EQRTSVFDRIKPPTTRPSVFQRMSMAATE

Query:  EENQCLMSTSTRPSAFQRLSVSTSKKSRPSTSVFDRLKVTSDQPKRKMDKLEVKPFDEVNSDKKLQSSIPSRMKRKLSVLINTEGSLKFLL
        EENQC     TR S F+RLS+S SKK+RPSTS FDRLK+T+DQ +R+M  L+ KPF E N D K+ S +PSRMKRKL V INT+    F+L
Subjt:  EENQCLMSTSTRPSAFQRLSVSTSKKSRPSTSVFDRLKVTSDQPKRKMDKLEVKPFDEVNSDKKLQSSIPSRMKRKLSVLINTEGSLKFLL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTTCAAGACTTCTTCAATGGTCGCTGTCATGAACAAGTCTTACATGGGTTCTACTGCCCATTGTTGCTTCAATAAACTGGCGTTGCAAGAAGATAAAGCTTCTAT
CGTTGCAGGCCAAGAAACAACCTTGCAAGGGGCATATACTAATGACAAGTTTATTGTTAAGTATAACCCTTTGTTTGAACATGATTCTGATGTAGTGACTGTCATGATGA
CTGGGACTAGAACTATGGAAGAAAGAATGGTTGAGATGCAGGAGCACATCGACACCTTGATGAAGGCGATTGAAGAAAAAGATTCTCAAATTGCGCAACTGAAGTGCCAA
ATTGAGAACCAACATATCGCCGAATCAAATCAAACCCAAGTCATAAAAAATCATGACAAAGGAAAGACTATAGTGCAAGATGATCAGCCACAGTGTTCTACTTCGATCGC
TTCACTATCCATCCAACAGCTCCAAGATATGATCACAAACTGTATCAGAGCTCAGTACGGTGGACCTACTCAAGATTCCCTCTTGTATTCCAAACCTTATACTAAGAGGA
TTGATAACTTGAGAATTCCAATCGGGTATCAGCCACCGAAATTTCAGCAGTTTGATGGAAAAGGCAATCCTAAACAACATATTGCCCACTTCGTTGAGACATGCGAGAAC
GCTGGTACTCGAGGGGACCTACTAGTCAAACAGTTCGTTCGAACACTTAAAGGAAATGCTTTTGACTGGTACACTGATCTAGAACCTGAGTCAGTAGACAGTTGGGAGGA
ACTCGAAAGAGAGTTTTTGAATCGCTTCTACAGCACTAGACGAACCGTTAGCATGTTCGAGCTCACAAACACTAAACAACGAAAAGGTGAACTCGTTGTTAACTATATAA
ATCGCTGGAGAGCTATGAGTCTAGATTGCAAAGATCGCCTCACTGAACTCTCTTCCGTTGAGATGTGCATTCAAGGCATGCACTGGGAACTCCTCTACATCCTTAAAGGT
ATAAAGCCTCGCACCTTTGAGGAACTAGCAACTCGTGCCCACGATATGGAGCTAAGTATTGCTAGTCGAGAAAACCAAGACCTTCTCCTCCCTAATATGAGAAAAGAAGG
AAGGAACGACGAAGAGACTATAGAAGAATCTATGGTTGTCAACACAACCCTTCCCAAGTCGTCTTCGAAAGGAAAGCGACAAACAAATGGAGCGCATCACTTAACTTTAA
AGGAAAGACAGAAGAAAATCTATCCTTTCCCTGATGCCGACATCCCTGATATGTTGGAACAACTATTGGAAGCGCAACTGATAGAGCTTCCTAAGTGTAAACGACCAGAA
GAGATGGAGAAAGTCGATGATCCCAAGTATTGCAAGTATCATCGAGTTATTGGTCATCCAGTGGAAGGATGTTTCGTCCTAAAGGACTTAATTTTAAAGCTGGCTAAGGA
AGGCAAAATTGAGCTTGACCTTGATGAAGTAGCCCAATCAAATCTTGCTACAATCAAAGGAAAGAGCAAACATCAAAGAAAGAAGGATCCTAAGAAACTTCAACCCAAGA
GGAAGAGAAGTAAAAAGTTTTCTCAACCTCAACAACTGGTGATGTTGAATAAATCGTTCTCCAAAAATTTCCACAAAAAGGAAAAAAAGAACCTTGCAACTTCCTACTGC
ATCGACGTAGAAGAAGTTGACAATTCCAAGAAGAGTGAACAAAGGACTTCCGTCTTTGATCGCATCAAGCCTCCAACTACTCGTCCTTCGGTATTCCAAAGAATGAGTAT
GGCTGCGACAGAGGAAGAAAATCAATGTTTGATGTCCACCTCCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCGAAGAAAAGTCGACCTTCAACATCTG
TTTTTGATCGCCTCAAAGTAACAAGCGATCAACCTAAAAGAAAGATGGATAAATTGGAGGTAAAACCTTTCGATGAAGTAAACAGCGACAAGAAGCTTCAAAGTAGCATC
CCGTCACGTATGAAGAGGAAGTTATCTGTTCTCATAAATACAGAAGGTTCCTTGAAGTTCCTTCTCTCCAAGTTCGAGGGTCCTTACACTGTACGCTATTGCGTTGTTCC
TTCTCCAAGTTCGAAGGTTCTTCGTTGTATCCTGCTGCGTTGTTCCTTCTCCAAGTTCGAGGGTTCTCAGTTGTACAACTGCTACGTTGTTCCTCCTCCAAGTGCGAATG
ATCTTATGTGGTGCGTTGTTGCATTGTTCCCTCTTCTCTCAAGTTCGATGGTTCTCACGCAGCTTTGCTGGAGTTTCTTCTCCCCAAGTTCGAAGGTTTTCACGCCCTCC
GTTGTAGTTCCTTCTTTCCAAGGTCGAAGGTTCTCACTCGCTGCGTTGCAGTTCTTTCTCCCCAAGTTCGAAGGTTCACACACTTCGCTCCAGTTCCTTCTCCCAAATTC
GAAGGTTCTCATGCGCTTCGCGCTGCAGTTCCTTCCCTCCAAGTTTGAAGGTTCTCACATCGCTTCGCTGCGATCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGC
TCTGCAATTCTTTCCCCCAAGTTCGAAGGTTCTCAGGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCACTTCGCTGCATTTCCTTCCCCCCAAATTC
GAAGGTTCTCACGCGCTTCGCTGCGGTTCCTTCCCCCCAAGTTCGAAGGTTCTCACGCACTTCGCTGCAGTTCCTTCCTCCAAGTTTGAAGGTTCTCAAACGCTTCGCTG
TCGTTCCTTCCTTACAGTTCGTAGGTTCTCACACGCTTCGTTGCAGTTCCTTCCCCCCAAGTTCGAAGGTTCTCACGTCGCTTCGCTGCGCTCACGCGTTTCACTGCAGT
TCCTTCCTCCAATTTTGAAGGTTCTCACATCGCTTCGCTTCGCTCACGCGCTTCGCTGCGATCCTTCCCCCAAGTTCAAAGGTTCTCACGGGCATCGCCACAGTTCCTTC
CTCCAAGTTCGAAGGTTCTCACGCGCTTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCATTGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCTCAC
GCGCATCGCCATAGTTCCTTCCTCCGCGCATCGCCATAGTTCCTTCCTCCAAGTTCGAAGGTTCTCTCACACGCTTCGCCACAGTTCCTTCCTCCCACGCGCATCGCCAC
AGTTCCTTCCTCCAAGTTCGAAGGTTCTCTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCTCGCCACAG
TTCCTTCCTCCAAGTTCAAAGGTTCTCTCGCCACAGTTCCTTCCCTCCAAGTTCGAAGGTTCTCTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCTCGCCACAGT
TCCTTCCTCCAAGTTCGAAGGTTCTCTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCTCACGCGCTTCGCCACAGTTCCTTCATCCAAGTTCGAAGGTTCTCTCA
CGCGCTTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCTCACGCGCTTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCTCACGCGCATCGCCATAGTTCCT
TCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCCACAGTTCACGCGCACTTCCTCCAAGTTCGAAGGATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTGTTCTCACG
CGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTGTTCTCACGTGCATCGCGCGACAGTTCCTTCCTCCAAGTTCGCGCTTCGCCACAGTTCACGCGCACTTCCTCC
AAGTTCGAAGGATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTGTTTTCCTCCAAGTTCGAAGGTTGTTCTCACGTGCATCGCGCGACAGTTCCTTCCTCCAAGTTCG
AAGGTTGTTCTCACGTGCATCGCCTCCTCCAAGTTTGAAAGAAATTCTACACTCTCACAAAGACAAGAGTTCAGAGTTTCAAAGCTCTCAAGCAGAACCAAAGAATTCAG
AGAGACTCCACCAAGTCTGAAGACCGAAAACTCTCTGCAATCCATAAGTTCAAGTGTTGAACACTTCTTGAAGACCAAACACTCTTCAAGACTTCAACACTCCTTGAAGA
TCAAAGACTCTTCAGGACATCAACATTTCTTGAAGACCCAACACTCTTCAAGACTTCAACACTCCTTGAAGATCAAAGACTCTTCAGGACATCAACATTTCTTGAAGACC
CAACACTCTTCAAGACTTCAACACTCCTTGAAGATCAAAGACTCTTCAGGACATCAACACTTCTTGAAGGCTGAAAACTCCTTCAAGACTAGAAGACTTCAAGCTCCAAG
AATCCATTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCGTTCAAGACTTCTTCAATGGTCGCTGTCATGAACAAGTCTTACATGGGTTCTACTGCCCATTGTTGCTTCAATAAACTGGCGTTGCAAGAAGATAAAGCTTCTAT
CGTTGCAGGCCAAGAAACAACCTTGCAAGGGGCATATACTAATGACAAGTTTATTGTTAAGTATAACCCTTTGTTTGAACATGATTCTGATGTAGTGACTGTCATGATGA
CTGGGACTAGAACTATGGAAGAAAGAATGGTTGAGATGCAGGAGCACATCGACACCTTGATGAAGGCGATTGAAGAAAAAGATTCTCAAATTGCGCAACTGAAGTGCCAA
ATTGAGAACCAACATATCGCCGAATCAAATCAAACCCAAGTCATAAAAAATCATGACAAAGGAAAGACTATAGTGCAAGATGATCAGCCACAGTGTTCTACTTCGATCGC
TTCACTATCCATCCAACAGCTCCAAGATATGATCACAAACTGTATCAGAGCTCAGTACGGTGGACCTACTCAAGATTCCCTCTTGTATTCCAAACCTTATACTAAGAGGA
TTGATAACTTGAGAATTCCAATCGGGTATCAGCCACCGAAATTTCAGCAGTTTGATGGAAAAGGCAATCCTAAACAACATATTGCCCACTTCGTTGAGACATGCGAGAAC
GCTGGTACTCGAGGGGACCTACTAGTCAAACAGTTCGTTCGAACACTTAAAGGAAATGCTTTTGACTGGTACACTGATCTAGAACCTGAGTCAGTAGACAGTTGGGAGGA
ACTCGAAAGAGAGTTTTTGAATCGCTTCTACAGCACTAGACGAACCGTTAGCATGTTCGAGCTCACAAACACTAAACAACGAAAAGGTGAACTCGTTGTTAACTATATAA
ATCGCTGGAGAGCTATGAGTCTAGATTGCAAAGATCGCCTCACTGAACTCTCTTCCGTTGAGATGTGCATTCAAGGCATGCACTGGGAACTCCTCTACATCCTTAAAGGT
ATAAAGCCTCGCACCTTTGAGGAACTAGCAACTCGTGCCCACGATATGGAGCTAAGTATTGCTAGTCGAGAAAACCAAGACCTTCTCCTCCCTAATATGAGAAAAGAAGG
AAGGAACGACGAAGAGACTATAGAAGAATCTATGGTTGTCAACACAACCCTTCCCAAGTCGTCTTCGAAAGGAAAGCGACAAACAAATGGAGCGCATCACTTAACTTTAA
AGGAAAGACAGAAGAAAATCTATCCTTTCCCTGATGCCGACATCCCTGATATGTTGGAACAACTATTGGAAGCGCAACTGATAGAGCTTCCTAAGTGTAAACGACCAGAA
GAGATGGAGAAAGTCGATGATCCCAAGTATTGCAAGTATCATCGAGTTATTGGTCATCCAGTGGAAGGATGTTTCGTCCTAAAGGACTTAATTTTAAAGCTGGCTAAGGA
AGGCAAAATTGAGCTTGACCTTGATGAAGTAGCCCAATCAAATCTTGCTACAATCAAAGGAAAGAGCAAACATCAAAGAAAGAAGGATCCTAAGAAACTTCAACCCAAGA
GGAAGAGAAGTAAAAAGTTTTCTCAACCTCAACAACTGGTGATGTTGAATAAATCGTTCTCCAAAAATTTCCACAAAAAGGAAAAAAAGAACCTTGCAACTTCCTACTGC
ATCGACGTAGAAGAAGTTGACAATTCCAAGAAGAGTGAACAAAGGACTTCCGTCTTTGATCGCATCAAGCCTCCAACTACTCGTCCTTCGGTATTCCAAAGAATGAGTAT
GGCTGCGACAGAGGAAGAAAATCAATGTTTGATGTCCACCTCCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCGAAGAAAAGTCGACCTTCAACATCTG
TTTTTGATCGCCTCAAAGTAACAAGCGATCAACCTAAAAGAAAGATGGATAAATTGGAGGTAAAACCTTTCGATGAAGTAAACAGCGACAAGAAGCTTCAAAGTAGCATC
CCGTCACGTATGAAGAGGAAGTTATCTGTTCTCATAAATACAGAAGGTTCCTTGAAGTTCCTTCTCTCCAAGTTCGAGGGTCCTTACACTGTACGCTATTGCGTTGTTCC
TTCTCCAAGTTCGAAGGTTCTTCGTTGTATCCTGCTGCGTTGTTCCTTCTCCAAGTTCGAGGGTTCTCAGTTGTACAACTGCTACGTTGTTCCTCCTCCAAGTGCGAATG
ATCTTATGTGGTGCGTTGTTGCATTGTTCCCTCTTCTCTCAAGTTCGATGGTTCTCACGCAGCTTTGCTGGAGTTTCTTCTCCCCAAGTTCGAAGGTTTTCACGCCCTCC
GTTGTAGTTCCTTCTTTCCAAGGTCGAAGGTTCTCACTCGCTGCGTTGCAGTTCTTTCTCCCCAAGTTCGAAGGTTCACACACTTCGCTCCAGTTCCTTCTCCCAAATTC
GAAGGTTCTCATGCGCTTCGCGCTGCAGTTCCTTCCCTCCAAGTTTGAAGGTTCTCACATCGCTTCGCTGCGATCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGC
TCTGCAATTCTTTCCCCCAAGTTCGAAGGTTCTCAGGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCACTTCGCTGCATTTCCTTCCCCCCAAATTC
GAAGGTTCTCACGCGCTTCGCTGCGGTTCCTTCCCCCCAAGTTCGAAGGTTCTCACGCACTTCGCTGCAGTTCCTTCCTCCAAGTTTGAAGGTTCTCAAACGCTTCGCTG
TCGTTCCTTCCTTACAGTTCGTAGGTTCTCACACGCTTCGTTGCAGTTCCTTCCCCCCAAGTTCGAAGGTTCTCACGTCGCTTCGCTGCGCTCACGCGTTTCACTGCAGT
TCCTTCCTCCAATTTTGAAGGTTCTCACATCGCTTCGCTTCGCTCACGCGCTTCGCTGCGATCCTTCCCCCAAGTTCAAAGGTTCTCACGGGCATCGCCACAGTTCCTTC
CTCCAAGTTCGAAGGTTCTCACGCGCTTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCATTGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCTCAC
GCGCATCGCCATAGTTCCTTCCTCCGCGCATCGCCATAGTTCCTTCCTCCAAGTTCGAAGGTTCTCTCACACGCTTCGCCACAGTTCCTTCCTCCCACGCGCATCGCCAC
AGTTCCTTCCTCCAAGTTCGAAGGTTCTCTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCTCGCCACAG
TTCCTTCCTCCAAGTTCAAAGGTTCTCTCGCCACAGTTCCTTCCCTCCAAGTTCGAAGGTTCTCTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCTCGCCACAGT
TCCTTCCTCCAAGTTCGAAGGTTCTCTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCTCACGCGCTTCGCCACAGTTCCTTCATCCAAGTTCGAAGGTTCTCTCA
CGCGCTTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCTCACGCGCTTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCTCACGCGCATCGCCATAGTTCCT
TCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCCACAGTTCACGCGCACTTCCTCCAAGTTCGAAGGATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTGTTCTCACG
CGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTGTTCTCACGTGCATCGCGCGACAGTTCCTTCCTCCAAGTTCGCGCTTCGCCACAGTTCACGCGCACTTCCTCC
AAGTTCGAAGGATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTGTTTTCCTCCAAGTTCGAAGGTTGTTCTCACGTGCATCGCGCGACAGTTCCTTCCTCCAAGTTCG
AAGGTTGTTCTCACGTGCATCGCCTCCTCCAAGTTTGAAAGAAATTCTACACTCTCACAAAGACAAGAGTTCAGAGTTTCAAAGCTCTCAAGCAGAACCAAAGAATTCAG
AGAGACTCCACCAAGTCTGAAGACCGAAAACTCTCTGCAATCCATAAGTTCAAGTGTTGAACACTTCTTGAAGACCAAACACTCTTCAAGACTTCAACACTCCTTGAAGA
TCAAAGACTCTTCAGGACATCAACATTTCTTGAAGACCCAACACTCTTCAAGACTTCAACACTCCTTGAAGATCAAAGACTCTTCAGGACATCAACATTTCTTGAAGACC
CAACACTCTTCAAGACTTCAACACTCCTTGAAGATCAAAGACTCTTCAGGACATCAACACTTCTTGAAGGCTGAAAACTCCTTCAAGACTAGAAGACTTCAAGCTCCAAG
AATCCATTGA
Protein sequenceShow/hide protein sequence
MSFKTSSMVAVMNKSYMGSTAHCCFNKLALQEDKASIVAGQETTLQGAYTNDKFIVKYNPLFEHDSDVVTVMMTGTRTMEERMVEMQEHIDTLMKAIEEKDSQIAQLKCQ
IENQHIAESNQTQVIKNHDKGKTIVQDDQPQCSTSIASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRIPIGYQPPKFQQFDGKGNPKQHIAHFVETCEN
AGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKG
IKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEETIEESMVVNTTLPKSSSKGKRQTNGAHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPE
EMEKVDDPKYCKYHRVIGHPVEGCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGKSKHQRKKDPKKLQPKRKRSKKFSQPQQLVMLNKSFSKNFHKKEKKNLATSYC
IDVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCLMSTSTRPSAFQRLSVSTSKKSRPSTSVFDRLKVTSDQPKRKMDKLEVKPFDEVNSDKKLQSSI
PSRMKRKLSVLINTEGSLKFLLSKFEGPYTVRYCVVPSPSSKVLRCILLRCSFSKFEGSQLYNCYVVPPPSANDLMWCVVALFPLLSSSMVLTQLCWSFFSPSSKVFTPS
VVVPSFQGRRFSLAALQFFLPKFEGSHTSLQFLLPNSKVLMRFALQFLPSKFEGSHIASLRSFLQVRRFSRASLCNSFPQVRRFSGASCSSFLQIRRFSRTSLHFLPPKF
EGSHALRCGSFPPSSKVLTHFAAVPSSKFEGSQTLRCRSFLTVRRFSHASLQFLPPKFEGSHVASLRSRVSLQFLPPILKVLTSLRFAHALRCDPSPKFKGSHGHRHSSF
LQVRRFSRASPQFLPPSSKVLTRIATVPSSKFEGSLTRIAIVPSSAHRHSSFLQVRRFSHTLRHSSFLPRASPQFLPPSSKVLSPQFLPPSSKVLSPQFLPPSSKVLSPQ
FLPPSSKVLSPQFLPSKFEGSLATVPSSKFEGSLATVPSSKFEGSLATVPSSKFEGSLTRFATVPSSKFEGSLTRFATVPSSKFEGSLTRFATVPSSKFEGSLTRIAIVP
SSKFEGSHALRHSSRALPPSSKDRHSSFLQVRRLFSRASPQFLPPSSKVVLTCIARQFLPPSSRFATVHAHFLQVRRIATVPSSKFEGCFPPSSKVVLTCIARQFLPPSS
KVVLTCIASSKFERNSTLSQRQEFRVSKLSSRTKEFRETPPSLKTENSLQSISSSVEHFLKTKHSSRLQHSLKIKDSSGHQHFLKTQHSSRLQHSLKIKDSSGHQHFLKT
QHSSRLQHSLKIKDSSGHQHFLKAENSFKTRRLQAPRIH