; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0000618 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0000618
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTy3-gypsy retrotransposon protein
Genome locationchr4:11526890..11533881
RNA-Seq ExpressionLag0000618
SyntenyLag0000618
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0140640 - catalytic activity, acting on a nucleic acid (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0047477.1 uncharacterized protein E6C27_scaffold498G00940 [Cucumis melo var. makuwa]3.6e-13459.68Show/hide
Query:  PDSDIVTVMMTETRTMEERMTGMQEHINNLMKAIEEKDSQIAQLKCQIENQHIAESSQTQVIKNHDKGKAIVQDDQPQCSTSIASLTIQQLQDMITNCIR
        P  +I++VM+T+  T E+RM  +++ +N LMKA+EE+D +IA LK  IE++  AESS T  IKN +KGKAI+Q+ QPQ STSIASL++QQLQ+MI N I+
Subjt:  PDSDIVTVMMTETRTMEERMTGMQEHINNLMKAIEEKDSQIAQLKCQIENQHIAESSQTQVIKNHDKGKAIVQDDQPQCSTSIASLTIQQLQDMITNCIR

Query:  AQYGGPTQDSLLYSKPYTKRIDNLRMPIGYQPPKFQQFDGKGNPKQHIAHFVETCENA--------------------------EPESVDSWEELEREFL
         QYGGP Q   LYSKPYTKRIDN+RMP GYQPPKFQQFDGKGNPKQH+AHF+ETCE A                          EPES+DSWE+LER+FL
Subjt:  AQYGGPTQDSLLYSKPYTKRIDNLRMPIGYQPPKFQQFDGKGNPKQHIAHFVETCENA--------------------------EPESVDSWEELEREFL

Query:  NRFYSTRRIVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNM
        NRFYSTRRIVSM ELT TKQRKGE V++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSIA+R N DLL+P +
Subjt:  NRFYSTRRIVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNM

Query:  RKEGKNDEET-------IEESMVVNTT----LPKSSSKGKRQTNG-AHNLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPK----------------
        RKE K  + T        +E+MVV+TT    + K     KRQ  G     TLKERQ+K+YPFPD+D+PDML+QLLE QLI+LP+                
Subjt:  RKEGKNDEET-------IEESMVVNTT----LPKSSSKGKRQTNG-AHNLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPK----------------

Query:  ---------LERCFVLKDLILKLAMERKIELDLDEVAQSNLATI
                 +E+CFVLK+LILKLA+++KIEL+LD+VAQ+N A +
Subjt:  ---------LERCFVLKDLILKLAMERKIELDLDEVAQSNLATI

KAA0047477.1 uncharacterized protein E6C27_scaffold498G00940 [Cucumis melo var. makuwa]9.5e-0234.04Show/hide
Query:  RKNDPKKLQPKRKRKKENLVTSYCINEEEVDN---SKKSEQRTSVFDRIKPPTTRPSVFHRMSMVATEEENQCSISTSTRPSAFQRLSVSASKKSRSSTS
        + ++P ++  K K K  N  T +   EE  D+    K   QR+SVFDRI     RPSVF R+S    ++ NQ S  +STR SAFQRL+ SA K    S +
Subjt:  RKNDPKKLQPKRKRKKENLVTSYCINEEEVDN---SKKSEQRTSVFDRIKPPTTRPSVFHRMSMVATEEENQCSISTSTRPSAFQRLSVSASKKSRSSTS

Query:  VFDRLKDLIKTMTKIRAFKCKSSLSQEPKLHDAPSPHELKS
           R     +    +   + K+S+S   K        E++S
Subjt:  VFDRLKDLIKTMTKIRAFKCKSSLSQEPKLHDAPSPHELKS

KAA0047477.1 uncharacterized protein E6C27_scaffold498G00940 [Cucumis melo var. makuwa]6.1e-13454.32Show/hide
Query:  KTSSMTAVKNKSYMGSTAHCCFNELRFQEDKASIIASEETTLQGAYTNDK--FLVKYNPLF------------EPDSDIVTVMMTETRTMEERMTGMQEH
        K +S ++  + +Y G         +   +D+ S IA  ++ L+    + K   ++K NPL+            E   D+++VMM +   +E  M  M+  
Subjt:  KTSSMTAVKNKSYMGSTAHCCFNELRFQEDKASIIASEETTLQGAYTNDK--FLVKYNPLF------------EPDSDIVTVMMTETRTMEERMTGMQEH

Query:  INNLMKAIEEKDSQIAQLKCQIENQHIAESSQTQVIKNHDKGKAIVQDDQP-QCSTSIASLTIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLR
        IN LMK ++E+D +IA LK Q++ +  AESSQT V+K  DKGK +VQ++QP Q STS+ASL++QQLQDMITN IRAQYGGP+Q S +YSKPYTKRIDNLR
Subjt:  INNLMKAIEEKDSQIAQLKCQIENQHIAESSQTQVIKNHDKGKAIVQDDQP-QCSTSIASLTIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLR

Query:  MPIGYQPPKFQQFDGKGNPKQHIAHFVETCENA--------------------------EPESVDSWEELEREFLNRFYSTRRIVSMFELTNTKQRKGEL
        MP+GYQPPKFQQFDGKGNPKQH+AHFVETCENA                          EPES++SWE+LE+EFLNRFYSTRR VSM ELTNTKQRKGE 
Subjt:  MPIGYQPPKFQQFDGKGNPKQHIAHFVETCENA--------------------------EPESVDSWEELEREFLNRFYSTRRIVSMFELTNTKQRKGEL

Query:  VVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGKN-------DEETIEESMVVN
        V++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSIASR  +D L+P ++K+ K         + T +ESMVVN
Subjt:  VVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGKN-------DEETIEESMVVN

Query:  TTLPKSSSKGK-----RQTNGA--HNLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPK-------------------------LERCFVLKDLILKL
        TT P   SKGK     ++ +G+    LTLKERQ+K+YPFPD+DI DMLEQLLE QLI+LP+                         +E+CFVLK+LIL+L
Subjt:  TTLPKSSSKGK-----RQTNGA--HNLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPK-------------------------LERCFVLKDLILKL

Query:  AMERKIELDLDEVAQSNLATI
        A E++IELDL+EVAQ+N A +
Subjt:  AMERKIELDLDEVAQSNLATI

KAA0056121.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]9.1e-14652.28Show/hide
Query:  KASIIASEETTLQGAYTNDKFLVKYNPLFE--PDSDIVTVMMTETRTMEERMTGMQEHINNLMKAIEEKDSQIAQLKCQIENQHIAESSQTQVIKNHDKG
        K  I+  E   +   Y++ K      P  E  P  +I++VM+T   T E RM  +++ +N LMK +EE+D +IA LK  IE++  AESS    +KN DKG
Subjt:  KASIIASEETTLQGAYTNDKFLVKYNPLFE--PDSDIVTVMMTETRTMEERMTGMQEHINNLMKAIEEKDSQIAQLKCQIENQHIAESSQTQVIKNHDKG

Query:  KAIVQDDQPQCSTSIASLTIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRMPIGYQPPKFQQFDGKGNPKQHIAHFVETCENA----------
        KA++Q+ QPQ STSIASL++QQLQ+MI + I+ QYGGP Q   LY KPYTKRIDNLRMP GYQPPKFQQFDGKGNPKQH+AHF++TCE A          
Subjt:  KAIVQDDQPQCSTSIASLTIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRMPIGYQPPKFQQFDGKGNPKQHIAHFVETCENA----------

Query:  ----------------EPESVDSWEELEREFLNRFYSTRRIVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGI
                        EPES+D+WE+LER+FLNRFYSTR IVSM ELTNT+Q+KGELV++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GI
Subjt:  ----------------EPESVDSWEELEREFLNRFYSTRRIVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGI

Query:  KPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGKNDEET-------IEESMVVNTTLPKSSSKGK-----RQTNG--AHNLTLKERQKKIYPFPDAD
        KPRTFEELATRAHDMELSIA+R  +D L+P  R +    ++T       I+ESMVV+ T  KS SK K     R+ +G      TLKERQ+K+YPFPD+D
Subjt:  KPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGKNDEET-------IEESMVVNTTLPKSSSKGK-----RQTNG--AHNLTLKERQKKIYPFPDAD

Query:  IPDMLEQLLEAQLIELPK-------------------------LERCFVLKDLILKLAMERKIELDLDEVAQSNLATIKEKNKHQRKNDPKKLQPKRK--
        + DMLEQLLE QLI+LP+                         +E+CFVLK+LILKLA E+KIELD+DEVAQ+N A     N  + K D   LQ +R   
Subjt:  IPDMLEQLLEAQLIELPK-------------------------LERCFVLKDLILKLAMERKIELDLDEVAQSNLATIKEKNKHQRKNDPKKLQPKRK--

Query:  ------------RKKENLVTSYCI------------NEEEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMVATEEENQCSISTSTRPSAFQRLSVSAS
                      +E L  + C             + +EV+NS +  QRTSVFDRIKP TTR SVF R+S+   EEENQC     TR S  +RLS+S  
Subjt:  ------------RKKENLVTSYCI------------NEEEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMVATEEENQCSISTSTRPSAFQRLSVSAS

Query:  KKSRSSTSVFDRLK
        KK R STS FDRLK
Subjt:  KKSRSSTSVFDRLK

TYK03695.1 retrotransposon gag protein [Cucumis melo var. makuwa]1.5e-14044.92Show/hide
Query:  TSSMTAVKNKSYMGS--TAHCCFNELRFQEDKASIIASEETTLQ-GAYTNDKFLVKYNPLFE-------------PDSDIVTVMMTETRTMEERMTGMQE
        TS     K  SY+G     H C  E++  ED      ++    Q         ++K NP  +             P  +I++VM+T   T E RM  +++
Subjt:  TSSMTAVKNKSYMGS--TAHCCFNELRFQEDKASIIASEETTLQ-GAYTNDKFLVKYNPLFE-------------PDSDIVTVMMTETRTMEERMTGMQE

Query:  HINNLMKAIEEKDSQIAQLKCQIENQHIAESSQTQVIKNHDKGKAIVQDDQPQCSTSIASLTIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLR
         +N LMK +EE+D +IA LK  IE++  AESS    +KN DKGKA++Q+ QPQ STSIASL++QQLQ+MI + I+ QYGGP Q   LYSKPYTKRIDNLR
Subjt:  HINNLMKAIEEKDSQIAQLKCQIENQHIAESSQTQVIKNHDKGKAIVQDDQPQCSTSIASLTIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLR

Query:  MPIGYQPPKFQQFDGKGNPKQHIAHFVETCENA--------------------------EPESVDSWEELEREFLNRFYSTRRIVSMFELTNTKQRKGEL
        MP GYQPPKFQQFDGKGNPKQH+AHF+ETCE A                          EPES+D+WE+LER+FLNRFYSTRRIVSM ELTNT+Q+KGEL
Subjt:  MPIGYQPPKFQQFDGKGNPKQHIAHFVETCENA--------------------------EPESVDSWEELEREFLNRFYSTRRIVSMFELTNTKQRKGEL

Query:  VVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGKNDEET-------IEESMVVN
        V++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSI +R  +D L+P  R +     +T       I+ESMVV+
Subjt:  VVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGKNDEET-------IEESMVVN

Query:  TTLPKSSSKGK-----RQTNG--AHNLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPK-------------------------LERCFVLKDLILKL
         T  KS SK K     R+ +G      TLKERQ+K+YPF D+D+ DMLEQLLE QLI+LPK                         +E+CFVLK+LILKL
Subjt:  TTLPKSSSKGK-----RQTNG--AHNLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPK-------------------------LERCFVLKDLILKL

Query:  AMERKIELDLDEVAQSN-----------------------------------LATIKEKNK---------------HQRKNDPKKLQPK-----------
        A E+KIEL++DEVAQ+N                                   + TI  +NK                Q+   P  +Q +           
Subjt:  AMERKIELDLDEVAQSN-----------------------------------LATIKEKNK---------------HQRKNDPKKLQPK-----------

Query:  -------RKRKK-------------------------------------ENLVTSYCI------------NEEEVDNSKKSEQRTSVFDRIKPPTTRPSV
               R+ KK                                     E L  + C             + +EV+N  +  QRTSVFDRIKP TTR SV
Subjt:  -------RKRKK-------------------------------------ENLVTSYCI------------NEEEVDNSKKSEQRTSVFDRIKPPTTRPSV

Query:  FHRMSMVATEEENQCSISTSTRPSAFQRLSVSASKKSRSSTSVFDRLK
        F R+SM   EEENQC     TR S F+RLS+SASKK+R STS FDRLK
Subjt:  FHRMSMVATEEENQCSISTSTRPSAFQRLSVSASKKSRSSTSVFDRLK

XP_031735972.1 uncharacterized protein LOC116401693 [Cucumis sativus]2.8e-0139Show/hide
Query:  KRKKENLVTSYCINEEEVDNSKKSE---QRTSVFDRIKPPTTRPSVFHRMSMVATEEENQCSISTSTRPSAFQRLSVSASKK-------SRSSTSVFDRL
        K+ KE +V    I  EE DN+   E   QR SVFDRI+P   RP VF R+SM   E E   S+ +  R S F+RL+ +  K+       + +  S F+RL
Subjt:  KRKKENLVTSYCINEEEVDNSKKSE---QRTSVFDRIKPPTTRPSVFHRMSMVATEEENQCSISTSTRPSAFQRLSVSASKK-------SRSSTSVFDRL

XP_031742199.1 uncharacterized protein LOC105435721 [Cucumis sativus]3.6e-0139Show/hide
Query:  KRKKENLVTSYCINEEEVDNSKKSE---QRTSVFDRIKPPTTRPSVFHRMSMVATEEENQCSISTSTRPSAFQRLSVSASKK-------SRSSTSVFDRL
        K+ KE +V    I  EE DN+   E   QR SVFDRI+P   RP VF R+SM   E E   S+    R S F+RL+ +  K+       + +  S F+RL
Subjt:  KRKKENLVTSYCINEEEVDNSKKSE---QRTSVFDRIKPPTTRPSVFHRMSMVATEEENQCSISTSTRPSAFQRLSVSASKK-------SRSSTSVFDRL

XP_031742199.1 uncharacterized protein LOC105435721 [Cucumis sativus]6.1e-13454.32Show/hide
Query:  KTSSMTAVKNKSYMGSTAHCCFNELRFQEDKASIIASEETTLQGAYTNDK--FLVKYNPLF------------EPDSDIVTVMMTETRTMEERMTGMQEH
        K +S ++  + +Y G         +   +D+ S IA  ++ L+    + K   ++K NPL+            E   D+++VMM +   +E  M  M+  
Subjt:  KTSSMTAVKNKSYMGSTAHCCFNELRFQEDKASIIASEETTLQGAYTNDK--FLVKYNPLF------------EPDSDIVTVMMTETRTMEERMTGMQEH

Query:  INNLMKAIEEKDSQIAQLKCQIENQHIAESSQTQVIKNHDKGKAIVQDDQP-QCSTSIASLTIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLR
        IN LMK ++E+D +IA LK Q++ +  AESSQT V+K  DKGK +VQ++QP Q STS+ASL++QQLQDMITN IRAQYGGP+Q S +YSKPYTKRIDNLR
Subjt:  INNLMKAIEEKDSQIAQLKCQIENQHIAESSQTQVIKNHDKGKAIVQDDQP-QCSTSIASLTIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLR

Query:  MPIGYQPPKFQQFDGKGNPKQHIAHFVETCENA--------------------------EPESVDSWEELEREFLNRFYSTRRIVSMFELTNTKQRKGEL
        MP+GYQPPKFQQFDGKGNPKQH+AHFVETCENA                          EPES++SWE+LE+EFLNRFYSTRR VSM ELTNTKQRKGE 
Subjt:  MPIGYQPPKFQQFDGKGNPKQHIAHFVETCENA--------------------------EPESVDSWEELEREFLNRFYSTRRIVSMFELTNTKQRKGEL

Query:  VVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGKN-------DEETIEESMVVN
        V++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSIASR  +D L+P ++K+ K         + T +ESMVVN
Subjt:  VVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGKN-------DEETIEESMVVN

Query:  TTLPKSSSKGK-----RQTNGA--HNLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPK-------------------------LERCFVLKDLILKL
        TT P   SKGK     ++ +G+    LTLKERQ+K+YPFPD+DI DMLEQLLE QLI+LP+                         +E+CFVLK+LIL+L
Subjt:  TTLPKSSSKGK-----RQTNGA--HNLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPK-------------------------LERCFVLKDLILKL

Query:  AMERKIELDLDEVAQSNLATI
        A E++IELDL+EVAQ+N A +
Subjt:  AMERKIELDLDEVAQSNLATI

TrEMBL top hitse value%identityAlignment
A0A5A7TZU9 Ribonuclease H1.7e-13459.68Show/hide
Query:  PDSDIVTVMMTETRTMEERMTGMQEHINNLMKAIEEKDSQIAQLKCQIENQHIAESSQTQVIKNHDKGKAIVQDDQPQCSTSIASLTIQQLQDMITNCIR
        P  +I++VM+T+  T E+RM  +++ +N LMKA+EE+D +IA LK  IE++  AESS T  IKN +KGKAI+Q+ QPQ STSIASL++QQLQ+MI N I+
Subjt:  PDSDIVTVMMTETRTMEERMTGMQEHINNLMKAIEEKDSQIAQLKCQIENQHIAESSQTQVIKNHDKGKAIVQDDQPQCSTSIASLTIQQLQDMITNCIR

Query:  AQYGGPTQDSLLYSKPYTKRIDNLRMPIGYQPPKFQQFDGKGNPKQHIAHFVETCENA--------------------------EPESVDSWEELEREFL
         QYGGP Q   LYSKPYTKRIDN+RMP GYQPPKFQQFDGKGNPKQH+AHF+ETCE A                          EPES+DSWE+LER+FL
Subjt:  AQYGGPTQDSLLYSKPYTKRIDNLRMPIGYQPPKFQQFDGKGNPKQHIAHFVETCENA--------------------------EPESVDSWEELEREFL

Query:  NRFYSTRRIVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNM
        NRFYSTRRIVSM ELT TKQRKGE V++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSIA+R N DLL+P +
Subjt:  NRFYSTRRIVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNM

Query:  RKEGKNDEET-------IEESMVVNTT----LPKSSSKGKRQTNG-AHNLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPK----------------
        RKE K  + T        +E+MVV+TT    + K     KRQ  G     TLKERQ+K+YPFPD+D+PDML+QLLE QLI+LP+                
Subjt:  RKEGKNDEET-------IEESMVVNTT----LPKSSSKGKRQTNG-AHNLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPK----------------

Query:  ---------LERCFVLKDLILKLAMERKIELDLDEVAQSNLATI
                 +E+CFVLK+LILKLA+++KIEL+LD+VAQ+N A +
Subjt:  ---------LERCFVLKDLILKLAMERKIELDLDEVAQSNLATI

A0A5A7TZU9 Ribonuclease H4.6e-0234.04Show/hide
Query:  RKNDPKKLQPKRKRKKENLVTSYCINEEEVDN---SKKSEQRTSVFDRIKPPTTRPSVFHRMSMVATEEENQCSISTSTRPSAFQRLSVSASKKSRSSTS
        + ++P ++  K K K  N  T +   EE  D+    K   QR+SVFDRI     RPSVF R+S    ++ NQ S  +STR SAFQRL+ SA K    S +
Subjt:  RKNDPKKLQPKRKRKKENLVTSYCINEEEVDN---SKKSEQRTSVFDRIKPPTTRPSVFHRMSMVATEEENQCSISTSTRPSAFQRLSVSASKKSRSSTS

Query:  VFDRLKDLIKTMTKIRAFKCKSSLSQEPKLHDAPSPHELKS
           R     +    +   + K+S+S   K        E++S
Subjt:  VFDRLKDLIKTMTKIRAFKCKSSLSQEPKLHDAPSPHELKS

A0A5A7TZU9 Ribonuclease H1.8e-13155.74Show/hide
Query:  KASIIASEETTLQGAYTNDKFLVKYNPLFE--PDSDIVTVMMTETRTMEERMTGMQEHINNLMKAIEEKDSQIAQLKCQIENQHIAESSQTQVIKNHDKG
        K  I+  E   +   Y++ K      P  E  P S+I++VM+T   T E RM  +++ +N LMK +EE+D +IA LK  IE++  AESS    +KN DKG
Subjt:  KASIIASEETTLQGAYTNDKFLVKYNPLFE--PDSDIVTVMMTETRTMEERMTGMQEHINNLMKAIEEKDSQIAQLKCQIENQHIAESSQTQVIKNHDKG

Query:  KAIVQDDQPQCSTSIASLTIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRMPIGYQPPKFQQFDGKGNPKQHIAHFVETCENA----------
        KA++Q+ QPQ STSIASL++QQLQ+MI + I+ QYGGP Q   LYSKPYTKRIDNLRM  GYQPPKFQQFDGKGNPKQH+AHF+ETCE A          
Subjt:  KAIVQDDQPQCSTSIASLTIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRMPIGYQPPKFQQFDGKGNPKQHIAHFVETCENA----------

Query:  ----------------EPESVDSWEELEREFLNRFYSTRRIVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGI
                        EPES+D+WE+LER+FLNRFYSTRRIVSM ELTNT+Q+KGELV++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GI
Subjt:  ----------------EPESVDSWEELEREFLNRFYSTRRIVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGI

Query:  KPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGKNDEET-------IEESMVVNTTLPKSSSK-------GKRQTNGAHNLTLKERQKKIYPFPDAD
        KPRTFEELATRAHDMELSI +RE +D L+P  R +    ++T       I+ESMVV+ T  KS SK        K   N     TLKERQ+K+YPFPD+D
Subjt:  KPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGKNDEET-------IEESMVVNTTLPKSSSK-------GKRQTNGAHNLTLKERQKKIYPFPDAD

Query:  IPDMLEQLLEAQLIELPK-------------------------LERCFVLKDLILKLAMERKIELDLDEVAQSNLATIK
        + DMLEQLLE QLI+LP+                         +E+CFVLK+LILKLA E+KIELD+DEVAQ+N   I+
Subjt:  IPDMLEQLLEAQLIELPK-------------------------LERCFVLKDLILKLAMERKIELDLDEVAQSNLATIK

A0A5A7URH1 Ty3-gypsy retrotransposon protein4.4e-14652.28Show/hide
Query:  KASIIASEETTLQGAYTNDKFLVKYNPLFE--PDSDIVTVMMTETRTMEERMTGMQEHINNLMKAIEEKDSQIAQLKCQIENQHIAESSQTQVIKNHDKG
        K  I+  E   +   Y++ K      P  E  P  +I++VM+T   T E RM  +++ +N LMK +EE+D +IA LK  IE++  AESS    +KN DKG
Subjt:  KASIIASEETTLQGAYTNDKFLVKYNPLFE--PDSDIVTVMMTETRTMEERMTGMQEHINNLMKAIEEKDSQIAQLKCQIENQHIAESSQTQVIKNHDKG

Query:  KAIVQDDQPQCSTSIASLTIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRMPIGYQPPKFQQFDGKGNPKQHIAHFVETCENA----------
        KA++Q+ QPQ STSIASL++QQLQ+MI + I+ QYGGP Q   LY KPYTKRIDNLRMP GYQPPKFQQFDGKGNPKQH+AHF++TCE A          
Subjt:  KAIVQDDQPQCSTSIASLTIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRMPIGYQPPKFQQFDGKGNPKQHIAHFVETCENA----------

Query:  ----------------EPESVDSWEELEREFLNRFYSTRRIVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGI
                        EPES+D+WE+LER+FLNRFYSTR IVSM ELTNT+Q+KGELV++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GI
Subjt:  ----------------EPESVDSWEELEREFLNRFYSTRRIVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGI

Query:  KPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGKNDEET-------IEESMVVNTTLPKSSSKGK-----RQTNG--AHNLTLKERQKKIYPFPDAD
        KPRTFEELATRAHDMELSIA+R  +D L+P  R +    ++T       I+ESMVV+ T  KS SK K     R+ +G      TLKERQ+K+YPFPD+D
Subjt:  KPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGKNDEET-------IEESMVVNTTLPKSSSKGK-----RQTNG--AHNLTLKERQKKIYPFPDAD

Query:  IPDMLEQLLEAQLIELPK-------------------------LERCFVLKDLILKLAMERKIELDLDEVAQSNLATIKEKNKHQRKNDPKKLQPKRK--
        + DMLEQLLE QLI+LP+                         +E+CFVLK+LILKLA E+KIELD+DEVAQ+N A     N  + K D   LQ +R   
Subjt:  IPDMLEQLLEAQLIELPK-------------------------LERCFVLKDLILKLAMERKIELDLDEVAQSNLATIKEKNKHQRKNDPKKLQPKRK--

Query:  ------------RKKENLVTSYCI------------NEEEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMVATEEENQCSISTSTRPSAFQRLSVSAS
                      +E L  + C             + +EV+NS +  QRTSVFDRIKP TTR SVF R+S+   EEENQC     TR S  +RLS+S  
Subjt:  ------------RKKENLVTSYCI------------NEEEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMVATEEENQCSISTSTRPSAFQRLSVSAS

Query:  KKSRSSTSVFDRLK
        KK R STS FDRLK
Subjt:  KKSRSSTSVFDRLK

A0A5A7UXF0 Ty3-gypsy retrotransposon protein2.3e-13155.74Show/hide
Query:  KASIIASEETTLQGAYTNDKFLVKYNPLFEPDS--DIVTVMMTETRTMEERMTGMQEHINNLMKAIEEKDSQIAQLKCQIENQHIAESSQTQVIKNHDKG
        K  I+  E   +   Y++ K      P  E  S  +I++VM+T   T E RM  +++ +N LMK +EE+D +IA LK  IE++  AESS   ++KN DKG
Subjt:  KASIIASEETTLQGAYTNDKFLVKYNPLFEPDS--DIVTVMMTETRTMEERMTGMQEHINNLMKAIEEKDSQIAQLKCQIENQHIAESSQTQVIKNHDKG

Query:  KAIVQDDQPQCSTSIASLTIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRMPIGYQPPKFQQFDGKGNPKQHIAHFVETCENA----------
        KA++Q+ QPQ STSIASL++QQLQ+MI + I+ QYGGP Q   LYSKPYTKRIDNLRMP GYQPPKFQQFDGKGNPKQH+AHF+ETCE A          
Subjt:  KAIVQDDQPQCSTSIASLTIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRMPIGYQPPKFQQFDGKGNPKQHIAHFVETCENA----------

Query:  ----------------EPESVDSWEELEREFLNRFYSTRRIVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGI
                        EPES+D+WE+LER+FLNRFYSTRRIVSM ELTNT+Q+KGELV+NYINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GI
Subjt:  ----------------EPESVDSWEELEREFLNRFYSTRRIVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGI

Query:  KPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGKNDEET-------IEESMVVNTTLPKSSSK-------GKRQTNGAHNLTLKERQKKIYPFPDAD
        KPRTFEELATRAHDMELSIA+R  +D L+P  R +    ++T       I+ESMVV+ T  KS SK        K   +     TLKERQ+K+YPFPD+D
Subjt:  KPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGKNDEET-------IEESMVVNTTLPKSSSK-------GKRQTNGAHNLTLKERQKKIYPFPDAD

Query:  IPDMLEQLLEAQLIELPK-------------------------LERCFVLKDLILKLAMERKIELDLDEVAQSNLATIK
        + DMLEQLLE QLI+LP+                         +E+CFVLK+LILKLA E+KIELD+DEVAQ+N   I+
Subjt:  IPDMLEQLLEAQLIELPK-------------------------LERCFVLKDLILKLAMERKIELDLDEVAQSNLATIK

A0A5D3BX77 Retrotransposon gag protein7.3e-14144.92Show/hide
Query:  TSSMTAVKNKSYMGS--TAHCCFNELRFQEDKASIIASEETTLQ-GAYTNDKFLVKYNPLFE-------------PDSDIVTVMMTETRTMEERMTGMQE
        TS     K  SY+G     H C  E++  ED      ++    Q         ++K NP  +             P  +I++VM+T   T E RM  +++
Subjt:  TSSMTAVKNKSYMGS--TAHCCFNELRFQEDKASIIASEETTLQ-GAYTNDKFLVKYNPLFE-------------PDSDIVTVMMTETRTMEERMTGMQE

Query:  HINNLMKAIEEKDSQIAQLKCQIENQHIAESSQTQVIKNHDKGKAIVQDDQPQCSTSIASLTIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLR
         +N LMK +EE+D +IA LK  IE++  AESS    +KN DKGKA++Q+ QPQ STSIASL++QQLQ+MI + I+ QYGGP Q   LYSKPYTKRIDNLR
Subjt:  HINNLMKAIEEKDSQIAQLKCQIENQHIAESSQTQVIKNHDKGKAIVQDDQPQCSTSIASLTIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLR

Query:  MPIGYQPPKFQQFDGKGNPKQHIAHFVETCENA--------------------------EPESVDSWEELEREFLNRFYSTRRIVSMFELTNTKQRKGEL
        MP GYQPPKFQQFDGKGNPKQH+AHF+ETCE A                          EPES+D+WE+LER+FLNRFYSTRRIVSM ELTNT+Q+KGEL
Subjt:  MPIGYQPPKFQQFDGKGNPKQHIAHFVETCENA--------------------------EPESVDSWEELEREFLNRFYSTRRIVSMFELTNTKQRKGEL

Query:  VVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGKNDEET-------IEESMVVN
        V++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSI +R  +D L+P  R +     +T       I+ESMVV+
Subjt:  VVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGKNDEET-------IEESMVVN

Query:  TTLPKSSSKGK-----RQTNG--AHNLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPK-------------------------LERCFVLKDLILKL
         T  KS SK K     R+ +G      TLKERQ+K+YPF D+D+ DMLEQLLE QLI+LPK                         +E+CFVLK+LILKL
Subjt:  TTLPKSSSKGK-----RQTNG--AHNLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPK-------------------------LERCFVLKDLILKL

Query:  AMERKIELDLDEVAQSN-----------------------------------LATIKEKNK---------------HQRKNDPKKLQPK-----------
        A E+KIEL++DEVAQ+N                                   + TI  +NK                Q+   P  +Q +           
Subjt:  AMERKIELDLDEVAQSN-----------------------------------LATIKEKNK---------------HQRKNDPKKLQPK-----------

Query:  -------RKRKK-------------------------------------ENLVTSYCI------------NEEEVDNSKKSEQRTSVFDRIKPPTTRPSV
               R+ KK                                     E L  + C             + +EV+N  +  QRTSVFDRIKP TTR SV
Subjt:  -------RKRKK-------------------------------------ENLVTSYCI------------NEEEVDNSKKSEQRTSVFDRIKPPTTRPSV

Query:  FHRMSMVATEEENQCSISTSTRPSAFQRLSVSASKKSRSSTSVFDRLK
        F R+SM   EEENQC     TR S F+RLS+SASKK+R STS FDRLK
Subjt:  FHRMSMVATEEENQCSISTSTRPSAFQRLSVSASKKSRSSTSVFDRLK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACATTCAAGACTTCTTCGATGACCGCTGTCAAGAACAAGTCTTACATGGGTTCAACTGCCCATTGTTGCTTCAATGAACTGAGGTTCCAAGAAGATAAAGCTTCTAT
CATTGCAAGCGAAGAAACAACCTTGCAGGGGGCATATACTAATGACAAGTTTCTTGTTAAGTATAACCCTCTGTTTGAACCTGATTCTGACATAGTGACTGTCATGATGA
CTGAGACAAGAACTATGGAAGAAAGAATGACTGGGATGCAAGAGCACATCAACAACTTGATGAAGGCGATTGAAGAAAAAGATTCTCAAATCGCGCAACTAAAGTGCCAA
ATTGAGAACCAACATATCGCTGAATCAAGTCAAACCCAAGTCATAAAAAATCATGACAAAGGAAAGGCTATAGTGCAAGATGATCAGCCACAGTGTTCTACTTCGATCGC
TTCACTAACCATCCAACAGCTCCAAGATATGATCACAAATTGTATCAGAGCTCAGTACGGTGGACCTACTCAAGATTCCCTCTTGTATTCCAAACCTTATACTAAGAGGA
TTGATAACTTGAGAATGCCAATCGGATATCAGCCACCAAAATTTCAGCAGTTTGATGGAAAGGGCAATCCTAAACAACACATTGCCCACTTTGTTGAGACATGCGAGAAC
GCTGAACCTGAGTCAGTAGACAGTTGGGAGGAACTCGAAAGAGAGTTTTTGAATCGCTTCTACAGCACTAGAAGAATCGTTAGCATGTTCGAGCTCACCAACACTAAACA
ACGAAAAGGTGAACTCGTTGTTAACTATATAAATCGCTGGAGAGCCATGAGTCTAGATTGCAAAGATCGTCTCACTGAACTCTCTTCCGTCGAGATGTGCATTCAAGGCA
TGCACTGGGAACTCCTCTACATCCTTAAAGGTATAAAGCCTCGCACCTTTGAGGAACTAGCAACTCGCGCCCACGATATGGAGCTAAGTATTGCTAGCCGAGAAAACCAA
GACCTTCTCCTCCCTAACATGAGAAAAGAAGGAAAGAACGACGAAGAGACTATAGAAGAATCTATGGTTGTCAACACAACCCTTCCCAAGTCGTCTTCGAAAGGAAAGCG
ACAAACAAATGGAGCGCATAACTTAACTTTAAAGGAAAGACAGAAGAAAATCTATCCTTTCCCTGATGCCGACATCCCTGATATGTTGGAACAACTATTGGAAGCGCAAC
TGATAGAGCTTCCTAAGTTGGAAAGATGTTTCGTCCTAAAGGACTTAATTCTAAAGCTGGCTATGGAACGAAAAATTGAGCTCGACCTTGATGAAGTAGCTCAATCAAAT
CTTGCTACAATCAAAGAAAAGAACAAACATCAAAGAAAGAATGATCCTAAGAAACTTCAACCCAAGAGGAAGAGAAAAAAAGAGAACCTTGTGACTTCCTACTGCATCAA
CGAAGAAGAAGTTGACAATTCTAAGAAGAGTGAACAAAGGACTTCCGTCTTTGATCGCATCAAGCCTCCAACTACTCGTCCTTCGGTATTCCATAGAATGAGTATGGTCG
CGACAGAGGAAGAAAATCAATGTTCGATATCCACCTCCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCGCATCGAAGAAAAGTCGATCTTCAACATCTGTCTTT
GATCGCCTCAAAGATCTGATCAAGACCATGACAAAGATAAGAGCTTTTAAATGTAAAAGCTCCTTATCGCAAGAGCCTAAACTGCACGATGCTCCTAGCCCACACGAGCT
TAAAAGCCTCAAGTTCGGTGTTTTCATTCACCCTAACTGCGTTGTTCCTTCTCCAAGTTTGAAGGTTCTCACGCGCTTCGCTGTAGTTCCTTCCTCCAAGTTCGAAAGTT
CTCACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGTATGTCACTGCAGTTCCTTCCTCCAAGTTCGAAGATTCTCATGTACGTTGCTTCGCTGCAA
TTCCTTCTCTCCCCAAATTCGATTCGAAGGTTCTCATGTCGCTACGCTGCAGTTCTTTCTCAAGTCCGAAGGTTCTCACGGTTCTCACATCGCTTCGCTGCAGTTCCTCC
CCCAAGTTCGAAGGATCTCTCACGTCGCTTCGCTGGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTTCTCTCACGTCGCTTCAACAAAGTTCCTTCTCTCCAAGTTTGAAG
GTTCTCACGTCGCATCGCTAGCAGTTCTTTCTCTCCAAGTTCGAAGGTTCTCACGTCGCATCGCTAGCAGTTCCTTCTCTCCAAGTTCGAAGGTTCTCACGTTGCTTCGC
TGCAGTTTCCTTCCTCCAAGTCCGAAGGCTTCCCCAAGTCGAGTCGAAGGCTCACACGTTGCTGTTTGTTTGCACGTGTTGTTGTTATGCTGCTGTTCCTTCTCCAAAGT
TCGAAGGTTCCCACGTTGTGCGTTGTCCGCTTCGCTGCAGTTCCTTCTCTCCAAGTTTGAAGGTTCTCGGCTTTCGCTCTTGCGTTGTTCCTTCTCCAAGTCCAAAGGTT
TGCACGTGTTGCACGTGTTGAAGGTTCCCACGTTGTTCCCCACGTTGCGCGCGGTTGTGTTGCTTCCCTTCACCAAGTTCGAAGAAGGTTCTCACGCTGGCTGCGTTGTT
TCGTTGTTCCTTCTCCAAAGCTTGAACGTCCTCATGCACTCTGCTACTGTTCCTTCCTATCTCACGTGTTTCTCATGTTCTTACGCTGCAGTTCCTTCCCCACAAGTCCG
AGCTCCGCTCCTTCCTTCCTCCAAGGTGGTGAAATCACTGCAAGTGAAGCTGTTGACGACCGTGGTGACAACCCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTA
GACAAGTGGTGAAATCACTGCAAGAGAAGTTGATGGCGACCGTGGTGACCACCCCTGCAGGAAACTACAATCATCAAAGTGACTGGTCTAGACAGGTGGTGAAATCACTG
CAAGAGAAAGCTGATGACGACCGTGGTGACCACTCATATAGGAAACTACAACAGGTGGTGAAATCACTGCAGGTGAAAGCTGATGACGACCGTGGTGACCACCCCTGCAG
GAAACTACATGATCCATCAGATAACATCAGCCAAGTTCAATCAGATTATGCCACGTGGCAATTGGACCCAAGCCCAAGCCCATATGAGAGTCCAGAGTCATCAGAAGTCA
GAGAGTCTAGAGAATTAGAAGATCCTAGATTCAGAATTCAACCAACTCAAGACTCAGAAGTAGGCCGATCATCCAAGGAGATCAATAAGCCAATCGACCGATCAAGAAGA
TCAACAAGTCAGCAGGCCGATCATCCAAGAGATCAACAAGCCAACCGACCGATCAAGAAGATCAACAAGTCAGCAGGCCGATCATCCAAGAAGATCAACAAGCCAACTGA
TCGAACAGATCATCAAACCAACAGGCCGATCAAGAAGATCAACAACAAGCCGATCATCCAAGAAGATCAACAAGCCAACCGATCGAACAGATCATCAAGCCAACAGGCCG
ATCCAAGAGATCATCAAGTCAGCAGGCTGATCATCCAAGAGGATCAACAAGCTAACAAGCCGATCCAACAGATCATCAAGCCAACAGGCCGATCCAATAGATCATCAAGT
CAGCAAGTCGGTCATCCAAGAGGATCAACAAGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGACATTCAAGACTTCTTCGATGACCGCTGTCAAGAACAAGTCTTACATGGGTTCAACTGCCCATTGTTGCTTCAATGAACTGAGGTTCCAAGAAGATAAAGCTTCTAT
CATTGCAAGCGAAGAAACAACCTTGCAGGGGGCATATACTAATGACAAGTTTCTTGTTAAGTATAACCCTCTGTTTGAACCTGATTCTGACATAGTGACTGTCATGATGA
CTGAGACAAGAACTATGGAAGAAAGAATGACTGGGATGCAAGAGCACATCAACAACTTGATGAAGGCGATTGAAGAAAAAGATTCTCAAATCGCGCAACTAAAGTGCCAA
ATTGAGAACCAACATATCGCTGAATCAAGTCAAACCCAAGTCATAAAAAATCATGACAAAGGAAAGGCTATAGTGCAAGATGATCAGCCACAGTGTTCTACTTCGATCGC
TTCACTAACCATCCAACAGCTCCAAGATATGATCACAAATTGTATCAGAGCTCAGTACGGTGGACCTACTCAAGATTCCCTCTTGTATTCCAAACCTTATACTAAGAGGA
TTGATAACTTGAGAATGCCAATCGGATATCAGCCACCAAAATTTCAGCAGTTTGATGGAAAGGGCAATCCTAAACAACACATTGCCCACTTTGTTGAGACATGCGAGAAC
GCTGAACCTGAGTCAGTAGACAGTTGGGAGGAACTCGAAAGAGAGTTTTTGAATCGCTTCTACAGCACTAGAAGAATCGTTAGCATGTTCGAGCTCACCAACACTAAACA
ACGAAAAGGTGAACTCGTTGTTAACTATATAAATCGCTGGAGAGCCATGAGTCTAGATTGCAAAGATCGTCTCACTGAACTCTCTTCCGTCGAGATGTGCATTCAAGGCA
TGCACTGGGAACTCCTCTACATCCTTAAAGGTATAAAGCCTCGCACCTTTGAGGAACTAGCAACTCGCGCCCACGATATGGAGCTAAGTATTGCTAGCCGAGAAAACCAA
GACCTTCTCCTCCCTAACATGAGAAAAGAAGGAAAGAACGACGAAGAGACTATAGAAGAATCTATGGTTGTCAACACAACCCTTCCCAAGTCGTCTTCGAAAGGAAAGCG
ACAAACAAATGGAGCGCATAACTTAACTTTAAAGGAAAGACAGAAGAAAATCTATCCTTTCCCTGATGCCGACATCCCTGATATGTTGGAACAACTATTGGAAGCGCAAC
TGATAGAGCTTCCTAAGTTGGAAAGATGTTTCGTCCTAAAGGACTTAATTCTAAAGCTGGCTATGGAACGAAAAATTGAGCTCGACCTTGATGAAGTAGCTCAATCAAAT
CTTGCTACAATCAAAGAAAAGAACAAACATCAAAGAAAGAATGATCCTAAGAAACTTCAACCCAAGAGGAAGAGAAAAAAAGAGAACCTTGTGACTTCCTACTGCATCAA
CGAAGAAGAAGTTGACAATTCTAAGAAGAGTGAACAAAGGACTTCCGTCTTTGATCGCATCAAGCCTCCAACTACTCGTCCTTCGGTATTCCATAGAATGAGTATGGTCG
CGACAGAGGAAGAAAATCAATGTTCGATATCCACCTCCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCGCATCGAAGAAAAGTCGATCTTCAACATCTGTCTTT
GATCGCCTCAAAGATCTGATCAAGACCATGACAAAGATAAGAGCTTTTAAATGTAAAAGCTCCTTATCGCAAGAGCCTAAACTGCACGATGCTCCTAGCCCACACGAGCT
TAAAAGCCTCAAGTTCGGTGTTTTCATTCACCCTAACTGCGTTGTTCCTTCTCCAAGTTTGAAGGTTCTCACGCGCTTCGCTGTAGTTCCTTCCTCCAAGTTCGAAAGTT
CTCACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGTATGTCACTGCAGTTCCTTCCTCCAAGTTCGAAGATTCTCATGTACGTTGCTTCGCTGCAA
TTCCTTCTCTCCCCAAATTCGATTCGAAGGTTCTCATGTCGCTACGCTGCAGTTCTTTCTCAAGTCCGAAGGTTCTCACGGTTCTCACATCGCTTCGCTGCAGTTCCTCC
CCCAAGTTCGAAGGATCTCTCACGTCGCTTCGCTGGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTTCTCTCACGTCGCTTCAACAAAGTTCCTTCTCTCCAAGTTTGAAG
GTTCTCACGTCGCATCGCTAGCAGTTCTTTCTCTCCAAGTTCGAAGGTTCTCACGTCGCATCGCTAGCAGTTCCTTCTCTCCAAGTTCGAAGGTTCTCACGTTGCTTCGC
TGCAGTTTCCTTCCTCCAAGTCCGAAGGCTTCCCCAAGTCGAGTCGAAGGCTCACACGTTGCTGTTTGTTTGCACGTGTTGTTGTTATGCTGCTGTTCCTTCTCCAAAGT
TCGAAGGTTCCCACGTTGTGCGTTGTCCGCTTCGCTGCAGTTCCTTCTCTCCAAGTTTGAAGGTTCTCGGCTTTCGCTCTTGCGTTGTTCCTTCTCCAAGTCCAAAGGTT
TGCACGTGTTGCACGTGTTGAAGGTTCCCACGTTGTTCCCCACGTTGCGCGCGGTTGTGTTGCTTCCCTTCACCAAGTTCGAAGAAGGTTCTCACGCTGGCTGCGTTGTT
TCGTTGTTCCTTCTCCAAAGCTTGAACGTCCTCATGCACTCTGCTACTGTTCCTTCCTATCTCACGTGTTTCTCATGTTCTTACGCTGCAGTTCCTTCCCCACAAGTCCG
AGCTCCGCTCCTTCCTTCCTCCAAGGTGGTGAAATCACTGCAAGTGAAGCTGTTGACGACCGTGGTGACAACCCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTA
GACAAGTGGTGAAATCACTGCAAGAGAAGTTGATGGCGACCGTGGTGACCACCCCTGCAGGAAACTACAATCATCAAAGTGACTGGTCTAGACAGGTGGTGAAATCACTG
CAAGAGAAAGCTGATGACGACCGTGGTGACCACTCATATAGGAAACTACAACAGGTGGTGAAATCACTGCAGGTGAAAGCTGATGACGACCGTGGTGACCACCCCTGCAG
GAAACTACATGATCCATCAGATAACATCAGCCAAGTTCAATCAGATTATGCCACGTGGCAATTGGACCCAAGCCCAAGCCCATATGAGAGTCCAGAGTCATCAGAAGTCA
GAGAGTCTAGAGAATTAGAAGATCCTAGATTCAGAATTCAACCAACTCAAGACTCAGAAGTAGGCCGATCATCCAAGGAGATCAATAAGCCAATCGACCGATCAAGAAGA
TCAACAAGTCAGCAGGCCGATCATCCAAGAGATCAACAAGCCAACCGACCGATCAAGAAGATCAACAAGTCAGCAGGCCGATCATCCAAGAAGATCAACAAGCCAACTGA
TCGAACAGATCATCAAACCAACAGGCCGATCAAGAAGATCAACAACAAGCCGATCATCCAAGAAGATCAACAAGCCAACCGATCGAACAGATCATCAAGCCAACAGGCCG
ATCCAAGAGATCATCAAGTCAGCAGGCTGATCATCCAAGAGGATCAACAAGCTAACAAGCCGATCCAACAGATCATCAAGCCAACAGGCCGATCCAATAGATCATCAAGT
CAGCAAGTCGGTCATCCAAGAGGATCAACAAGCTAA
Protein sequenceShow/hide protein sequence
MTFKTSSMTAVKNKSYMGSTAHCCFNELRFQEDKASIIASEETTLQGAYTNDKFLVKYNPLFEPDSDIVTVMMTETRTMEERMTGMQEHINNLMKAIEEKDSQIAQLKCQ
IENQHIAESSQTQVIKNHDKGKAIVQDDQPQCSTSIASLTIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRMPIGYQPPKFQQFDGKGNPKQHIAHFVETCEN
AEPESVDSWEELEREFLNRFYSTRRIVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQ
DLLLPNMRKEGKNDEETIEESMVVNTTLPKSSSKGKRQTNGAHNLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKLERCFVLKDLILKLAMERKIELDLDEVAQSN
LATIKEKNKHQRKNDPKKLQPKRKRKKENLVTSYCINEEEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMVATEEENQCSISTSTRPSAFQRLSVSASKKSRSSTSVF
DRLKDLIKTMTKIRAFKCKSSLSQEPKLHDAPSPHELKSLKFGVFIHPNCVVPSPSLKVLTRFAVVPSSKFESSHALRCSSFLQVRRFSRMSLQFLPPSSKILMYVASLQ
FLLSPNSIRRFSCRYAAVLSQVRRFSRFSHRFAAVPPPSSKDLSRRFAGSSFLQVRRFFSHVASTKFLLSKFEGSHVASLAVLSLQVRRFSRRIASSSFSPSSKVLTLLR
CSFLPPSPKASPSRVEGSHVAVCLHVLLLCCCSFSKVRRFPRCALSASLQFLLSKFEGSRLSLLRCSFSKSKGLHVLHVLKVPTLFPTLRAVVLLPFTKFEEGSHAGCVV
SLFLLQSLNVLMHSATVPSYLTCFSCSYAAVPSPQVRAPLLPSSKVVKSLQVKLLTTVVTTPAGNYSHQSDWSRQVVKSLQEKLMATVVTTPAGNYNHQSDWSRQVVKSL
QEKADDDRGDHSYRKLQQVVKSLQVKADDDRGDHPCRKLHDPSDNISQVQSDYATWQLDPSPSPYESPESSEVRESRELEDPRFRIQPTQDSEVGRSSKEINKPIDRSRR
STSQQADHPRDQQANRPIKKINKSAGRSSKKINKPTDRTDHQTNRPIKKINNKPIIQEDQQANRSNRSSSQQADPRDHQVSRLIIQEDQQANKPIQQIIKPTGRSNRSSS
QQVGHPRGSTS