; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022175 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022175
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTy3/gypsy retrotransposon protein
Genome locationchr7:20396983..20399967
RNA-Seq ExpressionLag0022175
SyntenyLag0022175
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032454.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]4.0e-11442.74Show/hide
Query:  MAHKQLEERVAESEKHVEGMKE---KFPELENSVAELNKSMEKLFNSMEDQRQITIENQKALA--------NLLQGGFKVSPSGEKEFGQKRKIEEDVSA
        M   ++EER+   E+ + G+K+   K P +E+++ E+ K+ME +    E Q+Q  +   +A A         + +   + SP+ + + G K     D+  
Subjt:  MAHKQLEERVAESEKHVEGMKE---KFPELENSVAELNKSMEKLFNSMEDQRQITIENQKALA--------NLLQGGFKVSPSGEKEFGQKRKIEEDVSA

Query:  SSSRKEPRGEENFHDRQKFKKVEMPTFEGDYPDDWLFQAERYFDIHQLSDPEKVVVSAVSFAEVALRWYRWAESRSPFTSWRNLKYRVLERFRPTQEGTL
        +S+ ++   +EN +DR KFKKVEMP F G+ P+ WLF+AERYF IH+L++ EK++VS + F   AL WYR  E R  F SW NLK R+L RF+ T+EGT+
Subjt:  SSSRKEPRGEENFHDRQKFKKVEMPTFEGDYPDDWLFQAERYFDIHQLSDPEKVVVSAVSFAEVALRWYRWAESRSPFTSWRNLKYRVLERFRPTQEGTL

Query:  CARFLSIRQEKTVVEYREKFEALATPLPQLSEEVLENTFLNGWLPAIRSEVLCFEPMGLEAIMKAVQRIEDKN--RAIESKTSFGPGRLWKNPITSHANP
        C RFL I+QE TV EYR +F+ L  PL  L + V+E TF+ G  P IR+EV+  +P GL   M   Q +ED+   R   +  S+  G+  ++ ITS    
Subjt:  CARFLSIRQEKTVVEYREKFEALATPLPQLSEEVLENTFLNGWLPAIRSEVLCFEPMGLEAIMKAVQRIEDKN--RAIESKTSFGPGRLWKNPITSHANP

Query:  PKLIEQN---------PIKTITLPN-NPSHPPKETPLRRLSDSEMQLRREKGLCYRCDEKFHMGHKCKGK---ELKVLLVA--DEEPEQPFSKEKETTEC
            +QN         PI+TITL + NP    KE   +RL D+E QLR+EKGLC++C+EK+   HKCK K   EL++ +V   +EE E     E E  E 
Subjt:  PKLIEQN---------PIKTITLPN-NPSHPPKETPLRRLSDSEMQLRREKGLCYRCDEKFHMGHKCKGK---ELKVLLVA--DEEPEQPFSKEKETTEC

Query:  RVEEVEDPTVEVDMVELSINTVVGFSSPGTMKVKGRIEDKEM----------------LIEDMKLPVTETMNYGIIMGTRTAVKGKGICKKIVIALDGIT
        RV EV+  T     VELSIN+VVG + PGTMKVKG ++ KE+                ++  ++LP+ ET +YG+I+G+ TA++GKGIC+ + I +   T
Subjt:  RVEEVEDPTVEVDMVELSINTVVGFSSPGTMKVKGRIEDKEM----------------LIEDMKLPVTETMNYGIIMGTRTAVKGKGICKKIVIALDGIT

Query:  VAEDFLPIELSGVDVILGMQWLRTLGVTTIDWKTLTMEIKVGDSKITLKGDPSLTKTEVSLKQLKRTWGKHDQGFLVELRAITAA
        V EDFLP+EL GVDVILGMQWL +LGVT  DWK LT+     + +I +KGDPSLTK  VSLK L +TW +HD G+L+E R++  A
Subjt:  VAEDFLPIELSGVDVILGMQWLRTLGVTTIDWKTLTMEIKVGDSKITLKGDPSLTKTEVSLKQLKRTWGKHDQGFLVELRAITAA

KAA0048423.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]4.0e-11442.74Show/hide
Query:  MAHKQLEERVAESEKHVEGMKE---KFPELENSVAELNKSMEKLFNSMEDQRQITIENQKALA--------NLLQGGFKVSPSGEKEFGQKRKIEEDVSA
        M   ++EER+   E+ + G+K+   K P +E+++ E+ K+ME +    E Q+Q  +   +A A         + +   + SP+ + + G K     D+  
Subjt:  MAHKQLEERVAESEKHVEGMKE---KFPELENSVAELNKSMEKLFNSMEDQRQITIENQKALA--------NLLQGGFKVSPSGEKEFGQKRKIEEDVSA

Query:  SSSRKEPRGEENFHDRQKFKKVEMPTFEGDYPDDWLFQAERYFDIHQLSDPEKVVVSAVSFAEVALRWYRWAESRSPFTSWRNLKYRVLERFRPTQEGTL
        +S+ ++   +EN +DR KFKKVEMP F G+ P+ WLF+AERYF IH+L++ EK++VS + F   AL WYR  E R  F SW NLK R+L RF+ T+EGT+
Subjt:  SSSRKEPRGEENFHDRQKFKKVEMPTFEGDYPDDWLFQAERYFDIHQLSDPEKVVVSAVSFAEVALRWYRWAESRSPFTSWRNLKYRVLERFRPTQEGTL

Query:  CARFLSIRQEKTVVEYREKFEALATPLPQLSEEVLENTFLNGWLPAIRSEVLCFEPMGLEAIMKAVQRIEDKN--RAIESKTSFGPGRLWKNPITSHANP
        C RFL I+QE TV EYR +F+ L  PL  L + V+E TF+ G  P IR+EV+  +P GL   M   Q +ED+   R   +  S+  G+  ++ ITS    
Subjt:  CARFLSIRQEKTVVEYREKFEALATPLPQLSEEVLENTFLNGWLPAIRSEVLCFEPMGLEAIMKAVQRIEDKN--RAIESKTSFGPGRLWKNPITSHANP

Query:  PKLIEQN---------PIKTITLPN-NPSHPPKETPLRRLSDSEMQLRREKGLCYRCDEKFHMGHKCKGK---ELKVLLVA--DEEPEQPFSKEKETTEC
            +QN         PI+TITL + NP    KE   +RL D+E QLR+EKGLC++C+EK+   HKCK K   EL++ +V   +EE E     E E  E 
Subjt:  PKLIEQN---------PIKTITLPN-NPSHPPKETPLRRLSDSEMQLRREKGLCYRCDEKFHMGHKCKGK---ELKVLLVA--DEEPEQPFSKEKETTEC

Query:  RVEEVEDPTVEVDMVELSINTVVGFSSPGTMKVKGRIEDKEM----------------LIEDMKLPVTETMNYGIIMGTRTAVKGKGICKKIVIALDGIT
        RV EV+  T     VELSIN+VVG + PGTMKVKG ++ KE+                ++  ++LP+ ET +YG+I+G+ TA++GKGIC+ + I +   T
Subjt:  RVEEVEDPTVEVDMVELSINTVVGFSSPGTMKVKGRIEDKEM----------------LIEDMKLPVTETMNYGIIMGTRTAVKGKGICKKIVIALDGIT

Query:  VAEDFLPIELSGVDVILGMQWLRTLGVTTIDWKTLTMEIKVGDSKITLKGDPSLTKTEVSLKQLKRTWGKHDQGFLVELRAITAA
        V EDFLP+EL GVDVILGMQWL +LGVT  DWK LT+     + +I +KGDPSLTK  VSLK L +TW +HD G+L+E R++  A
Subjt:  VAEDFLPIELSGVDVILGMQWLRTLGVTTIDWKTLTMEIKVGDSKITLKGDPSLTKTEVSLKQLKRTWGKHDQGFLVELRAITAA

KAA0052232.1 transposon Tf2-1 polyprotein isoform X1 [Cucumis melo var. makuwa]8.1e-11542.91Show/hide
Query:  MAHKQLEERVAESEKHVEGMKE---KFPELENSVAELNKSMEKLFNSMEDQRQITIENQKALA--------NLLQGGFKVSPSGEKEFGQKRKIEEDVSA
        M   ++EER+   E+ + G+K+   K P +E+++ E+ K+ME +    E Q+Q  +   +A A         + +   + SP+ + + G K     D+  
Subjt:  MAHKQLEERVAESEKHVEGMKE---KFPELENSVAELNKSMEKLFNSMEDQRQITIENQKALA--------NLLQGGFKVSPSGEKEFGQKRKIEEDVSA

Query:  SSSRKEPRGEENFHDRQKFKKVEMPTFEGDYPDDWLFQAERYFDIHQLSDPEKVVVSAVSFAEVALRWYRWAESRSPFTSWRNLKYRVLERFRPTQEGTL
        +S+ ++   +EN +DR KFKKVEMP F G+ P+ WLF+AERYF IH+L++ EK++VS + F E AL WYR  E R  F SW NLK R+L RF+ T+EGT+
Subjt:  SSSRKEPRGEENFHDRQKFKKVEMPTFEGDYPDDWLFQAERYFDIHQLSDPEKVVVSAVSFAEVALRWYRWAESRSPFTSWRNLKYRVLERFRPTQEGTL

Query:  CARFLSIRQEKTVVEYREKFEALATPLPQLSEEVLENTFLNGWLPAIRSEVLCFEPMGLEAIMKAVQRIEDKN--RAIESKTSFGPGRLWKNPITSHANP
        C RFL I+QE T+ EYR +F+ L  PL  L + V+E TF+ G  P IR+EV+  +  GL  +M   Q +ED+   R   +  S+  G+  ++ ITS    
Subjt:  CARFLSIRQEKTVVEYREKFEALATPLPQLSEEVLENTFLNGWLPAIRSEVLCFEPMGLEAIMKAVQRIEDKN--RAIESKTSFGPGRLWKNPITSHANP

Query:  PKLIEQN---------PIKTITLPN-NPSHPPKETPLRRLSDSEMQLRREKGLCYRCDEKFHMGHKCKGK---ELKVLLVA--DEEPEQPFSKEKETTEC
            +QN         PI+TITL + NP    KE   +RL D+E QLR+EKGLC++C+EK+   HKCK K   EL++ +V   +EE E     E E TE 
Subjt:  PKLIEQN---------PIKTITLPN-NPSHPPKETPLRRLSDSEMQLRREKGLCYRCDEKFHMGHKCKGK---ELKVLLVA--DEEPEQPFSKEKETTEC

Query:  RVEEVEDPTVEVDMVELSINTVVGFSSPGTMKVKGRIEDKEM----------------LIEDMKLPVTETMNYGIIMGTRTAVKGKGICKKIVIALDGIT
        RV EV+  T     VELSIN+VVG + PGTMKVKG ++ KE+                +I  ++LP+ ET +YG+I+G+ TA++GKGIC+ + I +   T
Subjt:  RVEEVEDPTVEVDMVELSINTVVGFSSPGTMKVKGRIEDKEM----------------LIEDMKLPVTETMNYGIIMGTRTAVKGKGICKKIVIALDGIT

Query:  VAEDFLPIELSGVDVILGMQWLRTLGVTTIDWKTLTMEIKVGDSKITLKGDPSLTKTEVSLKQLKRTWGKHDQGFLVELRAITAA
        V EDFLP+EL GVDVILGMQWL +LGVT  DWK LT+     + +I +KGDPSLTK  VSLK L +TW +HD G+L+E R++  A
Subjt:  VAEDFLPIELSGVDVILGMQWLRTLGVTTIDWKTLTMEIKVGDSKITLKGDPSLTKTEVSLKQLKRTWGKHDQGFLVELRAITAA

KAA0065392.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]1.8e-11442.91Show/hide
Query:  MAHKQLEERVAESEKHVEGMKE---KFPELENSVAELNKSMEKLFNSMEDQRQITIENQKALA--------NLLQGGFKVSPSGEKEFGQKRKIEEDVSA
        M   ++EER+   E+ + G+K+   K P +E+++ E+ K+ME +    E Q+Q  +   +A A         + +   + SP+ + + G K     D+  
Subjt:  MAHKQLEERVAESEKHVEGMKE---KFPELENSVAELNKSMEKLFNSMEDQRQITIENQKALA--------NLLQGGFKVSPSGEKEFGQKRKIEEDVSA

Query:  SSSRKEPRGEENFHDRQKFKKVEMPTFEGDYPDDWLFQAERYFDIHQLSDPEKVVVSAVSFAEVALRWYRWAESRSPFTSWRNLKYRVLERFRPTQEGTL
        +S+ K+   +EN +DR KFKKVEMP F G+ P+ WLF+AERYF IH+L++ EK++VS + F   AL WYR  E R  F SW NLK R+L RF+ T+EGT+
Subjt:  SSSRKEPRGEENFHDRQKFKKVEMPTFEGDYPDDWLFQAERYFDIHQLSDPEKVVVSAVSFAEVALRWYRWAESRSPFTSWRNLKYRVLERFRPTQEGTL

Query:  CARFLSIRQEKTVVEYREKFEALATPLPQLSEEVLENTFLNGWLPAIRSEVLCFEPMGLEAIMKAVQRIEDKN--RAIESKTSFGPGRLWKNPITSHANP
        C RFL I+QE TV EYR +F+ L  PL  L + V+E TF+ G  P IR+EV+  +P GL   M   Q +ED+   R   +  S+  G+  ++ ITS    
Subjt:  CARFLSIRQEKTVVEYREKFEALATPLPQLSEEVLENTFLNGWLPAIRSEVLCFEPMGLEAIMKAVQRIEDKN--RAIESKTSFGPGRLWKNPITSHANP

Query:  PKLIEQN---------PIKTITLPN-NPSHPPKETPLRRLSDSEMQLRREKGLCYRCDEKFHMGHKCKGK---ELKVLLVA--DEEPEQPFSKEKETTEC
            +QN         PI+TITL + NP    KE   +RL D+E QLR+EKGLC++C+EK+   HKCK K   EL++ +V   +EE E     E E  E 
Subjt:  PKLIEQN---------PIKTITLPN-NPSHPPKETPLRRLSDSEMQLRREKGLCYRCDEKFHMGHKCKGK---ELKVLLVA--DEEPEQPFSKEKETTEC

Query:  RVEEVEDPTVEVDMVELSINTVVGFSSPGTMKVKGRIEDKEM----------------LIEDMKLPVTETMNYGIIMGTRTAVKGKGICKKIVIALDGIT
        RV EV+  T     VELSIN+VVG + PGTMKVKG ++ KE+                ++  ++LP+ ET +YG+I+G+ TA++GKGIC+ + I +   T
Subjt:  RVEEVEDPTVEVDMVELSINTVVGFSSPGTMKVKGRIEDKEM----------------LIEDMKLPVTETMNYGIIMGTRTAVKGKGICKKIVIALDGIT

Query:  VAEDFLPIELSGVDVILGMQWLRTLGVTTIDWKTLTMEIKVGDSKITLKGDPSLTKTEVSLKQLKRTWGKHDQGFLVELRAITAA
        V EDFLP+EL GVDVILGMQWL +LGVT  DWK LT+     + +I +KGDPSLTK  VSLK L +TW +HD G+L+E R++  A
Subjt:  VAEDFLPIELSGVDVILGMQWLRTLGVTTIDWKTLTMEIKVGDSKITLKGDPSLTKTEVSLKQLKRTWGKHDQGFLVELRAITAA

TYK30083.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]4.0e-11442.74Show/hide
Query:  MAHKQLEERVAESEKHVEGMKE---KFPELENSVAELNKSMEKLFNSMEDQRQITIENQKALA--------NLLQGGFKVSPSGEKEFGQKRKIEEDVSA
        M   ++EER+   E+ + G+K+   K P +E+++ E+ K+ME +    E Q+Q  +   +A A         + +   + SP+ + + G K     D+  
Subjt:  MAHKQLEERVAESEKHVEGMKE---KFPELENSVAELNKSMEKLFNSMEDQRQITIENQKALA--------NLLQGGFKVSPSGEKEFGQKRKIEEDVSA

Query:  SSSRKEPRGEENFHDRQKFKKVEMPTFEGDYPDDWLFQAERYFDIHQLSDPEKVVVSAVSFAEVALRWYRWAESRSPFTSWRNLKYRVLERFRPTQEGTL
        +S+ ++   +EN +DR KFKKVEMP F G+ P+ WLF+AERYF IH+L++ EK++VS + F   AL WYR  E R  F SW NLK R+L RF+ T+EGT+
Subjt:  SSSRKEPRGEENFHDRQKFKKVEMPTFEGDYPDDWLFQAERYFDIHQLSDPEKVVVSAVSFAEVALRWYRWAESRSPFTSWRNLKYRVLERFRPTQEGTL

Query:  CARFLSIRQEKTVVEYREKFEALATPLPQLSEEVLENTFLNGWLPAIRSEVLCFEPMGLEAIMKAVQRIEDKN--RAIESKTSFGPGRLWKNPITSHANP
        C RFL I+QE TV EYR +F+ L  PL  L + V+E TF+ G  P IR+EV+  +P GL   M   Q +ED+   R   +  S+  G+  ++ ITS    
Subjt:  CARFLSIRQEKTVVEYREKFEALATPLPQLSEEVLENTFLNGWLPAIRSEVLCFEPMGLEAIMKAVQRIEDKN--RAIESKTSFGPGRLWKNPITSHANP

Query:  PKLIEQN---------PIKTITLPN-NPSHPPKETPLRRLSDSEMQLRREKGLCYRCDEKFHMGHKCKGK---ELKVLLVA--DEEPEQPFSKEKETTEC
            +QN         PI+TITL + NP    KE   +RL D+E QLR+EKGLC++C+EK+   HKCK K   EL++ +V   +EE E     E E  E 
Subjt:  PKLIEQN---------PIKTITLPN-NPSHPPKETPLRRLSDSEMQLRREKGLCYRCDEKFHMGHKCKGK---ELKVLLVA--DEEPEQPFSKEKETTEC

Query:  RVEEVEDPTVEVDMVELSINTVVGFSSPGTMKVKGRIEDKEM----------------LIEDMKLPVTETMNYGIIMGTRTAVKGKGICKKIVIALDGIT
        RV EV+  T     VELSIN+VVG + PGTMKVKG ++ KE+                ++  ++LP+ ET +YG+I+G+ TA++GKGIC+ + I +   T
Subjt:  RVEEVEDPTVEVDMVELSINTVVGFSSPGTMKVKGRIEDKEM----------------LIEDMKLPVTETMNYGIIMGTRTAVKGKGICKKIVIALDGIT

Query:  VAEDFLPIELSGVDVILGMQWLRTLGVTTIDWKTLTMEIKVGDSKITLKGDPSLTKTEVSLKQLKRTWGKHDQGFLVELRAITAA
        V EDFLP+EL GVDVILGMQWL +LGVT  DWK LT+     + +I +KGDPSLTK  VSLK L +TW +HD G+L+E R++  A
Subjt:  VAEDFLPIELSGVDVILGMQWLRTLGVTTIDWKTLTMEIKVGDSKITLKGDPSLTKTEVSLKQLKRTWGKHDQGFLVELRAITAA

TrEMBL top hitse value%identityAlignment
A0A5A7SMQ7 Ty3/gypsy retrotransposon protein1.9e-11442.74Show/hide
Query:  MAHKQLEERVAESEKHVEGMKE---KFPELENSVAELNKSMEKLFNSMEDQRQITIENQKALA--------NLLQGGFKVSPSGEKEFGQKRKIEEDVSA
        M   ++EER+   E+ + G+K+   K P +E+++ E+ K+ME +    E Q+Q  +   +A A         + +   + SP+ + + G K     D+  
Subjt:  MAHKQLEERVAESEKHVEGMKE---KFPELENSVAELNKSMEKLFNSMEDQRQITIENQKALA--------NLLQGGFKVSPSGEKEFGQKRKIEEDVSA

Query:  SSSRKEPRGEENFHDRQKFKKVEMPTFEGDYPDDWLFQAERYFDIHQLSDPEKVVVSAVSFAEVALRWYRWAESRSPFTSWRNLKYRVLERFRPTQEGTL
        +S+ ++   +EN +DR KFKKVEMP F G+ P+ WLF+AERYF IH+L++ EK++VS + F   AL WYR  E R  F SW NLK R+L RF+ T+EGT+
Subjt:  SSSRKEPRGEENFHDRQKFKKVEMPTFEGDYPDDWLFQAERYFDIHQLSDPEKVVVSAVSFAEVALRWYRWAESRSPFTSWRNLKYRVLERFRPTQEGTL

Query:  CARFLSIRQEKTVVEYREKFEALATPLPQLSEEVLENTFLNGWLPAIRSEVLCFEPMGLEAIMKAVQRIEDKN--RAIESKTSFGPGRLWKNPITSHANP
        C RFL I+QE TV EYR +F+ L  PL  L + V+E TF+ G  P IR+EV+  +P GL   M   Q +ED+   R   +  S+  G+  ++ ITS    
Subjt:  CARFLSIRQEKTVVEYREKFEALATPLPQLSEEVLENTFLNGWLPAIRSEVLCFEPMGLEAIMKAVQRIEDKN--RAIESKTSFGPGRLWKNPITSHANP

Query:  PKLIEQN---------PIKTITLPN-NPSHPPKETPLRRLSDSEMQLRREKGLCYRCDEKFHMGHKCKGK---ELKVLLVA--DEEPEQPFSKEKETTEC
            +QN         PI+TITL + NP    KE   +RL D+E QLR+EKGLC++C+EK+   HKCK K   EL++ +V   +EE E     E E  E 
Subjt:  PKLIEQN---------PIKTITLPN-NPSHPPKETPLRRLSDSEMQLRREKGLCYRCDEKFHMGHKCKGK---ELKVLLVA--DEEPEQPFSKEKETTEC

Query:  RVEEVEDPTVEVDMVELSINTVVGFSSPGTMKVKGRIEDKEM----------------LIEDMKLPVTETMNYGIIMGTRTAVKGKGICKKIVIALDGIT
        RV EV+  T     VELSIN+VVG + PGTMKVKG ++ KE+                ++  ++LP+ ET +YG+I+G+ TA++GKGIC+ + I +   T
Subjt:  RVEEVEDPTVEVDMVELSINTVVGFSSPGTMKVKGRIEDKEM----------------LIEDMKLPVTETMNYGIIMGTRTAVKGKGICKKIVIALDGIT

Query:  VAEDFLPIELSGVDVILGMQWLRTLGVTTIDWKTLTMEIKVGDSKITLKGDPSLTKTEVSLKQLKRTWGKHDQGFLVELRAITAA
        V EDFLP+EL GVDVILGMQWL +LGVT  DWK LT+     + +I +KGDPSLTK  VSLK L +TW +HD G+L+E R++  A
Subjt:  VAEDFLPIELSGVDVILGMQWLRTLGVTTIDWKTLTMEIKVGDSKITLKGDPSLTKTEVSLKQLKRTWGKHDQGFLVELRAITAA

A0A5A7TXP9 Ty3/gypsy retrotransposon protein1.9e-11442.74Show/hide
Query:  MAHKQLEERVAESEKHVEGMKE---KFPELENSVAELNKSMEKLFNSMEDQRQITIENQKALA--------NLLQGGFKVSPSGEKEFGQKRKIEEDVSA
        M   ++EER+   E+ + G+K+   K P +E+++ E+ K+ME +    E Q+Q  +   +A A         + +   + SP+ + + G K     D+  
Subjt:  MAHKQLEERVAESEKHVEGMKE---KFPELENSVAELNKSMEKLFNSMEDQRQITIENQKALA--------NLLQGGFKVSPSGEKEFGQKRKIEEDVSA

Query:  SSSRKEPRGEENFHDRQKFKKVEMPTFEGDYPDDWLFQAERYFDIHQLSDPEKVVVSAVSFAEVALRWYRWAESRSPFTSWRNLKYRVLERFRPTQEGTL
        +S+ ++   +EN +DR KFKKVEMP F G+ P+ WLF+AERYF IH+L++ EK++VS + F   AL WYR  E R  F SW NLK R+L RF+ T+EGT+
Subjt:  SSSRKEPRGEENFHDRQKFKKVEMPTFEGDYPDDWLFQAERYFDIHQLSDPEKVVVSAVSFAEVALRWYRWAESRSPFTSWRNLKYRVLERFRPTQEGTL

Query:  CARFLSIRQEKTVVEYREKFEALATPLPQLSEEVLENTFLNGWLPAIRSEVLCFEPMGLEAIMKAVQRIEDKN--RAIESKTSFGPGRLWKNPITSHANP
        C RFL I+QE TV EYR +F+ L  PL  L + V+E TF+ G  P IR+EV+  +P GL   M   Q +ED+   R   +  S+  G+  ++ ITS    
Subjt:  CARFLSIRQEKTVVEYREKFEALATPLPQLSEEVLENTFLNGWLPAIRSEVLCFEPMGLEAIMKAVQRIEDKN--RAIESKTSFGPGRLWKNPITSHANP

Query:  PKLIEQN---------PIKTITLPN-NPSHPPKETPLRRLSDSEMQLRREKGLCYRCDEKFHMGHKCKGK---ELKVLLVA--DEEPEQPFSKEKETTEC
            +QN         PI+TITL + NP    KE   +RL D+E QLR+EKGLC++C+EK+   HKCK K   EL++ +V   +EE E     E E  E 
Subjt:  PKLIEQN---------PIKTITLPN-NPSHPPKETPLRRLSDSEMQLRREKGLCYRCDEKFHMGHKCKGK---ELKVLLVA--DEEPEQPFSKEKETTEC

Query:  RVEEVEDPTVEVDMVELSINTVVGFSSPGTMKVKGRIEDKEM----------------LIEDMKLPVTETMNYGIIMGTRTAVKGKGICKKIVIALDGIT
        RV EV+  T     VELSIN+VVG + PGTMKVKG ++ KE+                ++  ++LP+ ET +YG+I+G+ TA++GKGIC+ + I +   T
Subjt:  RVEEVEDPTVEVDMVELSINTVVGFSSPGTMKVKGRIEDKEM----------------LIEDMKLPVTETMNYGIIMGTRTAVKGKGICKKIVIALDGIT

Query:  VAEDFLPIELSGVDVILGMQWLRTLGVTTIDWKTLTMEIKVGDSKITLKGDPSLTKTEVSLKQLKRTWGKHDQGFLVELRAITAA
        V EDFLP+EL GVDVILGMQWL +LGVT  DWK LT+     + +I +KGDPSLTK  VSLK L +TW +HD G+L+E R++  A
Subjt:  VAEDFLPIELSGVDVILGMQWLRTLGVTTIDWKTLTMEIKVGDSKITLKGDPSLTKTEVSLKQLKRTWGKHDQGFLVELRAITAA

A0A5A7U908 Transposon Tf2-1 polyprotein isoform X13.9e-11542.91Show/hide
Query:  MAHKQLEERVAESEKHVEGMKE---KFPELENSVAELNKSMEKLFNSMEDQRQITIENQKALA--------NLLQGGFKVSPSGEKEFGQKRKIEEDVSA
        M   ++EER+   E+ + G+K+   K P +E+++ E+ K+ME +    E Q+Q  +   +A A         + +   + SP+ + + G K     D+  
Subjt:  MAHKQLEERVAESEKHVEGMKE---KFPELENSVAELNKSMEKLFNSMEDQRQITIENQKALA--------NLLQGGFKVSPSGEKEFGQKRKIEEDVSA

Query:  SSSRKEPRGEENFHDRQKFKKVEMPTFEGDYPDDWLFQAERYFDIHQLSDPEKVVVSAVSFAEVALRWYRWAESRSPFTSWRNLKYRVLERFRPTQEGTL
        +S+ ++   +EN +DR KFKKVEMP F G+ P+ WLF+AERYF IH+L++ EK++VS + F E AL WYR  E R  F SW NLK R+L RF+ T+EGT+
Subjt:  SSSRKEPRGEENFHDRQKFKKVEMPTFEGDYPDDWLFQAERYFDIHQLSDPEKVVVSAVSFAEVALRWYRWAESRSPFTSWRNLKYRVLERFRPTQEGTL

Query:  CARFLSIRQEKTVVEYREKFEALATPLPQLSEEVLENTFLNGWLPAIRSEVLCFEPMGLEAIMKAVQRIEDKN--RAIESKTSFGPGRLWKNPITSHANP
        C RFL I+QE T+ EYR +F+ L  PL  L + V+E TF+ G  P IR+EV+  +  GL  +M   Q +ED+   R   +  S+  G+  ++ ITS    
Subjt:  CARFLSIRQEKTVVEYREKFEALATPLPQLSEEVLENTFLNGWLPAIRSEVLCFEPMGLEAIMKAVQRIEDKN--RAIESKTSFGPGRLWKNPITSHANP

Query:  PKLIEQN---------PIKTITLPN-NPSHPPKETPLRRLSDSEMQLRREKGLCYRCDEKFHMGHKCKGK---ELKVLLVA--DEEPEQPFSKEKETTEC
            +QN         PI+TITL + NP    KE   +RL D+E QLR+EKGLC++C+EK+   HKCK K   EL++ +V   +EE E     E E TE 
Subjt:  PKLIEQN---------PIKTITLPN-NPSHPPKETPLRRLSDSEMQLRREKGLCYRCDEKFHMGHKCKGK---ELKVLLVA--DEEPEQPFSKEKETTEC

Query:  RVEEVEDPTVEVDMVELSINTVVGFSSPGTMKVKGRIEDKEM----------------LIEDMKLPVTETMNYGIIMGTRTAVKGKGICKKIVIALDGIT
        RV EV+  T     VELSIN+VVG + PGTMKVKG ++ KE+                +I  ++LP+ ET +YG+I+G+ TA++GKGIC+ + I +   T
Subjt:  RVEEVEDPTVEVDMVELSINTVVGFSSPGTMKVKGRIEDKEM----------------LIEDMKLPVTETMNYGIIMGTRTAVKGKGICKKIVIALDGIT

Query:  VAEDFLPIELSGVDVILGMQWLRTLGVTTIDWKTLTMEIKVGDSKITLKGDPSLTKTEVSLKQLKRTWGKHDQGFLVELRAITAA
        V EDFLP+EL GVDVILGMQWL +LGVT  DWK LT+     + +I +KGDPSLTK  VSLK L +TW +HD G+L+E R++  A
Subjt:  VAEDFLPIELSGVDVILGMQWLRTLGVTTIDWKTLTMEIKVGDSKITLKGDPSLTKTEVSLKQLKRTWGKHDQGFLVELRAITAA

A0A5A7VAR4 Ty3/gypsy retrotransposon protein8.7e-11542.91Show/hide
Query:  MAHKQLEERVAESEKHVEGMKE---KFPELENSVAELNKSMEKLFNSMEDQRQITIENQKALA--------NLLQGGFKVSPSGEKEFGQKRKIEEDVSA
        M   ++EER+   E+ + G+K+   K P +E+++ E+ K+ME +    E Q+Q  +   +A A         + +   + SP+ + + G K     D+  
Subjt:  MAHKQLEERVAESEKHVEGMKE---KFPELENSVAELNKSMEKLFNSMEDQRQITIENQKALA--------NLLQGGFKVSPSGEKEFGQKRKIEEDVSA

Query:  SSSRKEPRGEENFHDRQKFKKVEMPTFEGDYPDDWLFQAERYFDIHQLSDPEKVVVSAVSFAEVALRWYRWAESRSPFTSWRNLKYRVLERFRPTQEGTL
        +S+ K+   +EN +DR KFKKVEMP F G+ P+ WLF+AERYF IH+L++ EK++VS + F   AL WYR  E R  F SW NLK R+L RF+ T+EGT+
Subjt:  SSSRKEPRGEENFHDRQKFKKVEMPTFEGDYPDDWLFQAERYFDIHQLSDPEKVVVSAVSFAEVALRWYRWAESRSPFTSWRNLKYRVLERFRPTQEGTL

Query:  CARFLSIRQEKTVVEYREKFEALATPLPQLSEEVLENTFLNGWLPAIRSEVLCFEPMGLEAIMKAVQRIEDKN--RAIESKTSFGPGRLWKNPITSHANP
        C RFL I+QE TV EYR +F+ L  PL  L + V+E TF+ G  P IR+EV+  +P GL   M   Q +ED+   R   +  S+  G+  ++ ITS    
Subjt:  CARFLSIRQEKTVVEYREKFEALATPLPQLSEEVLENTFLNGWLPAIRSEVLCFEPMGLEAIMKAVQRIEDKN--RAIESKTSFGPGRLWKNPITSHANP

Query:  PKLIEQN---------PIKTITLPN-NPSHPPKETPLRRLSDSEMQLRREKGLCYRCDEKFHMGHKCKGK---ELKVLLVA--DEEPEQPFSKEKETTEC
            +QN         PI+TITL + NP    KE   +RL D+E QLR+EKGLC++C+EK+   HKCK K   EL++ +V   +EE E     E E  E 
Subjt:  PKLIEQN---------PIKTITLPN-NPSHPPKETPLRRLSDSEMQLRREKGLCYRCDEKFHMGHKCKGK---ELKVLLVA--DEEPEQPFSKEKETTEC

Query:  RVEEVEDPTVEVDMVELSINTVVGFSSPGTMKVKGRIEDKEM----------------LIEDMKLPVTETMNYGIIMGTRTAVKGKGICKKIVIALDGIT
        RV EV+  T     VELSIN+VVG + PGTMKVKG ++ KE+                ++  ++LP+ ET +YG+I+G+ TA++GKGIC+ + I +   T
Subjt:  RVEEVEDPTVEVDMVELSINTVVGFSSPGTMKVKGRIEDKEM----------------LIEDMKLPVTETMNYGIIMGTRTAVKGKGICKKIVIALDGIT

Query:  VAEDFLPIELSGVDVILGMQWLRTLGVTTIDWKTLTMEIKVGDSKITLKGDPSLTKTEVSLKQLKRTWGKHDQGFLVELRAITAA
        V EDFLP+EL GVDVILGMQWL +LGVT  DWK LT+     + +I +KGDPSLTK  VSLK L +TW +HD G+L+E R++  A
Subjt:  VAEDFLPIELSGVDVILGMQWLRTLGVTTIDWKTLTMEIKVGDSKITLKGDPSLTKTEVSLKQLKRTWGKHDQGFLVELRAITAA

A0A5D3DYS4 Ty3/gypsy retrotransposon protein1.9e-11442.74Show/hide
Query:  MAHKQLEERVAESEKHVEGMKE---KFPELENSVAELNKSMEKLFNSMEDQRQITIENQKALA--------NLLQGGFKVSPSGEKEFGQKRKIEEDVSA
        M   ++EER+   E+ + G+K+   K P +E+++ E+ K+ME +    E Q+Q  +   +A A         + +   + SP+ + + G K     D+  
Subjt:  MAHKQLEERVAESEKHVEGMKE---KFPELENSVAELNKSMEKLFNSMEDQRQITIENQKALA--------NLLQGGFKVSPSGEKEFGQKRKIEEDVSA

Query:  SSSRKEPRGEENFHDRQKFKKVEMPTFEGDYPDDWLFQAERYFDIHQLSDPEKVVVSAVSFAEVALRWYRWAESRSPFTSWRNLKYRVLERFRPTQEGTL
        +S+ ++   +EN +DR KFKKVEMP F G+ P+ WLF+AERYF IH+L++ EK++VS + F   AL WYR  E R  F SW NLK R+L RF+ T+EGT+
Subjt:  SSSRKEPRGEENFHDRQKFKKVEMPTFEGDYPDDWLFQAERYFDIHQLSDPEKVVVSAVSFAEVALRWYRWAESRSPFTSWRNLKYRVLERFRPTQEGTL

Query:  CARFLSIRQEKTVVEYREKFEALATPLPQLSEEVLENTFLNGWLPAIRSEVLCFEPMGLEAIMKAVQRIEDKN--RAIESKTSFGPGRLWKNPITSHANP
        C RFL I+QE TV EYR +F+ L  PL  L + V+E TF+ G  P IR+EV+  +P GL   M   Q +ED+   R   +  S+  G+  ++ ITS    
Subjt:  CARFLSIRQEKTVVEYREKFEALATPLPQLSEEVLENTFLNGWLPAIRSEVLCFEPMGLEAIMKAVQRIEDKN--RAIESKTSFGPGRLWKNPITSHANP

Query:  PKLIEQN---------PIKTITLPN-NPSHPPKETPLRRLSDSEMQLRREKGLCYRCDEKFHMGHKCKGK---ELKVLLVA--DEEPEQPFSKEKETTEC
            +QN         PI+TITL + NP    KE   +RL D+E QLR+EKGLC++C+EK+   HKCK K   EL++ +V   +EE E     E E  E 
Subjt:  PKLIEQN---------PIKTITLPN-NPSHPPKETPLRRLSDSEMQLRREKGLCYRCDEKFHMGHKCKGK---ELKVLLVA--DEEPEQPFSKEKETTEC

Query:  RVEEVEDPTVEVDMVELSINTVVGFSSPGTMKVKGRIEDKEM----------------LIEDMKLPVTETMNYGIIMGTRTAVKGKGICKKIVIALDGIT
        RV EV+  T     VELSIN+VVG + PGTMKVKG ++ KE+                ++  ++LP+ ET +YG+I+G+ TA++GKGIC+ + I +   T
Subjt:  RVEEVEDPTVEVDMVELSINTVVGFSSPGTMKVKGRIEDKEM----------------LIEDMKLPVTETMNYGIIMGTRTAVKGKGICKKIVIALDGIT

Query:  VAEDFLPIELSGVDVILGMQWLRTLGVTTIDWKTLTMEIKVGDSKITLKGDPSLTKTEVSLKQLKRTWGKHDQGFLVELRAITAA
        V EDFLP+EL GVDVILGMQWL +LGVT  DWK LT+     + +I +KGDPSLTK  VSLK L +TW +HD G+L+E R++  A
Subjt:  VAEDFLPIELSGVDVILGMQWLRTLGVTTIDWKTLTMEIKVGDSKITLKGDPSLTKTEVSLKQLKRTWGKHDQGFLVELRAITAA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G67020.1 unknown protein5.4e-0832.47Show/hide
Query:  KKVEMPTFEGDYPDDWLFQAERYFDIHQLSDPEKVVVSAVSFAEVALRWYRWAESRSPFTSWRNLKYRVLERFRPTQ
        +++EMP F+G    +W  + ER+F + +  D +K+ + A+S   VAL+W+    S   F  W + + R+L RF P +
Subjt:  KKVEMPTFEGDYPDDWLFQAERYFDIHQLSDPEKVVVSAVSFAEVALRWYRWAESRSPFTSWRNLKYRVLERFRPTQ

AT3G29750.1 Eukaryotic aspartyl protease family protein5.9e-0731.31Show/hide
Query:  DKEMLIE---DMKLPVTETMNYGIIMGTRTAVKGKGICKKIVIALDGITVAEDFLPIEL--SGVDVILGMQWLRTLGVTTIDWKTLTMEIKVGDSKITL
        D  +L+E    +KLP + T    +++G R  ++  G C  I + +  + + E+FL ++L  + VDVILG +WL  LG T ++W+            ITL
Subjt:  DKEMLIE---DMKLPVTETMNYGIIMGTRTAVKGKGICKKIVIALDGITVAEDFLPIEL--SGVDVILGMQWLRTLGVTTIDWKTLTMEIKVGDSKITL

AT3G42723.1 aminoacyl-tRNA ligases;ATP binding;nucleotide binding2.8e-0921.79Show/hide
Query:  ERYFDIHQLSDPEKVVVSAVSFAEVALRWYRWAESRSPFTSWRNLKYRVLERFRPTQEGTLCARFLSIRQEKTVVEYREKFEALATPLPQLSEEVLENTF
        E YF  + + + E++ +   +      +W +    ++  TSW+  K  +    + T +      +  I+QE +V EYRE+FEAL      L  + LE  F
Subjt:  ERYFDIHQLSDPEKVVVSAVSFAEVALRWYRWAESRSPFTSWRNLKYRVLERFRPTQEGTLCARFLSIRQEKTVVEYREKFEALATPLPQLSEEVLENTF

Query:  LNGWLPAIRSEVLCFEPMGLEAIMKAVQRIEDKNRAIESKTSFGPGRLWKNPITSHANPPKL--IEQNPIKTITLPNNPSHPPKETPLRRLSDSEMQLRR
        L G  P++++ V   +P G+  +M   Q +E+ N    S   +G G        S    PK+    Q  ++++ L        K+TP R  ++    L++
Subjt:  LNGWLPAIRSEVLCFEPMGLEAIMKAVQRIEDKNRAIESKTSFGPGRLWKNPITSHANPPKL--IEQNPIKTITLPNNPSHPPKETPLRRLSDSEMQLRR

Query:  EKGLCYRCDEKFHMGHKCKGKELKVLLVADEEPEQPFSKEKETTECRVEEVEDPTVEVDMVELSINTVVGFSSPGTMKVKGRIEDKEMLIEDMKLPVTET
        E              H+  G E+                      CR                             M+  G I  +E             
Subjt:  EKGLCYRCDEKFHMGHKCKGKELKVLLVADEEPEQPFSKEKETTECRVEEVEDPTVEVDMVELSINTVVGFSSPGTMKVKGRIEDKEMLIEDMKLPVTET

Query:  MNYGIIMGTRTAVKGKGICKKIVIALDGITVAEDFLPIELSG--VDVILGMQWLRTLGVTTIDWKTLTMEIKVGDSKITLKGDPSLTKTEVSLKQLKRTW
            +    R A   K  C++I + ++ I + ED+   +L    VDVILG +WL  LG T ++W+  +         +TL   P     E   K++K   
Subjt:  MNYGIIMGTRTAVKGKGICKKIVIALDGITVAEDFLPIELSG--VDVILGMQWLRTLGVTTIDWKTLTMEIKVGDSKITLKGDPSLTKTEVSLKQLKRTW

Query:  GKHDQGFLVELRA
         K   G   EL +
Subjt:  GKHDQGFLVELRA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCACGGGTTTGCTGAACGAAAGAGAGAAAACAAGAGAGAGGAAAAGATATTACCATGCGTTAAACAGAGAAAACTTGAGCTTTGGGTTTCCCTCGCAGCTAATCGACA
ACACGAAGAGAGGGAGAAAAAGAAGGGTGGCGACGGAGGAAGAAGGAACAGAGGCAGTGGCGGCGGTGGCTGTGCGTGGCGGCTGGCAGCATGCGGCAGCAGGCGACGGC
GGCGTGGAGAAAACGAGGGGGAGGCGGCGTGGCTGAGCGGTGTGGTCACACGAGGGAGAGAGAGAGACTTGCGAGAGAGAGAGATTGTCGGAGGAGAAGAAATTTCAGGG
GGGGTTGGACGCACGAGGGAGAGAGGAAGGGAGAATGAAGAGCATGTCATCCTGGGTAAGACACGGGGTATGGCGCACAAACAACTGGAGGAGCGGGTAGCAGAATCTGA
GAAACATGTCGAAGGTATGAAAGAAAAATTTCCTGAATTAGAAAATTCTGTAGCAGAATTGAATAAAAGTATGGAAAAGTTGTTTAATAGTATGGAAGATCAACGACAAA
TAACAATAGAAAATCAAAAGGCGTTAGCAAATTTGCTACAAGGAGGGTTTAAAGTCAGCCCTAGCGGGGAAAAAGAGTTCGGACAAAAGCGGAAAATTGAGGAAGATGTA
AGCGCTTCCTCTAGTCGAAAGGAACCCAGGGGAGAAGAAAATTTCCATGATCGACAGAAATTCAAGAAGGTAGAAATGCCTACCTTCGAAGGAGACTACCCAGACGATTG
GCTTTTTCAAGCAGAACGCTATTTCGACATACACCAATTATCAGATCCGGAGAAGGTCGTGGTATCCGCAGTAAGCTTCGCCGAAGTAGCTTTAAGGTGGTACCGGTGGG
CTGAGAGCCGAAGTCCGTTCACAAGCTGGAGAAATCTGAAATACCGAGTACTGGAGCGATTCAGGCCAACGCAAGAAGGGACCCTCTGTGCTCGATTCCTTTCCATTAGG
CAAGAGAAGACAGTGGTCGAATATCGGGAGAAATTTGAAGCCCTCGCAACCCCCCTTCCACAGCTATCTGAGGAAGTCCTTGAAAACACGTTCCTTAACGGATGGTTACC
GGCTATAAGATCTGAAGTCCTGTGTTTTGAACCCATGGGCCTGGAAGCCATCATGAAGGCGGTCCAAAGAATAGAAGACAAGAATCGGGCCATTGAATCTAAGACTAGTT
TCGGCCCAGGAAGATTGTGGAAAAATCCCATCACCAGCCATGCAAATCCACCGAAGTTAATTGAACAAAACCCCATTAAAACCATCACTTTACCCAATAACCCGTCCCAT
CCACCGAAAGAAACACCACTTAGACGCCTATCAGACTCTGAAATGCAATTAAGAAGAGAAAAGGGTCTGTGTTACAGGTGTGATGAGAAGTTCCACATGGGACACAAATG
TAAAGGAAAAGAGTTAAAGGTACTTCTGGTAGCAGACGAGGAACCAGAACAACCATTTTCCAAAGAAAAAGAAACTACAGAGTGTCGGGTAGAAGAAGTCGAAGATCCTA
CTGTGGAAGTCGACATGGTCGAACTCTCCATTAACACCGTAGTTGGATTTTCTTCTCCTGGAACGATGAAAGTAAAAGGGAGAATCGAAGACAAAGAGATGCTAATCGAA
GATATGAAGTTACCAGTGACAGAGACGATGAACTATGGCATTATCATGGGAACAAGAACCGCAGTGAAAGGAAAAGGTATTTGTAAAAAGATTGTGATCGCATTGGATGG
AATCACCGTTGCTGAAGATTTTTTACCCATAGAGTTGAGTGGGGTTGATGTAATATTAGGAATGCAGTGGCTGAGAACTCTAGGAGTAACGACTATTGACTGGAAAACTC
TAACGATGGAGATCAAAGTAGGAGATTCCAAGATTACCCTCAAGGGAGATCCCTCCTTGACCAAGACAGAAGTATCGCTTAAACAACTTAAGCGAACATGGGGCAAACAC
GATCAAGGCTTCTTAGTTGAATTAAGAGCAATTACAGCAGCAGTGGGTGACCCGGCTGTTTTTACGCGATTCCGCCATCACCGAACCCACCCATCGTGGACCAATTATTA
G
mRNA sequenceShow/hide mRNA sequence
ATGCACGGGTTTGCTGAACGAAAGAGAGAAAACAAGAGAGAGGAAAAGATATTACCATGCGTTAAACAGAGAAAACTTGAGCTTTGGGTTTCCCTCGCAGCTAATCGACA
ACACGAAGAGAGGGAGAAAAAGAAGGGTGGCGACGGAGGAAGAAGGAACAGAGGCAGTGGCGGCGGTGGCTGTGCGTGGCGGCTGGCAGCATGCGGCAGCAGGCGACGGC
GGCGTGGAGAAAACGAGGGGGAGGCGGCGTGGCTGAGCGGTGTGGTCACACGAGGGAGAGAGAGAGACTTGCGAGAGAGAGAGATTGTCGGAGGAGAAGAAATTTCAGGG
GGGGTTGGACGCACGAGGGAGAGAGGAAGGGAGAATGAAGAGCATGTCATCCTGGGTAAGACACGGGGTATGGCGCACAAACAACTGGAGGAGCGGGTAGCAGAATCTGA
GAAACATGTCGAAGGTATGAAAGAAAAATTTCCTGAATTAGAAAATTCTGTAGCAGAATTGAATAAAAGTATGGAAAAGTTGTTTAATAGTATGGAAGATCAACGACAAA
TAACAATAGAAAATCAAAAGGCGTTAGCAAATTTGCTACAAGGAGGGTTTAAAGTCAGCCCTAGCGGGGAAAAAGAGTTCGGACAAAAGCGGAAAATTGAGGAAGATGTA
AGCGCTTCCTCTAGTCGAAAGGAACCCAGGGGAGAAGAAAATTTCCATGATCGACAGAAATTCAAGAAGGTAGAAATGCCTACCTTCGAAGGAGACTACCCAGACGATTG
GCTTTTTCAAGCAGAACGCTATTTCGACATACACCAATTATCAGATCCGGAGAAGGTCGTGGTATCCGCAGTAAGCTTCGCCGAAGTAGCTTTAAGGTGGTACCGGTGGG
CTGAGAGCCGAAGTCCGTTCACAAGCTGGAGAAATCTGAAATACCGAGTACTGGAGCGATTCAGGCCAACGCAAGAAGGGACCCTCTGTGCTCGATTCCTTTCCATTAGG
CAAGAGAAGACAGTGGTCGAATATCGGGAGAAATTTGAAGCCCTCGCAACCCCCCTTCCACAGCTATCTGAGGAAGTCCTTGAAAACACGTTCCTTAACGGATGGTTACC
GGCTATAAGATCTGAAGTCCTGTGTTTTGAACCCATGGGCCTGGAAGCCATCATGAAGGCGGTCCAAAGAATAGAAGACAAGAATCGGGCCATTGAATCTAAGACTAGTT
TCGGCCCAGGAAGATTGTGGAAAAATCCCATCACCAGCCATGCAAATCCACCGAAGTTAATTGAACAAAACCCCATTAAAACCATCACTTTACCCAATAACCCGTCCCAT
CCACCGAAAGAAACACCACTTAGACGCCTATCAGACTCTGAAATGCAATTAAGAAGAGAAAAGGGTCTGTGTTACAGGTGTGATGAGAAGTTCCACATGGGACACAAATG
TAAAGGAAAAGAGTTAAAGGTACTTCTGGTAGCAGACGAGGAACCAGAACAACCATTTTCCAAAGAAAAAGAAACTACAGAGTGTCGGGTAGAAGAAGTCGAAGATCCTA
CTGTGGAAGTCGACATGGTCGAACTCTCCATTAACACCGTAGTTGGATTTTCTTCTCCTGGAACGATGAAAGTAAAAGGGAGAATCGAAGACAAAGAGATGCTAATCGAA
GATATGAAGTTACCAGTGACAGAGACGATGAACTATGGCATTATCATGGGAACAAGAACCGCAGTGAAAGGAAAAGGTATTTGTAAAAAGATTGTGATCGCATTGGATGG
AATCACCGTTGCTGAAGATTTTTTACCCATAGAGTTGAGTGGGGTTGATGTAATATTAGGAATGCAGTGGCTGAGAACTCTAGGAGTAACGACTATTGACTGGAAAACTC
TAACGATGGAGATCAAAGTAGGAGATTCCAAGATTACCCTCAAGGGAGATCCCTCCTTGACCAAGACAGAAGTATCGCTTAAACAACTTAAGCGAACATGGGGCAAACAC
GATCAAGGCTTCTTAGTTGAATTAAGAGCAATTACAGCAGCAGTGGGTGACCCGGCTGTTTTTACGCGATTCCGCCATCACCGAACCCACCCATCGTGGACCAATTATTA
G
Protein sequenceShow/hide protein sequence
MHGFAERKRENKREEKILPCVKQRKLELWVSLAANRQHEEREKKKGGDGGRRNRGSGGGGCAWRLAACGSRRRRRGENEGEAAWLSGVVTRGRERDLREREIVGGEEISG
GVGRTRERGRENEEHVILGKTRGMAHKQLEERVAESEKHVEGMKEKFPELENSVAELNKSMEKLFNSMEDQRQITIENQKALANLLQGGFKVSPSGEKEFGQKRKIEEDV
SASSSRKEPRGEENFHDRQKFKKVEMPTFEGDYPDDWLFQAERYFDIHQLSDPEKVVVSAVSFAEVALRWYRWAESRSPFTSWRNLKYRVLERFRPTQEGTLCARFLSIR
QEKTVVEYREKFEALATPLPQLSEEVLENTFLNGWLPAIRSEVLCFEPMGLEAIMKAVQRIEDKNRAIESKTSFGPGRLWKNPITSHANPPKLIEQNPIKTITLPNNPSH
PPKETPLRRLSDSEMQLRREKGLCYRCDEKFHMGHKCKGKELKVLLVADEEPEQPFSKEKETTECRVEEVEDPTVEVDMVELSINTVVGFSSPGTMKVKGRIEDKEMLIE
DMKLPVTETMNYGIIMGTRTAVKGKGICKKIVIALDGITVAEDFLPIELSGVDVILGMQWLRTLGVTTIDWKTLTMEIKVGDSKITLKGDPSLTKTEVSLKQLKRTWGKH
DQGFLVELRAITAAVGDPAVFTRFRHHRTHPSWTNY