; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0031384 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0031384
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTy3-gypsy retrotransposon protein
Genome locationchr11:7810636..7813832
RNA-Seq ExpressionLag0031384
SyntenyLag0031384
Gene Ontology termsGO:0006310 - DNA recombination (biological process)
GO:0015074 - DNA integration (biological process)
GO:0071897 - DNA biosynthetic process (biological process)
GO:0090502 - RNA phosphodiester bond hydrolysis, endonucleolytic (biological process)
GO:0003723 - RNA binding (molecular function)
GO:0003887 - DNA-directed DNA polymerase activity (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0047477.1 uncharacterized protein E6C27_scaffold498G00940 [Cucumis melo var. makuwa]2.0e-16367.89Show/hide
Query:  MITETRTMEERMTQMQKHINNLMKAIEEKDSQIAQLKCQIENQHISESSQTQVIKNHDKGKTIVQDDQPQCSASVASLSIQQLQDMITNCIRAQYGGPTR
        M+T+  T E+RM +++K +N LMKA+EE+D +IA LK  IE++  +ESS T  IKN +KGK I+Q+ QPQ S S+ASLS+QQLQ+MI N I+ QYGGP +
Subjt:  MITETRTMEERMTQMQKHINNLMKAIEEKDSQIAQLKCQIENQHISESSQTQVIKNHDKGKTIVQDDQPQCSASVASLSIQQLQDMITNCIRAQYGGPTR

Query:  DSLLYSKPYTKRIDNLRTPIRYQPPKFQQFDGKSNHKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVHSWEELEREFLNRFYSTRR
           LYSKPYTKRIDN+R P  YQPPKFQQFDGK N KQH+AHF+ETCE AGTRGDLLVKQFVRTLKGNAFDWYTDLEPES+ SWE+LER+FLNRFYSTRR
Subjt:  DSLLYSKPYTKRIDNLRTPIRYQPPKFQQFDGKSNHKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVHSWEELEREFLNRFYSTRR

Query:  TVSMFELTNTKQQKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDE
         VSM ELT TKQ+KGE V++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSIA+R N DLL+P +RKE +  +
Subjt:  TVSMFELTNTKQQKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDE

Query:  ET-------IEKSMVVNTT----LPKSSSKEKRQTNG-THHLTLKERQKKIYPFPDADIPDMLEQVLEAHLIELPKCKRPEEMEKVDDPKYCKYHRVIGH
         T        +++MVV+TT    + K    EKRQ  G     TLKERQ+K+YPFPD+D+PDML+Q+LE  LI+LP+CKRP EM +V+DP YCKYHRVI H
Subjt:  ET-------IEKSMVVNTT----LPKSSSKEKRQTNG-THHLTLKERQKKIYPFPDADIPDMLEQVLEAHLIELPKCKRPEEMEKVDDPKYCKYHRVIGH

Query:  PVERCFVLKDLILKLAKEGKIELDLDEVAQSNLAIV
        PVE+CFVLK+LILKLA + KIEL+LD+VAQ+N A V
Subjt:  PVERCFVLKDLILKLAKEGKIELDLDEVAQSNLAIV

KAA0047477.1 uncharacterized protein E6C27_scaffold498G00940 [Cucumis melo var. makuwa]9.7e-0135Show/hide
Query:  KLQPKRKRSKK--FSQPQQQVMLNKSFSKTFH--KKEKVNFATFYCIDVEEVNNSKKSE----QRTSVFDRIKPPTTRPSAFQRMSMAATEEENQCSMST
        +L P +K+ +K  +S P  +  +    S+      K K   A    I VEE  +S++ +    QR+SVFDRI     RPS FQR+S +  ++ NQ S  +
Subjt:  KLQPKRKRSKK--FSQPQQQVMLNKSFSKTFH--KKEKVNFATFYCIDVEEVNNSKKSE----QRTSVFDRIKPPTTRPSAFQRMSMAATEEENQCSMST

Query:  FIRPSAFQRLSVSTSK----KSRPST--SVFYRLKFLVSK
          R SAFQRL+ S  K       P+T  S F RL   V++
Subjt:  FIRPSAFQRLSVSTSK----KSRPST--SVFYRLKFLVSK

KAA0047477.1 uncharacterized protein E6C27_scaffold498G00940 [Cucumis melo var. makuwa]1.3e-16267.51Show/hide
Query:  MITETRTMEERMTQMQKHINNLMKAIEEKDSQIAQLKCQIENQHISESSQTQVIKNHDKGKTIVQDDQPQCSASVASLSIQQLQDMITNCIRAQYGGPTR
        M+T   T E RM +++K +N LMK +EE+D +IA LK  IE++  +ESS   ++KN DKGK ++Q+ QPQ S S+ASLS+QQLQ+MI + I+ QYGGP +
Subjt:  MITETRTMEERMTQMQKHINNLMKAIEEKDSQIAQLKCQIENQHISESSQTQVIKNHDKGKTIVQDDQPQCSASVASLSIQQLQDMITNCIRAQYGGPTR

Query:  DSLLYSKPYTKRIDNLRTPIRYQPPKFQQFDGKSNHKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVHSWEELEREFLNRFYSTRR
           LYSKPYTKRIDNLR P  YQPPKFQQFDGK N KQH+AHF+ETCE AGTRGDLLVKQFVRTLKGNAFDWY DLEPES+ +WE+LER+FLNRFYSTRR
Subjt:  DSLLYSKPYTKRIDNLRTPIRYQPPKFQQFDGKSNHKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVHSWEELEREFLNRFYSTRR

Query:  TVSMFELTNTKQQKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDE
         VSM ELTNT+QQKGELV+NYINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSIA+R  +D L+P  R +    +
Subjt:  TVSMFELTNTKQQKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDE

Query:  ET-------IEKSMVVNTTLPKSSSKEKRQTNGTHH-------LTLKERQKKIYPFPDADIPDMLEQVLEAHLIELPKCKRPEEMEKVDDPKYCKYHRVI
        +T       I++SMVV+ T  KS SK K       H        TLKERQ+K+YPFPD+D+ DMLEQ+LE  LI+LP+CKRPE+ EKVDDP YCKYHRVI
Subjt:  ET-------IEKSMVVNTTLPKSSSKEKRQTNGTHH-------LTLKERQKKIYPFPDADIPDMLEQVLEAHLIELPKCKRPEEMEKVDDPKYCKYHRVI

Query:  GHPVERCFVLKDLILKLAKEGKIELDLDEVAQSN
         HPVE+CFVLK+LILKLA+E KIELD+DEVAQ+N
Subjt:  GHPVERCFVLKDLILKLAKEGKIELDLDEVAQSN

KAA0056121.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]4.9e-17859.97Show/hide
Query:  MITETRTMEERMTQMQKHINNLMKAIEEKDSQIAQLKCQIENQHISESSQTQVIKNHDKGKTIVQDDQPQCSASVASLSIQQLQDMITNCIRAQYGGPTR
        M+T   T E RM +++K +N LMK +EE+D +IA LK  IE++  +ESS    +KN DKGK ++Q+ QPQ S S+ASLS+QQLQ+MI + I+ QYGGP +
Subjt:  MITETRTMEERMTQMQKHINNLMKAIEEKDSQIAQLKCQIENQHISESSQTQVIKNHDKGKTIVQDDQPQCSASVASLSIQQLQDMITNCIRAQYGGPTR

Query:  DSLLYSKPYTKRIDNLRTPIRYQPPKFQQFDGKSNHKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVHSWEELEREFLNRFYSTRR
           LY KPYTKRIDNLR P  YQPPKFQQFDGK N KQH+AHF++TCE AGTRGDLLVKQFVRTLKGNA DWY DLEPES+ +WE+LER+FLNRFYSTR 
Subjt:  DSLLYSKPYTKRIDNLRTPIRYQPPKFQQFDGKSNHKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVHSWEELEREFLNRFYSTRR

Query:  TVSMFELTNTKQQKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDE
         VSM ELTNT+QQKGELV++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSIA+R  +D L+P  R +    +
Subjt:  TVSMFELTNTKQQKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDE

Query:  ET-------IEKSMVVNTTLPKSSSKEK-----RQTNG--THHLTLKERQKKIYPFPDADIPDMLEQVLEAHLIELPKCKRPEEMEKVDDPKYCKYHRVI
        +T       I++SMVV+ T  KS SK K     R+ +G      TLKERQ+K+YPFPD+D+ DMLEQ+LE  LI+LP+CKRPE+  KVDDP YCKYHRVI
Subjt:  ET-------IEKSMVVNTTLPKSSSKEK-----RQTNG--THHLTLKERQKKIYPFPDADIPDMLEQVLEAHLIELPKCKRPEEMEKVDDPKYCKYHRVI

Query:  GHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLAI------VKGKSKH----QRKKDLKKLQPKRKRSKKFSQPQQQVMLNKSFSKTFHKKEKVNFAT
         HPVE+CFVLK+LILKLA+E KIELD+DEVAQ+N AI      +KGK +     +R   L +  P   RS     P++ + +    + +  + +  N+ +
Subjt:  GHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLAI------VKGKSKH----QRKKDLKKLQPKRKRSKKFSQPQQQVMLNKSFSKTFHKKEKVNFAT

Query:  FYCIDVEEVNNSKKSEQRTSVFDRIKPPTTRPSAFQRMSMAATEEENQCSMSTFIRPSAFQRLSVSTSKKSRPSTSVFYRLK
              +EVNNS +  QRTSVFDRIKP TTR S FQR+S+A  EEENQC    + R S  +RLS+ST KK RPSTS F RLK
Subjt:  FYCIDVEEVNNSKKSEQRTSVFDRIKPPTTRPSAFQRMSMAATEEENQCSMSTFIRPSAFQRLSVSTSKKSRPSTSVFYRLK

TYK03695.1 retrotransposon gag protein [Cucumis melo var. makuwa]1.6e-17353.23Show/hide
Query:  MITETRTMEERMTQMQKHINNLMKAIEEKDSQIAQLKCQIENQHISESSQTQVIKNHDKGKTIVQDDQPQCSASVASLSIQQLQDMITNCIRAQYGGPTR
        M+T   T E RM +++K +N LMK +EE+D +IA LK  IE++  +ESS    +KN DKGK ++Q+ QPQ S S+ASLS+QQLQ+MI + I+ QYGGP +
Subjt:  MITETRTMEERMTQMQKHINNLMKAIEEKDSQIAQLKCQIENQHISESSQTQVIKNHDKGKTIVQDDQPQCSASVASLSIQQLQDMITNCIRAQYGGPTR

Query:  DSLLYSKPYTKRIDNLRTPIRYQPPKFQQFDGKSNHKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVHSWEELEREFLNRFYSTRR
           LYSKPYTKRIDNLR P  YQPPKFQQFDGK N KQH+AHF+ETCE AGTRGDLLVKQFVRTLKGNAFD Y DLEPES+ +WE+LER+FLNRFYSTRR
Subjt:  DSLLYSKPYTKRIDNLRTPIRYQPPKFQQFDGKSNHKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVHSWEELEREFLNRFYSTRR

Query:  TVSMFELTNTKQQKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDE
         VSM ELTNT+QQKGELV++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSI +R  +D L+P  R +     
Subjt:  TVSMFELTNTKQQKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDE

Query:  ET-------IEKSMVVNTTLPKSSSKEK-----RQTNG--THHLTLKERQKKIYPFPDADIPDMLEQVLEAHLIELPKCKRPEEMEKVDDPKYCKYHRVI
        +T       I++SMVV+ T  KS SK K     R+ +G      TLKERQ+K+YPF D+D+ DMLEQ+LE  LI+LPKCKRP++ EKVDDP YCKYHRVI
Subjt:  ET-------IEKSMVVNTTLPKSSSKEK-----RQTNG--THHLTLKERQKKIYPFPDADIPDMLEQVLEAHLIELPKCKRPEEMEKVDDPKYCKYHRVI

Query:  GHPVERCFVLKDLILKLAKEGKIELDLDEVAQSN-LAI--------------------------------------------------------------
         HPVE+CFVLK+LILKLA+E KIEL++DEVAQ+N +AI                                                              
Subjt:  GHPVERCFVLKDLILKLAKEGKIELDLDEVAQSN-LAI--------------------------------------------------------------

Query:  -----------------VKGKSKHQRKKDLKKLQPKRKRSKKFSQPQQQVMLNKSFSKTF---HKKEKVNFATFYCIDVEEVNNSKKS----------EQ
                         +  K K +R K +   +P + + + F Q ++ + L +   ++F   H +E +   T +   + EVNN+  S           Q
Subjt:  -----------------VKGKSKHQRKKDLKKLQPKRKRSKKFSQPQQQVMLNKSFSKTF---HKKEKVNFATFYCIDVEEVNNSKKS----------EQ

Query:  RTSVFDRIKPPTTRPSAFQRMSMAATEEENQCSMSTFIRPSAFQRLSVSTSKKSRPSTSVFYRLK
        RTSVFDRIKP TTR S FQR+SMA  EEENQC    + R S F+RLS+S SKK+RPSTS F RLK
Subjt:  RTSVFDRIKPPTTRPSAFQRMSMAATEEENQCSMSTFIRPSAFQRLSVSTSKKSRPSTSVFYRLK

XP_031742199.1 uncharacterized protein LOC105435721 [Cucumis sativus]2.9e-16269.14Show/hide
Query:  MEERMTQMQKHINNLMKAIEEKDSQIAQLKCQIENQHISESSQTQVIKNHDKGKTIVQDDQP-QCSASVASLSIQQLQDMITNCIRAQYGGPTRDSLLYS
        +E  M +M++ IN LMK ++E+D +IA LK Q++ +  +ESSQT V+K  DKGK +VQ++QP Q S SVASLS+QQLQDMITN IRAQYGGP++ S +YS
Subjt:  MEERMTQMQKHINNLMKAIEEKDSQIAQLKCQIENQHISESSQTQVIKNHDKGKTIVQDDQP-QCSASVASLSIQQLQDMITNCIRAQYGGPTRDSLLYS

Query:  KPYTKRIDNLRTPIRYQPPKFQQFDGKSNHKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVHSWEELEREFLNRFYSTRRTVSMFE
        KPYTKRIDNLR P+ YQPPKFQQFDGK N KQH+AHFVETCENAG+RGD LV+QFVR+LKGNAF+WYTDLEPES+ SWE+LE+EFLNRFYSTRRTVSM E
Subjt:  KPYTKRIDNLRTPIRYQPPKFQQFDGKSNHKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVHSWEELEREFLNRFYSTRRTVSMFE

Query:  LTNTKQQKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRN-------D
        LTNTKQ+KGE V++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSIASR  +D L+P ++K+ +         
Subjt:  LTNTKQQKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRN-------D

Query:  EETIEKSMVVNTT-LPKSSSKEKR---QTNGT--HHLTLKERQKKIYPFPDADIPDMLEQVLEAHLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERC
        + T ++SMVVNTT L  S  KE R   + +G+    LTLKERQ+K+YPFPD+DI DMLEQ+LE  LI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+C
Subjt:  EETIEKSMVVNTT-LPKSSSKEKR---QTNGT--HHLTLKERQKKIYPFPDADIPDMLEQVLEAHLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERC

Query:  FVLKDLILKLAKEGKIELDLDEVAQSNLAIV
        FVLK+LIL+LA+E +IELDL+EVAQ+N A V
Subjt:  FVLKDLILKLAKEGKIELDLDEVAQSNLAIV

TrEMBL top hitse value%identityAlignment
A0A5A7TZU9 Ribonuclease H9.7e-16467.89Show/hide
Query:  MITETRTMEERMTQMQKHINNLMKAIEEKDSQIAQLKCQIENQHISESSQTQVIKNHDKGKTIVQDDQPQCSASVASLSIQQLQDMITNCIRAQYGGPTR
        M+T+  T E+RM +++K +N LMKA+EE+D +IA LK  IE++  +ESS T  IKN +KGK I+Q+ QPQ S S+ASLS+QQLQ+MI N I+ QYGGP +
Subjt:  MITETRTMEERMTQMQKHINNLMKAIEEKDSQIAQLKCQIENQHISESSQTQVIKNHDKGKTIVQDDQPQCSASVASLSIQQLQDMITNCIRAQYGGPTR

Query:  DSLLYSKPYTKRIDNLRTPIRYQPPKFQQFDGKSNHKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVHSWEELEREFLNRFYSTRR
           LYSKPYTKRIDN+R P  YQPPKFQQFDGK N KQH+AHF+ETCE AGTRGDLLVKQFVRTLKGNAFDWYTDLEPES+ SWE+LER+FLNRFYSTRR
Subjt:  DSLLYSKPYTKRIDNLRTPIRYQPPKFQQFDGKSNHKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVHSWEELEREFLNRFYSTRR

Query:  TVSMFELTNTKQQKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDE
         VSM ELT TKQ+KGE V++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSIA+R N DLL+P +RKE +  +
Subjt:  TVSMFELTNTKQQKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDE

Query:  ET-------IEKSMVVNTT----LPKSSSKEKRQTNG-THHLTLKERQKKIYPFPDADIPDMLEQVLEAHLIELPKCKRPEEMEKVDDPKYCKYHRVIGH
         T        +++MVV+TT    + K    EKRQ  G     TLKERQ+K+YPFPD+D+PDML+Q+LE  LI+LP+CKRP EM +V+DP YCKYHRVI H
Subjt:  ET-------IEKSMVVNTT----LPKSSSKEKRQTNG-THHLTLKERQKKIYPFPDADIPDMLEQVLEAHLIELPKCKRPEEMEKVDDPKYCKYHRVIGH

Query:  PVERCFVLKDLILKLAKEGKIELDLDEVAQSNLAIV
        PVE+CFVLK+LILKLA + KIEL+LD+VAQ+N A V
Subjt:  PVERCFVLKDLILKLAKEGKIELDLDEVAQSNLAIV

A0A5A7TZU9 Ribonuclease H4.7e-0135Show/hide
Query:  KLQPKRKRSKK--FSQPQQQVMLNKSFSKTFH--KKEKVNFATFYCIDVEEVNNSKKSE----QRTSVFDRIKPPTTRPSAFQRMSMAATEEENQCSMST
        +L P +K+ +K  +S P  +  +    S+      K K   A    I VEE  +S++ +    QR+SVFDRI     RPS FQR+S +  ++ NQ S  +
Subjt:  KLQPKRKRSKK--FSQPQQQVMLNKSFSKTFH--KKEKVNFATFYCIDVEEVNNSKKSE----QRTSVFDRIKPPTTRPSAFQRMSMAATEEENQCSMST

Query:  FIRPSAFQRLSVSTSK----KSRPST--SVFYRLKFLVSK
          R SAFQRL+ S  K       P+T  S F RL   V++
Subjt:  FIRPSAFQRLSVSTSK----KSRPST--SVFYRLKFLVSK

A0A5A7TZU9 Ribonuclease H6.3e-16367.51Show/hide
Query:  MITETRTMEERMTQMQKHINNLMKAIEEKDSQIAQLKCQIENQHISESSQTQVIKNHDKGKTIVQDDQPQCSASVASLSIQQLQDMITNCIRAQYGGPTR
        M+T   T E RM +++K +N LMK +EE+D +IA LK  IE++  +ESS   ++KN DKGK ++Q+ QPQ S S+ASLS+QQLQ+MI + I+ QYGGP +
Subjt:  MITETRTMEERMTQMQKHINNLMKAIEEKDSQIAQLKCQIENQHISESSQTQVIKNHDKGKTIVQDDQPQCSASVASLSIQQLQDMITNCIRAQYGGPTR

Query:  DSLLYSKPYTKRIDNLRTPIRYQPPKFQQFDGKSNHKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVHSWEELEREFLNRFYSTRR
           LYSKPYTKRIDNLR P  YQPPKFQQFDGK N KQH+AHF+ETCE AGTRGDLLVKQFVRTLKGNAFDWY DLEPES+ +WE+LER+FLNRFYSTRR
Subjt:  DSLLYSKPYTKRIDNLRTPIRYQPPKFQQFDGKSNHKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVHSWEELEREFLNRFYSTRR

Query:  TVSMFELTNTKQQKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDE
         VSM ELTNT+QQKGELV+NYINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSIA+R  +D L+P  R +    +
Subjt:  TVSMFELTNTKQQKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDE

Query:  ET-------IEKSMVVNTTLPKSSSKEKRQTNGTHH-------LTLKERQKKIYPFPDADIPDMLEQVLEAHLIELPKCKRPEEMEKVDDPKYCKYHRVI
        +T       I++SMVV+ T  KS SK K       H        TLKERQ+K+YPFPD+D+ DMLEQ+LE  LI+LP+CKRPE+ EKVDDP YCKYHRVI
Subjt:  ET-------IEKSMVVNTTLPKSSSKEKRQTNGTHH-------LTLKERQKKIYPFPDADIPDMLEQVLEAHLIELPKCKRPEEMEKVDDPKYCKYHRVI

Query:  GHPVERCFVLKDLILKLAKEGKIELDLDEVAQSN
         HPVE+CFVLK+LILKLA+E KIELD+DEVAQ+N
Subjt:  GHPVERCFVLKDLILKLAKEGKIELDLDEVAQSN

A0A5A7URH1 Ty3-gypsy retrotransposon protein2.4e-17859.97Show/hide
Query:  MITETRTMEERMTQMQKHINNLMKAIEEKDSQIAQLKCQIENQHISESSQTQVIKNHDKGKTIVQDDQPQCSASVASLSIQQLQDMITNCIRAQYGGPTR
        M+T   T E RM +++K +N LMK +EE+D +IA LK  IE++  +ESS    +KN DKGK ++Q+ QPQ S S+ASLS+QQLQ+MI + I+ QYGGP +
Subjt:  MITETRTMEERMTQMQKHINNLMKAIEEKDSQIAQLKCQIENQHISESSQTQVIKNHDKGKTIVQDDQPQCSASVASLSIQQLQDMITNCIRAQYGGPTR

Query:  DSLLYSKPYTKRIDNLRTPIRYQPPKFQQFDGKSNHKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVHSWEELEREFLNRFYSTRR
           LY KPYTKRIDNLR P  YQPPKFQQFDGK N KQH+AHF++TCE AGTRGDLLVKQFVRTLKGNA DWY DLEPES+ +WE+LER+FLNRFYSTR 
Subjt:  DSLLYSKPYTKRIDNLRTPIRYQPPKFQQFDGKSNHKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVHSWEELEREFLNRFYSTRR

Query:  TVSMFELTNTKQQKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDE
         VSM ELTNT+QQKGELV++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSIA+R  +D L+P  R +    +
Subjt:  TVSMFELTNTKQQKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDE

Query:  ET-------IEKSMVVNTTLPKSSSKEK-----RQTNG--THHLTLKERQKKIYPFPDADIPDMLEQVLEAHLIELPKCKRPEEMEKVDDPKYCKYHRVI
        +T       I++SMVV+ T  KS SK K     R+ +G      TLKERQ+K+YPFPD+D+ DMLEQ+LE  LI+LP+CKRPE+  KVDDP YCKYHRVI
Subjt:  ET-------IEKSMVVNTTLPKSSSKEK-----RQTNG--THHLTLKERQKKIYPFPDADIPDMLEQVLEAHLIELPKCKRPEEMEKVDDPKYCKYHRVI

Query:  GHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLAI------VKGKSKH----QRKKDLKKLQPKRKRSKKFSQPQQQVMLNKSFSKTFHKKEKVNFAT
         HPVE+CFVLK+LILKLA+E KIELD+DEVAQ+N AI      +KGK +     +R   L +  P   RS     P++ + +    + +  + +  N+ +
Subjt:  GHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLAI------VKGKSKH----QRKKDLKKLQPKRKRSKKFSQPQQQVMLNKSFSKTFHKKEKVNFAT

Query:  FYCIDVEEVNNSKKSEQRTSVFDRIKPPTTRPSAFQRMSMAATEEENQCSMSTFIRPSAFQRLSVSTSKKSRPSTSVFYRLK
              +EVNNS +  QRTSVFDRIKP TTR S FQR+S+A  EEENQC    + R S  +RLS+ST KK RPSTS F RLK
Subjt:  FYCIDVEEVNNSKKSEQRTSVFDRIKPPTTRPSAFQRMSMAATEEENQCSMSTFIRPSAFQRLSVSTSKKSRPSTSVFYRLK

A0A5D3BX77 Retrotransposon gag protein7.9e-17453.23Show/hide
Query:  MITETRTMEERMTQMQKHINNLMKAIEEKDSQIAQLKCQIENQHISESSQTQVIKNHDKGKTIVQDDQPQCSASVASLSIQQLQDMITNCIRAQYGGPTR
        M+T   T E RM +++K +N LMK +EE+D +IA LK  IE++  +ESS    +KN DKGK ++Q+ QPQ S S+ASLS+QQLQ+MI + I+ QYGGP +
Subjt:  MITETRTMEERMTQMQKHINNLMKAIEEKDSQIAQLKCQIENQHISESSQTQVIKNHDKGKTIVQDDQPQCSASVASLSIQQLQDMITNCIRAQYGGPTR

Query:  DSLLYSKPYTKRIDNLRTPIRYQPPKFQQFDGKSNHKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVHSWEELEREFLNRFYSTRR
           LYSKPYTKRIDNLR P  YQPPKFQQFDGK N KQH+AHF+ETCE AGTRGDLLVKQFVRTLKGNAFD Y DLEPES+ +WE+LER+FLNRFYSTRR
Subjt:  DSLLYSKPYTKRIDNLRTPIRYQPPKFQQFDGKSNHKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVHSWEELEREFLNRFYSTRR

Query:  TVSMFELTNTKQQKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDE
         VSM ELTNT+QQKGELV++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSI +R  +D L+P  R +     
Subjt:  TVSMFELTNTKQQKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDE

Query:  ET-------IEKSMVVNTTLPKSSSKEK-----RQTNG--THHLTLKERQKKIYPFPDADIPDMLEQVLEAHLIELPKCKRPEEMEKVDDPKYCKYHRVI
        +T       I++SMVV+ T  KS SK K     R+ +G      TLKERQ+K+YPF D+D+ DMLEQ+LE  LI+LPKCKRP++ EKVDDP YCKYHRVI
Subjt:  ET-------IEKSMVVNTTLPKSSSKEK-----RQTNG--THHLTLKERQKKIYPFPDADIPDMLEQVLEAHLIELPKCKRPEEMEKVDDPKYCKYHRVI

Query:  GHPVERCFVLKDLILKLAKEGKIELDLDEVAQSN-LAI--------------------------------------------------------------
         HPVE+CFVLK+LILKLA+E KIEL++DEVAQ+N +AI                                                              
Subjt:  GHPVERCFVLKDLILKLAKEGKIELDLDEVAQSN-LAI--------------------------------------------------------------

Query:  -----------------VKGKSKHQRKKDLKKLQPKRKRSKKFSQPQQQVMLNKSFSKTF---HKKEKVNFATFYCIDVEEVNNSKKS----------EQ
                         +  K K +R K +   +P + + + F Q ++ + L +   ++F   H +E +   T +   + EVNN+  S           Q
Subjt:  -----------------VKGKSKHQRKKDLKKLQPKRKRSKKFSQPQQQVMLNKSFSKTF---HKKEKVNFATFYCIDVEEVNNSKKS----------EQ

Query:  RTSVFDRIKPPTTRPSAFQRMSMAATEEENQCSMSTFIRPSAFQRLSVSTSKKSRPSTSVFYRLK
        RTSVFDRIKP TTR S FQR+SMA  EEENQC    + R S F+RLS+S SKK+RPSTS F RLK
Subjt:  RTSVFDRIKPPTTRPSAFQRMSMAATEEENQCSMSTFIRPSAFQRLSVSTSKKSRPSTSVFYRLK

A0A5D3D4X3 Ty3-gypsy retrotransposon protein5.9e-16167.05Show/hide
Query:  MITETRTMEERMTQMQKHINNLMKAIEEKDSQIAQLKCQIENQHISESSQTQVIKNHDKGKTIVQDDQPQCSASVASLSIQQLQDMITNCIRAQYGGPTR
        M+T+    E+RM +++K +N LMK +EE+D +IA LK  IE++  +ESS    +KN DKGK ++Q+ QPQ S S+ASLS+QQLQ+MI + I+ QYGGP +
Subjt:  MITETRTMEERMTQMQKHINNLMKAIEEKDSQIAQLKCQIENQHISESSQTQVIKNHDKGKTIVQDDQPQCSASVASLSIQQLQDMITNCIRAQYGGPTR

Query:  DSLLYSKPYTKRIDNLRTPIRYQPPKFQQFDGKSNHKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVHSWEELEREFLNRFYSTRR
           LYSKPYTKRIDNLR P  YQPPKFQQFDGK N KQH+AHF+ETCE AGTRGDLLVKQFVRTLKGNAFDWY DLEPES+ +WE+LER+FLNRFYSTRR
Subjt:  DSLLYSKPYTKRIDNLRTPIRYQPPKFQQFDGKSNHKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVHSWEELEREFLNRFYSTRR

Query:  TVSMFELTNTKQQKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDE
         VSM ELTNT+QQKGELV++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSIA+R  +D L+P  R +    +
Subjt:  TVSMFELTNTKQQKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDE

Query:  ET-------IEKSMVVNTTLPKSSSKEK-----RQTNG--THHLTLKERQKKIYPFPDADIPDMLEQVLEAHLIELPKCKRPEEMEKVDDPKYCKYHRVI
        +T       I++SMVV+ T  KS SK K     R+ +G      TLKERQ+K+YPFPD+D+ DMLEQ+LE  LI+LP+CKRPE+  KVDDP YCKYHRVI
Subjt:  ET-------IEKSMVVNTTLPKSSSKEK-----RQTNG--THHLTLKERQKKIYPFPDADIPDMLEQVLEAHLIELPKCKRPEEMEKVDDPKYCKYHRVI

Query:  GHPVERCFVLKDLILKLAKEGKIELDLDEVAQSN
         HPVE+CFVLK+LILKLA+E KIELD+DEVAQ+N
Subjt:  GHPVERCFVLKDLILKLAKEGKIELDLDEVAQSN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATAACTGAGACAAGAACTATGGAAGAAAGAATGACTCAGATGCAAAAGCACATCAACAACTTGATGAAGGCGATTGAAGAAAAAGATTCTCAAATCGCGCAACTAAA
GTGCCAAATTGAGAACCAACATATCTCCGAATCAAGTCAAACACAAGTCATAAAAAATCATGACAAAGGAAAGACTATAGTGCAAGATGATCAGCCACAGTGTTCTGCTT
CGGTTGCTTCACTATCCATCCAACAGCTCCAAGATATGATCACAAACTGTATCAGAGCTCAGTATGGTGGACCTACTCGAGATTCCCTCTTGTATTCCAAACCTTATACT
AAGAGGATTGATAACTTGAGAACGCCAATCAGGTATCAACCACCAAAATTTCAACAGTTTGATGGAAAGAGCAATCATAAACAACATATTGCCCACTTCGTTGAGACATG
CGAGAACGCTGGTACTCGAGGGGACCTACTAGTCAAGCAGTTCGTTCGAACACTTAAAGGAAATGCTTTTGACTGGTACACTGATCTAGAACCTGAGTCAGTACACAGTT
GGGAGGAACTCGAAAGAGAGTTTTTGAATCGCTTCTACAGCACTAGAAGAACCGTTAGCATGTTCGAGCTCACCAACACTAAACAACAAAAAGGTGAACTCGTAGTTAAC
TATATAAATCGCTGGAGAGCCATGAGTCTAGATTGCAAAGATCGTCTCACTGAACTCTCTTCCGTCGAGATGTGCATTCAAGGCATGCACTGGGAACTCCTCTACATCCT
TAAAGGTATAAAGCCTCGCACCTTCGAAGAACTAGCAACTCGCGCCCACGATATGGAGCTAAGTATTGCTAGTCGAGAAAACCAAGACCTTCTCCTCCCTAACATGAGAA
AAGAAGGAAGGAACGACGAAGAGACTATAGAAAAATCTATGGTTGTAAACACAACTCTTCCTAAGTCGTCTTCAAAAGAAAAGCGACAAACAAATGGAACGCATCACTTA
ACTTTAAAGGAAAGACAGAAGAAAATCTATCCTTTCCCTGATGCCGACATCCCTGATATGTTGGAACAAGTACTAGAAGCGCACCTGATAGAGCTTCCTAAGTGTAAACG
ACCAGAAGAGATGGAGAAAGTCGATGATCCCAAGTATTGCAAGTATCATCGAGTTATTGGTCATCCAGTGGAAAGATGTTTCGTCCTTAAGGACTTAATTTTAAAGCTGG
CTAAGGAAGGCAAAATTGAGCTCGACCTTGATGAAGTAGCTCAATCAAATCTTGCTATAGTCAAAGGAAAGAGCAAACATCAAAGAAAGAAGGATCTTAAGAAACTTCAA
CCCAAGAGGAAGAGAAGTAAAAAGTTTTCTCAACCTCAACAACAGGTGATGCTGAATAAATCATTCTCCAAAACTTTTCACAAAAAGGAAAAAGTGAACTTTGCAACTTT
CTACTGCATCGACGTAGAAGAAGTTAACAATTCCAAGAAGAGTGAACAAAGGACTTCTGTCTTCGATCGCATCAAGCCTCCAACTACTCGTCCTTCGGCATTCCAAAGAA
TGAGTATGGCCGCGACAGAGGAAGAAAATCAATGTTCGATGTCCACCTTCATTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCGAAGAAAAGTCGACCTTCA
ACATCTGTTTTTTATCGCCTCAAATTCCTTGTCTCCAAGTTCGAGGGTCCTTACACTGTACGCTATTGTGTTGTTCCTTCTCCAAGTTCGAAGGTTCTCCGTTGTATCCT
GTTGCGTTGTTCCTTCTCCAAGTTCGAGGGTTCTCAGTTGTACGACTGCTACGTTGTTCCTCCTCCAAGTGCGAAGGATCTTATGTGGTGCGCTGTTGCATTGTTCCCTT
TTCTCTCAAGTTCGATGGTTCTCACGCAGCTTCGCTGGAGTTCATTCTCCCCAAGTTCGAAGGTTCTCACGCGCTTCATTGCAATTCCTTCTCTCCAAGGTCGAAGGTTC
TCACGCGCTTCGTTGAAGTTCCTTCCTCCAAGTTTGAAGGTCCTCATGCTACGCTCGGCTGCACTGCTGCGCTACTTCCTAAAGTCCAAAGATGTCAATTGTCCTCACGT
TGCGCTGCTTCCTTCTCCAAGTTCGAAGGCGTGGCAGCGACACAAGTCCAAGGACACGCCCCAAAATAAGGAACATGTCCCTACACTCATGTTGAAAGGGCGTGACGGCG
ACACAAGTCTAAGGAACATGTCCCATAGCGAGGAACATGTCCCTATAATCGTGCTGAAAGGGCGTGACGGCGACACAAGTCCAAGGAACATGTCCCATAGCGAGGAACAT
GTCCCTGTACTCGTGCTGAAAGGGCGTGACGGCGACACAAGTCCAAGGAACATGTCCCAAAGCGAGGAACATGTCCCTGCACTCGTGCTGAAAGGGCGTGACGACGACAC
AAGTCCAAGGAACATGTCCCAAAGCGAGGAACATGTCCCTGTACTCGTGCTGAAAGGGTGTGACGGCGACACAAGTCCAAGGAACATGTCCCTGCACTTGTGCCAAAAGG
CATGGCGACAACACAAGTCCAAGGAACATGTCCCAAAGCCAGGAACATGTCCCTACACTCGTGCTGAAAGGGCGTAG
mRNA sequenceShow/hide mRNA sequence
ATGATAACTGAGACAAGAACTATGGAAGAAAGAATGACTCAGATGCAAAAGCACATCAACAACTTGATGAAGGCGATTGAAGAAAAAGATTCTCAAATCGCGCAACTAAA
GTGCCAAATTGAGAACCAACATATCTCCGAATCAAGTCAAACACAAGTCATAAAAAATCATGACAAAGGAAAGACTATAGTGCAAGATGATCAGCCACAGTGTTCTGCTT
CGGTTGCTTCACTATCCATCCAACAGCTCCAAGATATGATCACAAACTGTATCAGAGCTCAGTATGGTGGACCTACTCGAGATTCCCTCTTGTATTCCAAACCTTATACT
AAGAGGATTGATAACTTGAGAACGCCAATCAGGTATCAACCACCAAAATTTCAACAGTTTGATGGAAAGAGCAATCATAAACAACATATTGCCCACTTCGTTGAGACATG
CGAGAACGCTGGTACTCGAGGGGACCTACTAGTCAAGCAGTTCGTTCGAACACTTAAAGGAAATGCTTTTGACTGGTACACTGATCTAGAACCTGAGTCAGTACACAGTT
GGGAGGAACTCGAAAGAGAGTTTTTGAATCGCTTCTACAGCACTAGAAGAACCGTTAGCATGTTCGAGCTCACCAACACTAAACAACAAAAAGGTGAACTCGTAGTTAAC
TATATAAATCGCTGGAGAGCCATGAGTCTAGATTGCAAAGATCGTCTCACTGAACTCTCTTCCGTCGAGATGTGCATTCAAGGCATGCACTGGGAACTCCTCTACATCCT
TAAAGGTATAAAGCCTCGCACCTTCGAAGAACTAGCAACTCGCGCCCACGATATGGAGCTAAGTATTGCTAGTCGAGAAAACCAAGACCTTCTCCTCCCTAACATGAGAA
AAGAAGGAAGGAACGACGAAGAGACTATAGAAAAATCTATGGTTGTAAACACAACTCTTCCTAAGTCGTCTTCAAAAGAAAAGCGACAAACAAATGGAACGCATCACTTA
ACTTTAAAGGAAAGACAGAAGAAAATCTATCCTTTCCCTGATGCCGACATCCCTGATATGTTGGAACAAGTACTAGAAGCGCACCTGATAGAGCTTCCTAAGTGTAAACG
ACCAGAAGAGATGGAGAAAGTCGATGATCCCAAGTATTGCAAGTATCATCGAGTTATTGGTCATCCAGTGGAAAGATGTTTCGTCCTTAAGGACTTAATTTTAAAGCTGG
CTAAGGAAGGCAAAATTGAGCTCGACCTTGATGAAGTAGCTCAATCAAATCTTGCTATAGTCAAAGGAAAGAGCAAACATCAAAGAAAGAAGGATCTTAAGAAACTTCAA
CCCAAGAGGAAGAGAAGTAAAAAGTTTTCTCAACCTCAACAACAGGTGATGCTGAATAAATCATTCTCCAAAACTTTTCACAAAAAGGAAAAAGTGAACTTTGCAACTTT
CTACTGCATCGACGTAGAAGAAGTTAACAATTCCAAGAAGAGTGAACAAAGGACTTCTGTCTTCGATCGCATCAAGCCTCCAACTACTCGTCCTTCGGCATTCCAAAGAA
TGAGTATGGCCGCGACAGAGGAAGAAAATCAATGTTCGATGTCCACCTTCATTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCGAAGAAAAGTCGACCTTCA
ACATCTGTTTTTTATCGCCTCAAATTCCTTGTCTCCAAGTTCGAGGGTCCTTACACTGTACGCTATTGTGTTGTTCCTTCTCCAAGTTCGAAGGTTCTCCGTTGTATCCT
GTTGCGTTGTTCCTTCTCCAAGTTCGAGGGTTCTCAGTTGTACGACTGCTACGTTGTTCCTCCTCCAAGTGCGAAGGATCTTATGTGGTGCGCTGTTGCATTGTTCCCTT
TTCTCTCAAGTTCGATGGTTCTCACGCAGCTTCGCTGGAGTTCATTCTCCCCAAGTTCGAAGGTTCTCACGCGCTTCATTGCAATTCCTTCTCTCCAAGGTCGAAGGTTC
TCACGCGCTTCGTTGAAGTTCCTTCCTCCAAGTTTGAAGGTCCTCATGCTACGCTCGGCTGCACTGCTGCGCTACTTCCTAAAGTCCAAAGATGTCAATTGTCCTCACGT
TGCGCTGCTTCCTTCTCCAAGTTCGAAGGCGTGGCAGCGACACAAGTCCAAGGACACGCCCCAAAATAAGGAACATGTCCCTACACTCATGTTGAAAGGGCGTGACGGCG
ACACAAGTCTAAGGAACATGTCCCATAGCGAGGAACATGTCCCTATAATCGTGCTGAAAGGGCGTGACGGCGACACAAGTCCAAGGAACATGTCCCATAGCGAGGAACAT
GTCCCTGTACTCGTGCTGAAAGGGCGTGACGGCGACACAAGTCCAAGGAACATGTCCCAAAGCGAGGAACATGTCCCTGCACTCGTGCTGAAAGGGCGTGACGACGACAC
AAGTCCAAGGAACATGTCCCAAAGCGAGGAACATGTCCCTGTACTCGTGCTGAAAGGGTGTGACGGCGACACAAGTCCAAGGAACATGTCCCTGCACTTGTGCCAAAAGG
CATGGCGACAACACAAGTCCAAGGAACATGTCCCAAAGCCAGGAACATGTCCCTACACTCGTGCTGAAAGGGCGTAG
Protein sequenceShow/hide protein sequence
MITETRTMEERMTQMQKHINNLMKAIEEKDSQIAQLKCQIENQHISESSQTQVIKNHDKGKTIVQDDQPQCSASVASLSIQQLQDMITNCIRAQYGGPTRDSLLYSKPYT
KRIDNLRTPIRYQPPKFQQFDGKSNHKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVHSWEELEREFLNRFYSTRRTVSMFELTNTKQQKGELVVN
YINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEETIEKSMVVNTTLPKSSSKEKRQTNGTHHL
TLKERQKKIYPFPDADIPDMLEQVLEAHLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLAIVKGKSKHQRKKDLKKLQ
PKRKRSKKFSQPQQQVMLNKSFSKTFHKKEKVNFATFYCIDVEEVNNSKKSEQRTSVFDRIKPPTTRPSAFQRMSMAATEEENQCSMSTFIRPSAFQRLSVSTSKKSRPS
TSVFYRLKFLVSKFEGPYTVRYCVVPSPSSKVLRCILLRCSFSKFEGSQLYDCYVVPPPSAKDLMWCAVALFPFLSSSMVLTQLRWSSFSPSSKVLTRFIAIPSLQGRRF
SRASLKFLPPSLKVLMLRSAALLRYFLKSKDVNCPHVALLPSPSSKAWQRHKSKDTPQNKEHVPTLMLKGRDGDTSLRNMSHSEEHVPIIVLKGRDGDTSPRNMSHSEEH
VPVLVLKGRDGDTSPRNMSQSEEHVPALVLKGRDDDTSPRNMSQSEEHVPVLVLKGCDGDTSPRNMSLHLCQKAWRQHKSKEHVPKPGTCPYTRAERA