; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0023071 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0023071
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTy3-gypsy retrotransposon protein
Genome locationchr7:43746337..43752470
RNA-Seq ExpressionLag0023071
SyntenyLag0023071
Gene Ontology termsGO:0090304 - nucleic acid metabolic process (biological process)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_031735972.1 uncharacterized protein LOC116401693 [Cucumis sativus]7.6e-26249.16Show/hide
Query:  MAPKKAPAKVTTSSDSYTGPVTRSRSQGIEIKEDHTPLAVASKISKLIEESSKDKVAVKDNPLFESVVPTSKQPKDTLSPDVMSVMMADVDQDERMAEME
        MA KKA +K + +SD+YTGP+TRSRS+GI I+      A+A  I K + ES K  + +K+NPL+      S +      PDVMSVMMADV  +  MAEME
Subjt:  MAPKKAPAKVTTSSDSYTGPVTRSRSQGIEIKEDHTPLAVASKISKLIEESSKDKVAVKDNPLFESVVPTSKQPKDTLSPDVMSVMMADVDQDERMAEME

Query:  RKLNLLMKAVDERDLEIAYLKNQLQTEKRLSLARPLLQERMIRGK---------------------------LSSVRTNHNTPHQ---------------
        RK+NLLMK VDERD EIA LK Q+QT +    ++  + +   +GK                            +S+R  +  P Q               
Subjt:  RKLNLLMKAVDERDLEIAYLKNQLQTEKRLSLARPLLQERMIRGK---------------------------LSSVRTNHNTPHQ---------------

Query:  --LPLS---PKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPDTIDSWEQMEREFLNRFYSTRRTVSMTELTSTKQRKG
          +PL    PKFQQFDGKGNPKQH+AHFVETCENAG+RGD LV+QFVR+LKGNAF+WYTDLEP++I+SWEQ+E+EFLNRFYSTRRTVSM ELT+TKQRKG
Subjt:  --LPLS---PKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPDTIDSWEQMEREFLNRFYSTRRTVSMTELTSTKQRKG

Query:  EPVIDYINRWRALSLDCKDRLS-----------------------------RLATRAHDMELSIASRGGKDPVVPDMRKEKKEVKGTEKVAKSATKESMV
        EPVIDYINRWRALSLDCKDRL+                              LATRAHDMELSIASRG KD +VP+++K+KKE+KG EK+ KS +KESMV
Subjt:  EPVIDYINRWRALSLDCKDRLS-----------------------------RLATRAHDMELSIASRGGKDPVVPDMRKEKKEVKGTEKVAKSATKESMV

Query:  VNTTPLKFSSKGKE----------------------------NSDVPDMLEQLLEKQLIELPECKRPEELGKVNDPNYCRYHRVVSHPVEKCFVLKELIL
        VNTTPLKF SKGKE                            +SD+ DMLEQLLEKQLI+LPECKRPE+ GKV+DPNYC+YHRV+SHPVEKCFVLKELIL
Subjt:  VNTTPLKFSSKGKE----------------------------NSDVPDMLEQLLEKQLIELPECKRPEELGKVNDPNYCRYHRVVSHPVEKCFVLKELIL

Query:  KLAREKKIELDLDEVAQTNHA--TVMANVGNENPLISSCDYEERLIQFGTLDPVM---------------------------------------------
        +LAREK+IELDL+EVAQTNHA  T+M+   +   +    +  + L+QFGT +P++                                             
Subjt:  KLAREKKIELDLDEVAQTNHA--TVMANVGNENPLISSCDYEERLIQFGTLDPVM---------------------------------------------

Query:  -------------------------------------------------------------------------------LLGVAVGSGDLSSFSIKDLLS
                                                                                       L G  V S DLS F+++DLLS
Subjt:  -------------------------------------------------------------------------------LLGVAVGSGDLSSFSIKDLLS

Query:  LPQEAKNVLINALIESDGTKIPT-SEARTCASCCVTIGFSDEDLLLGSKPHNRPLFVSGYIREQRVSRILIDDGSAVNIMPKSTMKQLGISMEELSNSKL
        LPQE K +LI+AL+ S  +   T +      S C++I FSDEDLLLGSK HNRPL+VSGY+REQRV RILID+GSAVNIMPKSTM QLGI M+ELSNSKL
Subjt:  LPQEAKNVLINALIESDGTKIPT-SEARTCASCCVTIGFSDEDLLLGSKPHNRPLFVSGYIREQRVSRILIDDGSAVNIMPKSTMKQLGISMEELSNSKL

Query:  VIQ-------------------------------DSKTTYKLLLGRPWIHGNGVITSTLHQCFKFYQDGIKKVEADTNPFSEAESHFADAKFYMKSDNTG
        VIQ                               DS+TTYKLLLGRPWIHGNGV+TSTLHQCFKFYQDG+KKVEAD+NPFSEAESHFADAKFY K++N  
Subjt:  VIQ-------------------------------DSKTTYKLLLGRPWIHGNGVITSTLHQCFKFYQDGIKKVEADTNPFSEAESHFADAKFYMKSDNTG

Query:  ETIPTEIPLIKKNDKLDLEPQADAGKEVVENVKASNPKNGEASTSLEKSKVVKDEKC--SPVLRYIPLSRRKKGESPFTECSKSIKVGDVEILKEGFTMP
        E +P E PL K  D   L+  A    E  E+ +  N   GEA TS  K  ++KDE    +PVLRY+PLSRRKKGESPF E  K +KVGD+EI+KE FT P
Subjt:  ETIPTEIPLIKKNDKLDLEPQADAGKEVVENVKASNPKNGEASTSLEKSKVVKDEKC--SPVLRYIPLSRRKKGESPFTECSKSIKVGDVEILKEGFTMP

Query:  LTKITEQEIRKPEDDQIKAILPEKRTKDGFDPKAYKLLAK-------------------------QKKLLKEGYSLPTTRKGLGYKSPEPVRITRKGKAK
        LTKI +QE++    D ++A LP++RTKDGFDPKAYKL+AK                         QKKLL+EG+S+P +RKGLGYKSPEP+RIT+KGK K
Subjt:  LTKITEQEIRKPEDDQIKAILPEKRTKDGFDPKAYKLLAK-------------------------QKKLLKEGYSLPTTRKGLGYKSPEPVRITRKGKAK

Query:  VADINHITAEEVDDSKEEENVNNEL
        V DINHIT EE D++  +E  N  +
Subjt:  VADINHITAEEVDDSKEEENVNNEL

XP_031739134.1 uncharacterized protein LOC116402863 [Cucumis sativus]4.5e-26249.24Show/hide
Query:  MAPKKAPAKVTTSSDSYTGPVTRSRSQGIEIKEDHTPLAVASKISKLIEESSKDKVAVKDNPLFESVVPTSKQPKDTLSPDVMSVMMADVDQDERMAEME
        MA KKA +K + +SD+YTGP+TRSRS+GI I+      A+A  I K + ES K  + +K+NPL+      S +      PDVMSVMMADV  +  MAEME
Subjt:  MAPKKAPAKVTTSSDSYTGPVTRSRSQGIEIKEDHTPLAVASKISKLIEESSKDKVAVKDNPLFESVVPTSKQPKDTLSPDVMSVMMADVDQDERMAEME

Query:  RKLNLLMKAVDERDLEIAYLKNQLQTEKRLSLARPLLQERMIRGK---------------------------LSSVRTNHNTPHQ---------------
        RK+NLLMK VDERD EIA LK Q+QT +    ++  + +   +GK                            SS+R  +  P Q               
Subjt:  RKLNLLMKAVDERDLEIAYLKNQLQTEKRLSLARPLLQERMIRGK---------------------------LSSVRTNHNTPHQ---------------

Query:  --LPLS---PKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPDTIDSWEQMEREFLNRFYSTRRTVSMTELTSTKQRKG
          +PL    PKFQQFDGKGNPKQH+AHFVETCENAG+RGD LV+QFVR+LKGNAF+WYTDLEP++I+SWEQ+E+EFLNRFYSTRRTVSM ELT+TKQRKG
Subjt:  --LPLS---PKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPDTIDSWEQMEREFLNRFYSTRRTVSMTELTSTKQRKG

Query:  EPVIDYINRWRALSLDCKDRLS-----------------------------RLATRAHDMELSIASRGGKDPVVPDMRKEKKEVKGTEKVAKSATKESMV
        EPVIDYINRWRALSLDCKDRL+                              LATRAHDMELSIASRG KD +VP+++K+KKE+KG EK+ KS  KESMV
Subjt:  EPVIDYINRWRALSLDCKDRLS-----------------------------RLATRAHDMELSIASRGGKDPVVPDMRKEKKEVKGTEKVAKSATKESMV

Query:  VNTTPLKFSSKGKE----------------------------NSDVPDMLEQLLEKQLIELPECKRPEELGKVNDPNYCRYHRVVSHPVEKCFVLKELIL
        VNTTPLKF SKGKE                            +SD+ DMLEQLLEKQLI+LPECKRPE+ GKV+DPNYC+YHRV+SHPVEKCFVLKELIL
Subjt:  VNTTPLKFSSKGKE----------------------------NSDVPDMLEQLLEKQLIELPECKRPEELGKVNDPNYCRYHRVVSHPVEKCFVLKELIL

Query:  KLAREKKIELDLDEVAQTNHA--TVMANVGNENPLISSCDYEERLIQFGTLDPVM---------------------------------------------
        +LAREK+IELDL+EVAQTNHA  T+M+   +   +    +  + L+QFGT +P++                                             
Subjt:  KLAREKKIELDLDEVAQTNHA--TVMANVGNENPLISSCDYEERLIQFGTLDPVM---------------------------------------------

Query:  -------------------------------------------------------------------------------LLGVAVGSGDLSSFSIKDLLS
                                                                                       L G  V S DLS F+++DLLS
Subjt:  -------------------------------------------------------------------------------LLGVAVGSGDLSSFSIKDLLS

Query:  LPQEAKNVLINALIESDGTKIPT-SEARTCASCCVTIGFSDEDLLLGSKPHNRPLFVSGYIREQRVSRILIDDGSAVNIMPKSTMKQLGISMEELSNSKL
        LPQE K +LI+AL+ S  +   T +      S C++I FSDEDLLLGSK HNRPL+VSGY+REQRV RILID+GSAVNIMPKSTM QLGI M+ELSNSKL
Subjt:  LPQEAKNVLINALIESDGTKIPT-SEARTCASCCVTIGFSDEDLLLGSKPHNRPLFVSGYIREQRVSRILIDDGSAVNIMPKSTMKQLGISMEELSNSKL

Query:  VIQ-------------------------------DSKTTYKLLLGRPWIHGNGVITSTLHQCFKFYQDGIKKVEADTNPFSEAESHFADAKFYMKSDNTG
        VIQ                               DS+TTYKLLLGRPWIHGNGV+TSTLHQCFKFYQDG+KKVEAD+NPFSEAESHFADAKFY K++N  
Subjt:  VIQ-------------------------------DSKTTYKLLLGRPWIHGNGVITSTLHQCFKFYQDGIKKVEADTNPFSEAESHFADAKFYMKSDNTG

Query:  ETIPTEIPLIKKNDKLDLEPQADAGKEVVENVKASNPKNGEASTSLEKSKVVKDEKC--SPVLRYIPLSRRKKGESPFTECSKSIKVGDVEILKEGFTMP
        E +P E PL K  D   L+  A    E  E+ +  N   GEA TS  K  ++KDE    +PVLRY+PLSRRKKGESPF E  K +KVGD+EI+KE FT P
Subjt:  ETIPTEIPLIKKNDKLDLEPQADAGKEVVENVKASNPKNGEASTSLEKSKVVKDEKC--SPVLRYIPLSRRKKGESPFTECSKSIKVGDVEILKEGFTMP

Query:  LTKITEQEIRKPEDDQIKAILPEKRTKDGFDPKAYKLLAK-------------------------QKKLLKEGYSLPTTRKGLGYKSPEPVRITRKGKAK
        LTKI +QE++    D ++A LP++RTKDGFDPKAYKL+AK                         QKKLL+EG+S+P +RKGLGYKSPEP+RIT+KGK K
Subjt:  LTKITEQEIRKPEDDQIKAILPEKRTKDGFDPKAYKLLAK-------------------------QKKLLKEGYSLPTTRKGLGYKSPEPVRITRKGKAK

Query:  VADINHITAEEVDDSKEEENVNNEL
        V DINHIT EE D++  +E  N  +
Subjt:  VADINHITAEEVDDSKEEENVNNEL

XP_031740568.1 uncharacterized protein LOC116403508 [Cucumis sativus]3.4e-26249.24Show/hide
Query:  MAPKKAPAKVTTSSDSYTGPVTRSRSQGIEIKEDHTPLAVASKISKLIEESSKDKVAVKDNPLFESVVPTSKQPKDTLSPDVMSVMMADVDQDERMAEME
        MA KKA +K + +SD+YTGP+TRSRS+GI I+      A+A  I K + ES K  + +K+NPL+      S +      PDVMSVMMADV  +  MAEME
Subjt:  MAPKKAPAKVTTSSDSYTGPVTRSRSQGIEIKEDHTPLAVASKISKLIEESSKDKVAVKDNPLFESVVPTSKQPKDTLSPDVMSVMMADVDQDERMAEME

Query:  RKLNLLMKAVDERDLEIAYLKNQLQTEKRLSLARPLLQERMIRGK---------------------------LSSVRTNHNTPHQ---------------
        RK+NLLMK VDERD EIA LK Q+QT +    ++  + +   +GK                            SS+R  +  P Q               
Subjt:  RKLNLLMKAVDERDLEIAYLKNQLQTEKRLSLARPLLQERMIRGK---------------------------LSSVRTNHNTPHQ---------------

Query:  --LPLS---PKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPDTIDSWEQMEREFLNRFYSTRRTVSMTELTSTKQRKG
          +PL    PKFQQFDGKGNPKQH+AHFVETCENAG+RGD LV+QFVR+LKGNAF+WYTDLEP++I+SWEQ+E+EFLNRFYSTRRTVSM ELT+TKQRKG
Subjt:  --LPLS---PKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPDTIDSWEQMEREFLNRFYSTRRTVSMTELTSTKQRKG

Query:  EPVIDYINRWRALSLDCKDRLS-----------------------------RLATRAHDMELSIASRGGKDPVVPDMRKEKKEVKGTEKVAKSATKESMV
        EPVIDYINRWRALSLDCKDRL+                              LATRAHDMELSIASRG KD +VP+++K+KKE+KG EK+ KS +KESMV
Subjt:  EPVIDYINRWRALSLDCKDRLS-----------------------------RLATRAHDMELSIASRGGKDPVVPDMRKEKKEVKGTEKVAKSATKESMV

Query:  VNTTPLKFSSKGKE----------------------------NSDVPDMLEQLLEKQLIELPECKRPEELGKVNDPNYCRYHRVVSHPVEKCFVLKELIL
        VNTTPLKF SKGKE                            +SD+ DMLEQLLEKQLI+LPECKRPE+ GKV+DPNYC+YHRV+SHPVEKCFVLKELIL
Subjt:  VNTTPLKFSSKGKE----------------------------NSDVPDMLEQLLEKQLIELPECKRPEELGKVNDPNYCRYHRVVSHPVEKCFVLKELIL

Query:  KLAREKKIELDLDEVAQTNHA--TVMANVGNENPLISSCDYEERLIQFGTLDPVM---------------------------------------------
        +LAREK+IELDL+EVAQTNHA  T+M+   +   +    +  + L+QFGT +P++                                             
Subjt:  KLAREKKIELDLDEVAQTNHA--TVMANVGNENPLISSCDYEERLIQFGTLDPVM---------------------------------------------

Query:  -------------------------------------------------------------------------------LLGVAVGSGDLSSFSIKDLLS
                                                                                       L G  V S DLS F+++DLLS
Subjt:  -------------------------------------------------------------------------------LLGVAVGSGDLSSFSIKDLLS

Query:  LPQEAKNVLINALIESDGTKIPT-SEARTCASCCVTIGFSDEDLLLGSKPHNRPLFVSGYIREQRVSRILIDDGSAVNIMPKSTMKQLGISMEELSNSKL
        LPQE K +LI+AL+ S  +   T +      S C++I FSDEDLLLGSK HNRPL+VSGY+REQRV RILID+GSAVNIMPKSTM QLGI M+ELSNSKL
Subjt:  LPQEAKNVLINALIESDGTKIPT-SEARTCASCCVTIGFSDEDLLLGSKPHNRPLFVSGYIREQRVSRILIDDGSAVNIMPKSTMKQLGISMEELSNSKL

Query:  VIQ-------------------------------DSKTTYKLLLGRPWIHGNGVITSTLHQCFKFYQDGIKKVEADTNPFSEAESHFADAKFYMKSDNTG
        VIQ                               DS+TTYKLLLGRPWIHGNGV+TSTLHQCFKFYQDG+KKVEAD+NPFSEAESHFADAKFY K++N  
Subjt:  VIQ-------------------------------DSKTTYKLLLGRPWIHGNGVITSTLHQCFKFYQDGIKKVEADTNPFSEAESHFADAKFYMKSDNTG

Query:  ETIPTEIPLIKKNDKLDLEPQADAGKEVVENVKASNPKNGEASTSLEKSKVVKDEKC--SPVLRYIPLSRRKKGESPFTECSKSIKVGDVEILKEGFTMP
        E +P E PL K  D   L+  A    E  E+ +  N   GEA TS  K  ++KDE    +PVLRY+PLSRRKKGESPF E  K +KVGD+EI+KE FT P
Subjt:  ETIPTEIPLIKKNDKLDLEPQADAGKEVVENVKASNPKNGEASTSLEKSKVVKDEKC--SPVLRYIPLSRRKKGESPFTECSKSIKVGDVEILKEGFTMP

Query:  LTKITEQEIRKPEDDQIKAILPEKRTKDGFDPKAYKLLAK-------------------------QKKLLKEGYSLPTTRKGLGYKSPEPVRITRKGKAK
        LTKI +QE++    D ++A LP++RTKDGFDPKAYKL+AK                         QKKLL+EG+S+P +RKGLGYKSPEP+RIT+KGK K
Subjt:  LTKITEQEIRKPEDDQIKAILPEKRTKDGFDPKAYKLLAK-------------------------QKKLLKEGYSLPTTRKGLGYKSPEPVRITRKGKAK

Query:  VADINHITAEEVDDSKEEENVNNEL
        V DINHIT EE D++  +E  N  +
Subjt:  VADINHITAEEVDDSKEEENVNNEL

XP_031742032.1 uncharacterized protein LOC116404025 [Cucumis sativus]7.6e-26249.24Show/hide
Query:  MAPKKAPAKVTTSSDSYTGPVTRSRSQGIEIKEDHTPLAVASKISKLIEESSKDKVAVKDNPLFESVVPTSKQPKDTLSPDVMSVMMADVDQDERMAEME
        MA KKA +K + +SD+YTGP+TRSRS+GI I+      A+A  I K + ES K  + +K+NPL+      S +      PDVMSVMMADV  +  MAEME
Subjt:  MAPKKAPAKVTTSSDSYTGPVTRSRSQGIEIKEDHTPLAVASKISKLIEESSKDKVAVKDNPLFESVVPTSKQPKDTLSPDVMSVMMADVDQDERMAEME

Query:  RKLNLLMKAVDERDLEIAYLKNQLQTEKRLSLARPLLQERMIRGK---------------------------LSSVRTNHNTPHQ---------------
        RK+NLLMK VDERD EIA LK Q+QT +    ++  + +   +GK                            SS+R  +  P Q               
Subjt:  RKLNLLMKAVDERDLEIAYLKNQLQTEKRLSLARPLLQERMIRGK---------------------------LSSVRTNHNTPHQ---------------

Query:  --LPLS---PKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPDTIDSWEQMEREFLNRFYSTRRTVSMTELTSTKQRKG
          +PL    PKFQQFDGKGNPKQH+AHFVETCENAG+RGD LV+QFVR+LKGNAF+WYTDLEP++I+SWEQ+E+EFLNRFYSTRRTVSM ELT+TKQRKG
Subjt:  --LPLS---PKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPDTIDSWEQMEREFLNRFYSTRRTVSMTELTSTKQRKG

Query:  EPVIDYINRWRALSLDCKDRLS-----------------------------RLATRAHDMELSIASRGGKDPVVPDMRKEKKEVKGTEKVAKSATKESMV
        EPVIDYINRWRALSLDCKDRL+                              LATRAHDMELSIASRG KD +VP+++K+KKE+KG EK+ KS  KESMV
Subjt:  EPVIDYINRWRALSLDCKDRLS-----------------------------RLATRAHDMELSIASRGGKDPVVPDMRKEKKEVKGTEKVAKSATKESMV

Query:  VNTTPLKFSSKGKE----------------------------NSDVPDMLEQLLEKQLIELPECKRPEELGKVNDPNYCRYHRVVSHPVEKCFVLKELIL
        VNTTPLKF SKGKE                            +SD+ DMLEQLLEKQLI+LPECKRPE+ GKV+DPNYC+YHRV+SHPVEKCFVLKELIL
Subjt:  VNTTPLKFSSKGKE----------------------------NSDVPDMLEQLLEKQLIELPECKRPEELGKVNDPNYCRYHRVVSHPVEKCFVLKELIL

Query:  KLAREKKIELDLDEVAQTNHA--TVMANVGNENPLISSCDYEERLIQFGTLDPVM---------------------------------------------
        +LAREK+IELDL+EVAQTNHA  T+M+   +   +    +  + L+QFGT +P++                                             
Subjt:  KLAREKKIELDLDEVAQTNHA--TVMANVGNENPLISSCDYEERLIQFGTLDPVM---------------------------------------------

Query:  -------------------------------------------------------------------------------LLGVAVGSGDLSSFSIKDLLS
                                                                                       L G  V S DLS F+++DLLS
Subjt:  -------------------------------------------------------------------------------LLGVAVGSGDLSSFSIKDLLS

Query:  LPQEAKNVLINALIESDGTKIPT-SEARTCASCCVTIGFSDEDLLLGSKPHNRPLFVSGYIREQRVSRILIDDGSAVNIMPKSTMKQLGISMEELSNSKL
        LPQE K +LI+AL+ S  +   T +      S C++I FSDEDLLLGSK HNRPL+VSGY+REQRV RILID+GSAVNIMPKSTM QLGI M+ELSNSKL
Subjt:  LPQEAKNVLINALIESDGTKIPT-SEARTCASCCVTIGFSDEDLLLGSKPHNRPLFVSGYIREQRVSRILIDDGSAVNIMPKSTMKQLGISMEELSNSKL

Query:  VIQ-------------------------------DSKTTYKLLLGRPWIHGNGVITSTLHQCFKFYQDGIKKVEADTNPFSEAESHFADAKFYMKSDNTG
        VIQ                               DS+TTYKLLLGRPWIHGNGV+TSTLHQCFKFYQDG+KKVEAD+NPFSEAESHFADAKFY K++N  
Subjt:  VIQ-------------------------------DSKTTYKLLLGRPWIHGNGVITSTLHQCFKFYQDGIKKVEADTNPFSEAESHFADAKFYMKSDNTG

Query:  ETIPTEIPLIKKNDKLDLEPQADAGKEVVENVKASNPKNGEASTSLEKSKVVKDEKC--SPVLRYIPLSRRKKGESPFTECSKSIKVGDVEILKEGFTMP
        E +P E PL K  D   L+  A    E  E+ +  N   GEA TS  K  ++KDE    +PVLRY+PLSRRKKGESPF E  K +KVGD+EI+KE FT P
Subjt:  ETIPTEIPLIKKNDKLDLEPQADAGKEVVENVKASNPKNGEASTSLEKSKVVKDEKC--SPVLRYIPLSRRKKGESPFTECSKSIKVGDVEILKEGFTMP

Query:  LTKITEQEIRKPEDDQIKAILPEKRTKDGFDPKAYKLLAK-------------------------QKKLLKEGYSLPTTRKGLGYKSPEPVRITRKGKAK
        LTKI +QE++    D ++A LP++RTKDGFDPKAYKL+AK                         QKKLL+EG+S+P +RKGLGYKSPEP+RIT+KGK K
Subjt:  LTKITEQEIRKPEDDQIKAILPEKRTKDGFDPKAYKLLAK-------------------------QKKLLKEGYSLPTTRKGLGYKSPEPVRITRKGKAK

Query:  VADINHITAEEVDDSKEEENVNNEL
        V DINHIT EE D++  +E  N  +
Subjt:  VADINHITAEEVDDSKEEENVNNEL

XP_031742199.1 uncharacterized protein LOC105435721 [Cucumis sativus]6.9e-23948.62Show/hide
Query:  MAPKKAPAKVTTSSDSYTGPVTRSRSQGIEIKEDHTPLAVASKISKLIEESSKDKVAVKDNPLFESVVPTSKQPKDTLSPDVMSVMMADVDQDERMAEME
        MA KKA +K + +SD+YTGP+TRSRS+GI I+      A+A  I K + ES K  + +K+NPL+      S +      PDVMSVMMADV  +  MAEME
Subjt:  MAPKKAPAKVTTSSDSYTGPVTRSRSQGIEIKEDHTPLAVASKISKLIEESSKDKVAVKDNPLFESVVPTSKQPKDTLSPDVMSVMMADVDQDERMAEME

Query:  RKLNLLMKAVDERDLEIAYLKNQLQTEKRLSLARPLLQERMIRGK---------------------------LSSVRTNHNTPHQ---------------
        RK+NLLMK VDERD EIA LK Q+QT +    ++  + +   +GK                            +S+R  +  P Q               
Subjt:  RKLNLLMKAVDERDLEIAYLKNQLQTEKRLSLARPLLQERMIRGK---------------------------LSSVRTNHNTPHQ---------------

Query:  --LPLS---PKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPDTIDSWEQMEREFLNRFYSTRRTVSMTELTSTKQRKG
          +PL    PKFQQFDGKGNPKQH+AHFVETCENAG+RGD LV+QFVR+LKGNAF+WYTDLEP++I+SWEQ+E+EFLNRFYSTRRTVSM ELT+TKQRKG
Subjt:  --LPLS---PKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPDTIDSWEQMEREFLNRFYSTRRTVSMTELTSTKQRKG

Query:  EPVIDYINRWRALSLDCKDRLS-----------------------------RLATRAHDMELSIASRGGKDPVVPDMRKEKKEVKGTEKVAKSATKESMV
        EPVIDYINRWRALSLDCKDRL+                              LATRAHDMELSIASRG KD +VP+++K+KKE+KG EK+ KS +KESMV
Subjt:  EPVIDYINRWRALSLDCKDRLS-----------------------------RLATRAHDMELSIASRGGKDPVVPDMRKEKKEVKGTEKVAKSATKESMV

Query:  VNTTPLKFSSKGKE----------------------------NSDVPDMLEQLLEKQLIELPECKRPEELGKVNDPNYCRYHRVVSHPVEKCFVLKELIL
        VNTTPLKF SKGKE                            +SD+ DMLEQLLEKQLI+LPECKRPE+ GKV+DPNYC+YHRV+SHPVEKCFVLKELIL
Subjt:  VNTTPLKFSSKGKE----------------------------NSDVPDMLEQLLEKQLIELPECKRPEELGKVNDPNYCRYHRVVSHPVEKCFVLKELIL

Query:  KLAREKKIELDLDEVAQTNHA--TVMANVGNENPLISSCDYEERLIQFGTLDPVM---------------------------------------------
        +LAREK+IELDL+EVAQTNHA  T+M+   +   +    +  + L+QFGT +P++                                             
Subjt:  KLAREKKIELDLDEVAQTNHA--TVMANVGNENPLISSCDYEERLIQFGTLDPVM---------------------------------------------

Query:  -------LLGVAVGSGDLSSFSIKDLLSLPQEAKNVLINALIESDGTKIPT-SEARTCASCCVTIGFSDEDLLLGSKPHNRPLFVSGYIREQRVSRILID
               L G  V S DLS F+++DLLSLPQE K +LI+AL+ S  +   T +      S C++I FSDEDLLLGSK HNRPL+VSGY+REQRV RILID
Subjt:  -------LLGVAVGSGDLSSFSIKDLLSLPQEAKNVLINALIESDGTKIPT-SEARTCASCCVTIGFSDEDLLLGSKPHNRPLFVSGYIREQRVSRILID

Query:  DGSAVNIMPKSTMKQLGISMEELSNSKLVIQ-------------------------------DSKTTYKLLLGRPWIHGNGVITSTLHQCFKFYQDGIKK
        +GSAVNIMPKSTM QLGI M+ELSNSKLVIQ                               DS+TTYKLLLGRPWIHGNGV+TSTLHQCFKFYQDG+KK
Subjt:  DGSAVNIMPKSTMKQLGISMEELSNSKLVIQ-------------------------------DSKTTYKLLLGRPWIHGNGVITSTLHQCFKFYQDGIKK

Query:  VEADTNPFSEAESHFADAKFYMKSDNTGETIPTEIPLIKKNDKLDLEPQADAGKEVVENVKASNPKNGEASTSLEKSKVVKDEKCSPVLRYIPLSRRKKG
        VEAD+NPFSEAESHFADA F +             P +K  ++          +  +  V                                        
Subjt:  VEADTNPFSEAESHFADAKFYMKSDNTGETIPTEIPLIKKNDKLDLEPQADAGKEVVENVKASNPKNGEASTSLEKSKVVKDEKCSPVLRYIPLSRRKKG

Query:  ESPFTECSKSIKVGDVEILKEGFTMPLTKITEQEIRKPEDDQIKAILPEKRTKDGFDPKAYKLLAK-------------------------QKKLLKEGY
               SK  +  D+EI+KE FT PLTKI +QE++    D ++A LP++RTKDGFDPKAYKL+AK                         QKKLL+EG+
Subjt:  ESPFTECSKSIKVGDVEILKEGFTMPLTKITEQEIRKPEDDQIKAILPEKRTKDGFDPKAYKLLAK-------------------------QKKLLKEGY

Query:  SLPTTRKGLGYKSPEPVRITRKGKAKVADINHITAEEVDDSKEEENVNNEL
        S+P +RKGLGYKSPEP+RIT+KGK KV DINHIT EE D++  +E  N  +
Subjt:  SLPTTRKGLGYKSPEPVRITRKGKAKVADINHITAEEVDDSKEEENVNNEL

TrEMBL top hitse value%identityAlignment
A0A5A7TZU9 Ribonuclease H6.3e-23846.13Show/hide
Query:  SDSYTGPVTRSRSQGIEIKEDHTPLAVASKISKLIEESSKDKVAVKDNPLFESVVPTSKQPKDTL-SPDVMSVMMADVD-QDERMAEMERKLNLLMKAVD
        SD    P TRSRS+ I+  ED  P  VA  I + I +  K  + +K+NP  +     S++  + +  P++MSVM+ DVD  ++RMAE+E+K+N+LMKAV+
Subjt:  SDSYTGPVTRSRSQGIEIKEDHTPLAVASKISKLIEESSKDKVAVKDNPLFESVVPTSKQPKDTL-SPDVMSVMMADVD-QDERMAEMERKLNLLMKAVD

Query:  ERDLEIAYLKNQLQ-----------TEKRLSLARPLLQERMIRGKLS---------------SVRTNHNTPHQ-----------------LP---LSPKF
        ERD EIA LKN ++           T K  +  + ++QE   +   S               S++T +  P Q                 +P     PKF
Subjt:  ERDLEIAYLKNQLQ-----------TEKRLSLARPLLQERMIRGKLS---------------SVRTNHNTPHQ-----------------LP---LSPKF

Query:  QQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPDTIDSWEQMEREFLNRFYSTRRTVSMTELTSTKQRKGEPVIDYINRWRA
        QQFDGKGNPKQH+AHF+ETCE AGTRGDLLVKQFVRTLKGNAFDWYTDLEP++IDSWEQ+ER+FLNRFYSTRR VSM ELT+TKQRKGEPVIDYINRWRA
Subjt:  QQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPDTIDSWEQMEREFLNRFYSTRRTVSMTELTSTKQRKGEPVIDYINRWRA

Query:  LSLDCKDRLS-----------------------------RLATRAHDMELSIASRGGKDPVVPDMRKEKKEVKGTEKVAKSATKESMVVNTTPLKFSSKG
        LSLDCKDRL+                              LATRAHDMELSIA+RG  D +VP++RKEKKEVK T+K  K  TKE+MVV+TTPLK  SK 
Subjt:  LSLDCKDRLS-----------------------------RLATRAHDMELSIASRGGKDPVVPDMRKEKKEVKGTEKVAKSATKESMVVNTTPLKFSSKG

Query:  KE--------------------------NSDVPDMLEQLLEKQLIELPECKRPEELGKVNDPNYCRYHRVVSHPVEKCFVLKELILKLAREKKIELDLDE
        K+                          +SD+PDML+QLLEKQLI+LPECKRP E+G+VNDPNYC+YHRV+SHPVEKCFVLKELILKLA +KKIEL+LD+
Subjt:  KE--------------------------NSDVPDMLEQLLEKQLIELPECKRPEELGKVNDPNYCRYHRVVSHPVEKCFVLKELILKLAREKKIELDLDE

Query:  VAQTNHATVMANVGNENPLISSCDYEERLIQFGTLDPVMLLG----------------------------------------------------------
        VAQTNHA V+    +    I S      LIQFG+L+PV++                                                            
Subjt:  VAQTNHATVMANVGNENPLISSCDYEERLIQFGTLDPVMLLG----------------------------------------------------------

Query:  ----------------VAVGSG------------------------------------------------DLSSFSIKDLLSLPQEAKNVLINALIESDG
                        +    G                                                DL    I DLL+L +E K+ +I  L   D 
Subjt:  ----------------VAVGSG------------------------------------------------DLSSFSIKDLLSLPQEAKNVLINALIESDG

Query:  TKIPTSEARTCASCCVTIGFSDEDLLLGSKPHNRPLFVSGYIREQRVSRILIDDGSAVNIMPKSTMKQLGISMEELSNSKLVIQ----------------
        + I TS A T  S C++I FSDEDLLLGSK HNRPL+VSG++REQ++++ILID+GSAVNI+PKSTM QLGIS+EELSNSKLVIQ                
Subjt:  TKIPTSEARTCASCCVTIGFSDEDLLLGSKPHNRPLFVSGYIREQRVSRILIDDGSAVNIMPKSTMKQLGISMEELSNSKLVIQ----------------

Query:  ---------------DSKTTYKLLLGRPWIHGNGVITSTLHQCFKFYQDGIKKVEADTNPFSEAESHFADAKFYMKSDNTGETIPTEIPLIKKNDKLDLE
                       DS+TTYK+LLGRPWIH NG++TSTLHQCFKFY+ GIKKV+AD+ PF++AESHFADAKFY KS++  E I TE+P+ K   K   E
Subjt:  ---------------DSKTTYKLLLGRPWIHGNGVITSTLHQCFKFYQDGIKKVEADTNPFSEAESHFADAKFYMKSDNTGETIPTEIPLIKKNDKLDLE

Query:  PQADAGKEVVENVKASNPKNGEASTSL-----EKSKVV---KDEKCSPVLRYIPLSRRKKGESPFTECSKSIKVGDVEILKEGFTMPLTKITEQEIRKPE
         +    K+  +    ++ +NGE +T       E  K+    K+    PVLRYIPLSRRKKGESPFTECSK++ V + EILKE FT PLTKI + E +K E
Subjt:  PQADAGKEVVENVKASNPKNGEASTSL-----EKSKVV---KDEKCSPVLRYIPLSRRKKGESPFTECSKSIKVGDVEILKEGFTMPLTKITEQEIRKPE

Query:  DDQIKAILPEKRTKDGFDPKAYKLLAK--------------------------QKKLLKEGYSLPTTRKGLGYKSPEPVRITRKGKAKVADINHITAEEV
           ++A LPE+RT +GFDPKAYKL+AK                          QKKL K+GYS+P +R G+GY+S EPVRIT KGKAKVA+  HIT EE 
Subjt:  DDQIKAILPEKRTKDGFDPKAYKLLAK--------------------------QKKLLKEGYSLPTTRKGLGYKSPEPVRITRKGKAKVADINHITAEEV

Query:  DDSKEEENVNNE
         DS+E + V ++
Subjt:  DDSKEEENVNNE

A0A5A7UWW2 Ty3-gypsy retrotransposon protein9.8e-23152.46Show/hide
Query:  ESSKDKVAVKDNPLFESVVPTSKQPKDTLSPDVMSVMMADVDQDERMAEMERKLNLLMKAVDERDLEIAYLKNQLQTEKRLSLARPLLQERMIRGKLSSV
        ES K  + +K+NPL+++    S + K    PDVMSVMMAD+  +  MAEMERK+N LMK V+ERD EI  L+ Q++T +     + L  + MI    +S+
Subjt:  ESSKDKVAVKDNPLFESVVPTSKQPKDTLSPDVMSVMMADVDQDERMAEMERKLNLLMKAVDERDLEIAYLKNQLQTEKRLSLARPLLQERMIRGKLSSV

Query:  RTNHNTPHQ-----------------LPLS---PKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPDTIDSWEQMEREF
        R  +  P Q                 +PL     KFQQFDGK NPKQHIAHFVETCENAG RGD LV+QF+R+LKGNAF+                    
Subjt:  RTNHNTPHQ-----------------LPLS---PKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPDTIDSWEQMEREF

Query:  LNRFYSTRRTVSMTELTSTKQRKGEPVIDYINRWRALSLDCKDRLSR-----------------------------LATRAHDMELSIASRGGKDPVVPD
              TRRTVSM ELT+TKQ KGEPVIDYINRWRALSLDC+DRL++                             LATRAHDMELSIA RG K+  V +
Subjt:  LNRFYSTRRTVSMTELTSTKQRKGEPVIDYINRWRALSLDCKDRLSR-----------------------------LATRAHDMELSIASRGGKDPVVPD

Query:  MRKEKKEVKGTEKVAKSATKESMVVNTTPLKFSSK----------GKE-----------------NSDVPDMLEQLLEKQLIELPECKRPEELGKVNDPN
        +RK+KKE K  EKV KS  KESMVVNTT LKFS +          G E                 +SD+ DMLEQLLEKQLI+L +CKRPE+ GKVNDPN
Subjt:  MRKEKKEVKGTEKVAKSATKESMVVNTTPLKFSSK----------GKE-----------------NSDVPDMLEQLLEKQLIELPECKRPEELGKVNDPN

Query:  YCRYHRVVSHPVEKCFVLKELILKLAR-EKKIELDLDEVAQTNHATVMANVGNENPLISSC----DYEERLIQFGTLDPVMLLGVAVGSGDLSSFSIKDL
        YC+YHRV+SHPVEKCFVLKELIL+L R +KKIELDL+E              +ENP + +C    + EE  I   +L+   L      S DLS F++ DL
Subjt:  YCRYHRVVSHPVEKCFVLKELILKLAR-EKKIELDLDEVAQTNHATVMANVGNENPLISSC----DYEERLIQFGTLDPVMLLGVAVGSGDLSSFSIKDL

Query:  LSLPQEAKNVLINALIESD--GTKIPTSEARTCASCCVTIGFSDEDLLLGSKPHNRPLFVSGYIREQRVSRILIDDGSAVNIMPKSTMKQLGISMEELSN
        LSLPQE K +LINAL+ S    +  PT+   +    C++I FSDEDLLLGSK HNRPL+VSGY+REQRV +ILID+GSA+NIMPKSTM+QLGI M+ELSN
Subjt:  LSLPQEAKNVLINALIESD--GTKIPTSEARTCASCCVTIGFSDEDLLLGSKPHNRPLFVSGYIREQRVSRILIDDGSAVNIMPKSTMKQLGISMEELSN

Query:  SKLVIQDSKTTYKLLLGRPWIHGNGVITSTLHQCFKFYQDGIKKVEADTNPFSEAESHFADAKFYMKSDNTGETIPTEIPLIKKNDKLDLEPQADAGKEV
        SKLVIQDS+TTYKLLLGRPWIHGN V+TSTLHQCFKFYQDG+KKVEAD NPFSEAESHFADAKFY+K+DN+ + +  E+PL+   D L L  ++ A +E 
Subjt:  SKLVIQDSKTTYKLLLGRPWIHGNGVITSTLHQCFKFYQDGIKKVEADTNPFSEAESHFADAKFYMKSDNTGETIPTEIPLIKKNDKLDLEPQADAGKEV

Query:  VENVKASNPKNGEASTSLEKSKVVKDEKCSP--VLRYIPLSRRKKGESPFTECSKSIKVGDVEILKEGFTMPLTKITEQEIRKPEDDQIKAILPEKRTKD
         ++    + + GEASTS  KS ++ DEK S   +LRYIPLSRRKK ESPF +  + +KVGD+E+LKE  T PLTKIT+QE++    D  +A L ++RTKD
Subjt:  VENVKASNPKNGEASTSLEKSKVVKDEKCSP--VLRYIPLSRRKKGESPFTECSKSIKVGDVEILKEGFTMPLTKITEQEIRKPEDDQIKAILPEKRTKD

Query:  GFDPKAYKLLAK-------------------------QKKLLKEGYSLPTTRKGLGYKSPEPVRITRKGKAKVADINHITAEEVDDSKEEENVNNELLFL
        GFDPKAYKL+ K                         +KKLL+EG+ +P +RKGLGYKSPEP+RITRKGK KV D NHIT EEVD  KE+E  +  +   
Subjt:  GFDPKAYKLLAK-------------------------QKKLLKEGYSLPTTRKGLGYKSPEPVRITRKGKAKVADINHITAEEVDDSKEEENVNNELLFL

Query:  GASGHQLLVLQYFKD
              +   QY KD
Subjt:  GASGHQLLVLQYFKD

A0A5A7VE63 Uncharacterized protein4.1e-21348.53Show/hide
Query:  NHNTPHQLPLSPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPDTIDSWEQMEREFLNRFYSTRRTVSMTELTSTKQR
        N   PH    +PKFQQFDGKGNPKQH+AHF+ETCE AGTRGDLLVKQFV+TLKGNAFDWYTDLEP++IDSWEQ+ER+FLNRFYSTRR VSM ELT+TKQR
Subjt:  NHNTPHQLPLSPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPDTIDSWEQMEREFLNRFYSTRRTVSMTELTSTKQR

Query:  KGEPVIDYINRWRALSLDCKDRLS-----------------------------RLATRAHDMELSIASRGGKDPVVPDMRKEKKEVKGTEKVAKSATKES
        KGEPVIDYINRWRALSLDCKDRL+                              LATRAHDMELSIA+RG  D +VP++RKEKKEVK T+KV K ATKE+
Subjt:  KGEPVIDYINRWRALSLDCKDRLS-----------------------------RLATRAHDMELSIASRGGKDPVVPDMRKEKKEVKGTEKVAKSATKES

Query:  MVVNTTPLKFSSKGKE--------------------------NSDVPDMLEQLLEKQLIELPECKRPEELGKVNDPNYCRYHRVVSHPVEKCFVLKELIL
        MVV+TTPLK  SK K+                          +SD+PDML+QLLEKQLI+LPECKRP E+G+VNDPNYC+YHRV+SHPVEKCFVLKELIL
Subjt:  MVVNTTPLKFSSKGKE--------------------------NSDVPDMLEQLLEKQLIELPECKRPEELGKVNDPNYCRYHRVVSHPVEKCFVLKELIL

Query:  KLAREKKIELDLDEVAQTNHATVMANVGNENPLISSCDYEERLIQFGTLDPVMLLG--------------------------------------------
        KLA +KKIEL++D+VAQ NHA V+    +   +I S      LIQFG+L+PV++                                              
Subjt:  KLAREKKIELDLDEVAQTNHATVMANVGNENPLISSCDYEERLIQFGTLDPVMLLG--------------------------------------------

Query:  ------------------------------VAVGSG------------------------------------------------DLSSFSIKDLLSLPQE
                                      +    G                                                DL    I DLL+L +E
Subjt:  ------------------------------VAVGSG------------------------------------------------DLSSFSIKDLLSLPQE

Query:  AKNVLINALIESDGTKIPTSEARTCASCCVTIGFSDEDLLLGSKPHNRPLFVSGYIREQRVSRILIDDGSAVNIMPKSTMKQLGISMEELSNSKLVIQ--
         K+ +I  L   D + I TS      S C++I FSDEDLLL SK HNRPL+VSGYI+EQ++++ILID+GSAV I+PKSTM QLGIS+EELSNSKLVIQ  
Subjt:  AKNVLINALIESDGTKIPTSEARTCASCCVTIGFSDEDLLLGSKPHNRPLFVSGYIREQRVSRILIDDGSAVNIMPKSTMKQLGISMEELSNSKLVIQ--

Query:  -----------------------------DSKTTYKLLLGRPWIHGNGVITSTLHQCFKFYQDGIKKVEADTNPFSEAESHFADAKFYMKSDNTGETIPT
                                     DSKTTYK+LLGRPWIH NG++TSTLHQCFKFY+  IKKV+AD+ PF++AESHFADAKFY KS++  E I T
Subjt:  -----------------------------DSKTTYKLLLGRPWIHGNGVITSTLHQCFKFYQDGIKKVEADTNPFSEAESHFADAKFYMKSDNTGETIPT

Query:  EIPLIKKNDKLDLEPQADAGKEVVENVKASNPKNGEASTSLEKSKVV--------KDEKCSPVLRYIPLSRRKKGESPFTECSKSIKVGDVEILKEGFTM
        E+P+ K   K   E +    K+  +    ++ +NGE++T  +    V        K+    PVLRYIPLSRRKKGESPF ECSK++ V + +ILKE F  
Subjt:  EIPLIKKNDKLDLEPQADAGKEVVENVKASNPKNGEASTSLEKSKVV--------KDEKCSPVLRYIPLSRRKKGESPFTECSKSIKVGDVEILKEGFTM

Query:  PLTKITEQEIRKPEDDQIKAILPEKRTKDGFDPKAYKLLAK--------------------------QKKLLKEGYSLPTTRKGLGYKSPEPVRITRKGK
         LTKI + E +K E   ++A LPE+RT +GFDPKAYKL+AK                          QKKL K+GYS+P  R G+GY+S EPV+IT KGK
Subjt:  PLTKITEQEIRKPEDDQIKAILPEKRTKDGFDPKAYKLLAK--------------------------QKKLLKEGYSLPTTRKGLGYKSPEPVRITRKGK

Query:  AKVADINHITAEEVDDSKE
        AKVA+  HIT EE  DSKE
Subjt:  AKVADINHITAEEVDDSKE

A0A5D3BIH8 Uncharacterized protein9.4e-23446.06Show/hide
Query:  SDSYTGPVTRSRSQGIEIKEDHTPLAVASKISKLIEESSKDKVAVKDNPLFESVVPTSKQPKDTL-SPDVMSVMMADVD-QDERMAEMERKLNLLMKAVD
        SD    P TRSRS+ I+  ED  P  VA  I K I +  K  + +K+NP  +     S++  + +  P++MSVM+ DVD  ++RM  +E+K+N+ MKAV+
Subjt:  SDSYTGPVTRSRSQGIEIKEDHTPLAVASKISKLIEESSKDKVAVKDNPLFESVVPTSKQPKDTL-SPDVMSVMMADVD-QDERMAEMERKLNLLMKAVD

Query:  ERDLEIAYLKNQLQ-----------TEKRLSLARPLLQERMIRGKLS---------------SVRTNHNTPHQLPLSPKFQQFDGKGNPKQHIAHFVETC
        ERD EIA LKN ++           T K  +  + ++QE   +   S               S++T +  P Q    P F     +GNPKQH+AHF+ETC
Subjt:  ERDLEIAYLKNQLQ-----------TEKRLSLARPLLQERMIRGKLS---------------SVRTNHNTPHQLPLSPKFQQFDGKGNPKQHIAHFVETC

Query:  ENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPDTIDSWEQMEREFLNRFYSTRRTVSMTELTSTKQRKGEPVIDYINRWRALSLDCKDRLS----------
        E AGTRGDLLVKQFVRTLKGNAFDWYTDLEP++IDSWEQ+ER+FLNRFYSTRR VSM ELT+TKQRKGEPVIDYINRWRALSLDCKDRL+          
Subjt:  ENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPDTIDSWEQMEREFLNRFYSTRRTVSMTELTSTKQRKGEPVIDYINRWRALSLDCKDRLS----------

Query:  -------------------RLATRAHDMELSIASRGGKDPVVPDMRKEKKEVKGTEKVAKSATKESMVVNTTPLKFSSKGKE------------------
                            LATRAHDMELSIA+RG  D +VP++RKEKKEVK T+K  K  TKE+MVV+TTPLK  SK K+                  
Subjt:  -------------------RLATRAHDMELSIASRGGKDPVVPDMRKEKKEVKGTEKVAKSATKESMVVNTTPLKFSSKGKE------------------

Query:  --------NSDVPDMLEQLLEKQLIELPECKRPEELGKVNDPNYCRYHRVVSHPVEKCFVLKELILKLAREKKIELDLDEVAQTNHATVMANVGNENPLI
                +SD+PDML+QLLEKQLI+LPECKRP E+G+VNDPNYC+YHRV+SHPVEKCFVLKELILKLA +KKIEL+LD+VAQTNHA V+    +    I
Subjt:  --------NSDVPDMLEQLLEKQLIELPECKRPEELGKVNDPNYCRYHRVVSHPVEKCFVLKELILKLAREKKIELDLDEVAQTNHATVMANVGNENPLI

Query:  SSCDYEERLIQFGTLDPVMLLG--------------------------------------------------------------------------VAVG
         S      LIQFG+L+PV++                                                                            +   
Subjt:  SSCDYEERLIQFGTLDPVMLLG--------------------------------------------------------------------------VAVG

Query:  SG------------------------------------------------DLSSFSIKDLLSLPQEAKNVLINALIESDGTKIPTSEARTCASCCVTIGF
         G                                                DL    I DLL+L +E K+ +I  L   D + I TS A T  S C++I F
Subjt:  SG------------------------------------------------DLSSFSIKDLLSLPQEAKNVLINALIESDGTKIPTSEARTCASCCVTIGF

Query:  SDEDLLLGSKPHNRPLFVSGYIREQRVSRILIDDGSAVNIMPKSTMKQLGISMEELSNSKLVIQ-------------------------------DSKTT
        SDEDLLLGSK HNRPL+VSG++REQ++++ILID+GSAVNI+PKSTM QLGIS+EELSNSKLVIQ                               DS+TT
Subjt:  SDEDLLLGSKPHNRPLFVSGYIREQRVSRILIDDGSAVNIMPKSTMKQLGISMEELSNSKLVIQ-------------------------------DSKTT

Query:  YKLLLGRPWIHGNGVITSTLHQCFKFYQDGIKKVEADTNPFSEAESHFADAKFYMKSDNTGETIPTEIPLIKKNDKLDLEPQADAGKEVVENVKASNPKN
        YK+LLGRPWIH NG++TSTLHQCFKFY+ GIKKV+AD+ PF++AESHFADAKFY KS++  E I TE+P+ K   K   E +    K+  +    ++ +N
Subjt:  YKLLLGRPWIHGNGVITSTLHQCFKFYQDGIKKVEADTNPFSEAESHFADAKFYMKSDNTGETIPTEIPLIKKNDKLDLEPQADAGKEVVENVKASNPKN

Query:  GEASTSL-----EKSKVV---KDEKCSPVLRYIPLSRRKKGESPFTECSKSIKVGDVEILKEGFTMPLTKITEQEIRKPEDDQIKAILPEKRTKDGFDPK
        GE +T       E  K+    K+    PVLRYIPLSRRKKGESPFTECSK++ V + EILKE FT PLTKI + E +K E   ++A LPE+RT +GFDPK
Subjt:  GEASTSL-----EKSKVV---KDEKCSPVLRYIPLSRRKKGESPFTECSKSIKVGDVEILKEGFTMPLTKITEQEIRKPEDDQIKAILPEKRTKDGFDPK

Query:  AYKLLAK--------------------------QKKLLKEGYSLPTTRKGLGYKSPEPVRITRKGKAKVADINHITAEEVDDSKEEENVNNE
        AYKL+AK                          QKKL K+GYS+P +R G+GY+S EPVRIT KGKAKVA+  HIT EE  DS+E + V ++
Subjt:  AYKLLAK--------------------------QKKLLKEGYSLPTTRKGLGYKSPEPVRITRKGKAKVADINHITAEEVDDSKEEENVNNE

A0A5D3DR73 Ty3-gypsy retrotransposon protein3.7e-23052.35Show/hide
Query:  ESSKDKVAVKDNPLFESVVPTSKQPKDTLSPDVMSVMMADVDQDERMAEMERKLNLLMKAVDERDLEIAYLKNQLQTEKRLSLARPLLQERMIRGKLSSV
        ES K  + +K+NPL+++    S + K    PDVMSVMMAD+  +  MAEMERK+N LMK V+ERD EI  L+ Q++T +     + L  + MI    +S+
Subjt:  ESSKDKVAVKDNPLFESVVPTSKQPKDTLSPDVMSVMMADVDQDERMAEMERKLNLLMKAVDERDLEIAYLKNQLQTEKRLSLARPLLQERMIRGKLSSV

Query:  RTNHNTPHQ-----------------LPLS---PKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPDTIDSWEQMEREF
        R  +  P Q                 +PL     KFQQFDGK NPKQHIAHFVETCENAG RGD LV+QF+R+LKGNAF+                    
Subjt:  RTNHNTPHQ-----------------LPLS---PKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPDTIDSWEQMEREF

Query:  LNRFYSTRRTVSMTELTSTKQRKGEPVIDYINRWRALSLDCKDRLSR-----------------------------LATRAHDMELSIASRGGKDPVVPD
              TRRTVSM ELT+TKQ KGEPVIDYINRWRALSLDC+DRL++                             LATRAHDMELSIA RG K+  V +
Subjt:  LNRFYSTRRTVSMTELTSTKQRKGEPVIDYINRWRALSLDCKDRLSR-----------------------------LATRAHDMELSIASRGGKDPVVPD

Query:  MRKEKKEVKGTEKVAKSATKESMVVNTTPLKFSSK----------GKE-----------------NSDVPDMLEQLLEKQLIELPECKRPEELGKVNDPN
        +RK+KKE K  EKV KS  KESMVVNTT LKFS +          G E                 +SD+ DMLEQLLEKQLI+L +CKRPE+ GKVNDPN
Subjt:  MRKEKKEVKGTEKVAKSATKESMVVNTTPLKFSSK----------GKE-----------------NSDVPDMLEQLLEKQLIELPECKRPEELGKVNDPN

Query:  YCRYHRVVSHPVEKCFVLKELILKLAR-EKKIELDLDEVAQTNHATVMANVGNENPLISSC----DYEERLIQFGTLDPVMLLGVAVGSGDLSSFSIKDL
        YC+YHRV+SHPVEKCFVLKELIL+L R +KKIELDL+E              +ENP + +C    + EE  I   +L+   L      S DLS F++ DL
Subjt:  YCRYHRVVSHPVEKCFVLKELILKLAR-EKKIELDLDEVAQTNHATVMANVGNENPLISSC----DYEERLIQFGTLDPVMLLGVAVGSGDLSSFSIKDL

Query:  LSLPQEAKNVLINALIESD--GTKIPTSEARTCASCCVTIGFSDEDLLLGSKPHNRPLFVSGYIREQRVSRILIDDGSAVNIMPKSTMKQLGISMEELSN
        LSLPQE K +LINAL+ S    +  PT+   +    C++I FSDEDLLLGSK HNRPL+VSGY+REQRV +ILID+GSA+NIMPKSTM+QLGI M+ELSN
Subjt:  LSLPQEAKNVLINALIESD--GTKIPTSEARTCASCCVTIGFSDEDLLLGSKPHNRPLFVSGYIREQRVSRILIDDGSAVNIMPKSTMKQLGISMEELSN

Query:  SKLVIQDSKTTYKLLLGRPWIHGNGVITSTLHQCFKFYQDGIKKVEADTNPFSEAESHFADAKFYMKSDNTGETIPTEIPLIKKNDKLDLEPQADAGKEV
        SKLVIQDS+TTYKLLLGRPWIHGN V+TSTLHQCFKFYQDG+KKVEAD NPFSEAESHFADAKFY+K+DN+ + +  E+PL+   D L L  ++ A +E 
Subjt:  SKLVIQDSKTTYKLLLGRPWIHGNGVITSTLHQCFKFYQDGIKKVEADTNPFSEAESHFADAKFYMKSDNTGETIPTEIPLIKKNDKLDLEPQADAGKEV

Query:  VENVKASNPKNGEASTSLEKSKVVKDEKCSP--VLRYIPLSRRKKGESPFTECSKSIKVGDVEILKEGFTMPLTKITEQEIRKPEDDQIKAILPEKRTKD
         ++    + + GEASTS  KS ++ DEK S   +LRYIPLSR KK ESPF +  + +KVGD+E+LKE  T PLTKIT+QE++    D  +A L ++RTKD
Subjt:  VENVKASNPKNGEASTSLEKSKVVKDEKCSP--VLRYIPLSRRKKGESPFTECSKSIKVGDVEILKEGFTMPLTKITEQEIRKPEDDQIKAILPEKRTKD

Query:  GFDPKAYKLLAK-------------------------QKKLLKEGYSLPTTRKGLGYKSPEPVRITRKGKAKVADINHITAEEVDDSKEEENVNNELLFL
        GFDPKAYKL+ K                         +KKLL+EG+ +P +RKGLGYKSPEP+RITRKGK KV D NHIT EEVD  KE+E  +  +   
Subjt:  GFDPKAYKLLAK-------------------------QKKLLKEGYSLPTTRKGLGYKSPEPVRITRKGKAKVADINHITAEEVDDSKEEENVNNELLFL

Query:  GASGHQLLVLQYFKD
              +   QY KD
Subjt:  GASGHQLLVLQYFKD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTTGCTATACCAAATCCCCTGCAGCATCTGGATCGAGTGGTAAGGCCATGATCTGCAGCAATAGAAAAGGTTGTTTCATCTCTCATCATCAAGTCATTTGTGCCGA
TGTTTGCCCTTTAGAGAAAGACGAAGGGGTTGTGCGAGTTCCCCCTCATACGACTCAAGTGGTGAAGGCCCTTTCTTTCACGCTTGATCCAACCAACGTGATGGCACCAA
AGAAAGCTCCCGCGAAGGTTACCACATCAAGCGACTCTTACACTGGTCCTGTCACTCGTAGTCGCTCCCAAGGAATTGAGATCAAGGAGGATCATACTCCTCTAGCTGTT
GCAAGCAAGATCTCAAAGCTGATTGAAGAATCCTCTAAGGATAAGGTTGCAGTCAAAGACAACCCACTGTTCGAATCTGTCGTTCCAACATCTAAGCAGCCAAAGGATAC
GCTAAGTCCTGATGTGATGTCCGTCATGATGGCTGATGTAGATCAGGATGAAAGAATGGCAGAGATGGAAAGGAAACTCAATCTCTTAATGAAGGCAGTTGATGAGAGAG
ATCTTGAGATTGCCTATTTGAAGAACCAGCTGCAAACCGAGAAACGGCTGAGTCTAGCCAGACCCCTGCTGCAGGAAAGAATGATAAGGGGAAAGCTGTCATCCGTGAGG
ACCAACCACAACACTCCGCATCAGTTGCCTCTTTCCCCCAAATTCCAGCAGTTTGACGGAAAGGGCAATCCAAAGCAACACATCGCCCACTTTGTTGAAACCTGTGAGAA
CGCCGGTACTCGGGGAGATTTGCTTGTTAAGCAATTTGTCCGAACACTGAAGGGAAACGCGTTCGATTGGTACACTGACCTGGAGCCTGATACAATTGACAGCTGGGAAC
AGATGGAGAGAGAGTTCCTGAATCGCTTCTACAGTACGAGGCGAACAGTTAGTATGACAGAACTCACGAGCACAAAGCAGCGAAAGGGTGAGCCAGTCATTGATTACATC
AATCGTTGGAGAGCCTTGAGTCTTGACTGTAAAGATAGACTCTCGAGATTGGCGACTCGCGCTCATGATATGGAGTTGAGCATCGCTAGTAGAGGGGGCAAAGATCCTGT
AGTCCCAGATATGAGGAAAGAAAAGAAGGAGGTCAAGGGCACTGAGAAAGTTGCGAAGAGTGCCACGAAAGAATCCATGGTCGTCAACACGACCCCGTTGAAATTTTCCT
CCAAAGGAAAAGAAAACTCCGATGTCCCAGATATGTTGGAACAATTGTTGGAAAAGCAATTAATTGAACTCCCCGAATGCAAACGACCGGAAGAGTTAGGGAAAGTCAAT
GATCCTAACTACTGCAGGTATCACCGAGTTGTTAGTCATCCAGTGGAGAAATGTTTTGTATTGAAGGAGCTAATCCTTAAGTTGGCTCGTGAAAAGAAAATTGAGCTGGA
CCTTGATGAGGTGGCTCAAACAAATCATGCAACCGTAATGGCAAATGTAGGGAATGAAAATCCTCTAATCTCCTCGTGTGATTATGAAGAGAGGCTGATTCAATTTGGGA
CCCTTGATCCCGTAATGCTCCTCGGAGTGGCGGTAGGTTCAGGAGACTTATCCTCTTTCAGCATAAAGGACTTACTGTCACTTCCTCAGGAAGCTAAAAACGTCCTTATT
AACGCATTGATAGAGTCCGATGGTACAAAGATTCCAACCTCTGAGGCACGCACATGTGCTTCGTGTTGCGTGACTATAGGATTTTCCGATGAAGATTTGCTGCTAGGGTC
AAAGCCTCACAATAGACCTTTATTCGTCTCGGGATACATCCGAGAACAGAGGGTAAGTCGGATCCTCATCGACGATGGATCAGCTGTCAATATAATGCCTAAGTCAACTA
TGAAGCAATTAGGCATCTCGATGGAAGAATTATCAAACAGTAAATTGGTGATTCAAGACTCCAAGACCACTTATAAGTTGCTATTAGGTCGTCCTTGGATCCACGGCAAT
GGAGTGATAACTTCTACATTGCACCAGTGCTTCAAATTCTATCAAGATGGCATTAAGAAAGTTGAAGCTGACACTAACCCATTTTCAGAAGCAGAATCCCATTTCGCTGA
TGCAAAATTTTATATGAAGAGTGACAACACAGGGGAAACTATACCTACAGAAATCCCCTTGATAAAAAAGAATGATAAACTCGATTTAGAGCCACAAGCAGATGCAGGGA
AGGAAGTTGTTGAGAATGTGAAGGCATCTAACCCGAAGAATGGTGAAGCATCTACAAGCCTTGAGAAATCTAAAGTTGTAAAGGATGAGAAATGTTCACCTGTTCTACGA
TACATCCCTTTGTCCCGGCGTAAAAAGGGTGAATCACCTTTCACTGAATGTTCGAAAAGCATAAAGGTCGGTGACGTTGAAATTCTGAAGGAAGGTTTCACTATGCCTCT
TACAAAGATCACAGAGCAAGAGATTAGGAAACCTGAGGATGACCAGATAAAAGCAATTTTGCCTGAAAAACGAACTAAAGACGGGTTCGACCCGAAGGCATACAAACTTC
TAGCAAAGCAGAAGAAGCTTCTAAAGGAAGGCTATAGTCTGCCTACGACGAGAAAAGGACTTGGATATAAGTCGCCTGAACCAGTTCGCATAACAAGAAAAGGGAAGGCG
AAAGTGGCAGACATAAATCATATAACAGCAGAGGAGGTCGATGACTCAAAAGAAGAAGAGAACGTCAACAACGAGCTTCTGTTTTTAGGCGCATCAGGCCACCAGTTGCT
CGTGCTTCAGTATTTCAAAGATTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGTTGCTATACCAAATCCCCTGCAGCATCTGGATCGAGTGGTAAGGCCATGATCTGCAGCAATAGAAAAGGTTGTTTCATCTCTCATCATCAAGTCATTTGTGCCGA
TGTTTGCCCTTTAGAGAAAGACGAAGGGGTTGTGCGAGTTCCCCCTCATACGACTCAAGTGGTGAAGGCCCTTTCTTTCACGCTTGATCCAACCAACGTGATGGCACCAA
AGAAAGCTCCCGCGAAGGTTACCACATCAAGCGACTCTTACACTGGTCCTGTCACTCGTAGTCGCTCCCAAGGAATTGAGATCAAGGAGGATCATACTCCTCTAGCTGTT
GCAAGCAAGATCTCAAAGCTGATTGAAGAATCCTCTAAGGATAAGGTTGCAGTCAAAGACAACCCACTGTTCGAATCTGTCGTTCCAACATCTAAGCAGCCAAAGGATAC
GCTAAGTCCTGATGTGATGTCCGTCATGATGGCTGATGTAGATCAGGATGAAAGAATGGCAGAGATGGAAAGGAAACTCAATCTCTTAATGAAGGCAGTTGATGAGAGAG
ATCTTGAGATTGCCTATTTGAAGAACCAGCTGCAAACCGAGAAACGGCTGAGTCTAGCCAGACCCCTGCTGCAGGAAAGAATGATAAGGGGAAAGCTGTCATCCGTGAGG
ACCAACCACAACACTCCGCATCAGTTGCCTCTTTCCCCCAAATTCCAGCAGTTTGACGGAAAGGGCAATCCAAAGCAACACATCGCCCACTTTGTTGAAACCTGTGAGAA
CGCCGGTACTCGGGGAGATTTGCTTGTTAAGCAATTTGTCCGAACACTGAAGGGAAACGCGTTCGATTGGTACACTGACCTGGAGCCTGATACAATTGACAGCTGGGAAC
AGATGGAGAGAGAGTTCCTGAATCGCTTCTACAGTACGAGGCGAACAGTTAGTATGACAGAACTCACGAGCACAAAGCAGCGAAAGGGTGAGCCAGTCATTGATTACATC
AATCGTTGGAGAGCCTTGAGTCTTGACTGTAAAGATAGACTCTCGAGATTGGCGACTCGCGCTCATGATATGGAGTTGAGCATCGCTAGTAGAGGGGGCAAAGATCCTGT
AGTCCCAGATATGAGGAAAGAAAAGAAGGAGGTCAAGGGCACTGAGAAAGTTGCGAAGAGTGCCACGAAAGAATCCATGGTCGTCAACACGACCCCGTTGAAATTTTCCT
CCAAAGGAAAAGAAAACTCCGATGTCCCAGATATGTTGGAACAATTGTTGGAAAAGCAATTAATTGAACTCCCCGAATGCAAACGACCGGAAGAGTTAGGGAAAGTCAAT
GATCCTAACTACTGCAGGTATCACCGAGTTGTTAGTCATCCAGTGGAGAAATGTTTTGTATTGAAGGAGCTAATCCTTAAGTTGGCTCGTGAAAAGAAAATTGAGCTGGA
CCTTGATGAGGTGGCTCAAACAAATCATGCAACCGTAATGGCAAATGTAGGGAATGAAAATCCTCTAATCTCCTCGTGTGATTATGAAGAGAGGCTGATTCAATTTGGGA
CCCTTGATCCCGTAATGCTCCTCGGAGTGGCGGTAGGTTCAGGAGACTTATCCTCTTTCAGCATAAAGGACTTACTGTCACTTCCTCAGGAAGCTAAAAACGTCCTTATT
AACGCATTGATAGAGTCCGATGGTACAAAGATTCCAACCTCTGAGGCACGCACATGTGCTTCGTGTTGCGTGACTATAGGATTTTCCGATGAAGATTTGCTGCTAGGGTC
AAAGCCTCACAATAGACCTTTATTCGTCTCGGGATACATCCGAGAACAGAGGGTAAGTCGGATCCTCATCGACGATGGATCAGCTGTCAATATAATGCCTAAGTCAACTA
TGAAGCAATTAGGCATCTCGATGGAAGAATTATCAAACAGTAAATTGGTGATTCAAGACTCCAAGACCACTTATAAGTTGCTATTAGGTCGTCCTTGGATCCACGGCAAT
GGAGTGATAACTTCTACATTGCACCAGTGCTTCAAATTCTATCAAGATGGCATTAAGAAAGTTGAAGCTGACACTAACCCATTTTCAGAAGCAGAATCCCATTTCGCTGA
TGCAAAATTTTATATGAAGAGTGACAACACAGGGGAAACTATACCTACAGAAATCCCCTTGATAAAAAAGAATGATAAACTCGATTTAGAGCCACAAGCAGATGCAGGGA
AGGAAGTTGTTGAGAATGTGAAGGCATCTAACCCGAAGAATGGTGAAGCATCTACAAGCCTTGAGAAATCTAAAGTTGTAAAGGATGAGAAATGTTCACCTGTTCTACGA
TACATCCCTTTGTCCCGGCGTAAAAAGGGTGAATCACCTTTCACTGAATGTTCGAAAAGCATAAAGGTCGGTGACGTTGAAATTCTGAAGGAAGGTTTCACTATGCCTCT
TACAAAGATCACAGAGCAAGAGATTAGGAAACCTGAGGATGACCAGATAAAAGCAATTTTGCCTGAAAAACGAACTAAAGACGGGTTCGACCCGAAGGCATACAAACTTC
TAGCAAAGCAGAAGAAGCTTCTAAAGGAAGGCTATAGTCTGCCTACGACGAGAAAAGGACTTGGATATAAGTCGCCTGAACCAGTTCGCATAACAAGAAAAGGGAAGGCG
AAAGTGGCAGACATAAATCATATAACAGCAGAGGAGGTCGATGACTCAAAAGAAGAAGAGAACGTCAACAACGAGCTTCTGTTTTTAGGCGCATCAGGCCACCAGTTGCT
CGTGCTTCAGTATTTCAAAGATTAA
Protein sequenceShow/hide protein sequence
MSCYTKSPAASGSSGKAMICSNRKGCFISHHQVICADVCPLEKDEGVVRVPPHTTQVVKALSFTLDPTNVMAPKKAPAKVTTSSDSYTGPVTRSRSQGIEIKEDHTPLAV
ASKISKLIEESSKDKVAVKDNPLFESVVPTSKQPKDTLSPDVMSVMMADVDQDERMAEMERKLNLLMKAVDERDLEIAYLKNQLQTEKRLSLARPLLQERMIRGKLSSVR
TNHNTPHQLPLSPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPDTIDSWEQMEREFLNRFYSTRRTVSMTELTSTKQRKGEPVIDYI
NRWRALSLDCKDRLSRLATRAHDMELSIASRGGKDPVVPDMRKEKKEVKGTEKVAKSATKESMVVNTTPLKFSSKGKENSDVPDMLEQLLEKQLIELPECKRPEELGKVN
DPNYCRYHRVVSHPVEKCFVLKELILKLAREKKIELDLDEVAQTNHATVMANVGNENPLISSCDYEERLIQFGTLDPVMLLGVAVGSGDLSSFSIKDLLSLPQEAKNVLI
NALIESDGTKIPTSEARTCASCCVTIGFSDEDLLLGSKPHNRPLFVSGYIREQRVSRILIDDGSAVNIMPKSTMKQLGISMEELSNSKLVIQDSKTTYKLLLGRPWIHGN
GVITSTLHQCFKFYQDGIKKVEADTNPFSEAESHFADAKFYMKSDNTGETIPTEIPLIKKNDKLDLEPQADAGKEVVENVKASNPKNGEASTSLEKSKVVKDEKCSPVLR
YIPLSRRKKGESPFTECSKSIKVGDVEILKEGFTMPLTKITEQEIRKPEDDQIKAILPEKRTKDGFDPKAYKLLAKQKKLLKEGYSLPTTRKGLGYKSPEPVRITRKGKA
KVADINHITAEEVDDSKEEENVNNELLFLGASGHQLLVLQYFKD