; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg016829 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg016829
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionTy3-gypsy retrotransposon protein
Genome locationscaffold9:39916085..39926942
RNA-Seq ExpressionSpg016829
SyntenySpg016829
Gene Ontology termsGO:0090304 - nucleic acid metabolic process (biological process)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0047477.1 uncharacterized protein E6C27_scaffold498G00940 [Cucumis melo var. makuwa]3.8e-1040.34Show/hide
Query:  QDQLESVENEDEGWTLVTRRKKQKKHYVQREPHMFRNYRRKNMSQKQKRKIKGLRKPKLVVEENQDFVMSRQPVTLKDFFPKNFLNDDHEDIPEIVACHA
        +++ + V+N +EGWTLVTRRKK+K+ + Q+E   +R YR K  SQ++  + K  RK   ++EE++     R+P+ LKDFFPKNF         EIV+CH 
Subjt:  QDQLESVENEDEGWTLVTRRKKQKKHYVQREPHMFRNYRRKNMSQKQKRKIKGLRKPKLVVEENQDFVMSRQPVTLKDFFPKNFLNDDHEDIPEIVACHA

Query:  VNTCEEGDSSSSPSQNAKK
         +T EE    S+  +   K
Subjt:  VNTCEEGDSSSSPSQNAKK

KAA0047477.1 uncharacterized protein E6C27_scaffold498G00940 [Cucumis melo var. makuwa]3.3e-0641.76Show/hide
Query:  SSQQPNETSSPDVMSVMMTDAGTSEERMAEVEKKISMLLRMVEEKDQEIASLRNQIESRDQKSAKDICQDQLESVENEDEGWTLVTRRKKQ
        S +   E   P++MSVM+TD  TSE+RMAE+EKK++ML++ VEE+D EIA L+N IESRD   +         +++N ++G  ++   + Q
Subjt:  SSQQPNETSSPDVMSVMMTDAGTSEERMAEVEKKISMLLRMVEEKDQEIASLRNQIESRDQKSAKDICQDQLESVENEDEGWTLVTRRKKQ

KAA0047477.1 uncharacterized protein E6C27_scaffold498G00940 [Cucumis melo var. makuwa]5.4e-6546.93Show/hide
Query:  EDIPEIVACHAVNTCEEGDSSSSPSQNAKKESIEEIDACEVQSKETSAHPAKSKAPKEE-------ASSNVPILRYVPLSRRKKGESPFTECSGILKVGD
        ED+ EI++     T  +G   +       K+S  + DA   Q         K +AP+ E         SN P+LRY+PLSRRKKGESPFTECS  L V +
Subjt:  EDIPEIVACHAVNTCEEGDSSSSPSQNAKKESIEEIDACEVQSKETSAHPAKSKAPKEE-------ASSNVPILRYVPLSRRKKGESPFTECSGILKVGD

Query:  VEILKENFTTPLTAITKQVIQESKTDRTKVTLPKRRTKAGFDPNAYKLLEKAGYDFTTHTEFKSLKIFDGRSEPSPTQKKLQKEGYAIPFSRAGLGYNPP
         EILKENFT PLT I K   ++ +    +  LP+RRT  GFDP AYKL+ KAGYDFTT TE KS+KIFD R E SPTQKKLQK+GY+IP SRAG+GY   
Subjt:  VEILKENFTTPLTAITKQVIQESKTDRTKVTLPKRRTKAGFDPNAYKLLEKAGYDFTTHTEFKSLKIFDGRSEPSPTQKKLQKEGYAIPFSRAGLGYNPP

Query:  EPVRITRKW--KVVDAHHITVEEMGDPEDKKNDDSQKISVFDRIKAPTTRLSVHQRLKYSFSKKKGQSKTSTPTRCSVFQRLSVSTSKKVDEPP-----R
        EPVRIT K   KV +  HITVEE  D E+ K   SQ+ SVFDRI     R SV QR+  S +K   Q  T + TR S FQRL+ S  K     P     +
Subjt:  EPVRITRKW--KVVDAHHITVEEMGDPEDKKNDDSQKISVFDRIKAPTTRLSVHQRLKYSFSKKKGQSKTSTPTRCSVFQRLSVSTSKKVDEPP-----R

Query:  SVFDRLQATYGQCKGKLKSLETDESDEMNDNNGFFSTVPSRMKRKPFILINTEGALKM
        S F RL  +  + + K     +++S  +  +    S  PSRMKRK F+ +NTEG+LK+
Subjt:  SVFDRLQATYGQCKGKLKSLETDESDEMNDNNGFFSTVPSRMKRKPFILINTEGALKM

KAA0061113.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]7.5e-6738.82Show/hide
Query:  MLLRMVEEKDQEIASLRNQIESRDQKSAKDICQDQLESVENEDEGWTLVTRRKKQKKHYVQREPHMFRNYRRKNMSQKQKRKIKGLRKPKLVVEENQDFV
        ++LR+  EK  ++  L  ++   D        Q++   +E +DEGWT+VTRRKK+K   +Q+E  ++ NYRR N +QK K+K K  RK KLV E+++DF 
Subjt:  MLLRMVEEKDQEIASLRNQIESRDQKSAKDICQDQLESVENEDEGWTLVTRRKKQKKHYVQREPHMFRNYRRKNMSQKQKRKIKGLRKPKLVVEENQDFV

Query:  MSRQPVTLKDFFPKNFLNDDHEDIPEIVA-----------------CHAVNTCEEG----------------------------DSSSS-----------
         +++ VTL DFFP  FL D  ++ P +VA                 C +++  +E                             D+ S+           
Subjt:  MSRQPVTLKDFFPKNFLNDDHEDIPEIVA-----------------CHAVNTCEEG----------------------------DSSSS-----------

Query:  ---------------------------------------------PSQN----------AKKESIEEIDACEVQSKETSAHPAKSKAPKEEASSNVPILR
                                                     P  N          A +ES +          E S   AKS    +E +SN PILR
Subjt:  ---------------------------------------------PSQN----------AKKESIEEIDACEVQSKETSAHPAKSKAPKEEASSNVPILR

Query:  YVPLSRRKKGESPFTECSGILKVGDVEILKENFTTPLTAITKQVIQESKTDRTKVTLPKRRTKAGFDPNAYKLLEKAGYDFTTHTEFKSLKIFDGRSEPS
        YVPLSRRKKGESPF E    LKVGD+E+LKE+FTTPLT ITK   QE K D  + +LP+RRTK GFDP AYK + KAGYDFTTHTEFKSLKI + + + S
Subjt:  YVPLSRRKKGESPFTECSGILKVGDVEILKENFTTPLTAITKQVIQESKTDRTKVTLPKRRTKAGFDPNAYKLLEKAGYDFTTHTEFKSLKIFDGRSEPS

Query:  PTQKKLQKEGYAIPFSRAGLGYNPPEPVRITRKW--KVVDAHHITVEEMGDPEDKKNDDSQKISVFDRIKAPTTRLSVHQRLKYSFSKKKGQSKTSTPTR
         TQKKL +EG+AIP SR GLGY PPEP+RITRK   KVVD++HITV+E+ D  ++K  DSQ+ S FDR+     R  V +RL    +++K    TS+  R
Subjt:  PTQKKLQKEGYAIPFSRAGLGYNPPEPVRITRKW--KVVDAHHITVEEMGDPEDKKNDDSQKISVFDRIKAPTTRLSVHQRLKYSFSKKKGQSKTSTPTR

Query:  CSVFQRLSVS
         S FQRL+++
Subjt:  CSVFQRLSVS

TYJ98225.1 uncharacterized protein E5676_scaffold180G001270 [Cucumis melo var. makuwa]5.4e-6546.93Show/hide
Query:  EDIPEIVACHAVNTCEEGDSSSSPSQNAKKESIEEIDACEVQSKETSAHPAKSKAPKEE-------ASSNVPILRYVPLSRRKKGESPFTECSGILKVGD
        ED+ EI++     T  +G   +       K+S  + DA   Q         K +AP+ E         SN P+LRY+PLSRRKKGESPFTECS  L V +
Subjt:  EDIPEIVACHAVNTCEEGDSSSSPSQNAKKESIEEIDACEVQSKETSAHPAKSKAPKEE-------ASSNVPILRYVPLSRRKKGESPFTECSGILKVGD

Query:  VEILKENFTTPLTAITKQVIQESKTDRTKVTLPKRRTKAGFDPNAYKLLEKAGYDFTTHTEFKSLKIFDGRSEPSPTQKKLQKEGYAIPFSRAGLGYNPP
         EILKENFT PLT I K   ++ +    +  LP+RRT  GFDP AYKL+ KAGYDFTT TE KS+KIFD R E SPTQKKLQK+GY+IP SRAG+GY   
Subjt:  VEILKENFTTPLTAITKQVIQESKTDRTKVTLPKRRTKAGFDPNAYKLLEKAGYDFTTHTEFKSLKIFDGRSEPSPTQKKLQKEGYAIPFSRAGLGYNPP

Query:  EPVRITRKW--KVVDAHHITVEEMGDPEDKKNDDSQKISVFDRIKAPTTRLSVHQRLKYSFSKKKGQSKTSTPTRCSVFQRLSVSTSKKVDEPP-----R
        EPVRIT K   KV +  HITVEE  D E+ K   SQ+ SVFDRI     R SV QR+  S +K   Q  T + TR S FQRL+ S  K     P     +
Subjt:  EPVRITRKW--KVVDAHHITVEEMGDPEDKKNDDSQKISVFDRIKAPTTRLSVHQRLKYSFSKKKGQSKTSTPTRCSVFQRLSVSTSKKVDEPP-----R

Query:  SVFDRLQATYGQCKGKLKSLETDESDEMNDNNGFFSTVPSRMKRKPFILINTEGALKM
        S F RL  +  + + K     +++S  +  +    S  PSRMKRK F+ +NTEG+LK+
Subjt:  SVFDRLQATYGQCKGKLKSLETDESDEMNDNNGFFSTVPSRMKRKPFILINTEGALKM

TYJ98225.1 uncharacterized protein E5676_scaffold180G001270 [Cucumis melo var. makuwa]3.8e-1040.34Show/hide
Query:  QDQLESVENEDEGWTLVTRRKKQKKHYVQREPHMFRNYRRKNMSQKQKRKIKGLRKPKLVVEENQDFVMSRQPVTLKDFFPKNFLNDDHEDIPEIVACHA
        +++ + V+N +EGWTLVTRRKK+K+ + Q+E   +R YR K  SQ++  + K  RK   ++EE++     R+P+ LKDFFPKNF         EIV+CH 
Subjt:  QDQLESVENEDEGWTLVTRRKKQKKHYVQREPHMFRNYRRKNMSQKQKRKIKGLRKPKLVVEENQDFVMSRQPVTLKDFFPKNFLNDDHEDIPEIVACHA

Query:  VNTCEEGDSSSSPSQNAKK
         +T EE    S+  +   K
Subjt:  VNTCEEGDSSSSPSQNAKK

TYJ98225.1 uncharacterized protein E5676_scaffold180G001270 [Cucumis melo var. makuwa]4.1e-0438.46Show/hide
Query:  SSQQPNETSSPDVMSVMMTDAGTSEERMAEVEKKISMLLRMVEEKDQEIASLRNQIESRDQKSAKDICQDQLESVENEDEGWTLVTRRKKQ
        S +   E   P++MSVM+TD  TSE+RM  +EKK++M ++ VEE+D EIA L+N IESRD   +         +++N ++G  ++   + Q
Subjt:  SSQQPNETSSPDVMSVMMTDAGTSEERMAEVEKKISMLLRMVEEKDQEIASLRNQIESRDQKSAKDICQDQLESVENEDEGWTLVTRRKKQ

TYJ98225.1 uncharacterized protein E5676_scaffold180G001270 [Cucumis melo var. makuwa]5.4e-6546.93Show/hide
Query:  EDIPEIVACHAVNTCEEGDSSSSPSQNAKKESIEEIDACEVQSKETSAHPAKSKAPKEE-------ASSNVPILRYVPLSRRKKGESPFTECSGILKVGD
        ED+ EI++     T  +G   +       K+S  + DA   Q         K +AP+ E         SN P+LRY+PLSRRKKGESPFTECS  L V +
Subjt:  EDIPEIVACHAVNTCEEGDSSSSPSQNAKKESIEEIDACEVQSKETSAHPAKSKAPKEE-------ASSNVPILRYVPLSRRKKGESPFTECSGILKVGD

Query:  VEILKENFTTPLTAITKQVIQESKTDRTKVTLPKRRTKAGFDPNAYKLLEKAGYDFTTHTEFKSLKIFDGRSEPSPTQKKLQKEGYAIPFSRAGLGYNPP
         EILKENFT PLT I K   ++ +    +  LP+RRT  GFDP AYKL+ KAGYDFTT TE KS+KIFD R E SPTQKKLQK+GY+IP SRAG+GY   
Subjt:  VEILKENFTTPLTAITKQVIQESKTDRTKVTLPKRRTKAGFDPNAYKLLEKAGYDFTTHTEFKSLKIFDGRSEPSPTQKKLQKEGYAIPFSRAGLGYNPP

Query:  EPVRITRKW--KVVDAHHITVEEMGDPEDKKNDDSQKISVFDRIKAPTTRLSVHQRLKYSFSKKKGQSKTSTPTRCSVFQRLSVSTSKKVDEPP-----R
        EPVRIT K   KV +  HITVEE  D E+ K   SQ+ SVFDRI     R SV QR+  S +K   Q  T + TR S FQRL+ S  K     P     +
Subjt:  EPVRITRKW--KVVDAHHITVEEMGDPEDKKNDDSQKISVFDRIKAPTTRLSVHQRLKYSFSKKKGQSKTSTPTRCSVFQRLSVSTSKKVDEPP-----R

Query:  SVFDRLQATYGQCKGKLKSLETDESDEMNDNNGFFSTVPSRMKRKPFILINTEGALKM
        S F RL  +  + + K     +++S  +  +    S  PSRMKRK F+ +NTEG+LK+
Subjt:  SVFDRLQATYGQCKGKLKSLETDESDEMNDNNGFFSTVPSRMKRKPFILINTEGALKM

TYK05005.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]1.3e-6632.4Show/hide
Query:  MLLRMVEEKD-----QEIASLRNQIESRDQKSAKDICQDQLESVENEDEGWTLVTRRKKQKKHYVQREPHMFRNYRRKNMSQKQKRKIKGLRKPKLVVEE
        ++LR+  EK      +E  +    +    Q+ A +  Q++   +E +DE WT+VTRRKK+K   +Q+E   +RNYRR N +QK K+K K  RK KL+ +E
Subjt:  MLLRMVEEKD-----QEIASLRNQIESRDQKSAKDICQDQLESVENEDEGWTLVTRRKKQKKHYVQREPHMFRNYRRKNMSQKQKRKIKGLRKPKLVVEE

Query:  NQDFVMSRQPVTLKDFFPKNFLNDDHEDIPEIVACHAVNTCEEG--------------------------------------------------------
        ++DF  +++ +TL DFFP  FL D  ++ P +VACHA+N  EE                                                         
Subjt:  NQDFVMSRQPVTLKDFFPKNFLNDDHEDIPEIVACHAVNTCEEG--------------------------------------------------------

Query:  -----------------------------------------------------------------------------------------DSSSSPSQNA-
                                                                                                 ++ S+P   A 
Subjt:  -----------------------------------------------------------------------------------------DSSSSPSQNA-

Query:  ----------KKESIEEIDACEV----------------------------QSKETSAHPAKSKAPKEEASSNVPILRYVPLSRRKKGESPFTECSGILK
                  K +S  E+ + EV                            +  E S   AKS    +E +SN  ILRYVPLSRRKKGESPF E    LK
Subjt:  ----------KKESIEEIDACEV----------------------------QSKETSAHPAKSKAPKEEASSNVPILRYVPLSRRKKGESPFTECSGILK

Query:  VGDVEILKENFTTPLTAITKQVIQESKTDRTKVTLPKRRTKAGFDPNAYKLLEKAGYDFTTHTEFKSLKIFDGRSEPSPTQKKLQKEGYAIPFSRAGLGY
        VGD+E+LKE+FTTPLT ITK   QE K D T+ +LP+RRTK  FDP AYKL+ KAGYDFTTHTEFKSLKI + + + S TQKKL +EG+AIP SR GLGY
Subjt:  VGDVEILKENFTTPLTAITKQVIQESKTDRTKVTLPKRRTKAGFDPNAYKLLEKAGYDFTTHTEFKSLKIFDGRSEPSPTQKKLQKEGYAIPFSRAGLGY

Query:  NPPEPVRITRKW--KVVDAHHITVEEMGDPEDKKNDDSQKISVFDRIKAPTTRLSVHQRLKYSFSKKKGQSKTSTPTRCSVFQRLSVSTSKKVDEPPRSV
          PEP+RITRK   K+VD++HITV+E+ D   +K  DSQ+ S FDRI     R  V +RL  + +++K    TS   R S F+RLS++  K    P   +
Subjt:  NPPEPVRITRKW--KVVDAHHITVEEMGDPEDKKNDDSQKISVFDRIKAPTTRLSVHQRLKYSFSKKKGQSKTSTPTRCSVFQRLSVSTSKKVDEPPRSV

Query:  FDRL-------------------QATYGQCKGKLKSLETDESD------EMNDNNGFFSTVPSRMKRKPFILIN-TEGALKM
         +RL                     +      ++K +  +         E+       S VPSRMKRK F+ +N ++G+LK+
Subjt:  FDRL-------------------QATYGQCKGKLKSLETDESD------EMNDNNGFFSTVPSRMKRKPFILIN-TEGALKM

TYK18071.1 uncharacterized protein E5676_scaffold306G004020 [Cucumis melo var. makuwa]3.8e-1040.34Show/hide
Query:  QDQLESVENEDEGWTLVTRRKKQKKHYVQREPHMFRNYRRKNMSQKQKRKIKGLRKPKLVVEENQDFVMSRQPVTLKDFFPKNFLNDDHEDIPEIVACHA
        +++ + V+N +EGWTLVTRRKK+K+ + Q+E   +R YR K  SQ++  + K  RK   ++EE++     R+P+ LKDFFPKNF         EIV+CH 
Subjt:  QDQLESVENEDEGWTLVTRRKKQKKHYVQREPHMFRNYRRKNMSQKQKRKIKGLRKPKLVVEENQDFVMSRQPVTLKDFFPKNFLNDDHEDIPEIVACHA

Query:  VNTCEEGDSSSSPSQNAKK
         +T EE    S+  +   K
Subjt:  VNTCEEGDSSSSPSQNAKK

TYK18071.1 uncharacterized protein E5676_scaffold306G004020 [Cucumis melo var. makuwa]2.2e-0542.35Show/hide
Query:  ETSSPDVMSVMMTDAGTSEERMAEVEKKISMLLRMVEEKDQEIASLRNQIESRDQKSAKDICQDQLESVENEDEGWTLVTRRKKQ
        E S P++MSVM+TD  TSE+RMAE+EKK++ML++ VEE+D  IA  +N IESRD   +        ++++N ++G  ++   + Q
Subjt:  ETSSPDVMSVMMTDAGTSEERMAEVEKKISMLLRMVEEKDQEIASLRNQIESRDQKSAKDICQDQLESVENEDEGWTLVTRRKKQ

TrEMBL top hitse value%identityAlignment
A0A5A7TZU9 Ribonuclease H2.6e-6546.93Show/hide
Query:  EDIPEIVACHAVNTCEEGDSSSSPSQNAKKESIEEIDACEVQSKETSAHPAKSKAPKEE-------ASSNVPILRYVPLSRRKKGESPFTECSGILKVGD
        ED+ EI++     T  +G   +       K+S  + DA   Q         K +AP+ E         SN P+LRY+PLSRRKKGESPFTECS  L V +
Subjt:  EDIPEIVACHAVNTCEEGDSSSSPSQNAKKESIEEIDACEVQSKETSAHPAKSKAPKEE-------ASSNVPILRYVPLSRRKKGESPFTECSGILKVGD

Query:  VEILKENFTTPLTAITKQVIQESKTDRTKVTLPKRRTKAGFDPNAYKLLEKAGYDFTTHTEFKSLKIFDGRSEPSPTQKKLQKEGYAIPFSRAGLGYNPP
         EILKENFT PLT I K   ++ +    +  LP+RRT  GFDP AYKL+ KAGYDFTT TE KS+KIFD R E SPTQKKLQK+GY+IP SRAG+GY   
Subjt:  VEILKENFTTPLTAITKQVIQESKTDRTKVTLPKRRTKAGFDPNAYKLLEKAGYDFTTHTEFKSLKIFDGRSEPSPTQKKLQKEGYAIPFSRAGLGYNPP

Query:  EPVRITRKW--KVVDAHHITVEEMGDPEDKKNDDSQKISVFDRIKAPTTRLSVHQRLKYSFSKKKGQSKTSTPTRCSVFQRLSVSTSKKVDEPP-----R
        EPVRIT K   KV +  HITVEE  D E+ K   SQ+ SVFDRI     R SV QR+  S +K   Q  T + TR S FQRL+ S  K     P     +
Subjt:  EPVRITRKW--KVVDAHHITVEEMGDPEDKKNDDSQKISVFDRIKAPTTRLSVHQRLKYSFSKKKGQSKTSTPTRCSVFQRLSVSTSKKVDEPP-----R

Query:  SVFDRLQATYGQCKGKLKSLETDESDEMNDNNGFFSTVPSRMKRKPFILINTEGALKM
        S F RL  +  + + K     +++S  +  +    S  PSRMKRK F+ +NTEG+LK+
Subjt:  SVFDRLQATYGQCKGKLKSLETDESDEMNDNNGFFSTVPSRMKRKPFILINTEGALKM

A0A5A7TZU9 Ribonuclease H1.8e-1040.34Show/hide
Query:  QDQLESVENEDEGWTLVTRRKKQKKHYVQREPHMFRNYRRKNMSQKQKRKIKGLRKPKLVVEENQDFVMSRQPVTLKDFFPKNFLNDDHEDIPEIVACHA
        +++ + V+N +EGWTLVTRRKK+K+ + Q+E   +R YR K  SQ++  + K  RK   ++EE++     R+P+ LKDFFPKNF         EIV+CH 
Subjt:  QDQLESVENEDEGWTLVTRRKKQKKHYVQREPHMFRNYRRKNMSQKQKRKIKGLRKPKLVVEENQDFVMSRQPVTLKDFFPKNFLNDDHEDIPEIVACHA

Query:  VNTCEEGDSSSSPSQNAKK
         +T EE    S+  +   K
Subjt:  VNTCEEGDSSSSPSQNAKK

A0A5A7TZU9 Ribonuclease H1.6e-0641.76Show/hide
Query:  SSQQPNETSSPDVMSVMMTDAGTSEERMAEVEKKISMLLRMVEEKDQEIASLRNQIESRDQKSAKDICQDQLESVENEDEGWTLVTRRKKQ
        S +   E   P++MSVM+TD  TSE+RMAE+EKK++ML++ VEE+D EIA L+N IESRD   +         +++N ++G  ++   + Q
Subjt:  SSQQPNETSSPDVMSVMMTDAGTSEERMAEVEKKISMLLRMVEEKDQEIASLRNQIESRDQKSAKDICQDQLESVENEDEGWTLVTRRKKQ

A0A5A7TZU9 Ribonuclease H2.6e-6546.93Show/hide
Query:  EDIPEIVACHAVNTCEEGDSSSSPSQNAKKESIEEIDACEVQSKETSAHPAKSKAPKEE-------ASSNVPILRYVPLSRRKKGESPFTECSGILKVGD
        ED+ EI++     T  +G   +       K+S  + DA   Q         K +AP+ E         SN P+LRY+PLSRRKKGESPFTECS  L V +
Subjt:  EDIPEIVACHAVNTCEEGDSSSSPSQNAKKESIEEIDACEVQSKETSAHPAKSKAPKEE-------ASSNVPILRYVPLSRRKKGESPFTECSGILKVGD

Query:  VEILKENFTTPLTAITKQVIQESKTDRTKVTLPKRRTKAGFDPNAYKLLEKAGYDFTTHTEFKSLKIFDGRSEPSPTQKKLQKEGYAIPFSRAGLGYNPP
         EILKENFT PLT I K   ++ +    +  LP+RRT  GFDP AYKL+ KAGYDFTT TE KS+KIFD R E SPTQKKLQK+GY+IP SRAG+GY   
Subjt:  VEILKENFTTPLTAITKQVIQESKTDRTKVTLPKRRTKAGFDPNAYKLLEKAGYDFTTHTEFKSLKIFDGRSEPSPTQKKLQKEGYAIPFSRAGLGYNPP

Query:  EPVRITRKW--KVVDAHHITVEEMGDPEDKKNDDSQKISVFDRIKAPTTRLSVHQRLKYSFSKKKGQSKTSTPTRCSVFQRLSVSTSKKVDEPP-----R
        EPVRIT K   KV +  HITVEE  D E+ K   SQ+ SVFDRI     R SV QR+  S +K   Q  T + TR S FQRL+ S  K     P     +
Subjt:  EPVRITRKW--KVVDAHHITVEEMGDPEDKKNDDSQKISVFDRIKAPTTRLSVHQRLKYSFSKKKGQSKTSTPTRCSVFQRLSVSTSKKVDEPP-----R

Query:  SVFDRLQATYGQCKGKLKSLETDESDEMNDNNGFFSTVPSRMKRKPFILINTEGALKM
        S F RL  +  + + K     +++S  +  +    S  PSRMKRK F+ +NTEG+LK+
Subjt:  SVFDRLQATYGQCKGKLKSLETDESDEMNDNNGFFSTVPSRMKRKPFILINTEGALKM

A0A5D3BIH8 Uncharacterized protein1.8e-1040.34Show/hide
Query:  QDQLESVENEDEGWTLVTRRKKQKKHYVQREPHMFRNYRRKNMSQKQKRKIKGLRKPKLVVEENQDFVMSRQPVTLKDFFPKNFLNDDHEDIPEIVACHA
        +++ + V+N +EGWTLVTRRKK+K+ + Q+E   +R YR K  SQ++  + K  RK   ++EE++     R+P+ LKDFFPKNF         EIV+CH 
Subjt:  QDQLESVENEDEGWTLVTRRKKQKKHYVQREPHMFRNYRRKNMSQKQKRKIKGLRKPKLVVEENQDFVMSRQPVTLKDFFPKNFLNDDHEDIPEIVACHA

Query:  VNTCEEGDSSSSPSQNAKK
         +T EE    S+  +   K
Subjt:  VNTCEEGDSSSSPSQNAKK

A0A5D3BIH8 Uncharacterized protein2.0e-0438.46Show/hide
Query:  SSQQPNETSSPDVMSVMMTDAGTSEERMAEVEKKISMLLRMVEEKDQEIASLRNQIESRDQKSAKDICQDQLESVENEDEGWTLVTRRKKQ
        S +   E   P++MSVM+TD  TSE+RM  +EKK++M ++ VEE+D EIA L+N IESRD   +         +++N ++G  ++   + Q
Subjt:  SSQQPNETSSPDVMSVMMTDAGTSEERMAEVEKKISMLLRMVEEKDQEIASLRNQIESRDQKSAKDICQDQLESVENEDEGWTLVTRRKKQ

A0A5D3BIH8 Uncharacterized protein2.6e-6546.93Show/hide
Query:  EDIPEIVACHAVNTCEEGDSSSSPSQNAKKESIEEIDACEVQSKETSAHPAKSKAPKEE-------ASSNVPILRYVPLSRRKKGESPFTECSGILKVGD
        ED+ EI++     T  +G   +       K+S  + DA   Q         K +AP+ E         SN P+LRY+PLSRRKKGESPFTECS  L V +
Subjt:  EDIPEIVACHAVNTCEEGDSSSSPSQNAKKESIEEIDACEVQSKETSAHPAKSKAPKEE-------ASSNVPILRYVPLSRRKKGESPFTECSGILKVGD

Query:  VEILKENFTTPLTAITKQVIQESKTDRTKVTLPKRRTKAGFDPNAYKLLEKAGYDFTTHTEFKSLKIFDGRSEPSPTQKKLQKEGYAIPFSRAGLGYNPP
         EILKENFT PLT I K   ++ +    +  LP+RRT  GFDP AYKL+ KAGYDFTT TE KS+KIFD R E SPTQKKLQK+GY+IP SRAG+GY   
Subjt:  VEILKENFTTPLTAITKQVIQESKTDRTKVTLPKRRTKAGFDPNAYKLLEKAGYDFTTHTEFKSLKIFDGRSEPSPTQKKLQKEGYAIPFSRAGLGYNPP

Query:  EPVRITRKW--KVVDAHHITVEEMGDPEDKKNDDSQKISVFDRIKAPTTRLSVHQRLKYSFSKKKGQSKTSTPTRCSVFQRLSVSTSKKVDEPP-----R
        EPVRIT K   KV +  HITVEE  D E+ K   SQ+ SVFDRI     R SV QR+  S +K   Q  T + TR S FQRL+ S  K     P     +
Subjt:  EPVRITRKW--KVVDAHHITVEEMGDPEDKKNDDSQKISVFDRIKAPTTRLSVHQRLKYSFSKKKGQSKTSTPTRCSVFQRLSVSTSKKVDEPP-----R

Query:  SVFDRLQATYGQCKGKLKSLETDESDEMNDNNGFFSTVPSRMKRKPFILINTEGALKM
        S F RL  +  + + K     +++S  +  +    S  PSRMKRK F+ +NTEG+LK+
Subjt:  SVFDRLQATYGQCKGKLKSLETDESDEMNDNNGFFSTVPSRMKRKPFILINTEGALKM

A0A5D3BY54 Ty3-gypsy retrotransposon protein3.6e-6738.82Show/hide
Query:  MLLRMVEEKDQEIASLRNQIESRDQKSAKDICQDQLESVENEDEGWTLVTRRKKQKKHYVQREPHMFRNYRRKNMSQKQKRKIKGLRKPKLVVEENQDFV
        ++LR+  EK  ++  L  ++   D        Q++   +E +DEGWT+VTRRKK+K   +Q+E  ++ NYRR N +QK K+K K  RK KLV E+++DF 
Subjt:  MLLRMVEEKDQEIASLRNQIESRDQKSAKDICQDQLESVENEDEGWTLVTRRKKQKKHYVQREPHMFRNYRRKNMSQKQKRKIKGLRKPKLVVEENQDFV

Query:  MSRQPVTLKDFFPKNFLNDDHEDIPEIVA-----------------CHAVNTCEEG----------------------------DSSSS-----------
         +++ VTL DFFP  FL D  ++ P +VA                 C +++  +E                             D+ S+           
Subjt:  MSRQPVTLKDFFPKNFLNDDHEDIPEIVA-----------------CHAVNTCEEG----------------------------DSSSS-----------

Query:  ---------------------------------------------PSQN----------AKKESIEEIDACEVQSKETSAHPAKSKAPKEEASSNVPILR
                                                     P  N          A +ES +          E S   AKS    +E +SN PILR
Subjt:  ---------------------------------------------PSQN----------AKKESIEEIDACEVQSKETSAHPAKSKAPKEEASSNVPILR

Query:  YVPLSRRKKGESPFTECSGILKVGDVEILKENFTTPLTAITKQVIQESKTDRTKVTLPKRRTKAGFDPNAYKLLEKAGYDFTTHTEFKSLKIFDGRSEPS
        YVPLSRRKKGESPF E    LKVGD+E+LKE+FTTPLT ITK   QE K D  + +LP+RRTK GFDP AYK + KAGYDFTTHTEFKSLKI + + + S
Subjt:  YVPLSRRKKGESPFTECSGILKVGDVEILKENFTTPLTAITKQVIQESKTDRTKVTLPKRRTKAGFDPNAYKLLEKAGYDFTTHTEFKSLKIFDGRSEPS

Query:  PTQKKLQKEGYAIPFSRAGLGYNPPEPVRITRKW--KVVDAHHITVEEMGDPEDKKNDDSQKISVFDRIKAPTTRLSVHQRLKYSFSKKKGQSKTSTPTR
         TQKKL +EG+AIP SR GLGY PPEP+RITRK   KVVD++HITV+E+ D  ++K  DSQ+ S FDR+     R  V +RL    +++K    TS+  R
Subjt:  PTQKKLQKEGYAIPFSRAGLGYNPPEPVRITRKW--KVVDAHHITVEEMGDPEDKKNDDSQKISVFDRIKAPTTRLSVHQRLKYSFSKKKGQSKTSTPTR

Query:  CSVFQRLSVS
         S FQRL+++
Subjt:  CSVFQRLSVS

A0A5D3C0W6 Ty3-gypsy retrotransposon protein6.2e-6732.4Show/hide
Query:  MLLRMVEEKD-----QEIASLRNQIESRDQKSAKDICQDQLESVENEDEGWTLVTRRKKQKKHYVQREPHMFRNYRRKNMSQKQKRKIKGLRKPKLVVEE
        ++LR+  EK      +E  +    +    Q+ A +  Q++   +E +DE WT+VTRRKK+K   +Q+E   +RNYRR N +QK K+K K  RK KL+ +E
Subjt:  MLLRMVEEKD-----QEIASLRNQIESRDQKSAKDICQDQLESVENEDEGWTLVTRRKKQKKHYVQREPHMFRNYRRKNMSQKQKRKIKGLRKPKLVVEE

Query:  NQDFVMSRQPVTLKDFFPKNFLNDDHEDIPEIVACHAVNTCEEG--------------------------------------------------------
        ++DF  +++ +TL DFFP  FL D  ++ P +VACHA+N  EE                                                         
Subjt:  NQDFVMSRQPVTLKDFFPKNFLNDDHEDIPEIVACHAVNTCEEG--------------------------------------------------------

Query:  -----------------------------------------------------------------------------------------DSSSSPSQNA-
                                                                                                 ++ S+P   A 
Subjt:  -----------------------------------------------------------------------------------------DSSSSPSQNA-

Query:  ----------KKESIEEIDACEV----------------------------QSKETSAHPAKSKAPKEEASSNVPILRYVPLSRRKKGESPFTECSGILK
                  K +S  E+ + EV                            +  E S   AKS    +E +SN  ILRYVPLSRRKKGESPF E    LK
Subjt:  ----------KKESIEEIDACEV----------------------------QSKETSAHPAKSKAPKEEASSNVPILRYVPLSRRKKGESPFTECSGILK

Query:  VGDVEILKENFTTPLTAITKQVIQESKTDRTKVTLPKRRTKAGFDPNAYKLLEKAGYDFTTHTEFKSLKIFDGRSEPSPTQKKLQKEGYAIPFSRAGLGY
        VGD+E+LKE+FTTPLT ITK   QE K D T+ +LP+RRTK  FDP AYKL+ KAGYDFTTHTEFKSLKI + + + S TQKKL +EG+AIP SR GLGY
Subjt:  VGDVEILKENFTTPLTAITKQVIQESKTDRTKVTLPKRRTKAGFDPNAYKLLEKAGYDFTTHTEFKSLKIFDGRSEPSPTQKKLQKEGYAIPFSRAGLGY

Query:  NPPEPVRITRKW--KVVDAHHITVEEMGDPEDKKNDDSQKISVFDRIKAPTTRLSVHQRLKYSFSKKKGQSKTSTPTRCSVFQRLSVSTSKKVDEPPRSV
          PEP+RITRK   K+VD++HITV+E+ D   +K  DSQ+ S FDRI     R  V +RL  + +++K    TS   R S F+RLS++  K    P   +
Subjt:  NPPEPVRITRKW--KVVDAHHITVEEMGDPEDKKNDDSQKISVFDRIKAPTTRLSVHQRLKYSFSKKKGQSKTSTPTRCSVFQRLSVSTSKKVDEPPRSV

Query:  FDRL-------------------QATYGQCKGKLKSLETDESD------EMNDNNGFFSTVPSRMKRKPFILIN-TEGALKM
         +RL                     +      ++K +  +         E+       S VPSRMKRK F+ +N ++G+LK+
Subjt:  FDRL-------------------QATYGQCKGKLKSLETDESD------EMNDNNGFFSTVPSRMKRKPFILIN-TEGALKM

A0A5D3D1E5 Ribonuclease H1.8e-1040.34Show/hide
Query:  QDQLESVENEDEGWTLVTRRKKQKKHYVQREPHMFRNYRRKNMSQKQKRKIKGLRKPKLVVEENQDFVMSRQPVTLKDFFPKNFLNDDHEDIPEIVACHA
        +++ + V+N +EGWTLVTRRKK+K+ + Q+E   +R YR K  SQ++  + K  RK   ++EE++     R+P+ LKDFFPKNF         EIV+CH 
Subjt:  QDQLESVENEDEGWTLVTRRKKQKKHYVQREPHMFRNYRRKNMSQKQKRKIKGLRKPKLVVEENQDFVMSRQPVTLKDFFPKNFLNDDHEDIPEIVACHA

Query:  VNTCEEGDSSSSPSQNAKK
         +T EE    S+  +   K
Subjt:  VNTCEEGDSSSSPSQNAKK

A0A5D3D1E5 Ribonuclease H1.0e-0542.35Show/hide
Query:  ETSSPDVMSVMMTDAGTSEERMAEVEKKISMLLRMVEEKDQEIASLRNQIESRDQKSAKDICQDQLESVENEDEGWTLVTRRKKQ
        E S P++MSVM+TD  TSE+RMAE+EKK++ML++ VEE+D  IA  +N IESRD   +        ++++N ++G  ++   + Q
Subjt:  ETSSPDVMSVMMTDAGTSEERMAEVEKKISMLLRMVEEKDQEIASLRNQIESRDQKSAKDICQDQLESVENEDEGWTLVTRRKKQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTGAGACGCATGACAGAATCTGCTCAACTTGAAATCGCTACTGCAGAAGATCTGATGCTTGGCAAATTCATCTATTCTTCTCAGCAACCAAATGAAACATCCAGTCC
TGATGTGATGTCTGTTATGATGACTGATGCTGGAACTAGTGAAGAACGAATGGCTGAGGTTGAGAAGAAAATCAGCATGCTACTGAGGATGGTCGAAGAGAAAGACCAAG
AAATCGCCTCTCTCAGGAACCAGATAGAAAGTCGGGATCAAAAATCTGCAAAAGACATCTGCCAAGACCAACTTGAGTCTGTTGAAAATGAAGACGAAGGATGGACTTTG
GTAACTCGTCGAAAGAAGCAAAAGAAACATTACGTTCAAAGAGAACCTCACATGTTCCGAAACTACAGGAGAAAGAACATGTCACAAAAACAGAAAAGAAAGATAAAAGG
TTTGAGAAAGCCTAAACTAGTTGTCGAAGAGAATCAAGATTTTGTTATGTCTCGACAACCAGTGACCCTTAAAGATTTCTTCCCGAAGAACTTTCTTAATGACGATCATG
AGGACATCCCTGAGATAGTCGCATGTCATGCTGTCAATACGTGCGAAGAAGGCGATTCTTCCTCAAGTCCATCTCAGAATGCGAAGAAGGAGTCGATCGAAGAAATAGAT
GCTTGTGAAGTTCAATCAAAGGAAACATCCGCTCACCCTGCAAAATCGAAAGCTCCAAAGGAAGAGGCATCGTCAAACGTCCCTATATTGCGCTACGTTCCTTTATCTCG
ACGCAAGAAAGGGGAATCACCATTCACAGAATGTTCAGGAATTTTGAAGGTTGGTGATGTGGAGATTCTAAAAGAAAATTTCACCACGCCTCTGACAGCAATCACCAAGC
AAGTGATTCAGGAATCAAAGACGGATCGTACCAAAGTGACTTTGCCGAAAAGACGAACGAAGGCTGGATTTGACCCAAATGCATACAAACTCCTAGAAAAGGCAGGATAC
GACTTTACAACTCATACGGAGTTTAAAAGTCTAAAAATCTTCGATGGGAGATCTGAGCCTTCTCCGACACAGAAGAAGCTCCAGAAAGAAGGTTATGCCATACCATTCTC
GAGAGCCGGACTAGGGTACAATCCCCCAGAGCCAGTTCGTATAACCAGAAAATGGAAAGTTGTAGATGCACATCATATAACAGTAGAAGAGATGGGCGATCCAGAAGATA
AAAAGAATGATGATAGCCAGAAAATCTCTGTCTTCGATCGCATTAAAGCGCCGACTACTCGTCTTTCAGTTCATCAGCGGCTGAAATACTCATTTTCGAAAAAGAAAGGT
CAGAGCAAAACTTCTACTCCTACCCGATGTTCAGTGTTTCAGCGTTTGAGTGTGTCTACATCGAAGAAAGTAGATGAACCGCCGAGATCCGTGTTCGATCGCCTTCAAGC
AACCTATGGCCAATGCAAGGGAAAGTTGAAAAGTCTTGAAACAGATGAATCCGACGAGATGAATGACAACAACGGATTTTTCAGTACTGTTCCTTCACGGATGAAGAGAA
AACCTTTTATTCTCATAAATACAGAAGGTGCTTTAAAAATGGCTTCGTTCTTCAAGTTCAGTAATTCGCTCGCTCCAAGTTTGGCATCTCGTCTCAAGTTTGGTGCTTCG
CTTCTTCAAAGTTTGGCGTCTCACCCCTTCAAGTTTGGCACCTTGTTTTCTCTAAAGTTTGCATTCTCCATGCTGGCATTTGTCGACCTCACTTCACTTTCAAGGGTGAC
AACCCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAATCACTGCAAGTGAAGCTGATGACGACCGTGGTGACCACCCTTGCAGGAAACTACA
GTCATCAAAGCGACTGGTCTAGACAGAGTGGAATCACTGTAGGCGAGTCTGGTGACTACTCCTGCAGGTTACTCAGATCACCCAATAAAATGGGGACTGGTCTAGCAGGA
GTGCATCACTGCGAATCTGGTGACTACCCCTGCAGGTTACTCAGATCACCCAATAAAATGAGGACTGGTCTAGCAGGAGTGCATCACTGTAGGCGAATCTGGTTACTCAG
ATCACCCAATGAAATAGGGGACTGGTCTAGCAGGAGTGAAATCACTACAAGTGAATTTGGAAACGACTGTTGCACGATGCTTACATATTCAAGTCAAACACATAAGGAAA
AGTTGATTCTTGGCCAAGGAAATTTTTGCCTAATCAAATATGGCAATGAATTTCCTACAATTGCTATTAAGAGTCAGAGAAGCCAGAGCCCAGAGCATTCTCCCAAGGTC
CAGAGTCTTCAGAAGTCAGAGAGTCCAGAGAATTCAGAAAATCCAAGATTCAGAATTCGTCAATCTCAAGACTCAGAAGATCCAACGACTGGAAAACTCCGAAGAATCAA
CCGTTTCTTCATCAAGATACAAGTTGAAGATTCAACCTTCTTCCGTCTGAGATCAAGCTCGCCAGCCCTCAGATCAGACTCTACATTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCTGAGACGCATGACAGAATCTGCTCAACTTGAAATCGCTACTGCAGAAGATCTGATGCTTGGCAAATTCATCTATTCTTCTCAGCAACCAAATGAAACATCCAGTCC
TGATGTGATGTCTGTTATGATGACTGATGCTGGAACTAGTGAAGAACGAATGGCTGAGGTTGAGAAGAAAATCAGCATGCTACTGAGGATGGTCGAAGAGAAAGACCAAG
AAATCGCCTCTCTCAGGAACCAGATAGAAAGTCGGGATCAAAAATCTGCAAAAGACATCTGCCAAGACCAACTTGAGTCTGTTGAAAATGAAGACGAAGGATGGACTTTG
GTAACTCGTCGAAAGAAGCAAAAGAAACATTACGTTCAAAGAGAACCTCACATGTTCCGAAACTACAGGAGAAAGAACATGTCACAAAAACAGAAAAGAAAGATAAAAGG
TTTGAGAAAGCCTAAACTAGTTGTCGAAGAGAATCAAGATTTTGTTATGTCTCGACAACCAGTGACCCTTAAAGATTTCTTCCCGAAGAACTTTCTTAATGACGATCATG
AGGACATCCCTGAGATAGTCGCATGTCATGCTGTCAATACGTGCGAAGAAGGCGATTCTTCCTCAAGTCCATCTCAGAATGCGAAGAAGGAGTCGATCGAAGAAATAGAT
GCTTGTGAAGTTCAATCAAAGGAAACATCCGCTCACCCTGCAAAATCGAAAGCTCCAAAGGAAGAGGCATCGTCAAACGTCCCTATATTGCGCTACGTTCCTTTATCTCG
ACGCAAGAAAGGGGAATCACCATTCACAGAATGTTCAGGAATTTTGAAGGTTGGTGATGTGGAGATTCTAAAAGAAAATTTCACCACGCCTCTGACAGCAATCACCAAGC
AAGTGATTCAGGAATCAAAGACGGATCGTACCAAAGTGACTTTGCCGAAAAGACGAACGAAGGCTGGATTTGACCCAAATGCATACAAACTCCTAGAAAAGGCAGGATAC
GACTTTACAACTCATACGGAGTTTAAAAGTCTAAAAATCTTCGATGGGAGATCTGAGCCTTCTCCGACACAGAAGAAGCTCCAGAAAGAAGGTTATGCCATACCATTCTC
GAGAGCCGGACTAGGGTACAATCCCCCAGAGCCAGTTCGTATAACCAGAAAATGGAAAGTTGTAGATGCACATCATATAACAGTAGAAGAGATGGGCGATCCAGAAGATA
AAAAGAATGATGATAGCCAGAAAATCTCTGTCTTCGATCGCATTAAAGCGCCGACTACTCGTCTTTCAGTTCATCAGCGGCTGAAATACTCATTTTCGAAAAAGAAAGGT
CAGAGCAAAACTTCTACTCCTACCCGATGTTCAGTGTTTCAGCGTTTGAGTGTGTCTACATCGAAGAAAGTAGATGAACCGCCGAGATCCGTGTTCGATCGCCTTCAAGC
AACCTATGGCCAATGCAAGGGAAAGTTGAAAAGTCTTGAAACAGATGAATCCGACGAGATGAATGACAACAACGGATTTTTCAGTACTGTTCCTTCACGGATGAAGAGAA
AACCTTTTATTCTCATAAATACAGAAGGTGCTTTAAAAATGGCTTCGTTCTTCAAGTTCAGTAATTCGCTCGCTCCAAGTTTGGCATCTCGTCTCAAGTTTGGTGCTTCG
CTTCTTCAAAGTTTGGCGTCTCACCCCTTCAAGTTTGGCACCTTGTTTTCTCTAAAGTTTGCATTCTCCATGCTGGCATTTGTCGACCTCACTTCACTTTCAAGGGTGAC
AACCCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAATCACTGCAAGTGAAGCTGATGACGACCGTGGTGACCACCCTTGCAGGAAACTACA
GTCATCAAAGCGACTGGTCTAGACAGAGTGGAATCACTGTAGGCGAGTCTGGTGACTACTCCTGCAGGTTACTCAGATCACCCAATAAAATGGGGACTGGTCTAGCAGGA
GTGCATCACTGCGAATCTGGTGACTACCCCTGCAGGTTACTCAGATCACCCAATAAAATGAGGACTGGTCTAGCAGGAGTGCATCACTGTAGGCGAATCTGGTTACTCAG
ATCACCCAATGAAATAGGGGACTGGTCTAGCAGGAGTGAAATCACTACAAGTGAATTTGGAAACGACTGTTGCACGATGCTTACATATTCAAGTCAAACACATAAGGAAA
AGTTGATTCTTGGCCAAGGAAATTTTTGCCTAATCAAATATGGCAATGAATTTCCTACAATTGCTATTAAGAGTCAGAGAAGCCAGAGCCCAGAGCATTCTCCCAAGGTC
CAGAGTCTTCAGAAGTCAGAGAGTCCAGAGAATTCAGAAAATCCAAGATTCAGAATTCGTCAATCTCAAGACTCAGAAGATCCAACGACTGGAAAACTCCGAAGAATCAA
CCGTTTCTTCATCAAGATACAAGTTGAAGATTCAACCTTCTTCCGTCTGAGATCAAGCTCGCCAGCCCTCAGATCAGACTCTACATTTTGA
Protein sequenceShow/hide protein sequence
MLRRMTESAQLEIATAEDLMLGKFIYSSQQPNETSSPDVMSVMMTDAGTSEERMAEVEKKISMLLRMVEEKDQEIASLRNQIESRDQKSAKDICQDQLESVENEDEGWTL
VTRRKKQKKHYVQREPHMFRNYRRKNMSQKQKRKIKGLRKPKLVVEENQDFVMSRQPVTLKDFFPKNFLNDDHEDIPEIVACHAVNTCEEGDSSSSPSQNAKKESIEEID
ACEVQSKETSAHPAKSKAPKEEASSNVPILRYVPLSRRKKGESPFTECSGILKVGDVEILKENFTTPLTAITKQVIQESKTDRTKVTLPKRRTKAGFDPNAYKLLEKAGY
DFTTHTEFKSLKIFDGRSEPSPTQKKLQKEGYAIPFSRAGLGYNPPEPVRITRKWKVVDAHHITVEEMGDPEDKKNDDSQKISVFDRIKAPTTRLSVHQRLKYSFSKKKG
QSKTSTPTRCSVFQRLSVSTSKKVDEPPRSVFDRLQATYGQCKGKLKSLETDESDEMNDNNGFFSTVPSRMKRKPFILINTEGALKMASFFKFSNSLAPSLASRLKFGAS
LLQSLASHPFKFGTLFSLKFAFSMLAFVDLTSLSRVTTPAGNYSHQSDWSRQVVKSLQVKLMTTVVTTLAGNYSHQSDWSRQSGITVGESGDYSCRLLRSPNKMGTGLAG
VHHCESGDYPCRLLRSPNKMRTGLAGVHHCRRIWLLRSPNEIGDWSSRSEITTSEFGNDCCTMLTYSSQTHKEKLILGQGNFCLIKYGNEFPTIAIKSQRSQSPEHSPKV
QSLQKSESPENSENPRFRIRQSQDSEDPTTGKLRRINRFFIKIQVEDSTFFRLRSSSPALRSDSTF