; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0026342 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0026342
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr10:35203581..35205326
RNA-Seq ExpressionLag0026342
SyntenyLag0026342
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CDH30699.1 putative Ty1-copia-like retrotransposon [Cercis chinensis]7.0e-7238.01Show/hide
Query:  SFPTLTPPLNIKLSDSNYLLWNNQLLNHIIAFDMESLINGT-PAPVQYL---DTAQTQ-----VNPHFLLWQKFNRILMSWIYSSLTEDKIGEIIGCSTA
        + PT    L IKL  +N+L+W NQLLN ++A   E ++ GT   P Q++   DTA T      +NP ++LWQ+ NR++MSWIYSSLTE  + +I+  ++A
Subjt:  SFPTLTPPLNIKLSDSNYLLWNNQLLNHIIAFDMESLINGT-PAPVQYL---DTAQTQ-----VNPHFLLWQKFNRILMSWIYSSLTEDKIGEIIGCSTA

Query:  FDIWEQLRVVIESSSTARLMSLRSQLQKARKDNLTVSQYLAKIKDITDQFVAIGEPLSYRDHFAYILEGLGSDYNPFVTSIQNRTDRPLLADVRSLLLAF
         +IW  LR    S+S AR+M LR QLQ  RK  L+V  Y+ +I+ I D   AIGE +S  D    +L GLGS+YNP V SI +R D   +  ++S L  +
Subjt:  FDIWEQLRVVIESSSTARLMSLRSQLQKARKDNLTVSQYLAKIKDITDQFVAIGEPLSYRDHFAYILEGLGSDYNPFVTSIQNRTDRPLLADVRSLLLAF

Query:  DARLEKQSTVDNLNVIQANLANLSTSSNQKRPNRPSGNQRPSYNSSPRPSFPTFPFPQQFPTPGPGVLGRPQASQRPSSSSPKWPPNNNSRPQCQICGKI
        + RLE Q++V+    +QAN A  +++  +++P       + +++ S  P   T  F  +  + G G  G           S K   N  SRPQCQIC KI
Subjt:  DARLEKQSTVDNLNVIQANLANLSTSSNQKRPNRPSGNQRPSYNSSPRPSFPTFPFPQQFPTPGPGVLGRPQASQRPSSSSPKWPPNNNSRPQCQICGKI

Query:  GHMALVCYNRHNPLYQAPTASSPQAFFNHLQSPSATVPNPAASSQDLSQNISTTSHPDESWFMDSGATHHMTPDLANLQQPSSYFGNEQVVIGNGMSLPV
        GH+A  C++R +  +Q P AS    F +  Q P A +  P                 +  WF+D+GATHH+T DLANL   + + G+++V++GNG  L V
Subjt:  GHMALVCYNRHNPLYQAPTASSPQAFFNHLQSPSATVPNPAASSQDLSQNISTTSHPDESWFMDSGATHHMTPDLANLQQPSSYFGNEQVVIGNGMSLPV

Query:  RHIGSNIILVSSSRHVSLKSILHTPRLSHNLISIARLCFDNRAFVEFYHDHFLVNDLASRIIV
         H G + I  S    + LK++LH P+++ NL+S+ +LC DN A+VEFY  +F V D  ++ I+
Subjt:  RHIGSNIILVSSSRHVSLKSILHTPRLSHNLISIARLCFDNRAFVEFYHDHFLVNDLASRIIV

GFZ12741.1 UBX domain-containing protein [Actinidia rufa]2.3e-7036.99Show/hide
Query:  THPSDFPYPPPPHSSLPFFQNYNGAQPFYPPPQPSP---------SFPTLTPPLNIKLSDSNYLLWNNQLLNHIIAFDMESLINGT-PAPVQYLDTAQTQ
        T  S  P  PPP +S P         P    P+P+P         + P++  PL +KL D NY++W  QLLN +IA  +E  ++G+   P ++LD  Q Q
Subjt:  THPSDFPYPPPPHSSLPFFQNYNGAQPFYPPPQPSP---------SFPTLTPPLNIKLSDSNYLLWNNQLLNHIIAFDMESLINGT-PAPVQYLDTAQTQ

Query:  VNPHFLLWQKFNRILMSWIYSSLTEDKIGEIIGCSTAFDIWEQLRVVIESSSTARLMSLRSQLQKARKDNLTVSQYLAKIKDITDQFVAIGEPLSYRDHF
         NP F  WQ++NR++MSWIY+S+ E  +G+I+G ++A  IWE L  +  ++S A L  LR+ LQ  +K+ LT   Y+ K + + +   +IGEP++Y DH 
Subjt:  VNPHFLLWQKFNRILMSWIYSSLTEDKIGEIIGCSTAFDIWEQLRVVIESSSTARLMSLRSQLQKARKDNLTVSQYLAKIKDITDQFVAIGEPLSYRDHF

Query:  AYILEGLGSDYNPFVTSIQNRTDRPLLADVRSLLLAFDARLEKQSTVDNLNVIQANLANLSTSSNQK-RPNRPSGNQRPSYNSSPRPSFPTFPFPQQFPT
         Y L GLG DYNPFVTSIQ++  RP + +  S                             TS  +K +   PS N  P+ NS   P             
Subjt:  AYILEGLGSDYNPFVTSIQNRTDRPLLADVRSLLLAFDARLEKQSTVDNLNVIQANLANLSTSSNQK-RPNRPSGNQRPSYNSSPRPSFPTFPFPQQFPT

Query:  PGPGVLGRPQASQRPSSSSPKWPPNNNSRPQCQICGKIGHMALVCYNRHNPLYQAPTASSPQAFFNHLQSPSATVPNPAASSQDLSQNISTTSHPDESWF
           G    P  S  PSS  P        RP+CQIC K GH A  CY+  N  YQ P    P   FN     +   PNP+ S+      +   S PD SW+
Subjt:  PGPGVLGRPQASQRPSSSSPKWPPNNNSRPQCQICGKIGHMALVCYNRHNPLYQAPTASSPQAFFNHLQSPSATVPNPAASSQDLSQNISTTSHPDESWF

Query:  MDSGATHHMTPDLANLQQPSSYFGNEQVVIGNGMSLPVRHIGSNIILVSSSRHVSLKSILHTPRLSHNLISIARLCFDNRAFVEFYHDHFLV
        MDSGA+HH TPDL  L   S Y G +QV +GNG + P+ +IG +  L + +  + L  + H+P L+ NL+S++RLC DN AF+EFY   FLV
Subjt:  MDSGATHHMTPDLANLQQPSSYFGNEQVVIGNGMSLPVRHIGSNIILVSSSRHVSLKSILHTPRLSHNLISIARLCFDNRAFVEFYHDHFLV

RVW25398.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]2.1e-6836.01Show/hide
Query:  PSPSFPTLTPPLNIKLSDSNYLLWNNQLLNHIIAFDMESLING-TPAPVQYLDTAQTQVNPHFLLWQKFNRILMSWIYSSLTEDKIGEIIGCSTAFDIWE
        P+ +   L  PL IKL   N++LW+ Q+ N I A   E  I G  P P + + T   +VNP FL+W++++R+++SWIYSSLT + +G+I+G  T+ + W 
Subjt:  PSPSFPTLTPPLNIKLSDSNYLLWNNQLLNHIIAFDMESLING-TPAPVQYLDTAQTQVNPHFLLWQKFNRILMSWIYSSLTEDKIGEIIGCSTAFDIWE

Query:  QLRVVIESSSTARLMSLRSQLQKARKDNLTVSQYLAKIKDITDQFVAIGEPLSYRDHFAYILEGLGSDYNPFVTSIQNRTDRPLLADVRSLLLAFDARLE
         L+    +S+ AR M LR   Q  +K +LT+ +Y+ K+K I+D   AIGEP+  +D    +L GLG++YNP V S+  R D   L  V S+LL  + RL 
Subjt:  QLRVVIESSSTARLMSLRSQLQKARKDNLTVSQYLAKIKDITDQFVAIGEPLSYRDHFAYILEGLGSDYNPFVTSIQNRTDRPLLADVRSLLLAFDARLE

Query:  KQSTVDNLNVIQANLANLSTSSNQKRPNRPSGNQRPSYNSSPRPSFPTFPFPQQFPTPGPGVLGRPQASQRPSSSSPKWPPNN----NSRPQCQICGKIG
         Q+T    +++ AN A         RP R   N +PS    P+   PT   P   P             Q P    P    NN    N+RPQCQ+CGK G
Subjt:  KQSTVDNLNVIQANLANLSTSSNQKRPNRPSGNQRPSYNSSPRPSFPTFPFPQQFPTPGPGVLGRPQASQRPSSSSPKWPPNN----NSRPQCQICGKIG

Query:  HMALVCYNRHNPLYQAPTASSPQAFFNHLQSPS-ATVPNPAASSQDLSQNISTTSHPDESWFMDSGATHHMTPDLANLQQPSSYFGNEQVVIGNGMSLPV
        HM L CY+R +  YQ P A +       L  P+ A +  P+++S D             SWF+DSGATHH++   AN+   + Y G + +++GNG SLP+
Subjt:  HMALVCYNRHNPLYQAPTASSPQAFFNHLQSPS-ATVPNPAASSQDLSQNISTTSHPDESWFMDSGATHHMTPDLANLQQPSSYFGNEQVVIGNGMSLPV

Query:  RHIGSNIILVSSSRHVSLKSILHTPRLSHNLISIARLCFDNRAFVEFYHDHFLVNDLASRI
          +G + +   +   V L ++LH P+L+ NLIS+++ C DN   +EF+   F V D  +++
Subjt:  RHIGSNIILVSSSRHVSLKSILHTPRLSHNLISIARLCFDNRAFVEFYHDHFLVNDLASRI

RVW69807.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]6.2e-7639.06Show/hide
Query:  SDFPYPPPPHSSLPFFQNYNGAQPFYPPPQPSPSFPTLTPPLNIKLSDSNYLLWNNQLLNHIIAFDMESLIN-GTPAPVQYLDTAQTQVNPHFLLWQKFN
        S FP  P  +S+     N N A        PSPS   L+  L+IKL ++N LL  +QLLN IIA  +E  I+    +P +YLD A  QVNP F+ W + N
Subjt:  SDFPYPPPPHSSLPFFQNYNGAQPFYPPPQPSPSFPTLTPPLNIKLSDSNYLLWNNQLLNHIIAFDMESLIN-GTPAPVQYLDTAQTQVNPHFLLWQKFN

Query:  RILMSWIYSSLTEDKIGEIIGCSTAFDIWEQLRVVIESSSTARLMSLRSQLQKARKDNLTVSQYLAKIKDITDQFVAIGEPLSYRDHFAYILEGLGSDYN
        +++MSWIYSSLT   +G+I+  STA DIW  L    ES S A +MSL SQLQ+ +K ++ +S+YL+++K + D+F  IGEPLSYRD    ILEGL  +Y+
Subjt:  RILMSWIYSSLTEDKIGEIIGCSTAFDIWEQLRVVIESSSTARLMSLRSQLQKARKDNLTVSQYLAKIKDITDQFVAIGEPLSYRDHFAYILEGLGSDYN

Query:  PFVTSIQNRTDRPLLADVRSLLLAFDARLEKQSTVDNLNVIQANLANLSTSSNQKRPNRPSGNQRPSYNSSPRPSFPTFPFPQQFPTPGPGVLGRPQASQ
         FVTSI NR+DRP L +V SLL  ++ RL ++S   NLN  QAN                                                        
Subjt:  PFVTSIQNRTDRPLLADVRSLLLAFDARLEKQSTVDNLNVIQANLANLSTSSNQKRPNRPSGNQRPSYNSSPRPSFPTFPFPQQFPTPGPGVLGRPQASQ

Query:  RPSSSSPKWPPNNNSRPQCQICGKIGHMALVCYNRHNPLYQAPTASSPQAFFNHLQSPSATVPNPAASSQDLSQNISTTSHP--------DESWFMDSGA
              P+ P  NNS PQCQICGK GH+AL  Y+R N  Y  P   +  AF     +P+     P  +S  +S  ++T++ P        D SW+MDSGA
Subjt:  RPSSSSPKWPPNNNSRPQCQICGKIGHMALVCYNRHNPLYQAPTASSPQAFFNHLQSPSATVPNPAASSQDLSQNISTTSHP--------DESWFMDSGA

Query:  THHMTPDLANLQQPSSYFGNEQVVIGNGMSLPVRHIGSNIILVSSSRHVSLKSILHTPRLSHNLISIARLCFDNRAFVEFYHDHFLVND
        THH TP+  ++     Y   +  ++GN   + + HIG + +L SS + + L  +LHTP +S  LIS+ RL  DN+AFVEFY   FLV D
Subjt:  THHMTPDLANLQQPSSYFGNEQVVIGNGMSLPVRHIGSNIILVSSSRHVSLKSILHTPRLSHNLISIARLCFDNRAFVEFYHDHFLVND

XP_022155181.1 uncharacterized protein LOC111022315 [Momordica charantia]2.3e-9950.71Show/hide
Query:  YPPPPHSSLPFFQNYNGAQPFYPPPQPSPSFPTLTPPLNIKLSDSNYLLWNNQLLNHIIAFDMESLINGT-PAPVQYLDTAQTQVNPHFLLWQKFNRILM
        +PPP  + L        AQP  P P  +  FPTL  PLN+KL+D+N+LLW NQLLN +IA  +   ++GT   P Q+LD  Q Q NP +  W+++NR+LM
Subjt:  YPPPPHSSLPFFQNYNGAQPFYPPPQPSPSFPTLTPPLNIKLSDSNYLLWNNQLLNHIIAFDMESLINGT-PAPVQYLDTAQTQVNPHFLLWQKFNRILM

Query:  SWIYSSLTEDKIGEIIGCSTAFDIWEQLRVVIESSSTARLMSLRSQLQKARKDNLTVSQYLAKIKDITDQFVAIGEPLSYRDHFAYILEGLGSDYNPFVT
         WIYSSL+E+K+GE++   T  DIW  L  V +S +TAR+M L+++LQ  RKD  +VSQYLAKIK+I D+F A+GEPLSYRDH A++L+GLGS+YN FVT
Subjt:  SWIYSSLTEDKIGEIIGCSTAFDIWEQLRVVIESSSTARLMSLRSQLQKARKDNLTVSQYLAKIKDITDQFVAIGEPLSYRDHFAYILEGLGSDYNPFVT

Query:  SIQNRTDRPLLADVRSLLLAFDARLEKQSTVDNLNVIQANLANLSTSSNQKRPNRPSGNQRPSYNSSPRPSFPTFPFPQQFPTPGPGVLGRPQASQRPSS
        SI NR D P L DVRSLLLA++ARL+KQ+TVD LN+ QANL NLS   N KRP       + S+ +  + SFP  P           +LG+PQ       
Subjt:  SIQNRTDRPLLADVRSLLLAFDARLEKQSTVDNLNVIQANLANLSTSSNQKRPNRPSGNQRPSYNSSPRPSFPTFPFPQQFPTPGPGVLGRPQASQRPSS

Query:  SSPKWPPN-NNSRPQCQICGKIGHMALVCYNRHNPLYQAPTASSPQAFFNHLQSPSATVPNPAASSQDLSQNISTTSHPDESWFMDSGATHHMTPDLANL
        S  KWPP  ++S+ QCQICGK+GH A VCY+R N  Y     +SPQA ++H+Q PS T P+     Q          HPDESWFMDSGATHHMTPD + L
Subjt:  SSPKWPPN-NNSRPQCQICGKIGHMALVCYNRHNPLYQAPTASSPQAFFNHLQSPSATVPNPAASSQDLSQNISTTSHPDESWFMDSGATHHMTPDLANL

Query:  QQPSSYFGNEQVVIGNGMSL
          P+ Y G EQV +GNG S+
Subjt:  QQPSSYFGNEQVVIGNGMSL

TrEMBL top hitse value%identityAlignment
A0A438CQD7 Retrovirus-related Pol polyprotein from transposon RE11.0e-6836.01Show/hide
Query:  PSPSFPTLTPPLNIKLSDSNYLLWNNQLLNHIIAFDMESLING-TPAPVQYLDTAQTQVNPHFLLWQKFNRILMSWIYSSLTEDKIGEIIGCSTAFDIWE
        P+ +   L  PL IKL   N++LW+ Q+ N I A   E  I G  P P + + T   +VNP FL+W++++R+++SWIYSSLT + +G+I+G  T+ + W 
Subjt:  PSPSFPTLTPPLNIKLSDSNYLLWNNQLLNHIIAFDMESLING-TPAPVQYLDTAQTQVNPHFLLWQKFNRILMSWIYSSLTEDKIGEIIGCSTAFDIWE

Query:  QLRVVIESSSTARLMSLRSQLQKARKDNLTVSQYLAKIKDITDQFVAIGEPLSYRDHFAYILEGLGSDYNPFVTSIQNRTDRPLLADVRSLLLAFDARLE
         L+    +S+ AR M LR   Q  +K +LT+ +Y+ K+K I+D   AIGEP+  +D    +L GLG++YNP V S+  R D   L  V S+LL  + RL 
Subjt:  QLRVVIESSSTARLMSLRSQLQKARKDNLTVSQYLAKIKDITDQFVAIGEPLSYRDHFAYILEGLGSDYNPFVTSIQNRTDRPLLADVRSLLLAFDARLE

Query:  KQSTVDNLNVIQANLANLSTSSNQKRPNRPSGNQRPSYNSSPRPSFPTFPFPQQFPTPGPGVLGRPQASQRPSSSSPKWPPNN----NSRPQCQICGKIG
         Q+T    +++ AN A         RP R   N +PS    P+   PT   P   P             Q P    P    NN    N+RPQCQ+CGK G
Subjt:  KQSTVDNLNVIQANLANLSTSSNQKRPNRPSGNQRPSYNSSPRPSFPTFPFPQQFPTPGPGVLGRPQASQRPSSSSPKWPPNN----NSRPQCQICGKIG

Query:  HMALVCYNRHNPLYQAPTASSPQAFFNHLQSPS-ATVPNPAASSQDLSQNISTTSHPDESWFMDSGATHHMTPDLANLQQPSSYFGNEQVVIGNGMSLPV
        HM L CY+R +  YQ P A +       L  P+ A +  P+++S D             SWF+DSGATHH++   AN+   + Y G + +++GNG SLP+
Subjt:  HMALVCYNRHNPLYQAPTASSPQAFFNHLQSPS-ATVPNPAASSQDLSQNISTTSHPDESWFMDSGATHHMTPDLANLQQPSSYFGNEQVVIGNGMSLPV

Query:  RHIGSNIILVSSSRHVSLKSILHTPRLSHNLISIARLCFDNRAFVEFYHDHFLVNDLASRI
          +G + +   +   V L ++LH P+L+ NLIS+++ C DN   +EF+   F V D  +++
Subjt:  RHIGSNIILVSSSRHVSLKSILHTPRLSHNLISIARLCFDNRAFVEFYHDHFLVNDLASRI

A0A438GC62 Retrovirus-related Pol polyprotein from transposon RE13.0e-7639.06Show/hide
Query:  SDFPYPPPPHSSLPFFQNYNGAQPFYPPPQPSPSFPTLTPPLNIKLSDSNYLLWNNQLLNHIIAFDMESLIN-GTPAPVQYLDTAQTQVNPHFLLWQKFN
        S FP  P  +S+     N N A        PSPS   L+  L+IKL ++N LL  +QLLN IIA  +E  I+    +P +YLD A  QVNP F+ W + N
Subjt:  SDFPYPPPPHSSLPFFQNYNGAQPFYPPPQPSPSFPTLTPPLNIKLSDSNYLLWNNQLLNHIIAFDMESLIN-GTPAPVQYLDTAQTQVNPHFLLWQKFN

Query:  RILMSWIYSSLTEDKIGEIIGCSTAFDIWEQLRVVIESSSTARLMSLRSQLQKARKDNLTVSQYLAKIKDITDQFVAIGEPLSYRDHFAYILEGLGSDYN
        +++MSWIYSSLT   +G+I+  STA DIW  L    ES S A +MSL SQLQ+ +K ++ +S+YL+++K + D+F  IGEPLSYRD    ILEGL  +Y+
Subjt:  RILMSWIYSSLTEDKIGEIIGCSTAFDIWEQLRVVIESSSTARLMSLRSQLQKARKDNLTVSQYLAKIKDITDQFVAIGEPLSYRDHFAYILEGLGSDYN

Query:  PFVTSIQNRTDRPLLADVRSLLLAFDARLEKQSTVDNLNVIQANLANLSTSSNQKRPNRPSGNQRPSYNSSPRPSFPTFPFPQQFPTPGPGVLGRPQASQ
         FVTSI NR+DRP L +V SLL  ++ RL ++S   NLN  QAN                                                        
Subjt:  PFVTSIQNRTDRPLLADVRSLLLAFDARLEKQSTVDNLNVIQANLANLSTSSNQKRPNRPSGNQRPSYNSSPRPSFPTFPFPQQFPTPGPGVLGRPQASQ

Query:  RPSSSSPKWPPNNNSRPQCQICGKIGHMALVCYNRHNPLYQAPTASSPQAFFNHLQSPSATVPNPAASSQDLSQNISTTSHP--------DESWFMDSGA
              P+ P  NNS PQCQICGK GH+AL  Y+R N  Y  P   +  AF     +P+     P  +S  +S  ++T++ P        D SW+MDSGA
Subjt:  RPSSSSPKWPPNNNSRPQCQICGKIGHMALVCYNRHNPLYQAPTASSPQAFFNHLQSPSATVPNPAASSQDLSQNISTTSHP--------DESWFMDSGA

Query:  THHMTPDLANLQQPSSYFGNEQVVIGNGMSLPVRHIGSNIILVSSSRHVSLKSILHTPRLSHNLISIARLCFDNRAFVEFYHDHFLVND
        THH TP+  ++     Y   +  ++GN   + + HIG + +L SS + + L  +LHTP +S  LIS+ RL  DN+AFVEFY   FLV D
Subjt:  THHMTPDLANLQQPSSYFGNEQVVIGNGMSLPVRHIGSNIILVSSSRHVSLKSILHTPRLSHNLISIARLCFDNRAFVEFYHDHFLVND

A0A6J1DQX7 uncharacterized protein LOC1110223151.1e-9950.71Show/hide
Query:  YPPPPHSSLPFFQNYNGAQPFYPPPQPSPSFPTLTPPLNIKLSDSNYLLWNNQLLNHIIAFDMESLINGT-PAPVQYLDTAQTQVNPHFLLWQKFNRILM
        +PPP  + L        AQP  P P  +  FPTL  PLN+KL+D+N+LLW NQLLN +IA  +   ++GT   P Q+LD  Q Q NP +  W+++NR+LM
Subjt:  YPPPPHSSLPFFQNYNGAQPFYPPPQPSPSFPTLTPPLNIKLSDSNYLLWNNQLLNHIIAFDMESLINGT-PAPVQYLDTAQTQVNPHFLLWQKFNRILM

Query:  SWIYSSLTEDKIGEIIGCSTAFDIWEQLRVVIESSSTARLMSLRSQLQKARKDNLTVSQYLAKIKDITDQFVAIGEPLSYRDHFAYILEGLGSDYNPFVT
         WIYSSL+E+K+GE++   T  DIW  L  V +S +TAR+M L+++LQ  RKD  +VSQYLAKIK+I D+F A+GEPLSYRDH A++L+GLGS+YN FVT
Subjt:  SWIYSSLTEDKIGEIIGCSTAFDIWEQLRVVIESSSTARLMSLRSQLQKARKDNLTVSQYLAKIKDITDQFVAIGEPLSYRDHFAYILEGLGSDYNPFVT

Query:  SIQNRTDRPLLADVRSLLLAFDARLEKQSTVDNLNVIQANLANLSTSSNQKRPNRPSGNQRPSYNSSPRPSFPTFPFPQQFPTPGPGVLGRPQASQRPSS
        SI NR D P L DVRSLLLA++ARL+KQ+TVD LN+ QANL NLS   N KRP       + S+ +  + SFP  P           +LG+PQ       
Subjt:  SIQNRTDRPLLADVRSLLLAFDARLEKQSTVDNLNVIQANLANLSTSSNQKRPNRPSGNQRPSYNSSPRPSFPTFPFPQQFPTPGPGVLGRPQASQRPSS

Query:  SSPKWPPN-NNSRPQCQICGKIGHMALVCYNRHNPLYQAPTASSPQAFFNHLQSPSATVPNPAASSQDLSQNISTTSHPDESWFMDSGATHHMTPDLANL
        S  KWPP  ++S+ QCQICGK+GH A VCY+R N  Y     +SPQA ++H+Q PS T P+     Q          HPDESWFMDSGATHHMTPD + L
Subjt:  SSPKWPPN-NNSRPQCQICGKIGHMALVCYNRHNPLYQAPTASSPQAFFNHLQSPSATVPNPAASSQDLSQNISTTSHPDESWFMDSGATHHMTPDLANL

Query:  QQPSSYFGNEQVVIGNGMSL
          P+ Y G EQV +GNG S+
Subjt:  QQPSSYFGNEQVVIGNGMSL

A0A7J0GPN0 UBX domain-containing protein1.1e-7036.99Show/hide
Query:  THPSDFPYPPPPHSSLPFFQNYNGAQPFYPPPQPSP---------SFPTLTPPLNIKLSDSNYLLWNNQLLNHIIAFDMESLINGT-PAPVQYLDTAQTQ
        T  S  P  PPP +S P         P    P+P+P         + P++  PL +KL D NY++W  QLLN +IA  +E  ++G+   P ++LD  Q Q
Subjt:  THPSDFPYPPPPHSSLPFFQNYNGAQPFYPPPQPSP---------SFPTLTPPLNIKLSDSNYLLWNNQLLNHIIAFDMESLINGT-PAPVQYLDTAQTQ

Query:  VNPHFLLWQKFNRILMSWIYSSLTEDKIGEIIGCSTAFDIWEQLRVVIESSSTARLMSLRSQLQKARKDNLTVSQYLAKIKDITDQFVAIGEPLSYRDHF
         NP F  WQ++NR++MSWIY+S+ E  +G+I+G ++A  IWE L  +  ++S A L  LR+ LQ  +K+ LT   Y+ K + + +   +IGEP++Y DH 
Subjt:  VNPHFLLWQKFNRILMSWIYSSLTEDKIGEIIGCSTAFDIWEQLRVVIESSSTARLMSLRSQLQKARKDNLTVSQYLAKIKDITDQFVAIGEPLSYRDHF

Query:  AYILEGLGSDYNPFVTSIQNRTDRPLLADVRSLLLAFDARLEKQSTVDNLNVIQANLANLSTSSNQK-RPNRPSGNQRPSYNSSPRPSFPTFPFPQQFPT
         Y L GLG DYNPFVTSIQ++  RP + +  S                             TS  +K +   PS N  P+ NS   P             
Subjt:  AYILEGLGSDYNPFVTSIQNRTDRPLLADVRSLLLAFDARLEKQSTVDNLNVIQANLANLSTSSNQK-RPNRPSGNQRPSYNSSPRPSFPTFPFPQQFPT

Query:  PGPGVLGRPQASQRPSSSSPKWPPNNNSRPQCQICGKIGHMALVCYNRHNPLYQAPTASSPQAFFNHLQSPSATVPNPAASSQDLSQNISTTSHPDESWF
           G    P  S  PSS  P        RP+CQIC K GH A  CY+  N  YQ P    P   FN     +   PNP+ S+      +   S PD SW+
Subjt:  PGPGVLGRPQASQRPSSSSPKWPPNNNSRPQCQICGKIGHMALVCYNRHNPLYQAPTASSPQAFFNHLQSPSATVPNPAASSQDLSQNISTTSHPDESWF

Query:  MDSGATHHMTPDLANLQQPSSYFGNEQVVIGNGMSLPVRHIGSNIILVSSSRHVSLKSILHTPRLSHNLISIARLCFDNRAFVEFYHDHFLV
        MDSGA+HH TPDL  L   S Y G +QV +GNG + P+ +IG +  L + +  + L  + H+P L+ NL+S++RLC DN AF+EFY   FLV
Subjt:  MDSGATHHMTPDLANLQQPSSYFGNEQVVIGNGMSLPVRHIGSNIILVSSSRHVSLKSILHTPRLSHNLISIARLCFDNRAFVEFYHDHFLV

U6EFK2 Putative Ty1-copia-like retrotransposon3.4e-7238.01Show/hide
Query:  SFPTLTPPLNIKLSDSNYLLWNNQLLNHIIAFDMESLINGT-PAPVQYL---DTAQTQ-----VNPHFLLWQKFNRILMSWIYSSLTEDKIGEIIGCSTA
        + PT    L IKL  +N+L+W NQLLN ++A   E ++ GT   P Q++   DTA T      +NP ++LWQ+ NR++MSWIYSSLTE  + +I+  ++A
Subjt:  SFPTLTPPLNIKLSDSNYLLWNNQLLNHIIAFDMESLINGT-PAPVQYL---DTAQTQ-----VNPHFLLWQKFNRILMSWIYSSLTEDKIGEIIGCSTA

Query:  FDIWEQLRVVIESSSTARLMSLRSQLQKARKDNLTVSQYLAKIKDITDQFVAIGEPLSYRDHFAYILEGLGSDYNPFVTSIQNRTDRPLLADVRSLLLAF
         +IW  LR    S+S AR+M LR QLQ  RK  L+V  Y+ +I+ I D   AIGE +S  D    +L GLGS+YNP V SI +R D   +  ++S L  +
Subjt:  FDIWEQLRVVIESSSTARLMSLRSQLQKARKDNLTVSQYLAKIKDITDQFVAIGEPLSYRDHFAYILEGLGSDYNPFVTSIQNRTDRPLLADVRSLLLAF

Query:  DARLEKQSTVDNLNVIQANLANLSTSSNQKRPNRPSGNQRPSYNSSPRPSFPTFPFPQQFPTPGPGVLGRPQASQRPSSSSPKWPPNNNSRPQCQICGKI
        + RLE Q++V+    +QAN A  +++  +++P       + +++ S  P   T  F  +  + G G  G           S K   N  SRPQCQIC KI
Subjt:  DARLEKQSTVDNLNVIQANLANLSTSSNQKRPNRPSGNQRPSYNSSPRPSFPTFPFPQQFPTPGPGVLGRPQASQRPSSSSPKWPPNNNSRPQCQICGKI

Query:  GHMALVCYNRHNPLYQAPTASSPQAFFNHLQSPSATVPNPAASSQDLSQNISTTSHPDESWFMDSGATHHMTPDLANLQQPSSYFGNEQVVIGNGMSLPV
        GH+A  C++R +  +Q P AS    F +  Q P A +  P                 +  WF+D+GATHH+T DLANL   + + G+++V++GNG  L V
Subjt:  GHMALVCYNRHNPLYQAPTASSPQAFFNHLQSPSATVPNPAASSQDLSQNISTTSHPDESWFMDSGATHHMTPDLANLQQPSSYFGNEQVVIGNGMSLPV

Query:  RHIGSNIILVSSSRHVSLKSILHTPRLSHNLISIARLCFDNRAFVEFYHDHFLVNDLASRIIV
         H G + I  S    + LK++LH P+++ NL+S+ +LC DN A+VEFY  +F V D  ++ I+
Subjt:  RHIGSNIILVSSSRHVSLKSILHTPRLSHNLISIARLCFDNRAFVEFYHDHFLVNDLASRIIV

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.1e-3926.8Show/hide
Query:  KLSDSNYLLWNNQLLNHIIAFDMESLING--TPAPVQYLDTAQTQVNPHFLLWQKFNRILMSWIYSSLTEDKIGEIIGCSTAFDIWEQLRVVIESSSTAR
        KL+ +NYL+W+ Q+      +++   ++G  T  P      A  +VNP +  W++ ++++ S +  +++      +   +TA  IWE LR +  + S   
Subjt:  KLSDSNYLLWNNQLLNHIIAFDMESLING--TPAPVQYLDTAQTQVNPHFLLWQKFNRILMSWIYSSLTEDKIGEIIGCSTAFDIWEQLRVVIESSSTAR

Query:  LMSLRSQLQKARKDNLTVSQYLAKIKDITDQFVAIGEPLSYRDHFAYILEGLGSDYNPFVTSIQNRTDRPLLADVRSLLLAFDARL--EKQSTVDNLNVI
        +  LR+QL++  K   T+  Y+  +    DQ   +G+P+ + +    +LE L  +Y P +  I  +   P L ++   LL  ++++     +TV  +   
Subjt:  LMSLRSQLQKARKDNLTVSQYLAKIKDITDQFVAIGEPLSYRDHFAYILEGLGSDYNPFVTSIQNRTDRPLLADVRSLLLAFDARL--EKQSTVDNLNVI

Query:  QANLANLSTSSNQKRPNRPSGNQRPSYNSSPRPSFPTFPFPQQFPTPGPGVLGRPQASQRPSSSSPKWPPNNNSRP---QCQICGKIGHMALVCYNRHNP
          +  N +T++N    NR +     + N++ +P                             SS+   P NN S+P   +CQICG  GH A  C      
Subjt:  QANLANLSTSSNQKRPNRPSGNQRPSYNSSPRPSFPTFPFPQQFPTPGPGVLGRPQASQRPSSSSPKWPPNNNSRP---QCQICGKIGHMALVCYNRHNP

Query:  LYQAPTASSPQAFFNHLQSPSATVPNPAASSQDLSQNISTTSHPDESWFMDSGATHHMTPDLANLQQPSSYFGNEQVVIGNGMSLPVRHIGSNIILVSSS
               S  Q F + + S     P+P    Q  +     + +   +W +DSGATHH+T D  NL     Y G + V++ +G ++P+ H GS   L + S
Subjt:  LYQAPTASSPQAFFNHLQSPSATVPNPAASSQDLSQNISTTSHPDESWFMDSGATHHMTPDLANLQQPSSYFGNEQVVIGNGMSLPVRHIGSNIILVSSS

Query:  RHVSLKSILHTPRLSHNLISIARLCFDNRAFVEFYHDHFLVNDL
        R ++L +IL+ P +  NLIS+ RLC  N   VEF+   F V DL
Subjt:  RHVSLKSILHTPRLSHNLISIARLCFDNRAFVEFYHDHFLVNDL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE25.5e-3527.7Show/hide
Query:  KLSDSNYLLWNNQLLNHIIAFDMESLING-TPAPVQYLDT-AQTQVNPHFLLWQKFNRILMSWIYSSLTEDKIGEIIGCSTAFDIWEQLRVVIESSSTAR
        KL+ +NYL+W+ Q+      +++   ++G TP P   + T A  +VNP +  W++ ++++ S I  +++      +   +TA  IWE LR +  + S   
Subjt:  KLSDSNYLLWNNQLLNHIIAFDMESLING-TPAPVQYLDT-AQTQVNPHFLLWQKFNRILMSWIYSSLTEDKIGEIIGCSTAFDIWEQLRVVIESSSTAR

Query:  LMSLRSQLQKARKDNLTVSQYLAKIKDITDQFVAIGEPLSYRDHFAYILEGLGSDYNPFVTSIQNRTDRPLLADVRSLLLAFDARLEKQSTVDNLNVIQA
        +  LR               ++ +     DQ   +G+P+ + +    +LE L  DY P +  I  +   P L ++   L+  +++L   ++ + + +   
Subjt:  LMSLRSQLQKARKDNLTVSQYLAKIKDITDQFVAIGEPLSYRDHFAYILEGLGSDYNPFVTSIQNRTDRPLLADVRSLLLAFDARLEKQSTVDNLNVIQA

Query:  NLANLSTSSNQKRPNRPSGNQRPSYNSSPRPSFPTFPFPQQFPTPGPGVLGRPQASQRPSSSSPKWPPNNNSRP---QCQICGKIGHMALVCYNRHNPLY
         + + +T++N+ + NR  G+ R   N++ R +                       S +PSSS  +   N   +P   +CQIC   GH A  C   H   +
Subjt:  NLANLSTSSNQKRPNRPSGNQRPSYNSSPRPSFPTFPFPQQFPTPGPGVLGRPQASQRPSSSSPKWPPNNNSRP---QCQICGKIGHMALVCYNRHNPLY

Query:  QAPTASSPQAFFNHLQSPSATVP-NPAASSQDLSQNISTTS-HPDESWFMDSGATHHMTPDLANLQQPSSYFGNEQVVIGNGMSLPVRHIGSNIILVSSS
        Q+ T        N  QS S   P  P A       N++  S +   +W +DSGATHH+T D  NL     Y G + V+I +G ++P+ H GS   L +SS
Subjt:  QAPTASSPQAFFNHLQSPSATVP-NPAASSQDLSQNISTTS-HPDESWFMDSGATHHMTPDLANLQQPSSYFGNEQVVIGNGMSLPVRHIGSNIILVSSS

Query:  RHVSLKSILHTPRLSHNLISIARLCFDNRAFVEFYHDHFLVNDL
        R + L  +L+ P +  NLIS+ RLC  NR  VEF+   F V DL
Subjt:  RHVSLKSILHTPRLSHNLISIARLCFDNRAFVEFYHDHFLVNDL

Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)5.8e-1626.18Show/hide
Query:  PLNIKLSDSNYLLWNNQLLNHIIAFDMESLINGTPAPVQYLDTAQTQVNPHFLLWQKFNRILMSWIYSSLTEDKI-GEIIGCSTAFDIWEQLRVVIESSS
        P+ + + +SNY  W    L H ++FD+   I+GT  P           N + + WQK + I+   +Y +LT  +  G  +  ST+ DIW +++    ++ 
Subjt:  PLNIKLSDSNYLLWNNQLLNHIIAFDMESLINGTPAPVQYLDTAQTQVNPHFLLWQKFNRILMSWIYSSLTEDKI-GEIIGCSTAFDIWEQLRVVIESSS

Query:  TARLMSLRSQLQKARKDNLTVSQYLAKIKDITDQFVAIGEPLSYRDHFAYILEGLGSDYNPFVTSIQNRTDRPLLADVRSLLLAFDARLEK
         AR + L S+L+     ++ V+ Y  K+K + D    +  P++ R+   Y+L GL   ++  +  I++R   P   D  ++L   + RL++
Subjt:  TARLMSLRSQLQKARKDNLTVSQYLAKIKDITDQFVAIGEPLSYRDHFAYILEGLGSDYNPFVTSIQNRTDRPLLADVRSLLLAFDARLEK

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)1.1e-1122.31Show/hide
Query:  LNIKLSDSNYLLWNNQLLNHIIAFDMESLINGTPAPVQYLDTAQTQVNPHFLLWQKFNRILMSWIYSSLTEDKIGEII--GCSTAFDIWEQLRVVIESSS
        + + L+  NY +W        ++F +   I+G+  P     T  T+       W++ + ++  WIY ++T+  +  II  GC TA D+W  L  +   + 
Subjt:  LNIKLSDSNYLLWNNQLLNHIIAFDMESLINGTPAPVQYLDTAQTQVNPHFLLWQKFNRILMSWIYSSLTEDKIGEII--GCSTAFDIWEQLRVVIESSS

Query:  TARLMSLRSQLQKARKDNLTVSQYLAKIKDITDQFVAIGEPLSYRDHFAYILEGLGSDYNPFVTSIQNRTDRPLLADVRSLLLAFDARLEKQSTVDNLNV
         AR +   ++L+    D+L+V +Y  K+K ++D    +  P+S R    ++L GL   Y+  +  I++++  P   + RS+LL  ++RL  +S     + 
Subjt:  TARLMSLRSQLQKARKDNLTVSQYLAKIKDITDQFVAIGEPLSYRDHFAYILEGLGSDYNPFVTSIQNRTDRPLLADVRSLLLAFDARLEKQSTVDNLNV

Query:  IQANLANL----------------STSSN-----QKRPNRPSGNQRPSYNSSPRPSFPTFPFPQQFPTPGPGVLGRPQASQRPSSSSPKWPPNNNSRPQC
           +L+N+                + +SN      K+ NR  G+    YN++            +   P   + G PQ+        P++       PQ 
Subjt:  IQANLANL----------------STSSN-----QKRPNRPSGNQRPSYNSSPRPSFPTFPFPQQFPTPGPGVLGRPQASQRPSSSSPKWPPNNNSRPQC

Query:  QICGKIGHMALVCYNRHNPLYQAPTASSPQAFFNHLQSPSATV-PNPAASSQDLSQNISTTSHPDESWFMDS
                          P+Y + T+  P        SP++ + P       D   N  +TS  D+S FM S
Subjt:  QICGKIGHMALVCYNRHNPLYQAPTASSPQAFFNHLQSPSATV-PNPAASSQDLSQNISTTSHPDESWFMDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCGGCTCTAGCTCTTCTTCGATTGACAGTACCCTGGTTCCTCCAATCGTCTCCTCAATGGTCTCTTCCCCTCCGATAACAACTCCGGTGGCCTCTTCCTCTAT
CTCAACTTCACCTGTGCCACCACGCCAACATCAAGTTATCTCTCCTCTCCCTCAACGTTTTTTCCCTAATACCTCATCTCGCCCCCTACACCCTAGCGTTCCCCCCTTCC
CAACCTCAAATCCCACCCAACCTCAAATTCTGAATGTTGCTACGCATCCTTCTGATTTTCCCTATCCACCTCCTCCTCATTCATCTCTACCATTCTTCCAAAACTATAAC
GGTGCTCAACCCTTTTACCCTCCCCCACAGCCATCACCCTCTTTTCCTACCCTCACTCCTCCCCTGAATATTAAACTTTCAGACTCAAATTATTTGTTGTGGAATAATCA
ACTCCTCAATCACATTATTGCGTTTGATATGGAGTCTCTTATCAATGGCACTCCTGCTCCTGTGCAGTATTTGGATACGGCTCAAACCCAGGTAAACCCACATTTCTTGT
TATGGCAGAAGTTTAATCGTATTCTAATGAGCTGGATTTATTCATCTTTAACTGAGGATAAGATAGGTGAGATTATTGGATGCTCTACTGCTTTTGATATCTGGGAGCAA
CTTCGTGTTGTTATTGAATCTTCATCCACTGCTCGTTTAATGTCTCTGCGATCTCAACTTCAGAAGGCTCGAAAAGATAATCTTACTGTATCTCAGTATTTAGCTAAAAT
CAAGGATATCACAGACCAATTTGTGGCCATTGGCGAACCCCTGTCTTACCGGGATCATTTTGCTTATATCCTCGAAGGTTTGGGCTCTGATTACAACCCCTTTGTGACAT
CCATTCAAAATCGAACCGACCGTCCATTGCTTGCTGACGTTAGGAGCCTTCTGCTTGCTTTTGATGCACGTTTGGAGAAGCAATCAACTGTGGACAATCTCAATGTTATC
CAGGCAAATCTTGCTAACCTCTCTACTTCTTCTAACCAAAAACGCCCCAATCGTCCCTCTGGCAACCAAAGACCATCATATAACTCTTCCCCGCGACCATCTTTCCCCAC
CTTTCCCTTCCCTCAACAATTCCCTACCCCAGGTCCTGGTGTTTTAGGTCGACCTCAAGCTTCCCAGCGCCCATCTTCTTCCTCTCCGAAATGGCCACCTAATAATAATT
CTCGTCCACAATGTCAAATATGTGGAAAGATTGGTCACATGGCCTTGGTTTGCTACAATCGCCATAACCCCTTATATCAAGCCCCCACTGCCTCTTCTCCTCAGGCCTTT
TTCAACCACCTGCAATCGCCCTCAGCTACCGTCCCTAATCCAGCTGCCTCTTCCCAGGATCTTTCTCAAAATATCTCCACTACCTCTCATCCGGATGAATCATGGTTCAT
GGACAGTGGTGCCACTCACCACATGACTCCTGACCTAGCTAATCTCCAGCAGCCTTCCTCTTATTTCGGCAATGAGCAGGTTGTAATCGGTAATGGTATGTCTCTTCCTG
TCCGTCATATTGGTTCTAATATTATTCTTGTGTCTTCTTCTCGTCATGTTTCTTTGAAATCTATCTTACACACTCCTCGCTTATCTCATAATCTAATCAGTATTGCACGA
TTATGCTTTGACAATAGAGCCTTTGTTGAATTTTATCATGATCATTTTCTGGTGAATGATCTAGCTTCCAGGATAATTGTTGAGTTCCTAGAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCCGGCTCTAGCTCTTCTTCGATTGACAGTACCCTGGTTCCTCCAATCGTCTCCTCAATGGTCTCTTCCCCTCCGATAACAACTCCGGTGGCCTCTTCCTCTAT
CTCAACTTCACCTGTGCCACCACGCCAACATCAAGTTATCTCTCCTCTCCCTCAACGTTTTTTCCCTAATACCTCATCTCGCCCCCTACACCCTAGCGTTCCCCCCTTCC
CAACCTCAAATCCCACCCAACCTCAAATTCTGAATGTTGCTACGCATCCTTCTGATTTTCCCTATCCACCTCCTCCTCATTCATCTCTACCATTCTTCCAAAACTATAAC
GGTGCTCAACCCTTTTACCCTCCCCCACAGCCATCACCCTCTTTTCCTACCCTCACTCCTCCCCTGAATATTAAACTTTCAGACTCAAATTATTTGTTGTGGAATAATCA
ACTCCTCAATCACATTATTGCGTTTGATATGGAGTCTCTTATCAATGGCACTCCTGCTCCTGTGCAGTATTTGGATACGGCTCAAACCCAGGTAAACCCACATTTCTTGT
TATGGCAGAAGTTTAATCGTATTCTAATGAGCTGGATTTATTCATCTTTAACTGAGGATAAGATAGGTGAGATTATTGGATGCTCTACTGCTTTTGATATCTGGGAGCAA
CTTCGTGTTGTTATTGAATCTTCATCCACTGCTCGTTTAATGTCTCTGCGATCTCAACTTCAGAAGGCTCGAAAAGATAATCTTACTGTATCTCAGTATTTAGCTAAAAT
CAAGGATATCACAGACCAATTTGTGGCCATTGGCGAACCCCTGTCTTACCGGGATCATTTTGCTTATATCCTCGAAGGTTTGGGCTCTGATTACAACCCCTTTGTGACAT
CCATTCAAAATCGAACCGACCGTCCATTGCTTGCTGACGTTAGGAGCCTTCTGCTTGCTTTTGATGCACGTTTGGAGAAGCAATCAACTGTGGACAATCTCAATGTTATC
CAGGCAAATCTTGCTAACCTCTCTACTTCTTCTAACCAAAAACGCCCCAATCGTCCCTCTGGCAACCAAAGACCATCATATAACTCTTCCCCGCGACCATCTTTCCCCAC
CTTTCCCTTCCCTCAACAATTCCCTACCCCAGGTCCTGGTGTTTTAGGTCGACCTCAAGCTTCCCAGCGCCCATCTTCTTCCTCTCCGAAATGGCCACCTAATAATAATT
CTCGTCCACAATGTCAAATATGTGGAAAGATTGGTCACATGGCCTTGGTTTGCTACAATCGCCATAACCCCTTATATCAAGCCCCCACTGCCTCTTCTCCTCAGGCCTTT
TTCAACCACCTGCAATCGCCCTCAGCTACCGTCCCTAATCCAGCTGCCTCTTCCCAGGATCTTTCTCAAAATATCTCCACTACCTCTCATCCGGATGAATCATGGTTCAT
GGACAGTGGTGCCACTCACCACATGACTCCTGACCTAGCTAATCTCCAGCAGCCTTCCTCTTATTTCGGCAATGAGCAGGTTGTAATCGGTAATGGTATGTCTCTTCCTG
TCCGTCATATTGGTTCTAATATTATTCTTGTGTCTTCTTCTCGTCATGTTTCTTTGAAATCTATCTTACACACTCCTCGCTTATCTCATAATCTAATCAGTATTGCACGA
TTATGCTTTGACAATAGAGCCTTTGTTGAATTTTATCATGATCATTTTCTGGTGAATGATCTAGCTTCCAGGATAATTGTTGAGTTCCTAGAGTGA
Protein sequenceShow/hide protein sequence
MASGSSSSSIDSTLVPPIVSSMVSSPPITTPVASSSISTSPVPPRQHQVISPLPQRFFPNTSSRPLHPSVPPFPTSNPTQPQILNVATHPSDFPYPPPPHSSLPFFQNYN
GAQPFYPPPQPSPSFPTLTPPLNIKLSDSNYLLWNNQLLNHIIAFDMESLINGTPAPVQYLDTAQTQVNPHFLLWQKFNRILMSWIYSSLTEDKIGEIIGCSTAFDIWEQ
LRVVIESSSTARLMSLRSQLQKARKDNLTVSQYLAKIKDITDQFVAIGEPLSYRDHFAYILEGLGSDYNPFVTSIQNRTDRPLLADVRSLLLAFDARLEKQSTVDNLNVI
QANLANLSTSSNQKRPNRPSGNQRPSYNSSPRPSFPTFPFPQQFPTPGPGVLGRPQASQRPSSSSPKWPPNNNSRPQCQICGKIGHMALVCYNRHNPLYQAPTASSPQAF
FNHLQSPSATVPNPAASSQDLSQNISTTSHPDESWFMDSGATHHMTPDLANLQQPSSYFGNEQVVIGNGMSLPVRHIGSNIILVSSSRHVSLKSILHTPRLSHNLISIAR
LCFDNRAFVEFYHDHFLVNDLASRIIVEFLE