; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0035713 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0035713
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr3:28182874..28187367
RNA-Seq ExpressionLag0035713
SyntenyLag0035713
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6734747.1 hypothetical protein I3842_01G285500 [Carya illinoinensis]2.5e-2362.07Show/hide
Query:  MARDCAYRGSPTEDPNSHLKSFLDICGTEKINGVSEDAIHLRLFPFSLQDKARDWLQSITPGSITTWDALVQAFLKKFFPHAKTVQL
        M +   + GSP +DPN HL  FL+IC T KINGV+ED I LRLFPFSL+DKAR WLQS+ PGSI +W  + + FL KFFP AKT QL
Subjt:  MARDCAYRGSPTEDPNSHLKSFLDICGTEKINGVSEDAIHLRLFPFSLQDKARDWLQSITPGSITTWDALVQAFLKKFFPHAKTVQL

KAG6734747.1 hypothetical protein I3842_01G285500 [Carya illinoinensis]3.2e-7445.62Show/hide
Query:  MARDCAYRGSPTEDPNSHLKSFLDICGTEKINGVSEDAIHLRLFPFSLQDKARDWLQSITPGSITTWDALVQAFLKKFFPHAKT----------------
        MAR+ A+RG   EDP+ HL+SFL+ICGT K+NGVS DAI LRLFPFSLQD+A+DWL++I P SITTW+ L QAFL K+FP AK+                
Subjt:  MARDCAYRGSPTEDPNSHLKSFLDICGTEKINGVSEDAIHLRLFPFSLQDKARDWLQSITPGSITTWDALVQAFLKKFFPHAKT----------------

Query:  --------------------------VQLFYNGLTPSTKTIVDAAAGRTLLSKTVENARTLLEDMATNSYQCPSERSAPK-KIATGVFEVDKVSALQAQM
                                  +QLFYNGL  STK+I+DA AG ++ SK  + A T+LED+AT SY  P ER++P    A G++EVD+V++L+AQM
Subjt:  --------------------------VQLFYNGLTPSTKTIVDAAAGRTLLSKTVENARTLLEDMATNSYQCPSERSAPK-KIATGVFEVDKVSALQAQM

Query:  TSLANAFMKFSGTGSAQSIEPA-AVLASRPQEETI----EQVQYVSNFNSRGYNNNSTPTHYHPNNRNHENFSYANTKNVLN-PPGF-ASQTQENKKLED
         SL NA  K +  G AQ+  P+ A LA+   E  +    E   YV   + R Y +   PTHYHPN RNHENFSYAN KNVL  P GF  +   +   LED
Subjt:  TSLANAFMKFSGTGSAQSIEPA-AVLASRPQEETI----EQVQYVSNFNSRGYNNNSTPTHYHPNNRNHENFSYANTKNVLN-PPGF-ASQTQENKKLED

Query:  IVGAFIAESSNRTTKLEDAVIAINSTVNGYSAAIENIETQLGQLVSVVSTMNKGKAPAEQEKTQMEYCKAIIVHQEEAEEEPESEDYD
        I+  F+ ES +RTT LE++V AI STV     A++N+E QL Q+ + + TM KGK P+  E    E CKA+ +   +    P   D D
Subjt:  IVGAFIAESSNRTTKLEDAVIAINSTVNGYSAAIENIETQLGQLVSVVSTMNKGKAPAEQEKTQMEYCKAIIVHQEEAEEEPESEDYD

KAG7947748.1 hypothetical protein I3843_14G109500 [Carya illinoinensis]2.5e-2362.07Show/hide
Query:  MARDCAYRGSPTEDPNSHLKSFLDICGTEKINGVSEDAIHLRLFPFSLQDKARDWLQSITPGSITTWDALVQAFLKKFFPHAKTVQL
        M +   + GSP +DPN HL  FL+IC T KINGV+ED I LRLFPFSL+DKAR WLQS+ PGSI +W  + + FL KFFP AKT QL
Subjt:  MARDCAYRGSPTEDPNSHLKSFLDICGTEKINGVSEDAIHLRLFPFSLQDKARDWLQSITPGSITTWDALVQAFLKKFFPHAKTVQL

KAG7947748.1 hypothetical protein I3843_14G109500 [Carya illinoinensis]1.9e-7437.22Show/hide
Query:  KDKARDWLQSITPGSITTWDALVQAFLKKFFPPAKT------------------------------------------VQLFYNGLTPSTKTIVDAAAGG
        +DKAR WLQS+ PGSI +W  + + FL KFFPPAKT                                          VQ+FYNGL   T+TIVDAA+GG
Subjt:  KDKARDWLQSITPGSITTWDALVQAFLKKFFPPAKT------------------------------------------VQLFYNGLTPSTKTIVDAAAGG

Query:  TLLSKTVENARTLLEDMATNSYQCPSERSAPKKIAAGVFEVDKVSALQAQMTSLANAFMKFSGTWSAQSIE--SAVVLASRPQEETIEQVQYVSNFNSRG
        TL+SKT E A  LLE+MA+N+YQ P+ER+  KK+ AG+ E++ ++AL AQ+ +L++     +     QS E  ++  +     E + EQVQYV+N N   
Subjt:  TLLSKTVENARTLLEDMATNSYQCPSERSAPKKIAAGVFEVDKVSALQAQMTSLANAFMKFSGTWSAQSIE--SAVVLASRPQEETIEQVQYVSNFNSRG

Query:  YNNNSTPTHYHPNNRNHENFSYANTKNVL---NPPGFAPQTQENK-KLEDIVGAFIGESSNRTTKLEEAVIAINSTVNGHSAAIKNIETQLGQLVSVVST
        Y  N  P +YHP  RNHEN SY NTKNVL   +PPGF  Q  E K  LED + +F+ E++ R  K +  +  I +  +   A +KN+E Q+GQL + ++ 
Subjt:  YNNNSTPTHYHPNNRNHENFSYANTKNVL---NPPGFAPQTQENK-KLEDIVGAFIGESSNRTTKLEEAVIAINSTVNGHSAAIKNIETQLGQLVSVVST

Query:  MNKGKAPAEQEKTQMEYCKAIIVHQ-EEAEEEPESEDYDTPIG--------EVEED----TSSDEAEKP-----EPEPPIPYPTLMVPKEKKKEKEEKEQ
          +G  P+  E    E CKAI +   +E E  P  E   TP          +VEE+     + +E + P        PPI  P L  P+  +K+K +K+ 
Subjt:  MNKGKAPAEQEKTQMEYCKAIIVHQ-EEAEEEPESEDYDTPIG--------EVEED----TSSDEAEKP-----EPEPPIPYPTLMVPKEKKKEKEEKEQ

Query:  LEAL-----------------EMPQYNRFMKEWLGKKRKEKKVDTVYLASTCSTRVKQKVPEKVVDPRSFSVPCSFGTYSF-RALCNLGASINIIPLSLC
         + L                 +MP Y +F+K+ + KKR+ ++ +TV L+  CS  +++K+P+K+ DP SF++PC+ G   F R LC+LGASIN++P  +C
Subjt:  LEAL-----------------EMPQYNRFMKEWLGKKRKEKKVDTVYLASTCSTRVKQKVPEKVVDPRSFSVPCSFGTYSF-RALCNLGASINIIPLSLC

Query:  KNLDIGEIKSTPVKFQLTDQPVVRPVGIIDDL
        + L +GE+K T +  QL D+ +  P GII+D+
Subjt:  KNLDIGEIKSTPVKFQLTDQPVVRPVGIIDDL

KAG7990634.1 hypothetical protein I3843_02G035100 [Carya illinoinensis]2.5e-2362.07Show/hide
Query:  MARDCAYRGSPTEDPNSHLKSFLDICGTEKINGVSEDAIHLRLFPFSLQDKARDWLQSITPGSITTWDALVQAFLKKFFPHAKTVQL
        M +   + GSP +DPN HL  FL+IC T KINGV+ED I LRLFPFSL+DKAR WLQS+ PGSI +W  + + FL KFFP AKT QL
Subjt:  MARDCAYRGSPTEDPNSHLKSFLDICGTEKINGVSEDAIHLRLFPFSLQDKARDWLQSITPGSITTWDALVQAFLKKFFPHAKTVQL

KAG7990634.1 hypothetical protein I3843_02G035100 [Carya illinoinensis]6.4e-7537.41Show/hide
Query:  KDKARDWLQSITPGSITTWDALVQAFLKKFFPPAKT------------------------------------------VQLFYNGLTPSTKTIVDAAAGG
        +DKAR WLQS+ PGSI +W  + + FL KFFPPAKT                                          VQ+FYNGL   T+TIVDAA+GG
Subjt:  KDKARDWLQSITPGSITTWDALVQAFLKKFFPPAKT------------------------------------------VQLFYNGLTPSTKTIVDAAAGG

Query:  TLLSKTVENARTLLEDMATNSYQCPSERSAPKKIAAGVFEVDKVSALQAQMTSLANAFMKFSGTWSAQSIE--SAVVLASRPQEETIEQVQYVSNFNSRG
        TL+SKT E A  LLE+MA+N+YQ P+ER+  KK+ AG+ E++ ++AL AQ+ +L++     +     QS E  ++  +     E + EQVQYV+N N   
Subjt:  TLLSKTVENARTLLEDMATNSYQCPSERSAPKKIAAGVFEVDKVSALQAQMTSLANAFMKFSGTWSAQSIE--SAVVLASRPQEETIEQVQYVSNFNSRG

Query:  YNNNSTPTHYHPNNRNHENFSYANTKNVL---NPPGFAPQTQENK-KLEDIVGAFIGESSNRTTKLEEAVIAINSTVNGHSAAIKNIETQLGQLVSVVST
        Y  N  P +YHP  RNHEN SY NTKNVL   +PPGF  Q  E K  LED + +F+ E++ R  K +  +  I +  +   A +KN+E Q+GQL + ++ 
Subjt:  YNNNSTPTHYHPNNRNHENFSYANTKNVL---NPPGFAPQTQENK-KLEDIVGAFIGESSNRTTKLEEAVIAINSTVNGHSAAIKNIETQLGQLVSVVST

Query:  MNKGKAPAEQEKTQMEYCKAIIVHQ-EEAEEEPESEDYDTPIG--------EVEED----TSSDEAEKP-----EPEPPIPYPTLMVPKEKKKEKEEKEQ
          +G  P+  E    E CKAI +   +E E  P  E   TP          +VEE+     + +E + P        PPI  P L  P+  +K+K +K+ 
Subjt:  MNKGKAPAEQEKTQMEYCKAIIVHQ-EEAEEEPESEDYDTPIG--------EVEED----TSSDEAEKP-----EPEPPIPYPTLMVPKEKKKEKEEKEQ

Query:  LEAL-----------------EMPQYNRFMKEWLGKKRKEKKVDTVYLASTCSTRVKQKVPEKVVDPRSFSVPCSFGTYSF-RALCNLGASINIIPLSLC
         + L                 +MP Y +F+K+ + KKR+ ++ +TV L+  CS  +++K+P+K+ DP SF++PC+ G   F R LC+LGASIN++P S+C
Subjt:  LEAL-----------------EMPQYNRFMKEWLGKKRKEKKVDTVYLASTCSTRVKQKVPEKVVDPRSFSVPCSFGTYSF-RALCNLGASINIIPLSLC

Query:  KNLDIGEIKSTPVKFQLTDQPVVRPVGIIDDL
        + L +GE+K T +  QL D+ +  P GII+D+
Subjt:  KNLDIGEIKSTPVKFQLTDQPVVRPVGIIDDL

XP_022843226.1 uncharacterized protein LOC111366761 [Olea europaea var. sylvestris]1.7e-7538.61Show/hide
Query:  KDKARDWLQSITPGSITTWDALVQAFLKKFFPPAKTVQL------------------------------------------FYNGLTPSTKTIVDAAAGG
        +DKA+ W QS+  GSITTWD L Q FL K+FPP+K+ QL                                          FYNGL   T+T+VDAAAGG
Subjt:  KDKARDWLQSITPGSITTWDALVQAFLKKFFPPAKTVQL------------------------------------------FYNGLTPSTKTIVDAAAGG

Query:  TLLSKTVENARTLLEDMATNSYQCPSERSAPKKIAAGVFEVDKVSALQAQMTSLANAFMKFSGTWSAQSIESAVVLASRPQEETI--EQVQYVS--NFNS
         L++KT E A  LL+D+ATNSYQ PSERS  KK+ AG+ EVD ++AL AQ+ SL N  +  +   + Q+++S +  +S  QE  +  EQVQY+   N+N 
Subjt:  TLLSKTVENARTLLEDMATNSYQCPSERSAPKKIAAGVFEVDKVSALQAQMTSLANAFMKFSGTWSAQSIESAVVLASRPQEETI--EQVQYVS--NFNS

Query:  R-GYNNNSTPTHYHPNNRNHENFSYANTKNVLN-PPGFAPQTQENK-KLEDIVGAFIGESSNRTTKLEEAVIAINSTVNGHSAAIKNIETQLGQLVSVVS
        R GY  N    HYHP  RNHEN SY N +N L  PPGF  Q  + K  LEDI+G FI E+ +R  K E  +  I + V+   A +KN+E Q+GQL +++ 
Subjt:  R-GYNNNSTPTHYHPNNRNHENFSYANTKNVLN-PPGFAPQTQENK-KLEDIVGAFIGESSNRTTKLEEAVIAINSTVNGHSAAIKNIETQLGQLVSVVS

Query:  TMNKGKAPAEQEKTQMEYCKAIIVHQEEAEEEPESEDYDTPIGEV-EEDTSSDEAEKPEPE---------------PPIPYPTLMVPKEKKKEKEEKEQL
        +  KGK P++ E    E+C AI +   +  EE + +    P  +V   D    E +K E E               PPI  P L  P+   K+K + +  
Subjt:  TMNKGKAPAEQEKTQMEYCKAIIVHQEEAEEEPESEDYDTPIGEV-EEDTSSDEAEKPEPE---------------PPIPYPTLMVPKEKKKEKEEKEQL

Query:  EALE-----------------MPQYNRFMKEWLGKKRKEKKVDTVYLASTCSTRVKQKVPEKVVDPRSFSVPCSFGTYSF-RALCNLGASINIIPLSLCK
        + LE                 MP Y +F+KE +  K+K ++ +T+ L   CS  + QK+P K+ DP SF++PC+ G  +F RALC+ GASIN++PLS+ K
Subjt:  EALE-----------------MPQYNRFMKEWLGKKRKEKKVDTVYLASTCSTRVKQKVPEKVVDPRSFSVPCSFGTYSF-RALCNLGASINIIPLSLCK

Query:  NLDIGEIKSTPVKFQLTDQPVVRPVGIIDDL
         L +GE+K T +  QL D+ +  P G+I+D+
Subjt:  NLDIGEIKSTPVKFQLTDQPVVRPVGIIDDL

XP_022843226.1 uncharacterized protein LOC111366761 [Olea europaea var. sylvestris]4.1e-2159.77Show/hide
Query:  MARDCAYRGSPTEDPNSHLKSFLDICGTEKINGVSEDAIHLRLFPFSLQDKARDWLQSITPGSITTWDALVQAFLKKFFPHAKTVQL
        M +   + G+  EDPN+HL SFL+IC T K+NGV+EDAI LRLF FSL+DKA+ W QS+  GSITTWD L Q FL K+FP +K+ QL
Subjt:  MARDCAYRGSPTEDPNSHLKSFLDICGTEKINGVSEDAIHLRLFPFSLQDKARDWLQSITPGSITTWDALVQAFLKKFFPHAKTVQL

XP_022843226.1 uncharacterized protein LOC111366761 [Olea europaea var. sylvestris]3.7e-7538.16Show/hide
Query:  KDKARDWLQSITPGSITTWDALVQAFLKKFFPPAKT------------------------------------------VQLFYNGLTPSTKTIVDAAAGG
        +DKAR WLQS+ PGSI +W  + + FL KFFPPAKT                                          VQ+FYNGL   T+TIVDAA+GG
Subjt:  KDKARDWLQSITPGSITTWDALVQAFLKKFFPPAKT------------------------------------------VQLFYNGLTPSTKTIVDAAAGG

Query:  TLLSKTVENARTLLEDMATNSYQCPSERSAPKKIAAGVFEVDKVSALQAQMTSLANAFMKFSGTWSAQSIE--SAVVLASRPQEETIEQVQYVSNFNSRG
        TL+SKT E A  LLE+MA+N+YQ P+ER+  KK+ AG+ +++ ++AL AQ+ +L++     +     QS E  ++  +     E + EQVQYV+N N   
Subjt:  TLLSKTVENARTLLEDMATNSYQCPSERSAPKKIAAGVFEVDKVSALQAQMTSLANAFMKFSGTWSAQSIE--SAVVLASRPQEETIEQVQYVSNFNSRG

Query:  YNNNSTPTHYHPNNRNHENFSYANTKNVL---NPPGFAPQTQENK-KLEDIVGAFIGESSNRTTKLEEAVIAINSTVNGHSAAIKNIETQLGQLVSVVST
        Y  N  P +YHP  RNHEN SY NTKNVL   +PPGF  Q  E K  LED + +F+ E++ R  K +  +  I +  +   AAIKNIE Q+GQL + ++ 
Subjt:  YNNNSTPTHYHPNNRNHENFSYANTKNVL---NPPGFAPQTQENK-KLEDIVGAFIGESSNRTTKLEEAVIAINSTVNGHSAAIKNIETQLGQLVSVVST

Query:  MNKGKAPAEQEKTQMEYCKAIIVHQ-EEAEEEPESEDYDTP----IGE----VEED-TSSDEAEKPE--------PEPPIPYPTLMVPKEKKKEKEEKEQ
          +G  P+  E    E CKAI +   +E E  P  E   TP    IG+    VEED   +D  E+ +          PPI  P L  P+  +K+K +K+ 
Subjt:  MNKGKAPAEQEKTQMEYCKAIIVHQ-EEAEEEPESEDYDTP----IGE----VEED-TSSDEAEKPE--------PEPPIPYPTLMVPKEKKKEKEEKEQ

Query:  LEAL-----------------EMPQYNRFMKEWLGKKRKEKKVDTVYLASTCSTRVKQKVPEKVVDPRSFSVPCSFGTYSF-RALCNLGASINIIPLSLC
         + L                 +MP Y +F+K+ + KKR+ ++ +TV L+  CS  +++K+P+K+ DP SF++PC+ G   F + LC+LGASIN++PLS+C
Subjt:  LEAL-----------------EMPQYNRFMKEWLGKKRKEKKVDTVYLASTCSTRVKQKVPEKVVDPRSFSVPCSFGTYSF-RALCNLGASINIIPLSLC

Query:  KNLDIGEIKSTPVKFQLTDQPVVRPVGIIDDL
        + L + E+K T +  QL D+ +  P GII+D+
Subjt:  KNLDIGEIKSTPVKFQLTDQPVVRPVGIIDDL

TrEMBL top hitse value%identityAlignment
A0A2I4G4Q3 uncharacterized protein LOC1090047122.1e-6041.19Show/hide
Query:  MARDCAYRGSPTEDPNSHLKSFLDICGTEKINGVSEDAIHLRLFPFSLQDKARDWLQSITPGSITTWDALVQAFLKKFFPHAKT----------------
        M +   + GSP +DPN HL  FL+IC T KINGV+ED I LRLFPFSL+D+AR WLQS+ P SIT+W  + + F  KFFP AKT                
Subjt:  MARDCAYRGSPTEDPNSHLKSFLDICGTEKINGVSEDAIHLRLFPFSLQDKARDWLQSITPGSITTWDALVQAFLKKFFPHAKT----------------

Query:  --------------------------VQLFYNGLTPSTKTIVDAAAGRTLLSKTVENARTLLEDMATNSYQCPSERSAPKKIATGVFEVDKVSALQAQMT
                                  VQ+FYNGL   T+TIVD  +G TL+ KT+E A  LLE+MA+N+YQ P ER+  KK+A  + E++ ++AL AQ+ 
Subjt:  --------------------------VQLFYNGLTPSTKTIVDAAAGRTLLSKTVENARTLLEDMATNSYQCPSERSAPKKIATGVFEVDKVSALQAQMT

Query:  SLANAFMKFSGTGSAQSIE--PAAVLASRPQEETIEQVQYVSNFNSRGYNNNSTPTHYHPNNRNHENFSYANTKNVLN---PPGFASQTQENK-KLEDIV
        +L++     +     QS E   A  +     E + EQVQY++N N   Y  N  P +YHP  +NHEN SY NTKNVL    PPGF SQ+ E K  LED +
Subjt:  SLANAFMKFSGTGSAQSIE--PAAVLASRPQEETIEQVQYVSNFNSRGYNNNSTPTHYHPNNRNHENFSYANTKNVLN---PPGFASQTQENK-KLEDIV

Query:  GAFIAESSNRTTKLEDAVIAINSTVNGYSAAI-ENIETQLGQLVSVVSTMNKGKAPAEQEKTQMEYCKAIIVHQ-EEAEEEPESED
         +FI E++ R  K +  +  I +  +   AAI +NIE Q+GQL + ++   +G  P+  E    E CKAII+    E E + E E+
Subjt:  GAFIAESSNRTTKLEDAVIAINSTVNGYSAAI-ENIETQLGQLVSVVSTMNKGKAPAEQEKTQMEYCKAIIVHQ-EEAEEEPESED

A0A6J0ZX64 LOW QUALITY PROTEIN: uncharacterized protein LOC1104129451.3e-5733.16Show/hide
Query:  KDKARDWLQSITPGSITTWDALVQAFLKKFFPPAKT------------------------------------------VQLFYNGLTPSTKTIVDAAAGG
        +DKA+ WL S+  GSITTW+ L Q FL KFFPPAKT                                          VQ FYNGL  S KTI+DAAAGG
Subjt:  KDKARDWLQSITPGSITTWDALVQAFLKKFFPPAKT------------------------------------------VQLFYNGLTPSTKTIVDAAAGG

Query:  TLLSKTVENARTLLEDMATNSYQCPSERSAPKKIAAGVFEVDKVSALQAQMTSLANAFMKFSGTWSAQSIESAVVLASRPQEE--------TIEQVQYVS
         L+SK   +A  LLE+MA+N+YQ PSERS  +K A G +E+D +  L  Q+ +L+    K   T    ++++++V+     +           E VQ+V 
Subjt:  TLLSKTVENARTLLEDMATNSYQCPSERSAPKKIAAGVFEVDKVSALQAQMTSLANAFMKFSGTWSAQSIESAVVLASRPQEE--------TIEQVQYVS

Query:  NFNSRGYNNNSTPTHYHPNNRNHENFSYANTKNVLN-----PPGF----APQTQENK-KLEDIVGAFIGESSNRTTKLEEAVIAINSTVNGHSAAIKNIE
        NFN +   NN     Y+P  RNH NFS++N     N     PPGF     PQ  E K +LE+++  +I ++              ++ +    A+++N+E
Subjt:  NFNSRGYNNNSTPTHYHPNNRNHENFSYANTKNVLN-----PPGF----APQTQENK-KLEDIVGAFIGESSNRTTKLEEAVIAINSTVNGHSAAIKNIE

Query:  TQLGQLVSVVSTMNKGKAPAEQE--KTQMEYCKAII---------VHQEEAEEEPESED------YDTPIGEVEEDTSSDEAEKPEPEPPIPYPTLMVPK
        TQ+GQL + ++   +G  P++ +      E C+AI          V+Q+  E E E  D       +  I + ++D + ++       PP P+P  +  +
Subjt:  TQLGQLVSVVSTMNKGKAPAEQE--KTQMEYCKAII---------VHQEEAEEEPESED------YDTPIGEVEEDTSSDEAEKPEPEPPIPYPTLMVPK

Query:  EKKKEKEEKEQL-------------EALE-MPQYNRFMKEWLGKKRKEKKVDTVYLASTCSTRVKQKVPEKVVDPRSFSVPCSFGTYSF-RALCNLGASI
        ++K EK+ ++ L             EALE MP Y +F+K+ L KKRK  + +TV+L   CS  ++ K+P K+ DP SF++PC+ G   F +AL +LGASI
Subjt:  EKKKEKEEKEQL-------------EALE-MPQYNRFMKEWLGKKRKEKKVDTVYLASTCSTRVKQKVPEKVVDPRSFSVPCSFGTYSF-RALCNLGASI

Query:  NIIPLSLCKNLDIGEIKSTPVKFQLTDQPVVRPVGIIDD-LTASRRYDEKTEYNVFCGE-------VLGDSFLGLLGAV
        N++P S+ + L +GE K T V  QL D+  V P GII+D L    ++    ++ +   E       +LG  FL   GA+
Subjt:  NIIPLSLCKNLDIGEIKSTPVKFQLTDQPVVRPVGIIDD-LTASRRYDEKTEYNVFCGE-------VLGDSFLGLLGAV

A0A6J0ZX64 LOW QUALITY PROTEIN: uncharacterized protein LOC1104129455.2e-2262.96Show/hide
Query:  YRGSPTEDPNSHLKSFLDICGTEKINGVSEDAIHLRLFPFSLQDKARDWLQSITPGSITTWDALVQAFLKKFFPHAKTVQL
        + G P++DPNSHL +FL+IC T K NGV++DAI LRLFPFSL+DKA+ WL S+  GSITTW+ L Q FL KFFP AKT ++
Subjt:  YRGSPTEDPNSHLKSFLDICGTEKINGVSEDAIHLRLFPFSLQDKARDWLQSITPGSITTWDALVQAFLKKFFPHAKTVQL

A0A6J0ZX64 LOW QUALITY PROTEIN: uncharacterized protein LOC1104129453.2e-5633.16Show/hide
Query:  KDKARDWLQSITPGSITTWDALVQAFLKKFFPPAKT------------------------------------------VQLFYNGLTPSTKTIVDAAAGG
        +DKA+ WLQS  P + TTWD L +AFL KFFPP KT                                          VQ FYNGLT  TKT VDAAAGG
Subjt:  KDKARDWLQSITPGSITTWDALVQAFLKKFFPPAKT------------------------------------------VQLFYNGLTPSTKTIVDAAAGG

Query:  TLLSKTVENARTLLEDMATNSYQCPSERSAPKKIAAGVFEVDKVSALQAQMTSLANAFMKFSGTWSAQS--IESAVVLASRPQE---ETIEQVQYVSNFN
         L+ KT E A+ L+E+MA N+YQ  +ER   ++  AG+ EVD ++ L A+M ++     +  G+ S Q   + S  +      +    + EQVQY++N+N
Subjt:  TLLSKTVENARTLLEDMATNSYQCPSERSAPKKIAAGVFEVDKVSALQAQMTSLANAFMKFSGTWSAQS--IESAVVLASRPQE---ETIEQVQYVSNFN

Query:  SRGYNNNSTPTHYHPNNRNHENFSY---ANTKNVLNPPGF-APQTQENKK--LEDIVGAFIGESSNRTTKLEEAVIAINSTVNGHSAAI----KNIETQL
         R   NN     Y+P  RNH NF +    N +  +NPPGF   QT    K   E  +      S+++  KL  A       + G    +    +N+E QL
Subjt:  SRGYNNNSTPTHYHPNNRNHENFSY---ANTKNVLNPPGF-APQTQENKK--LEDIVGAFIGESSNRTTKLEEAVIAINSTVNGHSAAI----KNIETQL

Query:  GQLVSVVSTMNKGKAPAEQEKTQMEYCKAIIVHQEEAEEEPESEDYDTPIGEVEEDTSSDEAEKPEPEPPIPYPTLMVPKEKKKEKEEKEQLEALEM---
        GQ+ + V+  N+G  P++ E    E+ KAI +   +   EP        +G      S  E EK E +       L   KE  KE++ KE++E  E+   
Subjt:  GQLVSVVSTMNKGKAPAEQEKTQMEYCKAIIVHQEEAEEEPESEDYDTPIGEVEEDTSSDEAEKPEPEPPIPYPTLMVPKEKKKEKEEKEQLEALEM---

Query:  ---------PQYNRFMKEWLGKKRKEKKVDTVYLASTCSTRVKQKVPEKVVDPRSFSVPCSFGTYSF-RALCNLGASINIIPLSLCKNLDIGEIKSTPVK
                 P Y +F+KE + KKRK    +T+ L   CS  ++ K+P K+ DP SF+VPC+ G   F +ALC+LGAS+++IPL++ + L + E+K T + 
Subjt:  ---------PQYNRFMKEWLGKKRKEKKVDTVYLASTCSTRVKQKVPEKVVDPRSFSVPCSFGTYSF-RALCNLGASINIIPLSLCKNLDIGEIKSTPVK

Query:  FQLTDQPVVRPVGIIDD-LTASRRYDEKTEYNVFCGE-------VLGDSFLGLLGA---VKQGR
         QL D+ +  P+GI+++ L   +++    ++ V   E       +LG  FL   G    VK+G+
Subjt:  FQLTDQPVVRPVGIIDD-LTASRRYDEKTEYNVFCGE-------VLGDSFLGLLGA---VKQGR

A0A6P6XAQ1 Reverse transcriptase7.2e-2464.37Show/hide
Query:  MARDCAYRGSPTEDPNSHLKSFLDICGTEKINGVSEDAIHLRLFPFSLQDKARDWLQSITPGSITTWDALVQAFLKKFFPHAKTVQL
        M +   Y G+ TEDPNSHL +FL+IC T K NGVSEDAI LRLFPFSL+DKA+ WLQS  P + TTWD L +AFL KFFP  KT +L
Subjt:  MARDCAYRGSPTEDPNSHLKSFLDICGTEKINGVSEDAIHLRLFPFSLQDKARDWLQSITPGSITTWDALVQAFLKKFFPHAKTVQL

A0A6P6XAQ1 Reverse transcriptase4.2e-5635.71Show/hide
Query:  MARDCAYRGSPTEDPNSHLKSFLDICGTEKINGVSEDAIHLRLFPFSLQDKARDWLQSITPGSITTWDALVQAFLKKFFPHAKT----------------
        M R+  +RG+ TEDPN+HL  FLD+CGT K+NGV +DAI LRLFP SLQDK                  +VQAFL  FFP AKT                
Subjt:  MARDCAYRGSPTEDPNSHLKSFLDICGTEKINGVSEDAIHLRLFPFSLQDKARDWLQSITPGSITTWDALVQAFLKKFFPHAKT----------------

Query:  --------------------------VQLFYNGLTPSTKTIVDAAAGRTLLSKTVENARTLLEDMATNSYQCPSERSAPKKIATGVFEVDKVSALQAQMT
                                  +Q+FYNGL   T+TI+DAAAG TLLS+T ENA  LL+DMA NS+Q PSERS  KK+A G++E+D++S+L+AQ+ 
Subjt:  --------------------------VQLFYNGLTPSTKTIVDAAAGRTLLSKTVENARTLLEDMATNSYQCPSERSAPKKIATGVFEVDKVSALQAQMT

Query:  SLANAFMKFSGTGSAQSIE-PAAVLASRPQEETIEQVQYVSNFNSRGYNNNSTPTHYHPNNRNHENFSYANTKNVLNPPGFASQTQENKKLEDIVGAFIA
        +L NA  K SG G++ S E  AA       E TIEQ Q+ S                HP                          ++   LED++GAFI 
Subjt:  SLANAFMKFSGTGSAQSIE-PAAVLASRPQEETIEQVQYVSNFNSRGYNNNSTPTHYHPNNRNHENFSYANTKNVLNPPGFASQTQENKKLEDIVGAFIA

Query:  ESSNRTTKLEDAVIAINSTVNGYSAAIENIETQLGQLVSVVSTMNKGKAPAEQEKTQMEYCKAIIVHQEEAEEEPESEDYDTPIGEVEEDTSSDEAEKP-
        E  +R +++E+ V  +   + G + +I+N+E Q+GQ+   ++TM KGK P++ E    E+CKA+ +   +  +EPE +  + P+   EE  + +E  K  
Subjt:  ESSNRTTKLEDAVIAINSTVNGYSAAIENIETQLGQLVSVVSTMNKGKAPAEQEKTQMEYCKAIIVHQEEAEEEPESEDYDTPIGEVEEDTSSDEAEKP-

Query:  ----EPEPPIPSPTLMVPKEKKKEKEEKEQLEALEMPQYNRFMKEWLGKKRKEKKVDTVYLASTCSTRVQQK--DKARDWLQSITPGSIT
            + + P  S     P      +   EQ     MP Y RFMK+ +  KRK +  +TV L   CS  +Q+K   K +D      PGS T
Subjt:  ----EPEPPIPSPTLMVPKEKKKEKEEKEQLEALEMPQYNRFMKEWLGKKRKEKKVDTVYLASTCSTRVQQK--DKARDWLQSITPGSIT

A0A803PT47 Uncharacterized protein6.7e-5439.2Show/hide
Query:  TEDPNSHLKSFLDICGTEKINGVSEDAIHLRLFPFSLQDKARDWLQSITPGSITTWDALVQAFLKKFFPHAKTVQL------------------------
        TEDPN HL  FL++C   K+NGV++DAI LRLFP SL+D+ R WLQS+ P SI+TWD + + F+ KFFP +K+ QL                        
Subjt:  TEDPNSHLKSFLDICGTEKINGVSEDAIHLRLFPFSLQDKARDWLQSITPGSITTWDALVQAFLKKFFPHAKTVQL------------------------

Query:  ------------------FYNGLTPSTKTIVDAAAGRTLLSKTVENARTLLEDMATNSYQCPSERSAPKKIATGVFEVDKVSALQAQMTSLAN---AFMK
                          FYNGL   T+T++DAA G  LLSK +  A  LLE+MATNSY  P+ER+  KK+A G+ EVD ++ + AQ+++L+N   A + 
Subjt:  ------------------FYNGLTPSTKTIVDAAAGRTLLSKTVENARTLLEDMATNSYQCPSERSAPKKIATGVFEVDKVSALQAQMTSLAN---AFMK

Query:  FSGTGSAQSIEPAAVLASRPQEETIEQVQYVS-NFNSRGYNNNSTPTHYHPNNRNHENFSYANTKNVLN-PPGFASQTQENKK-LEDIVGAFIAESSNRT
           T + +++   A   S+  E +IEQ QY++    +  Y  N  P +YHP  RNHEN SY NTKNVL  P GF +Q QE+KK LEDI+G F+ ES  R 
Subjt:  FSGTGSAQSIEPAAVLASRPQEETIEQVQYVS-NFNSRGYNNNSTPTHYHPNNRNHENFSYANTKNVLN-PPGFASQTQENKK-LEDIVGAFIAESSNRT

Query:  TKLEDAVIAINSTVNGYSAAIENIETQLGQLVSVVSTMNKGKAPAEQEKTQMEYCKAIIVHQEEAEEEPESEDYD
         K E  +  I + ++   A+++NIE Q+ +L +V S     +A  E     +E C +   ++E    E   +  D
Subjt:  TKLEDAVIAINSTVNGYSAAIENIETQLGQLVSVVSTMNKGKAPAEQEKTQMEYCKAIIVHQEEAEEEPESEDYD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCGAGATTGTGCATACAGAGGATCACCCACCGAGGACCCAAATTCTCATCTAAAATCATTTTTAGACATTTGTGGGACGGAAAAAATTAATGGAGTTTCTGAAGA
TGCTATTCACTTACGCTTATTTCCTTTTTCTTTGCAAGATAAAGCACGAGATTGGTTGCAGTCTATTACCCCTGGGAGCATCACCACCTGGGACGCTTTGGTCCAGGCCT
TTCTAAAGAAATTCTTCCCTCATGCAAAGACGGTTCAATTATTTTATAATGGTTTAACTCCTAGTACAAAAACCATTGTTGATGCAGCTGCAGGTAGGACTCTGTTGTCC
AAAACCGTGGAAAATGCTCGTACACTTCTAGAGGATATGGCCACCAACAGCTATCAGTGCCCATCTGAGCGGTCTGCACCTAAAAAGATTGCTACTGGAGTGTTTGAGGT
TGACAAGGTAAGTGCACTCCAGGCCCAGATGACCTCCCTTGCTAATGCTTTTATGAAATTTTCAGGTACAGGGAGTGCACAGTCAATTGAACCAGCTGCTGTTTTAGCAT
CTAGACCTCAGGAGGAGACAATCGAGCAGGTTCAGTATGTATCAAATTTTAATTCTAGGGGATATAATAATAATTCCACACCTACACATTATCACCCTAACAATAGGAAC
CATGAAAATTTCTCTTATGCCAATACTAAGAATGTTCTTAACCCTCCTGGGTTTGCCTCTCAAACTCAAGAAAATAAAAAGTTAGAAGATATTGTTGGAGCTTTCATTGC
AGAGTCAAGTAACAGGACAACCAAGTTGGAGGACGCAGTCATTGCCATCAACTCAACAGTGAATGGCTACAGTGCTGCCATCGAGAATATTGAGACTCAGCTGGGACAGT
TGGTAAGTGTTGTAAGCACCATGAATAAAGGTAAGGCCCCAGCTGAGCAGGAGAAAACCCAGATGGAGTACTGTAAAGCCATCATTGTACACCAGGAGGAAGCTGAAGAG
GAACCTGAGTCTGAGGATTATGACACGCCCATTGGGGAAGTTGAGGAGGACACATCATCAGATGAGGCTGAAAAGCCTGAACCTGAACCTCCTATTCCTTCTCCCACACT
CATGGTTCCCAAAGAAAAGAAAAAAGAAAAAGAAGAAAAAGAACAATTAGAAGCATTAGAGATGCCCCAGTACAACAGGTTCATGAAGGAATGGTTAGGTAAGAAGCGAA
AGGAAAAGAAGGTTGACACCGTATATCTCGCTTCCACATGCAGCACCAGAGTACAACAGAAGGATAAAGCACGAGATTGGTTGCAATCTATTACCCCTGGGAGCATCACC
ACCTGGGACGCTTTGGTCCAGGCCTTTCTAAAGAAATTCTTCCCTCCTGCAAAGACGGTTCAATTATTTTATAATGGTTTAACTCCTAGTACAAAAACCATTGTTGATGC
AGCTGCAGGTGGGACTCTGTTGTCCAAAACCGTGGAAAATGCTCGTACACTTCTAGAGGATATGGCCACCAACAGCTATCAGTGCCCATCTGAGCGGTCTGCACCTAAAA
AGATTGCTGCTGGAGTGTTTGAGGTTGACAAGGTAAGTGCACTCCAGGCCCAGATGACCTCCCTTGCTAATGCTTTTATGAAATTTTCAGGTACATGGAGTGCACAGTCA
ATTGAATCAGCTGTTGTTTTAGCATCTAGACCTCAGGAGGAGACAATCGAGCAGGTTCAGTATGTATCAAATTTTAATTCTAGGGGATATAATAATAATTCCACACCTAC
ACATTATCACCCTAACAATAGGAACCATGAAAATTTCTCTTATGCCAATACTAAGAATGTTCTTAACCCTCCTGGGTTTGCCCCTCAAACTCAAGAAAATAAAAAGTTAG
AAGATATTGTTGGAGCTTTCATTGGAGAGTCAAGTAACAGGACAACCAAGTTGGAGGAGGCAGTCATTGCCATCAACTCAACAGTGAATGGCCACAGTGCTGCCATCAAG
AATATCGAGACTCAGCTGGGACAGTTGGTAAGTGTTGTAAGCACCATGAATAAAGGTAAGGCCCCAGCTGAGCAGGAGAAAACCCAGATGGAGTACTGTAAAGCCATCAT
TGTACACCAGGAGGAAGCTGAAGAGGAACCTGAGTCTGAGGATTATGACACGCCCATTGGGGAAGTTGAGGAGGACACATCATCAGATGAGGCTGAAAAGCCTGAACCTG
AACCTCCTATTCCTTATCCCACACTCATGGTTCCCAAAGAAAAGAAAAAAGAAAAAGAAGAAAAAGAACAATTAGAAGCATTAGAGATGCCCCAGTACAACAGGTTCATG
AAGGAATGGTTAGGTAAGAAGCGAAAGGAAAAGAAGGTTGACACCGTATATCTCGCTTCCACATGCAGCACCAGAGTAAAACAGAAGGTACCTGAAAAAGTAGTAGATCC
AAGGAGTTTTTCTGTTCCTTGTAGCTTTGGTACTTATTCTTTTAGAGCATTATGTAATTTAGGTGCTAGCATTAATATTATTCCTCTATCTCTGTGCAAAAATTTAGATA
TAGGTGAGATTAAATCTACTCCTGTAAAGTTCCAATTGACTGATCAGCCTGTGGTTAGACCAGTTGGCATTATAGATGATTTGACAGCGTCTCGACGCTATGACGAAAAA
ACAGAATATAATGTCTTTTGCGGCGAGGTTTTGGGGGATTCGTTTTTGGGACTTCTTGGAGCCGTAAAGCAGGGCAGAACAGAGCATTTTGGAGCTGAATCAAAGGGAGC
AAGTTGGAAATCAACCCATTGTTCGTGGGGATCGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTCGAGATTGTGCATACAGAGGATCACCCACCGAGGACCCAAATTCTCATCTAAAATCATTTTTAGACATTTGTGGGACGGAAAAAATTAATGGAGTTTCTGAAGA
TGCTATTCACTTACGCTTATTTCCTTTTTCTTTGCAAGATAAAGCACGAGATTGGTTGCAGTCTATTACCCCTGGGAGCATCACCACCTGGGACGCTTTGGTCCAGGCCT
TTCTAAAGAAATTCTTCCCTCATGCAAAGACGGTTCAATTATTTTATAATGGTTTAACTCCTAGTACAAAAACCATTGTTGATGCAGCTGCAGGTAGGACTCTGTTGTCC
AAAACCGTGGAAAATGCTCGTACACTTCTAGAGGATATGGCCACCAACAGCTATCAGTGCCCATCTGAGCGGTCTGCACCTAAAAAGATTGCTACTGGAGTGTTTGAGGT
TGACAAGGTAAGTGCACTCCAGGCCCAGATGACCTCCCTTGCTAATGCTTTTATGAAATTTTCAGGTACAGGGAGTGCACAGTCAATTGAACCAGCTGCTGTTTTAGCAT
CTAGACCTCAGGAGGAGACAATCGAGCAGGTTCAGTATGTATCAAATTTTAATTCTAGGGGATATAATAATAATTCCACACCTACACATTATCACCCTAACAATAGGAAC
CATGAAAATTTCTCTTATGCCAATACTAAGAATGTTCTTAACCCTCCTGGGTTTGCCTCTCAAACTCAAGAAAATAAAAAGTTAGAAGATATTGTTGGAGCTTTCATTGC
AGAGTCAAGTAACAGGACAACCAAGTTGGAGGACGCAGTCATTGCCATCAACTCAACAGTGAATGGCTACAGTGCTGCCATCGAGAATATTGAGACTCAGCTGGGACAGT
TGGTAAGTGTTGTAAGCACCATGAATAAAGGTAAGGCCCCAGCTGAGCAGGAGAAAACCCAGATGGAGTACTGTAAAGCCATCATTGTACACCAGGAGGAAGCTGAAGAG
GAACCTGAGTCTGAGGATTATGACACGCCCATTGGGGAAGTTGAGGAGGACACATCATCAGATGAGGCTGAAAAGCCTGAACCTGAACCTCCTATTCCTTCTCCCACACT
CATGGTTCCCAAAGAAAAGAAAAAAGAAAAAGAAGAAAAAGAACAATTAGAAGCATTAGAGATGCCCCAGTACAACAGGTTCATGAAGGAATGGTTAGGTAAGAAGCGAA
AGGAAAAGAAGGTTGACACCGTATATCTCGCTTCCACATGCAGCACCAGAGTACAACAGAAGGATAAAGCACGAGATTGGTTGCAATCTATTACCCCTGGGAGCATCACC
ACCTGGGACGCTTTGGTCCAGGCCTTTCTAAAGAAATTCTTCCCTCCTGCAAAGACGGTTCAATTATTTTATAATGGTTTAACTCCTAGTACAAAAACCATTGTTGATGC
AGCTGCAGGTGGGACTCTGTTGTCCAAAACCGTGGAAAATGCTCGTACACTTCTAGAGGATATGGCCACCAACAGCTATCAGTGCCCATCTGAGCGGTCTGCACCTAAAA
AGATTGCTGCTGGAGTGTTTGAGGTTGACAAGGTAAGTGCACTCCAGGCCCAGATGACCTCCCTTGCTAATGCTTTTATGAAATTTTCAGGTACATGGAGTGCACAGTCA
ATTGAATCAGCTGTTGTTTTAGCATCTAGACCTCAGGAGGAGACAATCGAGCAGGTTCAGTATGTATCAAATTTTAATTCTAGGGGATATAATAATAATTCCACACCTAC
ACATTATCACCCTAACAATAGGAACCATGAAAATTTCTCTTATGCCAATACTAAGAATGTTCTTAACCCTCCTGGGTTTGCCCCTCAAACTCAAGAAAATAAAAAGTTAG
AAGATATTGTTGGAGCTTTCATTGGAGAGTCAAGTAACAGGACAACCAAGTTGGAGGAGGCAGTCATTGCCATCAACTCAACAGTGAATGGCCACAGTGCTGCCATCAAG
AATATCGAGACTCAGCTGGGACAGTTGGTAAGTGTTGTAAGCACCATGAATAAAGGTAAGGCCCCAGCTGAGCAGGAGAAAACCCAGATGGAGTACTGTAAAGCCATCAT
TGTACACCAGGAGGAAGCTGAAGAGGAACCTGAGTCTGAGGATTATGACACGCCCATTGGGGAAGTTGAGGAGGACACATCATCAGATGAGGCTGAAAAGCCTGAACCTG
AACCTCCTATTCCTTATCCCACACTCATGGTTCCCAAAGAAAAGAAAAAAGAAAAAGAAGAAAAAGAACAATTAGAAGCATTAGAGATGCCCCAGTACAACAGGTTCATG
AAGGAATGGTTAGGTAAGAAGCGAAAGGAAAAGAAGGTTGACACCGTATATCTCGCTTCCACATGCAGCACCAGAGTAAAACAGAAGGTACCTGAAAAAGTAGTAGATCC
AAGGAGTTTTTCTGTTCCTTGTAGCTTTGGTACTTATTCTTTTAGAGCATTATGTAATTTAGGTGCTAGCATTAATATTATTCCTCTATCTCTGTGCAAAAATTTAGATA
TAGGTGAGATTAAATCTACTCCTGTAAAGTTCCAATTGACTGATCAGCCTGTGGTTAGACCAGTTGGCATTATAGATGATTTGACAGCGTCTCGACGCTATGACGAAAAA
ACAGAATATAATGTCTTTTGCGGCGAGGTTTTGGGGGATTCGTTTTTGGGACTTCTTGGAGCCGTAAAGCAGGGCAGAACAGAGCATTTTGGAGCTGAATCAAAGGGAGC
AAGTTGGAAATCAACCCATTGTTCGTGGGGATCGTGA
Protein sequenceShow/hide protein sequence
MARDCAYRGSPTEDPNSHLKSFLDICGTEKINGVSEDAIHLRLFPFSLQDKARDWLQSITPGSITTWDALVQAFLKKFFPHAKTVQLFYNGLTPSTKTIVDAAAGRTLLS
KTVENARTLLEDMATNSYQCPSERSAPKKIATGVFEVDKVSALQAQMTSLANAFMKFSGTGSAQSIEPAAVLASRPQEETIEQVQYVSNFNSRGYNNNSTPTHYHPNNRN
HENFSYANTKNVLNPPGFASQTQENKKLEDIVGAFIAESSNRTTKLEDAVIAINSTVNGYSAAIENIETQLGQLVSVVSTMNKGKAPAEQEKTQMEYCKAIIVHQEEAEE
EPESEDYDTPIGEVEEDTSSDEAEKPEPEPPIPSPTLMVPKEKKKEKEEKEQLEALEMPQYNRFMKEWLGKKRKEKKVDTVYLASTCSTRVQQKDKARDWLQSITPGSIT
TWDALVQAFLKKFFPPAKTVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQCPSERSAPKKIAAGVFEVDKVSALQAQMTSLANAFMKFSGTWSAQS
IESAVVLASRPQEETIEQVQYVSNFNSRGYNNNSTPTHYHPNNRNHENFSYANTKNVLNPPGFAPQTQENKKLEDIVGAFIGESSNRTTKLEEAVIAINSTVNGHSAAIK
NIETQLGQLVSVVSTMNKGKAPAEQEKTQMEYCKAIIVHQEEAEEEPESEDYDTPIGEVEEDTSSDEAEKPEPEPPIPYPTLMVPKEKKKEKEEKEQLEALEMPQYNRFM
KEWLGKKRKEKKVDTVYLASTCSTRVKQKVPEKVVDPRSFSVPCSFGTYSFRALCNLGASINIIPLSLCKNLDIGEIKSTPVKFQLTDQPVVRPVGIIDDLTASRRYDEK
TEYNVFCGEVLGDSFLGLLGAVKQGRTEHFGAESKGASWKSTHCSWGS