; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0015607 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0015607
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionGag protease polyprotein
Genome locationchr12:17511997..17513000
RNA-Seq ExpressionLag0015607
SyntenyLag0015607
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001969 - Aspartic peptidase, active site
IPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022156067.1 uncharacterized protein LOC111023035 [Momordica charantia]3.6e-6347.27Show/hide
Query:  MVRDEARNWWKSIAAAEDHANVPILLGRFKDLLYDYYFPETVKDDKETELLHYTQGTMYVIQYERKFTELSRFSLDLFSMPKRKIKRFIKGLREEIRGTI
        M+R EA NWW+S+AAAEDHANVP+   RFKDLLY+YYFP TV+++K  E L  TQG++ V QY+RKFTELSRF +      + KI +FI GLR EI+G +
Subjt:  MVRDEARNWWKSIAAAEDHANVPILLGRFKDLLYDYYFPETVKDDKETELLHYTQGTMYVIQYERKFTELSRFSLDLFSMPKRKIKRFIKGLREEIRGTI

Query:  ALNAPTTFVAALHGALVLDKNLAKNAQTHWEVGSTSGVKRKPPPTQARPLQKAPRQQFQKQTSTIPCCNVCSRNHVGQC---------------------
         L   TT+ AA+  ALV+DK L +  Q+   +GS+SGVKRK     +       +   Q+QT+  P C  C +NH G C                     
Subjt:  ALNAPTTFVAALHGALVLDKNLAKNAQTHWEVGSTSGVKRKPPPTQARPLQKAPRQQFQKQTSTIPCCNVCSRNHVGQC---------------------

Query:  ------------------ESGGKQKARVFALNDEIVED-DAVVTGTILVLKVPAFVLFDSGSSHSFISSAFVDQADLKLEPLGFVLLVSTPSGFVMLAQQ
                            GG Q+ARVFAL    VE  +AVVTGTILVL +PA+ LFDSGSSHSFI+S FV  ADL+LE LGF+L VSTPSG V++  Q
Subjt:  ------------------ESGGKQKARVFALNDEIVED-DAVVTGTILVLKVPAFVLFDSGSSHSFISSAFVDQADLKLEPLGFVLLVSTPSGFVMLAQQ

Query:  VLRTGEVSIAG
        V++ G++S  G
Subjt:  VLRTGEVSIAG

XP_022156326.1 uncharacterized protein LOC111023247 [Momordica charantia]3.0e-7352.27Show/hide
Query:  MVRDEARNWWKSIAAAEDHANVPILLGRFKDLLYDYYFPETVKDDKETELLHYTQGTMYVIQYERKFTELSRFSLDLFSMPKRKIKRFIKGLREEIRGTI
        M+R EA NWW S+AAAED+ANVPI   RFK+LLYDYY+PETVKD KE E LH  QGT+ V QYERKFTELSRF+L+L      KIKRF+KGLR+ IRG +
Subjt:  MVRDEARNWWKSIAAAEDHANVPILLGRFKDLLYDYYFPETVKDDKETELLHYTQGTMYVIQYERKFTELSRFSLDLFSMPKRKIKRFIKGLREEIRGTI

Query:  ALNAPTTFVAALHGALVLDKNLAKNAQTHWEVGSTSGVKRKPPPTQARPLQKAPRQQFQKQTSTIPCCNVCSRNHVGQCESGGK----------------
         L  PTT+  A+ GALV+DK+++  A    EVGS+SGVKRK P T A  + +AP++Q Q Q    P C  C + H GQC +G K                
Subjt:  ALNAPTTFVAALHGALVLDKNLAKNAQTHWEVGSTSGVKRKPPPTQARPLQKAPRQQFQKQTSTIPCCNVCSRNHVGQCESGGK----------------

Query:  -----------------------QKARVFAL-NDEIVEDDAVVTGTILVLKVPAFVLFDSGSSHSFISSAFVDQADLKLEPLGFVLLVSTPSGFVMLAQQ
                               Q+ARVFAL   E  + + VVTGT+LV  VPA+VLFDSGSSH+FISS FV QA L+LEPLGF+L VSTPSG +++A Q
Subjt:  -----------------------QKARVFAL-NDEIVEDDAVVTGTILVLKVPAFVLFDSGSSHSFISSAFVDQADLKLEPLGFVLLVSTPSGFVMLAQQ

Query:  VLRTGEVS
         +R  E+S
Subjt:  VLRTGEVS

XP_022157413.1 uncharacterized protein LOC111024114 [Momordica charantia]2.7e-6650.34Show/hide
Query:  MVRDEARNWWKSIAAAEDHANVPILLGRFKDLLYDYYFPETVKDDKETELLHYTQGTMYVIQYERKFTELSRFSLDLFSMPKRKIKRFIKGLREEIRGTI
        M+R EA NWW+S+AAAEDH NVP+   RFKDLLY+YYFP TV+++K  E L  TQG++ V QYERKFTELSRF +      + KI +FI GLR EI+G +
Subjt:  MVRDEARNWWKSIAAAEDHANVPILLGRFKDLLYDYYFPETVKDDKETELLHYTQGTMYVIQYERKFTELSRFSLDLFSMPKRKIKRFIKGLREEIRGTI

Query:  ALNAPTTFVAALHGALVLDKNLAKNAQTHWEVGSTSGVKRKPPPTQARPLQKAPRQQFQKQTSTIPCCNVCSRNHVGQC------------------ESG
         +  PTT+ AA+  ALV+DK L +  Q+   +GS+SGVKRK     +    +  +   Q+QT+  P C  C +NH G C                    G
Subjt:  ALNAPTTFVAALHGALVLDKNLAKNAQTHWEVGSTSGVKRKPPPTQARPLQKAPRQQFQKQTSTIPCCNVCSRNHVGQC------------------ESG

Query:  GKQKARVFALNDEIVED-DAVVTGTILVLKVPAFVLFDSGSSHSFISSAFVDQADLKLEPLGFVLLVSTPSGFVMLAQQVLRTGEVSIAG
        G Q+ARVFAL    VE  +AVVTGTILV+ +PA+ LFDSGSSHSFI+S FV  ADL+LE LGF+L VSTPSG V++  QV++ G++S  G
Subjt:  GKQKARVFALNDEIVED-DAVVTGTILVLKVPAFVLFDSGSSHSFISSAFVDQADLKLEPLGFVLLVSTPSGFVMLAQQVLRTGEVSIAG

XP_022158750.1 uncharacterized protein LOC111025215 [Momordica charantia]1.8e-6246.3Show/hide
Query:  MVRDEARNWWKSIAAAEDHANVPILLGRFKDLLYDYYFPETVKDDKETELLHYTQGTMYVIQYERKFTELSRFSLDLFSMPKRKIKRFIKGLREEIRGTI
        M+R EA NWW+S+AAAEDHANVP+   RFKDLLY+YYFP TV+++K  E L  TQG++ V +YERKFTELSRF +      + KI +FI GLR EI+G +
Subjt:  MVRDEARNWWKSIAAAEDHANVPILLGRFKDLLYDYYFPETVKDDKETELLHYTQGTMYVIQYERKFTELSRFSLDLFSMPKRKIKRFIKGLREEIRGTI

Query:  ALNAPTTFVAALHGALVLDKNLAKNAQTHWEVGSTSGVKRKPPPTQARPLQKAPRQQFQKQTSTIPCCNVCSRNHVGQC---------------------
         L  PTT+ AA+  ALV+DK L +  Q+   +GS+SGVKRK     +    +  +   Q+QT+  P C  C ++H G C                     
Subjt:  ALNAPTTFVAALHGALVLDKNLAKNAQTHWEVGSTSGVKRKPPPTQARPLQKAPRQQFQKQTSTIPCCNVCSRNHVGQC---------------------

Query:  ------------------ESGGKQKARVFALNDEIVE-DDAVVTGTILVLKVPAFVLFDSGSSHSFISSAFVDQADLKLEPLGFVLLVSTPSGFVMLAQQ
                            GG  +ARVFAL    VE  +AVVT T+LVL +PA+ LFDSGSSHSFI+S FV  ADL+LE LGF+L VSTPSG V++  Q
Subjt:  ------------------ESGGKQKARVFALNDEIVE-DDAVVTGTILVLKVPAFVLFDSGSSHSFISSAFVDQADLKLEPLGFVLLVSTPSGFVMLAQQ

Query:  VLRTGEVSIAG
        V++ G++S  G
Subjt:  VLRTGEVSIAG

XP_022159077.1 uncharacterized protein LOC111025517 [Momordica charantia]6.0e-6650.18Show/hide
Query:  MVRDEARNWWKSIAAAEDHANVPILLGRFKDLLYDYYFPETVKDDKETELLHYTQGTMYVIQYERKFTELSRFSLDLFSMPKRKIKRFIKGLREEIRGTI
        M+R EA NWW S+A AEDHANVPI   RFKDLLYDYY+P+T+KD KE E LH++ GT+ V QYERKFTELS F+ +L      KIKRF+KGLR+ IRG +
Subjt:  MVRDEARNWWKSIAAAEDHANVPILLGRFKDLLYDYYFPETVKDDKETELLHYTQGTMYVIQYERKFTELSRFSLDLFSMPKRKIKRFIKGLREEIRGTI

Query:  ALNAPTTFVAALHGALVLDKNLAKNAQTHWEVGSTSGVKRKPPPTQARPLQKAPRQQFQKQTSTIPCCNVCSRNHVGQCESGGK------------QKAR
         L  P T+  A+ G L++D +++ + Q   EVGS+SGVKRK  P  A    +AP++  Q+Q    P C  C +   GQC +G +            ++  
Subjt:  ALNAPTTFVAALHGALVLDKNLAKNAQTHWEVGSTSGVKRKPPPTQARPLQKAPRQQFQKQTSTIPCCNVCSRNHVGQCESGGK------------QKAR

Query:  VFALNDEIVEDDAVVT-----GTILVLKVPAFVLFDSGSSHSFISSAFVDQADLKLEPLGFVLLVSTPSGFVMLAQQVLRTGEVS
        + A N + +   A  T     GT LV  VPA+VLFD GSSH+FIS+AFV QA L+LEPLGF+L VSTPSG V++A Q++R GE+S
Subjt:  VFALNDEIVEDDAVVT-----GTILVLKVPAFVLFDSGSSHSFISSAFVDQADLKLEPLGFVLLVSTPSGFVMLAQQVLRTGEVS

TrEMBL top hitse value%identityAlignment
A0A6J1DR22 uncharacterized protein LOC1110230351.8e-6347.27Show/hide
Query:  MVRDEARNWWKSIAAAEDHANVPILLGRFKDLLYDYYFPETVKDDKETELLHYTQGTMYVIQYERKFTELSRFSLDLFSMPKRKIKRFIKGLREEIRGTI
        M+R EA NWW+S+AAAEDHANVP+   RFKDLLY+YYFP TV+++K  E L  TQG++ V QY+RKFTELSRF +      + KI +FI GLR EI+G +
Subjt:  MVRDEARNWWKSIAAAEDHANVPILLGRFKDLLYDYYFPETVKDDKETELLHYTQGTMYVIQYERKFTELSRFSLDLFSMPKRKIKRFIKGLREEIRGTI

Query:  ALNAPTTFVAALHGALVLDKNLAKNAQTHWEVGSTSGVKRKPPPTQARPLQKAPRQQFQKQTSTIPCCNVCSRNHVGQC---------------------
         L   TT+ AA+  ALV+DK L +  Q+   +GS+SGVKRK     +       +   Q+QT+  P C  C +NH G C                     
Subjt:  ALNAPTTFVAALHGALVLDKNLAKNAQTHWEVGSTSGVKRKPPPTQARPLQKAPRQQFQKQTSTIPCCNVCSRNHVGQC---------------------

Query:  ------------------ESGGKQKARVFALNDEIVED-DAVVTGTILVLKVPAFVLFDSGSSHSFISSAFVDQADLKLEPLGFVLLVSTPSGFVMLAQQ
                            GG Q+ARVFAL    VE  +AVVTGTILVL +PA+ LFDSGSSHSFI+S FV  ADL+LE LGF+L VSTPSG V++  Q
Subjt:  ------------------ESGGKQKARVFALNDEIVED-DAVVTGTILVLKVPAFVLFDSGSSHSFISSAFVDQADLKLEPLGFVLLVSTPSGFVMLAQQ

Query:  VLRTGEVSIAG
        V++ G++S  G
Subjt:  VLRTGEVSIAG

A0A6J1DTA8 uncharacterized protein LOC1110241141.3e-6650.34Show/hide
Query:  MVRDEARNWWKSIAAAEDHANVPILLGRFKDLLYDYYFPETVKDDKETELLHYTQGTMYVIQYERKFTELSRFSLDLFSMPKRKIKRFIKGLREEIRGTI
        M+R EA NWW+S+AAAEDH NVP+   RFKDLLY+YYFP TV+++K  E L  TQG++ V QYERKFTELSRF +      + KI +FI GLR EI+G +
Subjt:  MVRDEARNWWKSIAAAEDHANVPILLGRFKDLLYDYYFPETVKDDKETELLHYTQGTMYVIQYERKFTELSRFSLDLFSMPKRKIKRFIKGLREEIRGTI

Query:  ALNAPTTFVAALHGALVLDKNLAKNAQTHWEVGSTSGVKRKPPPTQARPLQKAPRQQFQKQTSTIPCCNVCSRNHVGQC------------------ESG
         +  PTT+ AA+  ALV+DK L +  Q+   +GS+SGVKRK     +    +  +   Q+QT+  P C  C +NH G C                    G
Subjt:  ALNAPTTFVAALHGALVLDKNLAKNAQTHWEVGSTSGVKRKPPPTQARPLQKAPRQQFQKQTSTIPCCNVCSRNHVGQC------------------ESG

Query:  GKQKARVFALNDEIVED-DAVVTGTILVLKVPAFVLFDSGSSHSFISSAFVDQADLKLEPLGFVLLVSTPSGFVMLAQQVLRTGEVSIAG
        G Q+ARVFAL    VE  +AVVTGTILV+ +PA+ LFDSGSSHSFI+S FV  ADL+LE LGF+L VSTPSG V++  QV++ G++S  G
Subjt:  GKQKARVFALNDEIVED-DAVVTGTILVLKVPAFVLFDSGSSHSFISSAFVDQADLKLEPLGFVLLVSTPSGFVMLAQQVLRTGEVSIAG

A0A6J1DUM2 uncharacterized protein LOC1110232471.4e-7352.27Show/hide
Query:  MVRDEARNWWKSIAAAEDHANVPILLGRFKDLLYDYYFPETVKDDKETELLHYTQGTMYVIQYERKFTELSRFSLDLFSMPKRKIKRFIKGLREEIRGTI
        M+R EA NWW S+AAAED+ANVPI   RFK+LLYDYY+PETVKD KE E LH  QGT+ V QYERKFTELSRF+L+L      KIKRF+KGLR+ IRG +
Subjt:  MVRDEARNWWKSIAAAEDHANVPILLGRFKDLLYDYYFPETVKDDKETELLHYTQGTMYVIQYERKFTELSRFSLDLFSMPKRKIKRFIKGLREEIRGTI

Query:  ALNAPTTFVAALHGALVLDKNLAKNAQTHWEVGSTSGVKRKPPPTQARPLQKAPRQQFQKQTSTIPCCNVCSRNHVGQCESGGK----------------
         L  PTT+  A+ GALV+DK+++  A    EVGS+SGVKRK P T A  + +AP++Q Q Q    P C  C + H GQC +G K                
Subjt:  ALNAPTTFVAALHGALVLDKNLAKNAQTHWEVGSTSGVKRKPPPTQARPLQKAPRQQFQKQTSTIPCCNVCSRNHVGQCESGGK----------------

Query:  -----------------------QKARVFAL-NDEIVEDDAVVTGTILVLKVPAFVLFDSGSSHSFISSAFVDQADLKLEPLGFVLLVSTPSGFVMLAQQ
                               Q+ARVFAL   E  + + VVTGT+LV  VPA+VLFDSGSSH+FISS FV QA L+LEPLGF+L VSTPSG +++A Q
Subjt:  -----------------------QKARVFAL-NDEIVEDDAVVTGTILVLKVPAFVLFDSGSSHSFISSAFVDQADLKLEPLGFVLLVSTPSGFVMLAQQ

Query:  VLRTGEVS
         +R  E+S
Subjt:  VLRTGEVS

A0A6J1DWP4 uncharacterized protein LOC1110252158.8e-6346.3Show/hide
Query:  MVRDEARNWWKSIAAAEDHANVPILLGRFKDLLYDYYFPETVKDDKETELLHYTQGTMYVIQYERKFTELSRFSLDLFSMPKRKIKRFIKGLREEIRGTI
        M+R EA NWW+S+AAAEDHANVP+   RFKDLLY+YYFP TV+++K  E L  TQG++ V +YERKFTELSRF +      + KI +FI GLR EI+G +
Subjt:  MVRDEARNWWKSIAAAEDHANVPILLGRFKDLLYDYYFPETVKDDKETELLHYTQGTMYVIQYERKFTELSRFSLDLFSMPKRKIKRFIKGLREEIRGTI

Query:  ALNAPTTFVAALHGALVLDKNLAKNAQTHWEVGSTSGVKRKPPPTQARPLQKAPRQQFQKQTSTIPCCNVCSRNHVGQC---------------------
         L  PTT+ AA+  ALV+DK L +  Q+   +GS+SGVKRK     +    +  +   Q+QT+  P C  C ++H G C                     
Subjt:  ALNAPTTFVAALHGALVLDKNLAKNAQTHWEVGSTSGVKRKPPPTQARPLQKAPRQQFQKQTSTIPCCNVCSRNHVGQC---------------------

Query:  ------------------ESGGKQKARVFALNDEIVE-DDAVVTGTILVLKVPAFVLFDSGSSHSFISSAFVDQADLKLEPLGFVLLVSTPSGFVMLAQQ
                            GG  +ARVFAL    VE  +AVVT T+LVL +PA+ LFDSGSSHSFI+S FV  ADL+LE LGF+L VSTPSG V++  Q
Subjt:  ------------------ESGGKQKARVFALNDEIVE-DDAVVTGTILVLKVPAFVLFDSGSSHSFISSAFVDQADLKLEPLGFVLLVSTPSGFVMLAQQ

Query:  VLRTGEVSIAG
        V++ G++S  G
Subjt:  VLRTGEVSIAG

A0A6J1DYU5 uncharacterized protein LOC1110255172.9e-6650.18Show/hide
Query:  MVRDEARNWWKSIAAAEDHANVPILLGRFKDLLYDYYFPETVKDDKETELLHYTQGTMYVIQYERKFTELSRFSLDLFSMPKRKIKRFIKGLREEIRGTI
        M+R EA NWW S+A AEDHANVPI   RFKDLLYDYY+P+T+KD KE E LH++ GT+ V QYERKFTELS F+ +L      KIKRF+KGLR+ IRG +
Subjt:  MVRDEARNWWKSIAAAEDHANVPILLGRFKDLLYDYYFPETVKDDKETELLHYTQGTMYVIQYERKFTELSRFSLDLFSMPKRKIKRFIKGLREEIRGTI

Query:  ALNAPTTFVAALHGALVLDKNLAKNAQTHWEVGSTSGVKRKPPPTQARPLQKAPRQQFQKQTSTIPCCNVCSRNHVGQCESGGK------------QKAR
         L  P T+  A+ G L++D +++ + Q   EVGS+SGVKRK  P  A    +AP++  Q+Q    P C  C +   GQC +G +            ++  
Subjt:  ALNAPTTFVAALHGALVLDKNLAKNAQTHWEVGSTSGVKRKPPPTQARPLQKAPRQQFQKQTSTIPCCNVCSRNHVGQCESGGK------------QKAR

Query:  VFALNDEIVEDDAVVT-----GTILVLKVPAFVLFDSGSSHSFISSAFVDQADLKLEPLGFVLLVSTPSGFVMLAQQVLRTGEVS
        + A N + +   A  T     GT LV  VPA+VLFD GSSH+FIS+AFV QA L+LEPLGF+L VSTPSG V++A Q++R GE+S
Subjt:  VFALNDEIVEDDAVVT-----GTILVLKVPAFVLFDSGSSHSFISSAFVDQADLKLEPLGFVLLVSTPSGFVMLAQQVLRTGEVS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGAGGGATGAAGCCCGGAATTGGTGGAAGTCAATTGCAGCTGCTGAAGATCATGCCAATGTACCAATCTTGTTGGGGAGATTTAAGGATCTGCTTTACGATTACTA
CTTCCCGGAGACCGTCAAGGACGACAAAGAGACAGAGCTCCTGCACTATACTCAGGGCACTATGTATGTAATCCAGTACGAGCGAAAGTTCACGGAGCTGTCGCGTTTTT
CTCTGGATCTGTTTAGCATGCCGAAAAGAAAAATCAAGAGGTTCATCAAGGGCCTCCGAGAAGAAATTCGAGGAACTATTGCCCTGAATGCACCTACTACTTTTGTTGCG
GCCCTCCATGGGGCGTTGGTCTTGGATAAAAACCTGGCCAAGAATGCACAGACTCACTGGGAGGTCGGTTCGACCTCTGGGGTTAAAAGGAAGCCCCCACCCACTCAAGC
GAGACCTCTGCAGAAGGCACCTCGCCAGCAGTTCCAGAAGCAGACCTCGACCATTCCTTGTTGCAATGTGTGCAGCAGAAACCATGTGGGTCAATGCGAGTCAGGTGGCA
AGCAAAAAGCTCGTGTTTTTGCCCTGAATGATGAAATTGTGGAGGATGATGCCGTGGTGACAGGAACTATTCTTGTTTTGAAAGTCCCTGCTTTTGTGTTATTTGACTCG
GGGTCGAGTCACTCTTTTATCTCATCGGCGTTTGTCGATCAAGCTGATCTGAAGTTAGAGCCGCTAGGGTTTGTTCTCTTAGTGTCCACCCCCTCCGGATTTGTGATGCT
TGCTCAGCAAGTCTTAAGGACGGGTGAAGTTTCAATCGCGGGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTGAGGGATGAAGCCCGGAATTGGTGGAAGTCAATTGCAGCTGCTGAAGATCATGCCAATGTACCAATCTTGTTGGGGAGATTTAAGGATCTGCTTTACGATTACTA
CTTCCCGGAGACCGTCAAGGACGACAAAGAGACAGAGCTCCTGCACTATACTCAGGGCACTATGTATGTAATCCAGTACGAGCGAAAGTTCACGGAGCTGTCGCGTTTTT
CTCTGGATCTGTTTAGCATGCCGAAAAGAAAAATCAAGAGGTTCATCAAGGGCCTCCGAGAAGAAATTCGAGGAACTATTGCCCTGAATGCACCTACTACTTTTGTTGCG
GCCCTCCATGGGGCGTTGGTCTTGGATAAAAACCTGGCCAAGAATGCACAGACTCACTGGGAGGTCGGTTCGACCTCTGGGGTTAAAAGGAAGCCCCCACCCACTCAAGC
GAGACCTCTGCAGAAGGCACCTCGCCAGCAGTTCCAGAAGCAGACCTCGACCATTCCTTGTTGCAATGTGTGCAGCAGAAACCATGTGGGTCAATGCGAGTCAGGTGGCA
AGCAAAAAGCTCGTGTTTTTGCCCTGAATGATGAAATTGTGGAGGATGATGCCGTGGTGACAGGAACTATTCTTGTTTTGAAAGTCCCTGCTTTTGTGTTATTTGACTCG
GGGTCGAGTCACTCTTTTATCTCATCGGCGTTTGTCGATCAAGCTGATCTGAAGTTAGAGCCGCTAGGGTTTGTTCTCTTAGTGTCCACCCCCTCCGGATTTGTGATGCT
TGCTCAGCAAGTCTTAAGGACGGGTGAAGTTTCAATCGCGGGCTAG
Protein sequenceShow/hide protein sequence
MVRDEARNWWKSIAAAEDHANVPILLGRFKDLLYDYYFPETVKDDKETELLHYTQGTMYVIQYERKFTELSRFSLDLFSMPKRKIKRFIKGLREEIRGTIALNAPTTFVA
ALHGALVLDKNLAKNAQTHWEVGSTSGVKRKPPPTQARPLQKAPRQQFQKQTSTIPCCNVCSRNHVGQCESGGKQKARVFALNDEIVEDDAVVTGTILVLKVPAFVLFDS
GSSHSFISSAFVDQADLKLEPLGFVLLVSTPSGFVMLAQQVLRTGEVSIAG