; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0004380 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0004380
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr05:17335129..17336322
RNA-Seq ExpressionPI0004380
SyntenyPI0004380
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048713.1 hypothetical protein E6C27_scaffold43G00050 [Cucumis melo var. makuwa]4.8e-4456.89Show/hide
Query:  LRDEAKRFKRMVKACPYNGIPECILMEVFYFGLNKATQQTVDAVFVGGMFKSSYNQIKATLDTMANNNEEWDEDDFDNR------RGGRGRSEEGMDKNA
        L D  KRFKRM+K CP++ IPEC+LME FYFGL+K T Q+ + VF GGM +SSYNQIK  LDTMA+N++EW ++ F +R      +G RGR E+G+D + 
Subjt:  LRDEAKRFKRMVKACPYNGIPECILMEVFYFGLNKATQQTVDAVFVGGMFKSSYNQIKATLDTMANNNEEWDEDDFDNR------RGGRGRSEEGMDKNA

Query:  VVALQRQMTAINILLNSMAISQVNVVGGSVHAANQIDDMGCVGCDGPHDTDACPLNTETVAFVRNDP
        +VALQ Q+  +NI L SMA+ QVNVV  SV    Q+++MGCVGC  PH+T+ACPLNTE VA+++NDP
Subjt:  VVALQRQMTAINILLNSMAISQVNVVGGSVHAANQIDDMGCVGCDGPHDTDACPLNTETVAFVRNDP

XP_038880527.1 uncharacterized protein LOC120072192 [Benincasa hispida]3.5e-4736.02Show/hide
Query:  MAHDLDRPIRSYAAPNLYNFNPGIAYPVFGENARFEIKPMMLQMIQNTGQFGSHPGEDPHEHIRSFYSICPSFHMPDISPEELRFALFPLILRDEAK---
        MA++  RP+R YA+P LY+F+PGI YP+  +  RFE+K +MLQM+Q   QFG   GEDPH H++ F   C  F +P I+PE++R +LFP  LRD+AK   
Subjt:  MAHDLDRPIRSYAAPNLYNFNPGIAYPVFGENARFEIKPMMLQMIQNTGQFGSHPGEDPHEHIRSFYSICPSFHMPDISPEELRFALFPLILRDEAK---

Query:  ---------------------------------------------------RFKRMVKACPYNGIPECILMEVFYFGLNKATQQTVDAVFVGGMFKSSYN
                                                           RF  +VK CP + +   I ME FY GLN+A+Q   DA    G+   SY 
Subjt:  ---------------------------------------------------RFKRMVKACPYNGIPECILMEVFYFGLNKATQQTVDAVFVGGMFKSSYN

Query:  QIKATLDTMANNNEEWDEDDFDNRRGGRGRSE--EGMDKNAVVALQRQMTAINILL---------NSMAISQVNVVGGSVHAANQIDDMGCVGCDGPHDT
        + K  L  +A +N EW +D +D R   R RS+    +D NA+  L  Q+  +  LL         N   ++QV V G           + CVGC   H  
Subjt:  QIKATLDTMANNNEEWDEDDFDNRRGGRGRSE--EGMDKNAVVALQRQMTAINILL---------NSMAISQVNVVGGSVHAANQIDDMGCVGCDGPHDT

Query:  DACPLNTETVAFVRNDPFSNTYNLGWRNHPYFGWGETGQQNQGRHGG
          CP N ++V F++N+PFSNTYN GW NHP F W  TG   Q  H G
Subjt:  DACPLNTETVAFVRNDPFSNTYNLGWRNHPYFGWGETGQQNQGRHGG

XP_038882276.1 uncharacterized protein LOC120073506 [Benincasa hispida]1.3e-4637.27Show/hide
Query:  RNAPLPQAAQEPNVAYMAHDLDRPIRSYAAPNLYNFNPGIAYPVFGENARFEIKPMMLQMIQNTGQFGSHPGEDPHEHIRSFYSICPSFHMPDISPEE--
        +N  L  AAQ P    MA++  RPIR Y +P LY+F P I YP      RFE+K +MLQM++  G+FG   GE PH H++ F   C SF +P I+PEE  
Subjt:  RNAPLPQAAQEPNVAYMAHDLDRPIRSYAAPNLYNFNPGIAYPVFGENARFEIKPMMLQMIQNTGQFGSHPGEDPHEHIRSFYSICPSFHMPDISPEE--

Query:  ----------------LRFALFPLILRDEA------KRFKRMVKACPYNGIPECILMEVFYFGLNKATQQTVDAVFVGGMFKSSYNQIKATLDTMANNNE
                         R  +      D        + FK +VK CP +G+   I ME FY  LN+A+Q  VDA    G+ + SY + K  LD +A +N 
Subjt:  ----------------LRFALFPLILRDEA------KRFKRMVKACPYNGIPECILMEVFYFGLNKATQQTVDAVFVGGMFKSSYNQIKATLDTMANNNE

Query:  EWDEDDFDNRRGGRGRSE--EGMDKNAVVALQRQMTAINILLNSMAISQVNVVGGSVHAANQIDDMG-----CVGCDGPHDTDACPLNTETVAFVRNDPF
        EW +D +D R   R RS+    +D NA+  L  Q+  +  LL ++ +        +    NQ++  G     CVGC   H    C  N ++V F++N+PF
Subjt:  EWDEDDFDNRRGGRGRSE--EGMDKNAVVALQRQMTAINILLNSMAISQVNVVGGSVHAANQIDDMG-----CVGCDGPHDTDACPLNTETVAFVRNDPF

Query:  SNTYNLGWRNHPYFGWGETGQQ
        SNTYN GWRNHP F W    QQ
Subjt:  SNTYNLGWRNHPYFGWGETGQQ

XP_038887458.1 uncharacterized protein LOC120077591 [Benincasa hispida]3.1e-5136.16Show/hide
Query:  MENNNNRNAP-----LPQAAQEPNVAYMAHDLDRPIRSYAAPNLYNFNPGIAYPVFGENARFEIKPMMLQMIQNTGQFGSHPGEDPHEHIRSFYSICPSF
        M +NNN  AP     + +  Q+P   ++A D + PIR+YAAPNLY+F+PGI+ P+  ENARFEIKP+M+QMIQN  QF S   E+PH H+  F  +C +F
Subjt:  MENNNNRNAP-----LPQAAQEPNVAYMAHDLDRPIRSYAAPNLYNFNPGIAYPVFGENARFEIKPMMLQMIQNTGQFGSHPGEDPHEHIRSFYSICPSF

Query:  HMPDISPEELRFALFPLILRDEAK------------------------------------------------------RFKRMVKACPYNGIPECILMEV
         +P I+P  +R  LFP  LRD+AK                                                      RF+R+VK CP+ GI +C+LME+
Subjt:  HMPDISPEELRFALFPLILRDEAK------------------------------------------------------RFKRMVKACPYNGIPECILMEV

Query:  FYFGLNKATQQTVDAVFVGGMFKSSYNQIKATLDTMANNNEEWDEDDFDNRRGGRGRSEEG-MDKNAVVALQRQMTAINILLNSMAISQVNVVGGSV--H
        FY GLN++TQ   DA  V      +Y + K  LD ++ N ++W +D +  R   R R++   +  N +  L  QM  +  LL  MA++Q     GS   +
Subjt:  FYFGLNKATQQTVDAVFVGGMFKSSYNQIKATLDTMANNNEEWDEDDFDNRRGGRGRSEEG-MDKNAVVALQRQMTAINILLNSMAISQVNVVGGSV--H

Query:  AANQIDDMGCVGCDGPHDTDACPLNTETVAFVRNDPFSNTYNLGWRNHPYFGWG
        A  Q+  +  +     H  + CP N + V  ++N+P++NTYN  WRNHP FGWG
Subjt:  AANQIDDMGCVGCDGPHDTDACPLNTETVAFVRNDPFSNTYNLGWRNHPYFGWG

XP_038896595.1 uncharacterized protein LOC120084850 [Benincasa hispida]1.8e-5137.65Show/hide
Query:  IRSYAAPNLYNFNPGIAYPVFGENARFEIKPMMLQMIQNTGQFGSHPGEDPHEHIRSFYSICPSFHMPDISPEELRFALFPLILRDEAKRF---------
        +R+YAAPN YNF P I+ P   ENA F+I+ +MLQMIQN G+FG   GEDPH H+  F  IC +F +P +SPE +R  +FP  LRDEA+R+         
Subjt:  IRSYAAPNLYNFNPGIAYPVFGENARFEIKPMMLQMIQNTGQFGSHPGEDPHEHIRSFYSICPSFHMPDISPEELRFALFPLILRDEAKRF---------

Query:  --------KRMVKACPYNGIPECILMEVFYFGLNKATQQTVDAVFVGGMFKSSYNQIKATLDTMANNNEEWDEDDFDNRRGGRGRSEEG--MDKNAVVAL
                +++VK C + GIP C+LME  Y GL+++ Q   DA    G+   +Y + K  LD ++ N +EW  + ++  RG   R  EG  +  + +  L
Subjt:  --------KRMVKACPYNGIPECILMEVFYFGLNKATQQTVDAVFVGGMFKSSYNQIKATLDTMANNNEEWDEDDFDNRRGGRGRSEEG--MDKNAVVAL

Query:  QRQMTAINILLNSMAISQVNVVGGS--VHAANQIDDMGCVGCDGPHDTDACPLNTETVAFVRNDPFSNTYNLGWRNHPYFGWGETGQQNQGRHGGQGDRR
          QMT + +LL +MAI Q ++   S   +A   +  + CV C   H  + CPLN + V  + N+PF N YN  WRNHP F W   G  NQG         
Subjt:  QRQMTAINILLNSMAISQVNVVGGS--VHAANQIDDMGCVGCDGPHDTDACPLNTETVAFVRNDPFSNTYNLGWRNHPYFGWGETGQQNQGRHGGQGDRR

Query:  EEASGSHTRYHNNRPHTPII----NNNPPSL--LHPLLHP
          +  +H   H     +P I    N+NP +L  + P  HP
Subjt:  EEASGSHTRYHNNRPHTPII----NNNPPSL--LHPLLHP

TrEMBL top hitse value%identityAlignment
A0A5A7TNS8 Uncharacterized protein1.1e-3836.77Show/hide
Query:  ENNNNRNAPLPQAAQEPNVAYMAHDLDRPIRSYAAPNLYNFNPGIAYPVFGENARFEIKPMMLQMIQNTGQFGSHPGEDPHEHIRSFYSICPSFHMPDIS
        +N  + N      A E N   +AH+L+ PI  YA+P+LY FN GI YP FG NARF++KP ML MIQ T QFG +  EDP +HI+S Y +C SFH+P +S
Subjt:  ENNNNRNAPLPQAAQEPNVAYMAHDLDRPIRSYAAPNLYNFNPGIAYPVFGENARFEIKPMMLQMIQNTGQFGSHPGEDPHEHIRSFYSICPSFHMPDIS

Query:  PEELRFALFPLILRDEAKRFKRMVKACPYNGIPECILMEVFYFGLNKATQQTVDAVF--VGGMFKSSYNQIKATLDTMANNNEEWDEDDFDNRRGGRGRS
         E+L    F   L+D+ K++   ++                     +ATQ  VD +F   G + + +Y QIK TLD M+ N+E                 
Subjt:  PEELRFALFPLILRDEAKRFKRMVKACPYNGIPECILMEVFYFGLNKATQQTVDAVF--VGGMFKSSYNQIKATLDTMANNNEEWDEDDFDNRRGGRGRS

Query:  EEGMDKNAVVALQRQMTAINILLNSMAISQVNVVGGSVHAANQIDDMGCVGCDGPHDTDACPLNTETVAFVRNDPFSNTYNLGWRNHPYFG
                                   +SQV +       A +I+ MGCVGCDGPH T+A   NTE+V +++   +SNTYNL WRNH  FG
Subjt:  EEGMDKNAVVALQRQMTAINILLNSMAISQVNVVGGSVHAANQIDDMGCVGCDGPHDTDACPLNTETVAFVRNDPFSNTYNLGWRNHPYFG

A0A5D3CC26 Uncharacterized protein2.3e-4456.89Show/hide
Query:  LRDEAKRFKRMVKACPYNGIPECILMEVFYFGLNKATQQTVDAVFVGGMFKSSYNQIKATLDTMANNNEEWDEDDFDNR------RGGRGRSEEGMDKNA
        L D  KRFKRM+K CP++ IPEC+LME FYFGL+K T Q+ + VF GGM +SSYNQIK  LDTMA+N++EW ++ F +R      +G RGR E+G+D + 
Subjt:  LRDEAKRFKRMVKACPYNGIPECILMEVFYFGLNKATQQTVDAVFVGGMFKSSYNQIKATLDTMANNNEEWDEDDFDNR------RGGRGRSEEGMDKNA

Query:  VVALQRQMTAINILLNSMAISQVNVVGGSVHAANQIDDMGCVGCDGPHDTDACPLNTETVAFVRNDP
        +VALQ Q+  +NI L SMA+ QVNVV  SV    Q+++MGCVGC  PH+T+ACPLNTE VA+++NDP
Subjt:  VVALQRQMTAINILLNSMAISQVNVVGGSVHAANQIDDMGCVGCDGPHDTDACPLNTETVAFVRNDP

A0A6J1EEI2 uncharacterized protein LOC1114333944.0e-4432.79Show/hide
Query:  NNNRNAPLPQAAQE---PNVAYMAHDLDRPIRSYAAPNLYNFNPGIAYPVFGENARFEIKPMMLQMIQNTGQFGSHPGEDPHEHIRSFYSICPSFHMPDI
        N     P   A QE    N  ++A D +R IR+YA P +   NP I  P   +   FE+KP+M QM+Q  GQF   P EDPH H++SF  +  SF    +
Subjt:  NNNRNAPLPQAAQE---PNVAYMAHDLDRPIRSYAAPNLYNFNPGIAYPVFGENARFEIKPMMLQMIQNTGQFGSHPGEDPHEHIRSFYSICPSFHMPDI

Query:  SPEELRFALFPLILRDEAK------------------------------------------------------RFKRMVKACPYNGIPECILMEVFYFGL
          + +R +LFP  LRD AK                                                      RFK M++ CP++G+P CI ME FY GL
Subjt:  SPEELRFALFPLILRDEAK------------------------------------------------------RFKRMVKACPYNGIPECILMEVFYFGL

Query:  NKATQQTVDAVFVGGMFKSSYNQIKATLDTMANNNEEWDEDDFDNRRGGRGRSEEG-MDKNAVVALQRQMTAINILLNSMAISQVNVVGGSVHAA---NQ
        N AT+Q VDA   G +   +YN+    L+ +A+NN +W +      R   GR   G ++ +A+ ++  Q+ ++  +L ++A+ Q +++   VH     NQ
Subjt:  NKATQQTVDAVFVGGMFKSSYNQIKATLDTMANNNEEWDEDDFDNRRGGRGRSEEG-MDKNAVVALQRQMTAINILLNSMAISQVNVVGGSVHAA---NQ

Query:  IDDMGCVGCDGPHDTDACPLNTETVAFV---------RNDPFSNTYNLGWRNHPYFGWGETGQQNQ
             CV C   H  D CP N  ++ +V         +N+PFSNTYN GWRNHP F W   G  NQ
Subjt:  IDDMGCVGCDGPHDTDACPLNTETVAFV---------RNDPFSNTYNLGWRNHPYFGWGETGQQNQ

A0A6J1EQ90 uncharacterized protein LOC1114364113.1e-4131.9Show/hide
Query:  NNNRNAPLPQAAQE---PNVAYMAHDLDRPIRSYAAPNLYNFNPGIAYPVFGENARFEIKPMMLQMIQNTGQFGSHPGEDPHEHIRSFYSI-------CP
        N     P   A QE    N  ++A D +R IR+YA P +   NP I  P   +   FE+KP+M QM+Q  GQF   P EDPH H++SF  +         
Subjt:  NNNRNAPLPQAAQE---PNVAYMAHDLDRPIRSYAAPNLYNFNPGIAYPVFGENARFEIKPMMLQMIQNTGQFGSHPGEDPHEHIRSFYSI-------CP

Query:  SFHMPDISPEELRFALFPLILRDEAK------------------------------------------------------RFKRMVKACPYNGIPECILM
        SF    +  + +R +LFP +LRD AK                                                      RFK M++ CP++G+P CI M
Subjt:  SFHMPDISPEELRFALFPLILRDEAK------------------------------------------------------RFKRMVKACPYNGIPECILM

Query:  EVFYFGLNKATQQTVDAVFVGGMFKSSYNQIKATLDTMANNNEEWDEDDFDNRRGGRGRSEEG-MDKNAVVALQRQMTAINILLNSMAISQVNVVGGSVH
        E FY GLN  T+Q VDA   G +   +YN+    L+ +A+NN +W +      R   GR   G ++ +A+ ++  Q+ ++  +L ++A+ Q +++   VH
Subjt:  EVFYFGLNKATQQTVDAVFVGGMFKSSYNQIKATLDTMANNNEEWDEDDFDNRRGGRGRSEEG-MDKNAVVALQRQMTAINILLNSMAISQVNVVGGSVH

Query:  AA---NQIDDMGCVGCDGPHDTDACPLNTETVAFV---------RNDPFSNTYNLGWRNHPYFGWGETGQQNQ
         A   NQ     CV C   H  D CP N  ++ +V         +N+PFSNTYN GWRNHP F W      NQ
Subjt:  AA---NQIDDMGCVGCDGPHDTDACPLNTETVAFV---------RNDPFSNTYNLGWRNHPYFGWGETGQQNQ

A0A6J1H7E4 uncharacterized protein LOC1114611689.7e-4332.51Show/hide
Query:  NNNRNAPLPQAAQE---PNVAYMAHDLDRPIRSYAAPNLYNFNPGIAYPVFGENARFEIKPMMLQMIQNTGQFGSHPGEDPHEHIRSFYSICPSFHMPDI
        N     P+  A QE    N   +A D +R IR+YA P +   NP I  P   +   FE+KP+M QM+Q  GQF   P EDPH H++SF  +  SF    +
Subjt:  NNNRNAPLPQAAQE---PNVAYMAHDLDRPIRSYAAPNLYNFNPGIAYPVFGENARFEIKPMMLQMIQNTGQFGSHPGEDPHEHIRSFYSICPSFHMPDI

Query:  SPEELRFALFPLILRDEAK------------------------------------------------------RFKRMVKACPYNGIPECILMEVFYFGL
          + +R +LFP  LRD AK                                                      RFK M++ CP++G+P CI ME FY GL
Subjt:  SPEELRFALFPLILRDEAK------------------------------------------------------RFKRMVKACPYNGIPECILMEVFYFGL

Query:  NKATQQTVDAVFVGGMFKSSYNQIKATLDTMANNNEEWDEDDFDNRRGGRGRSEEG-MDKNAVVALQRQMTAINILLNSMAISQVNVVGGSVHAA---NQ
        N AT+Q VDA   G M   +YN+    L+ +A+NN +W +      R   G+   G ++ +A+ ++  Q+ ++  +L ++A  Q  ++    H A    Q
Subjt:  NKATQQTVDAVFVGGMFKSSYNQIKATLDTMANNNEEWDEDDFDNRRGGRGRSEEG-MDKNAVVALQRQMTAINILLNSMAISQVNVVGGSVHAA---NQ

Query:  IDDMGCVGCDGPHDTDACPLNTETVAFVR---------NDPFSNTYNLGWRNHPYFGWGETGQQNQ
             CV C   H  D CP N  ++ +VR         N+P SNTYN GWRNHP F W   G  NQ
Subjt:  IDDMGCVGCDGPHDTDACPLNTETVAFVR---------NDPFSNTYNLGWRNHPYFGWGETGQQNQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAATAATAACAATAGAAACGCCCCTCTGCCGCAAGCTGCCCAAGAACCAAACGTTGCCTACATGGCACATGACCTGGACAGGCCAATTAGGTCATATGCGGCGCC
CAACCTCTACAACTTCAACCCAGGGATCGCTTACCCTGTATTCGGCGAAAATGCAAGGTTTGAAATCAAGCCTATGATGCTACAAATGATTCAGAACACCGGACAATTTG
GCAGTCACCCTGGAGAAGATCCACACGAGCACATTAGAAGTTTCTACTCTATTTGTCCTTCCTTCCATATGCCAGACATCTCACCTGAGGAATTGAGATTCGCACTATTC
CCGTTAATTCTAAGGGATGAGGCGAAAAGGTTCAAGAGGATGGTCAAAGCATGCCCCTACAATGGCATTCCTGAATGCATCTTGATGGAGGTCTTCTACTTTGGCTTGAA
CAAGGCGACACAACAGACTGTTGATGCTGTGTTTGTAGGTGGTATGTTTAAAAGCTCCTACAACCAAATTAAGGCAACGCTGGACACGATGGCCAACAATAATGAAGAAT
GGGATGAAGATGATTTCGACAATCGTCGAGGAGGACGAGGAAGAAGCGAAGAAGGTATGGATAAGAACGCCGTGGTGGCGTTGCAGAGACAAATGACTGCGATAAACATT
CTTCTCAACTCTATGGCAATATCGCAAGTCAATGTCGTAGGAGGCTCTGTGCACGCGGCAAATCAAATTGATGATATGGGATGTGTGGGATGCGACGGTCCTCATGATAC
TGACGCATGCCCACTCAACACAGAAACAGTCGCGTTCGTAAGGAACGACCCTTTCTCCAACACTTACAACCTTGGTTGGAGGAACCATCCTTACTTTGGATGGGGAGAAA
CGGGTCAACAAAATCAAGGGCGACATGGTGGTCAAGGTGACCGTCGCGAAGAAGCATCTGGCTCCCACACGAGGTACCACAACAACAGACCCCACACTCCCATCATCAAC
AACAACCCACCATCATTACTCCATCCACTTCTTCATCCATGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAAATAATAACAATAGAAACGCCCCTCTGCCGCAAGCTGCCCAAGAACCAAACGTTGCCTACATGGCACATGACCTGGACAGGCCAATTAGGTCATATGCGGCGCC
CAACCTCTACAACTTCAACCCAGGGATCGCTTACCCTGTATTCGGCGAAAATGCAAGGTTTGAAATCAAGCCTATGATGCTACAAATGATTCAGAACACCGGACAATTTG
GCAGTCACCCTGGAGAAGATCCACACGAGCACATTAGAAGTTTCTACTCTATTTGTCCTTCCTTCCATATGCCAGACATCTCACCTGAGGAATTGAGATTCGCACTATTC
CCGTTAATTCTAAGGGATGAGGCGAAAAGGTTCAAGAGGATGGTCAAAGCATGCCCCTACAATGGCATTCCTGAATGCATCTTGATGGAGGTCTTCTACTTTGGCTTGAA
CAAGGCGACACAACAGACTGTTGATGCTGTGTTTGTAGGTGGTATGTTTAAAAGCTCCTACAACCAAATTAAGGCAACGCTGGACACGATGGCCAACAATAATGAAGAAT
GGGATGAAGATGATTTCGACAATCGTCGAGGAGGACGAGGAAGAAGCGAAGAAGGTATGGATAAGAACGCCGTGGTGGCGTTGCAGAGACAAATGACTGCGATAAACATT
CTTCTCAACTCTATGGCAATATCGCAAGTCAATGTCGTAGGAGGCTCTGTGCACGCGGCAAATCAAATTGATGATATGGGATGTGTGGGATGCGACGGTCCTCATGATAC
TGACGCATGCCCACTCAACACAGAAACAGTCGCGTTCGTAAGGAACGACCCTTTCTCCAACACTTACAACCTTGGTTGGAGGAACCATCCTTACTTTGGATGGGGAGAAA
CGGGTCAACAAAATCAAGGGCGACATGGTGGTCAAGGTGACCGTCGCGAAGAAGCATCTGGCTCCCACACGAGGTACCACAACAACAGACCCCACACTCCCATCATCAAC
AACAACCCACCATCATTACTCCATCCACTTCTTCATCCATGA
Protein sequenceShow/hide protein sequence
MENNNNRNAPLPQAAQEPNVAYMAHDLDRPIRSYAAPNLYNFNPGIAYPVFGENARFEIKPMMLQMIQNTGQFGSHPGEDPHEHIRSFYSICPSFHMPDISPEELRFALF
PLILRDEAKRFKRMVKACPYNGIPECILMEVFYFGLNKATQQTVDAVFVGGMFKSSYNQIKATLDTMANNNEEWDEDDFDNRRGGRGRSEEGMDKNAVVALQRQMTAINI
LLNSMAISQVNVVGGSVHAANQIDDMGCVGCDGPHDTDACPLNTETVAFVRNDPFSNTYNLGWRNHPYFGWGETGQQNQGRHGGQGDRREEASGSHTRYHNNRPHTPIIN
NNPPSLLHPLLHP