; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0005882 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0005882
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr04:22107346..22111820
RNA-Seq ExpressionPI0005882
SyntenyPI0005882
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
WP_217833177.1 retrotransposon gag domain-containing protein, partial [Synechococcus sp. PCC 7002]1.0e-7348.08Show/hide
Query:  FMSDSEQP-FELDPEIERTFRG-----NRRRARQTQVRRMENNNRNAPQPQADPKPNA------------AYIAHDLDRPIRSYAGPNLYNFNPGIAYPI
        +MS+ + P F++DPEIERTFR       +RR+ QT    M+        PQA  + NA              +AHD +RP+R YA PNLYNF PGI  P 
Subjt:  FMSDSEQP-FELDPEIERTFRG-----NRRRARQTQVRRMENNNRNAPQPQADPKPNA------------AYIAHDLDRPIRSYAGPNLYNFNPGIAYPI

Query:  FSENARFEIKPVMLQMIQNVGQFGGHPGEDPHEHIRSFYSICASFHMPGISPEELRFALFPLTLRDEAKRWANALKGGEVGTWDQLIEKFMKKIFPPYEN
        F  N RFE+KPVMLQM+Q  GQFGG  GEDPH H++SF  IC++F M G+  + +R  LFP +LRDEA++WA + + GE+ TW +++EKFM+K FPP  +
Subjt:  FSENARFEIKPVMLQMIQNVGQFGGHPGEDPHEHIRSFYSICASFHMPGISPEELRFALFPLTLRDEAKRWANALKGGEVGTWDQLIEKFMKKIFPPYEN

Query:  ARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFVDGMLKSTYNQIKTTLDTMASNNEE
        A+RR+++++F+QKD E   +AW+RFKR+V+ CPHNGIP C+ ME+FY GLNK +Q  ADA    G++  TY Q K  LD ++ N  +
Subjt:  ARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFVDGMLKSTYNQIKTTLDTMASNNEE

XP_017233063.1 PREDICTED: uncharacterized protein LOC108207110 [Daucus carota subsp. sativus]4.6e-5847.33Show/hide
Query:  FELDPEIERTFRGNR---RRARQTQVRRMENNNRNAPQPQADPKPNAAYIAHDLDRPIRSYAGPNLYNFNPGIAYPIFSENARFEIKPVMLQMIQNVGQF
        F  DPEIERTF   R   R+ +QTQV  M++N  N   P     P  A+I  D DR IR YA P     N GI  P   +  +FE+KPVM QM+Q +GQF
Subjt:  FELDPEIERTFRGNR---RRARQTQVRRMENNNRNAPQPQADPKPNAAYIAHDLDRPIRSYAGPNLYNFNPGIAYPIFSENARFEIKPVMLQMIQNVGQF

Query:  GGHPGEDPHEHIRSFYSICASFHMPGISPEELRFALFPLTLRDEAKRWANALKGGEVGTWDQLIEKFMKKIFPPYENARRRKELMSFQQKDRENLHDAWS
         G P EDPH H+R F  I  SF   G+  + LR  LFP ++RD A+ W N+L  G V TW+ L EKF+ K FPP  NA+ R E+ SFQQ+D E+L+DAW 
Subjt:  GGHPGEDPHEHIRSFYSICASFHMPGISPEELRFALFPLTLRDEAKRWANALKGGEVGTWDQLIEKFMKKIFPPYENARRRKELMSFQQKDRENLHDAWS

Query:  RFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFVDGMLKSTYNQIKTTLDTMASNN
        RFK +++ CPH+GI  CI ME FY GLN  T+   DA     +L  +YNQ    L+T+A+ N
Subjt:  RFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFVDGMLKSTYNQIKTTLDTMASNN

XP_030508936.1 uncharacterized protein LOC115723589 [Cannabis sativa]4.3e-5645.35Show/hide
Query:  MSDSEQPFEL---DPEIERTFRGNRRRARQTQVRRMENNNRNAPQPQADPKPNAAYIAHDLDRPIRSYAGPNLYNFNPGIAYPIFSENARFEIKPVMLQM
        M++ E+  EL   DPEIERTF   R+R ++ + ++  N            + N   +A D  R IR YA P     NPGI  P   +   FE+KPVM QM
Subjt:  MSDSEQPFEL---DPEIERTFRGNRRRARQTQVRRMENNNRNAPQPQADPKPNAAYIAHDLDRPIRSYAGPNLYNFNPGIAYPIFSENARFEIKPVMLQM

Query:  IQNVGQFGGHPGEDPHEHIRSFYSICASFHMPGISPEELRFALFPLTLRDEAKRWANALKGGEVGTWDQLIEKFMKKIFPPYENARRRKELMSFQQKDRE
        +Q VGQFGG P EDPH HIRSF  +  SF + G+S E LR  LFP +LRD A+ W N L    V  W+ L EKF++K FPP  NA+ R E+MSFQQ + E
Subjt:  IQNVGQFGGHPGEDPHEHIRSFYSICASFHMPGISPEELRFALFPLTLRDEAKRWANALKGGEVGTWDQLIEKFMKKIFPPYENARRRKELMSFQQKDRE

Query:  NLHDAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFVDGMLKSTYNQIKTTLDTMASNN
           DAW RFK +++ CPH+GIP CI +E FY GLN A +   DA     +L  +YN+    L+ +ASNN
Subjt:  NLHDAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFVDGMLKSTYNQIKTTLDTMASNN

XP_038880527.1 uncharacterized protein LOC120072192 [Benincasa hispida]1.9e-5646.12Show/hide
Query:  IAHDLDRPIRSYAGPNLYNFNPGIAYPIFSENARFEIKPVMLQMIQNVGQFGGHPGEDPHEHIRSFYSICASFHMPGISPEELRFALFPLTLRDEAKRWA
        +A++  RP+R YA P LY+F+PGI YP+  +  RFE+K VMLQM+Q   QFGG  GEDPH H++ F   C  F +P I+PE++R +LFP +LRD+AK+W 
Subjt:  IAHDLDRPIRSYAGPNLYNFNPGIAYPIFSENARFEIKPVMLQMIQNVGQFGGHPGEDPHEHIRSFYSICASFHMPGISPEELRFALFPLTLRDEAKRWA

Query:  NALKGGEVGTWDQLIEKFMKKIFPPYENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFVDGMLKSTYN
        ++L+  E+ TW++L+EKFM+K FPP  NARRR+E+M+F+Q+D E L  A  RF  +VK CP++ +   I ME FY GLN+A+Q  ADA   +G++  +Y 
Subjt:  NALKGGEVGTWDQLIEKFMKKIFPPYENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFVDGMLKSTYN

Query:  QIKTTLDTMASNNEECDEDDFGGR--RGRRAKGDDGMDKSAVVAL
        + K  L  +A +N E  +D + GR  R RR++  + +D +A+  L
Subjt:  QIKTTLDTMASNNEECDEDDFGGR--RGRRAKGDDGMDKSAVVAL

XP_038887458.1 uncharacterized protein LOC120077591 [Benincasa hispida]4.9e-6846.55Show/hide
Query:  SEQPFELDPEIERTF--RGNRRRARQTQVRRMENNNRNAPQPQAD---PKPNAAYIAHDLDRPIRSYAGPNLYNFNPGIAYPIFSENARFEIKPVMLQMI
        ++  FE +PEI+ TF  R ++ RA + ++   +NNN  AP+       P  +  ++A D + PIR+YA PNLY+F+PGI+ PI  ENARFEIKPVM+QMI
Subjt:  SEQPFELDPEIERTF--RGNRRRARQTQVRRMENNNRNAPQPQAD---PKPNAAYIAHDLDRPIRSYAGPNLYNFNPGIAYPIFSENARFEIKPVMLQMI

Query:  QNVGQFGGHPGEDPHEHIRSFYSICASFHMPGISPEELRFALFPLTLRDEAKRWANALKGGEVGTWDQLIEKFMKKIFPPYENARRRKELMSFQQKDREN
        QN+ QF     E+PH H+  F  +C++F +PGI+P  +R  LFP TLRD+AKRWA++L+  E+ + DQL+E FMKK FPP  N RRRK +++F++ D E 
Subjt:  QNVGQFGGHPGEDPHEHIRSFYSICASFHMPGISPEELRFALFPLTLRDEAKRWANALKGGEVGTWDQLIEKFMKKIFPPYENARRRKELMSFQQKDREN

Query:  LHDAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFVDGMLKSTYNQIKTTLDTMASNNEECDEDDFGGRRGRRAKGDDGM
        L  AW RF+R+VK CPH GI  C+LME+FY GLN++TQ  ADA  V+  +  TY + K  LD ++ N ++  +D + GR   R + D+ +
Subjt:  LHDAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFVDGMLKSTYNQIKTTLDTMASNNEECDEDDFGGRRGRRAKGDDGM

TrEMBL top hitse value%identityAlignment
A0A5A7V1F3 Retrotrans_gag domain-containing protein9.3e-4948.15Show/hide
Query:  YIAHDLDRPIRSYAGPNLYNFNPGIAYPIFSENARFEIKPVMLQMIQNVGQFGGHPGEDPHEHIRSFYSICASFHMPGISPEELRFALFPLTLRDEAKRW
        Y+AH+L RPIRSYA P+LY FNPGIAYP F ENA +E K                                                       D+AKRW
Subjt:  YIAHDLDRPIRSYAGPNLYNFNPGIAYPIFSENARFEIKPVMLQMIQNVGQFGGHPGEDPHEHIRSFYSICASFHMPGISPEELRFALFPLTLRDEAKRW

Query:  ANALKGGEVGTWDQLIEKFMKKIFPPYENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFVDGMLKSTY
        AN+++ GEV TW+ LIEKFMKK FP  + A+RR++L+ F+Q+DR+NLHDAWS FKRMVKAC H+GI K +LME FYFGL+K T+Q+AD++F+ G+L+S+Y
Subjt:  ANALKGGEVGTWDQLIEKFMKKIFPPYENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFVDGMLKSTY

Query:  NQIKTTLDTMASNNEE
        NQIK  LD+MA+N+++
Subjt:  NQIKTTLDTMASNNEE

A0A6J1EEI2 uncharacterized protein LOC1114333948.2e-5345.53Show/hide
Query:  NAAYIAHDLDRPIRSYAGPNLYNFNPGIAYPIFSENARFEIKPVMLQMIQNVGQFGGHPGEDPHEHIRSFYSICASFHMPGISPEELRFALFPLTLRDEA
        NA ++A D +R IR+YA P +   NP I  P   +   FE+KPVM QM+Q +GQF G P EDPH H++SF  +  SF    +  + +R +LFP +LRD A
Subjt:  NAAYIAHDLDRPIRSYAGPNLYNFNPGIAYPIFSENARFEIKPVMLQMIQNVGQFGGHPGEDPHEHIRSFYSICASFHMPGISPEELRFALFPLTLRDEA

Query:  KRWANALKGGEVGTWDQLIEKFMKKIFPPYENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFVDGMLK
        K W N L  G + +W+ L+EKF+ K FPP  NAR R E++ FQQ + + L +AW RFK M++ CPH+G+P CI ME FY GLN AT+Q  DA     +L 
Subjt:  KRWANALKGGEVGTWDQLIEKFMKKIFPPYENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFVDGMLK

Query:  STYNQIKTTLDTMASNNEECDEDDFGGRRGRRAKG
         TYN+    L+ +ASNN  C   D     GR+ +G
Subjt:  STYNQIKTTLDTMASNNEECDEDDFGGRRGRRAKG

A0A6J1EQ90 uncharacterized protein LOC1114364114.9e-5040.68Show/hide
Query:  FELDPEIERTFRGNRRRARQ------TQVRRMENNNRNAPQP-----QADPKPNAAYIAHDLDRPIRSYAGPNLYNFNPGIAYPIFSENARFEIKPVMLQ
        F LDPEIERTFR   ++ ++       Q+      NR    P     Q     N  ++A D +R IR+YA P +   NP I  P   +   FE+KPVM Q
Subjt:  FELDPEIERTFRGNRRRARQ------TQVRRMENNNRNAPQP-----QADPKPNAAYIAHDLDRPIRSYAGPNLYNFNPGIAYPIFSENARFEIKPVMLQ

Query:  MIQNVGQFGGHPGEDPHEHIRSFYSI-------CASFHMPGISPEELRFALFPLTLRDEAKRWANALKGGEVGTWDQLIEKFMKKIFPPYENARRRKELM
        M+Q +GQF G P EDPH H++SF  +         SF   G+  + +R +LFP  LRD AK W N L  G + +W+ L E F+ K FPP  NAR + E++
Subjt:  MIQNVGQFGGHPGEDPHEHIRSFYSI-------CASFHMPGISPEELRFALFPLTLRDEAKRWANALKGGEVGTWDQLIEKFMKKIFPPYENARRRKELM

Query:  SFQQKDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFVDGMLKSTYNQIKTTLDTMASNNEECDEDDFGGRRGRRAKG
        +FQQ + E L +A  RFK M++ CPH+G+P CI ME FY GLN  T+Q  DA     +L  TYN+    L+ +ASNN  C   D     GR+ +G
Subjt:  SFQQKDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFVDGMLKSTYNQIKTTLDTMASNNEECDEDDFGGRRGRRAKG

A0A6J1H7E4 uncharacterized protein LOC1114611686.2e-5345.96Show/hide
Query:  NAAYIAHDLDRPIRSYAGPNLYNFNPGIAYPIFSENARFEIKPVMLQMIQNVGQFGGHPGEDPHEHIRSFYSICASFHMPGISPEELRFALFPLTLRDEA
        NA  +A D +R IR+YA P +   NP I  P   +   FE+KPVM QM+Q +GQF G P EDPH H++SF  +  SF   G+  + +R +LFP +LRD A
Subjt:  NAAYIAHDLDRPIRSYAGPNLYNFNPGIAYPIFSENARFEIKPVMLQMIQNVGQFGGHPGEDPHEHIRSFYSICASFHMPGISPEELRFALFPLTLRDEA

Query:  KRWANALKGGEVGTWDQLIEKFMKKIFPPYENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFVDGMLK
        K W N L    + +W+ L EKF+ K FPP  NAR R E+++FQQ + E L +AW RFK M++ CPH+G+P CI ME FY GLN AT+Q  DA     ML 
Subjt:  KRWANALKGGEVGTWDQLIEKFMKKIFPPYENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFVDGMLK

Query:  STYNQIKTTLDTMASNNEECDEDDFGGRRGRRAKG
         TYN+    L+ +ASNN  C   D     G++ +G
Subjt:  STYNQIKTTLDTMASNNEECDEDDFGGRRGRRAKG

U5CUI2 Retrotrans_gag domain-containing protein4.3e-5449.31Show/hide
Query:  NAAYIAHDLDRPIRSYAGPNLYNFNPGIAYPIFSENARFEIKPVMLQMIQNVGQFGGHPGEDPHEHIRSFYSICASFHMPGISPEELRFALFPLTLRDEA
        N   +A D  R IR YA P     NPGI  P   +  +FE+KPVM QM+Q VGQF G P EDPH H+RSF  +  SF + G+S E LR  LFP +LRD A
Subjt:  NAAYIAHDLDRPIRSYAGPNLYNFNPGIAYPIFSENARFEIKPVMLQMIQNVGQFGGHPGEDPHEHIRSFYSICASFHMPGISPEELRFALFPLTLRDEA

Query:  KRWANALKGGEVGTWDQLIEKFMKKIFPPYENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFVDGMLK
        + W N L    V  W+ L EKF++K FPP  NA+ R E+MSFQQ + E+  DAW RFK +++ CPH+GIP CI ME FY GLN A++   DA     +L 
Subjt:  KRWANALKGGEVGTWDQLIEKFMKKIFPPYENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFVDGMLK

Query:  STYNQIKTTLDTMASNN
         +YN+    L+T+ASNN
Subjt:  STYNQIKTTLDTMASNN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAACCTCATTCGCGAGTATATGCAGAAAAATGATGCTCTTCTGCAAAGCCAAGCTTCATCAATTCGTAATCTGGAGGTACAGTTAGGGCAGCTCGCCAGCGATTT
CTCTGGAAGACCGCAAGGATCCCTCCCAAGCAATACAGAAACGCCAAATCAGGATTATTGCGAGGAAGAAGAAGCTTCTAAGTTGCTCAACCCTGAAGAATTGTTTGAAG
AAACATCGGAGACAGAATGTGTAAACGTCGTATCTGGCGAAAGGAAATCAGAACATTGGGATTTCAAGCAAATAAGAGATGTCAGGCGAGGAAGTCTATGCGAGGAAGCT
TCAGGCGAGGAAGCGGCGAAAGAGATAGAGATATTCCCTTCGCCGAACTTATCCTTAGCATATTACTCAGAAATGATGAAATATGTGGCGGAAGCACCGAAAGAAACTCA
GTTTCAGTTGAATCGCACGTTCAAACCCGAAGAACATGAGAACAAGTTTATGAGTGACAGTGAACAACCATTCGAACTTGACCCTGAAATTGAGCGAACATTTCGGGGTA
ATCGGCGAAGAGCAAGGCAGACACAAGTTCGAAGAATGGAAAATAACAACAGAAATGCCCCTCAGCCGCAAGCTGACCCAAAACCAAATGCTGCCTACATAGCACACGAC
TTGGATAGGCCAATTAGATCTTATGCAGGACCCAACCTCTACAACTTCAACCCAGGAATCGCCTACCCTATATTCAGCGAGAACGCCAGGTTTGAAATCAAACCTGTTAT
GCTTCAAATGATTCAGAACGTCGGACAATTCGGCGGACATCCTGGGGAAGATCCACACGAACATATAAGGAGTTTCTACTCCATCTGCGCTTCCTTCCATATGCCAGGCA
TCTCACCTGAAGAATTAAGATTCGCTCTCTTCCCGTTAACTTTGAGGGATGAGGCGAAGAGGTGGGCAAATGCTCTGAAAGGTGGCGAGGTGGGAACATGGGATCAACTA
ATAGAGAAATTTATGAAGAAAATTTTTCCACCTTACGAAAACGCTAGAAGAAGGAAGGAGCTTATGAGCTTCCAGCAGAAGGACAGAGAAAACCTACATGACGCGTGGAG
TAGGTTCAAACGGATGGTCAAAGCATGCCCCCACAATGGCATTCCTAAATGCATATTGATGGAGGTTTTCTATTTTGGACTAAACAAGGCTACACAGCAGACTGCTGATG
CTGTGTTTGTAGACGGTATGCTGAAAAGTACATACAATCAGATTAAGACGACGCTGGATACGATGGCCAGCAACAATGAAGAATGTGATGAAGATGATTTCGGCGGTCGC
CGAGGAAGACGAGCAAAAGGTGATGATGGCATGGATAAAAGCGCCGTGGTGGCATTATAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAAACCTCATTCGCGAGTATATGCAGAAAAATGATGCTCTTCTGCAAAGCCAAGCTTCATCAATTCGTAATCTGGAGGTACAGTTAGGGCAGCTCGCCAGCGATTT
CTCTGGAAGACCGCAAGGATCCCTCCCAAGCAATACAGAAACGCCAAATCAGGATTATTGCGAGGAAGAAGAAGCTTCTAAGTTGCTCAACCCTGAAGAATTGTTTGAAG
AAACATCGGAGACAGAATGTGTAAACGTCGTATCTGGCGAAAGGAAATCAGAACATTGGGATTTCAAGCAAATAAGAGATGTCAGGCGAGGAAGTCTATGCGAGGAAGCT
TCAGGCGAGGAAGCGGCGAAAGAGATAGAGATATTCCCTTCGCCGAACTTATCCTTAGCATATTACTCAGAAATGATGAAATATGTGGCGGAAGCACCGAAAGAAACTCA
GTTTCAGTTGAATCGCACGTTCAAACCCGAAGAACATGAGAACAAGTTTATGAGTGACAGTGAACAACCATTCGAACTTGACCCTGAAATTGAGCGAACATTTCGGGGTA
ATCGGCGAAGAGCAAGGCAGACACAAGTTCGAAGAATGGAAAATAACAACAGAAATGCCCCTCAGCCGCAAGCTGACCCAAAACCAAATGCTGCCTACATAGCACACGAC
TTGGATAGGCCAATTAGATCTTATGCAGGACCCAACCTCTACAACTTCAACCCAGGAATCGCCTACCCTATATTCAGCGAGAACGCCAGGTTTGAAATCAAACCTGTTAT
GCTTCAAATGATTCAGAACGTCGGACAATTCGGCGGACATCCTGGGGAAGATCCACACGAACATATAAGGAGTTTCTACTCCATCTGCGCTTCCTTCCATATGCCAGGCA
TCTCACCTGAAGAATTAAGATTCGCTCTCTTCCCGTTAACTTTGAGGGATGAGGCGAAGAGGTGGGCAAATGCTCTGAAAGGTGGCGAGGTGGGAACATGGGATCAACTA
ATAGAGAAATTTATGAAGAAAATTTTTCCACCTTACGAAAACGCTAGAAGAAGGAAGGAGCTTATGAGCTTCCAGCAGAAGGACAGAGAAAACCTACATGACGCGTGGAG
TAGGTTCAAACGGATGGTCAAAGCATGCCCCCACAATGGCATTCCTAAATGCATATTGATGGAGGTTTTCTATTTTGGACTAAACAAGGCTACACAGCAGACTGCTGATG
CTGTGTTTGTAGACGGTATGCTGAAAAGTACATACAATCAGATTAAGACGACGCTGGATACGATGGCCAGCAACAATGAAGAATGTGATGAAGATGATTTCGGCGGTCGC
CGAGGAAGACGAGCAAAAGGTGATGATGGCATGGATAAAAGCGCCGTGGTGGCATTATAG
Protein sequenceShow/hide protein sequence
MENLIREYMQKNDALLQSQASSIRNLEVQLGQLASDFSGRPQGSLPSNTETPNQDYCEEEEASKLLNPEELFEETSETECVNVVSGERKSEHWDFKQIRDVRRGSLCEEA
SGEEAAKEIEIFPSPNLSLAYYSEMMKYVAEAPKETQFQLNRTFKPEEHENKFMSDSEQPFELDPEIERTFRGNRRRARQTQVRRMENNNRNAPQPQADPKPNAAYIAHD
LDRPIRSYAGPNLYNFNPGIAYPIFSENARFEIKPVMLQMIQNVGQFGGHPGEDPHEHIRSFYSICASFHMPGISPEELRFALFPLTLRDEAKRWANALKGGEVGTWDQL
IEKFMKKIFPPYENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFVDGMLKSTYNQIKTTLDTMASNNEECDEDDFGGR
RGRRAKGDDGMDKSAVVAL