; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc10g26190 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc10g26190
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr10:19078601..19079482
RNA-Seq ExpressionMoc10g26190
SyntenyMoc10g26190
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022143489.1 uncharacterized protein LOC111013365 [Momordica charantia]6.8e-3953.42Show/hide
Query:  GNNPSYIDGLVSQLRLRFDMTDLGGVKYFLGLDITRSINGIHVSQAKYARDLLSRFAMAAAKPCHTPISIKTIFYNSQPCSPSDVHAYRAIIGALQYFTF
        G++PSYI  L+ QL+ RFDMTDL  +KYFLGL+I+ + NG+ +SQAKY RD+L RF + +AKPC TP+S+ +      PCS  D+  YR++IG+L Y TF
Subjt:  GNNPSYIDGLVSQLRLRFDMTDLGGVKYFLGLDITRSINGIHVSQAKYARDLLSRFAMAAAKPCHTPISIKTIFYNSQPCSPSDVHAYRAIIGALQYFTF

Query:  SRPDIVLSISKLSQFMHSPSLFHLAAAKRVLRYIAGTISSGLLFQR
        +RPDI  ++ KLSQFMH+P   HL AAKR+LRY++G++   ++FQR
Subjt:  SRPDIVLSISKLSQFMHSPSLFHLAAAKRVLRYIAGTISSGLLFQR

XP_022143496.1 uncharacterized protein LOC111013373 [Momordica charantia]1.2e-3258.91Show/hide
Query:  GNNPSYIDGLVSQLRLRFDMTDLGGVKYFLGLDITRSINGIHVSQAKYARDLLSRFAMAAAKPCHTPISIKTIFYNSQPCSPSDVHAYRAIIGALQYFTF
        GN+PSYI  L+SQL+L F+MTDLG +KYFLGL+I+ S  GI V Q KY RD+L RF M  AK C TP+++ T       CS +D   YRA+IGAL Y TF
Subjt:  GNNPSYIDGLVSQLRLRFDMTDLGGVKYFLGLDITRSINGIHVSQAKYARDLLSRFAMAAAKPCHTPISIKTIFYNSQPCSPSDVHAYRAIIGALQYFTF

Query:  SRPDIVLSISKLSQFMHSPSLFHLAAAKR
        SRPDI   ISKLSQFMH P   HL  AKR
Subjt:  SRPDIVLSISKLSQFMHSPSLFHLAAAKR

XP_022151604.1 uncharacterized protein LOC111019517 [Momordica charantia]9.8e-3856.34Show/hide
Query:  SYIDGLVSQLRLRFDMTDLGGVKYFLGLDITRSINGIHVSQAKYARDLLSRFAMAAAKPCHTPISIKTIFYNSQPCSPSDVHAYRAIIGALQYFTFSRPD
        SYID L+SQL+LRFDMTDLG +++FLGL+I  S +GI V Q+KY +D+L+RF M +AKPC TPI++ +  + S  CS +D   YR+++GAL Y TFSRP+
Subjt:  SYIDGLVSQLRLRFDMTDLGGVKYFLGLDITRSINGIHVSQAKYARDLLSRFAMAAAKPCHTPISIKTIFYNSQPCSPSDVHAYRAIIGALQYFTFSRPD

Query:  IVLSISKLSQFMHSPSLFHLAAAKRVLRYIAGTISSGLLFQR
        I  ++SKLSQ +HSP+  HL AAKRVLRY+ G++  GL+FQ+
Subjt:  IVLSISKLSQFMHSPSLFHLAAAKRVLRYIAGTISSGLLFQR

XP_022152156.1 uncharacterized protein LOC111019945 [Momordica charantia]7.2e-4155.7Show/hide
Query:  GNNPSYIDGLVSQLRLRFDMTDLGGVKYFLGLDITRSINGIHVSQAKYARDLLSRFAMAAAKPCHTPISIKTIFYNSQPCSPSDVHAYRAIIGALQYFTF
        GN+ SYI  L+S L+ RFDMTDLG + YFLGL+IT +  GI V+QAKY RD+L RF M +AKPC TP+++       +PCS  D   YR++IGA  Y TF
Subjt:  GNNPSYIDGLVSQLRLRFDMTDLGGVKYFLGLDITRSINGIHVSQAKYARDLLSRFAMAAAKPCHTPISIKTIFYNSQPCSPSDVHAYRAIIGALQYFTF

Query:  SRPDIVLSISKLSQFMHSPSLFHLAAAKRVLRYIAGTISSGLLFQRKVYAPLCITAFS
        SRPDI  S+SKLSQ MH P ++HL AAKR+LRY+ GT+  GL+F+R   A L + AFS
Subjt:  SRPDIVLSISKLSQFMHSPSLFHLAAAKRVLRYIAGTISSGLLFQRKVYAPLCITAFS

XP_022152751.1 uncharacterized protein LOC111020396 isoform X1 [Momordica charantia]4.2e-81100Show/hide
Query:  GTISSGLLFQRKVYAPLCITAFSARIGLVTWMIGVLPLDLLCFLVLIQFLGLPRNKQPFPKVLLRQSIGLLPPQQLNYSGFVTCFVILVLLCLSHPRFGV
        GTISSGLLFQRKVYAPLCITAFSARIGLVTWMIGVLPLDLLCFLVLIQFLGLPRNKQPFPKVLLRQSIGLLPPQQLNYSGFVTCFVILVLLCLSHPRFGV
Subjt:  GTISSGLLFQRKVYAPLCITAFSARIGLVTWMIGVLPLDLLCFLVLIQFLGLPRNKQPFPKVLLRQSIGLLPPQQLNYSGFVTCFVILVLLCLSHPRFGV

Query:  ITSHYSISSKSGCSMAVRSMWRSIFILSRSGLFVKTYAFNIFLLCSTCRLADKTTYW
        ITSHYSISSKSGCSMAVRSMWRSIFILSRSGLFVKTYAFNIFLLCSTCRLADKTTYW
Subjt:  ITSHYSISSKSGCSMAVRSMWRSIFILSRSGLFVKTYAFNIFLLCSTCRLADKTTYW

TrEMBL top hitse value%identityAlignment
A0A2N9ISB5 Integrase catalytic domain-containing protein2.6e-3641.48Show/hide
Query:  NNPSYIDGLVSQLRLRFDMTDLGGVKYFLGLDITRSINGIHVSQAKYARDLLSRFAMAAAKPCHTPISIKTIFYNSQPCSPSDVHAYRAIIGALQYFTFS
        N P+Y+D LV+QL   FD+ DLG + YFLGL +TRS  G++++QAKYA DLL +  M  +KP  +P    T     +     D H YR+++GAL Y TF+
Subjt:  NNPSYIDGLVSQLRLRFDMTDLGGVKYFLGLDITRSINGIHVSQAKYARDLLSRFAMAAAKPCHTPISIKTIFYNSQPCSPSDVHAYRAIIGALQYFTFS

Query:  RPDIVLSISKLSQFMHSPSLFHLAAAKRVLRYIAGTISSGLLFQRKVYAPLCITAFSARIGLVTWMIGVLPLDLLCFLVLIQFLGLPRNKQPFPKVLLRQ
        RPDI  S+ ++ Q+M +P+  HLAAAKR+LRYI GT+  G                         M  V PL +L  L LI  LG PRN   FP +L  Q
Subjt:  RPDIVLSISKLSQFMHSPSLFHLAAAKRVLRYIAGTISSGLLFQRKVYAPLCITAFSARIGLVTWMIGVLPLDLLCFLVLIQFLGLPRNKQPFPKVLLRQ

Query:  SIGLLPPQQLNYSGFVTCFVILVLLCLSH
        +  LLP  QL+  GF     IL    L+H
Subjt:  SIGLLPPQQLNYSGFVTCFVILVLLCLSH

A0A6J1CPG5 uncharacterized protein LOC1110133653.3e-3953.42Show/hide
Query:  GNNPSYIDGLVSQLRLRFDMTDLGGVKYFLGLDITRSINGIHVSQAKYARDLLSRFAMAAAKPCHTPISIKTIFYNSQPCSPSDVHAYRAIIGALQYFTF
        G++PSYI  L+ QL+ RFDMTDL  +KYFLGL+I+ + NG+ +SQAKY RD+L RF + +AKPC TP+S+ +      PCS  D+  YR++IG+L Y TF
Subjt:  GNNPSYIDGLVSQLRLRFDMTDLGGVKYFLGLDITRSINGIHVSQAKYARDLLSRFAMAAAKPCHTPISIKTIFYNSQPCSPSDVHAYRAIIGALQYFTF

Query:  SRPDIVLSISKLSQFMHSPSLFHLAAAKRVLRYIAGTISSGLLFQR
        +RPDI  ++ KLSQFMH+P   HL AAKR+LRY++G++   ++FQR
Subjt:  SRPDIVLSISKLSQFMHSPSLFHLAAAKRVLRYIAGTISSGLLFQR

A0A6J1DDJ2 uncharacterized protein LOC1110195174.7e-3856.34Show/hide
Query:  SYIDGLVSQLRLRFDMTDLGGVKYFLGLDITRSINGIHVSQAKYARDLLSRFAMAAAKPCHTPISIKTIFYNSQPCSPSDVHAYRAIIGALQYFTFSRPD
        SYID L+SQL+LRFDMTDLG +++FLGL+I  S +GI V Q+KY +D+L+RF M +AKPC TPI++ +  + S  CS +D   YR+++GAL Y TFSRP+
Subjt:  SYIDGLVSQLRLRFDMTDLGGVKYFLGLDITRSINGIHVSQAKYARDLLSRFAMAAAKPCHTPISIKTIFYNSQPCSPSDVHAYRAIIGALQYFTFSRPD

Query:  IVLSISKLSQFMHSPSLFHLAAAKRVLRYIAGTISSGLLFQR
        I  ++SKLSQ +HSP+  HL AAKRVLRY+ G++  GL+FQ+
Subjt:  IVLSISKLSQFMHSPSLFHLAAAKRVLRYIAGTISSGLLFQR

A0A6J1DGS4 uncharacterized protein LOC1110199453.5e-4155.7Show/hide
Query:  GNNPSYIDGLVSQLRLRFDMTDLGGVKYFLGLDITRSINGIHVSQAKYARDLLSRFAMAAAKPCHTPISIKTIFYNSQPCSPSDVHAYRAIIGALQYFTF
        GN+ SYI  L+S L+ RFDMTDLG + YFLGL+IT +  GI V+QAKY RD+L RF M +AKPC TP+++       +PCS  D   YR++IGA  Y TF
Subjt:  GNNPSYIDGLVSQLRLRFDMTDLGGVKYFLGLDITRSINGIHVSQAKYARDLLSRFAMAAAKPCHTPISIKTIFYNSQPCSPSDVHAYRAIIGALQYFTF

Query:  SRPDIVLSISKLSQFMHSPSLFHLAAAKRVLRYIAGTISSGLLFQRKVYAPLCITAFS
        SRPDI  S+SKLSQ MH P ++HL AAKR+LRY+ GT+  GL+F+R   A L + AFS
Subjt:  SRPDIVLSISKLSQFMHSPSLFHLAAAKRVLRYIAGTISSGLLFQRKVYAPLCITAFS

A0A6J1DIP4 uncharacterized protein LOC111020396 isoform X12.0e-81100Show/hide
Query:  GTISSGLLFQRKVYAPLCITAFSARIGLVTWMIGVLPLDLLCFLVLIQFLGLPRNKQPFPKVLLRQSIGLLPPQQLNYSGFVTCFVILVLLCLSHPRFGV
        GTISSGLLFQRKVYAPLCITAFSARIGLVTWMIGVLPLDLLCFLVLIQFLGLPRNKQPFPKVLLRQSIGLLPPQQLNYSGFVTCFVILVLLCLSHPRFGV
Subjt:  GTISSGLLFQRKVYAPLCITAFSARIGLVTWMIGVLPLDLLCFLVLIQFLGLPRNKQPFPKVLLRQSIGLLPPQQLNYSGFVTCFVILVLLCLSHPRFGV

Query:  ITSHYSISSKSGCSMAVRSMWRSIFILSRSGLFVKTYAFNIFLLCSTCRLADKTTYW
        ITSHYSISSKSGCSMAVRSMWRSIFILSRSGLFVKTYAFNIFLLCSTCRLADKTTYW
Subjt:  ITSHYSISSKSGCSMAVRSMWRSIFILSRSGLFVKTYAFNIFLLCSTCRLADKTTYW

SwissProt top hitse value%identityAlignment
P04146 Copia protein3.6e-1130.88Show/hide
Query:  RFDMTDLGGVKYFLGLDITRSINGIHVSQAKYARDLLSRFAMAAAKPCHTPISIK---TIFYNSQPCSPSDVHAYRAIIGALQYFTF-SRPDIVLSISKL
        +F MTDL  +K+F+G+ I    + I++SQ+ Y + +LS+F M       TP+  K    +  + + C+       R++IG L Y    +RPD+  +++ L
Subjt:  RFDMTDLGGVKYFLGLDITRSINGIHVSQAKYARDLLSRFAMAAAKPCHTPISIK---TIFYNSQPCSPSDVHAYRAIIGALQYFTF-SRPDIVLSISKL

Query:  SQFMHSPSLFHLAAAKRVLRYIAGTISSGLLFQRKV
        S++    +       KRVLRY+ GTI   L+F++ +
Subjt:  SQFMHSPSLFHLAAAKRVLRYIAGTISSGLLFQRKV

P25600 Putative transposon Ty5-1 protein YCL074W5.6e-1234.93Show/hide
Query:  NPSYIDGLVSQLRLRFDMTDLGGVKYFLGLDITRSING-IHVSQAKYARDLLSRFAMAAAKPCHTPI-SIKTIFYNSQPCSPSDVHAYRAIIGALQY-FT
        +P   D +  +L   + M DLG V  FLGL+I +S NG I +S   Y     S   +   K   TP+ + K +F  + P    D+  Y++I+G L +   
Subjt:  NPSYIDGLVSQLRLRFDMTDLGGVKYFLGLDITRSING-IHVSQAKYARDLLSRFAMAAAKPCHTPI-SIKTIFYNSQPCSPSDVHAYRAIIGALQY-FT

Query:  FSRPDIVLSISKLSQFMHSPSLFHLAAAKRVLRYIAGTISSGLLFQ
          RPDI   +S LS+F+  P   HL +A+RVLRY+  T S  L ++
Subjt:  FSRPDIVLSISKLSQFMHSPSLFHLAAAKRVLRYIAGTISSGLLFQ

P92519 Uncharacterized mitochondrial protein AtMg008103.5e-2240.41Show/hide
Query:  GNNPSYIDGLVSQLRLRFDMTDLGGVKYFLGLDITRSINGIHVSQAKYARDLLSRFAMAAAKPCHTPISIKTIFYNSQPCSPSDVHAYRAIIGALQYFTF
        G++ + ++ L+ QL   F M DLG V YFLG+ I    +G+ +SQ KYA  +L+   M   KP  TP+ +K     S    P D   +R+I+GALQY T 
Subjt:  GNNPSYIDGLVSQLRLRFDMTDLGGVKYFLGLDITRSINGIHVSQAKYARDLLSRFAMAAAKPCHTPISIKTIFYNSQPCSPSDVHAYRAIIGALQYFTF

Query:  SRPDIVLSISKLSQFMHSPSLFHLAAAKRVLRYIAGTISSGLLFQR
        +RPDI  +++ + Q MH P+L      KRVLRY+ GTI  GL   +
Subjt:  SRPDIVLSISKLSQFMHSPSLFHLAAAKRVLRYIAGTISSGLLFQR

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE18.9e-2641.89Show/hide
Query:  GNNPSYIDGLVSQLRLRFDMTDLGGVKYFLGLDITRSINGIHVSQAKYARDLLSRFAMAAAKPCHTPI--SIKTIFYNSQPCSPSDVHAYRAIIGALQYF
        GN+P+ +   +  L  RF + D   + YFLG++  R   G+H+SQ +Y  DLL+R  M  AKP  TP+  S K   Y+      +D   YR I+G+LQY 
Subjt:  GNNPSYIDGLVSQLRLRFDMTDLGGVKYFLGLDITRSINGIHVSQAKYARDLLSRFAMAAAKPCHTPI--SIKTIFYNSQPCSPSDVHAYRAIIGALQYF

Query:  TFSRPDIVLSISKLSQFMHSPSLFHLAAAKRVLRYIAGTISSGLLFQR
         F+RPDI  ++++LSQFMH P+  HL A KR+LRY+AGT + G+  ++
Subjt:  TFSRPDIVLSISKLSQFMHSPSLFHLAAAKRVLRYIAGTISSGLLFQR

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.1e-2237.67Show/hide
Query:  GNNPSYIDGLVSQLRLRFDMTDLGGVKYFLGLDITRSINGIHVSQAKYARDLLSRFAMAAAKPCHTPISIKTIFYNSQPCSPSDVHAYRAIIGALQYFTF
        GN+   +   +  L  RF + +   + YFLG++  R   G+H+SQ +Y  DLL+R  M  AKP  TP++              D   YR I+G+LQY  F
Subjt:  GNNPSYIDGLVSQLRLRFDMTDLGGVKYFLGLDITRSINGIHVSQAKYARDLLSRFAMAAAKPCHTPISIKTIFYNSQPCSPSDVHAYRAIIGALQYFTF

Query:  SRPDIVLSISKLSQFMHSPSLFHLAAAKRVLRYIAGTISSGLLFQR
        +RPD+  ++++LSQ+MH P+  H  A KRVLRY+AGT   G+  ++
Subjt:  SRPDIVLSISKLSQFMHSPSLFHLAAAKRVLRYIAGTISSGLLFQR

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.2e-2541.96Show/hide
Query:  NNPSYIDGLVSQLRLRFDMTDLGGVKYFLGLDITRSINGIHVSQAKYARDLLSRFAMAAAKPCHTPISIKTIFYNSQPCSPSDVHAYRAIIGALQYFTFS
        NN + +D L SQL+  F + DLG +KYFLGL+I RS  GI++ Q KYA DLL    +   KP   P+     F         D  AYR +IG L Y   +
Subjt:  NNPSYIDGLVSQLRLRFDMTDLGGVKYFLGLDITRSINGIHVSQAKYARDLLSRFAMAAAKPCHTPISIKTIFYNSQPCSPSDVHAYRAIIGALQYFTFS

Query:  RPDIVLSISKLSQFMHSPSLFHLAAAKRVLRYIAGTISSGLLF
        R DI  +++KLSQF  +P L H  A  ++L YI GT+  GL +
Subjt:  RPDIVLSISKLSQFMHSPSLFHLAAAKRVLRYIAGTISSGLLF

ATMG00810.1 DNA/RNA polymerases superfamily protein2.5e-2340.41Show/hide
Query:  GNNPSYIDGLVSQLRLRFDMTDLGGVKYFLGLDITRSINGIHVSQAKYARDLLSRFAMAAAKPCHTPISIKTIFYNSQPCSPSDVHAYRAIIGALQYFTF
        G++ + ++ L+ QL   F M DLG V YFLG+ I    +G+ +SQ KYA  +L+   M   KP  TP+ +K     S    P D   +R+I+GALQY T 
Subjt:  GNNPSYIDGLVSQLRLRFDMTDLGGVKYFLGLDITRSINGIHVSQAKYARDLLSRFAMAAAKPCHTPISIKTIFYNSQPCSPSDVHAYRAIIGALQYFTF

Query:  SRPDIVLSISKLSQFMHSPSLFHLAAAKRVLRYIAGTISSGLLFQR
        +RPDI  +++ + Q MH P+L      KRVLRY+ GTI  GL   +
Subjt:  SRPDIVLSISKLSQFMHSPSLFHLAAAKRVLRYIAGTISSGLLFQR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCAATAATCCTTCTTATATTGATGGCCTTGTATCTCAACTACGCTTGCGTTTCGATATGACTGATCTTGGTGGTGTTAAATATTTTCTTGGGTTGGACATC
ACTCGGTCTATCAATGGTATTCATGTTTCACAAGCTAAATATGCTCGGGATCTTCTATCACGATTTGCCATGGCTGCAGCTAAACCTTGTCACACACCAATCTCT
ATCAAGACTATTTTTTATAATTCACAGCCTTGTTCTCCTTCAGATGTTCATGCTTATCGGGCTATTATTGGTGCTTTGCAATATTTTACATTTTCTCGACCAGAT
ATTGTATTATCTATCAGCAAGTTGTCTCAATTTATGCATTCTCCATCATTATTTCACTTGGCTGCTGCGAAACGGGTTTTACGTTATATTGCAGGAACTATTTCT
TCTGGTTTACTTTTTCAACGTAAAGTTTATGCCCCTTTATGTATTACTGCATTTTCTGCTCGAATTGGGCTGGTGACCTGGATGATAGGCGTTCTACCACTCGAT
TTATTGTGTTTCTTGGTCCTAATCCAGTTTCTTGGTCTGCCAAGAAACAAACAACCATTTCCCAAAGTTCTACTGAGGCAGAGTATCGGGCTCTTGCCTCCACAG
CAGCTGAATTATTCTGGATTCGTCACTTGCTTCGTGATCTTGGTCTTGCTTTGTCTCAGCCACCCAAGATTTGGTGTGATAACCAGTCACTATTCAATTAGTTCG
AAATCCGGTTGTTCCATGGCCGTACGAAGCATGTGGAGATCGATTTTCATTTTATCCAGGAGCGGGTTATTCGTAAAGACATATGCCTTCAATATATTTCTACTT
TGCTCAACTTGCAGACTTGCTGACAAAACCACTTACTGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGCAATAATCCTTCTTATATTGATGGCCTTGTATCTCAACTACGCTTGCGTTTCGATATGACTGATCTTGGTGGTGTTAAATATTTTCTTGGGTTGGACATC
ACTCGGTCTATCAATGGTATTCATGTTTCACAAGCTAAATATGCTCGGGATCTTCTATCACGATTTGCCATGGCTGCAGCTAAACCTTGTCACACACCAATCTCT
ATCAAGACTATTTTTTATAATTCACAGCCTTGTTCTCCTTCAGATGTTCATGCTTATCGGGCTATTATTGGTGCTTTGCAATATTTTACATTTTCTCGACCAGAT
ATTGTATTATCTATCAGCAAGTTGTCTCAATTTATGCATTCTCCATCATTATTTCACTTGGCTGCTGCGAAACGGGTTTTACGTTATATTGCAGGAACTATTTCT
TCTGGTTTACTTTTTCAACGTAAAGTTTATGCCCCTTTATGTATTACTGCATTTTCTGCTCGAATTGGGCTGGTGACCTGGATGATAGGCGTTCTACCACTCGAT
TTATTGTGTTTCTTGGTCCTAATCCAGTTTCTTGGTCTGCCAAGAAACAAACAACCATTTCCCAAAGTTCTACTGAGGCAGAGTATCGGGCTCTTGCCTCCACAG
CAGCTGAATTATTCTGGATTCGTCACTTGCTTCGTGATCTTGGTCTTGCTTTGTCTCAGCCACCCAAGATTTGGTGTGATAACCAGTCACTATTCAATTAGTTCG
AAATCCGGTTGTTCCATGGCCGTACGAAGCATGTGGAGATCGATTTTCATTTTATCCAGGAGCGGGTTATTCGTAAAGACATATGCCTTCAATATATTTCTACTT
TGCTCAACTTGCAGACTTGCTGACAAAACCACTTACTGGTGA
Protein sequenceShow/hide protein sequence
MGNNPSYIDGLVSQLRLRFDMTDLGGVKYFLGLDITRSINGIHVSQAKYARDLLSRFAMAAAKPCHTPISIKTIFYNSQPCSPSDVHAYRAIIGALQYFTFSRPD
IVLSISKLSQFMHSPSLFHLAAAKRVLRYIAGTISSGLLFQRKVYAPLCITAFSARIGLVTWMIGVLPLDLLCFLVLIQFLGLPRNKQPFPKVLLRQSIGLLPPQ
QLNYSGFVTCFVILVLLCLSHPRFGVITSHYSISSKSGCSMAVRSMWRSIFILSRSGLFVKTYAFNIFLLCSTCRLADKTTYW