; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g12190 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g12190
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionTransposon Ty3-G Gag-Pol polyprotein
Genome locationchr4:9216042..9221134
RNA-Seq ExpressionMoc04g12190
SyntenyMoc04g12190
Gene Ontology termsGO:0005488 - binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138327.1 uncharacterized protein LOC111009540 isoform X1 [Momordica charantia]6.6e-7049.45Show/hide
Query:  MAGKNAATSGKGLAADPSASQPLSPKSAIERLLLVEDSLGEVRSSVQTIHGLKENLSKHLENIAIEQERLRDYL-------------------GNRDGQC
        MAGK+   + KG A D S  Q LSPKS   RLLL+EDSLGEVRS+VQ IH L ENL + LE I  EQERL+  L                    N DG+ 
Subjt:  MAGKNAATSGKGLAADPSASQPLSPKSAIERLLLVEDSLGEVRSSVQTIHGLKENLSKHLENIAIEQERLRDYL-------------------GNRDGQC

Query:  E-----------------DKDFLWGNPRFDRFD-YQNGGGVRTDFRMKIDLPTFNGKMDVEGFLDWVKNVENFFDYTNTPEDKK----------------
                          D+ FL  +PRF R + Y   GGVRTDF+MKIDLPTFNGKMDVE FLD VKNVENFFDYTNTPEDKK                
Subjt:  E-----------------DKDFLWGNPRFDRFD-YQNGGGVRTDFRMKIDLPTFNGKMDVEGFLDWVKNVENFFDYTNTPEDKK----------------

Query:  ----------------------------------------QYQRCRQGAKNIADCTEEFHRLGARNNLTKIEDYKITRYIDGLREDIQDQMYIQPIRLLT
                                                 YQRCRQG K IAD TE FHRLGA+ N+ + EDYKI R++DGLREDIQDQM IQPI LLT
Subjt:  ----------------------------------------QYQRCRQGAKNIADCTEEFHRLGARNNLTKIEDYKITRYIDGLREDIQDQMYIQPIRLLT

Query:  DAITMATKIEDKIDRKRLKNPIRRTPWDKSVTAKFFTFDSGK---VGAASTSTAPKPTDDIAKPPP
        DAI MATKIEDK   KRL+ P RRTPWDK   +K  T D+GK   +G  S ST  KP DD AK  P
Subjt:  DAITMATKIEDKIDRKRLKNPIRRTPWDKSVTAKFFTFDSGK---VGAASTSTAPKPTDDIAKPPP

XP_022138328.1 uncharacterized protein LOC111009540 isoform X2 [Momordica charantia]6.6e-7049.45Show/hide
Query:  MAGKNAATSGKGLAADPSASQPLSPKSAIERLLLVEDSLGEVRSSVQTIHGLKENLSKHLENIAIEQERLRDYL-------------------GNRDGQC
        MAGK+   + KG A D S  Q LSPKS   RLLL+EDSLGEVRS+VQ IH L ENL + LE I  EQERL+  L                    N DG+ 
Subjt:  MAGKNAATSGKGLAADPSASQPLSPKSAIERLLLVEDSLGEVRSSVQTIHGLKENLSKHLENIAIEQERLRDYL-------------------GNRDGQC

Query:  E-----------------DKDFLWGNPRFDRFD-YQNGGGVRTDFRMKIDLPTFNGKMDVEGFLDWVKNVENFFDYTNTPEDKK----------------
                          D+ FL  +PRF R + Y   GGVRTDF+MKIDLPTFNGKMDVE FLD VKNVENFFDYTNTPEDKK                
Subjt:  E-----------------DKDFLWGNPRFDRFD-YQNGGGVRTDFRMKIDLPTFNGKMDVEGFLDWVKNVENFFDYTNTPEDKK----------------

Query:  ----------------------------------------QYQRCRQGAKNIADCTEEFHRLGARNNLTKIEDYKITRYIDGLREDIQDQMYIQPIRLLT
                                                 YQRCRQG K IAD TE FHRLGA+ N+ + EDYKI R++DGLREDIQDQM IQPI LLT
Subjt:  ----------------------------------------QYQRCRQGAKNIADCTEEFHRLGARNNLTKIEDYKITRYIDGLREDIQDQMYIQPIRLLT

Query:  DAITMATKIEDKIDRKRLKNPIRRTPWDKSVTAKFFTFDSGK---VGAASTSTAPKPTDDIAKPPP
        DAI MATKIEDK   KRL+ P RRTPWDK   +K  T D+GK   +G  S ST  KP DD AK  P
Subjt:  DAITMATKIEDKIDRKRLKNPIRRTPWDKSVTAKFFTFDSGK---VGAASTSTAPKPTDDIAKPPP

XP_024021836.1 uncharacterized protein LOC112091747 [Morus notabilis]2.6e-3453.33Show/hide
Query:  LDNGTRTIIDAASQGAFMGKTESGAYDLLEEMTLNNYQWHSERSAQKRSMGVNDVDVIAALTTQVASLTKQLQSSQLAVNAIQIPPTFCEYCYGNHRSED
        L   TRT+IDA++ GA M KTE  AY+LLE+M  NNYQW SERS +K++ G+++VD I  LT Q+ASL+KQLQS+QL  NAIQ     CE+C G+H+S +
Subjt:  LDNGTRTIIDAASQGAFMGKTESGAYDLLEEMTLNNYQWHSERSAQKRSMGVNDVDVIAALTTQVASLTKQLQSSQLAVNAIQIPPTFCEYCYGNHRSED

Query:  CQVGNPFMQGQHEQVQFVGNFNRQPNNPYSNTYNPGWVPGRRNLT-KWQN
        CQ  NPF Q Q E  Q+VGN+ RQ NNP    +N GW   R +L   W+N
Subjt:  CQVGNPFMQGQHEQVQFVGNFNRQPNNPYSNTYNPGWVPGRRNLT-KWQN

XP_024031895.1 uncharacterized protein LOC112094656 [Morus notabilis]4.8e-3655.03Show/hide
Query:  LDNGTRTIIDAASQGAFMGKTESGAYDLLEEMTLNNYQWHSERSAQKRSMGVNDVDVIAALTTQVASLTKQLQSSQLAVNAIQIPPTFCEYCYGNHRSED
        L   TRT+IDA++ GA M KTE  AY+LLE+M  NNYQW SERS  K++ G+++VD I  LT Q+AS +KQLQS+QL  NAIQ     CE+C  +H+S +
Subjt:  LDNGTRTIIDAASQGAFMGKTESGAYDLLEEMTLNNYQWHSERSAQKRSMGVNDVDVIAALTTQVASLTKQLQSSQLAVNAIQIPPTFCEYCYGNHRSED

Query:  CQVGNPFMQGQHEQVQFVGNFNRQPNNPYSNTYNPGWVPGRRNLTKWQN
        CQ  NPF Q Q EQ Q+VGN+ RQ NNPYSN +N GW     NL+ W+N
Subjt:  CQVGNPFMQGQHEQVQFVGNFNRQPNNPYSNTYNPGWVPGRRNLTKWQN

XP_034899370.1 LOW QUALITY PROTEIN: uncharacterized protein LOC118037487 [Populus alba]3.0e-3860.58Show/hide
Query:  LDNGTRTIIDAASQGAFMGKTESGAYDLLEEMTLNNYQWHSERSAQKRSMGVNDVDVIAALTTQVASLTKQLQSSQLAVNAIQIPPTFCEYCYGNHRSED
        L+  TRT+IDAAS GAFM K++  AY+LLEEM +NNYQW +ERS QK+++GV+++D I ALT QV SLT+QL+++QL+ NAI    T C++C+GNH SE+
Subjt:  LDNGTRTIIDAASQGAFMGKTESGAYDLLEEMTLNNYQWHSERSAQKRSMGVNDVDVIAALTTQVASLTKQLQSSQLAVNAIQIPPTFCEYCYGNHRSED

Query:  CQVGNPFMQGQHEQVQFVGNFNRQPNNPYSNTYNPGW
        CQVGNPF Q +H    FV N++RQ NNPYS TYNPGW
Subjt:  CQVGNPFMQGQHEQVQFVGNFNRQPNNPYSNTYNPGW

TrEMBL top hitse value%identityAlignment
A0A3S3N117 Retrotrans_gag domain-containing protein4.5e-3255.47Show/hide
Query:  LDNGTRTIIDAASQGAFMGKTESGAYDLLEEMTLNNYQWHSERSAQKRSMGVNDVDVIAALTTQVASLTKQLQSSQLAVNAIQIPPTFCEYCYGNHRSED
        L + TRT IDAA+ G  M K+   AY+L+EEM  NNYQW S+   QK+  GV+++D I+ALT QVA+L+KQ+QS  + V+A+Q     CE+C GNH   D
Subjt:  LDNGTRTIIDAASQGAFMGKTESGAYDLLEEMTLNNYQWHSERSAQKRSMGVNDVDVIAALTTQVASLTKQLQSSQLAVNAIQIPPTFCEYCYGNHRSED

Query:  CQVGNPFMQGQHEQVQFVGNFNRQPNNPYSNTYNPGW
        CQVGNPF     EQV +V N++RQ NNPYSNTYNPGW
Subjt:  CQVGNPFMQGQHEQVQFVGNFNRQPNNPYSNTYNPGW

A0A6J0ZYV0 uncharacterized protein LOC1104134138.8e-2852.27Show/hide
Query:  RTIIDAASQGAFMGKTESGAYDLLEEMTLNNYQWHSERSAQKRSMGVNDVDVIAALTTQVASLTKQLQSSQLAVNAIQIPPTFCEYCYGNHRSEDCQVGN
        +TIIDAA+ GA M K    AY+LLEEM  NNYQW SERS  ++++G  ++D +  LTTQVA+L+K+L +  L V+A+Q     CE C  +H  + C   +
Subjt:  RTIIDAASQGAFMGKTESGAYDLLEEMTLNNYQWHSERSAQKRSMGVNDVDVIAALTTQVASLTKQLQSSQLAVNAIQIPPTFCEYCYGNHRSEDCQVGN

Query:  PFMQGQHEQVQFVGNFNRQPNNPYSNTYNPGW
               E VQFVGNFNRQ NNPYSNTYNPGW
Subjt:  PFMQGQHEQVQFVGNFNRQPNNPYSNTYNPGW

A0A6J1CAS9 uncharacterized protein LOC111009540 isoform X13.2e-7049.45Show/hide
Query:  MAGKNAATSGKGLAADPSASQPLSPKSAIERLLLVEDSLGEVRSSVQTIHGLKENLSKHLENIAIEQERLRDYL-------------------GNRDGQC
        MAGK+   + KG A D S  Q LSPKS   RLLL+EDSLGEVRS+VQ IH L ENL + LE I  EQERL+  L                    N DG+ 
Subjt:  MAGKNAATSGKGLAADPSASQPLSPKSAIERLLLVEDSLGEVRSSVQTIHGLKENLSKHLENIAIEQERLRDYL-------------------GNRDGQC

Query:  E-----------------DKDFLWGNPRFDRFD-YQNGGGVRTDFRMKIDLPTFNGKMDVEGFLDWVKNVENFFDYTNTPEDKK----------------
                          D+ FL  +PRF R + Y   GGVRTDF+MKIDLPTFNGKMDVE FLD VKNVENFFDYTNTPEDKK                
Subjt:  E-----------------DKDFLWGNPRFDRFD-YQNGGGVRTDFRMKIDLPTFNGKMDVEGFLDWVKNVENFFDYTNTPEDKK----------------

Query:  ----------------------------------------QYQRCRQGAKNIADCTEEFHRLGARNNLTKIEDYKITRYIDGLREDIQDQMYIQPIRLLT
                                                 YQRCRQG K IAD TE FHRLGA+ N+ + EDYKI R++DGLREDIQDQM IQPI LLT
Subjt:  ----------------------------------------QYQRCRQGAKNIADCTEEFHRLGARNNLTKIEDYKITRYIDGLREDIQDQMYIQPIRLLT

Query:  DAITMATKIEDKIDRKRLKNPIRRTPWDKSVTAKFFTFDSGK---VGAASTSTAPKPTDDIAKPPP
        DAI MATKIEDK   KRL+ P RRTPWDK   +K  T D+GK   +G  S ST  KP DD AK  P
Subjt:  DAITMATKIEDKIDRKRLKNPIRRTPWDKSVTAKFFTFDSGK---VGAASTSTAPKPTDDIAKPPP

A0A6J1CCQ8 uncharacterized protein LOC111009540 isoform X23.2e-7049.45Show/hide
Query:  MAGKNAATSGKGLAADPSASQPLSPKSAIERLLLVEDSLGEVRSSVQTIHGLKENLSKHLENIAIEQERLRDYL-------------------GNRDGQC
        MAGK+   + KG A D S  Q LSPKS   RLLL+EDSLGEVRS+VQ IH L ENL + LE I  EQERL+  L                    N DG+ 
Subjt:  MAGKNAATSGKGLAADPSASQPLSPKSAIERLLLVEDSLGEVRSSVQTIHGLKENLSKHLENIAIEQERLRDYL-------------------GNRDGQC

Query:  E-----------------DKDFLWGNPRFDRFD-YQNGGGVRTDFRMKIDLPTFNGKMDVEGFLDWVKNVENFFDYTNTPEDKK----------------
                          D+ FL  +PRF R + Y   GGVRTDF+MKIDLPTFNGKMDVE FLD VKNVENFFDYTNTPEDKK                
Subjt:  E-----------------DKDFLWGNPRFDRFD-YQNGGGVRTDFRMKIDLPTFNGKMDVEGFLDWVKNVENFFDYTNTPEDKK----------------

Query:  ----------------------------------------QYQRCRQGAKNIADCTEEFHRLGARNNLTKIEDYKITRYIDGLREDIQDQMYIQPIRLLT
                                                 YQRCRQG K IAD TE FHRLGA+ N+ + EDYKI R++DGLREDIQDQM IQPI LLT
Subjt:  ----------------------------------------QYQRCRQGAKNIADCTEEFHRLGARNNLTKIEDYKITRYIDGLREDIQDQMYIQPIRLLT

Query:  DAITMATKIEDKIDRKRLKNPIRRTPWDKSVTAKFFTFDSGK---VGAASTSTAPKPTDDIAKPPP
        DAI MATKIEDK   KRL+ P RRTPWDK   +K  T D+GK   +G  S ST  KP DD AK  P
Subjt:  DAITMATKIEDKIDRKRLKNPIRRTPWDKSVTAKFFTFDSGK---VGAASTSTAPKPTDDIAKPPP

A0A6P5S0R1 Reverse transcriptase2.9e-3156.93Show/hide
Query:  LDNGTRTIIDAASQGAFMGKTESGAYDLLEEMTLNNYQWHSERSAQKRSMGVNDVDVIAALTTQVASLTKQLQSSQLAVNAIQIPPTFCEYCYGNHRSED
        LD  ++T+IDAA++GA M KT+  AY+LLE M  N+YQW SER+  K++ GV+DV+ I ALT Q+++L+KQL S  L VNAIQ P   CE+C  +H S D
Subjt:  LDNGTRTIIDAASQGAFMGKTESGAYDLLEEMTLNNYQWHSERSAQKRSMGVNDVDVIAALTTQVASLTKQLQSSQLAVNAIQIPPTFCEYCYGNHRSED

Query:  CQVGNPFMQGQHEQVQFVGNFNRQPNNPYSNTYNPGW
        C  GNPF     EQV  VG+FNRQ NNPYSNTYNP W
Subjt:  CQVGNPFMQGQHEQVQFVGNFNRQPNNPYSNTYNPGW

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGGGAAGAATGCAGCGACTTCCGGCAAAGGACTCGCAGCGGATCCTTCGGCAAGTCAACCTCTGTCACCGAAATCTGCCATCGAACGTTTACTGTTGGTGGAAGA
TTCATTAGGAGAGGTTCGTTCCAGTGTACAGACAATCCACGGACTTAAGGAAAATCTTTCCAAACATCTGGAGAATATAGCAATTGAGCAAGAGCGGCTGCGTGATTATC
TAGGCAACCGCGACGGCCAATGTGAAGACAAAGACTTTCTTTGGGGAAATCCTCGTTTTGATCGTTTTGATTATCAAAATGGGGGAGGTGTTCGAACTGATTTTCGAATG
AAGATTGACCTACCAACTTTCAATGGGAAGATGGACGTTGAGGGATTCTTGGATTGGGTCAAGAATGTAGAAAATTTCTTTGATTACACTAATACCCCAGAAGATAAAAA
GCAGTATCAGCGCTGTCGACAAGGGGCAAAGAACATAGCCGATTGTACTGAGGAATTCCATCGACTAGGAGCACGAAACAACTTAACAAAGATAGAAGATTATAAGATTA
CCCGATATATTGATGGTCTTCGTGAGGATATACAAGATCAAATGTATATTCAGCCAATTAGGCTCCTGACCGATGCTATAACCATGGCTACAAAGATCGAAGACAAGATT
GATAGGAAACGCCTCAAGAATCCTATTCGACGTACGCCTTGGGATAAATCTGTAACTGCTAAATTCTTTACCTTTGATTCAGGGAAGGTTGGGGCCGCTTCTACATCAAC
GGCACCCAAACCCACTGATGATATTGCTAAACCTCCTCCTAAGCCTGCGGAGGCCGAAGAAGGTGCCTACAATCTTGATGAAGATGTTCTTGCCGACGACGACGACACTG
CCTATATAGAGCCCGATGAAGGACAAGTGACCGCTTTCGAACATCTTCCTATAACATATGACAATGACAGTGATTTCCATACCATTTGGCAACAATGCAACCAACATGTT
AATTGCAATGACTTTCATATTCTTGATGGCTATCTGTGTAAAGGAGACCGACTGTGCATTCCACATACGTCATTAAGGGAATCCTTAATTCGGGATATGCACAGTGGCGG
ACTTGAGTTGATGAAGAGGAGAAAAAGAGGAAATAAAGAAGAAAAATCGAACCAAATTCCAGTGTGTTGGACGCCTAGGCGCCGAAAATTGACAAAATGGCAGAATGTTG
GGCGCCAAGGCGCTACGAAGTTGGAAGCGGAGTTCGTGTTTGGGCGCCTAGGCCCCAAAATTGGGCGCTTAGATAATGGAACAAGAACTATAATAGATGCAGCATCACAA
GGGGCCTTCATGGGAAAAACTGAAAGTGGAGCATACGATTTGTTGGAAGAAATGACATTGAACAACTACCAGTGGCATAGTGAGAGGTCAGCTCAGAAAAGGTCGATGGG
AGTAAATGATGTGGATGTTATCGCTGCATTGACCACGCAGGTTGCTTCCCTTACCAAGCAACTTCAATCAAGTCAGCTTGCGGTAAATGCTATACAAATACCACCTACAT
TTTGTGAATATTGTTATGGTAACCATCGTAGTGAAGATTGTCAAGTGGGGAACCCATTTATGCAAGGCCAACATGAGCAAGTTCAGTTTGTTGGGAATTTTAATCGCCAG
CCAAATAACCCCTATTCCAACACCTATAATCCAGGTTGGGTGCCTGGGCGTCGAAATTTGACAAAATGGCAGAATGTTGGGTGCCTAGGCGCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCCGGGAAGAATGCAGCGACTTCCGGCAAAGGACTCGCAGCGGATCCTTCGGCAAGTCAACCTCTGTCACCGAAATCTGCCATCGAACGTTTACTGTTGGTGGAAGA
TTCATTAGGAGAGGTTCGTTCCAGTGTACAGACAATCCACGGACTTAAGGAAAATCTTTCCAAACATCTGGAGAATATAGCAATTGAGCAAGAGCGGCTGCGTGATTATC
TAGGCAACCGCGACGGCCAATGTGAAGACAAAGACTTTCTTTGGGGAAATCCTCGTTTTGATCGTTTTGATTATCAAAATGGGGGAGGTGTTCGAACTGATTTTCGAATG
AAGATTGACCTACCAACTTTCAATGGGAAGATGGACGTTGAGGGATTCTTGGATTGGGTCAAGAATGTAGAAAATTTCTTTGATTACACTAATACCCCAGAAGATAAAAA
GCAGTATCAGCGCTGTCGACAAGGGGCAAAGAACATAGCCGATTGTACTGAGGAATTCCATCGACTAGGAGCACGAAACAACTTAACAAAGATAGAAGATTATAAGATTA
CCCGATATATTGATGGTCTTCGTGAGGATATACAAGATCAAATGTATATTCAGCCAATTAGGCTCCTGACCGATGCTATAACCATGGCTACAAAGATCGAAGACAAGATT
GATAGGAAACGCCTCAAGAATCCTATTCGACGTACGCCTTGGGATAAATCTGTAACTGCTAAATTCTTTACCTTTGATTCAGGGAAGGTTGGGGCCGCTTCTACATCAAC
GGCACCCAAACCCACTGATGATATTGCTAAACCTCCTCCTAAGCCTGCGGAGGCCGAAGAAGGTGCCTACAATCTTGATGAAGATGTTCTTGCCGACGACGACGACACTG
CCTATATAGAGCCCGATGAAGGACAAGTGACCGCTTTCGAACATCTTCCTATAACATATGACAATGACAGTGATTTCCATACCATTTGGCAACAATGCAACCAACATGTT
AATTGCAATGACTTTCATATTCTTGATGGCTATCTGTGTAAAGGAGACCGACTGTGCATTCCACATACGTCATTAAGGGAATCCTTAATTCGGGATATGCACAGTGGCGG
ACTTGAGTTGATGAAGAGGAGAAAAAGAGGAAATAAAGAAGAAAAATCGAACCAAATTCCAGTGTGTTGGACGCCTAGGCGCCGAAAATTGACAAAATGGCAGAATGTTG
GGCGCCAAGGCGCTACGAAGTTGGAAGCGGAGTTCGTGTTTGGGCGCCTAGGCCCCAAAATTGGGCGCTTAGATAATGGAACAAGAACTATAATAGATGCAGCATCACAA
GGGGCCTTCATGGGAAAAACTGAAAGTGGAGCATACGATTTGTTGGAAGAAATGACATTGAACAACTACCAGTGGCATAGTGAGAGGTCAGCTCAGAAAAGGTCGATGGG
AGTAAATGATGTGGATGTTATCGCTGCATTGACCACGCAGGTTGCTTCCCTTACCAAGCAACTTCAATCAAGTCAGCTTGCGGTAAATGCTATACAAATACCACCTACAT
TTTGTGAATATTGTTATGGTAACCATCGTAGTGAAGATTGTCAAGTGGGGAACCCATTTATGCAAGGCCAACATGAGCAAGTTCAGTTTGTTGGGAATTTTAATCGCCAG
CCAAATAACCCCTATTCCAACACCTATAATCCAGGTTGGGTGCCTGGGCGTCGAAATTTGACAAAATGGCAGAATGTTGGGTGCCTAGGCGCTTAA
Protein sequenceShow/hide protein sequence
MAGKNAATSGKGLAADPSASQPLSPKSAIERLLLVEDSLGEVRSSVQTIHGLKENLSKHLENIAIEQERLRDYLGNRDGQCEDKDFLWGNPRFDRFDYQNGGGVRTDFRM
KIDLPTFNGKMDVEGFLDWVKNVENFFDYTNTPEDKKQYQRCRQGAKNIADCTEEFHRLGARNNLTKIEDYKITRYIDGLREDIQDQMYIQPIRLLTDAITMATKIEDKI
DRKRLKNPIRRTPWDKSVTAKFFTFDSGKVGAASTSTAPKPTDDIAKPPPKPAEAEEGAYNLDEDVLADDDDTAYIEPDEGQVTAFEHLPITYDNDSDFHTIWQQCNQHV
NCNDFHILDGYLCKGDRLCIPHTSLRESLIRDMHSGGLELMKRRKRGNKEEKSNQIPVCWTPRRRKLTKWQNVGRQGATKLEAEFVFGRLGPKIGRLDNGTRTIIDAASQ
GAFMGKTESGAYDLLEEMTLNNYQWHSERSAQKRSMGVNDVDVIAALTTQVASLTKQLQSSQLAVNAIQIPPTFCEYCYGNHRSEDCQVGNPFMQGQHEQVQFVGNFNRQ
PNNPYSNTYNPGWVPGRRNLTKWQNVGCLGA