; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc02g19660 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc02g19660
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionTransposon protein, putative, CACTA, En/Spm sub-class
Genome locationchr2:14598565..14601768
RNA-Seq ExpressionMoc02g19660
SyntenyMoc02g19660
Gene Ontology termsNA
InterPro domainsIPR025312 - Domain of unknown function DUF4216
IPR025452 - Domain of unknown function DUF4218


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032028.1 hypothetical protein E6C27_scaffold134G001270 [Cucumis melo var. makuwa]2.1e-6350Show/hide
Query:  MYPIERSLQTLKQYVKNKARPGGSIVEAVIANEALTFCSMYLDGIETKFNRLIRNDDMLDLNDSKA----HLPMRSEVSSNSNAVVHELNTLACGLDIRI
        MYPIERSL+ LKQYV+N ARP GSI EA   NE+L FCSMYL  IET+FNR  RN+D +D  D +      L  +++       +  +L  LACG +   
Subjt:  MYPIERSLQTLKQYVKNKARPGGSIVEAVIANEALTFCSMYLDGIETKFNRLIRNDDMLDLNDSKA----HLPMRSEVSSNSNAVVHELNTLACGLDIRI

Query:  RSYNGCVINGFRFNTIERDDRCTTQNSGVCVMGGHNTKLSNFYGVIREVIELSYINKKRVLLLRCDWYDTNSRKNRIRQDHSFTSIDTRHLWYQDEPFIL
        RSY+GC+ NG +F+T +RD R TT+NSGV V  G       FYG + E+I L YI  K V+L +C+WYDT+ RKNRI     FTSI+TR+ WY+DEPFIL
Subjt:  RSYNGCVINGFRFNTIERDDRCTTQNSGVCVMGGHNTKLSNFYGVIREVIELSYINKKRVLLLRCDWYDTNSRKNRIRQDHSFTSIDTRHLWYQDEPFIL

Query:  VFQAQQVFYVEDLKLGSGWKVVQKIEHRHLWDVPEIK----EVDICEGVTNQCEVEEVDLKAHTFHRVDIDP
        V QA QVFYV+D KLG  WK+VQ+I+ RH+WD+ EI+    EV   E ++      E +L + TF+R D++P
Subjt:  VFQAQQVFYVEDLKLGSGWKVVQKIEHRHLWDVPEIK----EVDICEGVTNQCEVEEVDLKAHTFHRVDIDP

RVW66346.1 hypothetical protein CK203_065230 [Vitis vinifera]4.9e-6043Show/hide
Query:  MYPIERSLQTLKQYVKNKARPGGSIVEAVIANEALTFCSMYLDGIETKFNRLIRNDDMLDLN---------------DSKAHLPMRSE------------
        MYP ER+L TLK+YV+NKARP GSI EA I NEAL FCSMYL GIET+FNR  RN+D  +                  S+ HL    E            
Subjt:  MYPIERSLQTLKQYVKNKARPGGSIVEAVIANEALTFCSMYLDGIETKFNRLIRNDDMLDLN---------------DSKAHLPMRSE------------

Query:  -----------------VSSN-------------------------SNAVVHELNTLACGLDIRIRSYNGCVINGFRFNTIERDDRCTTQNSGVCVMGGH
                          SS+                         S     EL +LACG D R+ +Y GC++NG RF+T +RDDR  TQNSG+CV G H
Subjt:  -----------------VSSN-------------------------SNAVVHELNTLACGLDIRIRSYNGCVINGFRFNTIERDDRCTTQNSGVCVMGGH

Query:  NTKLSNFYGVIREVIELSYINKKRVLLLRCDWYDTNSRKNRIRQDHSFTSIDTRHLWYQDEPFILVFQAQQVFYVEDLKLGSGWKVVQKIEHRHLWDVPE
        + +  +FYGV+  V+ L+Y+   +V+L +C W+DTN +K RI+ D++FT+I     WY ++PFIL  QAQQVFY++D K G  WKVVQK+ HRH+WDVPE
Subjt:  NTKLSNFYGVIREVIELSYINKKRVLLLRCDWYDTNSRKNRIRQDHSFTSIDTRHLWYQDEPFILVFQAQQVFYVEDLKLGSGWKVVQKIEHRHLWDVPE

RVX21623.1 hypothetical protein CK203_002253 [Vitis vinifera]8.9e-6244.18Show/hide
Query:  MYPIERSLQTLKQYVKNKARPGGSIVEAVIANEALTFCSMYLDGIETKFNRLIRNDDMLD----------------------------------------
        MYP ER+L TLK+YV+NKARP GSIVEA I NEAL FCSMYL GIET+FNR  RN+D  +                                        
Subjt:  MYPIERSLQTLKQYVKNKARPGGSIVEAVIANEALTFCSMYLDGIETKFNRLIRNDDMLD----------------------------------------

Query:  -------LNDSKAHLPMRSEVS--------------SNSNAVVHELNTLACGLDIRIRSYNGCVINGFRFNTIERDDRCTTQNSGVCVMGGHNTKLSNFY
               L  S  +L  R E                  S     EL +LACG D R+ +Y GC++NG RF+T +RDDR  TQNSG+CV G H+ +  +FY
Subjt:  -------LNDSKAHLPMRSEVS--------------SNSNAVVHELNTLACGLDIRIRSYNGCVINGFRFNTIERDDRCTTQNSGVCVMGGHNTKLSNFY

Query:  GVIREVIELSYINKKRVLLLRCDWYDTNSRKNRIRQDHSFTSIDTRHLWYQDEPFILVFQAQQVFYVEDLKLGSGWKVVQKIEHRHLWDVPE
        GV+  V+ L+Y+   +V+L +C W+DTN +K RI+ D++FT+I     WY ++PFIL  QAQQVFY++D K G  WKVVQK+ HRH+WDVPE
Subjt:  GVIREVIELSYINKKRVLLLRCDWYDTNSRKNRIRQDHSFTSIDTRHLWYQDEPFILVFQAQQVFYVEDLKLGSGWKVVQKIEHRHLWDVPE

XP_022148890.1 uncharacterized protein LOC111017452 [Momordica charantia]2.1e-7160.81Show/hide
Query:  SEVSSNSNAVVHELNTLACGLDIRIRSYNGCVINGFRFNTIERDDRCTTQNSGVCVMGGHNTKLSNFYGVIREVIELSYINKKRVLLLRCDWYDTNSRKN
        S+V S SN VV+EL  LACG D R+ SY  CV NG RFNT ERDDR TTQNSGVCV GG + + S+FYG+I+EVIEL YI  KRVLL RCDWYDTN +KN
Subjt:  SEVSSNSNAVVHELNTLACGLDIRIRSYNGCVINGFRFNTIERDDRCTTQNSGVCVMGGHNTKLSNFYGVIREVIELSYINKKRVLLLRCDWYDTNSRKN

Query:  RIRQDHSFTSIDTRHLWYQDEPFILVFQAQQVFYVEDLKLGSGWKVVQKIEHRHLWDVPEIKEVDICEGVTNQCEVEEVDLKAHTFHRVDIDP--INKFE
         +RQ+++FTSI+T HLWY+D+PFILV QAQQVFYV DL+LG+GWKV QKI+HRHLWDVPE++E+D+ E   NQC V+EV+L+  TFHR DIDP  ++   
Subjt:  RIRQDHSFTSIDTRHLWYQDEPFILVFQAQQVFYVEDLKLGSGWKVVQKIEHRHLWDVPEIKEVDICEGVTNQCEVEEVDLKAHTFHRVDIDP--INKFE

Query:  VDLHVKHHYDYIKYEIGTRYKD
          + +  ++D ++ EI    +D
Subjt:  VDLHVKHHYDYIKYEIGTRYKD

XP_026429044.1 uncharacterized protein LOC113325016 [Papaver somniferum]5.2e-6248.99Show/hide
Query:  MYPIERSLQTLKQYVKNKARPGGSIVEAVIANEALTFCSMYLDGIETKFNRLIRNDD-MLDLNDSKAHLPMRSEVSSNSNA-----------VVHELNTL
        MYPIER L TLK+YVKN+ARP GSI EA I  E LTFCSMY  G ETKF R  RNDD   D    K  + + ++ +   +A              E+ TL
Subjt:  MYPIERSLQTLKQYVKNKARPGGSIVEAVIANEALTFCSMYLDGIETKFNRLIRNDD-MLDLNDSKAHLPMRSEVSSNSNA-----------VVHELNTL

Query:  ACGLDIRIRSYNGCVINGFRFNTIERDDRCTTQNSGVCVMGGHNTKLSNFYGVIREVIELSYINKKRVLLLRCDWYDTNSRKNRIRQDHSFTSIDTRHLW
        A G+D+R+ SY  C +NG ++++ +R+ R TTQNSG+ V G H      FYG +R+VIEL Y +  R++L +CDW+D +  +N+IR+D+  TSI+  +LW
Subjt:  ACGLDIRIRSYNGCVINGFRFNTIERDDRCTTQNSGVCVMGGHNTKLSNFYGVIREVIELSYINKKRVLLLRCDWYDTNSRKNRIRQDHSFTSIDTRHLW

Query:  YQDEPFILVFQAQQVFYVEDLKLGSGWKVVQKIEHRHLWDVPEIKEV
        Y+D+P++L  QAQQVFYV+D K G+ WKVV K+EHRHLWDVPE+ ++
Subjt:  YQDEPFILVFQAQQVFYVEDLKLGSGWKVVQKIEHRHLWDVPEIKEV

TrEMBL top hitse value%identityAlignment
A0A438D9E3 Uncharacterized protein2.4e-6043Show/hide
Query:  MYPIERSLQTLKQYVKNKARPGGSIVEAVIANEALTFCSMYLDGIETKFNRLIRNDDMLDLN---------------DSKAHLPMRSE------------
        MYP ER+L TLK+YV+NKARP GSI EA I NEAL FCSMYL GIET+FNR  RN+D  +                  S+ HL    E            
Subjt:  MYPIERSLQTLKQYVKNKARPGGSIVEAVIANEALTFCSMYLDGIETKFNRLIRNDDMLDLN---------------DSKAHLPMRSE------------

Query:  -----------------VSSN-------------------------SNAVVHELNTLACGLDIRIRSYNGCVINGFRFNTIERDDRCTTQNSGVCVMGGH
                          SS+                         S     EL +LACG D R+ +Y GC++NG RF+T +RDDR  TQNSG+CV G H
Subjt:  -----------------VSSN-------------------------SNAVVHELNTLACGLDIRIRSYNGCVINGFRFNTIERDDRCTTQNSGVCVMGGH

Query:  NTKLSNFYGVIREVIELSYINKKRVLLLRCDWYDTNSRKNRIRQDHSFTSIDTRHLWYQDEPFILVFQAQQVFYVEDLKLGSGWKVVQKIEHRHLWDVPE
        + +  +FYGV+  V+ L+Y+   +V+L +C W+DTN +K RI+ D++FT+I     WY ++PFIL  QAQQVFY++D K G  WKVVQK+ HRH+WDVPE
Subjt:  NTKLSNFYGVIREVIELSYINKKRVLLLRCDWYDTNSRKNRIRQDHSFTSIDTRHLWYQDEPFILVFQAQQVFYVEDLKLGSGWKVVQKIEHRHLWDVPE

A0A438G2C1 Uncharacterized protein2.4e-6043Show/hide
Query:  MYPIERSLQTLKQYVKNKARPGGSIVEAVIANEALTFCSMYLDGIETKFNRLIRNDDMLDLN---------------DSKAHLPMRSE------------
        MYP ER+L TLK+YV+NKARP GSI EA I NEAL FCSMYL GIET+FNR  RN+D  +                  S+ HL    E            
Subjt:  MYPIERSLQTLKQYVKNKARPGGSIVEAVIANEALTFCSMYLDGIETKFNRLIRNDDMLDLN---------------DSKAHLPMRSE------------

Query:  -----------------VSSN-------------------------SNAVVHELNTLACGLDIRIRSYNGCVINGFRFNTIERDDRCTTQNSGVCVMGGH
                          SS+                         S     EL +LACG D R+ +Y GC++NG RF+T +RDDR  TQNSG+CV G H
Subjt:  -----------------VSSN-------------------------SNAVVHELNTLACGLDIRIRSYNGCVINGFRFNTIERDDRCTTQNSGVCVMGGH

Query:  NTKLSNFYGVIREVIELSYINKKRVLLLRCDWYDTNSRKNRIRQDHSFTSIDTRHLWYQDEPFILVFQAQQVFYVEDLKLGSGWKVVQKIEHRHLWDVPE
        + +  +FYGV+  V+ L+Y+   +V+L +C W+DTN +K RI+ D++FT+I     WY ++PFIL  QAQQVFY++D K G  WKVVQK+ HRH+WDVPE
Subjt:  NTKLSNFYGVIREVIELSYINKKRVLLLRCDWYDTNSRKNRIRQDHSFTSIDTRHLWYQDEPFILVFQAQQVFYVEDLKLGSGWKVVQKIEHRHLWDVPE

A0A438KKA2 Uncharacterized protein4.3e-6244.18Show/hide
Query:  MYPIERSLQTLKQYVKNKARPGGSIVEAVIANEALTFCSMYLDGIETKFNRLIRNDDMLD----------------------------------------
        MYP ER+L TLK+YV+NKARP GSIVEA I NEAL FCSMYL GIET+FNR  RN+D  +                                        
Subjt:  MYPIERSLQTLKQYVKNKARPGGSIVEAVIANEALTFCSMYLDGIETKFNRLIRNDDMLD----------------------------------------

Query:  -------LNDSKAHLPMRSEVS--------------SNSNAVVHELNTLACGLDIRIRSYNGCVINGFRFNTIERDDRCTTQNSGVCVMGGHNTKLSNFY
               L  S  +L  R E                  S     EL +LACG D R+ +Y GC++NG RF+T +RDDR  TQNSG+CV G H+ +  +FY
Subjt:  -------LNDSKAHLPMRSEVS--------------SNSNAVVHELNTLACGLDIRIRSYNGCVINGFRFNTIERDDRCTTQNSGVCVMGGHNTKLSNFY

Query:  GVIREVIELSYINKKRVLLLRCDWYDTNSRKNRIRQDHSFTSIDTRHLWYQDEPFILVFQAQQVFYVEDLKLGSGWKVVQKIEHRHLWDVPE
        GV+  V+ L+Y+   +V+L +C W+DTN +K RI+ D++FT+I     WY ++PFIL  QAQQVFY++D K G  WKVVQK+ HRH+WDVPE
Subjt:  GVIREVIELSYINKKRVLLLRCDWYDTNSRKNRIRQDHSFTSIDTRHLWYQDEPFILVFQAQQVFYVEDLKLGSGWKVVQKIEHRHLWDVPE

A0A5D3D2R6 Uncharacterized protein1.0e-6350Show/hide
Query:  MYPIERSLQTLKQYVKNKARPGGSIVEAVIANEALTFCSMYLDGIETKFNRLIRNDDMLDLNDSKA----HLPMRSEVSSNSNAVVHELNTLACGLDIRI
        MYPIERSL+ LKQYV+N ARP GSI EA   NE+L FCSMYL  IET+FNR  RN+D +D  D +      L  +++       +  +L  LACG +   
Subjt:  MYPIERSLQTLKQYVKNKARPGGSIVEAVIANEALTFCSMYLDGIETKFNRLIRNDDMLDLNDSKA----HLPMRSEVSSNSNAVVHELNTLACGLDIRI

Query:  RSYNGCVINGFRFNTIERDDRCTTQNSGVCVMGGHNTKLSNFYGVIREVIELSYINKKRVLLLRCDWYDTNSRKNRIRQDHSFTSIDTRHLWYQDEPFIL
        RSY+GC+ NG +F+T +RD R TT+NSGV V  G       FYG + E+I L YI  K V+L +C+WYDT+ RKNRI     FTSI+TR+ WY+DEPFIL
Subjt:  RSYNGCVINGFRFNTIERDDRCTTQNSGVCVMGGHNTKLSNFYGVIREVIELSYINKKRVLLLRCDWYDTNSRKNRIRQDHSFTSIDTRHLWYQDEPFIL

Query:  VFQAQQVFYVEDLKLGSGWKVVQKIEHRHLWDVPEIK----EVDICEGVTNQCEVEEVDLKAHTFHRVDIDP
        V QA QVFYV+D KLG  WK+VQ+I+ RH+WD+ EI+    EV   E ++      E +L + TF+R D++P
Subjt:  VFQAQQVFYVEDLKLGSGWKVVQKIEHRHLWDVPEIK----EVDICEGVTNQCEVEEVDLKAHTFHRVDIDP

A0A6J1D6Q8 uncharacterized protein LOC1110174521.0e-7160.81Show/hide
Query:  SEVSSNSNAVVHELNTLACGLDIRIRSYNGCVINGFRFNTIERDDRCTTQNSGVCVMGGHNTKLSNFYGVIREVIELSYINKKRVLLLRCDWYDTNSRKN
        S+V S SN VV+EL  LACG D R+ SY  CV NG RFNT ERDDR TTQNSGVCV GG + + S+FYG+I+EVIEL YI  KRVLL RCDWYDTN +KN
Subjt:  SEVSSNSNAVVHELNTLACGLDIRIRSYNGCVINGFRFNTIERDDRCTTQNSGVCVMGGHNTKLSNFYGVIREVIELSYINKKRVLLLRCDWYDTNSRKN

Query:  RIRQDHSFTSIDTRHLWYQDEPFILVFQAQQVFYVEDLKLGSGWKVVQKIEHRHLWDVPEIKEVDICEGVTNQCEVEEVDLKAHTFHRVDIDP--INKFE
         +RQ+++FTSI+T HLWY+D+PFILV QAQQVFYV DL+LG+GWKV QKI+HRHLWDVPE++E+D+ E   NQC V+EV+L+  TFHR DIDP  ++   
Subjt:  RIRQDHSFTSIDTRHLWYQDEPFILVFQAQQVFYVEDLKLGSGWKVVQKIEHRHLWDVPEIKEVDICEGVTNQCEVEEVDLKAHTFHRVDIDP--INKFE

Query:  VDLHVKHHYDYIKYEIGTRYKD
          + +  ++D ++ EI    +D
Subjt:  VDLHVKHHYDYIKYEIGTRYKD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATCCGATTGAGAGGAGTTTGCAGACATTGAAACAATATGTAAAAAATAAAGCTCGGCCCGGGGGTTCTATAGTAGAAGCGGTCATTGCAAATGAGGCATTGACGTT
TTGCTCAATGTACCTCGATGGAATTGAAACAAAATTCAATAGGCTAATTCGAAATGATGACATGTTGGACCTGAATGACTCAAAGGCACATCTTCCTATGAGATCGGAAG
TTTCTTCAAATTCGAACGCTGTTGTGCATGAGTTGAATACACTAGCATGTGGTCTTGATATTCGGATTCGTTCATATAATGGTTGTGTTATAAATGGATTTCGATTTAAC
ACAATTGAGCGAGATGATCGGTGTACAACTCAAAATAGTGGAGTATGTGTTATGGGTGGACATAATACTAAGTTATCTAATTTCTACGGTGTCATTAGAGAAGTTATCGA
ATTGAGCTACATTAATAAAAAACGAGTTCTTCTATTAAGGTGTGATTGGTATGATACGAATTCTAGGAAGAATCGTATTCGTCAGGACCATAGTTTTACGAGCATTGACA
CACGTCATTTGTGGTACCAAGACGAACCATTCATACTTGTATTTCAGGCACAACAAGTATTCTATGTTGAAGATCTCAAATTAGGTTCTGGTTGGAAAGTAGTTCAAAAA
ATTGAACATAGACATTTATGGGATGTGCCTGAAATAAAAGAAGTTGATATATGTGAAGGAGTTACAAATCAATGTGAAGTTGAAGAAGTGGATCTTAAAGCACATACGTT
TCATAGAGTAGATATAGATCCAATTAATAAATTTGAGGTGGACCTTCATGTTAAACATCATTATGATTATATCAAGTACGAGATCGGAACCCGCTACAAAGATTATCGAC
ATCGCTTACATAGGTACTATCGTGATTTCGAAGATGCTAAAACGGCTCGACAACGACCCTATGGACAAATTACATCAGAAGTTTGGAATATGCTGTGTGATAGATGGGAA
ACTCGTGAATGGATGGAAAAAATGGTGGAGTTGGTAGAAAAAGCGAATGAAGAAGATAAAGAATTAAGTGAGCAAGATGCGATGGAAATAGTGCTTGGAAAGCAATCATC
GTATACGAAAGGGATGGGTTACGGTCCAAAGCCACCAAGTCAGAAACAGGCAGGAGGATACTCTCAGGAATATGTTCATGCCTTGGAGGATAAACTGGCAAAAAATGAAG
AGTTATTGCAAACTCAACGCCAGGAAACCCAGAGGTTGTTTGAAATGCAACGTCAAGAGTATGAAAGAAAGTTTGACAGTATAGAAGAACTTTTTCGAAGATTTACTGAA
GGAGGAGGAAGTTCATCGTTGAACAAGGGTGGAGTATGCATTGATGTTGGAGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGTATCCGATTGAGAGGAGTTTGCAGACATTGAAACAATATGTAAAAAATAAAGCTCGGCCCGGGGGTTCTATAGTAGAAGCGGTCATTGCAAATGAGGCATTGACGTT
TTGCTCAATGTACCTCGATGGAATTGAAACAAAATTCAATAGGCTAATTCGAAATGATGACATGTTGGACCTGAATGACTCAAAGGCACATCTTCCTATGAGATCGGAAG
TTTCTTCAAATTCGAACGCTGTTGTGCATGAGTTGAATACACTAGCATGTGGTCTTGATATTCGGATTCGTTCATATAATGGTTGTGTTATAAATGGATTTCGATTTAAC
ACAATTGAGCGAGATGATCGGTGTACAACTCAAAATAGTGGAGTATGTGTTATGGGTGGACATAATACTAAGTTATCTAATTTCTACGGTGTCATTAGAGAAGTTATCGA
ATTGAGCTACATTAATAAAAAACGAGTTCTTCTATTAAGGTGTGATTGGTATGATACGAATTCTAGGAAGAATCGTATTCGTCAGGACCATAGTTTTACGAGCATTGACA
CACGTCATTTGTGGTACCAAGACGAACCATTCATACTTGTATTTCAGGCACAACAAGTATTCTATGTTGAAGATCTCAAATTAGGTTCTGGTTGGAAAGTAGTTCAAAAA
ATTGAACATAGACATTTATGGGATGTGCCTGAAATAAAAGAAGTTGATATATGTGAAGGAGTTACAAATCAATGTGAAGTTGAAGAAGTGGATCTTAAAGCACATACGTT
TCATAGAGTAGATATAGATCCAATTAATAAATTTGAGGTGGACCTTCATGTTAAACATCATTATGATTATATCAAGTACGAGATCGGAACCCGCTACAAAGATTATCGAC
ATCGCTTACATAGGTACTATCGTGATTTCGAAGATGCTAAAACGGCTCGACAACGACCCTATGGACAAATTACATCAGAAGTTTGGAATATGCTGTGTGATAGATGGGAA
ACTCGTGAATGGATGGAAAAAATGGTGGAGTTGGTAGAAAAAGCGAATGAAGAAGATAAAGAATTAAGTGAGCAAGATGCGATGGAAATAGTGCTTGGAAAGCAATCATC
GTATACGAAAGGGATGGGTTACGGTCCAAAGCCACCAAGTCAGAAACAGGCAGGAGGATACTCTCAGGAATATGTTCATGCCTTGGAGGATAAACTGGCAAAAAATGAAG
AGTTATTGCAAACTCAACGCCAGGAAACCCAGAGGTTGTTTGAAATGCAACGTCAAGAGTATGAAAGAAAGTTTGACAGTATAGAAGAACTTTTTCGAAGATTTACTGAA
GGAGGAGGAAGTTCATCGTTGAACAAGGGTGGAGTATGCATTGATGTTGGAGATTGA
Protein sequenceShow/hide protein sequence
MYPIERSLQTLKQYVKNKARPGGSIVEAVIANEALTFCSMYLDGIETKFNRLIRNDDMLDLNDSKAHLPMRSEVSSNSNAVVHELNTLACGLDIRIRSYNGCVINGFRFN
TIERDDRCTTQNSGVCVMGGHNTKLSNFYGVIREVIELSYINKKRVLLLRCDWYDTNSRKNRIRQDHSFTSIDTRHLWYQDEPFILVFQAQQVFYVEDLKLGSGWKVVQK
IEHRHLWDVPEIKEVDICEGVTNQCEVEEVDLKAHTFHRVDIDPINKFEVDLHVKHHYDYIKYEIGTRYKDYRHRLHRYYRDFEDAKTARQRPYGQITSEVWNMLCDRWE
TREWMEKMVELVEKANEEDKELSEQDAMEIVLGKQSSYTKGMGYGPKPPSQKQAGGYSQEYVHALEDKLAKNEELLQTQRQETQRLFEMQRQEYERKFDSIEELFRRFTE
GGGSSSLNKGGVCIDVGD