; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ChyUNG226480 (gene) of Cucumber (hystrix) v1 genome

Gene IDChyUNG226480
OrganismCucumis hystrix (Cucumber (hystrix) v1)
DescriptionReverse transcriptase
Genome locationscaffold25_size3045174:867500..883491
RNA-Seq ExpressionChyUNG226480
SyntenyChyUNG226480
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0004519 - endonuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037547.1 uncharacterized protein E6C27_scaffold277G001660 [Cucumis melo var. makuwa]5.52e-9772.65Show/hide
Query:  KKIGRPERVEPSDPEKAYRIERLKKLGATVFEGSTDPADAENWLNMLEKCFDVMNYPEERKVRLATILLQKEAEGWLKSILVRRSDARTLDW*TFRGIFE
        ++IGR +R EPSDPEKAY IERLKKLGATVFEGSTDPAD ENWLNMLEKCFDVMN PEERKVRLAT LLQKEAEGW KSIL RRSDAR LDW TFRGIFE
Subjt:  KKIGRPERVEPSDPEKAYRIERLKKLGATVFEGSTDPADAENWLNMLEKCFDVMNYPEERKVRLATILLQKEAEGWLKSILVRRSDARTLDW*TFRGIFE

Query:  DKYYPRTYCEAKRDEFLGLKQGLLSVAEY*RKYTELSRYADVIVASESDMCRWF*RGLNFEIRTPVQPLLNERIFLC*WRLPFVWSRV*H*RNQQWSLVV
        DKYYP TYCEAKRDEF+GLKQG LSVAEY RKYTELS Y DVI+ASESD CR F RGL FEIRTPV  +     F    +L     RV     ++ S  V
Subjt:  DKYYPRTYCEAKRDEFLGLKQGLLSVAEY*RKYTELSRYADVIVASESDMCRWF*RGLNFEIRTPVQPLLNERIFLC*WRLPFVWSRV*H*RNQQWSLVV

Query:  ELQ*----LVVLEAVSSEGSHLG
        EL      LV LEAVSS GS LG
Subjt:  ELQ*----LVVLEAVSSEGSHLG

KAA0042134.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]3.25e-9768.24Show/hide
Query:  LLKSVISG-PGGGVTTWY*SSLLHGN*NRAESC----HYVLADDAGRIRTGCKVLLKVSLKG--------------------I*YPKKIGRPERVEPSDP
        + +S ++G PGGGVTTWY SSLLHGN  RA S       V+    GR R   +  ++   +G                    +    +IGRP+R EPSDP
Subjt:  LLKSVISG-PGGGVTTWY*SSLLHGN*NRAESC----HYVLADDAGRIRTGCKVLLKVSLKG--------------------I*YPKKIGRPERVEPSDP

Query:  EKAYRIERLKKLGATVFEGSTDPADAENWLNMLEKCFDVMNYPEERKVRLATILLQKEAEGWLKSILVRRSDARTLDW*TFRGIFEDKYYPRTYCEAKRD
        EKAY IERLKKLGATVFEGSTDPADAENWLNMLEKCFDVMN PEERKVRLAT LLQKEAEGW KSIL RRSDAR LDW TFRGIFEDKYYP TYCEAKRD
Subjt:  EKAYRIERLKKLGATVFEGSTDPADAENWLNMLEKCFDVMNYPEERKVRLATILLQKEAEGWLKSILVRRSDARTLDW*TFRGIFEDKYYPRTYCEAKRD

Query:  EFLGLKQGLLSVAEY*RKYTELSRYADVIVASESDMCRWF*RGLNFEIRTPVQPL
        EFLGLKQG LSVAEY RKYTELSRYADVIVASESD CR F RGL FEIRTPV  +
Subjt:  EFLGLKQGLLSVAEY*RKYTELSRYADVIVASESDMCRWF*RGLNFEIRTPVQPL

KAA0051483.1 uncharacterized protein E6C27_scaffold174G00020 [Cucumis melo var. makuwa]1.88e-11264.79Show/hide
Query:  NRAESCHYVLADDAGRIRTGCKVLLKVSLKGI*YPKKIGRPERVEPSDPEKAYRIERLKKLGATVFEGSTDPADAENWLNMLEKCFDVMNYPEERKVRLA
        N  ESCH+VL DDA RIRTGCKVL          PK+IGRP+R EPSDPEKAY IERLKKLG TVFEGSTDPADAENWLNMLEKCFDVMN PEERKVRLA
Subjt:  NRAESCHYVLADDAGRIRTGCKVLLKVSLKGI*YPKKIGRPERVEPSDPEKAYRIERLKKLGATVFEGSTDPADAENWLNMLEKCFDVMNYPEERKVRLA

Query:  TILLQKEAEGWLKSILVRRSDARTLDW*TFRGIFEDKYYPRTYCEAKRDEFLGLKQGLLSVAEY*RKYTELSRYADVIVASESDMCRWF*RGLNFEIRTP
        T LLQKEAE W KSIL RR DAR LDW TFRGIFEDKYYP TYCEAKRDEFLGLKQG LSVAEY RKYTELSRYADVIVAS+SD CR F +GL F+IRTP
Subjt:  TILLQKEAEGWLKSILVRRSDARTLDW*TFRGIFEDKYYPRTYCEAKRDEFLGLKQGLLSVAEY*RKYTELSRYADVIVASESDMCRWF*RGLNFEIRTP

Query:  VQPLLNERIFLC*WRLPFVWSRV*H*RNQQWSLVVELQ*LVVLEAVSSEGSHL----------G*IFQAVKTLRIVVEAKHRGT
        V  +     F          S++     +    + E +  V L   +S  S            G   +AVK LRI +EAKHRGT
Subjt:  VQPLLNERIFLC*WRLPFVWSRV*H*RNQQWSLVVELQ*LVVLEAVSSEGSHL----------G*IFQAVKTLRIVVEAKHRGT

KAA0066440.1 Gag protease polyprotein-like protein [Cucumis melo var. makuwa]7.63e-9881.03Show/hide
Query:  ESCHYVLADDAGRIRTGCKVLLKVSLKGI*YPKKIGRPERVEPSDPEKAYRIERLKKLGATVFEGSTDPADAENWLNMLEKCFDVMNYPEERKVRLATIL
        +SCH VLADDAGRIRTGCKVL          PK+IGRPERVE SDPEKAYRIERLKKLGATVFEGSTDPADAENWLNMLEKCFDVMN PEER VRLA+ L
Subjt:  ESCHYVLADDAGRIRTGCKVLLKVSLKGI*YPKKIGRPERVEPSDPEKAYRIERLKKLGATVFEGSTDPADAENWLNMLEKCFDVMNYPEERKVRLATIL

Query:  LQKEAEGWLKSILVRRSDARTLDW*TFRGIFEDKYYPRTYCEAKRDEFLGLKQGLLSVAEY*RKYTELSRYADVIVASESDMCRWF*RGLNFEIR
        LQKEAEGW KSIL RRSDA  LDW TFRGIFE+KYYP  YCEAKRDEFLGLKQ  L VAEY RKY ELS YADVIVASESD CR F RGL+FEI+
Subjt:  LQKEAEGWLKSILVRRSDARTLDW*TFRGIFEDKYYPRTYCEAKRDEFLGLKQGLLSVAEY*RKYTELSRYADVIVASESDMCRWF*RGLNFEIR

TYK03091.1 reverse transcriptase [Cucumis melo var. makuwa]2.46e-9673.54Show/hide
Query:  GPGGGVTTWY*SSLLHGN*NRAESCHYVLADDAGRIRTGCKVLLKVSLKGI*YPKKIGRPERVEPSDPEKAYRIERLKKLGATVFEGSTDPADAENWLNM
        GPGGGVTTWY S          ESCH+VLADDAGRIRTGC +L          PK+IGRPE+  PSD EK Y IERLKKLGATVFEGSTDPADAE WLNM
Subjt:  GPGGGVTTWY*SSLLHGN*NRAESCHYVLADDAGRIRTGCKVLLKVSLKGI*YPKKIGRPERVEPSDPEKAYRIERLKKLGATVFEGSTDPADAENWLNM

Query:  LEKCFDVMNYPEERKVRLATILLQKEAEGWLKSILVRRSDARTLDW*TFRGIFEDKYYPRTYCEAKRDEFLGLKQGLLSVAEY*RKYTELSRYADVIVAS
        LEKCFDVM+ P+ERKV+LAT LLQKEAEGW KSI+ RR+DARTLDW TFRGIFE+KYYP TYCEAKRDEFL LKQ  LSVA+Y RKYTELSRYA++IVAS
Subjt:  LEKCFDVMNYPEERKVRLATILLQKEAEGWLKSILVRRSDARTLDW*TFRGIFEDKYYPRTYCEAKRDEFLGLKQGLLSVAEY*RKYTELSRYADVIVAS

Query:  ESDMCRWF*RGLNFEIRTPVQPL
        ESD C  F RGL FEIRTPV  +
Subjt:  ESDMCRWF*RGLNFEIRTPVQPL

TrEMBL top hitse value%identityAlignment
A0A5A7T7M6 Reverse transcriptase9.5e-8674.09Show/hide
Query:  GPGGGVTTWY*SSLLHGN*NRAESCHYVLADDAGRIRTGCKVLLKVSLKGI*YPKKIGRPERVEPSDPEKAYRIERLKKLGATVFEGSTDPADAENWLNM
        GPGGGVTTWY S          ESCH+VLADDAGRIRTGC +L          PK+IGRPE+  PSD EK Y IERLKKLGATVFEGSTDPADAE WLNM
Subjt:  GPGGGVTTWY*SSLLHGN*NRAESCHYVLADDAGRIRTGCKVLLKVSLKGI*YPKKIGRPERVEPSDPEKAYRIERLKKLGATVFEGSTDPADAENWLNM

Query:  LEKCFDVMNYPEERKVRLATILLQKEAEGWLKSILVRRSDARTLDW*TFRGIFEDKYYPRTYCEAKRDEFLGLKQGLLSVAEY*RKYTELSRYADVIVAS
        LEKCFDVM+ P+ERKV+LAT LL KEAEGW KSI+ RR+DARTLDW TFRGIFE+KYYP TYCEAKRDEFL LKQ  LSVA+Y RKYTELSRYA++IVAS
Subjt:  LEKCFDVMNYPEERKVRLATILLQKEAEGWLKSILVRRSDARTLDW*TFRGIFEDKYYPRTYCEAKRDEFLGLKQGLLSVAEY*RKYTELSRYADVIVAS

Query:  ESDMCRWF*RGLNFEIRTPV
        ESD C  F RGL FEIRTPV
Subjt:  ESDMCRWF*RGLNFEIRTPV

A0A5A7TFC5 Reverse transcriptase8.6e-8769.05Show/hide
Query:  LLKSVISG-PGGGVTTWY*SSLLHGN*NRAES----CHYVLADDAGRIRTGCKVLLKVSLKG--------------------I*YPKKIGRPERVEPSDP
        + +S ++G PGGGVTTWY SSLLHGN  RA S       V+    GR R   +  ++   +G                    +    +IGRP+R EPSDP
Subjt:  LLKSVISG-PGGGVTTWY*SSLLHGN*NRAES----CHYVLADDAGRIRTGCKVLLKVSLKG--------------------I*YPKKIGRPERVEPSDP

Query:  EKAYRIERLKKLGATVFEGSTDPADAENWLNMLEKCFDVMNYPEERKVRLATILLQKEAEGWLKSILVRRSDARTLDW*TFRGIFEDKYYPRTYCEAKRD
        EKAY IERLKKLGATVFEGSTDPADAENWLNMLEKCFDVMN PEERKVRLAT LLQKEAEGW KSIL RRSDAR LDW TFRGIFEDKYYP TYCEAKRD
Subjt:  EKAYRIERLKKLGATVFEGSTDPADAENWLNMLEKCFDVMNYPEERKVRLATILLQKEAEGWLKSILVRRSDARTLDW*TFRGIFEDKYYPRTYCEAKRD

Query:  EFLGLKQGLLSVAEY*RKYTELSRYADVIVASESDMCRWF*RGLNFEIRTPV
        EFLGLKQG LSVAEY RKYTELSRYADVIVASESD CR F RGL FEIRTPV
Subjt:  EFLGLKQGLLSVAEY*RKYTELSRYADVIVASESDMCRWF*RGLNFEIRTPV

A0A5A7U4R7 Reverse transcriptase6.1e-8580.1Show/hide
Query:  NRAESCHYVLADDAGRIRTGCKVLLKVSLKGI*YPKKIGRPERVEPSDPEKAYRIERLKKLGATVFEGSTDPADAENWLNMLEKCFDVMNYPEERKVRLA
        N  ESCH++L DDA RIRTGCKVL           K+IGRP+R EPSDPEKAY IERLKKLGATVFE STDPADAEN LNMLEKCFD MN P+ERKVRLA
Subjt:  NRAESCHYVLADDAGRIRTGCKVLLKVSLKGI*YPKKIGRPERVEPSDPEKAYRIERLKKLGATVFEGSTDPADAENWLNMLEKCFDVMNYPEERKVRLA

Query:  TILLQKEAEGWLKSILVRRSDARTLDW*TFRGIFEDKYYPRTYCEAKRDEFLGLKQGLLSVAEY*RKYTELSRYADVIVASESDMCRWF*RGLNFEIRTP
        T LLQKEAEGW KSIL RRSDAR LDW TFRGIFEDKYYP TYCEAKRDEFLGLKQG LSVAEY  KYTELSRY+DVIVASESD CR F RGL FEIRTP
Subjt:  TILLQKEAEGWLKSILVRRSDARTLDW*TFRGIFEDKYYPRTYCEAKRDEFLGLKQGLLSVAEY*RKYTELSRYADVIVASESDMCRWF*RGLNFEIRTP

Query:  V
        V
Subjt:  V

A0A5A7U6G1 Retrotrans_gag domain-containing protein2.0e-9164.79Show/hide
Query:  NRAESCHYVLADDAGRIRTGCKVLLKVSLKGI*YPKKIGRPERVEPSDPEKAYRIERLKKLGATVFEGSTDPADAENWLNMLEKCFDVMNYPEERKVRLA
        N  ESCH+VL DDA RIRTGCKVL          PK+IGRP+R EPSDPEKAY IERLKKLG TVFEGSTDPADAENWLNMLEKCFDVMN PEERKVRLA
Subjt:  NRAESCHYVLADDAGRIRTGCKVLLKVSLKGI*YPKKIGRPERVEPSDPEKAYRIERLKKLGATVFEGSTDPADAENWLNMLEKCFDVMNYPEERKVRLA

Query:  TILLQKEAEGWLKSILVRRSDARTLDW*TFRGIFEDKYYPRTYCEAKRDEFLGLKQGLLSVAEY*RKYTELSRYADVIVASESDMCRWF*RGLNFEIRTP
        T LLQKEAE W KSIL RR DAR LDW TFRGIFEDKYYP TYCEAKRDEFLGLKQG LSVAEY RKYTELSRYADVIVAS+SD CR F +GL F+IRTP
Subjt:  TILLQKEAEGWLKSILVRRSDARTLDW*TFRGIFEDKYYPRTYCEAKRDEFLGLKQGLLSVAEY*RKYTELSRYADVIVASESDMCRWF*RGLNFEIRTP

Query:  VQPLLNERIFLC*WRLPFVWSRV*H*RNQQWSLVVELQ*LVVLEAVSSEGSHL----------G*IFQAVKTLRIVVEAKHRGT
        V  +     F          S++     +    + E +  V L   +S  S            G   +AVK LRI +EAKHRGT
Subjt:  VQPLLNERIFLC*WRLPFVWSRV*H*RNQQWSLVVELQ*LVVLEAVSSEGSHL----------G*IFQAVKTLRIVVEAKHRGT

A0A5D3BTP3 Reverse transcriptase1.5e-8674.55Show/hide
Query:  GPGGGVTTWY*SSLLHGN*NRAESCHYVLADDAGRIRTGCKVLLKVSLKGI*YPKKIGRPERVEPSDPEKAYRIERLKKLGATVFEGSTDPADAENWLNM
        GPGGGVTTWY S          ESCH+VLADDAGRIRTGC +L          PK+IGRPE+  PSD EK Y IERLKKLGATVFEGSTDPADAE WLNM
Subjt:  GPGGGVTTWY*SSLLHGN*NRAESCHYVLADDAGRIRTGCKVLLKVSLKGI*YPKKIGRPERVEPSDPEKAYRIERLKKLGATVFEGSTDPADAENWLNM

Query:  LEKCFDVMNYPEERKVRLATILLQKEAEGWLKSILVRRSDARTLDW*TFRGIFEDKYYPRTYCEAKRDEFLGLKQGLLSVAEY*RKYTELSRYADVIVAS
        LEKCFDVM+ P+ERKV+LAT LLQKEAEGW KSI+ RR+DARTLDW TFRGIFE+KYYP TYCEAKRDEFL LKQ  LSVA+Y RKYTELSRYA++IVAS
Subjt:  LEKCFDVMNYPEERKVRLATILLQKEAEGWLKSILVRRSDARTLDW*TFRGIFEDKYYPRTYCEAKRDEFLGLKQGLLSVAEY*RKYTELSRYADVIVAS

Query:  ESDMCRWF*RGLNFEIRTPV
        ESD C  F RGL FEIRTPV
Subjt:  ESDMCRWF*RGLNFEIRTPV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ACATACTCTATAGTTTATGTAAGCTTTAGGAGCTGCGTGTCTCCAGATTTTCTCTTGAAATCAGTTATTTCTGGTCCGGGAGGGGGTGTGACAACTTGGTATTAG
AGCAGTTTGCTCCATGGGAATTAAAATAGAGCGGAGTCATGCCACTACGTACTAGCAGACGACGCAGGCAGAATCAGGACGGGATGCAAGGTCCTACTCAAGGTC
AGTTTGAAGGGAATCTAATACCCCAAGAAGATAGGAAGGCCAGAGAGAGTAGAGCCTAGTGATCCAGAAAAGGCATATAGAATTGAACGACTGAAGAAATTAGGG
GCTACAGTATTTGAGGGTTCCACAGATCCAGCTGATGCAGAGAATTGGTTGAATATGCTCGAAAAATGTTTTGACGTGATGAATTATCCTGAGGAGCGAAAGGTT
AGATTGGCCACAATTTTGTTGCAGAAGGAGGCTGAAGGATGGTTGAAATCTATATTAGTTAGGCGTAGTGATGCACGTACTTTAGACTGGTAGACTTTTAGAGGC
ATATTCGAAGATAAGTATTATCCCAGAACATACTGCGAGGCCAAGAGGGATGAGTTTTTGGGGTTGAAACAAGGATTACTTTCAGTGGCTGAGTACTAGAGAAAG
TATACCGAGCTTTCACGGTATGCTGATGTTATTGTGGCATCTGAGAGTGACATGTGTCGATGGTTTTAAAGAGGGTTGAATTTTGAAATACGTACCCCAGTACAA
CCATTGCTAAATGAACGGATTTTTCTCTGTTAGTGGAGACTGCCCTTCGTGTGGAGCAGAGTATAACATTAGAGAAATCAGCAGTGGAGCTTAGTCGTGGAGCTT
CAATAGTTAGTGGTTTTAGAGGCCGTGAGTAGCGAAGGTTCACACCTGGGGTAAATATTTCAAGCCGTCAAGACTTTAAGAATCGTTGTGGAGGCCAAGCATCGA
GGAACATGAGTTATGGTAGTGTTTTTCAGAGACAGAGCCAGAGCATACTGAGTCAATCCACTAAATCAACAGTAAGACCACAGCCAGGTAAAGAGTTCGTTGCTA
GTACCACAGTTGAACGATCGAGAGTTTGGAGAGGAAGAAGAAGAAGCTCAAGAAGTATTGGTTGAAGAACACAAGGACAACACAAACAGAATTAAGAGGGGAGCA
AATTCTGATGTTCATTGGGTGAATTTTGACAAAGAAAACAGGGCAGGTGGTTTTCTCCCGAAATCAGGAGACCTCTGACATTTAAAAAGTCAAAAGTCAACATTT
TGACTTTTTACCATTTTATCCATTATATCTTTTTTTCCTAAAGATCGAGTTCCCGGTGCTTGATAAACTGCTTTAGTCTGCTGAAGCTCCAATATCGATTGCC
mRNA sequenceShow/hide mRNA sequence
ACATACTCTATAGTTTATGTAAGCTTTAGGAGCTGCGTGTCTCCAGATTTTCTCTTGAAATCAGTTATTTCTGGTCCGGGAGGGGGTGTGACAACTTGGTATTAG
AGCAGTTTGCTCCATGGGAATTAAAATAGAGCGGAGTCATGCCACTACGTACTAGCAGACGACGCAGGCAGAATCAGGACGGGATGCAAGGTCCTACTCAAGGTC
AGTTTGAAGGGAATCTAATACCCCAAGAAGATAGGAAGGCCAGAGAGAGTAGAGCCTAGTGATCCAGAAAAGGCATATAGAATTGAACGACTGAAGAAATTAGGG
GCTACAGTATTTGAGGGTTCCACAGATCCAGCTGATGCAGAGAATTGGTTGAATATGCTCGAAAAATGTTTTGACGTGATGAATTATCCTGAGGAGCGAAAGGTT
AGATTGGCCACAATTTTGTTGCAGAAGGAGGCTGAAGGATGGTTGAAATCTATATTAGTTAGGCGTAGTGATGCACGTACTTTAGACTGGTAGACTTTTAGAGGC
ATATTCGAAGATAAGTATTATCCCAGAACATACTGCGAGGCCAAGAGGGATGAGTTTTTGGGGTTGAAACAAGGATTACTTTCAGTGGCTGAGTACTAGAGAAAG
TATACCGAGCTTTCACGGTATGCTGATGTTATTGTGGCATCTGAGAGTGACATGTGTCGATGGTTTTAAAGAGGGTTGAATTTTGAAATACGTACCCCAGTACAA
CCATTGCTAAATGAACGGATTTTTCTCTGTTAGTGGAGACTGCCCTTCGTGTGGAGCAGAGTATAACATTAGAGAAATCAGCAGTGGAGCTTAGTCGTGGAGCTT
CAATAGTTAGTGGTTTTAGAGGCCGTGAGTAGCGAAGGTTCACACCTGGGGTAAATATTTCAAGCCGTCAAGACTTTAAGAATCGTTGTGGAGGCCAAGCATCGA
GGAACATGAGTTATGGTAGTGTTTTTCAGAGACAGAGCCAGAGCATACTGAGTCAATCCACTAAATCAACAGTAAGACCACAGCCAGGTAAAGAGTTCGTTGCTA
GTACCACAGTTGAACGATCGAGAGTTTGGAGAGGAAGAAGAAGAAGCTCAAGAAGTATTGGTTGAAGAACACAAGGACAACACAAACAGAATTAAGAGGGGAGCA
AATTCTGATGTTCATTGGGTGAATTTTGACAAAGAAAACAGGGCAGGTGGTTTTCTCCCGAAATCAGGAGACCTCTGACATTTAAAAAGTCAAAAGTCAACATTT
TGACTTTTTACCATTTTATCCATTATATCTTTTTTTCCTAAAGATCGAGTTCCCGGTGCTTGATAAACTGCTTTAGTCTGCTGAAGCTCCAATATCGATTGCC
Protein sequenceShow/hide protein sequence
TYSIVYVSFRSCVSPDFLLKSVISGPGGGVTTWYSSLLHGN*NRAESCHYVLADDAGRIRTGCKVLLKVSLKGIYPKKIGRPERVEPSDPEKAYRIERLKKLGAT
VFEGSTDPADAENWLNMLEKCFDVMNYPEERKVRLATILLQKEAEGWLKSILVRRSDARTLDWTFRGIFEDKYYPRTYCEAKRDEFLGLKQGLLSVAEYRKYTEL
SRYADVIVASESDMCRWF*RGLNFEIRTPVQPLLNERIFLCWRLPFVWSRV*H*RNQQWSLVVELQ*LVVLEAVSSEGSHLG*IFQAVKTLRIVVEAKHRGTVMV
VFFRDRARAY*VNPLNQQ*DHSQVKSSLLVPQLNDREFGEEEEEAQEVLVEEHKDNTNRIKRGANSDVHWVNFDKENRAGGFLPKSGDLHLKSQKSTFLFTILSI
ISFFPKDRVPGA**TALVC*SSNIDCX