; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g15080 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g15080
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase
Genome locationchr3:10166151..10177084
RNA-Seq ExpressionMoc03g15080
SyntenyMoc03g15080
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022149380.1 uncharacterized protein LOC111017810 [Momordica charantia]3.7e-2143.14Show/hide
Query:  RGEIRGLDAIMEPTTYTAAPESALFLDKSIRRDSQVDQKMGTSHEVKRKFSSSSLSQTFRAPQQPLWTRDTPLLCYFCLIHHSGLCWKKKRICFKCKKVG
        R EI+GL  + EPTTY AA   AL +DK + +  Q  Q +G+S  VKRKF+S S SQ  R  Q  +  +  P +C  C   H G CW  KRICFKC+K G
Subjt:  RGEIRGLDAIMEPTTYTAAPESALFLDKSIRRDSQVDQKMGTSHEVKRKFSSSSLSQTFRAPQQPLWTRDTPLLCYFCLIHHSGLCWKKKRICFKCKKVG

Query:  HYVRECSMSSSNTRTIIRKTLPVALVQSIGQKQDVLALTQEEVKDVDVMVAAT
        H+ REC M+ SNT+ + +KT   A  Q    +  V ALT+ +V+  + +V  T
Subjt:  HYVRECSMSSSNTRTIIRKTLPVALVQSIGQKQDVLALTQEEVKDVDVMVAAT

XP_022156067.1 uncharacterized protein LOC111023035 [Momordica charantia]3.1e-2042.48Show/hide
Query:  RGEIRGLDAIMEPTTYTAAPESALFLDKSIRRDSQVDQKMGTSHEVKRKFSSSSLSQTFRAPQQPLWTRDTPLLCYFCLIHHSGLCWKKKRICFKCKKVG
        R EI+GL  + E TTY AA   AL +DK +  + Q  Q MG+S  VKRKF+S S SQ+    Q  +  +  P  C  C  +H+G CW  KRICF+C+K G
Subjt:  RGEIRGLDAIMEPTTYTAAPESALFLDKSIRRDSQVDQKMGTSHEVKRKFSSSSLSQTFRAPQQPLWTRDTPLLCYFCLIHHSGLCWKKKRICFKCKKVG

Query:  HYVRECSMSSSNTRTIIRKTLPVALVQSIGQKQDVLALTQEEVKDVDVMVAAT
        H+ REC M+ SNT+ + +KT   A  Q   Q+  V ALT+ +V+  + +V  T
Subjt:  HYVRECSMSSSNTRTIIRKTLPVALVQSIGQKQDVLALTQEEVKDVDVMVAAT

XP_022156326.1 uncharacterized protein LOC111023247 [Momordica charantia]1.4e-1735.95Show/hide
Query:  RGEIRGLDAIMEPTTYTAAPESALFLDKSIRRDSQVDQKMGTSHEVKRKFSSSSLSQTFRAPQQPLWTRDTPLLCYFCLIHHSGLCWKKKRICFKCKKVG
        R  IRG   +  PTTY  A   AL +DK +   +    ++G+S  VKRKF S+      RAPQ+    +  P +C  C   H+G CW   + CF+C + G
Subjt:  RGEIRGLDAIMEPTTYTAAPESALFLDKSIRRDSQVDQKMGTSHEVKRKFSSSSLSQTFRAPQQPLWTRDTPLLCYFCLIHHSGLCWKKKRICFKCKKVG

Query:  HYVRECSMSSSNTRTIIRKTLPVALVQSIGQKQDVLALTQEEVKDVDVMVAAT
        H+ REC MS++NT+ + ++  P    Q   Q+  V ALT++E  D + +V  T
Subjt:  HYVRECSMSSSNTRTIIRKTLPVALVQSIGQKQDVLALTQEEVKDVDVMVAAT

XP_022156328.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111023249 [Momordica charantia]2.4e-2041.18Show/hide
Query:  RGEIRGLDAIMEPTTYTAAPESALFLDKSIRRDSQVDQKMGTSHEVKRKFSSSSLSQTFRAPQQPLWTRDTPLLCYFCLIHHSGLCWKKKRICFKCKKVG
        R EI+GL  + EPTTY AA   AL +DK +  + Q  Q +G++  VKRKF+S S SQ+ R  Q     +  P +C  C  +H+  CW  K+ICFKC+K G
Subjt:  RGEIRGLDAIMEPTTYTAAPESALFLDKSIRRDSQVDQKMGTSHEVKRKFSSSSLSQTFRAPQQPLWTRDTPLLCYFCLIHHSGLCWKKKRICFKCKKVG

Query:  HYVRECSMSSSNTRTIIRKTLPVALVQSIGQKQDVLALTQEEVKDVDVMVAAT
        H+ REC M+ SNT+ + +KT      Q   Q   V ALT+ +V+  + +V  T
Subjt:  HYVRECSMSSSNTRTIIRKTLPVALVQSIGQKQDVLALTQEEVKDVDVMVAAT

XP_022158750.1 uncharacterized protein LOC111025215 [Momordica charantia]2.6e-1940.52Show/hide
Query:  RGEIRGLDAIMEPTTYTAAPESALFLDKSIRRDSQVDQKMGTSHEVKRKFSSSSLSQTFRAPQQPLWTRDTPLLCYFCLIHHSGLCWKKKRICFKCKKVG
        R EI+GL  + EPTTY AA   AL +DK +  + Q  Q +G+S  VKRKF+S S SQ  R  Q  +  +  P +C  C   H+G CW  KRIC++C+K G
Subjt:  RGEIRGLDAIMEPTTYTAAPESALFLDKSIRRDSQVDQKMGTSHEVKRKFSSSSLSQTFRAPQQPLWTRDTPLLCYFCLIHHSGLCWKKKRICFKCKKVG

Query:  HYVRECSMSSSNTRTIIRKTLPVALVQSIGQKQDVLALTQEEVKDVDVMVAAT
        H+ REC M+ SNT+ + ++    A  Q    +  V ALT+ +V+  + +V  T
Subjt:  HYVRECSMSSSNTRTIIRKTLPVALVQSIGQKQDVLALTQEEVKDVDVMVAAT

TrEMBL top hitse value%identityAlignment
A0A6J1D5J7 uncharacterized protein LOC1110178101.8e-2143.14Show/hide
Query:  RGEIRGLDAIMEPTTYTAAPESALFLDKSIRRDSQVDQKMGTSHEVKRKFSSSSLSQTFRAPQQPLWTRDTPLLCYFCLIHHSGLCWKKKRICFKCKKVG
        R EI+GL  + EPTTY AA   AL +DK + +  Q  Q +G+S  VKRKF+S S SQ  R  Q  +  +  P +C  C   H G CW  KRICFKC+K G
Subjt:  RGEIRGLDAIMEPTTYTAAPESALFLDKSIRRDSQVDQKMGTSHEVKRKFSSSSLSQTFRAPQQPLWTRDTPLLCYFCLIHHSGLCWKKKRICFKCKKVG

Query:  HYVRECSMSSSNTRTIIRKTLPVALVQSIGQKQDVLALTQEEVKDVDVMVAAT
        H+ REC M+ SNT+ + +KT   A  Q    +  V ALT+ +V+  + +V  T
Subjt:  HYVRECSMSSSNTRTIIRKTLPVALVQSIGQKQDVLALTQEEVKDVDVMVAAT

A0A6J1DQB9 Reverse transcriptase1.1e-2041.18Show/hide
Query:  RGEIRGLDAIMEPTTYTAAPESALFLDKSIRRDSQVDQKMGTSHEVKRKFSSSSLSQTFRAPQQPLWTRDTPLLCYFCLIHHSGLCWKKKRICFKCKKVG
        R EI+GL  + EPTTY AA   AL +DK +  + Q  Q +G++  VKRKF+S S SQ+ R  Q     +  P +C  C  +H+  CW  K+ICFKC+K G
Subjt:  RGEIRGLDAIMEPTTYTAAPESALFLDKSIRRDSQVDQKMGTSHEVKRKFSSSSLSQTFRAPQQPLWTRDTPLLCYFCLIHHSGLCWKKKRICFKCKKVG

Query:  HYVRECSMSSSNTRTIIRKTLPVALVQSIGQKQDVLALTQEEVKDVDVMVAAT
        H+ REC M+ SNT+ + +KT      Q   Q   V ALT+ +V+  + +V  T
Subjt:  HYVRECSMSSSNTRTIIRKTLPVALVQSIGQKQDVLALTQEEVKDVDVMVAAT

A0A6J1DR22 uncharacterized protein LOC1110230351.5e-2042.48Show/hide
Query:  RGEIRGLDAIMEPTTYTAAPESALFLDKSIRRDSQVDQKMGTSHEVKRKFSSSSLSQTFRAPQQPLWTRDTPLLCYFCLIHHSGLCWKKKRICFKCKKVG
        R EI+GL  + E TTY AA   AL +DK +  + Q  Q MG+S  VKRKF+S S SQ+    Q  +  +  P  C  C  +H+G CW  KRICF+C+K G
Subjt:  RGEIRGLDAIMEPTTYTAAPESALFLDKSIRRDSQVDQKMGTSHEVKRKFSSSSLSQTFRAPQQPLWTRDTPLLCYFCLIHHSGLCWKKKRICFKCKKVG

Query:  HYVRECSMSSSNTRTIIRKTLPVALVQSIGQKQDVLALTQEEVKDVDVMVAAT
        H+ REC M+ SNT+ + +KT   A  Q   Q+  V ALT+ +V+  + +V  T
Subjt:  HYVRECSMSSSNTRTIIRKTLPVALVQSIGQKQDVLALTQEEVKDVDVMVAAT

A0A6J1DUM2 uncharacterized protein LOC1110232477.0e-1835.95Show/hide
Query:  RGEIRGLDAIMEPTTYTAAPESALFLDKSIRRDSQVDQKMGTSHEVKRKFSSSSLSQTFRAPQQPLWTRDTPLLCYFCLIHHSGLCWKKKRICFKCKKVG
        R  IRG   +  PTTY  A   AL +DK +   +    ++G+S  VKRKF S+      RAPQ+    +  P +C  C   H+G CW   + CF+C + G
Subjt:  RGEIRGLDAIMEPTTYTAAPESALFLDKSIRRDSQVDQKMGTSHEVKRKFSSSSLSQTFRAPQQPLWTRDTPLLCYFCLIHHSGLCWKKKRICFKCKKVG

Query:  HYVRECSMSSSNTRTIIRKTLPVALVQSIGQKQDVLALTQEEVKDVDVMVAAT
        H+ REC MS++NT+ + ++  P    Q   Q+  V ALT++E  D + +V  T
Subjt:  HYVRECSMSSSNTRTIIRKTLPVALVQSIGQKQDVLALTQEEVKDVDVMVAAT

A0A6J1DWP4 uncharacterized protein LOC1110252151.3e-1940.52Show/hide
Query:  RGEIRGLDAIMEPTTYTAAPESALFLDKSIRRDSQVDQKMGTSHEVKRKFSSSSLSQTFRAPQQPLWTRDTPLLCYFCLIHHSGLCWKKKRICFKCKKVG
        R EI+GL  + EPTTY AA   AL +DK +  + Q  Q +G+S  VKRKF+S S SQ  R  Q  +  +  P +C  C   H+G CW  KRIC++C+K G
Subjt:  RGEIRGLDAIMEPTTYTAAPESALFLDKSIRRDSQVDQKMGTSHEVKRKFSSSSLSQTFRAPQQPLWTRDTPLLCYFCLIHHSGLCWKKKRICFKCKKVG

Query:  HYVRECSMSSSNTRTIIRKTLPVALVQSIGQKQDVLALTQEEVKDVDVMVAAT
        H+ REC M+ SNT+ + ++    A  Q    +  V ALT+ +V+  + +V  T
Subjt:  HYVRECSMSSSNTRTIIRKTLPVALVQSIGQKQDVLALTQEEVKDVDVMVAAT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTAGGACCAAATCCTTTGATTCTAGTTCTCATAGGACTTCGACCTCTCAAACTCCTTTAGAGAATCCTCCCACTAGGAGTAGTAAGTCTCGTCGGTGCAAG
CGGCGCGAGTCACTAGAAATTAGCTGCGACCTCTCTGACTATTCATATTCGAATTGTCCCGAGGAACTATTAGGGATACTTAGATATAACTATTCCATCCCAAAT
GATATTGAGTTGAGAATCCCTACGGCGGGTGAAACGATCGACAAACCTCCAATAGTTTCTAGTACCTATGGTGGAGATTGTGTGAAGATAGAGCCTGCCCTTTGT
CTGTCGAACAAATGCTTGCTATCCATTACATTAAAAAGTCTTCCAGCCCCTTGCCGATTCTATTTAAGCTGCTTTCCTGGAATTGCTAAATTAGTCAACAGGTCA
GTAATGGAAAAAGGTTTCTCTTCGGTTGTGAATGGTCCCACCTCCATAAAGAACTGGAAACAAAGCCAATTTCTTGTCTTCAAAGCACTAATTGATCCTCCACCG
GAGTTAACTAAAGCGACTAGATCAGTCCTAGCTGACTGTGTTGTCCTTTCAGCTAGAGATCGTTATAGTCCCGACCTCTTATCCAATCGAAATTTGAGGAACTGT
GGTCTCGTAGCTAAGCTCACAGAAGCCGCCCACCCCTCCATTGTCTTATTTTTCAGCCACCGCCGCCTCAGCGTCCCCTTTTCCACAGCAGCCAGCGACGGCGTC
TTCCATGGCAGCTCAGGTTCTTCTCCGGCGGCACATTCCCTAGCCTGTTCCAGCGGCTTCGACGGCATTGCACGCAAAGCAGACCACGCTGGTGTTCCTCGAGCT
GCAGCAGTGTCGGTCTTCAACTGTGCCGCATCCTCGACCAGGTGTGGTTTTTGGCGTCGCCTCTCCTCCACGCTTCGATTTGTGGAGTCTTTAGGGGTTGATGTG
GAGTTGGTTATGGTAAAGAACGTTGGATTCTTGGGAAAAGTGAATGGCAAGGTTCTTAGATCCAACTCTCAAGGGTTTCTTAACACCTTGGGTAAAAATGGTCAA
GGGCCGATAGATGGTGAAGTCATCGGGACCTTGGATAAAGGCTGCTTACTGAGTACTGTGGTTGTACTCATCCCTCTTTTTCACCTCCAGTTTGCAGGTGTCGAG
CTAGCTCTTGGGATGATTAATAAAGTGTTTTGTGAAGTTCCATATTTAGGGGATGTCCTAGGATTGATAACTCGTTGTCTTGAGAACCATTCGTGTAGTGGATGG
TTTATGATTAGTGAAGTTTCATTGAAAGCGCAAGTTTTTGTAAGTGTTATTGATCAGAGTAGCGCAGCGGAAGCGTGCACGTATGTGATTTCTGAAGATGATGCA
GAGCCCCATGTTGAACTTTCTATTCGCTCGAGCATACCTCGAGGGTTGATCTCATTAGAAGGTTTGCTAGTATCGGTTGGTATGGCTCGTCAAGGCGACACCCAG
GTTCAAGTCGATAGTCCAGCTTTCACTCTCTACATCGATGAACGTTTTAATGAGGAGACAGAGTTCATACATCGTGCACAAGGAAACACGACAGGGGCTCATCCC
CGTGGGGAAATTAGAGGACTAGATGCAATTATGGAACCAACCACTTACACTGCAGCTCCCGAGAGTGCCCTATTTCTTGATAAGAGTATTCGAAGGGATAGCCAA
GTTGACCAAAAGATGGGCACCTCACATGAAGTTAAAAGGAAGTTTTCATCGTCCTCTTTAAGCCAAACTTTTAGGGCTCCTCAACAACCATTATGGACACGAGAC
ACTCCCTTATTGTGTTACTTCTGTCTGATTCATCATTCTGGGCTGTGTTGGAAGAAGAAGAGGATATGTTTTAAGTGCAAGAAGGTTGGCCATTACGTTAGAGAA
TGTTCGATGAGTAGTTCGAACACCCGGACCATAATACGAAAGACGCTTCCAGTGGCACTAGTGCAAAGTATTGGACAAAAGCAAGACGTCCTTGCACTCACTCAG
GAAGAAGTAAAAGATGTGGATGTTATGGTAGCAGCCACCGCCGCCTCAGCGTCCCCTTTTCCACAGCATCCAGCGACGGCGTCGTCCATGGCAGCTCAAGTTGTC
TCGGCATCTCTCTGCCCCGACATTCCCCTACGACCTGGCTGCGCCACCGTCAAGATCCCTCTCATTCGGATGTGGTATCGGTTCATTCATGTACCCCTCCTCTCG
GGCGTCACAAATACCCATTATAGCAAAGCAGACCACGCCGGCATTCCTCAAGCTGCAGCAGTGTCGGTCTTCAACGACTCGGAGTTTTCGTCACTGAACCCGTTA
CCTAGAAGCAGTAGCATCGGTTTGGGGCGATTTGCAGCAGATAAGGGCTGCTTACTGAGTACTGTGGTTGTACTCATCCCTCTTTTCCCCTCCAGTTTGCAGGAT
GTGAGCTTCTCTTCAAACATTAGTGTTGTTGTTCTAATTGTTCAAGCTCTCGTGAGTGTTGGAAATCAGAGTAGCGCAGCAGAAGCGTGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTAGGACCAAATCCTTTGATTCTAGTTCTCATAGGACTTCGACCTCTCAAACTCCTTTAGAGAATCCTCCCACTAGGAGTAGTAAGTCTCGTCGGTGCAAG
CGGCGCGAGTCACTAGAAATTAGCTGCGACCTCTCTGACTATTCATATTCGAATTGTCCCGAGGAACTATTAGGGATACTTAGATATAACTATTCCATCCCAAAT
GATATTGAGTTGAGAATCCCTACGGCGGGTGAAACGATCGACAAACCTCCAATAGTTTCTAGTACCTATGGTGGAGATTGTGTGAAGATAGAGCCTGCCCTTTGT
CTGTCGAACAAATGCTTGCTATCCATTACATTAAAAAGTCTTCCAGCCCCTTGCCGATTCTATTTAAGCTGCTTTCCTGGAATTGCTAAATTAGTCAACAGGTCA
GTAATGGAAAAAGGTTTCTCTTCGGTTGTGAATGGTCCCACCTCCATAAAGAACTGGAAACAAAGCCAATTTCTTGTCTTCAAAGCACTAATTGATCCTCCACCG
GAGTTAACTAAAGCGACTAGATCAGTCCTAGCTGACTGTGTTGTCCTTTCAGCTAGAGATCGTTATAGTCCCGACCTCTTATCCAATCGAAATTTGAGGAACTGT
GGTCTCGTAGCTAAGCTCACAGAAGCCGCCCACCCCTCCATTGTCTTATTTTTCAGCCACCGCCGCCTCAGCGTCCCCTTTTCCACAGCAGCCAGCGACGGCGTC
TTCCATGGCAGCTCAGGTTCTTCTCCGGCGGCACATTCCCTAGCCTGTTCCAGCGGCTTCGACGGCATTGCACGCAAAGCAGACCACGCTGGTGTTCCTCGAGCT
GCAGCAGTGTCGGTCTTCAACTGTGCCGCATCCTCGACCAGGTGTGGTTTTTGGCGTCGCCTCTCCTCCACGCTTCGATTTGTGGAGTCTTTAGGGGTTGATGTG
GAGTTGGTTATGGTAAAGAACGTTGGATTCTTGGGAAAAGTGAATGGCAAGGTTCTTAGATCCAACTCTCAAGGGTTTCTTAACACCTTGGGTAAAAATGGTCAA
GGGCCGATAGATGGTGAAGTCATCGGGACCTTGGATAAAGGCTGCTTACTGAGTACTGTGGTTGTACTCATCCCTCTTTTTCACCTCCAGTTTGCAGGTGTCGAG
CTAGCTCTTGGGATGATTAATAAAGTGTTTTGTGAAGTTCCATATTTAGGGGATGTCCTAGGATTGATAACTCGTTGTCTTGAGAACCATTCGTGTAGTGGATGG
TTTATGATTAGTGAAGTTTCATTGAAAGCGCAAGTTTTTGTAAGTGTTATTGATCAGAGTAGCGCAGCGGAAGCGTGCACGTATGTGATTTCTGAAGATGATGCA
GAGCCCCATGTTGAACTTTCTATTCGCTCGAGCATACCTCGAGGGTTGATCTCATTAGAAGGTTTGCTAGTATCGGTTGGTATGGCTCGTCAAGGCGACACCCAG
GTTCAAGTCGATAGTCCAGCTTTCACTCTCTACATCGATGAACGTTTTAATGAGGAGACAGAGTTCATACATCGTGCACAAGGAAACACGACAGGGGCTCATCCC
CGTGGGGAAATTAGAGGACTAGATGCAATTATGGAACCAACCACTTACACTGCAGCTCCCGAGAGTGCCCTATTTCTTGATAAGAGTATTCGAAGGGATAGCCAA
GTTGACCAAAAGATGGGCACCTCACATGAAGTTAAAAGGAAGTTTTCATCGTCCTCTTTAAGCCAAACTTTTAGGGCTCCTCAACAACCATTATGGACACGAGAC
ACTCCCTTATTGTGTTACTTCTGTCTGATTCATCATTCTGGGCTGTGTTGGAAGAAGAAGAGGATATGTTTTAAGTGCAAGAAGGTTGGCCATTACGTTAGAGAA
TGTTCGATGAGTAGTTCGAACACCCGGACCATAATACGAAAGACGCTTCCAGTGGCACTAGTGCAAAGTATTGGACAAAAGCAAGACGTCCTTGCACTCACTCAG
GAAGAAGTAAAAGATGTGGATGTTATGGTAGCAGCCACCGCCGCCTCAGCGTCCCCTTTTCCACAGCATCCAGCGACGGCGTCGTCCATGGCAGCTCAAGTTGTC
TCGGCATCTCTCTGCCCCGACATTCCCCTACGACCTGGCTGCGCCACCGTCAAGATCCCTCTCATTCGGATGTGGTATCGGTTCATTCATGTACCCCTCCTCTCG
GGCGTCACAAATACCCATTATAGCAAAGCAGACCACGCCGGCATTCCTCAAGCTGCAGCAGTGTCGGTCTTCAACGACTCGGAGTTTTCGTCACTGAACCCGTTA
CCTAGAAGCAGTAGCATCGGTTTGGGGCGATTTGCAGCAGATAAGGGCTGCTTACTGAGTACTGTGGTTGTACTCATCCCTCTTTTCCCCTCCAGTTTGCAGGAT
GTGAGCTTCTCTTCAAACATTAGTGTTGTTGTTCTAATTGTTCAAGCTCTCGTGAGTGTTGGAAATCAGAGTAGCGCAGCAGAAGCGTGGTAG
Protein sequenceShow/hide protein sequence
MARTKSFDSSSHRTSTSQTPLENPPTRSSKSRRCKRRESLEISCDLSDYSYSNCPEELLGILRYNYSIPNDIELRIPTAGETIDKPPIVSSTYGGDCVKIEPALC
LSNKCLLSITLKSLPAPCRFYLSCFPGIAKLVNRSVMEKGFSSVVNGPTSIKNWKQSQFLVFKALIDPPPELTKATRSVLADCVVLSARDRYSPDLLSNRNLRNC
GLVAKLTEAAHPSIVLFFSHRRLSVPFSTAASDGVFHGSSGSSPAAHSLACSSGFDGIARKADHAGVPRAAAVSVFNCAASSTRCGFWRRLSSTLRFVESLGVDV
ELVMVKNVGFLGKVNGKVLRSNSQGFLNTLGKNGQGPIDGEVIGTLDKGCLLSTVVVLIPLFHLQFAGVELALGMINKVFCEVPYLGDVLGLITRCLENHSCSGW
FMISEVSLKAQVFVSVIDQSSAAEACTYVISEDDAEPHVELSIRSSIPRGLISLEGLLVSVGMARQGDTQVQVDSPAFTLYIDERFNEETEFIHRAQGNTTGAHP
RGEIRGLDAIMEPTTYTAAPESALFLDKSIRRDSQVDQKMGTSHEVKRKFSSSSLSQTFRAPQQPLWTRDTPLLCYFCLIHHSGLCWKKKRICFKCKKVGHYVRE
CSMSSSNTRTIIRKTLPVALVQSIGQKQDVLALTQEEVKDVDVMVAATAASASPFPQHPATASSMAAQVVSASLCPDIPLRPGCATVKIPLIRMWYRFIHVPLLS
GVTNTHYSKADHAGIPQAAAVSVFNDSEFSSLNPLPRSSSIGLGRFAADKGCLLSTVVVLIPLFPSSLQDVSFSSNISVVVLIVQALVSVGNQSSAAEAW