; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0022151 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0022151
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionReverse transcriptase Ty1/copia-type domain-containing protein
Genome locationchr02:4759877..4760767
RNA-Seq ExpressionPay0022151
SyntenyPay0022151
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0041594.1 uncharacterized protein E6C27_scaffold93G00610 [Cucumis melo var. makuwa]1.9e-13182.09Show/hide
Query:  MNIIHKAGLEKTISNVGPFYPQLIREFIVNLPDEFNDPSSADYQTVHIKGFKFVISPTVVNGFLGNTVDIDCSPS-------------WTLSTWPMNGIP
        M++IHKAGLEKTISNVGPFYPQLIREFIVNLPDEFN+PSSADYQTVHI+GFKFVISPT +NGFLGNT+DIDCS S              TLSTWPMNGIP
Subjt:  MNIIHKAGLEKTISNVGPFYPQLIREFIVNLPDEFNDPSSADYQTVHIKGFKFVISPTVVNGFLGNTVDIDCSPS-------------WTLSTWPMNGIP

Query:  ATALSVKYAILHKIGIANWFPSSHASSISAALGTFLYQICNSDKVDTGAFIYNQLLRHVGSFGLKVPIAFPRLFSSLLLDLNGAVLTASNAPRPEPKKIT
        A ALSVKY ILHKIGIANWFPSSHASSIS ALG FLYQICN DKVDTGAFIYNQLLRHVGSFG+KVPIA  RLF SLLL LNGAVLTAS+ PRPEPK I 
Subjt:  ATALSVKYAILHKIGIANWFPSSHASSISAALGTFLYQICNSDKVDTGAFIYNQLLRHVGSFGLKVPIAFPRLFSSLLLDLNGAVLTASNAPRPEPKKIT

Query:  LSYRLFQGSNVTDIDHDVHPTRGLRIFDTTDWDDSPEGFYVDRKLATRIINSLTAESLALTNSIILLSERRLEVDALIRHWKSSAPPTSRQQPPSG
        LSYRLFQG ++ DID DVHPT G RIFDT DWD+S EGFYVDR+LAT IINSLTAES ALTNSI LLSERRLEVDALIRH KSSAP TSRQ PPSG
Subjt:  LSYRLFQGSNVTDIDHDVHPTRGLRIFDTTDWDDSPEGFYVDRKLATRIINSLTAESLALTNSIILLSERRLEVDALIRHWKSSAPPTSRQQPPSG

KAA0064034.1 uncharacterized protein E6C27_scaffold99G00320 [Cucumis melo var. makuwa]4.9e-13585.87Show/hide
Query:  MNIIHKAGLEKTISNVGPFYPQLIREFIVNLPDEFNDPSSADYQTVHIKGFKFVISPTVVNGFLGNTVDIDCSPSWTLSTWPMNGIPATALSVKYAILHK
        M++IHK GLEKTISNV PFYPQLIREFIVNLPDEFNDPSSADYQTVHI+GFKF+ISP V+N FLGNTVDIDCS SWTL TWP+NGIPA ALSVKYAILHK
Subjt:  MNIIHKAGLEKTISNVGPFYPQLIREFIVNLPDEFNDPSSADYQTVHIKGFKFVISPTVVNGFLGNTVDIDCSPSWTLSTWPMNGIPATALSVKYAILHK

Query:  IGIANWFPSSHASSISAALGTFLYQICNSDKVDTGAFIYNQLLRHVGSFGLKVPIAFPRLFSSLLLDLNGAVLTASNAPRPEPKKITLSYRLFQGSNVTD
        IGIANW PSSHASSI AALGTFLYQICN DKVDTG FIYNQLLR+VGSFG+KV IAFPRLFSSLLL LNGAVLTAS+APRPEPK I LSYR+FQGS+V D
Subjt:  IGIANWFPSSHASSISAALGTFLYQICNSDKVDTGAFIYNQLLRHVGSFGLKVPIAFPRLFSSLLLDLNGAVLTASNAPRPEPKKITLSYRLFQGSNVTD

Query:  IDHDVHPTRGLRIFDTTDWDDSPEGFYVDRKLATRIINSLTAESLALTNSIILLSERRLEVDALIRHWKSSAPPTSRQQPPSG
        IDHDVHPT+  RIFDTTDWD+  EGFYVDRKLAT IINSLT ES ALTNSI LLS+RRLEVDALIRH KSSAP TSRQQPPSG
Subjt:  IDHDVHPTRGLRIFDTTDWDDSPEGFYVDRKLATRIINSLTAESLALTNSIILLSERRLEVDALIRHWKSSAPPTSRQQPPSG

TYK29236.1 uncharacterized protein E5676_scaffold1228G00270 [Cucumis melo var. makuwa]2.0e-13383.78Show/hide
Query:  MNIIHKAGLEKTISNVGPFYPQLIREFIVNLPDEFNDPSSADYQTVHIKGFKFVISPTVVNGFLGNTVDIDCSPS-------------WTLSTWPMNGIP
        MN+IHKAGL+KTISNVGPFYPQLIREFIVNLPDEFNDPSSADY TVHI+GFKFVISP V+NGFLGNTVDIDCSPS              TLSTW +NGIP
Subjt:  MNIIHKAGLEKTISNVGPFYPQLIREFIVNLPDEFNDPSSADYQTVHIKGFKFVISPTVVNGFLGNTVDIDCSPS-------------WTLSTWPMNGIP

Query:  ATALSVKYAILHKIGIANWFPSSHASSISAALGTFLYQICNSDKVDTGAFIYNQLLRHVGSFGLKVPIAFPRLFSSLLLDLNGAVLTASNAPRPEPKKIT
          ALSVKYAILHKIGIANWFPS HASSIS ALGTFLYQICNSDKVDTGAFIYNQLLRHVGSFG+KVPIAF RLFSSLLL LNGAVLTAS+AP PEPK I 
Subjt:  ATALSVKYAILHKIGIANWFPSSHASSISAALGTFLYQICNSDKVDTGAFIYNQLLRHVGSFGLKVPIAFPRLFSSLLLDLNGAVLTASNAPRPEPKKIT

Query:  LSYRLFQGSNVTDIDHDVHPTRGLRIFDTTDWDDSPEGFYVDRKLATRIINSLTAESLALTNSIILLSERRLEVDALIRHWKSSAPPTSRQQPPSG
        LSYRLFQGS++ DIDHDVHPTRG  IF+TTDWDD  EGFYVDR+LATRIINSLTAES ALTNSI LLSERRLEVDALIRH KSSA  TSRQQP SG
Subjt:  LSYRLFQGSNVTDIDHDVHPTRGLRIFDTTDWDDSPEGFYVDRKLATRIINSLTAESLALTNSIILLSERRLEVDALIRHWKSSAPPTSRQQPPSG

XP_008462856.1 PREDICTED: uncharacterized protein LOC103501137 [Cucumis melo]3.5e-13383.45Show/hide
Query:  MNIIHKAGLEKTISNVGPFYPQLIREFIVNLPDEFNDPSSADYQTVHIKGFKFVISPTVVNGFLGNTVDIDCSPS-------------WTLSTWPMNGIP
        MN+ HKAGLEKTISNVGPFYPQLIREFIVNLPD+FNDPSS +YQTVHI+GFKFVISP V+N FLGNTVDID S S              TLSTWP+NGIP
Subjt:  MNIIHKAGLEKTISNVGPFYPQLIREFIVNLPDEFNDPSSADYQTVHIKGFKFVISPTVVNGFLGNTVDIDCSPS-------------WTLSTWPMNGIP

Query:  ATALSVKYAILHKIGIANWFPSSHASSISAALGTFLYQICNSDKVDTGAFIYNQLLRHVGSFGLKVPIAFPRLFSSLLLDLNGAVLTASNAPRPEPKKIT
          ALSVKYAILHKIGIANWFPSSHASSISAAL TFLYQICNSDKVDTGAFIYNQLL HVGSF +KVPIAFPRLFSSLLL LNG VLTAS+AP PEPK I 
Subjt:  ATALSVKYAILHKIGIANWFPSSHASSISAALGTFLYQICNSDKVDTGAFIYNQLLRHVGSFGLKVPIAFPRLFSSLLLDLNGAVLTASNAPRPEPKKIT

Query:  LSYRLFQGSNVTDIDHDVHPTRGLRIFDTTDWDDSPEGFYVDRKLATRIINSLTAESLALTNSIILLSERRLEVDALIRHWKSSAPPTSRQQPPSG
        LSYRLFQGS+V DIDHDVHPT G RIFDTTDWD+S EGFYVDR+LATRIINSLTAES ALTNSI LLSERRLEVDALIRH KSSAP TSRQQPPSG
Subjt:  LSYRLFQGSNVTDIDHDVHPTRGLRIFDTTDWDDSPEGFYVDRKLATRIINSLTAESLALTNSIILLSERRLEVDALIRHWKSSAPPTSRQQPPSG

XP_008466681.1 PREDICTED: uncharacterized protein LOC103504033 [Cucumis melo]1.9e-13182.09Show/hide
Query:  MNIIHKAGLEKTISNVGPFYPQLIREFIVNLPDEFNDPSSADYQTVHIKGFKFVISPTVVNGFLGNTVDIDCSPS-------------WTLSTWPMNGIP
        M++IHKAGLEKTISNVGPFYPQLIREFIVNLPDEFN+PSSADYQTVHI+GFKFVISPT +NGFLGNT+DIDCS S              TLSTWPMNGIP
Subjt:  MNIIHKAGLEKTISNVGPFYPQLIREFIVNLPDEFNDPSSADYQTVHIKGFKFVISPTVVNGFLGNTVDIDCSPS-------------WTLSTWPMNGIP

Query:  ATALSVKYAILHKIGIANWFPSSHASSISAALGTFLYQICNSDKVDTGAFIYNQLLRHVGSFGLKVPIAFPRLFSSLLLDLNGAVLTASNAPRPEPKKIT
        A ALSVKY ILHKIGIANWFPSSHASSIS ALG FLYQICN DKVDTGAFIYNQLLRHVGSFG+KVPIA  RLF SLLL LNGAVLTAS+ PRPEPK I 
Subjt:  ATALSVKYAILHKIGIANWFPSSHASSISAALGTFLYQICNSDKVDTGAFIYNQLLRHVGSFGLKVPIAFPRLFSSLLLDLNGAVLTASNAPRPEPKKIT

Query:  LSYRLFQGSNVTDIDHDVHPTRGLRIFDTTDWDDSPEGFYVDRKLATRIINSLTAESLALTNSIILLSERRLEVDALIRHWKSSAPPTSRQQPPSG
        LSYRLFQG ++ DID DVHPT G RIFDT DWD+S EGFYVDR+LAT IINSLTAES ALTNSI LLSERRLEVDALIRH KSSAP TSRQ PPSG
Subjt:  LSYRLFQGSNVTDIDHDVHPTRGLRIFDTTDWDDSPEGFYVDRKLATRIINSLTAESLALTNSIILLSERRLEVDALIRHWKSSAPPTSRQQPPSG

TrEMBL top hitse value%identityAlignment
A0A1S3CHW3 uncharacterized protein LOC1035011371.7e-13383.45Show/hide
Query:  MNIIHKAGLEKTISNVGPFYPQLIREFIVNLPDEFNDPSSADYQTVHIKGFKFVISPTVVNGFLGNTVDIDCSPS-------------WTLSTWPMNGIP
        MN+ HKAGLEKTISNVGPFYPQLIREFIVNLPD+FNDPSS +YQTVHI+GFKFVISP V+N FLGNTVDID S S              TLSTWP+NGIP
Subjt:  MNIIHKAGLEKTISNVGPFYPQLIREFIVNLPDEFNDPSSADYQTVHIKGFKFVISPTVVNGFLGNTVDIDCSPS-------------WTLSTWPMNGIP

Query:  ATALSVKYAILHKIGIANWFPSSHASSISAALGTFLYQICNSDKVDTGAFIYNQLLRHVGSFGLKVPIAFPRLFSSLLLDLNGAVLTASNAPRPEPKKIT
          ALSVKYAILHKIGIANWFPSSHASSISAAL TFLYQICNSDKVDTGAFIYNQLL HVGSF +KVPIAFPRLFSSLLL LNG VLTAS+AP PEPK I 
Subjt:  ATALSVKYAILHKIGIANWFPSSHASSISAALGTFLYQICNSDKVDTGAFIYNQLLRHVGSFGLKVPIAFPRLFSSLLLDLNGAVLTASNAPRPEPKKIT

Query:  LSYRLFQGSNVTDIDHDVHPTRGLRIFDTTDWDDSPEGFYVDRKLATRIINSLTAESLALTNSIILLSERRLEVDALIRHWKSSAPPTSRQQPPSG
        LSYRLFQGS+V DIDHDVHPT G RIFDTTDWD+S EGFYVDR+LATRIINSLTAES ALTNSI LLSERRLEVDALIRH KSSAP TSRQQPPSG
Subjt:  LSYRLFQGSNVTDIDHDVHPTRGLRIFDTTDWDDSPEGFYVDRKLATRIINSLTAESLALTNSIILLSERRLEVDALIRHWKSSAPPTSRQQPPSG

A0A1S3CRT1 uncharacterized protein LOC1035040339.3e-13282.09Show/hide
Query:  MNIIHKAGLEKTISNVGPFYPQLIREFIVNLPDEFNDPSSADYQTVHIKGFKFVISPTVVNGFLGNTVDIDCSPS-------------WTLSTWPMNGIP
        M++IHKAGLEKTISNVGPFYPQLIREFIVNLPDEFN+PSSADYQTVHI+GFKFVISPT +NGFLGNT+DIDCS S              TLSTWPMNGIP
Subjt:  MNIIHKAGLEKTISNVGPFYPQLIREFIVNLPDEFNDPSSADYQTVHIKGFKFVISPTVVNGFLGNTVDIDCSPS-------------WTLSTWPMNGIP

Query:  ATALSVKYAILHKIGIANWFPSSHASSISAALGTFLYQICNSDKVDTGAFIYNQLLRHVGSFGLKVPIAFPRLFSSLLLDLNGAVLTASNAPRPEPKKIT
        A ALSVKY ILHKIGIANWFPSSHASSIS ALG FLYQICN DKVDTGAFIYNQLLRHVGSFG+KVPIA  RLF SLLL LNGAVLTAS+ PRPEPK I 
Subjt:  ATALSVKYAILHKIGIANWFPSSHASSISAALGTFLYQICNSDKVDTGAFIYNQLLRHVGSFGLKVPIAFPRLFSSLLLDLNGAVLTASNAPRPEPKKIT

Query:  LSYRLFQGSNVTDIDHDVHPTRGLRIFDTTDWDDSPEGFYVDRKLATRIINSLTAESLALTNSIILLSERRLEVDALIRHWKSSAPPTSRQQPPSG
        LSYRLFQG ++ DID DVHPT G RIFDT DWD+S EGFYVDR+LAT IINSLTAES ALTNSI LLSERRLEVDALIRH KSSAP TSRQ PPSG
Subjt:  LSYRLFQGSNVTDIDHDVHPTRGLRIFDTTDWDDSPEGFYVDRKLATRIINSLTAESLALTNSIILLSERRLEVDALIRHWKSSAPPTSRQQPPSG

A0A5A7TDT9 Uncharacterized protein9.3e-13282.09Show/hide
Query:  MNIIHKAGLEKTISNVGPFYPQLIREFIVNLPDEFNDPSSADYQTVHIKGFKFVISPTVVNGFLGNTVDIDCSPS-------------WTLSTWPMNGIP
        M++IHKAGLEKTISNVGPFYPQLIREFIVNLPDEFN+PSSADYQTVHI+GFKFVISPT +NGFLGNT+DIDCS S              TLSTWPMNGIP
Subjt:  MNIIHKAGLEKTISNVGPFYPQLIREFIVNLPDEFNDPSSADYQTVHIKGFKFVISPTVVNGFLGNTVDIDCSPS-------------WTLSTWPMNGIP

Query:  ATALSVKYAILHKIGIANWFPSSHASSISAALGTFLYQICNSDKVDTGAFIYNQLLRHVGSFGLKVPIAFPRLFSSLLLDLNGAVLTASNAPRPEPKKIT
        A ALSVKY ILHKIGIANWFPSSHASSIS ALG FLYQICN DKVDTGAFIYNQLLRHVGSFG+KVPIA  RLF SLLL LNGAVLTAS+ PRPEPK I 
Subjt:  ATALSVKYAILHKIGIANWFPSSHASSISAALGTFLYQICNSDKVDTGAFIYNQLLRHVGSFGLKVPIAFPRLFSSLLLDLNGAVLTASNAPRPEPKKIT

Query:  LSYRLFQGSNVTDIDHDVHPTRGLRIFDTTDWDDSPEGFYVDRKLATRIINSLTAESLALTNSIILLSERRLEVDALIRHWKSSAPPTSRQQPPSG
        LSYRLFQG ++ DID DVHPT G RIFDT DWD+S EGFYVDR+LAT IINSLTAES ALTNSI LLSERRLEVDALIRH KSSAP TSRQ PPSG
Subjt:  LSYRLFQGSNVTDIDHDVHPTRGLRIFDTTDWDDSPEGFYVDRKLATRIINSLTAESLALTNSIILLSERRLEVDALIRHWKSSAPPTSRQQPPSG

A0A5A7V9Y4 Uncharacterized protein2.4e-13585.87Show/hide
Query:  MNIIHKAGLEKTISNVGPFYPQLIREFIVNLPDEFNDPSSADYQTVHIKGFKFVISPTVVNGFLGNTVDIDCSPSWTLSTWPMNGIPATALSVKYAILHK
        M++IHK GLEKTISNV PFYPQLIREFIVNLPDEFNDPSSADYQTVHI+GFKF+ISP V+N FLGNTVDIDCS SWTL TWP+NGIPA ALSVKYAILHK
Subjt:  MNIIHKAGLEKTISNVGPFYPQLIREFIVNLPDEFNDPSSADYQTVHIKGFKFVISPTVVNGFLGNTVDIDCSPSWTLSTWPMNGIPATALSVKYAILHK

Query:  IGIANWFPSSHASSISAALGTFLYQICNSDKVDTGAFIYNQLLRHVGSFGLKVPIAFPRLFSSLLLDLNGAVLTASNAPRPEPKKITLSYRLFQGSNVTD
        IGIANW PSSHASSI AALGTFLYQICN DKVDTG FIYNQLLR+VGSFG+KV IAFPRLFSSLLL LNGAVLTAS+APRPEPK I LSYR+FQGS+V D
Subjt:  IGIANWFPSSHASSISAALGTFLYQICNSDKVDTGAFIYNQLLRHVGSFGLKVPIAFPRLFSSLLLDLNGAVLTASNAPRPEPKKITLSYRLFQGSNVTD

Query:  IDHDVHPTRGLRIFDTTDWDDSPEGFYVDRKLATRIINSLTAESLALTNSIILLSERRLEVDALIRHWKSSAPPTSRQQPPSG
        IDHDVHPT+  RIFDTTDWD+  EGFYVDRKLAT IINSLT ES ALTNSI LLS+RRLEVDALIRH KSSAP TSRQQPPSG
Subjt:  IDHDVHPTRGLRIFDTTDWDDSPEGFYVDRKLATRIINSLTAESLALTNSIILLSERRLEVDALIRHWKSSAPPTSRQQPPSG

A0A5D3E001 Uncharacterized protein9.9e-13483.78Show/hide
Query:  MNIIHKAGLEKTISNVGPFYPQLIREFIVNLPDEFNDPSSADYQTVHIKGFKFVISPTVVNGFLGNTVDIDCSPS-------------WTLSTWPMNGIP
        MN+IHKAGL+KTISNVGPFYPQLIREFIVNLPDEFNDPSSADY TVHI+GFKFVISP V+NGFLGNTVDIDCSPS              TLSTW +NGIP
Subjt:  MNIIHKAGLEKTISNVGPFYPQLIREFIVNLPDEFNDPSSADYQTVHIKGFKFVISPTVVNGFLGNTVDIDCSPS-------------WTLSTWPMNGIP

Query:  ATALSVKYAILHKIGIANWFPSSHASSISAALGTFLYQICNSDKVDTGAFIYNQLLRHVGSFGLKVPIAFPRLFSSLLLDLNGAVLTASNAPRPEPKKIT
          ALSVKYAILHKIGIANWFPS HASSIS ALGTFLYQICNSDKVDTGAFIYNQLLRHVGSFG+KVPIAF RLFSSLLL LNGAVLTAS+AP PEPK I 
Subjt:  ATALSVKYAILHKIGIANWFPSSHASSISAALGTFLYQICNSDKVDTGAFIYNQLLRHVGSFGLKVPIAFPRLFSSLLLDLNGAVLTASNAPRPEPKKIT

Query:  LSYRLFQGSNVTDIDHDVHPTRGLRIFDTTDWDDSPEGFYVDRKLATRIINSLTAESLALTNSIILLSERRLEVDALIRHWKSSAPPTSRQQPPSG
        LSYRLFQGS++ DIDHDVHPTRG  IF+TTDWDD  EGFYVDR+LATRIINSLTAES ALTNSI LLSERRLEVDALIRH KSSA  TSRQQP SG
Subjt:  LSYRLFQGSNVTDIDHDVHPTRGLRIFDTTDWDDSPEGFYVDRKLATRIINSLTAESLALTNSIILLSERRLEVDALIRHWKSSAPPTSRQQPPSG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACATTATCCATAAGGCTGGTTTGGAAAAAACTATCTCGAATGTTGGCCCTTTTTATCCTCAGTTAATTAGAGAATTTATTGTCAATCTGCCTGATGAGTTTAATGA
TCCAAGTAGTGCTGACTATCAGACGGTGCACATTAAAGGGTTCAAATTTGTGATTTCACCTACTGTAGTAAATGGGTTTCTTGGAAATACTGTTGATATTGACTGCTCTC
CATCATGGACCTTGTCCACATGGCCTATGAATGGAATCCCTGCAACTGCTCTCAGCGTCAAGTATGCCATTCTGCACAAGATTGGCATTGCCAATTGGTTCCCTTCCTCA
CATGCATCAAGCATATCTGCTGCCTTAGGTACATTCTTGTATCAAATTTGCAATAGTGATAAAGTAGATACGGGTGCCTTCATTTACAATCAACTGTTGAGGCATGTTGG
GTCGTTTGGGCTCAAGGTTCCTATTGCTTTTCCGAGGTTATTCTCCAGTCTGCTACTTGATCTAAATGGAGCGGTGCTTACTGCATCTAATGCTCCTAGACCTGAACCTA
AGAAAATTACACTTAGCTACAGACTCTTTCAAGGCAGTAATGTGACTGATATTGACCATGATGTGCATCCAACTCGAGGCCTACGTATTTTTGACACTACTGACTGGGAT
GACTCTCCTGAAGGCTTCTATGTGGATCGTAAGTTAGCTACCCGTATTATTAATTCCTTGACTGCTGAATCTCTTGCATTGACTAACTCCATCATTCTGTTGTCTGAACG
TCGATTAGAGGTTGATGCTCTCATTCGACATTGGAAGTCTTCGGCACCACCTACTAGTCGTCAGCAGCCACCATCTGGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAACATTATCCATAAGGCTGGTTTGGAAAAAACTATCTCGAATGTTGGCCCTTTTTATCCTCAGTTAATTAGAGAATTTATTGTCAATCTGCCTGATGAGTTTAATGA
TCCAAGTAGTGCTGACTATCAGACGGTGCACATTAAAGGGTTCAAATTTGTGATTTCACCTACTGTAGTAAATGGGTTTCTTGGAAATACTGTTGATATTGACTGCTCTC
CATCATGGACCTTGTCCACATGGCCTATGAATGGAATCCCTGCAACTGCTCTCAGCGTCAAGTATGCCATTCTGCACAAGATTGGCATTGCCAATTGGTTCCCTTCCTCA
CATGCATCAAGCATATCTGCTGCCTTAGGTACATTCTTGTATCAAATTTGCAATAGTGATAAAGTAGATACGGGTGCCTTCATTTACAATCAACTGTTGAGGCATGTTGG
GTCGTTTGGGCTCAAGGTTCCTATTGCTTTTCCGAGGTTATTCTCCAGTCTGCTACTTGATCTAAATGGAGCGGTGCTTACTGCATCTAATGCTCCTAGACCTGAACCTA
AGAAAATTACACTTAGCTACAGACTCTTTCAAGGCAGTAATGTGACTGATATTGACCATGATGTGCATCCAACTCGAGGCCTACGTATTTTTGACACTACTGACTGGGAT
GACTCTCCTGAAGGCTTCTATGTGGATCGTAAGTTAGCTACCCGTATTATTAATTCCTTGACTGCTGAATCTCTTGCATTGACTAACTCCATCATTCTGTTGTCTGAACG
TCGATTAGAGGTTGATGCTCTCATTCGACATTGGAAGTCTTCGGCACCACCTACTAGTCGTCAGCAGCCACCATCTGGTTAA
Protein sequenceShow/hide protein sequence
MNIIHKAGLEKTISNVGPFYPQLIREFIVNLPDEFNDPSSADYQTVHIKGFKFVISPTVVNGFLGNTVDIDCSPSWTLSTWPMNGIPATALSVKYAILHKIGIANWFPSS
HASSISAALGTFLYQICNSDKVDTGAFIYNQLLRHVGSFGLKVPIAFPRLFSSLLLDLNGAVLTASNAPRPEPKKITLSYRLFQGSNVTDIDHDVHPTRGLRIFDTTDWD
DSPEGFYVDRKLATRIINSLTAESLALTNSIILLSERRLEVDALIRHWKSSAPPTSRQQPPSG