; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0011650 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0011650
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionReverse transcriptase
Genome locationchr07:15851129..15852202
RNA-Seq ExpressionPI0011650
SyntenyPI0011650
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048713.1 hypothetical protein E6C27_scaffold43G00050 [Cucumis melo var. makuwa]9.1e-5443.58Show/hide
Query:  MSFQQKDRDNLHDSWSRFKMMVKACPHNGIPECILMEVFYFGLNKATQQTADAVFVSCMLKNSYNQIKATQDTMTSNNEEWDEDDFDNR------KGRRG
        M+F+Q+DR+NL D W RFK M+K CPH+ IPEC+LME FYFGL+K T Q+A+ VF   ML++SYNQIK   DTM SN++EW ++ F +R      KG RG
Subjt:  MSFQQKDRDNLHDSWSRFKMMVKACPHNGIPECILMEVFYFGLNKATQQTADAVFVSCMLKNSYNQIKATQDTMTSNNEEWDEDDFDNR------KGRRG

Query:  QSEEGMDKNVVVALQGQMTAMNNLLKSMAISQVNAARGSVQATNQIDDMGCVGCDGPHNTNACPLNTETADFVRNGPFFNTYNLGWTNHPNFGWGGTGQQ
        + E+G+D +++VALQGQ+  M N+L+SMA+ QVN  R SVQ   Q+++MGCVGC  PHNTNACPLNTE   +++N P             +  WGG   Q
Subjt:  QSEEGMDKNVVVALQGQMTAMNNLLKSMAISQVNAARGSVQATNQIDDMGCVGCDGPHNTNACPLNTETADFVRNGPFFNTYNLGWTNHPNFGWGGTGQQ

Query:  NQGRHGGQGDHQGEASGSHVRYHNNRPQHSQHQQQPTTTSSSMENLLHEYMQKNDALLQSQASSIHNLEVQLGQLASDFAGRPQGSLPSNIEMPNQ
                                                                 + SQASSI N+E+QLGQL SDF+ RP+ S PSN E PNQ
Subjt:  NQGRHGGQGDHQGEASGSHVRYHNNRPQHSQHQQQPTTTSSSMENLLHEYMQKNDALLQSQASSIHNLEVQLGQLASDFAGRPQGSLPSNIEMPNQ

XP_030505184.1 uncharacterized protein LOC115720166 [Cannabis sativa]5.1e-4938.04Show/hide
Query:  MKKIFPPHENVRRRNELMSFQQKDRDNLHDSWSRFKMMVKACPHNGIPECILMEVFYFGLNKATQQTADAVFVSCMLKNSYNQIKATQDTMTSNNEEWDE
        ++K FPP  N + R+E+MSF Q + ++  D+W RFK +++ CPH+GIP CI ME FY GLN  +Q   DA     +L  SYN+     +T+ SNN +W  
Subjt:  MKKIFPPHENVRRRNELMSFQQKDRDNLHDSWSRFKMMVKACPHNGIPECILMEVFYFGLNKATQQTADAVFVSCMLKNSYNQIKATQDTMTSNNEEWDE

Query:  DDFDNRKGRRGQSEEG-MDKNVVVALQGQMTAMNNLLKSMAISQVNAARGSVQATNQIDDMGCVGCDGPHNTNACPLNTETADFV-------RNGPFFNT
            N +    +   G ++ + + AL  QM +M N+LK+++I   N+      A  Q DD+ CV C   H    CP N E+  ++        NG F N+
Subjt:  DDFDNRKGRRGQSEEG-MDKNVVVALQGQMTAMNNLLKSMAISQVNAARGSVQATNQIDDMGCVGCDGPHNTNACPLNTETADFV-------RNGPFFNT

Query:  YNLGWTNHPNFGWGGTGQQNQGRHGGQGDHQGEASGSHVRYHNNRPQHSQHQQQPTTTSSSMENLLHEYMQKNDALLQSQASSIHNLEVQLGQLASDFAG
        YN  W NHPN  W   G +++ +H  QG        S    H   PQH+Q+ Q      SS+E+L+ +YM KNDA++QSQA+ + NLE+QLG LA++   
Subjt:  YNLGWTNHPNFGWGGTGQQNQGRHGGQGDHQGEASGSHVRYHNNRPQHSQHQQQPTTTSSSMENLLHEYMQKNDALLQSQASSIHNLEVQLGQLASDFAG

Query:  RPQGSLPSNIEMPNQAGGSGKESVTLRNGRNLTIRDPDTERS-NPTS
        RPQGSLPS+ E P + G    +S+ LR+G++L   + + + S  PTS
Subjt:  RPQGSLPSNIEMPNQAGGSGKESVTLRNGRNLTIRDPDTERS-NPTS

XP_030507648.1 uncharacterized protein LOC115722545 [Cannabis sativa]3.0e-4939.34Show/hide
Query:  MKKIFPPHENVRRRNELMSFQQKDRDNLHDSWSRFKMMVKACPHNGIPECILMEVFYFGLNKATQQTADAVFVSCMLKNSYNQIKATQDTMTSNNEEWDE
        ++K FPP  N + R+E+MSFQQ + +   D+W RFK +++ CPH+GIP CI +E FY GLN  T+   DA     +L  SYN+     + + SNN +W  
Subjt:  MKKIFPPHENVRRRNELMSFQQKDRDNLHDSWSRFKMMVKACPHNGIPECILMEVFYFGLNKATQQTADAVFVSCMLKNSYNQIKATQDTMTSNNEEWDE

Query:  DDFDNRKGRRGQSEEGMDKNVVVALQGQMTAMNNLLKSMAISQVNAARGSVQ--ATNQIDDMGCVGCDGPHNTNACPLNTETADFV-------RNGPFFN
            NR     +    ++ + + AL  QM +M N+LK+M +       GSVQ  A  Q  ++ CV C   H    CP N  +  +V        N P+ N
Subjt:  DDFDNRKGRRGQSEEGMDKNVVVALQGQMTAMNNLLKSMAISQVNAARGSVQ--ATNQIDDMGCVGCDGPHNTNACPLNTETADFV-------RNGPFFN

Query:  TYNLGWTNHPNFGWGGTGQQNQGRHGGQGDHQGEASGSHVRYHNNRPQHSQHQQQPTTTSSSMENLLHEYMQKNDALLQSQASSIHNLEVQLGQLASDFA
        +YN  W +HPNF WGG G  + G    QG  Q    G   +  +++PQ SQ        +SS+E+L+ +YM KNDA++QSQA+S+ NLEVQLGQLA+D  
Subjt:  TYNLGWTNHPNFGWGGTGQQNQGRHGGQGDHQGEASGSHVRYHNNRPQHSQHQQQPTTTSSSMENLLHEYMQKNDALLQSQASSIHNLEVQLGQLASDFA

Query:  GRPQGSLPSNIEMPNQAGGSGKESVTLRNGRNL
         RPQG+LPS+ E P + G    +++TLR+G+ L
Subjt:  GRPQGSLPSNIEMPNQAGGSGKESVTLRNGRNL

XP_030509259.1 uncharacterized protein LOC115723937 [Cannabis sativa]1.8e-4937.47Show/hide
Query:  MKKIFPPHENVRRRNELMSFQQKDRDNLHDSWSRFKMMVKACPHNGIPECILMEVFYFGLNKATQQTADAVFVSCMLKNSYNQIKATQDTMTSNNEEWDE
        ++K FPP  N + R+E+MSFQQ + +   D+W RFK +++ CPH+GIP CI +E FY GLN A++   DA     +L  SYN+     + + SNN +W  
Subjt:  MKKIFPPHENVRRRNELMSFQQKDRDNLHDSWSRFKMMVKACPHNGIPECILMEVFYFGLNKATQQTADAVFVSCMLKNSYNQIKATQDTMTSNNEEWDE

Query:  DDFDNRKGRRGQSEEGMDKNVVVALQGQMTAMNNLLKSMAISQVNAARGSVQ--ATNQIDDMGCVGCDGPHNTNACPLNTETADFV-------RNGPFFN
            NR     +    ++ + + AL  QM +M N+LK+M +       GSVQ  A  Q  ++ CV C   H    CP N  +  +V        N P+ N
Subjt:  DDFDNRKGRRGQSEEGMDKNVVVALQGQMTAMNNLLKSMAISQVNAARGSVQ--ATNQIDDMGCVGCDGPHNTNACPLNTETADFV-------RNGPFFN

Query:  TYNLGWTNHPNFGWGGTGQQNQGRHGGQGDHQGEASGSHVRYHNNRPQHSQHQQQPTTTSSSMENLLHEYMQKNDALLQSQASSIHNLEVQLGQLASDFA
        +YN  W +HPNF WGG G  + G    QG        S     + +P+  Q  Q   + +SS+E+L+ +YM KNDA++QSQA+S+ NLEVQLGQLA+D  
Subjt:  TYNLGWTNHPNFGWGGTGQQNQGRHGGQGDHQGEASGSHVRYHNNRPQHSQHQQQPTTTSSSMENLLHEYMQKNDALLQSQASSIHNLEVQLGQLASDFA

Query:  GRPQGSLPSNIEMPNQAGGSGKESVTLRNGRNLTIRDPDTERSNPTS-HSTAEIDSSSNISNS
         RPQG+LPS+ E P + G    ++VTLR+G+ +      T    P+S     E+      SNS
Subjt:  GRPQGSLPSNIEMPNQAGGSGKESVTLRNGRNLTIRDPDTERSNPTS-HSTAEIDSSSNISNS

XP_030510138.1 uncharacterized protein LOC115724905 [Cannabis sativa]1.0e-4939.46Show/hide
Query:  MKKIFPPHENVRRRNELMSFQQKDRDNLHDSWSRFKMMVKACPHNGIPECILMEVFYFGLNKATQQTADAVFVSCMLKNSYNQIKATQDTMTSNNEEWDE
        ++K FPP  N + R+E+MSFQQ + +   D+W RFK +++ CPH+GIP CI +E FY GLN A++   DA     +L  SYN+     + + SNN +W  
Subjt:  MKKIFPPHENVRRRNELMSFQQKDRDNLHDSWSRFKMMVKACPHNGIPECILMEVFYFGLNKATQQTADAVFVSCMLKNSYNQIKATQDTMTSNNEEWDE

Query:  DDFDNRKGRRGQSEEGMDKNVVVALQGQMTAMNNLLKSMAISQVNAARGSVQ--ATNQIDDMGCVGCDGPHNTNACPLNTETADFV-------RNGPFFN
            NR     +    ++ + + AL  QM +M N+LK+M +       GSVQ  A  Q  ++ CV C   H    CP N  +  +V        N P+ N
Subjt:  DDFDNRKGRRGQSEEGMDKNVVVALQGQMTAMNNLLKSMAISQVNAARGSVQ--ATNQIDDMGCVGCDGPHNTNACPLNTETADFV-------RNGPFFN

Query:  TYNLGWTNHPNFGWGGTGQQNQGRHGGQGDHQGEASGSHVRYHNNRPQHSQHQQQPT-TTSSSMENLLHEYMQKNDALLQSQASSIHNLEVQLGQLASDF
        +YN  W +HPNF WGG G  + G  G     QG+ S          P  SQ   QP  + +SS+E+L+ +YM KNDA++QSQA+S+ NLEVQLGQLA+D 
Subjt:  TYNLGWTNHPNFGWGGTGQQNQGRHGGQGDHQGEASGSHVRYHNNRPQHSQHQQQPT-TTSSSMENLLHEYMQKNDALLQSQASSIHNLEVQLGQLASDF

Query:  AGRPQGSLPSNIEMPNQAGGSGKESVTLRNGR
          RPQG+LPS+ E P +      ++VTLR+G+
Subjt:  AGRPQGSLPSNIEMPNQAGGSGKESVTLRNGR

TrEMBL top hitse value%identityAlignment
A0A0A0K6F9 Uncharacterized protein3.1e-3958.87Show/hide
Query:  MSFQQKDRDNLHDSWSRFKMMVKACPHNGIPECILMEVFYFGLNKATQQTADAVFVSCMLKNSYNQIKATQDTMTSNNEEWDEDDFDNRKGRRGQSEEGM
        M F+Q+D++N+HD+WSRFK +VKACP +GIPEC+ MEVFYFGL+K T Q  + +FV  ML++SYNQIKAT D+M++N++EWD+  F +R   RG+++EG+
Subjt:  MSFQQKDRDNLHDSWSRFKMMVKACPHNGIPECILMEVFYFGLNKATQQTADAVFVSCMLKNSYNQIKATQDTMTSNNEEWDEDDFDNRKGRRGQSEEGM

Query:  DKNVVVALQGQMTAMNNLLKSMAISQVNAARGSVQATNQID
        DK+VVV LQGQM AMNNLL+SM +SQVNAA   + A  Q++
Subjt:  DKNVVVALQGQMTAMNNLLKSMAISQVNAARGSVQATNQID

A0A5D3CC26 Uncharacterized protein4.4e-5443.58Show/hide
Query:  MSFQQKDRDNLHDSWSRFKMMVKACPHNGIPECILMEVFYFGLNKATQQTADAVFVSCMLKNSYNQIKATQDTMTSNNEEWDEDDFDNR------KGRRG
        M+F+Q+DR+NL D W RFK M+K CPH+ IPEC+LME FYFGL+K T Q+A+ VF   ML++SYNQIK   DTM SN++EW ++ F +R      KG RG
Subjt:  MSFQQKDRDNLHDSWSRFKMMVKACPHNGIPECILMEVFYFGLNKATQQTADAVFVSCMLKNSYNQIKATQDTMTSNNEEWDEDDFDNR------KGRRG

Query:  QSEEGMDKNVVVALQGQMTAMNNLLKSMAISQVNAARGSVQATNQIDDMGCVGCDGPHNTNACPLNTETADFVRNGPFFNTYNLGWTNHPNFGWGGTGQQ
        + E+G+D +++VALQGQ+  M N+L+SMA+ QVN  R SVQ   Q+++MGCVGC  PHNTNACPLNTE   +++N P             +  WGG   Q
Subjt:  QSEEGMDKNVVVALQGQMTAMNNLLKSMAISQVNAARGSVQATNQIDDMGCVGCDGPHNTNACPLNTETADFVRNGPFFNTYNLGWTNHPNFGWGGTGQQ

Query:  NQGRHGGQGDHQGEASGSHVRYHNNRPQHSQHQQQPTTTSSSMENLLHEYMQKNDALLQSQASSIHNLEVQLGQLASDFAGRPQGSLPSNIEMPNQ
                                                                 + SQASSI N+E+QLGQL SDF+ RP+ S PSN E PNQ
Subjt:  NQGRHGGQGDHQGEASGSHVRYHNNRPQHSQHQQQPTTTSSSMENLLHEYMQKNDALLQSQASSIHNLEVQLGQLASDFAGRPQGSLPSNIEMPNQ

A0A5D3D2S0 Uncharacterized protein1.2e-4348.15Show/hide
Query:  MLKNSYNQIKATQDTMTSNNEEWDEDDFDNRKGRRGQSEEGMDKNVVVALQGQMTAMNNLLKSMAISQVNAARGSVQATNQIDDMGCVGCDGPHNTNACP
        ML++SY QIK T D +T+N++E  +DD   +   RG+++ GMD+NV+VALQGQ+T M  LL+SMA+SQV+A    VQA  Q+D+M  VGC  PH T+AC 
Subjt:  MLKNSYNQIKATQDTMTSNNEEWDEDDFDNRKGRRGQSEEGMDKNVVVALQGQMTAMNNLLKSMAISQVNAARGSVQATNQIDDMGCVGCDGPHNTNACP

Query:  LNTETADFVRNGPFFNTYNLGWTNHPNFGWGGTGQQNQGRHGGQGDHQGEASGSHVRYHNNRPQHS--QHQQQPTTTS---SSMENLLHEYMQKNDALLQ
        LN E A +V++ P+ NTYN                         G+++GEA   H + H +RP +S  QHQQ  T T    SSM  LL +YMQ+ DA +Q
Subjt:  LNTETADFVRNGPFFNTYNLGWTNHPNFGWGGTGQQNQGRHGGQGDHQGEASGSHVRYHNNRPQHS--QHQQQPTTTS---SSMENLLHEYMQKNDALLQ

Query:  SQASSIHNLEVQLGQLASDFAGRPQGSLPSNIEMPNQAGGSGK
        SQ +SI NLE+ LGQLA DF+GRP GSLPSNIE+PN   G  K
Subjt:  SQASSIHNLEVQLGQLASDFAGRPQGSLPSNIEMPNQAGGSGK

A0A6J1EEI2 uncharacterized protein LOC1114333942.2e-3736.05Show/hide
Query:  KIFPPHENVRRRNELMSFQQKDRDNLHDSWSRFKMMVKACPHNGIPECILMEVFYFGLNKATQQTADAVFVSCMLKNSYNQIKATQDTMTSNNEEWDEDD
        K FPP  N R RNE++ FQQ + D L ++W RFK M++ CPH+G+P CI ME FY GLN AT+Q  DA     +L  +YN+     + + SNN +W   D
Subjt:  KIFPPHENVRRRNELMSFQQKDRDNLHDSWSRFKMMVKACPHNGIPECILMEVFYFGLNKATQQTADAVFVSCMLKNSYNQIKATQDTMTSNNEEWDEDD

Query:  FDNRKGRRGQSEEGMDKNVVVALQGQMTAMNNLLKSMAISQ---VNAARGSVQATNQIDDMGCVGCDGPHNTNACPLNTETADFV---------RNGPFF
          +  GR+ +    ++ + + ++  Q+ ++ N+L+++A+ Q   + A   +V   NQ     CV C   H  + CP N  +  +V         +N PF 
Subjt:  FDNRKGRRGQSEEGMDKNVVVALQGQMTAMNNLLKSMAISQ---VNAARGSVQATNQIDDMGCVGCDGPHNTNACPLNTETADFV---------RNGPFF

Query:  NTYNLGWTNHPNFGWGGTGQQNQGRHGGQGDHQGEASGSHVRYHNNR--PQHSQHQQQPTTTSSSMENLLHEYMQKNDALLQSQASSIHNLEVQ
        NTYN GW NHPNF W G G  NQ          G    + + Y + +   Q     Q   T  +S+E+L+ EYM KND ++Q+Q +S+ NLEVQ
Subjt:  NTYNLGWTNHPNFGWGGTGQQNQGRHGGQGDHQGEASGSHVRYHNNR--PQHSQHQQQPTTTSSSMENLLHEYMQKNDALLQSQASSIHNLEVQ

A0A6J1G7Q6 uncharacterized protein LOC1114515983.1e-3934.27Show/hide
Query:  KIFPPHENVRRRNELMSFQQKDRDNLHDSWSRFKMMVKACPHNGIPECILMEVFYFGLNKATQQTADAVFVSCMLKNSYNQIKATQDTMTSNNEEWDEDD
        K FPP  + R RNE+++FQ+ + + L ++W RFK  ++ CPH+G+P CI +E FY GLN AT+Q  DA     +L  +YN+     + + SNN +W    
Subjt:  KIFPPHENVRRRNELMSFQQKDRDNLHDSWSRFKMMVKACPHNGIPECILMEVFYFGLNKATQQTADAVFVSCMLKNSYNQIKATQDTMTSNNEEWDEDD

Query:  FDNRKGRRGQSEEGMDKNVVVALQGQMTAMNNLLKSMAISQ---VNAARGSVQATNQIDDMGCVGCDGPHNTNACPLNTETADFVRN---------GPFF
         D R     ++ E ++ + + ++  Q+ +M N+L+++A  Q   + A   +     Q     CV C   H  + CP N  +  +V N          P  
Subjt:  FDNRKGRRGQSEEGMDKNVVVALQGQMTAMNNLLKSMAISQ---VNAARGSVQATNQIDDMGCVGCDGPHNTNACPLNTETADFVRN---------GPFF

Query:  NTYNLGWTNHPNFGWGGTGQQNQGRHGGQGDHQGEASGSHVRYHNNR--PQHSQHQQQPTTTSSSMENLLHEYMQKNDALLQSQASSIHNLEVQLGQLAS
        NTYN GW NHPNF   G G  NQ          G    + + Y + +   Q     Q    + + +E+L+ EYM +NDA++QSQ  S+ NLEVQ+GQLA+
Subjt:  NTYNLGWTNHPNFGWGGTGQQNQGRHGGQGDHQGEASGSHVRYHNNR--PQHSQHQQQPTTTSSSMENLLHEYMQKNDALLQSQASSIHNLEVQLGQLAS

Query:  DFAGRPQGSLPSNIEMPNQAG
        +   RP G LP++ EMP + G
Subjt:  DFAGRPQGSLPSNIEMPNQAG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAAAATTTTCCCACCTCACGAGAACGTCAGAAGAAGGAATGAGCTCATGAGCTTCCAGCAAAAAGATAGAGATAACCTACATGACTCGTGGAGTAGGTTCAAGAT
GATGGTCAAAGCATGCCCTCACAATGGCATCCCCGAATGCATATTGATGGAGGTTTTCTATTTTGGCTTGAACAAGGCGACACAGCAGACTGCTGATGCTGTGTTTGTAA
GTTGTATGTTGAAAAACTCATACAACCAGATTAAGGCAACACAGGACACGATGACCAGCAATAATGAAGAATGGGATGAAGATGATTTCGACAATCGCAAAGGAAGACGT
GGACAAAGCGAAGAAGGTATGGATAAGAACGTCGTGGTGGCGTTGCAAGGACAAATGACTGCAATGAACAATCTTCTCAAATCTATGGCAATATCGCAAGTCAATGCCGC
AAGAGGCTCTGTGCAAGCGACAAATCAAATTGATGATATGGGATGTGTGGGATGCGATGGTCCTCATAATACTAACGCATGCCCACTCAATACAGAAACAGCCGACTTCG
TAAGGAATGGCCCTTTCTTTAACACTTACAACCTTGGTTGGACAAACCATCCTAATTTTGGATGGGGAGGAACGGGTCAACAAAATCAAGGACGACATGGTGGTCAAGGT
GACCATCAAGGGGAAGCATCTGGCTCCCACGTGAGGTACCACAACAACAGGCCACAACACTCCCAACATCAACAACAACCCACCACCACTTCTTCATCTATGGAGAACCT
CCTCCACGAATACATGCAGAAAAATGATGCCCTTCTGCAAAGCCAAGCTTCATCAATTCACAATCTGGAAGTGCAATTAGGGCAGCTAGCCAGCGACTTCGCCGGAAGAC
CACAAGGATCCCTCCCAAGTAATATAGAAATGCCAAACCAGGCGGGGGGATCTGGAAAAGAGAGTGTGACACTACGAAACGGAAGGAACCTGACCATCCGCGATCCTGAT
ACTGAACGTAGCAACCCTACTTCTCATTCTACTGCCGAGATTGATAGCTCAAGTAACATTTCTAATTCTGTACATTTCTCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAGAAAATTTTCCCACCTCACGAGAACGTCAGAAGAAGGAATGAGCTCATGAGCTTCCAGCAAAAAGATAGAGATAACCTACATGACTCGTGGAGTAGGTTCAAGAT
GATGGTCAAAGCATGCCCTCACAATGGCATCCCCGAATGCATATTGATGGAGGTTTTCTATTTTGGCTTGAACAAGGCGACACAGCAGACTGCTGATGCTGTGTTTGTAA
GTTGTATGTTGAAAAACTCATACAACCAGATTAAGGCAACACAGGACACGATGACCAGCAATAATGAAGAATGGGATGAAGATGATTTCGACAATCGCAAAGGAAGACGT
GGACAAAGCGAAGAAGGTATGGATAAGAACGTCGTGGTGGCGTTGCAAGGACAAATGACTGCAATGAACAATCTTCTCAAATCTATGGCAATATCGCAAGTCAATGCCGC
AAGAGGCTCTGTGCAAGCGACAAATCAAATTGATGATATGGGATGTGTGGGATGCGATGGTCCTCATAATACTAACGCATGCCCACTCAATACAGAAACAGCCGACTTCG
TAAGGAATGGCCCTTTCTTTAACACTTACAACCTTGGTTGGACAAACCATCCTAATTTTGGATGGGGAGGAACGGGTCAACAAAATCAAGGACGACATGGTGGTCAAGGT
GACCATCAAGGGGAAGCATCTGGCTCCCACGTGAGGTACCACAACAACAGGCCACAACACTCCCAACATCAACAACAACCCACCACCACTTCTTCATCTATGGAGAACCT
CCTCCACGAATACATGCAGAAAAATGATGCCCTTCTGCAAAGCCAAGCTTCATCAATTCACAATCTGGAAGTGCAATTAGGGCAGCTAGCCAGCGACTTCGCCGGAAGAC
CACAAGGATCCCTCCCAAGTAATATAGAAATGCCAAACCAGGCGGGGGGATCTGGAAAAGAGAGTGTGACACTACGAAACGGAAGGAACCTGACCATCCGCGATCCTGAT
ACTGAACGTAGCAACCCTACTTCTCATTCTACTGCCGAGATTGATAGCTCAAGTAACATTTCTAATTCTGTACATTTCTCTTAA
Protein sequenceShow/hide protein sequence
MKKIFPPHENVRRRNELMSFQQKDRDNLHDSWSRFKMMVKACPHNGIPECILMEVFYFGLNKATQQTADAVFVSCMLKNSYNQIKATQDTMTSNNEEWDEDDFDNRKGRR
GQSEEGMDKNVVVALQGQMTAMNNLLKSMAISQVNAARGSVQATNQIDDMGCVGCDGPHNTNACPLNTETADFVRNGPFFNTYNLGWTNHPNFGWGGTGQQNQGRHGGQG
DHQGEASGSHVRYHNNRPQHSQHQQQPTTTSSSMENLLHEYMQKNDALLQSQASSIHNLEVQLGQLASDFAGRPQGSLPSNIEMPNQAGGSGKESVTLRNGRNLTIRDPD
TERSNPTSHSTAEIDSSSNISNSVHFS