; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0025851 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0025851
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionLOW QUALITY PROTEIN: uncharacterized protein LOC110412945
Genome locationchr07:15624003..15625575
RNA-Seq ExpressionPI0025851
SyntenyPI0025851
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_024949903.1 uncharacterized protein LOC112496715 [Citrus sinensis]3.5e-5737.64Show/hide
Query:  IKIRALSVNLRDEVKRWANALEDGKVGTWDQLIEKFMKKFFPPHENARRRKELISFQQKDRENLYDSWSRFKRMVKACPHNDIPECILMEVFYFGLNKAT
        +++R  S +LRD  + W N+L    + TW  L +KF+ K+FPP +NA+ R E+ SF Q + E+L D+W RFK +++ CPH+ IP CI +E  Y GLN++T
Subjt:  IKIRALSVNLRDEVKRWANALEDGKVGTWDQLIEKFMKKFFPPHENARRRKELISFQQKDRENLYDSWSRFKRMVKACPHNDIPECILMEVFYFGLNKAT

Query:  QQTANVVFVGGMLKSSYNQIKATLDTMANNNEEWDEDDFDNRRGGRG------------------RSEEGMDKNVVAVNEIDDMGCVGCNSPHNTDACPL
        +   +    G +L  SYN+    L+ +ANNN +W        RG  G                  +  + M      VN+I DM CV C   H  D CP 
Subjt:  QQTANVVFVGGMLKSSYNQIKATLDTMANNNEEWDEDDFDNRRGGRG------------------RSEEGMDKNVVAVNEIDDMGCVGCNSPHNTDACPL

Query:  NTEIIAFVKN-------DPFSNTYNLGWRNHPNFGWGGTGRHGGQDDHRGEASGSHARYHNNRPQHPQ--HQNQQQHTTTTPSTSSSMENLLREYMQKNN
        N   + +V N       +P+SNTYN GWR H NF W        Q+      SG       NRP  P   +Q  Q+  +      SS+E L+++Y+ +N 
Subjt:  NTEIIAFVKN-------DPFSNTYNLGWRNHPNFGWGGTGRHGGQDDHRGEASGSHARYHNNRPQHPQ--HQNQQQHTTTTPSTSSSMENLLREYMQKNN

Query:  ALLQSQALSIRNLEVQLGQLTSDFSGRPQGSFPSNTETPNQAGGSRKEKCHVVTLRSGRNLTIR
        A++QS  +S+RNLE Q+GQL +  S RPQGS PSNTE P + G   KE C V+ LRSGR L ++
Subjt:  ALLQSQALSIRNLEVQLGQLTSDFSGRPQGSFPSNTETPNQAGGSRKEKCHVVTLRSGRNLTIR

XP_030497803.1 uncharacterized protein LOC115713460 [Cannabis sativa]8.7e-5637.67Show/hide
Query:  IKIRALSVNLRDEVKRWANALEDGKVGTWDQLIEKFMKKFFPPHENARRRKELISFQQKDRENLYDSWSRFKRMVKACPHNDIPECILMEVFYFGLNKAT
        ++++    +LRD  + W N L    V  W+ L EKF++K+FPP  NA+ R E++SFQQ + E   D+W RFK +++ CPH+ IP CI +E FY GLN A+
Subjt:  IKIRALSVNLRDEVKRWANALEDGKVGTWDQLIEKFMKKFFPPHENARRRKELISFQQKDRENLYDSWSRFKRMVKACPHNDIPECILMEVFYFGLNKAT

Query:  QQTANVVFVGGMLKSSYNQIKATLDTMANNNEEWDEDDFDNRRGGRG--------------RSEEGMDKNV--------VAVNEIDDMGCVGCNSPHNTD
        +   +    G +L  SYN+    L+ +A+NN +W  +     R   G               S   + KN+         A  +     CV C   H  +
Subjt:  QQTANVVFVGGMLKSSYNQIKATLDTMANNNEEWDEDDFDNRRGGRG--------------RSEEGMDKNV--------VAVNEIDDMGCVGCNSPHNTD

Query:  ACPLNTEIIAFV-------KNDPFSNTYNLGWRNHPNFGWGGTGRHGGQDDHRGEASGSHARYHNNRPQHPQHQNQQQHTTTTPSTSSSMENLLREYMQK
         CP N   + +V        N+P+SN+YN  W++HPNF WGG          +G+ S         RPQ P HQ Q   T       SS+E+L+R+YM K
Subjt:  ACPLNTEIIAFV-------KNDPFSNTYNLGWRNHPNFGWGGTGRHGGQDDHRGEASGSHARYHNNRPQHPQHQNQQQHTTTTPSTSSSMENLLREYMQK

Query:  NNALLQSQALSIRNLEVQLGQLTSDFSGRPQGSFPSNTETPNQAGGSRKEKCHVVTLRSGR
        N+ ++QSQA S+RNLEVQLGQL +D   RPQG+ PS+TE P + G   KE C  VTLRSG+
Subjt:  NNALLQSQALSIRNLEVQLGQLTSDFSGRPQGSFPSNTETPNQAGGSRKEKCHVVTLRSGR

XP_030497851.1 uncharacterized protein LOC115713509 [Cannabis sativa]1.3e-5637.64Show/hide
Query:  IKIRALSVNLRDEVKRWANALEDGKVGTWDQLIEKFMKKFFPPHENARRRKELISFQQKDRENLYDSWSRFKRMVKACPHNDIPECILMEVFYFGLNKAT
        ++++    +LRD  + W N L    +  W+ L EKF++K+FPP  NA+ R E++SFQQ + E   D+W RFK +++ CPH+ IP CI +E FY GLN A+
Subjt:  IKIRALSVNLRDEVKRWANALEDGKVGTWDQLIEKFMKKFFPPHENARRRKELISFQQKDRENLYDSWSRFKRMVKACPHNDIPECILMEVFYFGLNKAT

Query:  QQTANVVFVGGMLKSSYNQIKATLDTMANNNEEWDEDDFDNRRGGRGRSE-----------------------EGMDKNVVAVNEIDDMGCVGCNSPHNT
        +   +    G +L  SYN+    L+ +A+NN +W  +     R   G  E                        G  + VVA+     + CV C   H  
Subjt:  QQTANVVFVGGMLKSSYNQIKATLDTMANNNEEWDEDDFDNRRGGRGRSE-----------------------EGMDKNVVAVNEIDDMGCVGCNSPHNT

Query:  DACPLNTEIIAFV-------KNDPFSNTYNLGWRNHPNFGWGGTGRHGGQDDHRGEASGSHARYHNNRPQHPQHQNQQQHTTTTPSTSSSMENLLREYMQ
        + CP N   + +V        N+P+SN+YN  W++HPNF WGG G        +G+ S         RPQ P HQ Q        S +SS+E+L+R+YM 
Subjt:  DACPLNTEIIAFV-------KNDPFSNTYNLGWRNHPNFGWGGTGRHGGQDDHRGEASGSHARYHNNRPQHPQHQNQQQHTTTTPSTSSSMENLLREYMQ

Query:  KNNALLQSQALSIRNLEVQLGQLTSDFSGRPQGSFPSNTETPNQAGGSRKEKCHVVTLRSGRNL
        K+NA++QSQA  +RNLE+QLGQL +D   RPQG+ PS+TE P + G   KE C+ VTLRSG+ L
Subjt:  KNNALLQSQALSIRNLEVQLGQLTSDFSGRPQGSFPSNTETPNQAGGSRKEKCHVVTLRSGRNL

XP_030510138.1 uncharacterized protein LOC115724905 [Cannabis sativa]8.7e-5636.57Show/hide
Query:  IKIRALSVNLRDEVKRWANALEDGKVGTWDQLIEKFMKKFFPPHENARRRKELISFQQKDRENLYDSWSRFKRMVKACPHNDIPECILMEVFYFGLNKAT
        ++++    +LRD  + W N L    V  W+ L E F++K+FPP  NA+ R E++SFQQ + E   D+W RFK +++ CPH+ IP CI +E FY GLN A+
Subjt:  IKIRALSVNLRDEVKRWANALEDGKVGTWDQLIEKFMKKFFPPHENARRRKELISFQQKDRENLYDSWSRFKRMVKACPHNDIPECILMEVFYFGLNKAT

Query:  QQTANVVFVGGMLKSSYNQIKATLDTMANNNEEWDEDDFDNRRGGRG--------------RSEEGMDKNV--------VAVNEIDDMGCVGCNSPHNTD
        +   +    G +L  SYN+    L+ +A+NN +W  +     R   G               S   + KN+         A  +  ++ CV C   H  +
Subjt:  QQTANVVFVGGMLKSSYNQIKATLDTMANNNEEWDEDDFDNRRGGRG--------------RSEEGMDKNV--------VAVNEIDDMGCVGCNSPHNTD

Query:  ACPLNTEIIAFV-------KNDPFSNTYNLGWRNHPNFGWGGTGRHGGQDDHRGEASGSHARYHNNRPQHPQHQNQQQHTTTTPSTSSSMENLLREYMQK
         CP N   + +V        N+P+SN+YN  W++HPNF WGG G        +G+   S     + +P  PQ            S +SS+E+L+R+YM K
Subjt:  ACPLNTEIIAFV-------KNDPFSNTYNLGWRNHPNFGWGGTGRHGGQDDHRGEASGSHARYHNNRPQHPQHQNQQQHTTTTPSTSSSMENLLREYMQK

Query:  NNALLQSQALSIRNLEVQLGQLTSDFSGRPQGSFPSNTETPNQAGGSRKEKCHVVTLRSGR
        N+A++QSQA S+RNLEVQLGQL +D   RPQG+ PS+TE P +     KE C  VTLRSG+
Subjt:  NNALLQSQALSIRNLEVQLGQLTSDFSGRPQGSFPSNTETPNQAGGSRKEKCHVVTLRSGR

XP_038902511.1 uncharacterized protein LOC120089170 [Benincasa hispida]3.4e-6036.99Show/hide
Query:  IKIRALSVNLRDEVKRWANALEDGKVGTWDQLIEKFMKKFFPPHENARRRKELISFQQKDRENLYDSWSRFKRMVKACPHNDIPECILMEVFYFGLNKAT
        I++      L+DE KRWA++LE  ++  WDQL+E+FMKKFFPP  NARR+ ++++F+  + E L  +W RF+R+VK CPH +I +C+LME FY GL ++ 
Subjt:  IKIRALSVNLRDEVKRWANALEDGKVGTWDQLIEKFMKKFFPPHENARRRKELISFQQKDRENLYDSWSRFKRMVKACPHNDIPECILMEVFYFGLNKAT

Query:  QQTANVVFVGGMLKSSYNQIKATLDTMANNNEEWDEDDFDNRRGGRGRSEEGMDKNVVAVN-------------------------------------EI
        Q  A      G +   Y + K  LD +  N ++W     DN  GGRG      +  ++ V+                                     ++
Subjt:  QQTANVVFVGGMLKSSYNQIKATLDTMANNNEEWDEDDFDNRRGGRGRSEEGMDKNVVAVN-------------------------------------EI

Query:  DDMGCVGCNSPHNTDACPLNTEIIAFVKNDPFSNTYNLGWRNHPNFGWGGTGRHGGQDD--HRGEASGSHARYHN--NRPQHPQHQNQQQHTTTTPST-S
          + C  C   H+ + CP N + +  ++N+P++NTYN GWRNHPNF WGG    GGQ +  +  E  G+   +H   N+  H   Q+  Q +++  ST S
Subjt:  DDMGCVGCNSPHNTDACPLNTEIIAFVKNDPFSNTYNLGWRNHPNFGWGGTGRHGGQDD--HRGEASGSHARYHN--NRPQHPQHQNQQQHTTTTPST-S

Query:  SSMENLLREYMQKNNALLQSQALSIRNLEVQLGQLTSDFSGRPQGSFPSNTETPNQAGGSRKEKC
        SS+E LL++Y++KN+A++QSQA SIRNLEVQ+GQL ++   R  G  PSN+E P   G + KE+C
Subjt:  SSMENLLREYMQKNNALLQSQALSIRNLEVQLGQLTSDFSGRPQGSFPSNTETPNQAGGSRKEKC

TrEMBL top hitse value%identityAlignment
A0A5A7V1F3 Retrotrans_gag domain-containing protein3.3e-5346.53Show/hide
Query:  YMAHDLDRPIR-------------IKIRALSVNL----RDEVKRWANALEDGKVGTWDQLIEKFMKKFFPPHENARRRKELISFQQKDRENLYDSWSRFK
        Y+AH+L RPIR             I       N     +D+ KRWAN++E G+V TW+ LIEKFMKKFFP  + A+RR++LI F+Q+DR+NL+D+WS FK
Subjt:  YMAHDLDRPIR-------------IKIRALSVNL----RDEVKRWANALEDGKVGTWDQLIEKFMKKFFPPHENARRRKELISFQQKDRENLYDSWSRFK

Query:  RMVKACPHNDIPECILMEVFYFGLNKATQQTANVVFVGGMLKSSYNQIKATLDTMANNNEEWDEDDFDNRRGGRGRSEEGMDKNVVAVNEIDDMGCVGCN
        RMVKAC H+ I + +LME FYFGL+K T+Q+A+ +F+GG+L+SSYNQIKA LD+MANN+++                      +  AV  +++M C GC 
Subjt:  RMVKACPHNDIPECILMEVFYFGLNKATQQTANVVFVGGMLKSSYNQIKATLDTMANNNEEWDEDDFDNRRGGRGRSEEGMDKNVVAVNEIDDMGCVGCN

Query:  SPHNTDACPLNTEIIAFVKNDPFSNTYNLGWRNHPNFGWGGTGRH
         PHNTDACPLN EI+A+VK DP        +   P    GG G+H
Subjt:  SPHNTDACPLNTEIIAFVKNDPFSNTYNLGWRNHPNFGWGGTGRH

A0A5B6VWJ0 Retroelement pol polyprotein-like3.9e-4633.62Show/hide
Query:  NLRDEVKRWANALEDGKVGTWDQLIEKFMKKFFPPHENARRRKELISFQQKDRENLYDSWSRFKRMVKACPHNDIPECILMEVFYFGLNKATQQTANVVF
        +LRD  + W N+L    + TW +L E+F+ K+F P +NA+ R E+ +F   D E+LY++W RFK +++ CPH+ IP CI +E FY GL   T+   +   
Subjt:  NLRDEVKRWANALEDGKVGTWDQLIEKFMKKFFPPHENARRRKELISFQQKDRENLYDSWSRFKRMVKACPHNDIPECILMEVFYFGLNKATQQTANVVF

Query:  VGGMLKSSYNQIKATLDTMANNNEEW--------------DEDDFDNRRGGRGRSEEGMDKNVVA----------VNEIDDMGCVGCNSPHNTDACPLNT
         G +L  SYN+    ++ +A+NN +W               E D       +  S   M KN+             N+ +++  V C   H  + CP N 
Subjt:  VGGMLKSSYNQIKATLDTMANNNEEW--------------DEDDFDNRRGGRGRSEEGMDKNVVA----------VNEIDDMGCVGCNSPHNTDACPLNT

Query:  EIIAFVKNDP--------FSNTYNLGWRNHPNFGWGGTGRHGGQDDHRGEASGSHARYHNNRPQHPQHQNQQQHTTTTPSTSSSMENLLREYMQKNNALL
        E + ++ N           SN YN  WRNH +F W   G            +G+   Y   RP    +  QQ         S+S+E+LL+ YM KN+AL+
Subjt:  EIIAFVKNDP--------FSNTYNLGWRNHPNFGWGGTGRHGGQDDHRGEASGSHARYHNNRPQHPQHQNQQQHTTTTPSTSSSMENLLREYMQKNNALL

Query:  QSQALSIRNLEVQLGQLTSDFSGRPQGSFPSNTETPNQAGGSRKEKCHVVTLRS
        QSQA +++NLE Q+GQL ++   R QG+ PS+TE P   G   KE C  +TLRS
Subjt:  QSQALSIRNLEVQLGQLTSDFSGRPQGSFPSNTETPNQAGGSRKEKCHVVTLRS

A0A5D3CC26 Uncharacterized protein1.8e-5141.22Show/hide
Query:  ISFQQKDRENLYDSWSRFKRMVKACPHNDIPECILMEVFYFGLNKATQQTANVVFVGGMLKSSYNQIKATLDTMANNNEEWDEDDFDNR------RGGRG
        ++F+Q+DRENL D W RFKRM+K CPH+ IPEC+LME FYFGL+K T Q+AN+VF GGML+SSYNQIK  LDTMA+N++EW ++ F +R      +G RG
Subjt:  ISFQQKDRENLYDSWSRFKRMVKACPHNDIPECILMEVFYFGLNKATQQTANVVFVGGMLKSSYNQIKATLDTMANNNEEWDEDDFDNR------RGGRG

Query:  RSEEGMDKNVVA-----------------------------VNEIDDMGCVGCNSPHNTDACPLNTEIIAFVKNDPFSNTYNLGWRNHPNFGWGGTGRHG
        R E+G+D +++                              V ++++MGCVGC +PHNT+ACPLNTEI+A++KNDP             +  WGG     
Subjt:  RSEEGMDKNVVA-----------------------------VNEIDDMGCVGCNSPHNTDACPLNTEIIAFVKNDPFSNTYNLGWRNHPNFGWGGTGRHG

Query:  GQDDHRGEASGSHARYHNNRPQHPQHQNQQQHTTTTPSTSSSMENLLREYMQKNNALLQSQALSIRNLEVQLGQLTSDFSGRPQGSFPSNTETPNQ
                                                                 + SQA SI+N+E+QLGQLTSDFS RP+ SFPSNTETPNQ
Subjt:  GQDDHRGEASGSHARYHNNRPQHPQHQNQQQHTTTTPSTSSSMENLLREYMQKNNALLQSQALSIRNLEVQLGQLTSDFSGRPQGSFPSNTETPNQ

A0A6J0ZX64 LOW QUALITY PROTEIN: uncharacterized protein LOC1104129451.8e-5134.75Show/hide
Query:  IKIRALSVNLRDEVKRWANALEDGKVGTWDQLIEKFMKKFFPPHENARRRKELISFQQKDRENLYDSWSRFKRMVKACPHNDIPECILMEVFYFGLNKAT
        I++R    +LRD+ K W N+L +G + TW+ L +KF+ KFFPP + A+ R ++ SF Q D E+LY++W RFK +++ CPH+ IP+ + ++ FY GL  + 
Subjt:  IKIRALSVNLRDEVKRWANALEDGKVGTWDQLIEKFMKKFFPPHENARRRKELISFQQKDRENLYDSWSRFKRMVKACPHNDIPECILMEVFYFGLNKAT

Query:  QQTANVVFVGGMLKSSYNQIKATLDTMANNNEEWDEDDFDNRRGGRGRSEEGMDKNVVAV----NEIDDMG----------CVGCNSPHNTDACPLNTEI
        +   +    G ++  +       L+ MA+NN +W  +   +R+       + +      V     ++D +G          C  C   H+ D CP N+E 
Subjt:  QQTANVVFVGGMLKSSYNQIKATLDTMANNNEEWDEDDFDNRRGGRGRSEEGMDKNVVAV----NEIDDMG----------CVGCNSPHNTDACPLNTEI

Query:  IAFV------KNDPFSNTYNLGWRNHPNFGWGGTGRHGGQDDHRGEASGSHARYHNNRPQHPQHQNQQQHTTTTPSTSSSMENLLREYMQKNNALLQSQA
        + FV      +N+P+SNTYN GWRNHPNF W                  ++A   N +P  P    QQQ     P   S +E LL +Y+ K +A++QSQ 
Subjt:  IAFV------KNDPFSNTYNLGWRNHPNFGWGGTGRHGGQDDHRGEASGSHARYHNNRPQHPQHQNQQQHTTTTPSTSSSMENLLREYMQKNNALLQSQA

Query:  LSIRNLEVQLGQLTSDFSGRPQGSFPSNTETPNQAGGSRKEKCHVVTLRSGRNL
         S+RNLE Q+GQL +  + RPQGS PS+T    Q     KE+C  +TLRSG+ +
Subjt:  LSIRNLEVQLGQLTSDFSGRPQGSFPSNTETPNQAGGSRKEKCHVVTLRSGRNL

A0A6J1DW02 uncharacterized protein LOC1110248975.7e-4535.31Show/hide
Query:  NLRDEVKRWANALEDGKVGTWDQLIEKFMKKFFPPHENARRRKELISFQQKDRENLYDSWSRFKRMVKACPHNDIPECILMEVFYFGLNKATQQTANVVF
        +L+D+ +   NA   G + TW  L+EKF+ KFFPP  +A  R+E+ISF+Q DRE ++++W RFK +++ C ++ +P C  +E F+ GL+  T+   N   
Subjt:  NLRDEVKRWANALEDGKVGTWDQLIEKFMKKFFPPHENARRRKELISFQQKDRENLYDSWSRFKRMVKACPHNDIPECILMEVFYFGLNKATQQTANVVF

Query:  VGGMLKSSYNQIKATLDTMANNNEEWDEDDFDNRRGGRGRSEEG---------MDKNVVAVNEIDDMGCVGCNSPHNTDACPLNTEIIAFVKNDPFSNTY
         G   K ++N+I   L+ +A++NE W      +R   + +   G         M K +V +N+      +G  +P  T   P+ ++   +  + P     
Subjt:  VGGMLKSSYNQIKATLDTMANNNEEWDEDDFDNRRGGRGRSEEG---------MDKNVVAVNEIDDMGCVGCNSPHNTDACPLNTEIIAFVKNDPFSNTY

Query:  NL-GWRNHPNFGWGGTGRHGGQDDHRGEASGSHARYHNNRPQH--PQHQNQQQHTTTTP--STSSSMENLLREYMQKNNALLQSQALSIRNLEVQLGQLT
        +L  WR+HPNF WGG G  G    ++G++  +   Y     QH  P  Q   Q T T P  + +S++EN+++EYM + +A++QSQA S+RN   QLG L 
Subjt:  NL-GWRNHPNFGWGGTGRHGGQDDHRGEASGSHARYHNNRPQH--PQHQNQQQHTTTTP--STSSSMENLLREYMQKNNALLQSQALSIRNLEVQLGQLT

Query:  SDFSGRPQGSFPSNTETPNQAGGSRKEKCHVVTLRSG
        ++   RPQGSFP +TE P + G   KE+C  VTLRSG
Subjt:  SDFSGRPQGSFPSNTETPNQAGGSRKEKCHVVTLRSG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTGATAGTGAACAACCAGAATTCGAGCTTGACCTTGAGATTGAACAAACTTTTCGGCATAACCGACGAAGAAGGAGGCAGAAAAACGCACAACGAATGAAGAATAA
TAACAATCGGAATGTACCTCCGCCTCAAGCTACCCAAGAACAGAACGTCGTCTACATGGCACACGACTTGGATAGACCAATTAGAATTAAGATTCGCGCTCTTTCCGTTA
ACCTGAGAGACGAGGTGAAAAGGTGGGCAAACGCCTTGGAAGATGGTAAGGTGGGAACGTGGGACCAATTAATAGAAAAATTTATGAAGAAATTCTTCCCACCTCACGAG
AATGCCAGAAGAAGGAAGGAGCTCATCAGCTTCCAACAGAAGGATAGAGAGAACCTATATGACTCGTGGAGTAGGTTCAAGAGGATGGTTAAAGCATGCCCCCACAATGA
CATTCCTGAATGCATATTGATGGAGGTGTTCTATTTTGGTTTGAACAAGGCGACGCAGCAGACTGCTAATGTTGTGTTTGTAGGTGGTATGCTGAAGAGCTCATACAACC
AGATTAAGGCAACGCTGGACACGATGGCCAACAATAACGAAGAATGGGATGAAGATGATTTCGACAATCGTCGAGGAGGACGAGGACGAAGTGAAGAAGGAATGGACAAG
AACGTCGTGGCGGTGAATGAAATTGATGACATGGGATGTGTGGGATGCAACAGTCCTCATAATACTGACGCATGCCCACTAAATACAGAAATTATCGCGTTCGTAAAGAA
CGACCCTTTCTCAAACACCTATAACCTTGGTTGGAGAAATCATCCTAACTTTGGATGGGGAGGAACGGGGCGACATGGTGGTCAAGATGATCATCGTGGGGAAGCATCTG
GCTCCCACGCGAGGTACCACAACAATAGGCCACAACATCCCCAACATCAAAATCAACAGCAACACACCACCACTACTCCATCCACTTCTTCATCTATGGAAAACCTCCTC
CGCGAGTATATGCAGAAAAATAATGCCCTTTTGCAAAGCCAAGCTTTATCAATTCGCAATTTGGAAGTACAATTAGGGCAGTTAACCAGCGACTTCTCTGGAAGGCCGCA
AGGATCCTTCCCAAGCAATACAGAAACGCCAAATCAGGCAGGGGGATCTAGAAAAGAGAAGTGTCACGTGGTGACACTACGAAGCGGAAGGAATTTGACCATCCGCGATC
CTGATTTTGAACGTAGCACTCTCATTTCTCACTCTACTTTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGTGATAGTGAACAACCAGAATTCGAGCTTGACCTTGAGATTGAACAAACTTTTCGGCATAACCGACGAAGAAGGAGGCAGAAAAACGCACAACGAATGAAGAATAA
TAACAATCGGAATGTACCTCCGCCTCAAGCTACCCAAGAACAGAACGTCGTCTACATGGCACACGACTTGGATAGACCAATTAGAATTAAGATTCGCGCTCTTTCCGTTA
ACCTGAGAGACGAGGTGAAAAGGTGGGCAAACGCCTTGGAAGATGGTAAGGTGGGAACGTGGGACCAATTAATAGAAAAATTTATGAAGAAATTCTTCCCACCTCACGAG
AATGCCAGAAGAAGGAAGGAGCTCATCAGCTTCCAACAGAAGGATAGAGAGAACCTATATGACTCGTGGAGTAGGTTCAAGAGGATGGTTAAAGCATGCCCCCACAATGA
CATTCCTGAATGCATATTGATGGAGGTGTTCTATTTTGGTTTGAACAAGGCGACGCAGCAGACTGCTAATGTTGTGTTTGTAGGTGGTATGCTGAAGAGCTCATACAACC
AGATTAAGGCAACGCTGGACACGATGGCCAACAATAACGAAGAATGGGATGAAGATGATTTCGACAATCGTCGAGGAGGACGAGGACGAAGTGAAGAAGGAATGGACAAG
AACGTCGTGGCGGTGAATGAAATTGATGACATGGGATGTGTGGGATGCAACAGTCCTCATAATACTGACGCATGCCCACTAAATACAGAAATTATCGCGTTCGTAAAGAA
CGACCCTTTCTCAAACACCTATAACCTTGGTTGGAGAAATCATCCTAACTTTGGATGGGGAGGAACGGGGCGACATGGTGGTCAAGATGATCATCGTGGGGAAGCATCTG
GCTCCCACGCGAGGTACCACAACAATAGGCCACAACATCCCCAACATCAAAATCAACAGCAACACACCACCACTACTCCATCCACTTCTTCATCTATGGAAAACCTCCTC
CGCGAGTATATGCAGAAAAATAATGCCCTTTTGCAAAGCCAAGCTTTATCAATTCGCAATTTGGAAGTACAATTAGGGCAGTTAACCAGCGACTTCTCTGGAAGGCCGCA
AGGATCCTTCCCAAGCAATACAGAAACGCCAAATCAGGCAGGGGGATCTAGAAAAGAGAAGTGTCACGTGGTGACACTACGAAGCGGAAGGAATTTGACCATCCGCGATC
CTGATTTTGAACGTAGCACTCTCATTTCTCACTCTACTTTCTAG
Protein sequenceShow/hide protein sequence
MSDSEQPEFELDLEIEQTFRHNRRRRRQKNAQRMKNNNNRNVPPPQATQEQNVVYMAHDLDRPIRIKIRALSVNLRDEVKRWANALEDGKVGTWDQLIEKFMKKFFPPHE
NARRRKELISFQQKDRENLYDSWSRFKRMVKACPHNDIPECILMEVFYFGLNKATQQTANVVFVGGMLKSSYNQIKATLDTMANNNEEWDEDDFDNRRGGRGRSEEGMDK
NVVAVNEIDDMGCVGCNSPHNTDACPLNTEIIAFVKNDPFSNTYNLGWRNHPNFGWGGTGRHGGQDDHRGEASGSHARYHNNRPQHPQHQNQQQHTTTTPSTSSSMENLL
REYMQKNNALLQSQALSIRNLEVQLGQLTSDFSGRPQGSFPSNTETPNQAGGSRKEKCHVVTLRSGRNLTIRDPDFERSTLISHSTF