; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0009091 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0009091
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr06:19900446..19901680
RNA-Seq ExpressionPI0009091
SyntenyPI0009091
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_030497803.1 uncharacterized protein LOC115713460 [Cannabis sativa]3.1e-6739.95Show/hide
Query:  PTSTTFNPGIAYPVFGENARFEIKHVMLQMIQNAGQFGEHPREDPHEHIRSFYSICASFHMPGISLEELRFALFPLTLRDEAKRWANALEDGEVGTWDQL
        P     NPGI  P   +   FE+K VM QM+Q  GQFG  P EDPH HIRSF  +  SF + G+S E LR  LFP +LRD A+ W N L    V  W+ L
Subjt:  PTSTTFNPGIAYPVFGENARFEIKHVMLQMIQNAGQFGEHPREDPHEHIRSFYSICASFHMPGISLEELRFALFPLTLRDEAKRWANALEDGEVGTWDQL

Query:  IEKFMKKFFPPYENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVLVDGMLKSTYNQIKTTLDTMASNNE
         EKF++K+FPP  NA+ R E+MSFQQ + E   DAW RFK +++ CPH+GIP CI +E FY GLN A++   DA     +L  +YN+    L+ +ASNN 
Subjt:  IEKFMKKFFPPYENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVLVDGMLKSTYNQIKTTLDTMASNNE

Query:  EWDEDDFGNRRGGRAKGDDGMDKSAVVALQGQMTAMNNLLKSMTISQVNVEGSSVHAANQIDDMR--CVGCGGHPNTDACPLNTET--------------
        +W      NR     K    ++  A+ AL  QM +M N+LK+M +      G SV  A  I   +  CV CG     + CP N  +              
Subjt:  EWDEDDFGNRRGGRAKGDDGMDKSAVVALQGQMTAMNNLLKSMTISQVNVEGSSVHAANQIDDMR--CVGCGGHPNTDACPLNTET--------------

Query:  -------------------------------------QP-TTTASSTSPMESLLREYMQKNDPLLQSQASSIRNLEVQLGQLASDFSGRQQGSLPINT
                                             QP     S TS +ESL+R+YM KND ++QSQA+S+RNLEVQLGQLA+D   R QG+LP +T
Subjt:  -------------------------------------QP-TTTASSTSPMESLLREYMQKNDPLLQSQASSIRNLEVQLGQLASDFSGRQQGSLPINT

XP_030505184.1 uncharacterized protein LOC115720166 [Cannabis sativa]1.9e-6438.06Show/hide
Query:  PTSTTFNPGIAYPVFGENARFEIKHVMLQMIQNAGQFGEHPREDPHEHIRSFYSICASFHMPGISLEELRFALFPLTLRDEAKRWANALEDGEVGTWDQL
        P     NPGI  P   +  +FE+K VM QM+Q  GQF E P EDPH H+RSF  +  SF + G+S E  R  LFP +LRD A+ W N L    V  W+  
Subjt:  PTSTTFNPGIAYPVFGENARFEIKHVMLQMIQNAGQFGEHPREDPHEHIRSFYSICASFHMPGISLEELRFALFPLTLRDEAKRWANALEDGEVGTWDQL

Query:  IEKFMKKFFPPYENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVLVDGMLKSTYNQIKTTLDTMASNNE
         EKF++K+FPP  NA+ R E+MSF Q + E+  DAW RFK +++ CPH+GIP CI ME FY GLN  +Q   DA     +L  +YN+    L+T+ASNN 
Subjt:  IEKFMKKFFPPYENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVLVDGMLKSTYNQIKTTLDTMASNNE

Query:  EWDEDDFGNRRGGRAKGDDGMDKSAVVALQGQMTAMNNLLKSMTISQVNVEGSSVHAANQIDDMRCVGCGGHPNTDACPLNTET----------------
        +W       R  G  K    ++  A+ AL  QM +M N+LK+++I   N +     AA Q DD+ CV C      + CP N E+                
Subjt:  EWDEDDFGNRRGGRAKGDDGMDKSAVVALQGQMTAMNNLLKSMTISQVNVEGSSVHAANQIDDMRCVGCGGHPNTDACPLNTET----------------

Query:  ----------QPTTT--------------------------------ASSTSPMESLLREYMQKNDPLLQSQASSIRNLEVQLGQLASDFSGRQQGSLPI
                   P  +                                 S  S +ESL+R+YM KND ++QSQA+ +RNLE+QLG LA++   R QGSLP 
Subjt:  ----------QPTTT--------------------------------ASSTSPMESLLREYMQKNDPLLQSQASSIRNLEVQLGQLASDFSGRQQGSLPI

Query:  NT
        +T
Subjt:  NT

XP_030508936.1 uncharacterized protein LOC115723589 [Cannabis sativa]3.3e-6138.64Show/hide
Query:  PTSTTFNPGIAYPVFGENARFEIKHVMLQMIQNAGQFGEHPREDPHEHIRSFYSICASFHMPGISLEELRFALFPLTLRDEAKRWANALEDGEVGTWDQL
        P     NPGI  P   +   FE+K VM QM+Q  GQFG  P EDPH HIRSF  +  SF + G+S E LR  LFP +LRD A+ W N L    V  W+ L
Subjt:  PTSTTFNPGIAYPVFGENARFEIKHVMLQMIQNAGQFGEHPREDPHEHIRSFYSICASFHMPGISLEELRFALFPLTLRDEAKRWANALEDGEVGTWDQL

Query:  IEKFMKKFFPPYENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVLVDGMLKSTYNQIKTTLDTMASNNE
         EKF++K+FPP  NA+ R E+MSFQQ + E   DAW RFK +++ CPH+GIP CI +E FY GLN A +   DA     +L  +YN+    L+ +ASNN 
Subjt:  IEKFMKKFFPPYENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVLVDGMLKSTYNQIKTTLDTMASNNE

Query:  EWDEDDFGNRRGGRAKGDDGMDKSAVVALQGQMTAMNNLLKSMTISQVNVEGSSVH--AANQIDDMRCVGCG-GHP-------------------NTDAC
        +W      NR     K    ++  A+ AL  QM +M N+LK+M +      G SV   AA Q  ++ CV CG GH                    N +  
Subjt:  EWDEDDFGNRRGGRAKGDDGMDKSAVVALQGQMTAMNNLLKSMTISQVNVEGSSVH--AANQIDDMRCVGCG-GHP-------------------NTDAC

Query:  PLNTETQPT-------------------------------------------TTASSTSPMESLLREYMQKNDPLLQSQASSIRNLEVQLGQLASD
        P +    P                                               S TS +ESL+R+YM +ND ++QSQA+S+RNLEVQLGQLA+D
Subjt:  PLNTETQPT-------------------------------------------TTASSTSPMESLLREYMQKNDPLLQSQASSIRNLEVQLGQLASD

XP_030510138.1 uncharacterized protein LOC115724905 [Cannabis sativa]1.7e-6539.55Show/hide
Query:  NPGIAYPVFGENARFEIKHVMLQMIQNAGQFGEHPREDPHEHIRSFYSICASFHMPGISLEELRFALFPLTLRDEAKRWANALEDGEVGTWDQLIEKFMK
        NPGI  P   +   FE+K VM QM+Q  GQFG  P EDPH HI SF  +  SF + G+S E LR  LFP +LRD A+ W N L    V  W+ L E F++
Subjt:  NPGIAYPVFGENARFEIKHVMLQMIQNAGQFGEHPREDPHEHIRSFYSICASFHMPGISLEELRFALFPLTLRDEAKRWANALEDGEVGTWDQLIEKFMK

Query:  KFFPPYENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVLVDGMLKSTYNQIKTTLDTMASNNEEWDEDD
        K+FPP  NA+ R E+MSFQQ + E   DAW RFK +++ CPH+GIP CI +E FY GLN A++   DA     +L  +YN+    L+ +ASNN +W    
Subjt:  KFFPPYENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVLVDGMLKSTYNQIKTTLDTMASNNEEWDEDD

Query:  FGNRRGGRAKGDDGMDKSAVVALQGQMTAMNNLLKSMTISQVNVEGSSVH--AANQIDDMRCVGCGGHPNTDACPLNTET--------------------
          NR     K    ++  A+ AL  QM +M N+LK+M +      G SV   AA Q  ++ CV CG     + CP N  +                    
Subjt:  FGNRRGGRAKGDDGMDKSAVVALQGQMTAMNNLLKSMTISQVNVEGSSVH--AANQIDDMRCVGCGGHPNTDACPLNTET--------------------

Query:  ------------------------------------QP-TTTASSTSPMESLLREYMQKNDPLLQSQASSIRNLEVQLGQLASDFSGRQQGSLPINT
                                            QP     S TS +ESL+R+YM KND ++QSQA+S+RNLEVQLGQLA+D   R QG+LP +T
Subjt:  ------------------------------------QP-TTTASSTSPMESLLREYMQKNDPLLQSQASSIRNLEVQLGQLASDFSGRQQGSLPINT

XP_038889363.1 uncharacterized protein LOC120079279 [Benincasa hispida]9.7e-6940.54Show/hide
Query:  ENARFEIKHVMLQMIQNAGQFGEHPREDPHEHIRSFYSICASFHMPGISLEELRFALFPLTLRDEAKRWANALEDGEVGTWDQLIEKFMKKFFPPYENAR
        EN RF+IK VMLQM+QN GQFG    ED H H+ SF  +C++F + G++ E +R  LFP TLRDEA  WA++LE  E+ +WDQL+E FMKKFFPP  NAR
Subjt:  ENARFEIKHVMLQMIQNAGQFGEHPREDPHEHIRSFYSICASFHMPGISLEELRFALFPLTLRDEAKRWANALEDGEVGTWDQLIEKFMKKFFPPYENAR

Query:  RRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVLVDGMLKSTYNQIKTTLDTMASNNEEWDEDDFGNRRGGRAK
        RRK++++F+Q + E L   W   +R+VK C H GIP C+LM+ FY GLN++TQ  ADA +  G +  TY + K  L  ++ N ++  +D +G R   R +
Subjt:  RRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVLVDGMLKSTYNQIKTTLDTMASNNEEWDEDDFGNRRGGRAK

Query:  GDDG-MDKSAVVALQGQMTAMNNLLKSMTISQ--VNVEGSSVHAANQIDDMRCVGCGG----HPN-------TDACPLNTET------------------
         D+  +    +  L  QM A+ +LL++M ++Q  ++   +  +A  Q+  + CV CGG    HPN           P N ++                  
Subjt:  GDDG-MDKSAVVALQGQMTAMNNLLKSMTISQ--VNVEGSSVHAANQIDDMRCVGCGG----HPN-------TDACPLNTET------------------

Query:  ----QP------TTTASSTSPMESLLREYMQKNDPLLQSQASSIRNLEVQLGQLASDFSGRQQGSLPINT
            QP      + T++++S +ESLL++Y++KND ++QSQ SSIRNLE+Q+GQLA++   R  G+LP N+
Subjt:  ----QP------TTTASSTSPMESLLREYMQKNDPLLQSQASSIRNLEVQLGQLASDFSGRQQGSLPINT

TrEMBL top hitse value%identityAlignment
A0A5D3CC26 Uncharacterized protein8.0e-5350.62Show/hide
Query:  MSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVLVDGMLKSTYNQIKTTLDTMASNNEEWDEDDFGNR------RGGRA
        M+F+Q+DRENL D W RFKRM+K CPH+ IP+C+LME FYFGL+K T Q+A+ V   GML+S+YNQIK  LDTMASN++EW ++ FG+R      +G R 
Subjt:  MSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVLVDGMLKSTYNQIKTTLDTMASNNEEWDEDDFGNR------RGGRA

Query:  KGDDGMDKSAVVALQGQMTAMNNLLKSMTISQVNVEGSSVHAANQIDDMRCVGCGGHPNTDACPLNTETQPTTTASSTSPMESLLREYMQKNDPL-----
        + +DG+D S +VALQGQ+  M N+L+SM + QVNV  SSV    Q+++M CVGC    NT+ACPLNTE                +  Y+ KNDP      
Subjt:  KGDDGMDKSAVVALQGQMTAMNNLLKSMTISQVNVEGSSVHAANQIDDMRCVGCGGHPNTDACPLNTETQPTTTASSTSPMESLLREYMQKNDPL-----

Query:  -------LQSQASSIRNLEVQLGQLASDFSGRQQGSLPINT
               + SQASSI+N+E+QLGQL SDFS R + S P NT
Subjt:  -------LQSQASSIRNLEVQLGQLASDFSGRQQGSLPINT

A0A6J1EEI2 uncharacterized protein LOC1114333948.0e-5333.42Show/hide
Query:  HPTSTTFNPGIAYPVFGENARFEIKHVMLQMIQNAGQFGEHPREDPHEHIRSFYSICASFHMPGISLEELRFALFPLTLRDEAKRWANALEDGEVGTWDQ
        HP     NP I  P   +   FE+K VM QM+Q  GQF   P EDPH H++SF  +  SF    +  + +R +LFP +LRD AK W N L  G + +W+ 
Subjt:  HPTSTTFNPGIAYPVFGENARFEIKHVMLQMIQNAGQFGEHPREDPHEHIRSFYSICASFHMPGISLEELRFALFPLTLRDEAKRWANALEDGEVGTWDQ

Query:  LIEKFMKKFFPPYENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVLVDGMLKSTYNQIKTTLDTMASNN
        L+EKF+ K+FPP  NAR R E++ FQQ + + L +AW RFK M++ CPH+G+P CI ME FY GLN AT+Q  DA     +L  TYN+    L+ +ASNN
Subjt:  LIEKFMKKFFPPYENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVLVDGMLKSTYNQIKTTLDTMASNN

Query:  EEWDEDDFGNRRGGRAKGDDGMDKSAVVALQGQMTAMNNLLKSMTISQVNVEGSSVHAA---NQIDDMRCVGCGGHPNTDACP-----------------
         +W   D  +  G + +G   ++  A+ ++  Q+ ++ N+L+++ + Q ++  + VH     NQ     CV CG     D CP                 
Subjt:  EEWDEDDFGNRRGGRAKGDDGMDKSAVVALQGQMTAMNNLLKSMTISQVNVEGSSVHAA---NQIDDMRCVGCGGHPNTDACP-----------------

Query:  ------------------------------------------------------LNTETQPTTTASST--SPMESLLREYMQKNDPLLQSQASSIRNLEV
                                                              +NT+ +    A  T  + +ESL++EYM KND ++Q+Q +S+RNLEV
Subjt:  ------------------------------------------------------LNTETQPTTTASST--SPMESLLREYMQKNDPLLQSQASSIRNLEV

Query:  Q
        Q
Subjt:  Q

A0A6J1EQ90 uncharacterized protein LOC1114364118.0e-5333.66Show/hide
Query:  HPTSTTFNPGIAYPVFGENARFEIKHVMLQMIQNAGQFGEHPREDPHEHIRSFYSI-------CASFHMPGISLEELRFALFPLTLRDEAKRWANALEDG
        HP     NP I  P   +   FE+K VM QM+Q  GQF   P EDPH H++SF  +         SF   G+  + +R +LFP  LRD AK W N L  G
Subjt:  HPTSTTFNPGIAYPVFGENARFEIKHVMLQMIQNAGQFGEHPREDPHEHIRSFYSI-------CASFHMPGISLEELRFALFPLTLRDEAKRWANALEDG

Query:  EVGTWDQLIEKFMKKFFPPYENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVLVDGMLKSTYNQIKTTL
         + +W+ L E F+ K+FPP  NAR + E+++FQQ + E L +A  RFK M++ CPH+G+P CI ME FY GLN  T+Q  DA     +L  TYN+    L
Subjt:  EVGTWDQLIEKFMKKFFPPYENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVLVDGMLKSTYNQIKTTL

Query:  DTMASNNEEWDEDDFGNRRGGRAKGDDGMDKSAVVALQGQMTAMNNLLKSMTISQVNVEGSSVHAA---NQIDDMRCVGCGGHPNTDACP----------
        + +ASNN +W   D  +  G + +G   ++  A+ ++  Q+ ++ N+L+++ + Q ++  + VH A   NQ     CV CG     D CP          
Subjt:  DTMASNNEEWDEDDFGNRRGGRAKGDDGMDKSAVVALQGQMTAMNNLLKSMTISQVNVEGSSVHAA---NQIDDMRCVGCGGHPNTDACP----------

Query:  -------------------------------------------------------------LNTETQPTTTASSTS--PMESLLREYMQKNDPLLQSQAS
                                                                     +NT+ + TT A  TS   +ESL++EYM KND ++QSQ +
Subjt:  -------------------------------------------------------------LNTETQPTTTASSTS--PMESLLREYMQKNDPLLQSQAS

Query:  SIRNLEVQLG
        S+RNLEVQ+G
Subjt:  SIRNLEVQLG

A0A6J1G7Q6 uncharacterized protein LOC1114515984.3e-5432.78Show/hide
Query:  HPTSTTFNPGIAYPVFGENARFEIKHVMLQMIQNAGQFGEHPREDPHEHIRSFYSICASFHMPGISLEELRFALFPLTLRDEAKRWANALEDGEVGTWDQ
        HP     NP I  P   +   FE+K VM QM+Q  GQF     +DPH H++SF  +  SF   G+  + +R + F  +LRD AK W N L  G + +W+ 
Subjt:  HPTSTTFNPGIAYPVFGENARFEIKHVMLQMIQNAGQFGEHPREDPHEHIRSFYSICASFHMPGISLEELRFALFPLTLRDEAKRWANALEDGEVGTWDQ

Query:  LIEKFMKKFFPPYENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVLVDGMLKSTYNQIKTTLDTMASNN
        L EKF+ K+FPP  +AR R E+++FQ+ + E L +AW RFK  ++ CPH+G+P CI +E FY GLN AT+Q  DA     +L  TYN+    L+ +ASNN
Subjt:  LIEKFMKKFFPPYENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVLVDGMLKSTYNQIKTTLDTMASNN

Query:  EEWDEDDFGNRRGGRAKGDDGMDKSAVVALQGQMTAMNNLLKSMTISQVNVEGSSVHAAN---QIDDMRCVGCGGHPNTDACPLNTET------------
         +W +     R     K  + ++  A+ ++  Q+ +M N+L+++   Q ++  +  H A    Q     CV CG     D CP N  +            
Subjt:  EEWDEDDFGNRRGGRAKGDDGMDKSAVVALQGQMTAMNNLLKSMTISQVNVEGSSVHAAN---QIDDMRCVGCGGHPNTDACPLNTET------------

Query:  -----------------------------------------------------QPTTTASSTSP--------MESLLREYMQKNDPLLQSQASSIRNLEV
                                                             Q TT    TS         +ESL++EYM +ND ++QSQ  S+RNLEV
Subjt:  -----------------------------------------------------QPTTTASSTSP--------MESLLREYMQKNDPLLQSQASSIRNLEV

Query:  QLGQLASDFSGRQQGSLPINT
        Q+GQLA++   R  G LP +T
Subjt:  QLGQLASDFSGRQQGSLPINT

U5CUI2 Retrotrans_gag domain-containing protein1.1e-5743.51Show/hide
Query:  PTSTTFNPGIAYPVFGENARFEIKHVMLQMIQNAGQFGEHPREDPHEHIRSFYSICASFHMPGISLEELRFALFPLTLRDEAKRWANALEDGEVGTWDQL
        P     NPGI  P   +  +FE+K VM QM+Q  GQF   P EDPH H+RSF  +  SF + G+S E LR  LFP +LRD A+ W N L    V  W+ L
Subjt:  PTSTTFNPGIAYPVFGENARFEIKHVMLQMIQNAGQFGEHPREDPHEHIRSFYSICASFHMPGISLEELRFALFPLTLRDEAKRWANALEDGEVGTWDQL

Query:  IEKFMKKFFPPYENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVLVDGMLKSTYNQIKTTLDTMASNNE
         EKF++K+FPP  NA+ R E+MSFQQ + E+  DAW RFK +++ CPH+GIP CI ME FY GLN A++   DA     +L  +YN+    L+T+ASNN 
Subjt:  IEKFMKKFFPPYENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVLVDGMLKSTYNQIKTTLDTMASNNE

Query:  EWDEDDFGNRRGGRAKGDDG-MDKSAVVALQGQMTAMNNLLKSMTISQVNVEGSSVHAANQIDDMRCVGCGGHPNTDACPLNTET
        +W      N R   ++   G ++  A+ AL  QM +M N+LK+++I   N +     AA Q DD+ CV CG     + CP N E+
Subjt:  EWDEDDFGNRRGGRAKGDDG-MDKSAVVALQGQMTAMNNLLKSMTISQVNVEGSSVHAANQIDDMRCVGCGGHPNTDACPLNTET

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGGCACCCAACCTCTACAACTTTCAACCCAGGAATCGCCTACCCTGTATTCGGCGAAAACGCCAGGTTTGAAATCAAACATGTTATGCTTCAAATGATTCAGAACGC
CGGACAATTCGGCGAACATCCTAGGGAAGATCCACACGAACATATAAGGAGTTTCTACTCCATCTGCGCTTCCTTCCATATGCCAGGCATCTCACTTGAAGAATTAAGAT
TCGCTCTCTTCCCGTTAACTCTGAGGGATGAGGCGAAAAGGTGGGCAAATGCTCTAGAAGATGGCGAGGTGGGAACATGGGATCAACTAATAGAGAAATTTATGAAGAAA
TTTTTCCCACCTTACGAAAACGCTAGAAGAAGGAAAGAGCTTATGAGCTTCCAGCAGAAGGATAGAGAAAACCTACATGACGCGTGGAGTAGGTTCAAACGGATGGTCAA
AGCATGCCCCCACAATGGCATTCCTAAATGCATATTGATGGAGGTTTTCTATTTCGGACTAAACAAGGCTACACAGCAGACTGCTGATGCTGTGCTTGTAGACGGTATGC
TAAAGAGCACATACAACCAGATTAAGACGACGCTGGACACGATGGCCAGCAACAATGAAGAATGGGATGAAGATGATTTCGGCAATCGCCGAGGAGGACGAGCAAAAGGT
GATGATGGCATGGATAAAAGCGCCGTGGTGGCATTGCAGGGACAAATGACTGCGATGAACAATTTACTCAAATCAATGACAATATCGCAAGTCAACGTCGAAGGAAGCTC
TGTGCACGCGGCTAACCAAATTGATGACATGAGATGCGTGGGATGCGGCGGTCATCCTAACACTGACGCATGCCCACTCAATACTGAAACCCAGCCCACCACCACCGCCT
CATCCACCTCTCCCATGGAAAGTCTCCTCCGCGAATACATGCAGAAAAATGATCCTCTTCTGCAAAGCCAAGCTTCATCAATTCGTAATCTGGAGGTACAGTTAGGTCAG
CTCGCTAGTGATTTCTCCGGAAGACAGCAAGGATCCCTCCCAATCAATACATAA
mRNA sequenceShow/hide mRNA sequence
ATGCGGCACCCAACCTCTACAACTTTCAACCCAGGAATCGCCTACCCTGTATTCGGCGAAAACGCCAGGTTTGAAATCAAACATGTTATGCTTCAAATGATTCAGAACGC
CGGACAATTCGGCGAACATCCTAGGGAAGATCCACACGAACATATAAGGAGTTTCTACTCCATCTGCGCTTCCTTCCATATGCCAGGCATCTCACTTGAAGAATTAAGAT
TCGCTCTCTTCCCGTTAACTCTGAGGGATGAGGCGAAAAGGTGGGCAAATGCTCTAGAAGATGGCGAGGTGGGAACATGGGATCAACTAATAGAGAAATTTATGAAGAAA
TTTTTCCCACCTTACGAAAACGCTAGAAGAAGGAAAGAGCTTATGAGCTTCCAGCAGAAGGATAGAGAAAACCTACATGACGCGTGGAGTAGGTTCAAACGGATGGTCAA
AGCATGCCCCCACAATGGCATTCCTAAATGCATATTGATGGAGGTTTTCTATTTCGGACTAAACAAGGCTACACAGCAGACTGCTGATGCTGTGCTTGTAGACGGTATGC
TAAAGAGCACATACAACCAGATTAAGACGACGCTGGACACGATGGCCAGCAACAATGAAGAATGGGATGAAGATGATTTCGGCAATCGCCGAGGAGGACGAGCAAAAGGT
GATGATGGCATGGATAAAAGCGCCGTGGTGGCATTGCAGGGACAAATGACTGCGATGAACAATTTACTCAAATCAATGACAATATCGCAAGTCAACGTCGAAGGAAGCTC
TGTGCACGCGGCTAACCAAATTGATGACATGAGATGCGTGGGATGCGGCGGTCATCCTAACACTGACGCATGCCCACTCAATACTGAAACCCAGCCCACCACCACCGCCT
CATCCACCTCTCCCATGGAAAGTCTCCTCCGCGAATACATGCAGAAAAATGATCCTCTTCTGCAAAGCCAAGCTTCATCAATTCGTAATCTGGAGGTACAGTTAGGTCAG
CTCGCTAGTGATTTCTCCGGAAGACAGCAAGGATCCCTCCCAATCAATACATAA
Protein sequenceShow/hide protein sequence
MRHPTSTTFNPGIAYPVFGENARFEIKHVMLQMIQNAGQFGEHPREDPHEHIRSFYSICASFHMPGISLEELRFALFPLTLRDEAKRWANALEDGEVGTWDQLIEKFMKK
FFPPYENARRRKELMSFQQKDRENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVLVDGMLKSTYNQIKTTLDTMASNNEEWDEDDFGNRRGGRAKG
DDGMDKSAVVALQGQMTAMNNLLKSMTISQVNVEGSSVHAANQIDDMRCVGCGGHPNTDACPLNTETQPTTTASSTSPMESLLREYMQKNDPLLQSQASSIRNLEVQLGQ
LASDFSGRQQGSLPINT