; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0009269 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0009269
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr05:14854756..14856309
RNA-Seq ExpressionPI0009269
SyntenyPI0009269
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048713.1 hypothetical protein E6C27_scaffold43G00050 [Cucumis melo var. makuwa]3.2e-5343.81Show/hide
Query:  LRDEAKRFKKMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFADGMLKSTYNQIKTMLDTMASNNEEWDEDDFGNR------RGGRSKGDECMDKNV
        L D  KRFK+M+K CPH+ IP+C+LME FYFGL+K T Q+A+ VF  GML+S+YNQIK MLDTMASN++EW ++ FG+R      +G R + ++ +D ++
Subjt:  LRDEAKRFKKMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFADGMLKSTYNQIKTMLDTMASNNEEWDEDDFGNR------RGGRSKGDECMDKNV

Query:  VVALQGQMTTMNNLLKSMAISQVNTAGGSVHAANQIDEMGCVGCGGPHNTDACPLNTEIVAYGGQGDHRGEASGSHARYHNNRPQHFHHQQQPTTTASST
        +VALQGQ+  M N+L+SMA+ QVN    SV    Q++EMGCVGC  PHNT+ACPLNTEIVA                 Y  N P   H      TT    
Subjt:  VVALQGQMTTMNNLLKSMAISQVNTAGGSVHAANQIDEMGCVGCGGPHNTDACPLNTEIVAYGGQGDHRGEASGSHARYHNNRPQHFHHQQQPTTTASST

Query:  SSMENLLREYMQKNDALLQSQASSIRNLEVQLGQLASDFSERPQRSLPSNTETPNQAEGSRKEKCHAVTLRSGRNLTIREPDAKCSHSNSNSTAEIGSSS
                         + SQASSI+N+E+QLGQL SDFS RP+RS PSNTETPNQ         +          T +E   K S+  ++    IG   
Subjt:  SSMENLLREYMQKNDALLQSQASSIRNLEVQLGQLASDFSERPQRSLPSNTETPNQAEGSRKEKCHAVTLRSGRNLTIREPDAKCSHSNSNSTAEIGSSS

Query:  KIPSSIHFPLTDNVS
        K+  S  F +  ++S
Subjt:  KIPSSIHFPLTDNVS

XP_030497803.1 uncharacterized protein LOC115713460 [Cannabis sativa]1.6e-6036.89Show/hide
Query:  ADSEPNAAYIAHDLDRPIRSYAAPNLYNFNPGIAYPAFGENARFEIKPVMLQMIQNARQFGGHPGEDPHEHIRSFYSICASFHMLGISPEELRFALFPLT
        A +E N   +A D  R IR YAAP     NPGI  P   +   FE+KPVM QM+Q   QFGG P EDPH HIRSF  +  SF + G+S E LR  LFP +
Subjt:  ADSEPNAAYIAHDLDRPIRSYAAPNLYNFNPGIAYPAFGENARFEIKPVMLQMIQNARQFGGHPGEDPHEHIRSFYSICASFHMLGISPEELRFALFPLT

Query:  LRDEAK------------------------------------------------------RFKKMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFA
        LRD A+                                                      RFK++++ CPH+GIP CI +E FY GLN A++   DA   
Subjt:  LRDEAK------------------------------------------------------RFKKMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFA

Query:  DGMLKSTYNQIKTMLDTMASNNEEWDEDDFGNRRGGRSKGDECMDKNVVVALQGQMTTMNNLLKSMAISQVNTAGGSVH--AANQIDEMGCVGCGGPHNT
          +L  +YN+   +L+ +ASNN +W      NR     K    ++ + + AL  QM +M N+LK+M        GGSV   AA Q  +  CV CG  H  
Subjt:  DGMLKSTYNQIKTMLDTMASNNEEWDEDDFGNRRGGRSKGDECMDKNVVVALQGQMTTMNNLLKSMAISQVNTAGGSVH--AANQIDEMGCVGCGGPHNT

Query:  DACPLNTEIVAY-GGQGDHRGEASGSHA-----RYHNN----------RPQHFHHQQQP----TTTASSTSSMENLLREYMQKNDALLQSQASSIRNLEV
        + CP N   V Y G Q  +R     S++     ++H N           P  F  Q +P        S TSS+E+L+R+YM KND ++QSQA+S+RNLEV
Subjt:  DACPLNTEIVAY-GGQGDHRGEASGSHA-----RYHNN----------RPQHFHHQQQP----TTTASSTSSMENLLREYMQKNDALLQSQASSIRNLEV

Query:  QLGQLASDFSERPQRSLPSNTETPNQAEGSRKEKCHAVTLRSGRNLTIREPDAKCSHSNSNSTAEIGSSSKIP--SSIHFPLTDNVSS
        QLGQLA+D   RPQ +LPS+TE P +     KE C AVTLRSG+   I    A       +S  + G   K P  S++  P     SS
Subjt:  QLGQLASDFSERPQRSLPSNTETPNQAEGSRKEKCHAVTLRSGRNLTIREPDAKCSHSNSNSTAEIGSSSKIP--SSIHFPLTDNVSS

XP_030505184.1 uncharacterized protein LOC115720166 [Cannabis sativa]1.4e-5635.25Show/hide
Query:  IAHDLDRPIRSYAAPNLYNFNPGIAYPAFGENARFEIKPVMLQMIQNARQFGGHPGEDPHEHIRSFYSICASFHMLGISPEELRFALFPLTLRDEAK---
        +  D  R IR YAAP     NPGI  P   +  +FE+KPVM QM+Q   QF   P EDPH H+RSF  +  SF + G+S E  R  LFP +LRD A+   
Subjt:  IAHDLDRPIRSYAAPNLYNFNPGIAYPAFGENARFEIKPVMLQMIQNARQFGGHPGEDPHEHIRSFYSICASFHMLGISPEELRFALFPLTLRDEAK---

Query:  ---------------------------------------------------RFKKMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFADGMLKSTYN
                                                           RFK++++ CPH+GIP CI ME FY GLN  +Q   DA     +L  +YN
Subjt:  ---------------------------------------------------RFKKMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFADGMLKSTYN

Query:  QIKTMLDTMASNNEEWDEDDFGNRRGGRSKGDECMDKNVVVALQGQMTTMNNLLKSMAISQVNTAGGSVHAANQIDEMGCVGCGGPHNTDACPLNTEIVA
        +   +L+T+ASNN +W       R  G  K    ++ + + AL  QM +M N+LK+++I   N+      AA Q D++ CV C   H  + CP N E V 
Subjt:  QIKTMLDTMASNNEEWDEDDFGNRRGGRSKGDECMDKNVVVALQGQMTTMNNLLKSMAISQVNTAGGSVHAANQIDEMGCVGCGGPHNTDACPLNTEIVA

Query:  YGGQGD---HRGEASGSHARYHNNRP----------QHFHHQQQPTTTA-------------SSTSSMENLLREYMQKNDALLQSQASSIRNLEVQLGQL
        Y G  +   + G  S S+ +   N P          +HF   +Q                  S  SS+E+L+R+YM KNDA++QSQA+ +RNLE+QLG L
Subjt:  YGGQGD---HRGEASGSHARYHNNRP----------QHFHHQQQPTTTA-------------SSTSSMENLLREYMQKNDALLQSQASSIRNLEVQLGQL

Query:  ASDFSERPQRSLPSNTETPNQAEGSRKEKCHAVTLRSGRNLTIREPDAKCS
        A++   RPQ SLPS+TE P +     KE+C ++ LRSG++L   E + K S
Subjt:  ASDFSERPQRSLPSNTETPNQAEGSRKEKCHAVTLRSGRNLTIREPDAKCS

XP_030510138.1 uncharacterized protein LOC115724905 [Cannabis sativa]2.4e-5637.12Show/hide
Query:  NPGIAYPAFGENARFEIKPVMLQMIQNARQFGGHPGEDPHEHIRSFYSICASFHMLGISPEELRFALFPLTLRDEAK-----------------------
        NPGI  P   +   FE+KPVM QM+Q   QFGG P EDPH HI SF  +  SF + G+S E LR  LFP +LRD A+                       
Subjt:  NPGIAYPAFGENARFEIKPVMLQMIQNARQFGGHPGEDPHEHIRSFYSICASFHMLGISPEELRFALFPLTLRDEAK-----------------------

Query:  -------------------------------RFKKMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFADGMLKSTYNQIKTMLDTMASNNEEWDEDD
                                       RFK++++ CPH+GIP CI +E FY GLN A++   DA     +L  +YN+   +L+ +ASNN +W    
Subjt:  -------------------------------RFKKMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFADGMLKSTYNQIKTMLDTMASNNEEWDEDD

Query:  FGNRRGGRSKGDECMDKNVVVALQGQMTTMNNLLKSMAISQVNTAGGSVH--AANQIDEMGCVGCGGPHNTDACPLNTEIVAYGGQ--------------
          NR     K    ++ + + AL  QM +M N+LK+M        GGSV   AA Q  E+ CV CG  H  + CP N   V Y G               
Subjt:  FGNRRGGRSKGDECMDKNVVVALQGQMTTMNNLLKSMAISQVNTAGGSVH--AANQIDEMGCVGCGGPHNTDACPLNTEIVAYGGQ--------------

Query:  ------------GDHRGEASGSHARYHNNRPQHFHHQQQPTTTASSTSSMENLLREYMQKNDALLQSQASSIRNLEVQLGQLASDFSERPQRSLPSNTET
                    G     +SG+  +   + P  F  QQ      S TSS+E+L+R+YM KNDA++QSQA+S+RNLEVQLGQLA+D   RPQ +LPS+TE 
Subjt:  ------------GDHRGEASGSHARYHNNRPQHFHHQQQPTTTASSTSSMENLLREYMQKNDALLQSQASSIRNLEVQLGQLASDFSERPQRSLPSNTET

Query:  PNQAEGSRKEKCHAVTLRSGR----NLTIRE
        P +     KE C AVTLRSG+    N+  RE
Subjt:  PNQAEGSRKEKCHAVTLRSGR----NLTIRE

XP_038896595.1 uncharacterized protein LOC120084850 [Benincasa hispida]1.3e-5435.66Show/hide
Query:  IRSYAAPNLYNFNPGIAYPAFGENARFEIKPVMLQMIQNARQFGGHPGEDPHEHIRSFYSICASFHMLGISPEELRFALFPLTLRDEAKRF---------
        +R+YAAPN YNF P I+ P   ENA F+I+ +MLQMIQNA +FGG  GEDPH H+  F  IC +F +  +SPE +R  +FP TLRDEA+R+         
Subjt:  IRSYAAPNLYNFNPGIAYPAFGENARFEIKPVMLQMIQNARQFGGHPGEDPHEHIRSFYSICASFHMLGISPEELRFALFPLTLRDEAKRF---------

Query:  --------KKMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFADGMLKSTYNQIKTMLDTMASNNEEWDEDDFGNRRGGRSKGDE--CMDKNVVVAL
                +++VK C H GIP C+LME  Y GL+++ Q  ADA  A+G++  TY + K +LD ++ N +EW  + +   RG   +  E   +  + +  L
Subjt:  --------KKMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFADGMLKSTYNQIKTMLDTMASNNEEWDEDDFGNRRGGRSKGDE--CMDKNVVVAL

Query:  QGQMTTMNNLLKSMAISQ--VNTAGGSVHAANQIDEMGCVGCGGPHNTDACPLNTEIV--------------------AYGGQGDHRGEASGSHARYH--
          QMTTM  LL++MAI Q  ++ +    +A   +  + CV CG  H+ + CPLN + V                     +   G+H   +  +H   H  
Subjt:  QGQMTTMNNLLKSMAISQ--VNTAGGSVHAANQIDEMGCVGCGGPHNTDACPLNTEIV--------------------AYGGQGDHRGEASGSHARYH--

Query:  --NNRPQHFHHQQQP--------------TTTASSTSSMENLLREYMQKNDALLQSQASSIRNLEVQLGQLASDFSERPQRSLPSNT
           + P   +    P              + T+ STSS+E LL+ Y++KN+ + QSQASS+RNLE Q+ QL  +   +    LPS++
Subjt:  --NNRPQHFHHQQQP--------------TTTASSTSSMENLLREYMQKNDALLQSQASSIRNLEVQLGQLASDFSERPQRSLPSNT

TrEMBL top hitse value%identityAlignment
A0A5D3CC26 Uncharacterized protein1.5e-5343.81Show/hide
Query:  LRDEAKRFKKMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFADGMLKSTYNQIKTMLDTMASNNEEWDEDDFGNR------RGGRSKGDECMDKNV
        L D  KRFK+M+K CPH+ IP+C+LME FYFGL+K T Q+A+ VF  GML+S+YNQIK MLDTMASN++EW ++ FG+R      +G R + ++ +D ++
Subjt:  LRDEAKRFKKMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFADGMLKSTYNQIKTMLDTMASNNEEWDEDDFGNR------RGGRSKGDECMDKNV

Query:  VVALQGQMTTMNNLLKSMAISQVNTAGGSVHAANQIDEMGCVGCGGPHNTDACPLNTEIVAYGGQGDHRGEASGSHARYHNNRPQHFHHQQQPTTTASST
        +VALQGQ+  M N+L+SMA+ QVN    SV    Q++EMGCVGC  PHNT+ACPLNTEIVA                 Y  N P   H      TT    
Subjt:  VVALQGQMTTMNNLLKSMAISQVNTAGGSVHAANQIDEMGCVGCGGPHNTDACPLNTEIVAYGGQGDHRGEASGSHARYHNNRPQHFHHQQQPTTTASST

Query:  SSMENLLREYMQKNDALLQSQASSIRNLEVQLGQLASDFSERPQRSLPSNTETPNQAEGSRKEKCHAVTLRSGRNLTIREPDAKCSHSNSNSTAEIGSSS
                         + SQASSI+N+E+QLGQL SDFS RP+RS PSNTETPNQ         +          T +E   K S+  ++    IG   
Subjt:  SSMENLLREYMQKNDALLQSQASSIRNLEVQLGQLASDFSERPQRSLPSNTETPNQAEGSRKEKCHAVTLRSGRNLTIREPDAKCSHSNSNSTAEIGSSS

Query:  KIPSSIHFPLTDNVS
        K+  S  F +  ++S
Subjt:  KIPSSIHFPLTDNVS

A0A5D3D2S0 Uncharacterized protein2.9e-4452.04Show/hide
Query:  MLKSTYNQIKTMLDTMASNNEEWDEDDFGNRRGGRSKGDECMDKNVVVALQGQMTTMNNLLKSMAISQVNTAGGSVHAANQIDEMGCVGCGGPHNTDACP
        ML+S+Y QIKT LD + +N++E  +DD   +   R + D  MD+NV+VALQGQ+T M  LL+SMA+SQV+  G  V A  Q+DEM  VGCG PH TDAC 
Subjt:  MLKSTYNQIKTMLDTMASNNEEWDEDDFGNRRGGRSKGDECMDKNVVVALQGQMTTMNNLLKSMAISQVNTAGGSVHAANQIDEMGCVGCGGPHNTDACP

Query:  LNTEIVAY---------GGQGDHRGEASGSHARYHNNRPQHF--HHQQQPTTTASSTSSMENLLREYMQKNDALLQSQASSIRNLEVQLGQLASDFSERP
        LN EI AY            G++RGEA   H + H +RP +    HQQ  T T SS SSM  LLR+YMQ+ DA +QSQ +SI NLE+ LGQLA DFS RP
Subjt:  LNTEIVAY---------GGQGDHRGEASGSHARYHNNRPQHF--HHQQQPTTTASSTSSMENLLREYMQKNDALLQSQASSIRNLEVQLGQLASDFSERP

Query:  QRSLPSNTETPNQAEGSRKEK
          SLPSN E PN   G  K K
Subjt:  QRSLPSNTETPNQAEGSRKEK

A0A6J1EEI2 uncharacterized protein LOC1114333942.3e-4130.62Show/hide
Query:  NAAYIAHDLDRPIRSYAAPNLYNFNPGIAYPAFGENARFEIKPVMLQMIQNARQFGGHPGEDPHEHIRSFYSICASFHMLGISPEELRFALFPLTLRDEA
        NA ++A D +R IR+YA P +   NP I  P   +   FE+KPVM QM+Q   QF G P EDPH H++SF  +  SF    +  + +R +LFP +LRD A
Subjt:  NAAYIAHDLDRPIRSYAAPNLYNFNPGIAYPAFGENARFEIKPVMLQMIQNARQFGGHPGEDPHEHIRSFYSICASFHMLGISPEELRFALFPLTLRDEA

Query:  K------------------------------------------------------RFKKMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFADGMLK
        K                                                      RFK+M++ CPH+G+P CI ME FY GLN AT+Q  DA     +L 
Subjt:  K------------------------------------------------------RFKKMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFADGMLK

Query:  STYNQIKTMLDTMASNNEEWDEDDFGNRRGGRSKGDECMDKNVVVALQGQMTTMNNLLKSMAISQVNTAGGSVHAA---NQIDEMGCVGCGGPHNTDACP
         TYN+   +L+ +ASNN +W   D  +  G +++G   ++ + + ++  Q+ ++ N+L+++A+ Q +     VH     NQ     CV CG  H  D CP
Subjt:  STYNQIKTMLDTMASNNEEWDEDDFGNRRGGRSKGDECMDKNVVVALQGQMTTMNNLLKSMAISQVNTAGGSVHAA---NQIDEMGCVGCGGPHNTDACP

Query:  LNTEIVAYGGQGDHRG-----------------------EASGSHARY---HNNRPQHFHHQQQ---------------PTTTASSTSSMENLLREYMQK
         N   + Y G    +G                       +  GS+ +      N P  F  Q Q               P    +  +S+E+L++EYM K
Subjt:  LNTEIVAYGGQGDHRG-----------------------EASGSHARY---HNNRPQHFHHQQQ---------------PTTTASSTSSMENLLREYMQK

Query:  NDALLQSQASSIRNLEVQ
        ND ++Q+Q +S+RNLEVQ
Subjt:  NDALLQSQASSIRNLEVQ

A0A6J1EQ90 uncharacterized protein LOC1114364114.6e-4230.42Show/hide
Query:  NAAYIAHDLDRPIRSYAAPNLYNFNPGIAYPAFGENARFEIKPVMLQMIQNARQFGGHPGEDPHEHIRSFYSI-------CASFHMLGISPEELRFALFP
        N  ++A D +R IR+YA P +   NP I  P   +   FE+KPVM QM+Q   QF G P EDPH H++SF  +         SF   G+  + +R +LFP
Subjt:  NAAYIAHDLDRPIRSYAAPNLYNFNPGIAYPAFGENARFEIKPVMLQMIQNARQFGGHPGEDPHEHIRSFYSI-------CASFHMLGISPEELRFALFP

Query:  LTLRDEAK------------------------------------------------------RFKKMVKACPHNGIPKCILMEVFYFGLNKATQQTADAV
          LRD AK                                                      RFK+M++ CPH+G+P CI ME FY GLN  T+Q  DA 
Subjt:  LTLRDEAK------------------------------------------------------RFKKMVKACPHNGIPKCILMEVFYFGLNKATQQTADAV

Query:  FADGMLKSTYNQIKTMLDTMASNNEEWDEDDFGNRRGGRSKGDECMDKNVVVALQGQMTTMNNLLKSMAISQVNTAGGSVHAA---NQIDEMGCVGCGGP
            +L  TYN+   +L+ +ASNN +W   D  +  G +++G   ++ + + ++  Q+ ++ N+L+++A+ Q +     VH A   NQ     CV CG  
Subjt:  FADGMLKSTYNQIKTMLDTMASNNEEWDEDDFGNRRGGRSKGDECMDKNVVVALQGQMTTMNNLLKSMAISQVNTAGGSVHAA---NQIDEMGCVGCGGP

Query:  HNTDACPLNTEIVAYGGQGDHRGEASGSHA--------RYHNN---RPQHFHHQQQP----------------------------TTTASSTS--SMENL
        H  D CP N   + Y G    +G    +          R H N   + Q  ++QQ P                            TT A  TS  S+E+L
Subjt:  HNTDACPLNTEIVAYGGQGDHRGEASGSHA--------RYHNN---RPQHFHHQQQP----------------------------TTTASSTS--SMENL

Query:  LREYMQKNDALLQSQASSIRNLEVQLGQLASDFSERPQRSLPSNTETPNQAEGSRKE
        ++EYM KNDA++QSQ +S+RNLEVQ+G   +           ++T+  N+    +KE
Subjt:  LREYMQKNDALLQSQASSIRNLEVQLGQLASDFSERPQRSLPSNTETPNQAEGSRKE

A0A6J1G7Q6 uncharacterized protein LOC1114515981.5e-4530.93Show/hide
Query:  NAAYIAHDLDRPIRSYAAPNLYNFNPGIAYPAFGENARFEIKPVMLQMIQNARQFGGHPGEDPHEHIRSFYSICASFHMLGISPEELRFALFPLTLRDEA
        NA ++A D +R IR+YA P +   NP I  P   +   FE+KPVM QM+Q   QF G   +DPH H++SF  +  SF   G+  + +R + F  +LRD A
Subjt:  NAAYIAHDLDRPIRSYAAPNLYNFNPGIAYPAFGENARFEIKPVMLQMIQNARQFGGHPGEDPHEHIRSFYSICASFHMLGISPEELRFALFPLTLRDEA

Query:  K------------------------------------------------------RFKKMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFADGMLK
        K                                                      RFK+ ++ CPH+G+P CI +E FY GLN AT+Q  DA     +L 
Subjt:  K------------------------------------------------------RFKKMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFADGMLK

Query:  STYNQIKTMLDTMASNNEEWDEDDFGNRRGGRSKGDECMDKNVVVALQGQMTTMNNLLKSMAISQVNTAGGSVHAAN---QIDEMGCVGCGGPHNTDACP
         TYN+   +L+ +ASNN +W +     R     K  E ++ + + ++  Q+ +M N+L+++A  Q +      H A    Q     CV CG  H  D CP
Subjt:  STYNQIKTMLDTMASNNEEWDEDDFGNRRGGRSKGDECMDKNVVVALQGQMTTMNNLLKSMAISQVNTAGGSVHAAN---QIDEMGCVGCGGPHNTDACP

Query:  LNTEIVAYGG----QGDHRGEASGS----------------HARYHNNRP-------------QHFHHQQQPTTTASSTSS--------MENLLREYMQK
         N   + Y G    QG+ +   S +                   Y+   P             Q  +  QQ TT    TS         +E+L++EYM +
Subjt:  LNTEIVAYGG----QGDHRGEASGS----------------HARYHNNRP-------------QHFHHQQQPTTTASSTSS--------MENLLREYMQK

Query:  NDALLQSQASSIRNLEVQLGQLASDFSERPQRSLPSNTETPNQ
        NDA++QSQ  S+RNLEVQ+GQLA++   RP   LP++TE P +
Subjt:  NDALLQSQASSIRNLEVQLGQLASDFSERPQRSLPSNTETPNQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTGCTTACATCTAGAAGGATGGAAAATAACAACAGAAATACCCCTCCGCCACATGCTGACTCAGAACCAAATGCTGCTTACATAGCACACGACTTGGATAGGCCGAT
TAGATCTTATGCGGCACCCAACCTCTACAACTTTAACCCAGGAATCGCCTACCCTGCATTCGGCGAGAACGCCAGGTTTGAAATCAAACCTGTTATGCTTCAGATGATTC
AGAACGCCAGACAATTCGGCGGACATCCTGGGGAAGATCCACACGAACATATAAGGAGTTTCTACTCCATCTGCGCTTCCTTCCACATGCTAGGCATTTCACCTGAAGAA
TTGAGATTCGCCCTCTTCCCGTTAACTCTGAGGGATGAGGCGAAGAGGTTCAAAAAGATGGTCAAAGCATGTCCCCACAATGGCATTCCTAAATGCATATTGATGGAAGT
TTTCTATTTCGGACTAAACAAGGCTACACAGCAGACTGCTGACGCTGTGTTTGCAGACGGTATGCTAAAAAGCACATACAACCAGATTAAGACGATGTTAGATACGATGG
CCAGCAACAATGAGGAATGGGATGAAGATGATTTCGGCAATCGCCGAGGAGGACGATCAAAAGGTGATGAATGCATGGATAAGAACGTCGTGGTGGCGTTACAGGGACAA
ATGACTACGATGAACAATCTACTCAAATCAATGGCAATATCGCAGGTCAACACCGCAGGAGGCTCTGTGCACGCGGCTAACCAAATTGATGAAATGGGATGCGTGGGATG
CGGTGGTCCTCATAACACCGACGCATGCCCACTCAATACTGAAATTGTCGCGTACGGTGGTCAAGGTGACCATCGCGGGGAAGCATCTGGCTCCCACGCGAGGTACCACA
ACAATAGACCCCAACATTTCCATCATCAACAACAACCCACCACCACCGCTTCGTCCACCTCGTCTATGGAAAACCTCCTCCGCGAATATATGCAGAAAAATGATGCTCTT
CTGCAAAGCCAAGCTTCATCAATTCGTAATCTGGAGGTACAGTTAGGGCAGCTCGCCAGCGATTTCTCCGAAAGACCGCAAAGGTCCCTCCCAAGCAATACAGAAACACC
AAATCAGGCGGAGGGATCCAGAAAAGAGAAGTGTCACGCGGTGACACTACGCAGCGGAAGAAATTTAACTATCCGCGAGCCTGATGCTAAATGTAGCCACTCTAATTCTA
ACTCTACTGCCGAGATTGGCAGTTCAAGTAAAATTCCTTCTTCAATACATTTCCCTTTAACTGATAATGTTTCTTCCTCGTAG
mRNA sequenceShow/hide mRNA sequence
ATGCTGCTTACATCTAGAAGGATGGAAAATAACAACAGAAATACCCCTCCGCCACATGCTGACTCAGAACCAAATGCTGCTTACATAGCACACGACTTGGATAGGCCGAT
TAGATCTTATGCGGCACCCAACCTCTACAACTTTAACCCAGGAATCGCCTACCCTGCATTCGGCGAGAACGCCAGGTTTGAAATCAAACCTGTTATGCTTCAGATGATTC
AGAACGCCAGACAATTCGGCGGACATCCTGGGGAAGATCCACACGAACATATAAGGAGTTTCTACTCCATCTGCGCTTCCTTCCACATGCTAGGCATTTCACCTGAAGAA
TTGAGATTCGCCCTCTTCCCGTTAACTCTGAGGGATGAGGCGAAGAGGTTCAAAAAGATGGTCAAAGCATGTCCCCACAATGGCATTCCTAAATGCATATTGATGGAAGT
TTTCTATTTCGGACTAAACAAGGCTACACAGCAGACTGCTGACGCTGTGTTTGCAGACGGTATGCTAAAAAGCACATACAACCAGATTAAGACGATGTTAGATACGATGG
CCAGCAACAATGAGGAATGGGATGAAGATGATTTCGGCAATCGCCGAGGAGGACGATCAAAAGGTGATGAATGCATGGATAAGAACGTCGTGGTGGCGTTACAGGGACAA
ATGACTACGATGAACAATCTACTCAAATCAATGGCAATATCGCAGGTCAACACCGCAGGAGGCTCTGTGCACGCGGCTAACCAAATTGATGAAATGGGATGCGTGGGATG
CGGTGGTCCTCATAACACCGACGCATGCCCACTCAATACTGAAATTGTCGCGTACGGTGGTCAAGGTGACCATCGCGGGGAAGCATCTGGCTCCCACGCGAGGTACCACA
ACAATAGACCCCAACATTTCCATCATCAACAACAACCCACCACCACCGCTTCGTCCACCTCGTCTATGGAAAACCTCCTCCGCGAATATATGCAGAAAAATGATGCTCTT
CTGCAAAGCCAAGCTTCATCAATTCGTAATCTGGAGGTACAGTTAGGGCAGCTCGCCAGCGATTTCTCCGAAAGACCGCAAAGGTCCCTCCCAAGCAATACAGAAACACC
AAATCAGGCGGAGGGATCCAGAAAAGAGAAGTGTCACGCGGTGACACTACGCAGCGGAAGAAATTTAACTATCCGCGAGCCTGATGCTAAATGTAGCCACTCTAATTCTA
ACTCTACTGCCGAGATTGGCAGTTCAAGTAAAATTCCTTCTTCAATACATTTCCCTTTAACTGATAATGTTTCTTCCTCGTAG
Protein sequenceShow/hide protein sequence
MLLTSRRMENNNRNTPPPHADSEPNAAYIAHDLDRPIRSYAAPNLYNFNPGIAYPAFGENARFEIKPVMLQMIQNARQFGGHPGEDPHEHIRSFYSICASFHMLGISPEE
LRFALFPLTLRDEAKRFKKMVKACPHNGIPKCILMEVFYFGLNKATQQTADAVFADGMLKSTYNQIKTMLDTMASNNEEWDEDDFGNRRGGRSKGDECMDKNVVVALQGQ
MTTMNNLLKSMAISQVNTAGGSVHAANQIDEMGCVGCGGPHNTDACPLNTEIVAYGGQGDHRGEASGSHARYHNNRPQHFHHQQQPTTTASSTSSMENLLREYMQKNDAL
LQSQASSIRNLEVQLGQLASDFSERPQRSLPSNTETPNQAEGSRKEKCHAVTLRSGRNLTIREPDAKCSHSNSNSTAEIGSSSKIPSSIHFPLTDNVSSS