; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0028443 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0028443
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr07:10960291..10962863
RNA-Seq ExpressionPI0028443
SyntenyPI0028443
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ERM93404.1 hypothetical protein AMTR_s04947p00003620 [Amborella trichopoda]3.0e-3531.23Show/hide
Query:  AAQEQNPTYMTHDLDRPIRSYASPNLYNFSSGNAYPVFDENAQFEIKLVMIQMIQNVGQYGGHLYEDPHEHIRSFYSICASFHVPDISPKELCFVLFPLT
        A Q  NP  +  D  R IR YA+P     + G   P   +  QFE+K VM QM+Q VGQ+ G   EDPH H+RSF  +  SF +  +S + L   LFP +
Subjt:  AAQEQNPTYMTHDLDRPIRSYASPNLYNFSSGNAYPVFDENAQFEIKLVMIQMIQNVGQYGGHLYEDPHEHIRSFYSICASFHVPDISPKELCFVLFPLT

Query:  LRDEAKRWANDGIFIVLFSFVVLGVCIFDLLKLNLYEDGMVCDLTCMRRGKEHATIRRSACIAIITCIKYESANSLEDKEVGTWEQLIKKFMKKFFPPHE
        LRD A+ W                                                                 N+L    V  W  L +KF++K+FPP  
Subjt:  LRDEAKRWANDGIFIVLFSFVVLGVCIFDLLKLNLYEDGMVCDLTCMRRGKEHATIRRSACIAIITCIKYESANSLEDKEVGTWEQLIKKFMKKFFPPHE

Query:  NVRRRKELVGFQQKDRENLHDAWSRFKCMVKACPHNGIPECMLMEVFYFELNKGTQQTVD-----AVFIKA------TLDSMTRNNEEWDEDDFSSCRGD
        N + R E++ FQQ + E+  DAW RFK +++ CPH+GIP C+ ME FY  LN  ++  +D     A+  K+       L+++  NN +W     S+ R  
Subjt:  NVRRRKELVGFQQKDRENLHDAWSRFKCMVKACPHNGIPECMLMEVFYFELNKGTQQTVD-----AVFIKA------TLDSMTRNNEEWDEDDFSSCRGD

Query:  RRRSEKG-LNKNVVVALQGQMTAMDNLLKSMAI
          R   G L  + + AL  QM +M N+LK+++I
Subjt:  RRRSEKG-LNKNVVVALQGQMTAMDNLLKSMAI

WP_217833177.1 retrotransposon gag domain-containing protein, partial [Synechococcus sp. PCC 7002]1.5e-4535.42Show/hide
Query:  AAQEQNPTYMTHDLDRPIRSYASPNLYNFSSGNAYPVFDENAQFEIKLVMIQMIQNVGQYGGHLYEDPHEHIRSFYSICASFHVPDISPKELCFVLFPLT
        A  + NP  + HD +RP+R YASPNLYNF+ G   P F+ N +FE+K VM+QM+Q  GQ+GG   EDPH H++SF  IC++F +  +    +   LFP +
Subjt:  AAQEQNPTYMTHDLDRPIRSYASPNLYNFSSGNAYPVFDENAQFEIKLVMIQMIQNVGQYGGHLYEDPHEHIRSFYSICASFHVPDISPKELCFVLFPLT

Query:  LRDEAKRWANDGIFIVLFSFVVLGVCIFDLLKLNLYEDGMVCDLTCMRRGKEHATIRRSACIAIITCIKYESANSLEDKEVGTWEQLIKKFMKKFFPPHE
        LRDEA++W                                                                A S E  E+ TW ++++KFM+K+FPP  
Subjt:  LRDEAKRWANDGIFIVLFSFVVLGVCIFDLLKLNLYEDGMVCDLTCMRRGKEHATIRRSACIAIITCIKYESANSLEDKEVGTWEQLIKKFMKKFFPPHE

Query:  NVRRRKELVGFQQKDRENLHDAWSRFKCMVKACPHNGIPECMLMEVFYFELNKGTQQTVDA-----------VFIKATLDSMTRNNEE
        + +RR+++V F+QKD E   +AW+RFK +V+ CPHNGIP C+ ME+FY  LNK +Q   DA              K  LD ++RN  +
Subjt:  NVRRRKELVGFQQKDRENLHDAWSRFKCMVKACPHNGIPECMLMEVFYFELNKGTQQTVDA-----------VFIKATLDSMTRNNEE

XP_038880527.1 uncharacterized protein LOC120072192 [Benincasa hispida]2.7e-3932.21Show/hide
Query:  MTHDLDRPIRSYASPNLYNFSSGNAYPVFDENAQFEIKLVMIQMIQNVGQYGGHLYEDPHEHIRSFYSICASFHVPDISPKELCFVLFPLTLRDEAKRWA
        M ++  RP+R YASP LY+FS G  YP+ D   +FE+K VM+QM+Q   Q+GG   EDPH H++ F   C  F +P I+P+++   LFP +LRD+AK+W 
Subjt:  MTHDLDRPIRSYASPNLYNFSSGNAYPVFDENAQFEIKLVMIQMIQNVGQYGGHLYEDPHEHIRSFYSICASFHVPDISPKELCFVLFPLTLRDEAKRWA

Query:  NDGIFIVLFSFVVLGVCIFDLLKLNLYEDGMVCDLTCMRRGKEHATIRRSACIAIITCIKYESANSLEDKEVGTWEQLIKKFMKKFFPPHENVRRRKELV
                                                                        +SLE +E+ TWE+L++KFM+K+FPP  N RRR+E++
Subjt:  NDGIFIVLFSFVVLGVCIFDLLKLNLYEDGMVCDLTCMRRGKEHATIRRSACIAIITCIKYESANSLEDKEVGTWEQLIKKFMKKFFPPHENVRRRKELV

Query:  GFQQKDRENLHDAWSRFKCMVKACPHNGIPECMLMEVFYFELNKGTQQTVDAVFIKATLDS-----------MTRNNEEWDEDDFSSCRGDRRRSEKGLN
         F+Q+D E L  A  RF  +VK CP++ +   + ME FY  LN+ +Q   DA   +  +D            + ++N EW +D +   R DRRR  + +N
Subjt:  GFQQKDRENLHDAWSRFKCMVKACPHNGIPECMLMEVFYFELNKGTQQTVDAVFIKATLDS-----------MTRNNEEWDEDDFSSCRGDRRRSEKGLN

Query:  K---NVVVALQGQMTAMDNLLKSMAI
            N +  L  Q+  M +LL+++ +
Subjt:  K---NVVVALQGQMTAMDNLLKSMAI

XP_038887458.1 uncharacterized protein LOC120077591 [Benincasa hispida]3.1e-4834.86Show/hide
Query:  MTNNNNDRNVPPNQAAQE--QNPTYMTHDLDRPIRSYASPNLYNFSSGNAYPVFDENAQFEIKLVMIQMIQNVGQYGGHLYEDPHEHIRSFYSICASFHV
        M +NNN+     NQ      Q+P ++  D + PIR+YA+PNLY+FS G + P+ +ENA+FEIK VM+QMIQN+ Q+     E+PH H+  F  +C++F +
Subjt:  MTNNNNDRNVPPNQAAQE--QNPTYMTHDLDRPIRSYASPNLYNFSSGNAYPVFDENAQFEIKLVMIQMIQNVGQYGGHLYEDPHEHIRSFYSICASFHV

Query:  PDISPKELCFVLFPLTLRDEAKRWANDGIFIVLFSFVVLGVCIFDLLKLNLYEDGMVCDLTCMRRGKEHATIRRSACIAIITCIKYESANSLEDKEVGTW
        P I+P  +   LFP TLRD+AKRW                                                                A+SLE  E+ + 
Subjt:  PDISPKELCFVLFPLTLRDEAKRWANDGIFIVLFSFVVLGVCIFDLLKLNLYEDGMVCDLTCMRRGKEHATIRRSACIAIITCIKYESANSLEDKEVGTW

Query:  EQLIKKFMKKFFPPHENVRRRKELVGFQQKDRENLHDAWSRFKCMVKACPHNGIPECMLMEVFYFELNKGTQQTVDAVFI-----------KATLDSMTR
        +QL++ FMKKFFPP  N RRRK ++ F++ D E L  AW RF+ +VK CPH GI +C+LME+FY  LN+ TQ   DA  +           K  LD ++ 
Subjt:  EQLIKKFMKKFFPPHENVRRRKELVGFQQKDRENLHDAWSRFKCMVKACPHNGIPECMLMEVFYFELNKGTQQTVDAVFI-----------KATLDSMTR

Query:  NNEEWDEDDFSSCRGDRRRSEKGL-NKNVVVALQGQMTAMDNLLKSMAIS
        N ++W +D +     +RRR++  +   N +  L  QM  + +LL+ MA++
Subjt:  NNEEWDEDDFSSCRGDRRRSEKGL-NKNVVVALQGQMTAMDNLLKSMAIS

XP_038889363.1 uncharacterized protein LOC120079279 [Benincasa hispida]6.1e-3631.42Show/hide
Query:  DENAQFEIKLVMIQMIQNVGQYGGHLYEDPHEHIRSFYSICASFHVPDISPKELCFVLFPLTLRDEAKRWANDGIFIVLFSFVVLGVCIFDLLKLNLYED
        +EN +F+IK VM+QM+QN GQ+GG   ED H H+ SF  +C++F +  ++P+ +   LFP TLRDEA  W                              
Subjt:  DENAQFEIKLVMIQMIQNVGQYGGHLYEDPHEHIRSFYSICASFHVPDISPKELCFVLFPLTLRDEAKRWANDGIFIVLFSFVVLGVCIFDLLKLNLYED

Query:  GMVCDLTCMRRGKEHATIRRSACIAIITCIKYESANSLEDKEVGTWEQLIKKFMKKFFPPHENVRRRKELVGFQQKDRENLHDAWSRFKCMVKACPHNGI
                                          A+SLE  E+ +W+QL++ FMKKFFPP  N RRRK+++ F+Q + E L   W   + +VK C H GI
Subjt:  GMVCDLTCMRRGKEHATIRRSACIAIITCIKYESANSLEDKEVGTWEQLIKKFMKKFFPPHENVRRRKELVGFQQKDRENLHDAWSRFKCMVKACPHNGI

Query:  PECMLMEVFYFELNKGTQQTVDAVFIKATLDS-----------MTRNNEEWDEDDFSSCRGDRRRSEKGL-NKNVVVALQGQMTAMDNLLKSMAIS
        P+C+LM+ FY  LN+ TQ   DA   +  +D            ++RN ++  +D +     +RRR++  +   + +  L  QM A+ +LL++MA++
Subjt:  PECMLMEVFYFELNKGTQQTVDAVFIKATLDS-----------MTRNNEEWDEDDFSSCRGDRRRSEKGL-NKNVVVALQGQMTAMDNLLKSMAIS

TrEMBL top hitse value%identityAlignment
A0A5A7V1F3 Retrotrans_gag domain-containing protein2.7e-2932.31Show/hide
Query:  VPPNQAAQEQNPTYMTHDLDRPIRSYASPNLYNFSSGNAYPVFDENAQFEIKLVMIQMIQNVGQYGGHLYEDPHEHIRSFYSICASFHVPDISPKELCFV
        VP NQA      TY+ H+L RPIRSYA P+LY F+ G AYP F ENA +E K                                                
Subjt:  VPPNQAAQEQNPTYMTHDLDRPIRSYASPNLYNFSSGNAYPVFDENAQFEIKLVMIQMIQNVGQYGGHLYEDPHEHIRSFYSICASFHVPDISPKELCFV

Query:  LFPLTLRDEAKRWANDGIFIVLFSFVVLGVCIFDLLKLNLYEDGMVCDLTCMRRGKEHATIRRSACIAIITCIKYESANSLEDKEVGTWEQLIKKFMKKF
               D+AKRW                                                                ANS+E  EV TWE LI+KFMKKF
Subjt:  LFPLTLRDEAKRWANDGIFIVLFSFVVLGVCIFDLLKLNLYEDGMVCDLTCMRRGKEHATIRRSACIAIITCIKYESANSLEDKEVGTWEQLIKKFMKKF

Query:  FPPHENVRRRKELVGFQQKDRENLHDAWSRFKCMVKACPHNGIPECMLMEVFYFELNKGTQQTVDAVF-----------IKATLDSMTRNNEE-------
        FP  +  +RR++L+ F+Q+DR+NLHDAWS FK MVKAC H+GI + +LME FYF L+K T+Q+ D++F           IKA LDSM  N+++       
Subjt:  FPPHENVRRRKELVGFQQKDRENLHDAWSRFKCMVKACPHNGIPECMLMEVFYFELNKGTQQTVDAVF-----------IKATLDSMTRNNEE-------

Query:  WDEDDFSSCRGDRRRSEKGLNKNVV
         +E D + C G        LN  +V
Subjt:  WDEDDFSSCRGDRRRSEKGLNKNVV

A0A5A7V2U7 Putative disease resistance RPP13-like protein 14.4e-3263.96Show/hide
Query:  NNNNDRNVPPNQAAQEQNPTYMTHDLDRPIRSYASPNLYNFSSGNAYPVFDENAQFEIKLVMIQMIQNVGQYGGHLYEDPHEHIRSFYSICASFHVPDIS
        NN N++ +P NQAAQ  NPTY+  D DRPIRSYASPNLY+F+ G AYP F EN +FEIKLVM+Q+IQN GQ+  H  +DPH++IR+FYSICASFH+P IS
Subjt:  NNNNDRNVPPNQAAQEQNPTYMTHDLDRPIRSYASPNLYNFSSGNAYPVFDENAQFEIKLVMIQMIQNVGQYGGHLYEDPHEHIRSFYSICASFHVPDIS

Query:  PKELCFVLFPL
         +EL F  F L
Subjt:  PKELCFVLFPL

A0A6J1EEI2 uncharacterized protein LOC1114333942.7e-2928.61Show/hide
Query:  NNDRNVPPNQAAQEQ---NPTYMTHDLDRPIRSYASPNLYNFSSGNAYPVFDENAQFEIKLVMIQMIQNVGQYGGHLYEDPHEHIRSFYSICASFHVPDI
        N +   P   A QE+   N  ++  D +R IR+YA P +   +     P   +   FE+K VM QM+Q +GQ+ G   EDPH H++SF  +  SF    +
Subjt:  NNDRNVPPNQAAQEQ---NPTYMTHDLDRPIRSYASPNLYNFSSGNAYPVFDENAQFEIKLVMIQMIQNVGQYGGHLYEDPHEHIRSFYSICASFHVPDI

Query:  SPKELCFVLFPLTLRDEAKRWANDGIFIVLFSFVVLGVCIFDLLKLNLYEDGMVCDLTCMRRGKEHATIRRSACIAIITCIKYESANSLEDKEVGTWEQL
            +   LFP +LRD AK W N          + LG                                                        + +W  L
Subjt:  SPKELCFVLFPLTLRDEAKRWANDGIFIVLFSFVVLGVCIFDLLKLNLYEDGMVCDLTCMRRGKEHATIRRSACIAIITCIKYESANSLEDKEVGTWEQL

Query:  IKKFMKKFFPPHENVRRRKELVGFQQKDRENLHDAWSRFKCMVKACPHNGIPECMLMEVFYFELNKGTQQTVDA----VFIKAT-------LDSMTRNNE
        ++KF+ K+FPP  N R R E+V FQQ + + L +AW RFK M++ CPH+G+P C+ ME FY  LN  T+Q VDA      +  T       L+ +  NN 
Subjt:  IKKFMKKFFPPHENVRRRKELVGFQQKDRENLHDAWSRFKCMVKACPHNGIPECMLMEVFYFELNKGTQQTVDA----VFIKAT-------LDSMTRNNE

Query:  EWDEDDFSSCRGDRRRSEKG-LNKNVVVALQGQMTAMDNLLKSMAI
        +W     +  R +  R  +G L  + + ++  Q+ ++ N+L+++A+
Subjt:  EWDEDDFSSCRGDRRRSEKG-LNKNVVVALQGQMTAMDNLLKSMAI

A0A6J1H7E4 uncharacterized protein LOC1114611684.6e-2929.02Show/hide
Query:  MTNNNNDRNVPPNQAAQEQ---NPTYMTHDLDRPIRSYASPNLYNFSSGNAYPVFDENAQFEIKLVMIQMIQNVGQYGGHLYEDPHEHIRSFYSICASFH
        M   N +   P   A QE+   N   +  D +R IR+YA P +   +     P   +   FE+K VM QM+Q +GQ+ G   EDPH H++SF  +  SF 
Subjt:  MTNNNNDRNVPPNQAAQEQ---NPTYMTHDLDRPIRSYASPNLYNFSSGNAYPVFDENAQFEIKLVMIQMIQNVGQYGGHLYEDPHEHIRSFYSICASFH

Query:  VPDISPKELCFVLFPLTLRDEAKRWANDGIFIVLFSFVVLGVCIFDLLKLNLYEDGMVCDLTCMRRGKEHATIRRSACIAIITCIKYESANSLEDKEVGT
           +    +   LFP +LRD AK W                                                                 N+L  + + +
Subjt:  VPDISPKELCFVLFPLTLRDEAKRWANDGIFIVLFSFVVLGVCIFDLLKLNLYEDGMVCDLTCMRRGKEHATIRRSACIAIITCIKYESANSLEDKEVGT

Query:  WEQLIKKFMKKFFPPHENVRRRKELVGFQQKDRENLHDAWSRFKCMVKACPHNGIPECMLMEVFYFELNKGTQQTVDA----VFIKAT-------LDSMT
        W  L +KF+ K+FPP  N R R E+V FQQ + E L +AW RFK M++ CPH+G+P C+ ME FY  LN  T+Q VDA      +  T       L+ + 
Subjt:  WEQLIKKFMKKFFPPHENVRRRKELVGFQQKDRENLHDAWSRFKCMVKACPHNGIPECMLMEVFYFELNKGTQQTVDA----VFIKAT-------LDSMT

Query:  RNNEEWDEDDFSSCRGDRRRSEKGLNKNVVVALQGQMTAMDNLLKSMA
         NN +W   D  S  G + R    L  + + ++  Q+ ++ N+L+++A
Subjt:  RNNEEWDEDDFSSCRGDRRRSEKGLNKNVVVALQGQMTAMDNLLKSMA

U5CUI2 Retrotrans_gag domain-containing protein1.5e-3531.23Show/hide
Query:  AAQEQNPTYMTHDLDRPIRSYASPNLYNFSSGNAYPVFDENAQFEIKLVMIQMIQNVGQYGGHLYEDPHEHIRSFYSICASFHVPDISPKELCFVLFPLT
        A Q  NP  +  D  R IR YA+P     + G   P   +  QFE+K VM QM+Q VGQ+ G   EDPH H+RSF  +  SF +  +S + L   LFP +
Subjt:  AAQEQNPTYMTHDLDRPIRSYASPNLYNFSSGNAYPVFDENAQFEIKLVMIQMIQNVGQYGGHLYEDPHEHIRSFYSICASFHVPDISPKELCFVLFPLT

Query:  LRDEAKRWANDGIFIVLFSFVVLGVCIFDLLKLNLYEDGMVCDLTCMRRGKEHATIRRSACIAIITCIKYESANSLEDKEVGTWEQLIKKFMKKFFPPHE
        LRD A+ W                                                                 N+L    V  W  L +KF++K+FPP  
Subjt:  LRDEAKRWANDGIFIVLFSFVVLGVCIFDLLKLNLYEDGMVCDLTCMRRGKEHATIRRSACIAIITCIKYESANSLEDKEVGTWEQLIKKFMKKFFPPHE

Query:  NVRRRKELVGFQQKDRENLHDAWSRFKCMVKACPHNGIPECMLMEVFYFELNKGTQQTVD-----AVFIKA------TLDSMTRNNEEWDEDDFSSCRGD
        N + R E++ FQQ + E+  DAW RFK +++ CPH+GIP C+ ME FY  LN  ++  +D     A+  K+       L+++  NN +W     S+ R  
Subjt:  NVRRRKELVGFQQKDRENLHDAWSRFKCMVKACPHNGIPECMLMEVFYFELNKGTQQTVD-----AVFIKA------TLDSMTRNNEEWDEDDFSSCRGD

Query:  RRRSEKG-LNKNVVVALQGQMTAMDNLLKSMAI
          R   G L  + + AL  QM +M N+LK+++I
Subjt:  RRRSEKG-LNKNVVVALQGQMTAMDNLLKSMAI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCAACAACAACAACGACCGAAACGTTCCTCCTAATCAAGCTGCCCAAGAACAGAACCCTACATATATGACCCACGACCTAGATAGACCAATTAGGTCGTATGCTTC
GCCAAATCTTTACAACTTTAGCTCAGGAAATGCCTACCCTGTGTTTGATGAAAATGCACAATTTGAGATTAAACTTGTTATGATACAGATGATACAGAACGTCGGACAGT
ATGGTGGCCACCTGTATGAAGATCCACACGAGCACATTCGAAGCTTTTACTCCATCTGTGCTTCTTTTCACGTGCCTGACATATCGCCTAAAGAGTTGTGTTTTGTCCTC
TTCCCATTGACATTGAGGGATGAGGCAAAAAGATGGGCAAATGATGGGATATTTATTGTTCTATTTAGTTTTGTTGTGTTGGGTGTTTGCATATTTGATTTACTTAAGCT
TAATCTATATGAAGATGGAATGGTATGCGATCTAACTTGTATGCGAAGAGGGAAGGAACATGCTACAATTAGGCGATCTGCTTGTATTGCTATAATAACTTGTATAAAAT
ACGAGTCAGCAAACTCTTTAGAGGACAAAGAGGTTGGAACATGGGAACAATTGATAAAAAAGTTTATGAAGAAGTTCTTCCCACCCCATGAAAATGTCAGAAGAAGAAAG
GAACTTGTAGGTTTTCAGCAGAAGGATAGGGAAAACCTGCACGACGCGTGGAGTAGGTTTAAATGCATGGTGAAAGCGTGCCCCCACAATGGCATTCCTGAATGCATGTT
GATGGAGGTATTCTATTTTGAGCTGAACAAAGGAACACAACAAACTGTTGACGCTGTGTTCATTAAGGCGACGCTGGACTCTATGACCAGAAATAATGAAGAATGGGATG
AAGATGATTTCAGCTCTTGCCGAGGAGATAGAAGAAGAAGCGAGAAAGGATTGAATAAGAACGTCGTGGTGGCGTTACAAGGCCAGATGACTGCAATGGATAATCTGCTG
AAATCGATGGCGATTTCGTAA
mRNA sequenceShow/hide mRNA sequence
ATGACCAACAACAACAACGACCGAAACGTTCCTCCTAATCAAGCTGCCCAAGAACAGAACCCTACATATATGACCCACGACCTAGATAGACCAATTAGGTCGTATGCTTC
GCCAAATCTTTACAACTTTAGCTCAGGAAATGCCTACCCTGTGTTTGATGAAAATGCACAATTTGAGATTAAACTTGTTATGATACAGATGATACAGAACGTCGGACAGT
ATGGTGGCCACCTGTATGAAGATCCACACGAGCACATTCGAAGCTTTTACTCCATCTGTGCTTCTTTTCACGTGCCTGACATATCGCCTAAAGAGTTGTGTTTTGTCCTC
TTCCCATTGACATTGAGGGATGAGGCAAAAAGATGGGCAAATGATGGGATATTTATTGTTCTATTTAGTTTTGTTGTGTTGGGTGTTTGCATATTTGATTTACTTAAGCT
TAATCTATATGAAGATGGAATGGTATGCGATCTAACTTGTATGCGAAGAGGGAAGGAACATGCTACAATTAGGCGATCTGCTTGTATTGCTATAATAACTTGTATAAAAT
ACGAGTCAGCAAACTCTTTAGAGGACAAAGAGGTTGGAACATGGGAACAATTGATAAAAAAGTTTATGAAGAAGTTCTTCCCACCCCATGAAAATGTCAGAAGAAGAAAG
GAACTTGTAGGTTTTCAGCAGAAGGATAGGGAAAACCTGCACGACGCGTGGAGTAGGTTTAAATGCATGGTGAAAGCGTGCCCCCACAATGGCATTCCTGAATGCATGTT
GATGGAGGTATTCTATTTTGAGCTGAACAAAGGAACACAACAAACTGTTGACGCTGTGTTCATTAAGGCGACGCTGGACTCTATGACCAGAAATAATGAAGAATGGGATG
AAGATGATTTCAGCTCTTGCCGAGGAGATAGAAGAAGAAGCGAGAAAGGATTGAATAAGAACGTCGTGGTGGCGTTACAAGGCCAGATGACTGCAATGGATAATCTGCTG
AAATCGATGGCGATTTCGTAA
Protein sequenceShow/hide protein sequence
MTNNNNDRNVPPNQAAQEQNPTYMTHDLDRPIRSYASPNLYNFSSGNAYPVFDENAQFEIKLVMIQMIQNVGQYGGHLYEDPHEHIRSFYSICASFHVPDISPKELCFVL
FPLTLRDEAKRWANDGIFIVLFSFVVLGVCIFDLLKLNLYEDGMVCDLTCMRRGKEHATIRRSACIAIITCIKYESANSLEDKEVGTWEQLIKKFMKKFFPPHENVRRRK
ELVGFQQKDRENLHDAWSRFKCMVKACPHNGIPECMLMEVFYFELNKGTQQTVDAVFIKATLDSMTRNNEEWDEDDFSSCRGDRRRSEKGLNKNVVVALQGQMTAMDNLL
KSMAIS