; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0028535 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0028535
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr8:24571654..24573716
RNA-Seq ExpressionLag0028535
SyntenyLag0028535
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EXB53755.1 hypothetical protein L484_022412 [Morus notabilis]1.7e-2235.32Show/hide
Query:  PQFLRTGIVDHGWELFCAKPESVNTQVVREFYANIV---KEDGF-QN----FPHAAYNE-----------MDVAP--SDEQLSDAVRDVGIEGAQWQLSK
        P F+   I  HGW  FC  P +    +VREFYAN++   +E  F QN    F   A N            +D A   +DEQL   + +V IEGA WQ+S 
Subjt:  PQFLRTGIVDHGWELFCAKPESVNTQVVREFYANIV---KEDGF-QN----FPHAAYNE-----------MDVAP--SDEQLSDAVRDVGIEGAQWQLSK

Query:  TQKRTFQSAYLKREANTWMRFIRQRMLPTTHDSTISRERVLLAFAILRSLSINVGRIIASEISGC--WKKVGKLFFPNTITMLCSKAGVLVDEGDVILFD
            T     LKR A  W  F+  R +P+TH  T++++RVLL ++IL  +S+N+  I   EI  C   +K G L+FP+ IT L  KA V   + + I+ +
Subjt:  TQKRTFQSAYLKREANTWMRFIRQRMLPTTHDSTISRERVLLAFAILRSLSINVGRIIASEISGC--WKKVGKLFFPNTITMLCSKAGVLVDEGDVILFD

Query:  KGIIDKSNLARLQRMQEV
         G I   +++R+ + + V
Subjt:  KGIIDKSNLARLQRMQEV

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]1.2e-2033.08Show/hide
Query:  ASEEHDAIE-EQQLPFDRFANNFSIAKYAELLKRDFLFERGF-------SGDLPQFLRTGIVDHGWELFCAKPESVNTQVVREFYANI---VKEDGFQNF
        A + H A++ E +    R+ NN        +  R    E+GF        G LP F+   I  H W+ FCA PE     +VREFYAN+   V+   +   
Subjt:  ASEEHDAIE-EQQLPFDRFANNFSIAKYAELLKRDFLFERGF-------SGDLPQFLRTGIVDHGWELFCAKPESVNTQVVREFYANI---VKEDGFQNF

Query:  PHAAYNEMDV-------APSDE-----------QLSDAVRDVGIEGAQWQLSKTQKRTFQSAYLKREANTWMRFIRQRMLPTTHDSTISRERVLLAFAIL
           +++E  +        P DE            L   +  V + GA+W +S     T   + L   A  W  F++  +LPTTH  T+S++R+LL  ++L
Subjt:  PHAAYNEMDV-------APSDE-----------QLSDAVRDVGIEGAQWQLSKTQKRTFQSAYLKREANTWMRFIRQRMLPTTHDSTISRERVLLAFAIL

Query:  RSLSINVGRIIASEISGC-WKKVGKLFFPNTITMLC--SKAGVLVDEGDVILFDKGIIDKSNLARL
           SINVGR+I SEI  C  +K G LFFP+ IT LC  ++A  LV+E    L + G ID   +AR+
Subjt:  RSLSINVGRIIASEISGC-WKKVGKLFFPNTITMLC--SKAGVLVDEGDVILFDKGIIDKSNLARL

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]4.2e-2930.1Show/hide
Query:  ASEEHDAIE-EQQLPFDRFANNFSIAKYAELLKRDFLFERGF-------SGDLPQFLRTGIVDHGWELFCAKPESVNTQVVREFYANIVKED-------G
        A + H A++ E +    R+ NN        +  R    E+GF        G LP F+   I  H W+ FCA PE     +VREFYAN+   +       G
Subjt:  ASEEHDAIE-EQQLPFDRFANNFSIAKYAELLKRDFLFERGF-------SGDLPQFLRTGIVDHGWELFCAKPESVNTQVVREFYANIVKED-------G

Query:  FQ--------------NFPHAAYNEMDVAPSDEQLSDAVRDVGIEGAQWQLSKTQKRTFQSAYLKREANTWMRFIRQRMLPTTHDSTISRERVLLAFAIL
         Q                P   ++E     + + L   +  V   GA+W +S     T   + L   A  W  F++ R+LPTTH  T+S++R+LL  ++L
Subjt:  FQ--------------NFPHAAYNEMDVAPSDEQLSDAVRDVGIEGAQWQLSKTQKRTFQSAYLKREANTWMRFIRQRMLPTTHDSTISRERVLLAFAIL

Query:  RSLSINVGRIIASEISGC-WKKVGKLFFPNTITMLC--SKAGVLVDEGDVILFDKGIIDKSNLARL---------QRMQEVR---------QGGLIYNIN
           SINVGR+I SEI  C  +K G LFFP+ IT LC  ++A  LV+E    L + G ID   +AR+         Q+    R          G ++  + 
Subjt:  RSLSINVGRIIASEISGC-WKKVGKLFFPNTITMLC--SKAGVLVDEGDVILFDKGIIDKSNLARL---------QRMQEVR---------QGGLIYNIN

Query:  MILEQLALSTSRQ-------EFAERQTLTFWNYVKNRDASLRRALQENFSKPYPALPAFPEDLLNPRIPPPPVEREEENDDE
         + ++L+    +Q       +   +Q   FW Y K RD +L++ALQ NF++P P  PAFP+++L         E +++  +E
Subjt:  MILEQLALSTSRQ-------EFAERQTLTFWNYVKNRDASLRRALQENFSKPYPALPAFPEDLLNPRIPPPPVEREEENDDE

PON70375.1 hypothetical protein PanWU01x14_080440 [Parasponia andersonii]1.9e-2133.49Show/hide
Query:  LKRDFLFERGFSGDLPQFLRTGIVDHGWELFCAKPESVNTQVVREFYANIVKEDGFQNFPHAAYNEMDVA----------PSDE-----------QLSDA
        ++++F+++     + P F+   I+ H W+LFCA PE     +VREFY N+   D    +       + V           P DE           +L   
Subjt:  LKRDFLFERGFSGDLPQFLRTGIVDHGWELFCAKPESVNTQVVREFYANIVKEDGFQNFPHAAYNEMDVA----------PSDE-----------QLSDA

Query:  VRDVGIEGAQWQLSKTQKRTFQSAYLKREANTWMRFIRQRMLPTTHDSTISRERVLLAFAILRSLSINVGRIIASEISGC-WKKVGKLFFPNTITMLC--
        +  V I GA+W +S     T   + L   A  W  F++ R+LPTTH  T+S+E V L +++L   SINVGR+I  EI  C  +K G LFFP+ IT +C  
Subjt:  VRDVGIEGAQWQLSKTQKRTFQSAYLKREANTWMRFIRQRMLPTTHDSTISRERVLLAFAILRSLSINVGRIIASEISGC-WKKVGKLFFPNTITMLC--

Query:  SKAGVLVDE
        ++A  LV+E
Subjt:  SKAGVLVDE

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]6.9e-2433.89Show/hide
Query:  SDEQLSDAVRDVGIEGAQWQLSKTQKRTFQSAYLKREANTWMRFIRQRMLPTTHDSTISRERVLLAFAILRSLSINVGRIIASEISGC-WKKVGKLFFPN
        ++ +L   +  V   GA+W +S     T   + L   A  W  F++ R+LPTTH   +S++R+LL  ++L   SINVGR+I SEI  C  +K G LFFP+
Subjt:  SDEQLSDAVRDVGIEGAQWQLSKTQKRTFQSAYLKREANTWMRFIRQRMLPTTHDSTISRERVLLAFAILRSLSINVGRIIASEISGC-WKKVGKLFFPN

Query:  TITMLCSKAGVLVDEGDVILFDKGIIDKSNLARL--------------QRMQEVRQGGLIYNINMILEQLALSTSRQEFAERQTLTFWNYVKNRDASLRR
         IT LC  A  LV+E    L + G ID   +AR+               R           ++   L+ L    S+QE   +Q   FW Y K RD +L++
Subjt:  TITMLCSKAGVLVDEGDVILFDKGIIDKSNLARL--------------QRMQEVRQGGLIYNINMILEQLALSTSRQEFAERQTLTFWNYVKNRDASLRR

Query:  ALQENFSKPYPALPAFPEDLLNPRIPPPPVEREEENDDE
        ALQ NF++P P  PAFP+++L         E +++  +E
Subjt:  ALQENFSKPYPALPAFPEDLLNPRIPPPPVEREEENDDE

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)5.9e-2133.08Show/hide
Query:  ASEEHDAIE-EQQLPFDRFANNFSIAKYAELLKRDFLFERGF-------SGDLPQFLRTGIVDHGWELFCAKPESVNTQVVREFYANI---VKEDGFQNF
        A + H A++ E +    R+ NN        +  R    E+GF        G LP F+   I  H W+ FCA PE     +VREFYAN+   V+   +   
Subjt:  ASEEHDAIE-EQQLPFDRFANNFSIAKYAELLKRDFLFERGF-------SGDLPQFLRTGIVDHGWELFCAKPESVNTQVVREFYANI---VKEDGFQNF

Query:  PHAAYNEMDV-------APSDE-----------QLSDAVRDVGIEGAQWQLSKTQKRTFQSAYLKREANTWMRFIRQRMLPTTHDSTISRERVLLAFAIL
           +++E  +        P DE            L   +  V + GA+W +S     T   + L   A  W  F++  +LPTTH  T+S++R+LL  ++L
Subjt:  PHAAYNEMDV-------APSDE-----------QLSDAVRDVGIEGAQWQLSKTQKRTFQSAYLKREANTWMRFIRQRMLPTTHDSTISRERVLLAFAIL

Query:  RSLSINVGRIIASEISGC-WKKVGKLFFPNTITMLC--SKAGVLVDEGDVILFDKGIIDKSNLARL
           SINVGR+I SEI  C  +K G LFFP+ IT LC  ++A  LV+E    L + G ID   +AR+
Subjt:  RSLSINVGRIIASEISGC-WKKVGKLFFPNTITMLC--SKAGVLVDEGDVILFDKGIIDKSNLARL

A0A2P5BCG4 Uncharacterized protein (Fragment)2.0e-2930.1Show/hide
Query:  ASEEHDAIE-EQQLPFDRFANNFSIAKYAELLKRDFLFERGF-------SGDLPQFLRTGIVDHGWELFCAKPESVNTQVVREFYANIVKED-------G
        A + H A++ E +    R+ NN        +  R    E+GF        G LP F+   I  H W+ FCA PE     +VREFYAN+   +       G
Subjt:  ASEEHDAIE-EQQLPFDRFANNFSIAKYAELLKRDFLFERGF-------SGDLPQFLRTGIVDHGWELFCAKPESVNTQVVREFYANIVKED-------G

Query:  FQ--------------NFPHAAYNEMDVAPSDEQLSDAVRDVGIEGAQWQLSKTQKRTFQSAYLKREANTWMRFIRQRMLPTTHDSTISRERVLLAFAIL
         Q                P   ++E     + + L   +  V   GA+W +S     T   + L   A  W  F++ R+LPTTH  T+S++R+LL  ++L
Subjt:  FQ--------------NFPHAAYNEMDVAPSDEQLSDAVRDVGIEGAQWQLSKTQKRTFQSAYLKREANTWMRFIRQRMLPTTHDSTISRERVLLAFAIL

Query:  RSLSINVGRIIASEISGC-WKKVGKLFFPNTITMLC--SKAGVLVDEGDVILFDKGIIDKSNLARL---------QRMQEVR---------QGGLIYNIN
           SINVGR+I SEI  C  +K G LFFP+ IT LC  ++A  LV+E    L + G ID   +AR+         Q+    R          G ++  + 
Subjt:  RSLSINVGRIIASEISGC-WKKVGKLFFPNTITMLC--SKAGVLVDEGDVILFDKGIIDKSNLARL---------QRMQEVR---------QGGLIYNIN

Query:  MILEQLALSTSRQ-------EFAERQTLTFWNYVKNRDASLRRALQENFSKPYPALPAFPEDLLNPRIPPPPVEREEENDDE
         + ++L+    +Q       +   +Q   FW Y K RD +L++ALQ NF++P P  PAFP+++L         E +++  +E
Subjt:  MILEQLALSTSRQ-------EFAERQTLTFWNYVKNRDASLRRALQENFSKPYPALPAFPEDLLNPRIPPPPVEREEENDDE

A0A2P5DAQ2 Uncharacterized protein9.1e-2233.49Show/hide
Query:  LKRDFLFERGFSGDLPQFLRTGIVDHGWELFCAKPESVNTQVVREFYANIVKEDGFQNFPHAAYNEMDVA----------PSDE-----------QLSDA
        ++++F+++     + P F+   I+ H W+LFCA PE     +VREFY N+   D    +       + V           P DE           +L   
Subjt:  LKRDFLFERGFSGDLPQFLRTGIVDHGWELFCAKPESVNTQVVREFYANIVKEDGFQNFPHAAYNEMDVA----------PSDE-----------QLSDA

Query:  VRDVGIEGAQWQLSKTQKRTFQSAYLKREANTWMRFIRQRMLPTTHDSTISRERVLLAFAILRSLSINVGRIIASEISGC-WKKVGKLFFPNTITMLC--
        +  V I GA+W +S     T   + L   A  W  F++ R+LPTTH  T+S+E V L +++L   SINVGR+I  EI  C  +K G LFFP+ IT +C  
Subjt:  VRDVGIEGAQWQLSKTQKRTFQSAYLKREANTWMRFIRQRMLPTTHDSTISRERVLLAFAILRSLSINVGRIIASEISGC-WKKVGKLFFPNTITMLC--

Query:  SKAGVLVDE
        ++A  LV+E
Subjt:  SKAGVLVDE

A0A2P5DXM3 Uncharacterized protein3.3e-2433.89Show/hide
Query:  SDEQLSDAVRDVGIEGAQWQLSKTQKRTFQSAYLKREANTWMRFIRQRMLPTTHDSTISRERVLLAFAILRSLSINVGRIIASEISGC-WKKVGKLFFPN
        ++ +L   +  V   GA+W +S     T   + L   A  W  F++ R+LPTTH   +S++R+LL  ++L   SINVGR+I SEI  C  +K G LFFP+
Subjt:  SDEQLSDAVRDVGIEGAQWQLSKTQKRTFQSAYLKREANTWMRFIRQRMLPTTHDSTISRERVLLAFAILRSLSINVGRIIASEISGC-WKKVGKLFFPN

Query:  TITMLCSKAGVLVDEGDVILFDKGIIDKSNLARL--------------QRMQEVRQGGLIYNINMILEQLALSTSRQEFAERQTLTFWNYVKNRDASLRR
         IT LC  A  LV+E    L + G ID   +AR+               R           ++   L+ L    S+QE   +Q   FW Y K RD +L++
Subjt:  TITMLCSKAGVLVDEGDVILFDKGIIDKSNLARL--------------QRMQEVRQGGLIYNINMILEQLALSTSRQEFAERQTLTFWNYVKNRDASLRR

Query:  ALQENFSKPYPALPAFPEDLLNPRIPPPPVEREEENDDE
        ALQ NF++P P  PAFP+++L         E +++  +E
Subjt:  ALQENFSKPYPALPAFPEDLLNPRIPPPPVEREEENDDE

W9QTD9 Uncharacterized protein8.2e-2335.32Show/hide
Query:  PQFLRTGIVDHGWELFCAKPESVNTQVVREFYANIV---KEDGF-QN----FPHAAYNE-----------MDVAP--SDEQLSDAVRDVGIEGAQWQLSK
        P F+   I  HGW  FC  P +    +VREFYAN++   +E  F QN    F   A N            +D A   +DEQL   + +V IEGA WQ+S 
Subjt:  PQFLRTGIVDHGWELFCAKPESVNTQVVREFYANIV---KEDGF-QN----FPHAAYNE-----------MDVAP--SDEQLSDAVRDVGIEGAQWQLSK

Query:  TQKRTFQSAYLKREANTWMRFIRQRMLPTTHDSTISRERVLLAFAILRSLSINVGRIIASEISGC--WKKVGKLFFPNTITMLCSKAGVLVDEGDVILFD
            T     LKR A  W  F+  R +P+TH  T++++RVLL ++IL  +S+N+  I   EI  C   +K G L+FP+ IT L  KA V   + + I+ +
Subjt:  TQKRTFQSAYLKREANTWMRFIRQRMLPTTHDSTISRERVLLAFAILRSLSINVGRIIASEISGC--WKKVGKLFFPNTITMLCSKAGVLVDEGDVILFD

Query:  KGIIDKSNLARLQRMQEV
         G I   +++R+ + + V
Subjt:  KGIIDKSNLARLQRMQEV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATCATCTGATGAAGCCACGTGTCGCAGAGCAATCATTCGAAGGCTTCAATGATGGACAGCACATCAATTATTACCATGGGCGTTTGGAATCCACGATTTTACCGTT
GCTGGTTTCAAATTTTCAGGATTGCATTTTGCTGGAGCCTTATTTATTCACAACCGCTGACGATAGCGAGAAAGAGAGAGATAATGAGGAAGAGGAGGTACCCGTTACCC
CCGAAGTTCAGAAGGTAAAAAAGATAACGCCAGAGGAAAATGAAGCCAAGAAAAGGAGAAGGCAGCAAAGGGCTGTAGAACAGGAAGAAGTTCAGGAGGTAGCAGAGGTT
GTTGCCACTGCAGCGGAAGAAGGAAATACTCAAGAACCTGAAGTGCAAAACCCGGTTACGGTTCAAGAAGAGAATGTTAGGGAAAATCAAGAAATAGAGACTGACGAAGT
TCGAGGCGAACAAACCGCAGAGGTGCCTGAAGAAGGGAATGAACAGGGATCGGTGCAAGAGGCTCGGGTTGAAGTCATCATACCTGAACCACCAAAGAAGCGCCGCATTA
AGTGGAAGGCAGGCCGCATCAGGGTGATTCGGAATACCCCATCGCCTCCATCATCGGACTCTGAGGAAGAAAGAAGGGAAAGAGAGAAAAAGAAGCTGAGGACAAAGGCG
GAAAAGGGCAAAAACTTTGCTGAAGCATCGGAGGAACACGATGCAATAGAAGAACAACAGTTACCATTTGATCGCTTCGCCAATAATTTTTCCATAGCAAAATACGCTGA
GCTTCTGAAAAGAGATTTCTTGTTTGAGCGTGGTTTCAGTGGTGATCTTCCACAGTTTCTGAGGACCGGTATTGTAGACCACGGCTGGGAGCTGTTTTGTGCGAAGCCTG
AGTCTGTAAACACACAGGTGGTGCGTGAATTTTATGCTAACATTGTCAAGGAGGATGGATTCCAGAACTTTCCCCATGCGGCATATAATGAGATGGATGTGGCGCCATCT
GATGAGCAACTAAGTGATGCTGTGCGAGATGTAGGAATTGAAGGGGCACAATGGCAGTTATCCAAAACTCAGAAGAGGACATTCCAGTCGGCTTATTTGAAAAGGGAAGC
GAACACGTGGATGAGATTTATTAGACAGAGGATGCTTCCAACAACACATGACTCGACAATCTCCAGGGAACGGGTTCTCCTAGCTTTTGCCATCTTGCGGTCTCTCAGTA
TTAACGTAGGGAGGATCATTGCGAGTGAAATTTCTGGTTGCTGGAAAAAGGTGGGGAAGCTGTTCTTCCCAAATACAATTACAATGCTTTGCAGTAAAGCAGGGGTTCTA
GTGGATGAGGGAGATGTGATCCTGTTTGACAAAGGGATCATCGACAAGTCCAATTTGGCACGGCTCCAGCGGATGCAGGAGGTCCGTCAAGGTGGGCTTATTTACAACAT
CAACATGATTCTAGAACAACTGGCACTTTCGACCAGTAGGCAGGAGTTTGCTGAAAGGCAAACTTTAACCTTCTGGAACTATGTTAAGAATCGGGATGCCAGCTTAAGAA
GGGCGCTGCAAGAGAATTTTTCCAAACCGTATCCAGCCCTTCCTGCATTCCCTGAGGATCTGTTGAACCCCAGGATTCCGCCCCCACCTGTTGAAAGAGAAGAAGAGAAT
GATGATGAAGAGCAGTGTCGGGAAGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGCATCATCTGATGAAGCCACGTGTCGCAGAGCAATCATTCGAAGGCTTCAATGATGGACAGCACATCAATTATTACCATGGGCGTTTGGAATCCACGATTTTACCGTT
GCTGGTTTCAAATTTTCAGGATTGCATTTTGCTGGAGCCTTATTTATTCACAACCGCTGACGATAGCGAGAAAGAGAGAGATAATGAGGAAGAGGAGGTACCCGTTACCC
CCGAAGTTCAGAAGGTAAAAAAGATAACGCCAGAGGAAAATGAAGCCAAGAAAAGGAGAAGGCAGCAAAGGGCTGTAGAACAGGAAGAAGTTCAGGAGGTAGCAGAGGTT
GTTGCCACTGCAGCGGAAGAAGGAAATACTCAAGAACCTGAAGTGCAAAACCCGGTTACGGTTCAAGAAGAGAATGTTAGGGAAAATCAAGAAATAGAGACTGACGAAGT
TCGAGGCGAACAAACCGCAGAGGTGCCTGAAGAAGGGAATGAACAGGGATCGGTGCAAGAGGCTCGGGTTGAAGTCATCATACCTGAACCACCAAAGAAGCGCCGCATTA
AGTGGAAGGCAGGCCGCATCAGGGTGATTCGGAATACCCCATCGCCTCCATCATCGGACTCTGAGGAAGAAAGAAGGGAAAGAGAGAAAAAGAAGCTGAGGACAAAGGCG
GAAAAGGGCAAAAACTTTGCTGAAGCATCGGAGGAACACGATGCAATAGAAGAACAACAGTTACCATTTGATCGCTTCGCCAATAATTTTTCCATAGCAAAATACGCTGA
GCTTCTGAAAAGAGATTTCTTGTTTGAGCGTGGTTTCAGTGGTGATCTTCCACAGTTTCTGAGGACCGGTATTGTAGACCACGGCTGGGAGCTGTTTTGTGCGAAGCCTG
AGTCTGTAAACACACAGGTGGTGCGTGAATTTTATGCTAACATTGTCAAGGAGGATGGATTCCAGAACTTTCCCCATGCGGCATATAATGAGATGGATGTGGCGCCATCT
GATGAGCAACTAAGTGATGCTGTGCGAGATGTAGGAATTGAAGGGGCACAATGGCAGTTATCCAAAACTCAGAAGAGGACATTCCAGTCGGCTTATTTGAAAAGGGAAGC
GAACACGTGGATGAGATTTATTAGACAGAGGATGCTTCCAACAACACATGACTCGACAATCTCCAGGGAACGGGTTCTCCTAGCTTTTGCCATCTTGCGGTCTCTCAGTA
TTAACGTAGGGAGGATCATTGCGAGTGAAATTTCTGGTTGCTGGAAAAAGGTGGGGAAGCTGTTCTTCCCAAATACAATTACAATGCTTTGCAGTAAAGCAGGGGTTCTA
GTGGATGAGGGAGATGTGATCCTGTTTGACAAAGGGATCATCGACAAGTCCAATTTGGCACGGCTCCAGCGGATGCAGGAGGTCCGTCAAGGTGGGCTTATTTACAACAT
CAACATGATTCTAGAACAACTGGCACTTTCGACCAGTAGGCAGGAGTTTGCTGAAAGGCAAACTTTAACCTTCTGGAACTATGTTAAGAATCGGGATGCCAGCTTAAGAA
GGGCGCTGCAAGAGAATTTTTCCAAACCGTATCCAGCCCTTCCTGCATTCCCTGAGGATCTGTTGAACCCCAGGATTCCGCCCCCACCTGTTGAAAGAGAAGAAGAGAAT
GATGATGAAGAGCAGTGTCGGGAAGATTGA
Protein sequenceShow/hide protein sequence
MHHLMKPRVAEQSFEGFNDGQHINYYHGRLESTILPLLVSNFQDCILLEPYLFTTADDSEKERDNEEEEVPVTPEVQKVKKITPEENEAKKRRRQQRAVEQEEVQEVAEV
VATAAEEGNTQEPEVQNPVTVQEENVRENQEIETDEVRGEQTAEVPEEGNEQGSVQEARVEVIIPEPPKKRRIKWKAGRIRVIRNTPSPPSSDSEEERREREKKKLRTKA
EKGKNFAEASEEHDAIEEQQLPFDRFANNFSIAKYAELLKRDFLFERGFSGDLPQFLRTGIVDHGWELFCAKPESVNTQVVREFYANIVKEDGFQNFPHAAYNEMDVAPS
DEQLSDAVRDVGIEGAQWQLSKTQKRTFQSAYLKREANTWMRFIRQRMLPTTHDSTISRERVLLAFAILRSLSINVGRIIASEISGCWKKVGKLFFPNTITMLCSKAGVL
VDEGDVILFDKGIIDKSNLARLQRMQEVRQGGLIYNINMILEQLALSTSRQEFAERQTLTFWNYVKNRDASLRRALQENFSKPYPALPAFPEDLLNPRIPPPPVEREEEN
DDEEQCRED