; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0016128 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0016128
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionGag protease polyprotein
Genome locationchr12:33660873..33667792
RNA-Seq ExpressionLag0016128
SyntenyLag0016128
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0047193.1 gag protease polyprotein [Cucumis melo var. makuwa]3.3e-1836.41Show/hide
Query:  SAERSIDVSSGLVTWLQFKEAFFQKYYSTIISYRKEREFLTLSQGEKSAKKTKRFIAGLRDDVQRDVGALGPANYASALRAATFMGM-PTVNATPAAKES
        + ER +      +TW QFKE+F+ K++S  +   K +EFL L Q  ++A+  K F+ GLR D+Q  V A  PA +A ALR A  + +    N++  A   
Subjt:  SAERSIDVSSGLVTWLQFKEAFFQKYYSTIISYRKEREFLTLSQGEKSAKKTKRFIAGLRDDVQRDVGALGPANYASALRAATFMGM-PTVNATPAAKES

Query:  KPYAGQKRKHEQTSTNL-QRSQPSSESTRQKTQHDKQEGNGGE-KPKCNSCGRHHWGQCMARKGVCFKCHQEGHMANNCPQKKTRDEYGNVARSQGNVGA
           +GQKRK EQ    + QR+  S    R   Q   + G     KP C +CG+HH G+C+     CFKC QEGH  + CP + T    GN A++QG    
Subjt:  KPYAGQKRKHEQTSTNL-QRSQPSSESTRQKTQHDKQEGNGGE-KPKCNSCGRHHWGQCMARKGVCFKCHQEGHMANNCPQKKTRDEYGNVARSQGNVGA

Query:  LSAESGFRILDLGRYRG
            S  RI    R  G
Subjt:  LSAESGFRILDLGRYRG

XP_023537968.1 uncharacterized protein LOC111798850 [Cucurbita pepo subsp. pepo]2.7e-2035.38Show/hide
Query:  GLVTWLQFKEAFFQKYYSTIISYRKEREFLTLSQGEKSA----------------------KKTKRFIAGLRDDVQRDVGALGPANYASALRAATFMGMP
        G+++W QFKEAF+++YYS     +K++EF  L+Q  +S                       KK +RFI GL ++++  VGA+ P  Y  ALR+AT +   
Subjt:  GLVTWLQFKEAFFQKYYSTIISYRKEREFLTLSQGEKSA----------------------KKTKRFIAGLRDDVQRDVGALGPANYASALRAATFMGMP

Query:  TVNATPAAKESKPYAGQKRKHEQTSTNLQ-RSQPSSE------STRQKTQHDKQEGNGGEKPKCNSCGRHHWGQCMARKGVCFKCHQEGHMANNC
        +V          P  GQKR ++Q + + Q   QP  +        ++  Q  +Q G G  KP C  CG++HWGQC+AR G CF+C Q+GH++ NC
Subjt:  TVNATPAAKESKPYAGQKRKHEQTSTNLQ-RSQPSSE------STRQKTQHDKQEGNGGEKPKCNSCGRHHWGQCMARKGVCFKCHQEGHMANNC

XP_038880159.1 uncharacterized protein LOC120071839 [Benincasa hispida]8.4e-2237.89Show/hide
Query:  RSAERSIDVSSGLVTWLQFKEAFFQKYYSTIISYRKEREFLTLSQG----------------------EKSAKKTKRFIAGLRDDVQRDVGALGPANYAS
        R AER I+ S G  TW QFKE F++KY+S  + Y K+ EF+ L QG                         AK+ +RF+ GLRD+V+  V AL P NYA+
Subjt:  RSAERSIDVSSGLVTWLQFKEAFFQKYYSTIISYRKEREFLTLSQG----------------------EKSAKKTKRFIAGLRDDVQRDVGALGPANYAS

Query:  ALRAATFMGMPTVNATPAAKESKPYAGQKRKHEQTSTNLQRSQPSSESTRQKTQHDKQEGNG--GEKPKCNSCGRHHWGQCMARKGVCFK
        A RAA  +G P+      +   +P +GQKRK EQ +    +   S++ ++   Q       G   E+P C SCG+HHWG C+   G CFK
Subjt:  ALRAATFMGMPTVNATPAAKESKPYAGQKRKHEQTSTNLQRSQPSSESTRQKTQHDKQEGNG--GEKPKCNSCGRHHWGQCMARKGVCFK

XP_038882311.1 uncharacterized protein LOC120073551 [Benincasa hispida]2.5e-1835.59Show/hide
Query:  RSAERSIDVSSGLVTWLQFKEAFFQKYYSTIISYRKEREFLTLSQGEKSAK----------------------KTKRFIAGLRDDVQRDVGALGPANYAS
        RS E+ ID    L TW QFKE F++KY+S    Y K+ EFL   QG  S +                      +T  FI GL+  ++  V AL    YA+
Subjt:  RSAERSIDVSSGLVTWLQFKEAFFQKYYSTIISYRKEREFLTLSQGEKSAK----------------------KTKRFIAGLRDDVQRDVGALGPANYAS

Query:  ALRAATFMGMPTVNATPAAKESK--PYAGQKRKHEQTSTNLQRSQPSSESTRQKTQHDKQEGNGGEKPKCNSCGRHHWGQCMARKGVCFKCHQEGHMANN
        AL AA  +   +       +  +    AGQKRK           Q +SES ++K    K+     EKP CNSC +HHWG+C+AR GVC++  Q+GHMA  
Subjt:  ALRAATFMGMPTVNATPAAKESK--PYAGQKRKHEQTSTNLQRSQPSSESTRQKTQHDKQEGNGGEKPKCNSCGRHHWGQCMARKGVCFKCHQEGHMANN

Query:  CPQKKTRDEYGNVARSQGNVGA
        CP + T ++  N    Q  VG+
Subjt:  CPQKKTRDEYGNVARSQGNVGA

XP_038891712.1 uncharacterized protein LOC120081110 [Benincasa hispida]6.0e-2035.78Show/hide
Query:  AERSIDVSSGLVTWLQFKEAFFQKYYSTIISYRKEREFLTLSQGEKS----------------------AKKTKRFIAGLRDDVQRDVGALGPANYASAL
        AER + V    VTW QFKE F+ KY+S  + Y K+REFL L QG +S                      A + +RFI GL++ ++  V A  P  +  AL
Subjt:  AERSIDVSSGLVTWLQFKEAFFQKYYSTIISYRKEREFLTLSQGEKS----------------------AKKTKRFIAGLRDDVQRDVGALGPANYASAL

Query:  RAATFMGMPTVNATPAAKESKPYAGQKRKHEQTSTN---LQRSQPSSESTRQKTQHDKQEGNGGE-KPKCNSCGRHHWGQCMARKGVCFKCHQEGHMANN
        R A  +   + +       + P +GQKRK +Q        Q+ Q S  S R   Q   + G     +P C+SCGR HWGQC+A  GVCF C Q+GH+ + 
Subjt:  RAATFMGMPTVNATPAAKESKPYAGQKRKHEQTSTN---LQRSQPSSESTRQKTQHDKQEGNGGE-KPKCNSCGRHHWGQCMARKGVCFKCHQEGHMANN

Query:  CPQKKTRDEY--GN--VARSQGNVGALSAESG
        CP   TR  +  GN  V   Q + G L  + G
Subjt:  CPQKKTRDEY--GN--VARSQGNVGALSAESG

TrEMBL top hitse value%identityAlignment
A0A5A7TAQ5 Reverse transcriptase1.0e-1731.4Show/hide
Query:  MPPRGQGRGRGRGRGRGRGGRGLALPE------QVDP----------SMDQYDEDL----PEEQAPAPPTPAQT-------------VTLTLEALQALIN
        MPPR   R  GRG GRGRG  G   PE       +DP          +M+Q   DL     ++Q PAPP PA                 L L +L+ +  
Subjt:  MPPRGQGRGRGRGRGRGRGGRGLALPE------QVDP----------SMDQYDEDL----PEEQAPAPPTPAQT-------------VTLTLEALQALIN

Query:  NVVASQ----------------APQRSAERSIDVSSGLVTWLQFKEAFFQKYYSTIISYRKEREFLTLSQGEKSAKK----------------------T
         +   +                A   + ER +    G +TW QFKE+F+ K++S  +   K +EFL L QG+ + ++                      +
Subjt:  NVVASQ----------------APQRSAERSIDVSSGLVTWLQFKEAFFQKYYSTIISYRKEREFLTLSQGEKSAKK----------------------T

Query:  KRFIAGLRDDVQRDVGALGPANYASALRAATFMGM-PTVNATPAAKESKPYAGQKRKHEQTSTNL-QRSQPSSESTRQKTQHDKQEGNGG-EKPKCNSCG
         +F+ GLR  +Q  V A  PA +A ALR A  + +    N++  A      +GQKRK EQ   ++ QR+  S    R+  Q   + G     KP C +CG
Subjt:  KRFIAGLRDDVQRDVGALGPANYASALRAATFMGM-PTVNATPAAKESKPYAGQKRKHEQTSTNL-QRSQPSSESTRQKTQHDKQEGNGG-EKPKCNSCG

Query:  RHHWGQCMARKGVCFKCHQEGHMANNCPQKKTRDEYGNVARSQG
        +HH G+C+   G+CFKC QEGH A+ CP + T    GN A++QG
Subjt:  RHHWGQCMARKGVCFKCHQEGHMANNCPQKKTRDEYGNVARSQG

A0A5A7TP01 Reverse transcriptase1.8e-1734.15Show/hide
Query:  SAERSIDVSSGLVTWLQFKEAFFQKYYSTIISYRKEREFLTLSQGEKS----------------------AKKTKRFIAGLRDDVQRDVGALGPANYASA
        +AER++   +  +TW QF+E+F+ K++S  + + K +EFL L QG+ S                      A +T+RF+ GLR D+Q  V AL P  +A A
Subjt:  SAERSIDVSSGLVTWLQFKEAFFQKYYSTIISYRKEREFLTLSQGEKS----------------------AKKTKRFIAGLRDDVQRDVGALGPANYASA

Query:  LRAATFMGMPTVNATPAAKESKPYAGQKRKHEQTSTNLQRSQPSSESTRQKTQHDKQEGNGG----EKPKCNSCGRHHWGQCMARKGVCFKCHQEGHMAN
        LR A  + +           +   +GQKRK E     + +  P S    Q+  H ++    G    E P C +CG+ H GQC+A  GVCF+C Q GH A+
Subjt:  LRAATFMGMPTVNATPAAKESKPYAGQKRKHEQTSTNLQRSQPSSESTRQKTQHDKQEGNGG----EKPKCNSCGRHHWGQCMARKGVCFKCHQEGHMAN

Query:  NCPQK
         CP+K
Subjt:  NCPQK

A0A5A7TUE7 Gag protease polyprotein1.6e-1836.41Show/hide
Query:  SAERSIDVSSGLVTWLQFKEAFFQKYYSTIISYRKEREFLTLSQGEKSAKKTKRFIAGLRDDVQRDVGALGPANYASALRAATFMGM-PTVNATPAAKES
        + ER +      +TW QFKE+F+ K++S  +   K +EFL L Q  ++A+  K F+ GLR D+Q  V A  PA +A ALR A  + +    N++  A   
Subjt:  SAERSIDVSSGLVTWLQFKEAFFQKYYSTIISYRKEREFLTLSQGEKSAKKTKRFIAGLRDDVQRDVGALGPANYASALRAATFMGM-PTVNATPAAKES

Query:  KPYAGQKRKHEQTSTNL-QRSQPSSESTRQKTQHDKQEGNGGE-KPKCNSCGRHHWGQCMARKGVCFKCHQEGHMANNCPQKKTRDEYGNVARSQGNVGA
           +GQKRK EQ    + QR+  S    R   Q   + G     KP C +CG+HH G+C+     CFKC QEGH  + CP + T    GN A++QG    
Subjt:  KPYAGQKRKHEQTSTNL-QRSQPSSESTRQKTQHDKQEGNGGE-KPKCNSCGRHHWGQCMARKGVCFKCHQEGHMANNCPQKKTRDEYGNVARSQGNVGA

Query:  LSAESGFRILDLGRYRG
            S  RI    R  G
Subjt:  LSAESGFRILDLGRYRG

A0A5A7UJ81 Reverse transcriptase1.3e-1735.96Show/hide
Query:  SAERSIDVSSGLVTWLQFKEAFFQKYYSTIISYRKEREFLTLSQGEKS----------------------AKKTKRFIAGLRDDVQRDVGALGPANYASA
        +AER +      +TW QFKE+F+ K++S  + + K +EFL L QG+ +                      A +T++F+ GLR D+Q  V AL PA +A A
Subjt:  SAERSIDVSSGLVTWLQFKEAFFQKYYSTIISYRKEREFLTLSQGEKS----------------------AKKTKRFIAGLRDDVQRDVGALGPANYASA

Query:  LRAATFMGMPTVNATPAAKESKPYAGQKRKHE-QTSTNLQRSQPSSESTRQKTQHDKQEGNG-GEKPKCNSCGRHHWGQCMARKGVCFKCHQEGHMANNC
        LR A  + +P    +  A       GQKRK E Q     QR+  S    ++  +     G   GE P C +CGR H G+C+A  GVCF+C Q GH A+ C
Subjt:  LRAATFMGMPTVNATPAAKESKPYAGQKRKHE-QTSTNLQRSQPSSESTRQKTQHDKQEGNG-GEKPKCNSCGRHHWGQCMARKGVCFKCHQEGHMANNC

Query:  PQK
        P+K
Subjt:  PQK

A0A5D3E1H4 Gag-protease polyprotein1.6e-1836.41Show/hide
Query:  SAERSIDVSSGLVTWLQFKEAFFQKYYSTIISYRKEREFLTLSQGEKSAKKTKRFIAGLRDDVQRDVGALGPANYASALRAATFMGM-PTVNATPAAKES
        + ER +      +TW QFKE+F+ K++S  +   K +EFL L Q  ++A+  K F+ GLR D+Q  V A  PA +A ALR A  + +    N++  A   
Subjt:  SAERSIDVSSGLVTWLQFKEAFFQKYYSTIISYRKEREFLTLSQGEKSAKKTKRFIAGLRDDVQRDVGALGPANYASALRAATFMGM-PTVNATPAAKES

Query:  KPYAGQKRKHEQTSTNL-QRSQPSSESTRQKTQHDKQEGNGGE-KPKCNSCGRHHWGQCMARKGVCFKCHQEGHMANNCPQKKTRDEYGNVARSQGNVGA
           +GQKRK EQ    + QR+  S    R   Q   + G     KP C +CG+HH G+C+     CFKC QEGH  + CP + T    GN A++QG    
Subjt:  KPYAGQKRKHEQTSTNL-QRSQPSSESTRQKTQHDKQEGNGGE-KPKCNSCGRHHWGQCMARKGVCFKCHQEGHMANNCPQKKTRDEYGNVARSQGNVGA

Query:  LSAESGFRILDLGRYRG
            S  RI    R  G
Subjt:  LSAESGFRILDLGRYRG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTCCACGTGGACAAGGTCGAGGACGTGGACGAGGACGTGGTCGTGGTAGGGGTGGCAGAGGTTTGGCACTCCCGGAGCAAGTAGATCCCTCTATGGATCAATATGA
TGAAGATCTTCCTGAGGAGCAAGCTCCTGCACCTCCAACACCGGCACAGACTGTCACTTTGACTTTAGAGGCTCTTCAGGCGCTTATAAATAATGTCGTTGCCAGTCAGG
CGCCCCAAAGATCGGCTGAGAGATCTATCGATGTGAGTTCTGGTCTGGTCACTTGGTTGCAGTTCAAGGAGGCGTTCTTCCAGAAATATTACTCGACCATCATCAGTTAC
AGAAAGGAGAGGGAATTCCTAACCTTGTCACAAGGGGAGAAGTCAGCAAAGAAGACAAAGCGGTTCATCGCGGGCCTCAGGGATGACGTCCAAAGAGATGTTGGAGCCCT
TGGCCCAGCAAACTACGCATCGGCCCTTCGAGCGGCCACCTTTATGGGCATGCCAACTGTTAATGCAACTCCAGCAGCCAAGGAGTCGAAACCCTACGCAGGACAGAAGA
GGAAACACGAGCAGACATCAACCAACCTCCAGCGATCTCAACCCTCATCCGAAAGTACTAGACAGAAAACTCAGCATGACAAACAAGAGGGCAATGGAGGTGAGAAACCA
AAGTGCAACTCTTGTGGAAGACATCATTGGGGTCAGTGCATGGCGAGGAAAGGTGTGTGTTTCAAATGTCACCAGGAAGGGCATATGGCTAATAACTGCCCTCAAAAGAA
AACAAGAGACGAGTATGGTAACGTGGCCAGGTCGCAAGGGAATGTCGGGGCCTTAAGTGCTGAATCCGGATTCCGAATCCTGGACCTGGGGCGTTACAGAGGCCCTCCTC
CCTCACGGTTTCAGATTTGGAATCCCGATGCGCGCTTTGTATTTTGGCGTTCCTTTTGCTTCTTCTCCGCCGCCTCCAACACTGTCTCCGTCGTCAGCAGCCTCTTCAGC
TCCCCTGTAGCCCAAGAAAACGAAATTTACACATTATTGTCATGGATCTTCCAAAAACGAAGCAACCCCACATCAGCCGTGGCCGTCCTCTCGCCCAGCCGCCGAGCTCC
CGCCGTCGCCGTCGCCGCTCGCGCTACCGCCCACAGCCACGCCGTCCAGCCACCTCCTTCCTCCTCGCGTCGCCAACCGTGCCGCTGCCCCTCGCCTGAAATCAAAGAGC
CCAGACGTCGGTTGGTTTTGTCGCTGAATCGAAGAGCCCCTCGCCTTCTTCGTCTATTCTCCGCTGGAAATCGAAAACCCACGCAAGGATTTGCACGATTTTTGGTTTCG
AACGTCAAGAACTCGTGGCATCCATTTGGGCGTTTTTGGCATCGTTTAGCTATTTCGCGCCGTAAAAGTGTTCGATTGAGTTCGAATCACTTAAAACTTGAATACCCATT
GTCCAAGGAGTGTTCTAACACGTTGTTCGAGCCATTCGATGGATACCCATTGCCCGTTGTGCCTTCGACGTGGTTGCCCATGGTTGGAAGGATCTTGTTAGACTGCTTTG
GATCTCTAAGGTTGCTTTGGAAGCGTTTAGATTATGCAGGTGATGAGCTGTATGTGCCCGATGATGTAGACGAGGAGGTCCACGAGGGTGACCACAGGATTCCATATTGT
GGAGTGGATCGAATCAATTTAGAAGTGTTGTTGAGTTTAACAGGTGTTCTAAAATTTTGGGTTTGTCCTATGTACGCCGTTATGTTGCCAAAATTTCCGGAGCTCTCGGT
AATTTTGGACAACCATAATGTGCAAGGAGTTGACGAGGATAACCGGGCGGAGGTAGGACCAGGAAACGACCCAGAGAAAGACCAGACCAACGGGCCGGGCCAATGGTCGG
CCTCGGCCTGGGCCGACGCCAACGTCGACCACTCGGCCCGCTTGCGCGGGCCAAGTTCGTTTGCCTCCGCTCAGTCCCTACTGCCTCTAGCCGCCCCGGTTCCATCTGAA
GTGGATCTGAACTCTATTCTCGACTCTCTCTCTTGCTCTCTAACCCTGATACTTCCGTTTTCTGACTTAAGCATCGGAGACAGTGTGACTAGCACCACGCCATTGTGCAG
GTTTACCGTTTTGCAGGCCACGTCTTCCCCCTCATCCACAAATTTACCGTTGGTGGCACGTGAAGGTCAGGTGAGTTCATCTGACCAAATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCTCCACGTGGACAAGGTCGAGGACGTGGACGAGGACGTGGTCGTGGTAGGGGTGGCAGAGGTTTGGCACTCCCGGAGCAAGTAGATCCCTCTATGGATCAATATGA
TGAAGATCTTCCTGAGGAGCAAGCTCCTGCACCTCCAACACCGGCACAGACTGTCACTTTGACTTTAGAGGCTCTTCAGGCGCTTATAAATAATGTCGTTGCCAGTCAGG
CGCCCCAAAGATCGGCTGAGAGATCTATCGATGTGAGTTCTGGTCTGGTCACTTGGTTGCAGTTCAAGGAGGCGTTCTTCCAGAAATATTACTCGACCATCATCAGTTAC
AGAAAGGAGAGGGAATTCCTAACCTTGTCACAAGGGGAGAAGTCAGCAAAGAAGACAAAGCGGTTCATCGCGGGCCTCAGGGATGACGTCCAAAGAGATGTTGGAGCCCT
TGGCCCAGCAAACTACGCATCGGCCCTTCGAGCGGCCACCTTTATGGGCATGCCAACTGTTAATGCAACTCCAGCAGCCAAGGAGTCGAAACCCTACGCAGGACAGAAGA
GGAAACACGAGCAGACATCAACCAACCTCCAGCGATCTCAACCCTCATCCGAAAGTACTAGACAGAAAACTCAGCATGACAAACAAGAGGGCAATGGAGGTGAGAAACCA
AAGTGCAACTCTTGTGGAAGACATCATTGGGGTCAGTGCATGGCGAGGAAAGGTGTGTGTTTCAAATGTCACCAGGAAGGGCATATGGCTAATAACTGCCCTCAAAAGAA
AACAAGAGACGAGTATGGTAACGTGGCCAGGTCGCAAGGGAATGTCGGGGCCTTAAGTGCTGAATCCGGATTCCGAATCCTGGACCTGGGGCGTTACAGAGGCCCTCCTC
CCTCACGGTTTCAGATTTGGAATCCCGATGCGCGCTTTGTATTTTGGCGTTCCTTTTGCTTCTTCTCCGCCGCCTCCAACACTGTCTCCGTCGTCAGCAGCCTCTTCAGC
TCCCCTGTAGCCCAAGAAAACGAAATTTACACATTATTGTCATGGATCTTCCAAAAACGAAGCAACCCCACATCAGCCGTGGCCGTCCTCTCGCCCAGCCGCCGAGCTCC
CGCCGTCGCCGTCGCCGCTCGCGCTACCGCCCACAGCCACGCCGTCCAGCCACCTCCTTCCTCCTCGCGTCGCCAACCGTGCCGCTGCCCCTCGCCTGAAATCAAAGAGC
CCAGACGTCGGTTGGTTTTGTCGCTGAATCGAAGAGCCCCTCGCCTTCTTCGTCTATTCTCCGCTGGAAATCGAAAACCCACGCAAGGATTTGCACGATTTTTGGTTTCG
AACGTCAAGAACTCGTGGCATCCATTTGGGCGTTTTTGGCATCGTTTAGCTATTTCGCGCCGTAAAAGTGTTCGATTGAGTTCGAATCACTTAAAACTTGAATACCCATT
GTCCAAGGAGTGTTCTAACACGTTGTTCGAGCCATTCGATGGATACCCATTGCCCGTTGTGCCTTCGACGTGGTTGCCCATGGTTGGAAGGATCTTGTTAGACTGCTTTG
GATCTCTAAGGTTGCTTTGGAAGCGTTTAGATTATGCAGGTGATGAGCTGTATGTGCCCGATGATGTAGACGAGGAGGTCCACGAGGGTGACCACAGGATTCCATATTGT
GGAGTGGATCGAATCAATTTAGAAGTGTTGTTGAGTTTAACAGGTGTTCTAAAATTTTGGGTTTGTCCTATGTACGCCGTTATGTTGCCAAAATTTCCGGAGCTCTCGGT
AATTTTGGACAACCATAATGTGCAAGGAGTTGACGAGGATAACCGGGCGGAGGTAGGACCAGGAAACGACCCAGAGAAAGACCAGACCAACGGGCCGGGCCAATGGTCGG
CCTCGGCCTGGGCCGACGCCAACGTCGACCACTCGGCCCGCTTGCGCGGGCCAAGTTCGTTTGCCTCCGCTCAGTCCCTACTGCCTCTAGCCGCCCCGGTTCCATCTGAA
GTGGATCTGAACTCTATTCTCGACTCTCTCTCTTGCTCTCTAACCCTGATACTTCCGTTTTCTGACTTAAGCATCGGAGACAGTGTGACTAGCACCACGCCATTGTGCAG
GTTTACCGTTTTGCAGGCCACGTCTTCCCCCTCATCCACAAATTTACCGTTGGTGGCACGTGAAGGTCAGGTGAGTTCATCTGACCAAATTTGA
Protein sequenceShow/hide protein sequence
MPPRGQGRGRGRGRGRGRGGRGLALPEQVDPSMDQYDEDLPEEQAPAPPTPAQTVTLTLEALQALINNVVASQAPQRSAERSIDVSSGLVTWLQFKEAFFQKYYSTIISY
RKEREFLTLSQGEKSAKKTKRFIAGLRDDVQRDVGALGPANYASALRAATFMGMPTVNATPAAKESKPYAGQKRKHEQTSTNLQRSQPSSESTRQKTQHDKQEGNGGEKP
KCNSCGRHHWGQCMARKGVCFKCHQEGHMANNCPQKKTRDEYGNVARSQGNVGALSAESGFRILDLGRYRGPPPSRFQIWNPDARFVFWRSFCFFSAASNTVSVVSSLFS
SPVAQENEIYTLLSWIFQKRSNPTSAVAVLSPSRRAPAVAVAARATAHSHAVQPPPSSSRRQPCRCPSPEIKEPRRRLVLSLNRRAPRLLRLFSAGNRKPTQGFARFLVS
NVKNSWHPFGRFWHRLAISRRKSVRLSSNHLKLEYPLSKECSNTLFEPFDGYPLPVVPSTWLPMVGRILLDCFGSLRLLWKRLDYAGDELYVPDDVDEEVHEGDHRIPYC
GVDRINLEVLLSLTGVLKFWVCPMYAVMLPKFPELSVILDNHNVQGVDEDNRAEVGPGNDPEKDQTNGPGQWSASAWADANVDHSARLRGPSSFASAQSLLPLAAPVPSE
VDLNSILDSLSCSLTLILPFSDLSIGDSVTSTTPLCRFTVLQATSSPSSTNLPLVAREGQVSSSDQI