; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10020459 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10020459
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionATP-dependent helicase/deoxyribonuclease subunit B
Genome locationChr04:32006324..32026668
RNA-Seq ExpressionHG10020459
SyntenyHG10020459
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004137681.1 uncharacterized protein LOC101203220 [Cucumis sativus]9.8e-8663.09Show/hide
Query:  MGEPPILSLLVLLLSLVPISLAYRPGDIVPMSKMGQYHSSRTVWHDMIGRHCPIFGVNREVLVPIPKPVGYTGADPYKISFQVGKEKFLVPWLLVINRKS
        M +PPILSLL+ LLSL+PI LAYRPGDIVPMSKMGQYHSSRTVWHDMIGRHCPI+GVNREVLVPIPKPVGYTGADPYKISFQVGKEKFLVPWLLVINRKS
Subjt:  MGEPPILSLLVLLLSLVPISLAYRPGDIVPMSKMGQYHSSRTVWHDMIGRHCPIFGVNREVLVPIPKPVGYTGADPYKISFQVGKEKFLVPWLLVINRKS

Query:  AEVPMIDVHLRYSGSDLHGVTAKVMDMPHICIWQVGGCTTVDSGCCDDSGSISGGSSNHSVLLQNRGGKILRPGTILSQSKGPQAAASLGCYMVSNLSSS
        AEVPMIDVHLRYSGSDLHGVTAKV+DMPHI I              D    IS                         Q   P                 
Subjt:  AEVPMIDVHLRYSGSDLHGVTAKVMDMPHICIWQVGGCTTVDSGCCDDSGSISGGSSNHSVLLQNRGGKILRPGTILSQSKGPQAAASLGCYMVSNLSSS

Query:  VENLTAGEGLLGSSIELSYGSIKDLIVSGFNRESNCVWEEQSEIDVTSGLYVLFGSGLTLSFILSIYILQSSKDKLARFVRETVVESSNPGAGVAKVE
                              K ++V          WEEQSEIDVTSGLYVLFGSGLTLSFILS+YILQSSKDKLARFVRETVVESS PG GVAKVE
Subjt:  VENLTAGEGLLGSSIELSYGSIKDLIVSGFNRESNCVWEEQSEIDVTSGLYVLFGSGLTLSFILSIYILQSSKDKLARFVRETVVESSNPGAGVAKVE

XP_008442331.1 PREDICTED: uncharacterized protein LOC103486235 isoform X1 [Cucumis melo]9.5e-8965.1Show/hide
Query:  MGEPPILSLLVLLLSLVPISLAYRPGDIVPMSKMGQYHSSRTVWHDMIGRHCPIFGVNREVLVPIPKPVGYTGADPYKISFQVGKEKFLVPWLLVINRKS
        M  PPILSLL+ LLSLVPISLAYRPGDIVPMSKMGQYHSSRTVWHDMIGRHCPIFGVNREVLVPIPKPVGYTGADPYKISFQVGKEKFLVPWLLVINRKS
Subjt:  MGEPPILSLLVLLLSLVPISLAYRPGDIVPMSKMGQYHSSRTVWHDMIGRHCPIFGVNREVLVPIPKPVGYTGADPYKISFQVGKEKFLVPWLLVINRKS

Query:  AEVPMIDVHLRYSGSDLHGVTAKVMDMPHICIWQVGGCTTVDSGCCDDSGSISGGSSNHSVLLQNRGGKILRPGTILSQSKGPQAAASLGCYMVSNLSSS
        AEVPMIDVHLRYSGSDLHGVTAKV+DMPHI  +  G  TT  S   D    I+                         Q   P                 
Subjt:  AEVPMIDVHLRYSGSDLHGVTAKVMDMPHICIWQVGGCTTVDSGCCDDSGSISGGSSNHSVLLQNRGGKILRPGTILSQSKGPQAAASLGCYMVSNLSSS

Query:  VENLTAGEGLLGSSIELSYGSIKDLIVSGFNRESNCVWEEQSEIDVTSGLYVLFGSGLTLSFILSIYILQSSKDKLARFVRETVVESSNPGAGVAKVE
                              K ++V          WEEQSEIDVTSGLYVLFGSGLTLSFILSIYILQSSKDKLARFVRETVVESS PG GVAKVE
Subjt:  VENLTAGEGLLGSSIELSYGSIKDLIVSGFNRESNCVWEEQSEIDVTSGLYVLFGSGLTLSFILSIYILQSSKDKLARFVRETVVESSNPGAGVAKVE

XP_008442332.1 PREDICTED: uncharacterized protein LOC103486235 isoform X2 [Cucumis melo]8.9e-8763.42Show/hide
Query:  MGEPPILSLLVLLLSLVPISLAYRPGDIVPMSKMGQYHSSRTVWHDMIGRHCPIFGVNREVLVPIPKPVGYTGADPYKISFQVGKEKFLVPWLLVINRKS
        M  PPILSLL+ LLSLVPISLAYRPGDIVPMSKMGQYHSSRTVWHDMIGRHCPIFGVNREVLVPIPKPVGYTGADPYKISFQVGKEKFLVPWLLVINRKS
Subjt:  MGEPPILSLLVLLLSLVPISLAYRPGDIVPMSKMGQYHSSRTVWHDMIGRHCPIFGVNREVLVPIPKPVGYTGADPYKISFQVGKEKFLVPWLLVINRKS

Query:  AEVPMIDVHLRYSGSDLHGVTAKVMDMPHICIWQVGGCTTVDSGCCDDSGSISGGSSNHSVLLQNRGGKILRPGTILSQSKGPQAAASLGCYMVSNLSSS
        AEVPMIDVHLRYSGSDLHGVTAKV+DMPHI                         +  H  + +              Q   P                 
Subjt:  AEVPMIDVHLRYSGSDLHGVTAKVMDMPHICIWQVGGCTTVDSGCCDDSGSISGGSSNHSVLLQNRGGKILRPGTILSQSKGPQAAASLGCYMVSNLSSS

Query:  VENLTAGEGLLGSSIELSYGSIKDLIVSGFNRESNCVWEEQSEIDVTSGLYVLFGSGLTLSFILSIYILQSSKDKLARFVRETVVESSNPGAGVAKVE
                              K ++V          WEEQSEIDVTSGLYVLFGSGLTLSFILSIYILQSSKDKLARFVRETVVESS PG GVAKVE
Subjt:  VENLTAGEGLLGSSIELSYGSIKDLIVSGFNRESNCVWEEQSEIDVTSGLYVLFGSGLTLSFILSIYILQSSKDKLARFVRETVVESSNPGAGVAKVE

XP_022157271.1 uncharacterized protein LOC111024020 isoform X1 [Momordica charantia]3.7e-8562.75Show/hide
Query:  MGEPPILSLLVLLLSLVPISLAYRPGDIVPMSKMGQYHSSRTVWHDMIGRHCPIFGVNREVLVPIPKPVGYTGADPYKISFQVGKEKFLVPWLLVINRKS
        MGEP ILSLLVLLL L+PIS+AYRPGDIVPMSKMGQYHSSRTVWHDMIGRHCPIF VNREVLVPIPKPVGYTGADPYKISFQVGKEKFL+PWLLVINRKS
Subjt:  MGEPPILSLLVLLLSLVPISLAYRPGDIVPMSKMGQYHSSRTVWHDMIGRHCPIFGVNREVLVPIPKPVGYTGADPYKISFQVGKEKFLVPWLLVINRKS

Query:  AEVPMIDVHLRYSGSDLHGVTAKVMDMPHICIWQVGGCTTVDSGCCDDSGSISGGSSNHSVLLQNRGGKILRPGTILSQSKGPQAAASLGCYMVSNLSSS
         EVPMIDVHLRYSGSDLHGVTAKV DMPHI I              D    IS                         Q   P                 
Subjt:  AEVPMIDVHLRYSGSDLHGVTAKVMDMPHICIWQVGGCTTVDSGCCDDSGSISGGSSNHSVLLQNRGGKILRPGTILSQSKGPQAAASLGCYMVSNLSSS

Query:  VENLTAGEGLLGSSIELSYGSIKDLIVSGFNRESNCVWEEQSEIDVTSGLYVLFGSGLTLSFILSIYILQSSKDKLARFVRETVVESSNPGAGVAKVE
                              K ++V          WEEQSEIDVTSGLYVLFGSGLTL+FILSIYILQSSKDKLARFVRETVVE+S PG GVAKVE
Subjt:  VENLTAGEGLLGSSIELSYGSIKDLIVSGFNRESNCVWEEQSEIDVTSGLYVLFGSGLTLSFILSIYILQSSKDKLARFVRETVVESSNPGAGVAKVE

XP_038903796.1 uncharacterized protein LOC120090293 isoform X1 [Benincasa hispida]2.6e-8663.09Show/hide
Query:  MGEPPILSLLVLLLSLVPISLAYRPGDIVPMSKMGQYHSSRTVWHDMIGRHCPIFGVNREVLVPIPKPVGYTGADPYKISFQVGKEKFLVPWLLVINRKS
        MGEPPILSLLV LLSLVPISLAYRPGDIVPMSKMGQYHSSRTVWHDMIGRHCPIFGVNREVLVPIPKPVGYTGADPYKISFQVGKEKFLVPWLLVINR+S
Subjt:  MGEPPILSLLVLLLSLVPISLAYRPGDIVPMSKMGQYHSSRTVWHDMIGRHCPIFGVNREVLVPIPKPVGYTGADPYKISFQVGKEKFLVPWLLVINRKS

Query:  AEVPMIDVHLRYSGSDLHGVTAKVMDMPHICIWQVGGCTTVDSGCCDDSGSISGGSSNHSVLLQNRGGKILRPGTILSQSKGPQAAASLGCYMVSNLSSS
        AEVPMIDVHLRYSGSDLHGVTAKV+DMPHI                         +  H  + +              Q   P                 
Subjt:  AEVPMIDVHLRYSGSDLHGVTAKVMDMPHICIWQVGGCTTVDSGCCDDSGSISGGSSNHSVLLQNRGGKILRPGTILSQSKGPQAAASLGCYMVSNLSSS

Query:  VENLTAGEGLLGSSIELSYGSIKDLIVSGFNRESNCVWEEQSEIDVTSGLYVLFGSGLTLSFILSIYILQSSKDKLARFVRETVVESSNPGAGVAKVE
                              K ++V          WEEQSEIDV SGLYVLFGSGLTLSFILSI+ILQSSKDKLARFVRETVVESS P  GVAKVE
Subjt:  VENLTAGEGLLGSSIELSYGSIKDLIVSGFNRESNCVWEEQSEIDVTSGLYVLFGSGLTLSFILSIYILQSSKDKLARFVRETVVESSNPGAGVAKVE

TrEMBL top hitse value%identityAlignment
A0A0A0L9R0 Uncharacterized protein4.7e-8663.09Show/hide
Query:  MGEPPILSLLVLLLSLVPISLAYRPGDIVPMSKMGQYHSSRTVWHDMIGRHCPIFGVNREVLVPIPKPVGYTGADPYKISFQVGKEKFLVPWLLVINRKS
        M +PPILSLL+ LLSL+PI LAYRPGDIVPMSKMGQYHSSRTVWHDMIGRHCPI+GVNREVLVPIPKPVGYTGADPYKISFQVGKEKFLVPWLLVINRKS
Subjt:  MGEPPILSLLVLLLSLVPISLAYRPGDIVPMSKMGQYHSSRTVWHDMIGRHCPIFGVNREVLVPIPKPVGYTGADPYKISFQVGKEKFLVPWLLVINRKS

Query:  AEVPMIDVHLRYSGSDLHGVTAKVMDMPHICIWQVGGCTTVDSGCCDDSGSISGGSSNHSVLLQNRGGKILRPGTILSQSKGPQAAASLGCYMVSNLSSS
        AEVPMIDVHLRYSGSDLHGVTAKV+DMPHI I              D    IS                         Q   P                 
Subjt:  AEVPMIDVHLRYSGSDLHGVTAKVMDMPHICIWQVGGCTTVDSGCCDDSGSISGGSSNHSVLLQNRGGKILRPGTILSQSKGPQAAASLGCYMVSNLSSS

Query:  VENLTAGEGLLGSSIELSYGSIKDLIVSGFNRESNCVWEEQSEIDVTSGLYVLFGSGLTLSFILSIYILQSSKDKLARFVRETVVESSNPGAGVAKVE
                              K ++V          WEEQSEIDVTSGLYVLFGSGLTLSFILS+YILQSSKDKLARFVRETVVESS PG GVAKVE
Subjt:  VENLTAGEGLLGSSIELSYGSIKDLIVSGFNRESNCVWEEQSEIDVTSGLYVLFGSGLTLSFILSIYILQSSKDKLARFVRETVVESSNPGAGVAKVE

A0A1S3B5G0 uncharacterized protein LOC103486235 isoform X14.6e-8965.1Show/hide
Query:  MGEPPILSLLVLLLSLVPISLAYRPGDIVPMSKMGQYHSSRTVWHDMIGRHCPIFGVNREVLVPIPKPVGYTGADPYKISFQVGKEKFLVPWLLVINRKS
        M  PPILSLL+ LLSLVPISLAYRPGDIVPMSKMGQYHSSRTVWHDMIGRHCPIFGVNREVLVPIPKPVGYTGADPYKISFQVGKEKFLVPWLLVINRKS
Subjt:  MGEPPILSLLVLLLSLVPISLAYRPGDIVPMSKMGQYHSSRTVWHDMIGRHCPIFGVNREVLVPIPKPVGYTGADPYKISFQVGKEKFLVPWLLVINRKS

Query:  AEVPMIDVHLRYSGSDLHGVTAKVMDMPHICIWQVGGCTTVDSGCCDDSGSISGGSSNHSVLLQNRGGKILRPGTILSQSKGPQAAASLGCYMVSNLSSS
        AEVPMIDVHLRYSGSDLHGVTAKV+DMPHI  +  G  TT  S   D    I+                         Q   P                 
Subjt:  AEVPMIDVHLRYSGSDLHGVTAKVMDMPHICIWQVGGCTTVDSGCCDDSGSISGGSSNHSVLLQNRGGKILRPGTILSQSKGPQAAASLGCYMVSNLSSS

Query:  VENLTAGEGLLGSSIELSYGSIKDLIVSGFNRESNCVWEEQSEIDVTSGLYVLFGSGLTLSFILSIYILQSSKDKLARFVRETVVESSNPGAGVAKVE
                              K ++V          WEEQSEIDVTSGLYVLFGSGLTLSFILSIYILQSSKDKLARFVRETVVESS PG GVAKVE
Subjt:  VENLTAGEGLLGSSIELSYGSIKDLIVSGFNRESNCVWEEQSEIDVTSGLYVLFGSGLTLSFILSIYILQSSKDKLARFVRETVVESSNPGAGVAKVE

A0A1S3B641 uncharacterized protein LOC103486235 isoform X24.3e-8763.42Show/hide
Query:  MGEPPILSLLVLLLSLVPISLAYRPGDIVPMSKMGQYHSSRTVWHDMIGRHCPIFGVNREVLVPIPKPVGYTGADPYKISFQVGKEKFLVPWLLVINRKS
        M  PPILSLL+ LLSLVPISLAYRPGDIVPMSKMGQYHSSRTVWHDMIGRHCPIFGVNREVLVPIPKPVGYTGADPYKISFQVGKEKFLVPWLLVINRKS
Subjt:  MGEPPILSLLVLLLSLVPISLAYRPGDIVPMSKMGQYHSSRTVWHDMIGRHCPIFGVNREVLVPIPKPVGYTGADPYKISFQVGKEKFLVPWLLVINRKS

Query:  AEVPMIDVHLRYSGSDLHGVTAKVMDMPHICIWQVGGCTTVDSGCCDDSGSISGGSSNHSVLLQNRGGKILRPGTILSQSKGPQAAASLGCYMVSNLSSS
        AEVPMIDVHLRYSGSDLHGVTAKV+DMPHI                         +  H  + +              Q   P                 
Subjt:  AEVPMIDVHLRYSGSDLHGVTAKVMDMPHICIWQVGGCTTVDSGCCDDSGSISGGSSNHSVLLQNRGGKILRPGTILSQSKGPQAAASLGCYMVSNLSSS

Query:  VENLTAGEGLLGSSIELSYGSIKDLIVSGFNRESNCVWEEQSEIDVTSGLYVLFGSGLTLSFILSIYILQSSKDKLARFVRETVVESSNPGAGVAKVE
                              K ++V          WEEQSEIDVTSGLYVLFGSGLTLSFILSIYILQSSKDKLARFVRETVVESS PG GVAKVE
Subjt:  VENLTAGEGLLGSSIELSYGSIKDLIVSGFNRESNCVWEEQSEIDVTSGLYVLFGSGLTLSFILSIYILQSSKDKLARFVRETVVESSNPGAGVAKVE

A0A5A7TLX6 Uncharacterized protein4.3e-8763.42Show/hide
Query:  MGEPPILSLLVLLLSLVPISLAYRPGDIVPMSKMGQYHSSRTVWHDMIGRHCPIFGVNREVLVPIPKPVGYTGADPYKISFQVGKEKFLVPWLLVINRKS
        M  PPILSLL+ LLSLVPISLAYRPGDIVPMSKMGQYHSSRTVWHDMIGRHCPIFGVNREVLVPIPKPVGYTGADPYKISFQVGKEKFLVPWLLVINRKS
Subjt:  MGEPPILSLLVLLLSLVPISLAYRPGDIVPMSKMGQYHSSRTVWHDMIGRHCPIFGVNREVLVPIPKPVGYTGADPYKISFQVGKEKFLVPWLLVINRKS

Query:  AEVPMIDVHLRYSGSDLHGVTAKVMDMPHICIWQVGGCTTVDSGCCDDSGSISGGSSNHSVLLQNRGGKILRPGTILSQSKGPQAAASLGCYMVSNLSSS
        AEVPMIDVHLRYSGSDLHGVTAKV+DMPHI                         +  H  + +              Q   P                 
Subjt:  AEVPMIDVHLRYSGSDLHGVTAKVMDMPHICIWQVGGCTTVDSGCCDDSGSISGGSSNHSVLLQNRGGKILRPGTILSQSKGPQAAASLGCYMVSNLSSS

Query:  VENLTAGEGLLGSSIELSYGSIKDLIVSGFNRESNCVWEEQSEIDVTSGLYVLFGSGLTLSFILSIYILQSSKDKLARFVRETVVESSNPGAGVAKVE
                              K ++V          WEEQSEIDVTSGLYVLFGSGLTLSFILSIYILQSSKDKLARFVRETVVESS PG GVAKVE
Subjt:  VENLTAGEGLLGSSIELSYGSIKDLIVSGFNRESNCVWEEQSEIDVTSGLYVLFGSGLTLSFILSIYILQSSKDKLARFVRETVVESSNPGAGVAKVE

A0A6J1DXJ6 uncharacterized protein LOC111024020 isoform X11.8e-8562.75Show/hide
Query:  MGEPPILSLLVLLLSLVPISLAYRPGDIVPMSKMGQYHSSRTVWHDMIGRHCPIFGVNREVLVPIPKPVGYTGADPYKISFQVGKEKFLVPWLLVINRKS
        MGEP ILSLLVLLL L+PIS+AYRPGDIVPMSKMGQYHSSRTVWHDMIGRHCPIF VNREVLVPIPKPVGYTGADPYKISFQVGKEKFL+PWLLVINRKS
Subjt:  MGEPPILSLLVLLLSLVPISLAYRPGDIVPMSKMGQYHSSRTVWHDMIGRHCPIFGVNREVLVPIPKPVGYTGADPYKISFQVGKEKFLVPWLLVINRKS

Query:  AEVPMIDVHLRYSGSDLHGVTAKVMDMPHICIWQVGGCTTVDSGCCDDSGSISGGSSNHSVLLQNRGGKILRPGTILSQSKGPQAAASLGCYMVSNLSSS
         EVPMIDVHLRYSGSDLHGVTAKV DMPHI I              D    IS                         Q   P                 
Subjt:  AEVPMIDVHLRYSGSDLHGVTAKVMDMPHICIWQVGGCTTVDSGCCDDSGSISGGSSNHSVLLQNRGGKILRPGTILSQSKGPQAAASLGCYMVSNLSSS

Query:  VENLTAGEGLLGSSIELSYGSIKDLIVSGFNRESNCVWEEQSEIDVTSGLYVLFGSGLTLSFILSIYILQSSKDKLARFVRETVVESSNPGAGVAKVE
                              K ++V          WEEQSEIDVTSGLYVLFGSGLTL+FILSIYILQSSKDKLARFVRETVVE+S PG GVAKVE
Subjt:  VENLTAGEGLLGSSIELSYGSIKDLIVSGFNRESNCVWEEQSEIDVTSGLYVLFGSGLTLSFILSIYILQSSKDKLARFVRETVVESSNPGAGVAKVE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G51610.1 unknown protein2.0e-6550Show/hide
Query:  ILSLLVLLLSLVPISLAYRPGDIVPMSKMGQYHSSRTVWHDMIGRHCPIFGVNREVLVPIPKPVGYTGADPYKISFQVGKEKFLVPWLLVINRKSAEVPM
        I +L+++ LS   +S AYRPGDIV MSKMGQYHSSRT WHD+IG+HCPIF VNREVL+PI KP+GYTG DPYKI FQVG EKFL+ WLLVINRKS+EVPM
Subjt:  ILSLLVLLLSLVPISLAYRPGDIVPMSKMGQYHSSRTVWHDMIGRHCPIFGVNREVLVPIPKPVGYTGADPYKISFQVGKEKFLVPWLLVINRKSAEVPM

Query:  IDVHLRYSGSDLHGVTAKVMDMPHICIWQVGGCTTVDSGCCDDSGSISGGSSNHSVLLQNRGGKILRPGTILSQSKGPQAAASLGCYMVSNLSSSVENLT
        IDV+LRYSG DL GVTA+V+DMPH  +                        + H                I  Q   PQ                     
Subjt:  IDVHLRYSGSDLHGVTAKVMDMPHICIWQVGGCTTVDSGCCDDSGSISGGSSNHSVLLQNRGGKILRPGTILSQSKGPQAAASLGCYMVSNLSSSVENLT

Query:  AGEGLLGSSIELSYGSIKDLIVSGFNRESNCVWEEQSEIDVTSGLYVLFGSGLTLSFILSIYILQSSKDKLARFVRETVVESSNPGAG
                         K ++V          W+EQSEIDV+SG YVLFGS LT SF+LSIY+LQSS++KLARFVRETVVESS+   G
Subjt:  AGEGLLGSSIELSYGSIKDLIVSGFNRESNCVWEEQSEIDVTSGLYVLFGSGLTLSFILSIYILQSSKDKLARFVRETVVESSNPGAG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGAGTACTCTTCGGGGTGGCTTATGGCTATTGAACGTTATTTGTGACCATTTCCCACTGAGGAAATTATTCAGCCTGATTGAATTGGGCCAAGTTCGAGATGGACC
GAGAAGTAAAACCCAATTGGATCTGAAAGTAGCGGGGCCCATGGATAACTCTGGGCCCCGGGGGCCGAGCACCAAGGATCCCCGCCAGTCCAAAAGGGCGAGGAGGGAAA
CTGATTCCATAGAGCTCCAATTCTTGGAACGATTGGGCAAATTACCGTCGACGATGGGAGAACCTCCAATTCTATCTCTCCTCGTTTTGCTATTGTCGCTGGTTCCGATT
TCGCTGGCTTACAGGCCCGGAGACATCGTTCCGATGAGCAAGATGGGACAGTATCACTCGTCGAGAACAGTTTGGCACGATATGATAGGTCGACATTGTCCAATTTTTGG
CGTCAATCGCGAGGTTTTGGTTCCTATACCGAAACCGGTCGGCTACACAGGAGCTGATCCGTATAAAATATCCTTTCAAGTTGGAAAAGAGAAGTTTCTTGTCCCATGGC
TTCTTGTGATAAATCGAAAAAGTGCAGAAGTCCCAATGATTGATGTCCATTTGAGATACTCTGGAAGTGATCTTCATGGTGTGACTGCCAAAGTAATGGACATGCCTCAT
ATCTGCATATGGCAGGTTGGAGGATGTACGACAGTCGACAGCGGTTGTTGCGACGATAGTGGTAGTATAAGTGGTGGTTCTTCCAATCATAGTGTTTTGCTTCAAAATCG
TGGAGGAAAAATTTTGAGACCGGGAACAATCTTGAGCCAGAGCAAAGGACCTCAGGCTGCAGCCTCGTTAGGTTGCTATATGGTTTCAAACTTGTCGAGCTCAGTTGAAA
ACTTAACAGCTGGAGAAGGACTATTAGGCTCTTCTATCGAGCTAAGCTATGGAAGTATTAAGGATTTGATTGTGAGTGGATTCAACCGTGAATCGAACTGTGTCTGGGAG
GAGCAATCAGAGATTGATGTGACTTCTGGATTATATGTTTTGTTTGGATCAGGTCTTACGCTATCTTTTATTCTGTCGATTTACATCTTGCAATCATCGAAGGACAAGTT
AGCAAGGTTTGTAAGAGAAACAGTAGTTGAAAGCAGCAATCCTGGCGCAGGAGTGGCAAAAGTCGAATAG
mRNA sequenceShow/hide mRNA sequence
ATGTTGAGTACTCTTCGGGGTGGCTTATGGCTATTGAACGTTATTTGTGACCATTTCCCACTGAGGAAATTATTCAGCCTGATTGAATTGGGCCAAGTTCGAGATGGACC
GAGAAGTAAAACCCAATTGGATCTGAAAGTAGCGGGGCCCATGGATAACTCTGGGCCCCGGGGGCCGAGCACCAAGGATCCCCGCCAGTCCAAAAGGGCGAGGAGGGAAA
CTGATTCCATAGAGCTCCAATTCTTGGAACGATTGGGCAAATTACCGTCGACGATGGGAGAACCTCCAATTCTATCTCTCCTCGTTTTGCTATTGTCGCTGGTTCCGATT
TCGCTGGCTTACAGGCCCGGAGACATCGTTCCGATGAGCAAGATGGGACAGTATCACTCGTCGAGAACAGTTTGGCACGATATGATAGGTCGACATTGTCCAATTTTTGG
CGTCAATCGCGAGGTTTTGGTTCCTATACCGAAACCGGTCGGCTACACAGGAGCTGATCCGTATAAAATATCCTTTCAAGTTGGAAAAGAGAAGTTTCTTGTCCCATGGC
TTCTTGTGATAAATCGAAAAAGTGCAGAAGTCCCAATGATTGATGTCCATTTGAGATACTCTGGAAGTGATCTTCATGGTGTGACTGCCAAAGTAATGGACATGCCTCAT
ATCTGCATATGGCAGGTTGGAGGATGTACGACAGTCGACAGCGGTTGTTGCGACGATAGTGGTAGTATAAGTGGTGGTTCTTCCAATCATAGTGTTTTGCTTCAAAATCG
TGGAGGAAAAATTTTGAGACCGGGAACAATCTTGAGCCAGAGCAAAGGACCTCAGGCTGCAGCCTCGTTAGGTTGCTATATGGTTTCAAACTTGTCGAGCTCAGTTGAAA
ACTTAACAGCTGGAGAAGGACTATTAGGCTCTTCTATCGAGCTAAGCTATGGAAGTATTAAGGATTTGATTGTGAGTGGATTCAACCGTGAATCGAACTGTGTCTGGGAG
GAGCAATCAGAGATTGATGTGACTTCTGGATTATATGTTTTGTTTGGATCAGGTCTTACGCTATCTTTTATTCTGTCGATTTACATCTTGCAATCATCGAAGGACAAGTT
AGCAAGGTTTGTAAGAGAAACAGTAGTTGAAAGCAGCAATCCTGGCGCAGGAGTGGCAAAAGTCGAATAG
Protein sequenceShow/hide protein sequence
MLSTLRGGLWLLNVICDHFPLRKLFSLIELGQVRDGPRSKTQLDLKVAGPMDNSGPRGPSTKDPRQSKRARRETDSIELQFLERLGKLPSTMGEPPILSLLVLLLSLVPI
SLAYRPGDIVPMSKMGQYHSSRTVWHDMIGRHCPIFGVNREVLVPIPKPVGYTGADPYKISFQVGKEKFLVPWLLVINRKSAEVPMIDVHLRYSGSDLHGVTAKVMDMPH
ICIWQVGGCTTVDSGCCDDSGSISGGSSNHSVLLQNRGGKILRPGTILSQSKGPQAAASLGCYMVSNLSSSVENLTAGEGLLGSSIELSYGSIKDLIVSGFNRESNCVWE
EQSEIDVTSGLYVLFGSGLTLSFILSIYILQSSKDKLARFVRETVVESSNPGAGVAKVE