; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0036648 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0036648
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionUPF0047 protein like
Genome locationchr2:143671..147352
RNA-Seq ExpressionLag0036648
SyntenyLag0036648
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR001602 - Uncharacterised protein family UPF0047
IPR035917 - YjbQ-like superfamily
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6591177.1 hypothetical protein SDJN03_13523, partial [Cucurbita argyrosperma subsp. sororia]1.5e-8090.74Show/hide
Query:  MQGAVLLTAQPLRVKLLWSSDTSRGNQSATAGTGNSSLMAAAGPKWAQKTVMLPPHRRGCHLITPKIMKEIGQDLSEFKCGIAHIFLQHTSASLTINENY
        MQGAVLL AQPLRVKLLW +++S GN SATAG GN SLMAAAGPKWAQKTV LPPHRRGCHLITPKIMKEIG+DLSEFKCG+AHIFLQHTSASLTINENY
Subjt:  MQGAVLLTAQPLRVKLLWSSDTSRGNQSATAGTGNSSLMAAAGPKWAQKTVMLPPHRRGCHLITPKIMKEIGQDLSEFKCGIAHIFLQHTSASLTINENY

Query:  DSDVRSDTETFLNKIVPEGTSAPWKHTLEGPDDMPAHIKSSMFGCDLTIPITNGKFNMGTWQ
        DSDVRSDTETFLN IVPEGTSAPWKHTLEGPDDMPAHIKSSMFGC LT+PITNGKFNMGTWQ
Subjt:  DSDVRSDTETFLNKIVPEGTSAPWKHTLEGPDDMPAHIKSSMFGCDLTIPITNGKFNMGTWQ

XP_008453044.1 PREDICTED: UPF0047 protein YjbQ [Cucumis melo]5.4e-7888.89Show/hide
Query:  MQGAVLLTAQPLRVKLLWSSDTSRGNQSATAGTGNSSLMAAAGPKWAQKTVMLPPHRRGCHLITPKIMKEIGQDLSEFKCGIAHIFLQHTSASLTINENY
        MQGA++L AQPLRVKLL S+++S GN SATAG+ N SLMAAAGPKWAQKTV L PHRRGCHLITPKIMKEIGQDLSEFKCG+AHIFLQHTSASLTINENY
Subjt:  MQGAVLLTAQPLRVKLLWSSDTSRGNQSATAGTGNSSLMAAAGPKWAQKTVMLPPHRRGCHLITPKIMKEIGQDLSEFKCGIAHIFLQHTSASLTINENY

Query:  DSDVRSDTETFLNKIVPEGTSAPWKHTLEGPDDMPAHIKSSMFGCDLTIPITNGKFNMGTWQ
        DSDV+SDTETFLNKIVPEGTSAPWKHTLEGPDDMPAHIKSS+FGC LTIPITNGKFNMGTWQ
Subjt:  DSDVRSDTETFLNKIVPEGTSAPWKHTLEGPDDMPAHIKSSMFGCDLTIPITNGKFNMGTWQ

XP_022936473.1 uncharacterized protein LOC111443082 [Cucurbita moschata]4.0e-8191.36Show/hide
Query:  MQGAVLLTAQPLRVKLLWSSDTSRGNQSATAGTGNSSLMAAAGPKWAQKTVMLPPHRRGCHLITPKIMKEIGQDLSEFKCGIAHIFLQHTSASLTINENY
        MQGAVLL AQPLRVKLLWS+++S GN SATAG GN SLMAAAGPKWAQKTV LPPHRRGCHLITPKIMKEIG+DLSEFKCG+AHIFLQHTSASLTINENY
Subjt:  MQGAVLLTAQPLRVKLLWSSDTSRGNQSATAGTGNSSLMAAAGPKWAQKTVMLPPHRRGCHLITPKIMKEIGQDLSEFKCGIAHIFLQHTSASLTINENY

Query:  DSDVRSDTETFLNKIVPEGTSAPWKHTLEGPDDMPAHIKSSMFGCDLTIPITNGKFNMGTWQ
        DSDVRSDTETFLN IVPEGTSAPWKHTLEGPDDMPAHIKSSMFGC LT+PITNGKFNMGTWQ
Subjt:  DSDVRSDTETFLNKIVPEGTSAPWKHTLEGPDDMPAHIKSSMFGCDLTIPITNGKFNMGTWQ

XP_022976235.1 uncharacterized protein LOC111476694 [Cucurbita maxima]1.7e-7990.74Show/hide
Query:  MQGAVLLTAQPLRVKLLWSSDTSRGNQSATAGTGNSSLMAAAGPKWAQKTVMLPPHRRGCHLITPKIMKEIGQDLSEFKCGIAHIFLQHTSASLTINENY
        MQGAVLL AQPLRVKLL S+++S GN SATAG GN SLMAAAGPKWAQKTV LPPHRRGCHLITPKIMKEIG DLSEFKCG+AHIFLQHTSASLTINENY
Subjt:  MQGAVLLTAQPLRVKLLWSSDTSRGNQSATAGTGNSSLMAAAGPKWAQKTVMLPPHRRGCHLITPKIMKEIGQDLSEFKCGIAHIFLQHTSASLTINENY

Query:  DSDVRSDTETFLNKIVPEGTSAPWKHTLEGPDDMPAHIKSSMFGCDLTIPITNGKFNMGTWQ
        DSDVRSDTETFLN IVPEGTSAPWKHTLEGPDDMPAHIKSSMFGC LT+PITNGKFNMGTWQ
Subjt:  DSDVRSDTETFLNKIVPEGTSAPWKHTLEGPDDMPAHIKSSMFGCDLTIPITNGKFNMGTWQ

XP_038897717.1 UPF0047 protein C4A8.02c [Benincasa hispida]3.4e-8090.12Show/hide
Query:  MQGAVLLTAQPLRVKLLWSSDTSRGNQSATAGTGNSSLMAAAGPKWAQKTVMLPPHRRGCHLITPKIMKEIGQDLSEFKCGIAHIFLQHTSASLTINENY
        MQGA+L  AQPLR+KLLW+S++S GN SATA +GN+SLMAAAGPKWAQKTV LPPHRRGCHLITPKIMKEIG DLSEFKCG+AHIFLQHTSASLTINENY
Subjt:  MQGAVLLTAQPLRVKLLWSSDTSRGNQSATAGTGNSSLMAAAGPKWAQKTVMLPPHRRGCHLITPKIMKEIGQDLSEFKCGIAHIFLQHTSASLTINENY

Query:  DSDVRSDTETFLNKIVPEGTSAPWKHTLEGPDDMPAHIKSSMFGCDLTIPITNGKFNMGTWQ
        DSDVRSDTETFLNKIVPEGTSAPWKHTLEGPDDMPAHIKSSMFGC LTIPITNGKFNMGTWQ
Subjt:  DSDVRSDTETFLNKIVPEGTSAPWKHTLEGPDDMPAHIKSSMFGCDLTIPITNGKFNMGTWQ

TrEMBL top hitse value%identityAlignment
A0A0A0L4Q6 Uncharacterized protein4.9e-7786.5Show/hide
Query:  MQGAVLLTAQPLRVKLLWSSDTSRGNQSATAGTGNSSLMAAAGPKWAQKTVMLPPHRRGCHLITPKIMKEIGQDLSEFKCGIAHIFLQHTSASLTINENY
        MQGA++L AQPLRVKLL S+++S GN SATAG+ N S MAAAGPKWAQKTV L PHRRGCHLITPKIMKEIGQDLSEFKCG+AHIFLQHTSASLTINENY
Subjt:  MQGAVLLTAQPLRVKLLWSSDTSRGNQSATAGTGNSSLMAAAGPKWAQKTVMLPPHRRGCHLITPKIMKEIGQDLSEFKCGIAHIFLQHTSASLTINENY

Query:  DSDVRSDTETFLNKIVPEGTSAPWKHTLEGPDDMPAHIKSSMFGCDLTIPITNGKFNMGTWQE
        DSDV++DTETFLNKIVPEGTSAPWKHTLEGPDDMPAHIKSS+FGC LTIPITNGK NMGTWQ+
Subjt:  DSDVRSDTETFLNKIVPEGTSAPWKHTLEGPDDMPAHIKSSMFGCDLTIPITNGKFNMGTWQE

A0A1S3BUP9 UPF0047 protein YjbQ2.6e-7888.89Show/hide
Query:  MQGAVLLTAQPLRVKLLWSSDTSRGNQSATAGTGNSSLMAAAGPKWAQKTVMLPPHRRGCHLITPKIMKEIGQDLSEFKCGIAHIFLQHTSASLTINENY
        MQGA++L AQPLRVKLL S+++S GN SATAG+ N SLMAAAGPKWAQKTV L PHRRGCHLITPKIMKEIGQDLSEFKCG+AHIFLQHTSASLTINENY
Subjt:  MQGAVLLTAQPLRVKLLWSSDTSRGNQSATAGTGNSSLMAAAGPKWAQKTVMLPPHRRGCHLITPKIMKEIGQDLSEFKCGIAHIFLQHTSASLTINENY

Query:  DSDVRSDTETFLNKIVPEGTSAPWKHTLEGPDDMPAHIKSSMFGCDLTIPITNGKFNMGTWQ
        DSDV+SDTETFLNKIVPEGTSAPWKHTLEGPDDMPAHIKSS+FGC LTIPITNGKFNMGTWQ
Subjt:  DSDVRSDTETFLNKIVPEGTSAPWKHTLEGPDDMPAHIKSSMFGCDLTIPITNGKFNMGTWQ

A0A6J1DDZ9 uncharacterized protein LOC111019630 isoform X11.5e-7385.54Show/hide
Query:  MQGAVLLT--AQPLRVKLLWS--SDTSRGNQSATAGTGNSSLMAAAGPKWAQKTVMLPPHRRGCHLITPKIMKEIGQDLSEFKCGIAHIFLQHTSASLTI
        MQGAVLL   AQP+RVKL  S   +TS G+ SA+AG+G  S MAAAGPKWAQKTV LPPHRRGCHLITPKIMKEIGQDL+EFKCG+AHIFLQHTSASLTI
Subjt:  MQGAVLLT--AQPLRVKLLWS--SDTSRGNQSATAGTGNSSLMAAAGPKWAQKTVMLPPHRRGCHLITPKIMKEIGQDLSEFKCGIAHIFLQHTSASLTI

Query:  NENYDSDVRSDTETFLNKIVPEGTSAPWKHTLEGPDDMPAHIKSSMFGCDLTIPITNGKFNMGTWQ
        NENYDSDVRSDTETFLN+IVPEGTSAPWKHTLEGPDDMPAHIKSSMFGC LTIPITNG FNMGTWQ
Subjt:  NENYDSDVRSDTETFLNKIVPEGTSAPWKHTLEGPDDMPAHIKSSMFGCDLTIPITNGKFNMGTWQ

A0A6J1FDC5 uncharacterized protein LOC1114430821.9e-8191.36Show/hide
Query:  MQGAVLLTAQPLRVKLLWSSDTSRGNQSATAGTGNSSLMAAAGPKWAQKTVMLPPHRRGCHLITPKIMKEIGQDLSEFKCGIAHIFLQHTSASLTINENY
        MQGAVLL AQPLRVKLLWS+++S GN SATAG GN SLMAAAGPKWAQKTV LPPHRRGCHLITPKIMKEIG+DLSEFKCG+AHIFLQHTSASLTINENY
Subjt:  MQGAVLLTAQPLRVKLLWSSDTSRGNQSATAGTGNSSLMAAAGPKWAQKTVMLPPHRRGCHLITPKIMKEIGQDLSEFKCGIAHIFLQHTSASLTINENY

Query:  DSDVRSDTETFLNKIVPEGTSAPWKHTLEGPDDMPAHIKSSMFGCDLTIPITNGKFNMGTWQ
        DSDVRSDTETFLN IVPEGTSAPWKHTLEGPDDMPAHIKSSMFGC LT+PITNGKFNMGTWQ
Subjt:  DSDVRSDTETFLNKIVPEGTSAPWKHTLEGPDDMPAHIKSSMFGCDLTIPITNGKFNMGTWQ

A0A6J1IIX0 uncharacterized protein LOC1114766948.1e-8090.74Show/hide
Query:  MQGAVLLTAQPLRVKLLWSSDTSRGNQSATAGTGNSSLMAAAGPKWAQKTVMLPPHRRGCHLITPKIMKEIGQDLSEFKCGIAHIFLQHTSASLTINENY
        MQGAVLL AQPLRVKLL S+++S GN SATAG GN SLMAAAGPKWAQKTV LPPHRRGCHLITPKIMKEIG DLSEFKCG+AHIFLQHTSASLTINENY
Subjt:  MQGAVLLTAQPLRVKLLWSSDTSRGNQSATAGTGNSSLMAAAGPKWAQKTVMLPPHRRGCHLITPKIMKEIGQDLSEFKCGIAHIFLQHTSASLTINENY

Query:  DSDVRSDTETFLNKIVPEGTSAPWKHTLEGPDDMPAHIKSSMFGCDLTIPITNGKFNMGTWQ
        DSDVRSDTETFLN IVPEGTSAPWKHTLEGPDDMPAHIKSSMFGC LT+PITNGKFNMGTWQ
Subjt:  DSDVRSDTETFLNKIVPEGTSAPWKHTLEGPDDMPAHIKSSMFGCDLTIPITNGKFNMGTWQ

SwissProt top hitse value%identityAlignment
O14155 UPF0047 protein C4A8.02c2.4e-2849.19Show/hide
Query:  QKTVMLPPHRRGCHLITPKIMKEIGQDLSEFKCGIAHIFLQHTSASLTINENYDSDVRSDTETFLNKIVPEGTSAPWKHTLEGPDDMPAHIKSSMFGCDL
        Q+ + L    +G ++IT  ++K++  +L  F  G  + F+QHTSA+LTINEN+D+D R+D    L+KIVPE  SA ++HT EG DDMPAH+KSS+ G  L
Subjt:  QKTVMLPPHRRGCHLITPKIMKEIGQDLSEFKCGIAHIFLQHTSASLTINENYDSDVRSDTETFLNKIVPEGTSAPWKHTLEGPDDMPAHIKSSMFGCDL

Query:  TIPITNGKFNMGTWQEPSFLAKRR
        T+PITNGK ++GTWQ+      RR
Subjt:  TIPITNGKFNMGTWQEPSFLAKRR

P0A2L1 UPF0047 protein YjbQ5.1e-2346.15Show/hide
Query:  WAQKTVMLPPHRRGCHLITPKIMKEIGQDLSEFKCGIAHIFLQHTSASLTINENYDSDVRSDTETFLNKIVPEGTSAPWKHTLEGPDDMPAHIKSSMFGC
        W Q+T+ L    RG HLIT +I  ++   L   + G+ H+ L HTSASLT+NEN D  VR+D E    K VP+  +A ++H  EG DDMP+HIKSS+ G 
Subjt:  WAQKTVMLPPHRRGCHLITPKIMKEIGQDLSEFKCGIAHIFLQHTSASLTINENYDSDVRSDTETFLNKIVPEGTSAPWKHTLEGPDDMPAHIKSSMFGC

Query:  DLTIPITNGKFNMGTWQ
         L +P+  G+  +GTWQ
Subjt:  DLTIPITNGKFNMGTWQ

P0A2L2 UPF0047 protein YjbQ5.1e-2346.15Show/hide
Query:  WAQKTVMLPPHRRGCHLITPKIMKEIGQDLSEFKCGIAHIFLQHTSASLTINENYDSDVRSDTETFLNKIVPEGTSAPWKHTLEGPDDMPAHIKSSMFGC
        W Q+T+ L    RG HLIT +I  ++   L   + G+ H+ L HTSASLT+NEN D  VR+D E    K VP+  +A ++H  EG DDMP+HIKSS+ G 
Subjt:  WAQKTVMLPPHRRGCHLITPKIMKEIGQDLSEFKCGIAHIFLQHTSASLTINENYDSDVRSDTETFLNKIVPEGTSAPWKHTLEGPDDMPAHIKSSMFGC

Query:  DLTIPITNGKFNMGTWQ
         L +P+  G+  +GTWQ
Subjt:  DLTIPITNGKFNMGTWQ

P0AF48 UPF0047 protein YjbQ5.5e-2547.01Show/hide
Query:  WAQKTVMLPPHRRGCHLITPKIMKEIGQDLSEFKCGIAHIFLQHTSASLTINENYDSDVRSDTETFLNKIVPEGTSAPWKHTLEGPDDMPAHIKSSMFGC
        W QKT+ L    RG HL+T +I+ ++  D+     G+ H+ LQHTSASLT+NEN D  VR D E F  + VP+  +  ++H  EG DDMP+HIKSSM G 
Subjt:  WAQKTVMLPPHRRGCHLITPKIMKEIGQDLSEFKCGIAHIFLQHTSASLTINENYDSDVRSDTETFLNKIVPEGTSAPWKHTLEGPDDMPAHIKSSMFGC

Query:  DLTIPITNGKFNMGTWQ
         L +P+  G+   GTWQ
Subjt:  DLTIPITNGKFNMGTWQ

P0AF49 UPF0047 protein YjbQ5.5e-2547.01Show/hide
Query:  WAQKTVMLPPHRRGCHLITPKIMKEIGQDLSEFKCGIAHIFLQHTSASLTINENYDSDVRSDTETFLNKIVPEGTSAPWKHTLEGPDDMPAHIKSSMFGC
        W QKT+ L    RG HL+T +I+ ++  D+     G+ H+ LQHTSASLT+NEN D  VR D E F  + VP+  +  ++H  EG DDMP+HIKSSM G 
Subjt:  WAQKTVMLPPHRRGCHLITPKIMKEIGQDLSEFKCGIAHIFLQHTSASLTINENYDSDVRSDTETFLNKIVPEGTSAPWKHTLEGPDDMPAHIKSSMFGC

Query:  DLTIPITNGKFNMGTWQ
         L +P+  G+   GTWQ
Subjt:  DLTIPITNGKFNMGTWQ

Arabidopsis top hitse value%identityAlignment
AT1G21065.1 unknown protein1.2e-5975.37Show/hide
Query:  ATAGTGNSSLMAAAGPKWAQKTVMLPPHRRGCHLITPKIMKEIGQDLSEFKCGIAHIFLQHTSASLTINENYDSDVRSDTETFLNKIVPEGTSAPWKHTL
        A   T  +S+ +++G KWAQKT+ LPP RRGCHLITPKI+KEI +DLS+F CG+AH+FLQHTSASLTINENYD DV++DTETFLN+IVPEG SAPW+HT+
Subjt:  ATAGTGNSSLMAAAGPKWAQKTVMLPPHRRGCHLITPKIMKEIGQDLSEFKCGIAHIFLQHTSASLTINENYDSDVRSDTETFLNKIVPEGTSAPWKHTL

Query:  EGPDDMPAHIKSSMFGCDLTIPITNGKFNMGTWQ
        EGPDDMPAHIKSSMFGC LTIPIT GK +MGTWQ
Subjt:  EGPDDMPAHIKSSMFGCDLTIPITNGKFNMGTWQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAGGCGCTGTTCTCTTGACTGCACAACCACTCCGCGTAAAGTTGTTGTGGAGCAGCGATACCAGTAGAGGTAATCAGTCCGCCACCGCAGGCACAGGCAATTCAAG
CTTGATGGCGGCCGCTGGTCCGAAGTGGGCTCAGAAGACTGTTATGCTGCCTCCTCATAGGCGGGGCTGTCATCTCATCACACCGAAGATAATGAAGGAAATTGGGCAGG
ACTTGTCAGAATTCAAATGTGGCATTGCTCATATCTTCTTGCAGCATACGAGTGCCTCTCTTACCATCAATGAAAACTATGATTCGGATGTTCGGAGTGATACTGAAACC
TTCCTCAACAAGATAGTGCCTGAAGGGACATCTGCTCCTTGGAAGCATACACTTGAAGGCCCGGATGACATGCCAGCACACATTAAATCATCAATGTTTGGCTGTGATCT
CACGATTCCAATTACAAATGGAAAGTTTAATATGGGTACTTGGCAGGAACCTTCCTTTCTTGCGAAAAGGAGGGTGAAGAAGGAACTCCTCGATAATTTCCTTGTAGTTC
CTATATCTAGCAAGGTTGAAACCAAAACTCCTCGAAAAAATAATTCCAAACAGTTTGGGCTAAACCACAACTCCAAAAGATATGATCAACAAGGCAGACCAATCAACATA
TCCGTTAACTCGGAGGGAGCGTCGGGCGGCATTTTAGTGCTGTGGAAAGAGAAGGAAGTCAAAGCCTTGGAGGTGGTATTGGGTATTTACTCGCTGTCCGTTTTGTTTGA
ATGTGGGAATCAAAATCAGTTTTTGGTGACTGGAGTTTATGGGCCTAGCCGTCCTAAGGGTAGAAATTTTTTTTGGAGGGAGCTTTATGACCTCTGTGGTTTGTGCAGTG
GAGTTTGGTGCATTGGAGGGGACTTCAATGTGGTGAGAAGTATTGAAGAAAAATCTTCCAGAGGTAGAGTTACTAAAAGTATGCGTGCTTTTAACTGTTGGGTGGAAAAT
TGTAGTCTTTCGGAGGTCGTTTTGGTTAATGCTAGATACACGTGGTATGAAAAAAGAGAGTCCCCAGTGTTTACTAAAATTGACAGATTCTTTGTGACCAAAGAATGGTT
GGACCTCTTCTCCAACGTGAATTCCAATCGGTTACAACGTATTACATCGAATCACTTCCCAATTTTACTTCAAGCTGGGAATTTCTCTTGGGGACTGATGCCTTTTAGAT
TTGAAAATGCTTGGCTATCCCATCTGAATTTCAGGAACTTGATAGATGGTTGGTGGAAGGAGCAAAATGTGGAAGGGTGGGCAGGGTATCAATTTATGGCCAAACTGAAA
GACATTAAAGAAAACTTAAAAGGGTGGAACAAGGAAACCTTTGGTGCTATAAGAGATGAAAAAATCAAAGAGATGAAAAAACCAGAATTTGCCAGAGAATTGAGCAGCTA
G
mRNA sequenceShow/hide mRNA sequence
ATGCAAGGCGCTGTTCTCTTGACTGCACAACCACTCCGCGTAAAGTTGTTGTGGAGCAGCGATACCAGTAGAGGTAATCAGTCCGCCACCGCAGGCACAGGCAATTCAAG
CTTGATGGCGGCCGCTGGTCCGAAGTGGGCTCAGAAGACTGTTATGCTGCCTCCTCATAGGCGGGGCTGTCATCTCATCACACCGAAGATAATGAAGGAAATTGGGCAGG
ACTTGTCAGAATTCAAATGTGGCATTGCTCATATCTTCTTGCAGCATACGAGTGCCTCTCTTACCATCAATGAAAACTATGATTCGGATGTTCGGAGTGATACTGAAACC
TTCCTCAACAAGATAGTGCCTGAAGGGACATCTGCTCCTTGGAAGCATACACTTGAAGGCCCGGATGACATGCCAGCACACATTAAATCATCAATGTTTGGCTGTGATCT
CACGATTCCAATTACAAATGGAAAGTTTAATATGGGTACTTGGCAGGAACCTTCCTTTCTTGCGAAAAGGAGGGTGAAGAAGGAACTCCTCGATAATTTCCTTGTAGTTC
CTATATCTAGCAAGGTTGAAACCAAAACTCCTCGAAAAAATAATTCCAAACAGTTTGGGCTAAACCACAACTCCAAAAGATATGATCAACAAGGCAGACCAATCAACATA
TCCGTTAACTCGGAGGGAGCGTCGGGCGGCATTTTAGTGCTGTGGAAAGAGAAGGAAGTCAAAGCCTTGGAGGTGGTATTGGGTATTTACTCGCTGTCCGTTTTGTTTGA
ATGTGGGAATCAAAATCAGTTTTTGGTGACTGGAGTTTATGGGCCTAGCCGTCCTAAGGGTAGAAATTTTTTTTGGAGGGAGCTTTATGACCTCTGTGGTTTGTGCAGTG
GAGTTTGGTGCATTGGAGGGGACTTCAATGTGGTGAGAAGTATTGAAGAAAAATCTTCCAGAGGTAGAGTTACTAAAAGTATGCGTGCTTTTAACTGTTGGGTGGAAAAT
TGTAGTCTTTCGGAGGTCGTTTTGGTTAATGCTAGATACACGTGGTATGAAAAAAGAGAGTCCCCAGTGTTTACTAAAATTGACAGATTCTTTGTGACCAAAGAATGGTT
GGACCTCTTCTCCAACGTGAATTCCAATCGGTTACAACGTATTACATCGAATCACTTCCCAATTTTACTTCAAGCTGGGAATTTCTCTTGGGGACTGATGCCTTTTAGAT
TTGAAAATGCTTGGCTATCCCATCTGAATTTCAGGAACTTGATAGATGGTTGGTGGAAGGAGCAAAATGTGGAAGGGTGGGCAGGGTATCAATTTATGGCCAAACTGAAA
GACATTAAAGAAAACTTAAAAGGGTGGAACAAGGAAACCTTTGGTGCTATAAGAGATGAAAAAATCAAAGAGATGAAAAAACCAGAATTTGCCAGAGAATTGAGCAGCTA
G
Protein sequenceShow/hide protein sequence
MQGAVLLTAQPLRVKLLWSSDTSRGNQSATAGTGNSSLMAAAGPKWAQKTVMLPPHRRGCHLITPKIMKEIGQDLSEFKCGIAHIFLQHTSASLTINENYDSDVRSDTET
FLNKIVPEGTSAPWKHTLEGPDDMPAHIKSSMFGCDLTIPITNGKFNMGTWQEPSFLAKRRVKKELLDNFLVVPISSKVETKTPRKNNSKQFGLNHNSKRYDQQGRPINI
SVNSEGASGGILVLWKEKEVKALEVVLGIYSLSVLFECGNQNQFLVTGVYGPSRPKGRNFFWRELYDLCGLCSGVWCIGGDFNVVRSIEEKSSRGRVTKSMRAFNCWVEN
CSLSEVVLVNARYTWYEKRESPVFTKIDRFFVTKEWLDLFSNVNSNRLQRITSNHFPILLQAGNFSWGLMPFRFENAWLSHLNFRNLIDGWWKEQNVEGWAGYQFMAKLK
DIKENLKGWNKETFGAIRDEKIKEMKKPEFARELSS