; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10022055 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10022055
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionProtein of unknown function (DUF1997)
Genome locationChr05:20298855..20300507
RNA-Seq ExpressionHG10022055
SyntenyHG10022055
Gene Ontology termsNA
InterPro domainsIPR018971 - Protein of unknown function DUF1997


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004141578.1 uncharacterized protein LOC101212716 isoform X2 [Cucumis sativus]8.0e-9474.49Show/hide
Query:  MLSSRTQPILCLQVENGVIFQRENSSLKKHKFKSWKCFAV-PKTQK---HHNFLSVSLTFFSDLPLYDSPGKASFDEYLEDKPRLVKATFPGKSQQLNQE
        ML S T+P +CL       FQRE SSLKK K K+WKCFA+ P++QK   H+N LSVS   FSDLPLY+SPGKASFDEYLEDKPRLVKATFPGK+QQLNQE
Subjt:  MLSSRTQPILCLQVENGVIFQRENSSLKKHKFKSWKCFAV-PKTQK---HHNFLSVSLTFFSDLPLYDSPGKASFDEYLEDKPRLVKATFPGKSQQLNQE

Query:  EWRIETPKIQLLFLKIWPTIHMKIICKTN-GEGYPSHVPHYITKVLQLEMTYWEINGIHKLYMPSSANVCSRGAIYTEKIGIRNHLKFQLLINLSFLVPD
        EWRIETPKIQLLFLKI PTI MKII KTN GE YP HVPHYI K+L  +MT WEINGIHK Y PSSANVCS G IY +KIG R+ LKFQL+I+LSFLVPD
Subjt:  EWRIETPKIQLLFLKIWPTIHMKIICKTN-GEGYPSHVPHYITKVLQLEMTYWEINGIHKLYMPSSANVCSRGAIYTEKIGIRNHLKFQLLINLSFLVPD

Query:  ALNFVPNDVLRGIIETVLKAMIEDLKHKTIHKLVEDYTEFRKENKKK
        AL+FVPNDVLRGIIETV+KAM+EDLKHKT+HKLVEDY++FR E +K+
Subjt:  ALNFVPNDVLRGIIETVLKAMIEDLKHKTIHKLVEDYTEFRKENKKK

XP_022961709.1 uncharacterized protein LOC111462397 [Cucurbita moschata]6.1e-8668.29Show/hide
Query:  MMLSSRTQPILCLQVENGVIFQRENSSLKKHKFKSWKCFAVPKTQ----KHHNFLSVSLTFFSDLPLYDSPGKASFDEYLEDKPRLVKATFPGKSQQLNQ
        M+ S   QP LC  V+NGV+ Q+  S LK  K   WKCFAV K+Q    K  N LSVSL  FSD+PLY+  GKASFD+YLEDKPRLVKATFPGKS+QLNQ
Subjt:  MMLSSRTQPILCLQVENGVIFQRENSSLKKHKFKSWKCFAVPKTQ----KHHNFLSVSLTFFSDLPLYDSPGKASFDEYLEDKPRLVKATFPGKSQQLNQ

Query:  EEWRIETPKIQLLFLKIWPTIHMKIICKTNGEGYPSHVPHYITKVLQLEMTYWEINGIHKLYMPSSANVCSRGAIYTEKIGIRNHLKFQLLINLSFLVPD
        EEWRIETPKI+ LFLKIWPTI +KII KT+GEGYPS VPH ITKVL L+MT WE+NGIH+ Y PSSANVCSRGAIY+EK GIR+ LKFQL INLSF +PD
Subjt:  EEWRIETPKIQLLFLKIWPTIHMKIICKTNGEGYPSHVPHYITKVLQLEMTYWEINGIHKLYMPSSANVCSRGAIYTEKIGIRNHLKFQLLINLSFLVPD

Query:  ALNFVPNDVLRGIIETVLKAMIEDLKHKTIHKLVEDYTEFRKENKK
        AL FVP DV + I+E  LK M+ED+K K + +LVEDY  FRKE KK
Subjt:  ALNFVPNDVLRGIIETVLKAMIEDLKHKTIHKLVEDYTEFRKENKK

XP_023517004.1 uncharacterized protein LOC111780797 isoform X1 [Cucurbita pepo subsp. pepo]8.0e-8667.89Show/hide
Query:  MMLSSRTQPILCLQVENGVIFQRENSSLKKHKFKSWKCFAVPKTQ----KHHNFLSVSLTFFSDLPLYDSPGKASFDEYLEDKPRLVKATFPGKSQQLNQ
        M+ S   QP LC  V+NGV+ Q+  S LK  K   WKCFAV K+Q    K  N LSVSL  FSD+PLY+  GKASFD+YLEDKPR+VKATFPGKS+QLNQ
Subjt:  MMLSSRTQPILCLQVENGVIFQRENSSLKKHKFKSWKCFAVPKTQ----KHHNFLSVSLTFFSDLPLYDSPGKASFDEYLEDKPRLVKATFPGKSQQLNQ

Query:  EEWRIETPKIQLLFLKIWPTIHMKIICKTNGEGYPSHVPHYITKVLQLEMTYWEINGIHKLYMPSSANVCSRGAIYTEKIGIRNHLKFQLLINLSFLVPD
        EEWRIETPKI+ LFLKIWPTI +KII KT+GEGYPS VPH ITKVL L+MT WE+NGIH+ Y PSSANVCSRGAIY++K GIR+ LKFQL INLSF +PD
Subjt:  EEWRIETPKIQLLFLKIWPTIHMKIICKTNGEGYPSHVPHYITKVLQLEMTYWEINGIHKLYMPSSANVCSRGAIYTEKIGIRNHLKFQLLINLSFLVPD

Query:  ALNFVPNDVLRGIIETVLKAMIEDLKHKTIHKLVEDYTEFRKENKK
        AL FVP DV + I+E  LKAM+ED+K K + +LVEDY  FRKE KK
Subjt:  ALNFVPNDVLRGIIETVLKAMIEDLKHKTIHKLVEDYTEFRKENKK

XP_031741979.1 uncharacterized protein LOC101212716 isoform X1 [Cucumis sativus]2.0e-9274.19Show/hide
Query:  MLSSRTQPILCLQVENGVIFQRENSSLKKHKFKSWKCFAV-PKTQK---HHNFLSVSLTFFSDLPLYDSPGKASFDEYLEDKPRLVKATFPGKSQQLNQE
        ML S T+P +CL       FQRE SSLKK K K+WKCFA+ P++QK   H+N LSVS   FSDLPLY+SPGKASFDEYLEDKPRLVKATFPGK+QQLNQE
Subjt:  MLSSRTQPILCLQVENGVIFQRENSSLKKHKFKSWKCFAV-PKTQK---HHNFLSVSLTFFSDLPLYDSPGKASFDEYLEDKPRLVKATFPGKSQQLNQE

Query:  EWRIETPKIQLLFLKIWPTIHMKIICKTN-GEGYPSHVPHYITKVLQLEM-TYWEINGIHKLYMPSSANVCSRGAIYTEKIGIRNHLKFQLLINLSFLVP
        EWRIETPKIQLLFLKI PTI MKII KTN GE YP HVPHYI K+L  +M T WEINGIHK Y PSSANVCS G IY +KIG R+ LKFQL+I+LSFLVP
Subjt:  EWRIETPKIQLLFLKIWPTIHMKIICKTN-GEGYPSHVPHYITKVLQLEM-TYWEINGIHKLYMPSSANVCSRGAIYTEKIGIRNHLKFQLLINLSFLVP

Query:  DALNFVPNDVLRGIIETVLKAMIEDLKHKTIHKLVEDYTEFRKENKKK
        DAL+FVPNDVLRGIIETV+KAM+EDLKHKT+HKLVEDY++FR E +K+
Subjt:  DALNFVPNDVLRGIIETVLKAMIEDLKHKTIHKLVEDYTEFRKENKKK

XP_038891182.1 uncharacterized protein LOC120080556 [Benincasa hispida]2.2e-10478.8Show/hide
Query:  MMLSSRTQPILCLQVENGVIFQRENS----SLKKHKFKSWKCFAVPKTQK-----HHNFLSVSLTFFSDLPLYDSPGKASFDEYLEDKPRLVKATFPGKS
        MML  RT  +  LQVENGV+ QRE+S    +LKK K K WKCFAV KTQK     HHN LSVSLTFFSDLPLYDSPGKASFDEYLEDKPRLVKATFPGK 
Subjt:  MMLSSRTQPILCLQVENGVIFQRENS----SLKKHKFKSWKCFAVPKTQK-----HHNFLSVSLTFFSDLPLYDSPGKASFDEYLEDKPRLVKATFPGKS

Query:  QQLNQEEWRIETPKIQLLFLKIWPTIHMKIICKTNGEGYPSHVPHYITKVLQLEMTYWEINGIHKLYMPSSANVCSRGAIYTEKIGIRNHLKFQLLINLS
        QQLNQEEWRIE PKI+LLFLKIWPT+ +KI CKTNGE YPS VPHYITKVL LEMT WEINGIHK Y PS ANVCSRGAIY+EKIG R+HLKF+LLINLS
Subjt:  QQLNQEEWRIETPKIQLLFLKIWPTIHMKIICKTNGEGYPSHVPHYITKVLQLEMTYWEINGIHKLYMPSSANVCSRGAIYTEKIGIRNHLKFQLLINLS

Query:  FLVPDALNFVPNDVLRGIIETVLKAMIEDLKHKTIHKLVEDYTEFRKENK
        FLVP  LNFV NDVL+ I++T LKAMIEDLKHK+IHKLVEDY EFRKENK
Subjt:  FLVPDALNFVPNDVLRGIIETVLKAMIEDLKHKTIHKLVEDYTEFRKENK

TrEMBL top hitse value%identityAlignment
A0A0A0KSD5 Uncharacterized protein3.9e-9474.49Show/hide
Query:  MLSSRTQPILCLQVENGVIFQRENSSLKKHKFKSWKCFAV-PKTQK---HHNFLSVSLTFFSDLPLYDSPGKASFDEYLEDKPRLVKATFPGKSQQLNQE
        ML S T+P +CL       FQRE SSLKK K K+WKCFA+ P++QK   H+N LSVS   FSDLPLY+SPGKASFDEYLEDKPRLVKATFPGK+QQLNQE
Subjt:  MLSSRTQPILCLQVENGVIFQRENSSLKKHKFKSWKCFAV-PKTQK---HHNFLSVSLTFFSDLPLYDSPGKASFDEYLEDKPRLVKATFPGKSQQLNQE

Query:  EWRIETPKIQLLFLKIWPTIHMKIICKTN-GEGYPSHVPHYITKVLQLEMTYWEINGIHKLYMPSSANVCSRGAIYTEKIGIRNHLKFQLLINLSFLVPD
        EWRIETPKIQLLFLKI PTI MKII KTN GE YP HVPHYI K+L  +MT WEINGIHK Y PSSANVCS G IY +KIG R+ LKFQL+I+LSFLVPD
Subjt:  EWRIETPKIQLLFLKIWPTIHMKIICKTN-GEGYPSHVPHYITKVLQLEMTYWEINGIHKLYMPSSANVCSRGAIYTEKIGIRNHLKFQLLINLSFLVPD

Query:  ALNFVPNDVLRGIIETVLKAMIEDLKHKTIHKLVEDYTEFRKENKKK
        AL+FVPNDVLRGIIETV+KAM+EDLKHKT+HKLVEDY++FR E +K+
Subjt:  ALNFVPNDVLRGIIETVLKAMIEDLKHKTIHKLVEDYTEFRKENKKK

A0A1S4E357 uncharacterized protein LOC1034987442.8e-7675.63Show/hide
Query:  FQRENSSLKKHKFKSWKCFAVPKTQK---HHNFLSVSLTFFSDLPLYDSPGKASFDEYLEDKPRLVKATFPGKSQQLNQEEWRIETPKIQLLFLKIWPTI
        FQRE SSLKK K K W+CFA+P++QK   H N LSVS   FSDL L++SPGKASFDEYLEDKPRL+KATFPGK QQLNQEEWRIETPKIQLLFLKIWPT+
Subjt:  FQRENSSLKKHKFKSWKCFAVPKTQK---HHNFLSVSLTFFSDLPLYDSPGKASFDEYLEDKPRLVKATFPGKSQQLNQEEWRIETPKIQLLFLKIWPTI

Query:  HMKIICKTN-GEGYPSHVPHYITKVLQLEMTYWEINGIHKLYMPSSANVCSRGAIYTEKIGIRNHLKFQLLINLSFLVPDALNFVPNDVLRGIIETV
         MKII KTN GE YP  VP+YI KVL  EMT WEINGI+K Y PSSANVCS G IY EKIG R+ LKF+L+I+LSFLVPDAL+FVPNDVLRG+I TV
Subjt:  HMKIICKTN-GEGYPSHVPHYITKVLQLEMTYWEINGIHKLYMPSSANVCSRGAIYTEKIGIRNHLKFQLLINLSFLVPDALNFVPNDVLRGIIETV

A0A6J1C174 uncharacterized protein LOC111006493 isoform X13.0e-7863.16Show/hide
Query:  MLSSRTQPILCLQVENGVIFQRENSSL-----KKHK-FKSWKCFAVPKT-QKHHNFLSVSLTFFSDLPLYDSPGKASFDEYLEDKPRLVKATFPGKSQQL
        M+S  +  +L   VENG   QR N+       KK K  +S K  AV KT Q+H N LS S+ FFSD+PL +SPGKASFD+YLEDKPR++KATFPGKSQQL
Subjt:  MLSSRTQPILCLQVENGVIFQRENSSL-----KKHK-FKSWKCFAVPKT-QKHHNFLSVSLTFFSDLPLYDSPGKASFDEYLEDKPRLVKATFPGKSQQL

Query:  NQEEWRIETPKIQLLFLKIWPTIHMKIICKTNGEGYPSHVPHYITKVLQLEMTYWEINGIHKLYMPSSANVCSRGAIYTEKIGIRNHLKFQLLINLSFLV
        NQEEWRIETPK++LL LKIWP I MKII KT+G+ YP HVPH+ITK+L LEMT WEINGIH+ Y PSSANV S+GAIY+EK G  + LKFQ  +N +F+V
Subjt:  NQEEWRIETPKIQLLFLKIWPTIHMKIICKTNGEGYPSHVPHYITKVLQLEMTYWEINGIHKLYMPSSANVCSRGAIYTEKIGIRNHLKFQLLINLSFLV

Query:  PDALNFVPNDVLRGIIETVLKAMIEDLKHKTIHKLVEDYTEFRKENK
        P AL+F+P D+ R I ETVLK M+EDL +K I KLVEDY++FRKE K
Subjt:  PDALNFVPNDVLRGIIETVLKAMIEDLKHKTIHKLVEDYTEFRKENK

A0A6J1HEU2 uncharacterized protein LOC1114623973.0e-8668.29Show/hide
Query:  MMLSSRTQPILCLQVENGVIFQRENSSLKKHKFKSWKCFAVPKTQ----KHHNFLSVSLTFFSDLPLYDSPGKASFDEYLEDKPRLVKATFPGKSQQLNQ
        M+ S   QP LC  V+NGV+ Q+  S LK  K   WKCFAV K+Q    K  N LSVSL  FSD+PLY+  GKASFD+YLEDKPRLVKATFPGKS+QLNQ
Subjt:  MMLSSRTQPILCLQVENGVIFQRENSSLKKHKFKSWKCFAVPKTQ----KHHNFLSVSLTFFSDLPLYDSPGKASFDEYLEDKPRLVKATFPGKSQQLNQ

Query:  EEWRIETPKIQLLFLKIWPTIHMKIICKTNGEGYPSHVPHYITKVLQLEMTYWEINGIHKLYMPSSANVCSRGAIYTEKIGIRNHLKFQLLINLSFLVPD
        EEWRIETPKI+ LFLKIWPTI +KII KT+GEGYPS VPH ITKVL L+MT WE+NGIH+ Y PSSANVCSRGAIY+EK GIR+ LKFQL INLSF +PD
Subjt:  EEWRIETPKIQLLFLKIWPTIHMKIICKTNGEGYPSHVPHYITKVLQLEMTYWEINGIHKLYMPSSANVCSRGAIYTEKIGIRNHLKFQLLINLSFLVPD

Query:  ALNFVPNDVLRGIIETVLKAMIEDLKHKTIHKLVEDYTEFRKENKK
        AL FVP DV + I+E  LK M+ED+K K + +LVEDY  FRKE KK
Subjt:  ALNFVPNDVLRGIIETVLKAMIEDLKHKTIHKLVEDYTEFRKENKK

A0A6J1JUJ3 uncharacterized protein LOC1114876273.6e-8467.77Show/hide
Query:  MMLSSRTQPILCLQVENGVIFQRENSSLKKHKFKSWKCFAVPKTQKHHNFLSVSLTFFSDLPLYDSPGKASFDEYLEDKPRLVKATFPGKSQQLNQEEWR
        M+ S   QP LC  V+NGV+ Q+  S LK  K   WKCFAV K       LSVSL  FSD+PLY+  GKASFD+YLEDKPRLVKA FPGKS+QLNQEEWR
Subjt:  MMLSSRTQPILCLQVENGVIFQRENSSLKKHKFKSWKCFAVPKTQKHHNFLSVSLTFFSDLPLYDSPGKASFDEYLEDKPRLVKATFPGKSQQLNQEEWR

Query:  IETPKIQLLFLKIWPTIHMKIICKTNGEGYPSHVPHYITKVLQLEMTYWEINGIHKLYMPSSANVCSRGAIYTEKIGIRNHLKFQLLINLSFLVPDALNF
        IETPKI+ LFLKIWPTI +KII KT+GEGYPS VPH IT+VL L+MT WE+NGI + YMPSSANVCSRGAIY+EK GIR+ LKFQL INLSF +PDAL F
Subjt:  IETPKIQLLFLKIWPTIHMKIICKTNGEGYPSHVPHYITKVLQLEMTYWEINGIHKLYMPSSANVCSRGAIYTEKIGIRNHLKFQLLINLSFLVPDALNF

Query:  VPNDVLRGIIETVLKAMIEDLKHKTIHKLVEDYTEFRKENKK
        +P DV + I+ET LKAM+ED+K K + +LVEDY  FRKE KK
Subjt:  VPNDVLRGIIETVLKAMIEDLKHKTIHKLVEDYTEFRKENKK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G39520.1 Protein of unknown function (DUF1997)6.4e-4141.12Show/hide
Query:  KSW--KCFAVP-KTQKHHNFLSVSLTFFSDLPLYDSPGKASFDEYLEDKPRLVKATFPG--KSQQLNQEEWRIETPKIQLLFLKIWPTIHMKIICKTNGE
        K W  KC   P K+ K+ + +S      +D+ L++SP +A FDEYLEDK R+ +A FP   K+ +LN+EEWRI+   I+  FL   P + M+I CK+NG+
Subjt:  KSW--KCFAVP-KTQKHHNFLSVSLTFFSDLPLYDSPGKASFDEYLEDKPRLVKATFPG--KSQQLNQEEWRIETPKIQLLFLKIWPTIHMKIICKTNGE

Query:  GYPSHVPHYITKVLQLEMTYWEINGIHKLYMPSSANVCSRGAIYTEKIGIRNHLKFQLLINLSFLVPDALNFVPNDVLRGIIETVLKAMIEDLKHKTIHK
         YPS VP +ITKVL+L MT WE+ G+ ++  P+   +  +GA+Y ++ G    LK +L   +SF++P  L  VP DV R +   +L  +++++KH+ I  
Subjt:  GYPSHVPHYITKVLQLEMTYWEINGIHKLYMPSSANVCSRGAIYTEKIGIRNHLKFQLLINLSFLVPDALNFVPNDVLRGIIETVLKAMIEDLKHKTIHK

Query:  LVEDYTEFRKENKK
        LV DY++F+ E KK
Subjt:  LVEDYTEFRKENKK

AT5G39530.1 Protein of unknown function (DUF1997)7.5e-4243.78Show/hide
Query:  SDLPLYDSPGKASFDEYLEDKPRLVKATFPGK--SQQLNQEEWRIETPKIQLLFLKIWPTIHMKIICKTNGEGYPSHVPHYITKVLQLEMTYWEINGIHK
        +D+PL +SP +A FDEYLEDK R+ +A FP K  S +LN+EEWRI+   I  LFL +WP + M++ CK+NG+ YP  VP  ITKVL+L M  W++ G+ +
Subjt:  SDLPLYDSPGKASFDEYLEDKPRLVKATFPGK--SQQLNQEEWRIETPKIQLLFLKIWPTIHMKIICKTNGEGYPSHVPHYITKVLQLEMTYWEINGIHK

Query:  LYMPSSANVCSRGAIYTEKIGIRNHLKFQLLINLSFLVPDALNFVPNDVLRGIIETVLKAMIEDLKHKTIHKLVEDYTEFRKENK
        +  P+  ++  +GA+Y ++ G    L+ QL +N+SF++P  L  VP DV R +   VL  ++E++KHK    L+ DY+ F+ E K
Subjt:  LYMPSSANVCSRGAIYTEKIGIRNHLKFQLLINLSFLVPDALNFVPNDVLRGIIETVLKAMIEDLKHKTIHKLVEDYTEFRKENK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGCTGTCTTCAAGAACACAGCCAATATTGTGTTTGCAAGTTGAAAATGGTGTTATTTTTCAAAGAGAAAACAGCAGTTTGAAGAAGCACAAATTTAAGAGCTGGAA
GTGCTTTGCAGTGCCAAAAACACAAAAACATCATAACTTCTTATCTGTTTCTTTGACATTTTTCAGTGATTTACCACTTTATGACTCTCCAGGGAAAGCTTCTTTTGATG
AATACTTGGAAGATAAACCCAGATTGGTCAAAGCAACATTTCCTGGAAAAAGTCAACAGCTCAACCAGGAAGAGTGGAGAATCGAGACCCCAAAAATTCAGCTGTTGTTC
CTCAAGATATGGCCAACAATTCATATGAAAATAATCTGCAAAACTAATGGAGAAGGTTATCCATCTCATGTTCCTCATTATATAACAAAAGTTCTCCAACTTGAAATGAC
ATACTGGGAGATCAATGGAATCCATAAACTCTATATGCCATCTTCAGCCAATGTTTGTTCTAGAGGAGCTATTTACACTGAAAAAATAGGAATTAGAAACCACCTTAAGT
TTCAACTCCTAATCAATCTCAGCTTTCTTGTACCCGACGCTCTCAATTTCGTTCCGAACGACGTTTTACGGGGCATCATCGAGACGGTTTTGAAGGCAATGATTGAGGAC
CTGAAGCATAAAACTATACATAAATTGGTTGAGGATTATACTGAGTTTAGGAAAGAGAACAAGAAGAAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGATGCTGTCTTCAAGAACACAGCCAATATTGTGTTTGCAAGTTGAAAATGGTGTTATTTTTCAAAGAGAAAACAGCAGTTTGAAGAAGCACAAATTTAAGAGCTGGAA
GTGCTTTGCAGTGCCAAAAACACAAAAACATCATAACTTCTTATCTGTTTCTTTGACATTTTTCAGTGATTTACCACTTTATGACTCTCCAGGGAAAGCTTCTTTTGATG
AATACTTGGAAGATAAACCCAGATTGGTCAAAGCAACATTTCCTGGAAAAAGTCAACAGCTCAACCAGGAAGAGTGGAGAATCGAGACCCCAAAAATTCAGCTGTTGTTC
CTCAAGATATGGCCAACAATTCATATGAAAATAATCTGCAAAACTAATGGAGAAGGTTATCCATCTCATGTTCCTCATTATATAACAAAAGTTCTCCAACTTGAAATGAC
ATACTGGGAGATCAATGGAATCCATAAACTCTATATGCCATCTTCAGCCAATGTTTGTTCTAGAGGAGCTATTTACACTGAAAAAATAGGAATTAGAAACCACCTTAAGT
TTCAACTCCTAATCAATCTCAGCTTTCTTGTACCCGACGCTCTCAATTTCGTTCCGAACGACGTTTTACGGGGCATCATCGAGACGGTTTTGAAGGCAATGATTGAGGAC
CTGAAGCATAAAACTATACATAAATTGGTTGAGGATTATACTGAGTTTAGGAAAGAGAACAAGAAGAAGTAA
Protein sequenceShow/hide protein sequence
MMLSSRTQPILCLQVENGVIFQRENSSLKKHKFKSWKCFAVPKTQKHHNFLSVSLTFFSDLPLYDSPGKASFDEYLEDKPRLVKATFPGKSQQLNQEEWRIETPKIQLLF
LKIWPTIHMKIICKTNGEGYPSHVPHYITKVLQLEMTYWEINGIHKLYMPSSANVCSRGAIYTEKIGIRNHLKFQLLINLSFLVPDALNFVPNDVLRGIIETVLKAMIED
LKHKTIHKLVEDYTEFRKENKKK