; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi02G014640 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi02G014640
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionProtein of unknown function (DUF1997)
Genome locationchr02:20339935..20341765
RNA-Seq ExpressionLsi02G014640
SyntenyLsi02G014640
Gene Ontology termsNA
InterPro domainsIPR018971 - Protein of unknown function DUF1997


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004141578.1 uncharacterized protein LOC101212716 isoform X2 [Cucumis sativus]2.2e-9172.16Show/hide
Query:  MLSSRTQPILCLQVENGVIFQRENSSLKKHKFKSWKCFAV-PKTQK---HHNFLSVSLTFFSDLPLYDSPGKASFDEYLEDKPRLVKATFPGKSQQLNQE
        ML S T+P +CL       FQRE SSLKK K K+WKCFA+ P++QK   H+N LSVS   FSDLPLY+SPGKASFDEYLEDKPRLVKATFPGK+QQLNQE
Subjt:  MLSSRTQPILCLQVENGVIFQRENSSLKKHKFKSWKCFAV-PKTQK---HHNFLSVSLTFFSDLPLYDSPGKASFDEYLEDKPRLVKATFPGKSQQLNQE

Query:  EWRIETPKIQLLFLKIWPTIHMKIICKTN-GEGYPSHVPHYITKVLQLEMVIQYYFEYTYWEINGIHKLYMPSSANVCSRGAIYTEKIGIRNHLKFQLLI
        EWRIETPKIQLLFLKI PTI MKII KTN GE YP HVPHYI K+L        +F+ T WEINGIHK Y PSSANVCS G IY +KIG R+ LKFQL+I
Subjt:  EWRIETPKIQLLFLKIWPTIHMKIICKTN-GEGYPSHVPHYITKVLQLEMVIQYYFEYTYWEINGIHKLYMPSSANVCSRGAIYTEKIGIRNHLKFQLLI

Query:  NLSFLVPDALNFVPNDVLRGIIETVLKAMIEDLKHKTIHKLVEDYTEFRKENKKK
        +LSFLVPDAL+FVPNDVLRGIIETV+KAM+EDLKHKT+HKLVEDY++FR E +K+
Subjt:  NLSFLVPDALNFVPNDVLRGIIETVLKAMIEDLKHKTIHKLVEDYTEFRKENKKK

XP_022961709.1 uncharacterized protein LOC111462397 [Cucurbita moschata]1.7e-8366.14Show/hide
Query:  MMLSSRTQPILCLQVENGVIFQRENSSLKKHKFKSWKCFAVPKTQ----KHHNFLSVSLTFFSDLPLYDSPGKASFDEYLEDKPRLVKATFPGKSQQLNQ
        M+ S   QP LC  V+NGV+ Q+  S LK  K   WKCFAV K+Q    K  N LSVSL  FSD+PLY+  GKASFD+YLEDKPRLVKATFPGKS+QLNQ
Subjt:  MMLSSRTQPILCLQVENGVIFQRENSSLKKHKFKSWKCFAVPKTQ----KHHNFLSVSLTFFSDLPLYDSPGKASFDEYLEDKPRLVKATFPGKSQQLNQ

Query:  EEWRIETPKIQLLFLKIWPTIHMKIICKTNGEGYPSHVPHYITKVLQLEMVIQYYFEYTYWEINGIHKLYMPSSANVCSRGAIYTEKIGIRNHLKFQLLI
        EEWRIETPKI+ LFLKIWPTI +KII KT+GEGYPS VPH ITKVL L+M        T WE+NGIH+ Y PSSANVCSRGAIY+EK GIR+ LKFQL I
Subjt:  EEWRIETPKIQLLFLKIWPTIHMKIICKTNGEGYPSHVPHYITKVLQLEMVIQYYFEYTYWEINGIHKLYMPSSANVCSRGAIYTEKIGIRNHLKFQLLI

Query:  NLSFLVPDALNFVPNDVLRGIIETVLKAMIEDLKHKTIHKLVEDYTEFRKENKK
        NLSF +PDAL FVP DV + I+E  LK M+ED+K K + +LVEDY  FRKE KK
Subjt:  NLSFLVPDALNFVPNDVLRGIIETVLKAMIEDLKHKTIHKLVEDYTEFRKENKK

XP_023517004.1 uncharacterized protein LOC111780797 isoform X1 [Cucurbita pepo subsp. pepo]2.2e-8365.75Show/hide
Query:  MMLSSRTQPILCLQVENGVIFQRENSSLKKHKFKSWKCFAVPKTQ----KHHNFLSVSLTFFSDLPLYDSPGKASFDEYLEDKPRLVKATFPGKSQQLNQ
        M+ S   QP LC  V+NGV+ Q+  S LK  K   WKCFAV K+Q    K  N LSVSL  FSD+PLY+  GKASFD+YLEDKPR+VKATFPGKS+QLNQ
Subjt:  MMLSSRTQPILCLQVENGVIFQRENSSLKKHKFKSWKCFAVPKTQ----KHHNFLSVSLTFFSDLPLYDSPGKASFDEYLEDKPRLVKATFPGKSQQLNQ

Query:  EEWRIETPKIQLLFLKIWPTIHMKIICKTNGEGYPSHVPHYITKVLQLEMVIQYYFEYTYWEINGIHKLYMPSSANVCSRGAIYTEKIGIRNHLKFQLLI
        EEWRIETPKI+ LFLKIWPTI +KII KT+GEGYPS VPH ITKVL L+M        T WE+NGIH+ Y PSSANVCSRGAIY++K GIR+ LKFQL I
Subjt:  EEWRIETPKIQLLFLKIWPTIHMKIICKTNGEGYPSHVPHYITKVLQLEMVIQYYFEYTYWEINGIHKLYMPSSANVCSRGAIYTEKIGIRNHLKFQLLI

Query:  NLSFLVPDALNFVPNDVLRGIIETVLKAMIEDLKHKTIHKLVEDYTEFRKENKK
        NLSF +PDAL FVP DV + I+E  LKAM+ED+K K + +LVEDY  FRKE KK
Subjt:  NLSFLVPDALNFVPNDVLRGIIETVLKAMIEDLKHKTIHKLVEDYTEFRKENKK

XP_031741979.1 uncharacterized protein LOC101212716 isoform X1 [Cucumis sativus]2.2e-9172.16Show/hide
Query:  MLSSRTQPILCLQVENGVIFQRENSSLKKHKFKSWKCFAV-PKTQK---HHNFLSVSLTFFSDLPLYDSPGKASFDEYLEDKPRLVKATFPGKSQQLNQE
        ML S T+P +CL       FQRE SSLKK K K+WKCFA+ P++QK   H+N LSVS   FSDLPLY+SPGKASFDEYLEDKPRLVKATFPGK+QQLNQE
Subjt:  MLSSRTQPILCLQVENGVIFQRENSSLKKHKFKSWKCFAV-PKTQK---HHNFLSVSLTFFSDLPLYDSPGKASFDEYLEDKPRLVKATFPGKSQQLNQE

Query:  EWRIETPKIQLLFLKIWPTIHMKIICKTN-GEGYPSHVPHYITKVLQLEMVIQYYFEYTYWEINGIHKLYMPSSANVCSRGAIYTEKIGIRNHLKFQLLI
        EWRIETPKIQLLFLKI PTI MKII KTN GE YP HVPHYI K+L  +M        T WEINGIHK Y PSSANVCS G IY +KIG R+ LKFQL+I
Subjt:  EWRIETPKIQLLFLKIWPTIHMKIICKTN-GEGYPSHVPHYITKVLQLEMVIQYYFEYTYWEINGIHKLYMPSSANVCSRGAIYTEKIGIRNHLKFQLLI

Query:  NLSFLVPDALNFVPNDVLRGIIETVLKAMIEDLKHKTIHKLVEDYTEFRKENKKK
        +LSFLVPDAL+FVPNDVLRGIIETV+KAM+EDLKHKT+HKLVEDY++FR E +K+
Subjt:  NLSFLVPDALNFVPNDVLRGIIETVLKAMIEDLKHKTIHKLVEDYTEFRKENKKK

XP_038891182.1 uncharacterized protein LOC120080556 [Benincasa hispida]6.3e-10276.36Show/hide
Query:  MMLSSRTQPILCLQVENGVIFQRENS----SLKKHKFKSWKCFAVPKTQK-----HHNFLSVSLTFFSDLPLYDSPGKASFDEYLEDKPRLVKATFPGKS
        MML  RT  +  LQVENGV+ QRE+S    +LKK K K WKCFAV KTQK     HHN LSVSLTFFSDLPLYDSPGKASFDEYLEDKPRLVKATFPGK 
Subjt:  MMLSSRTQPILCLQVENGVIFQRENS----SLKKHKFKSWKCFAVPKTQK-----HHNFLSVSLTFFSDLPLYDSPGKASFDEYLEDKPRLVKATFPGKS

Query:  QQLNQEEWRIETPKIQLLFLKIWPTIHMKIICKTNGEGYPSHVPHYITKVLQLEMVIQYYFEYTYWEINGIHKLYMPSSANVCSRGAIYTEKIGIRNHLK
        QQLNQEEWRIE PKI+LLFLKIWPT+ +KI CKTNGE YPS VPHYITKVL LEM        T WEINGIHK Y PS ANVCSRGAIY+EKIG R+HLK
Subjt:  QQLNQEEWRIETPKIQLLFLKIWPTIHMKIICKTNGEGYPSHVPHYITKVLQLEMVIQYYFEYTYWEINGIHKLYMPSSANVCSRGAIYTEKIGIRNHLK

Query:  FQLLINLSFLVPDALNFVPNDVLRGIIETVLKAMIEDLKHKTIHKLVEDYTEFRKENK
        F+LLINLSFLVP  LNFV NDVL+ I++T LKAMIEDLKHK+IHKLVEDY EFRKENK
Subjt:  FQLLINLSFLVPDALNFVPNDVLRGIIETVLKAMIEDLKHKTIHKLVEDYTEFRKENK

TrEMBL top hitse value%identityAlignment
A0A0A0KSD5 Uncharacterized protein1.1e-9172.16Show/hide
Query:  MLSSRTQPILCLQVENGVIFQRENSSLKKHKFKSWKCFAV-PKTQK---HHNFLSVSLTFFSDLPLYDSPGKASFDEYLEDKPRLVKATFPGKSQQLNQE
        ML S T+P +CL       FQRE SSLKK K K+WKCFA+ P++QK   H+N LSVS   FSDLPLY+SPGKASFDEYLEDKPRLVKATFPGK+QQLNQE
Subjt:  MLSSRTQPILCLQVENGVIFQRENSSLKKHKFKSWKCFAV-PKTQK---HHNFLSVSLTFFSDLPLYDSPGKASFDEYLEDKPRLVKATFPGKSQQLNQE

Query:  EWRIETPKIQLLFLKIWPTIHMKIICKTN-GEGYPSHVPHYITKVLQLEMVIQYYFEYTYWEINGIHKLYMPSSANVCSRGAIYTEKIGIRNHLKFQLLI
        EWRIETPKIQLLFLKI PTI MKII KTN GE YP HVPHYI K+L        +F+ T WEINGIHK Y PSSANVCS G IY +KIG R+ LKFQL+I
Subjt:  EWRIETPKIQLLFLKIWPTIHMKIICKTN-GEGYPSHVPHYITKVLQLEMVIQYYFEYTYWEINGIHKLYMPSSANVCSRGAIYTEKIGIRNHLKFQLLI

Query:  NLSFLVPDALNFVPNDVLRGIIETVLKAMIEDLKHKTIHKLVEDYTEFRKENKKK
        +LSFLVPDAL+FVPNDVLRGIIETV+KAM+EDLKHKT+HKLVEDY++FR E +K+
Subjt:  NLSFLVPDALNFVPNDVLRGIIETVLKAMIEDLKHKTIHKLVEDYTEFRKENKKK

A0A1S4E357 uncharacterized protein LOC1034987441.0e-7372.68Show/hide
Query:  FQRENSSLKKHKFKSWKCFAVPKTQK---HHNFLSVSLTFFSDLPLYDSPGKASFDEYLEDKPRLVKATFPGKSQQLNQEEWRIETPKIQLLFLKIWPTI
        FQRE SSLKK K K W+CFA+P++QK   H N LSVS   FSDL L++SPGKASFDEYLEDKPRL+KATFPGK QQLNQEEWRIETPKIQLLFLKIWPT+
Subjt:  FQRENSSLKKHKFKSWKCFAVPKTQK---HHNFLSVSLTFFSDLPLYDSPGKASFDEYLEDKPRLVKATFPGKSQQLNQEEWRIETPKIQLLFLKIWPTI

Query:  HMKIICKTN-GEGYPSHVPHYITKVLQLEMVIQYYFEYTYWEINGIHKLYMPSSANVCSRGAIYTEKIGIRNHLKFQLLINLSFLVPDALNFVPNDVLRG
         MKII KTN GE YP  VP+YI KVL        +FE T WEINGI+K Y PSSANVCS G IY EKIG R+ LKF+L+I+LSFLVPDAL+FVPNDVLRG
Subjt:  HMKIICKTN-GEGYPSHVPHYITKVLQLEMVIQYYFEYTYWEINGIHKLYMPSSANVCSRGAIYTEKIGIRNHLKFQLLINLSFLVPDALNFVPNDVLRG

Query:  IIETV
        +I TV
Subjt:  IIETV

A0A6J1C174 uncharacterized protein LOC111006493 isoform X18.3e-7661.18Show/hide
Query:  MLSSRTQPILCLQVENGVIFQRENSSL-----KKHK-FKSWKCFAVPKT-QKHHNFLSVSLTFFSDLPLYDSPGKASFDEYLEDKPRLVKATFPGKSQQL
        M+S  +  +L   VENG   QR N+       KK K  +S K  AV KT Q+H N LS S+ FFSD+PL +SPGKASFD+YLEDKPR++KATFPGKSQQL
Subjt:  MLSSRTQPILCLQVENGVIFQRENSSL-----KKHK-FKSWKCFAVPKT-QKHHNFLSVSLTFFSDLPLYDSPGKASFDEYLEDKPRLVKATFPGKSQQL

Query:  NQEEWRIETPKIQLLFLKIWPTIHMKIICKTNGEGYPSHVPHYITKVLQLEMVIQYYFEYTYWEINGIHKLYMPSSANVCSRGAIYTEKIGIRNHLKFQL
        NQEEWRIETPK++LL LKIWP I MKII KT+G+ YP HVPH+ITK+L LEM        T WEINGIH+ Y PSSANV S+GAIY+EK G  + LKFQ 
Subjt:  NQEEWRIETPKIQLLFLKIWPTIHMKIICKTNGEGYPSHVPHYITKVLQLEMVIQYYFEYTYWEINGIHKLYMPSSANVCSRGAIYTEKIGIRNHLKFQL

Query:  LINLSFLVPDALNFVPNDVLRGIIETVLKAMIEDLKHKTIHKLVEDYTEFRKENK
         +N +F+VP AL+F+P D+ R I ETVLK M+EDL +K I KLVEDY++FRKE K
Subjt:  LINLSFLVPDALNFVPNDVLRGIIETVLKAMIEDLKHKTIHKLVEDYTEFRKENK

A0A6J1HEU2 uncharacterized protein LOC1114623978.3e-8466.14Show/hide
Query:  MMLSSRTQPILCLQVENGVIFQRENSSLKKHKFKSWKCFAVPKTQ----KHHNFLSVSLTFFSDLPLYDSPGKASFDEYLEDKPRLVKATFPGKSQQLNQ
        M+ S   QP LC  V+NGV+ Q+  S LK  K   WKCFAV K+Q    K  N LSVSL  FSD+PLY+  GKASFD+YLEDKPRLVKATFPGKS+QLNQ
Subjt:  MMLSSRTQPILCLQVENGVIFQRENSSLKKHKFKSWKCFAVPKTQ----KHHNFLSVSLTFFSDLPLYDSPGKASFDEYLEDKPRLVKATFPGKSQQLNQ

Query:  EEWRIETPKIQLLFLKIWPTIHMKIICKTNGEGYPSHVPHYITKVLQLEMVIQYYFEYTYWEINGIHKLYMPSSANVCSRGAIYTEKIGIRNHLKFQLLI
        EEWRIETPKI+ LFLKIWPTI +KII KT+GEGYPS VPH ITKVL L+M        T WE+NGIH+ Y PSSANVCSRGAIY+EK GIR+ LKFQL I
Subjt:  EEWRIETPKIQLLFLKIWPTIHMKIICKTNGEGYPSHVPHYITKVLQLEMVIQYYFEYTYWEINGIHKLYMPSSANVCSRGAIYTEKIGIRNHLKFQLLI

Query:  NLSFLVPDALNFVPNDVLRGIIETVLKAMIEDLKHKTIHKLVEDYTEFRKENKK
        NLSF +PDAL FVP DV + I+E  LK M+ED+K K + +LVEDY  FRKE KK
Subjt:  NLSFLVPDALNFVPNDVLRGIIETVLKAMIEDLKHKTIHKLVEDYTEFRKENKK

A0A6J1JUJ3 uncharacterized protein LOC1114876277.8e-8265.6Show/hide
Query:  MMLSSRTQPILCLQVENGVIFQRENSSLKKHKFKSWKCFAVPKTQKHHNFLSVSLTFFSDLPLYDSPGKASFDEYLEDKPRLVKATFPGKSQQLNQEEWR
        M+ S   QP LC  V+NGV+ Q+  S LK  K   WKCFAV K       LSVSL  FSD+PLY+  GKASFD+YLEDKPRLVKA FPGKS+QLNQEEWR
Subjt:  MMLSSRTQPILCLQVENGVIFQRENSSLKKHKFKSWKCFAVPKTQKHHNFLSVSLTFFSDLPLYDSPGKASFDEYLEDKPRLVKATFPGKSQQLNQEEWR

Query:  IETPKIQLLFLKIWPTIHMKIICKTNGEGYPSHVPHYITKVLQLEMVIQYYFEYTYWEINGIHKLYMPSSANVCSRGAIYTEKIGIRNHLKFQLLINLSF
        IETPKI+ LFLKIWPTI +KII KT+GEGYPS VPH IT+VL L+M        T WE+NGI + YMPSSANVCSRGAIY+EK GIR+ LKFQL INLSF
Subjt:  IETPKIQLLFLKIWPTIHMKIICKTNGEGYPSHVPHYITKVLQLEMVIQYYFEYTYWEINGIHKLYMPSSANVCSRGAIYTEKIGIRNHLKFQLLINLSF

Query:  LVPDALNFVPNDVLRGIIETVLKAMIEDLKHKTIHKLVEDYTEFRKENKK
         +PDAL F+P DV + I+ET LKAM+ED+K K + +LVEDY  FRKE KK
Subjt:  LVPDALNFVPNDVLRGIIETVLKAMIEDLKHKTIHKLVEDYTEFRKENKK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G39520.1 Protein of unknown function (DUF1997)1.4e-3839.64Show/hide
Query:  KSW--KCFAVP-KTQKHHNFLSVSLTFFSDLPLYDSPGKASFDEYLEDKPRLVKATFPG--KSQQLNQEEWRIETPKIQLLFLKIWPTIHMKIICKTNGE
        K W  KC   P K+ K+ + +S      +D+ L++SP +A FDEYLEDK R+ +A FP   K+ +LN+EEWRI+   I+  FL   P + M+I CK+NG+
Subjt:  KSW--KCFAVP-KTQKHHNFLSVSLTFFSDLPLYDSPGKASFDEYLEDKPRLVKATFPG--KSQQLNQEEWRIETPKIQLLFLKIWPTIHMKIICKTNGE

Query:  GYPSHVPHYITKVLQLEMVIQYYFEYTYWEINGIHKLYMPSSANVCSRGAIYTEKIGIRNHLKFQLLINLSFLVPDALNFVPNDVLRGIIETVLKAMIED
         YPS VP +ITKVL+L M        T WE+ G+ ++  P+   +  +GA+Y ++ G    LK +L   +SF++P  L  VP DV R +   +L  ++++
Subjt:  GYPSHVPHYITKVLQLEMVIQYYFEYTYWEINGIHKLYMPSSANVCSRGAIYTEKIGIRNHLKFQLLINLSFLVPDALNFVPNDVLRGIIETVLKAMIED

Query:  LKHKTIHKLVEDYTEFRKENKK
        +KH+ I  LV DY++F+ E KK
Subjt:  LKHKTIHKLVEDYTEFRKENKK

AT5G39530.1 Protein of unknown function (DUF1997)9.5e-4041.97Show/hide
Query:  SDLPLYDSPGKASFDEYLEDKPRLVKATFPGK--SQQLNQEEWRIETPKIQLLFLKIWPTIHMKIICKTNGEGYPSHVPHYITKVLQLEMVIQYYFEYTY
        +D+PL +SP +A FDEYLEDK R+ +A FP K  S +LN+EEWRI+   I  LFL +WP + M++ CK+NG+ YP  VP  ITKVL+L M+         
Subjt:  SDLPLYDSPGKASFDEYLEDKPRLVKATFPGK--SQQLNQEEWRIETPKIQLLFLKIWPTIHMKIICKTNGEGYPSHVPHYITKVLQLEMVIQYYFEYTY

Query:  WEINGIHKLYMPSSANVCSRGAIYTEKIGIRNHLKFQLLINLSFLVPDALNFVPNDVLRGIIETVLKAMIEDLKHKTIHKLVEDYTEFRKENK
        W++ G+ ++  P+  ++  +GA+Y ++ G    L+ QL +N+SF++P  L  VP DV R +   VL  ++E++KHK    L+ DY+ F+ E K
Subjt:  WEINGIHKLYMPSSANVCSRGAIYTEKIGIRNHLKFQLLINLSFLVPDALNFVPNDVLRGIIETVLKAMIEDLKHKTIHKLVEDYTEFRKENK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGCTGTCTTCAAGAACACAGCCAATATTGTGTTTGCAAGTTGAAAATGGTGTTATTTTTCAAAGAGAAAACAGCAGTTTGAAGAAGCACAAATTTAAGAGCTGGAA
GTGCTTTGCAGTGCCAAAAACACAAAAACATCATAACTTCTTATCTGTTTCTTTGACATTTTTCAGTGATTTACCACTTTATGACTCTCCAGGGAAAGCTTCTTTTGATG
AATACTTGGAAGATAAACCCAGATTGGTCAAAGCAACATTTCCTGGAAAAAGTCAACAGCTCAACCAGGAAGAGTGGAGAATCGAGACCCCAAAAATTCAGCTGTTGTTC
CTCAAGATATGGCCAACAATTCATATGAAAATAATCTGCAAAACTAATGGAGAAGGTTATCCATCTCATGTTCCTCATTATATAACAAAAGTTCTCCAACTTGAAATGGT
AATTCAATATTATTTCGAATATACATACTGGGAGATCAATGGAATCCATAAACTCTATATGCCATCTTCAGCCAATGTTTGTTCTAGAGGAGCTATTTACACTGAAAAAA
TAGGAATTAGAAACCACCTTAAGTTTCAACTCCTAATCAATCTCAGCTTTCTTGTACCCGACGCTCTCAATTTCGTTCCGAACGACGTTTTACGGGGCATCATCGAGACG
GTTTTGAAGGCAATGATTGAGGACCTGAAGCATAAAACTATACATAAATTGGTTGAGGATTATACTGAGTTTAGGAAAGAGAACAAGAAGAAGTAA
mRNA sequenceShow/hide mRNA sequence
TGTTGAGAGAAGTAGTTTTGAAAAAAGCTATAGAGAGCCCCATGATGCTGTCTTCAAGAACACAGCCAATATTGTGTTTGCAAGTTGAAAATGGTGTTATTTTTCAAAGA
GAAAACAGCAGTTTGAAGAAGCACAAATTTAAGAGCTGGAAGTGCTTTGCAGTGCCAAAAACACAAAAACATCATAACTTCTTATCTGTTTCTTTGACATTTTTCAGTGA
TTTACCACTTTATGACTCTCCAGGGAAAGCTTCTTTTGATGAATACTTGGAAGATAAACCCAGATTGGTCAAAGCAACATTTCCTGGAAAAAGTCAACAGCTCAACCAGG
AAGAGTGGAGAATCGAGACCCCAAAAATTCAGCTGTTGTTCCTCAAGATATGGCCAACAATTCATATGAAAATAATCTGCAAAACTAATGGAGAAGGTTATCCATCTCAT
GTTCCTCATTATATAACAAAAGTTCTCCAACTTGAAATGGTAATTCAATATTATTTCGAATATACATACTGGGAGATCAATGGAATCCATAAACTCTATATGCCATCTTC
AGCCAATGTTTGTTCTAGAGGAGCTATTTACACTGAAAAAATAGGAATTAGAAACCACCTTAAGTTTCAACTCCTAATCAATCTCAGCTTTCTTGTACCCGACGCTCTCA
ATTTCGTTCCGAACGACGTTTTACGGGGCATCATCGAGACGGTTTTGAAGGCAATGATTGAGGACCTGAAGCATAAAACTATACATAAATTGGTTGAGGATTATACTGAG
TTTAGGAAAGAGAACAAGAAGAAGTAATTAGAGCAAAATGATATATAATTTTGTTATATATAATGAAAGTGACCACATCAAAATGATTCAACTGCTGTGTTCAGTGAAAA
AGTTTGGGCTTCTCTCCTTAGCCAAGAAAATCATATGGGATATAACTTATAGTA
Protein sequenceShow/hide protein sequence
MMLSSRTQPILCLQVENGVIFQRENSSLKKHKFKSWKCFAVPKTQKHHNFLSVSLTFFSDLPLYDSPGKASFDEYLEDKPRLVKATFPGKSQQLNQEEWRIETPKIQLLF
LKIWPTIHMKIICKTNGEGYPSHVPHYITKVLQLEMVIQYYFEYTYWEINGIHKLYMPSSANVCSRGAIYTEKIGIRNHLKFQLLINLSFLVPDALNFVPNDVLRGIIET
VLKAMIEDLKHKTIHKLVEDYTEFRKENKKK