; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0021052 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0021052
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionUnknown protein
Genome locationchr7:4264569..4267359
RNA-Seq ExpressionLag0021052
SyntenyLag0021052
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022937402.1 uncharacterized protein LOC111443687 isoform X3 [Cucurbita moschata]7.5e-7356.54Show/hide
Query:  MVRNPTPICRISVSSTAEAVPKKMEDRSSIYPRVKVREEKDLDDHPVVYEQKRSYLLPLKDLESLILQDSSNSPDSQRRTRRASCPDLGTASCRPKMWSK
        MV+NPTPICRIS       VP+ M+D+ +IYP+VKVR+E +LDDHP V EQKRSYLL LKDLESL L+DSS+S +  R   RASC  LGT + R ++WSK
Subjt:  MVRNPTPICRISVSSTAEAVPKKMEDRSSIYPRVKVREEKDLDDHPVVYEQKRSYLLPLKDLESLILQDSSNSPDSQRRTRRASCPDLGTASCRPKMWSK

Query:  FPPNLTENEGSRNQWLSSSKELFQMEHQFGGRVVQNCQTLKISQTLPRIKSLQLHQYLGYLSFRGKEHRVSPSSTAKIPKADSPNVIKPSTSESQERRSK
        FPPNLTENE                                                       GKEHRVSPSSTAK+PK  SPNV+KPSTSESQ  +  
Subjt:  FPPNLTENEGSRNQWLSSSKELFQMEHQFGGRVVQNCQTLKISQTLPRIKSLQLHQYLGYLSFRGKEHRVSPSSTAKIPKADSPNVIKPSTSESQERRSK

Query:  MADEDNKVNIRACSIPMPRAVVSSPENDIMIGKKNRKSTQKSSVLKNHNSVQSRHSQCKIVASHSGNENPISLMKSKET-ANSKCRSIGKSGTASRVGSF
             N+VN RA SIPMPRAV+SSPEND+MIGKKNRK+T+K SVLKNHNSVQSRHSQCK VA HSGNENPI   KSKET  NSKCRS GK+G A    +F
Subjt:  MADEDNKVNIRACSIPMPRAVVSSPENDIMIGKKNRKSTQKSSVLKNHNSVQSRHSQCKIVASHSGNENPISLMKSKET-ANSKCRSIGKSGTASRVGSF

Query:  VSKKAP
         SKK P
Subjt:  VSKKAP

XP_023538643.1 uncharacterized protein LOC111799527 isoform X1 [Cucurbita pepo subsp. pepo]3.7e-7258.31Show/hide
Query:  MVRNPTPICRISVSSTAEAVPKKMEDRSSIYPRVKVREEKDLDDHPVVYEQKRSYLLPLKDLESLILQDSSNSPDSQRRTRRASCPDLGTASCRPKMWSK
        MV+NPTPICRIS       VPK M+D+ +IYP+VKVREE +LDDHP V EQKRSYLL LKDLESL L+DSSNS +  R   RASC DLGT + R K+WSK
Subjt:  MVRNPTPICRISVSSTAEAVPKKMEDRSSIYPRVKVREEKDLDDHPVVYEQKRSYLLPLKDLESLILQDSSNSPDSQRRTRRASCPDLGTASCRPKMWSK

Query:  FPPNLTENEGSRNQWLSSSKELFQMEHQFGGRVVQNCQTLKISQTLPRIKSLQLHQYLGYLSFRGKEHRVSPSSTAKIPKADSPNVIKPSTSESQERRSK
        FPPNLTENE                                                       GKEHRVSPSSTAK+PK  SPNV+KPSTSESQ  +  
Subjt:  FPPNLTENEGSRNQWLSSSKELFQMEHQFGGRVVQNCQTLKISQTLPRIKSLQLHQYLGYLSFRGKEHRVSPSSTAKIPKADSPNVIKPSTSESQERRSK

Query:  MADEDNKVNIRACSIPMPRAVVSSPENDIMIGKKNRKSTQKSSVLKNHNSVQSRHSQCKIVASHSGNENPISLMKSKET-ANSKCRSIGKSGTAS
             NKVN RA SIPMPRAV+SSPEND+MIGKKNRK T+K SVLKNHNSVQSRHSQCK VA HSGNENPI   KSKET  +SKCRS GK+ + S
Subjt:  MADEDNKVNIRACSIPMPRAVVSSPENDIMIGKKNRKSTQKSSVLKNHNSVQSRHSQCKIVASHSGNENPISLMKSKET-ANSKCRSIGKSGTAS

XP_023538644.1 uncharacterized protein LOC111799527 isoform X2 [Cucurbita pepo subsp. pepo]2.0e-7358.64Show/hide
Query:  MVRNPTPICRISVSSTAEAVPKKMEDRSSIYPRVKVREEKDLDDHPVVYEQKRSYLLPLKDLESLILQDSSNSPDSQRRTRRASCPDLGTASCRPKMWSK
        MV+NPTPICRIS       VPK M+D+ +IYP+VKVREE +LDDHP V EQKRSYLL LKDLESL L+DSSNS +  R   RASC DLGT + R K+WSK
Subjt:  MVRNPTPICRISVSSTAEAVPKKMEDRSSIYPRVKVREEKDLDDHPVVYEQKRSYLLPLKDLESLILQDSSNSPDSQRRTRRASCPDLGTASCRPKMWSK

Query:  FPPNLTENEGSRNQWLSSSKELFQMEHQFGGRVVQNCQTLKISQTLPRIKSLQLHQYLGYLSFRGKEHRVSPSSTAKIPKADSPNVIKPSTSESQERRSK
        FPPNLTENE                                                       GKEHRVSPSSTAK+PK  SPNV+KPSTSESQ++R  
Subjt:  FPPNLTENEGSRNQWLSSSKELFQMEHQFGGRVVQNCQTLKISQTLPRIKSLQLHQYLGYLSFRGKEHRVSPSSTAKIPKADSPNVIKPSTSESQERRSK

Query:  MADEDNKVNIRACSIPMPRAVVSSPENDIMIGKKNRKSTQKSSVLKNHNSVQSRHSQCKIVASHSGNENPISLMKSKET-ANSKCRSIGKSGTAS
             NKVN RA SIPMPRAV+SSPEND+MIGKKNRK T+K SVLKNHNSVQSRHSQCK VA HSGNENPI   KSKET  +SKCRS GK+ + S
Subjt:  MADEDNKVNIRACSIPMPRAVVSSPENDIMIGKKNRKSTQKSSVLKNHNSVQSRHSQCKIVASHSGNENPISLMKSKET-ANSKCRSIGKSGTAS

XP_038895103.1 uncharacterized protein LOC120083415 isoform X1 [Benincasa hispida]1.7e-7256.77Show/hide
Query:  MVRNPTPICRISVSSTAEAVPKKMEDRSSIYPRVKVREEKDLDDHPVVYEQKRSYLLPLKDLESLILQDSSNSPDSQRRTRRASCPDLGTASCRPKMWSK
        MVRN TPICRISVSST EAVP+KM+D+++ YPRVKVREEK LDDHP VYEQKRSYLL LKDLESL+LQDSSNSP                          
Subjt:  MVRNPTPICRISVSSTAEAVPKKMEDRSSIYPRVKVREEKDLDDHPVVYEQKRSYLLPLKDLESLILQDSSNSPDSQRRTRRASCPDLGTASCRPKMWSK

Query:  FPPNLTENEGSRNQWLSSSKELFQMEHQFGGRVVQNCQTLKISQTLPRIKSLQLHQYLGYLSFRGKEHRVSPSSTAKIPKADSPNVIKPSTSESQ-ERRS
                                                                        GKEH VSPS +AKIPKA  PN IKPSTSESQ ERR 
Subjt:  FPPNLTENEGSRNQWLSSSKELFQMEHQFGGRVVQNCQTLKISQTLPRIKSLQLHQYLGYLSFRGKEHRVSPSSTAKIPKADSPNVIKPSTSESQ-ERRS

Query:  KMADEDNKVNIRACSIPMPRAVVSSPENDIMIGKKNRKSTQKSSVLKNHNSVQSRHSQCKIVASHSGNENPISLMKSKETANSKCRSIGKSGTASRVGSF
        +M DEDNK NIRA SIPMPRA++SSPEND+MIGKKNRK+T+K SVLKNHNSVQSRHSQCKI+ASHS NENPIS  +SKETA+SKCRSIGK+GTA R  SF
Subjt:  KMADEDNKVNIRACSIPMPRAVVSSPENDIMIGKKNRKSTQKSSVLKNHNSVQSRHSQCKIVASHSGNENPISLMKSKETANSKCRSIGKSGTASRVGSF

Query:  VSK
        +SK
Subjt:  VSK

XP_038895110.1 uncharacterized protein LOC120083415 isoform X2 [Benincasa hispida]6.7e-7456.95Show/hide
Query:  MVRNPTPICRISVSSTAEAVPKKMEDRSSIYPRVKVREEKDLDDHPVVYEQKRSYLLPLKDLESLILQDSSNSPDSQRRTRRASCPDLGTASCRPKMWSK
        MVRN TPICRISVSST EAVP+KM+D+++ YPRVKVREEK LDDHP VYEQKRSYLL LKDLESL+LQDSSNSP                          
Subjt:  MVRNPTPICRISVSSTAEAVPKKMEDRSSIYPRVKVREEKDLDDHPVVYEQKRSYLLPLKDLESLILQDSSNSPDSQRRTRRASCPDLGTASCRPKMWSK

Query:  FPPNLTENEGSRNQWLSSSKELFQMEHQFGGRVVQNCQTLKISQTLPRIKSLQLHQYLGYLSFRGKEHRVSPSSTAKIPKADSPNVIKPSTSESQERRSK
                                                                        GKEH VSPS +AKIPKA  PN IKPSTSESQERR +
Subjt:  FPPNLTENEGSRNQWLSSSKELFQMEHQFGGRVVQNCQTLKISQTLPRIKSLQLHQYLGYLSFRGKEHRVSPSSTAKIPKADSPNVIKPSTSESQERRSK

Query:  MADEDNKVNIRACSIPMPRAVVSSPENDIMIGKKNRKSTQKSSVLKNHNSVQSRHSQCKIVASHSGNENPISLMKSKETANSKCRSIGKSGTASRVGSFV
        M DEDNK NIRA SIPMPRA++SSPEND+MIGKKNRK+T+K SVLKNHNSVQSRHSQCKI+ASHS NENPIS  +SKETA+SKCRSIGK+GTA R  SF+
Subjt:  MADEDNKVNIRACSIPMPRAVVSSPENDIMIGKKNRKSTQKSSVLKNHNSVQSRHSQCKIVASHSGNENPISLMKSKETANSKCRSIGKSGTASRVGSFV

Query:  SK
        SK
Subjt:  SK

TrEMBL top hitse value%identityAlignment
A0A0A0L861 Uncharacterized protein3.1e-6453.42Show/hide
Query:  MVRNPTPICRISVSSTAEAVPKKMEDRSSIYPRVKVREEKDLDDHPVVYEQKRSYLLPLKDLESLILQDSSNSPDSQRRTRRASCPDLGTASCRPKMWSK
        MVRN  PICRISVSST EAVP+KM+D+S+ YP+VKVREE++LDD PVVYEQKRSYLL LKDLESL LQDSSN+P                          
Subjt:  MVRNPTPICRISVSSTAEAVPKKMEDRSSIYPRVKVREEKDLDDHPVVYEQKRSYLLPLKDLESLILQDSSNSPDSQRRTRRASCPDLGTASCRPKMWSK

Query:  FPPNLTENEGSRNQWLSSSKELFQMEHQFGGRVVQNCQTLKISQTLPRIKSLQLHQYLGYLSFRGKEHRVSPSSTAKIPKADSPNVIKPSTSESQ--ERR
                                      G+V                                KEHRVS  S AKIPKA S N IKPSTSE Q  ER 
Subjt:  FPPNLTENEGSRNQWLSSSKELFQMEHQFGGRVVQNCQTLKISQTLPRIKSLQLHQYLGYLSFRGKEHRVSPSSTAKIPKADSPNVIKPSTSESQ--ERR

Query:  SKMADEDNKVNIRACSIPMPRAVVSSPENDIMIGKKNRKSTQKSSVLKNHNSVQSRHSQCKIVASHSGNENPISLMKSKETANSKCRSIGKSGTASRVGS
          + DEDNK NIRA SIPMPRAVVSSPEND MIGKKNRK+T+K SVLKN NSVQSRHSQCKI+A HS NEN IS  +SK+T +SKCRS+GK+GT  R GS
Subjt:  SKMADEDNKVNIRACSIPMPRAVVSSPENDIMIGKKNRKSTQKSSVLKNHNSVQSRHSQCKIVASHSGNENPISLMKSKETANSKCRSIGKSGTASRVGS

Query:  FVSKKAP
        F+SK  P
Subjt:  FVSKKAP

A0A6J1DUS8 uncharacterized protein LOC1110245604.9e-7054.37Show/hide
Query:  MVRNPTPICRISVSSTAEAVPKKMEDRSSIYPRVKVREEKDL-DDHPVVYEQKRSYLLPLKDLESLILQDSSNSPDSQRRTRRASCPDLGTASCRPKMWS
        MV+NPTPICRISVSSTA+AVPKKM+D+S++YP+VKVREEKD  DDHP VYEQKRSYLL LKD ESL L+DSSNSP                         
Subjt:  MVRNPTPICRISVSSTAEAVPKKMEDRSSIYPRVKVREEKDL-DDHPVVYEQKRSYLLPLKDLESLILQDSSNSPDSQRRTRRASCPDLGTASCRPKMWS

Query:  KFPPNLTENEGSRNQWLSSSKELFQMEHQFGGRVVQNCQTLKISQTLPRIKSLQLHQYLGYLSFRGKEHRVSPSSTAKIPKADSPNVIKPSTSESQERRS
                                                                         GKEHRVSPSS+A+IPKA  PNVI+PS SESQERR 
Subjt:  KFPPNLTENEGSRNQWLSSSKELFQMEHQFGGRVVQNCQTLKISQTLPRIKSLQLHQYLGYLSFRGKEHRVSPSSTAKIPKADSPNVIKPSTSESQERRS

Query:  KMAD---EDNKVNIRACSIPMPRAVVSSPENDIMIGKKNRKSTQKSSVLKNHNSVQSRHSQCKIVASHSGNENPISLMKSKETANSKCRSIGKSGTASRV
        K  D   EDN +N RA SIPMPRAVVSSPENDIMIGKKNRK+T+K SVLKNHNSVQSRH+QCKI+ASHSGNENPI+  KSK+ A++K R +GKSGT +R 
Subjt:  KMAD---EDNKVNIRACSIPMPRAVVSSPENDIMIGKKNRKSTQKSSVLKNHNSVQSRHSQCKIVASHSGNENPISLMKSKETANSKCRSIGKSGTASRV

Query:  GSFVSKKAP
         SF+SKK P
Subjt:  GSFVSKKAP

A0A6J1FA75 uncharacterized protein LOC111443687 isoform X19.8e-7155.78Show/hide
Query:  MVRNPTPICRISVSSTAEAVPKKMEDRSSIYPRVKVREEKDLDDHPVVYEQKRSYLLPLKDLESLILQDSSNSPDSQRRTRRASCPDLGTASCRPKMWSK
        MV+NPTPICRIS       VP+ M+D+ +IYP+VKVR+E +LDDHP V EQKRSYLL LKDLESL L+DSS+S +  R   RASC  LGT + R ++WSK
Subjt:  MVRNPTPICRISVSSTAEAVPKKMEDRSSIYPRVKVREEKDLDDHPVVYEQKRSYLLPLKDLESLILQDSSNSPDSQRRTRRASCPDLGTASCRPKMWSK

Query:  FPPNLTENEGSRNQWLSSSKELFQMEHQFGGRVVQNCQTLKISQTLPRIKSLQLHQYLGYLSFRGKEHRVSPSSTAKIPKADSPNVIKPSTSESQERRSK
        FPPNLTENE                                                       GKEHRVSPSSTAK+PK  SPNV+KPSTSESQ  +  
Subjt:  FPPNLTENEGSRNQWLSSSKELFQMEHQFGGRVVQNCQTLKISQTLPRIKSLQLHQYLGYLSFRGKEHRVSPSSTAKIPKADSPNVIKPSTSESQERRSK

Query:  MADEDNKVNIRACSIPMPRAVVSSPENDIMIGKKNRKSTQKSSVLKNHNSVQSRHSQCKIVASHSGNENPISLMKSKET-ANSKCRSIGKSGTASRVGSF
             N+VN RA SIPMPRAV+SSPEND+MIGKKNRK+T+K SVLKNHNSVQSRHSQCK VA HSGNENPI   KSKET  NSKCRS GK+   S  G  
Subjt:  MADEDNKVNIRACSIPMPRAVVSSPENDIMIGKKNRKSTQKSSVLKNHNSVQSRHSQCKIVASHSGNENPISLMKSKET-ANSKCRSIGKSGTASRVGSF

Query:  VSK
        + K
Subjt:  VSK

A0A6J1FB22 uncharacterized protein LOC111443687 isoform X25.2e-7256.11Show/hide
Query:  MVRNPTPICRISVSSTAEAVPKKMEDRSSIYPRVKVREEKDLDDHPVVYEQKRSYLLPLKDLESLILQDSSNSPDSQRRTRRASCPDLGTASCRPKMWSK
        MV+NPTPICRIS       VP+ M+D+ +IYP+VKVR+E +LDDHP V EQKRSYLL LKDLESL L+DSS+S +  R   RASC  LGT + R ++WSK
Subjt:  MVRNPTPICRISVSSTAEAVPKKMEDRSSIYPRVKVREEKDLDDHPVVYEQKRSYLLPLKDLESLILQDSSNSPDSQRRTRRASCPDLGTASCRPKMWSK

Query:  FPPNLTENEGSRNQWLSSSKELFQMEHQFGGRVVQNCQTLKISQTLPRIKSLQLHQYLGYLSFRGKEHRVSPSSTAKIPKADSPNVIKPSTSESQERRSK
        FPPNLTENE                                                       GKEHRVSPSSTAK+PK  SPNV+KPSTSESQ++R  
Subjt:  FPPNLTENEGSRNQWLSSSKELFQMEHQFGGRVVQNCQTLKISQTLPRIKSLQLHQYLGYLSFRGKEHRVSPSSTAKIPKADSPNVIKPSTSESQERRSK

Query:  MADEDNKVNIRACSIPMPRAVVSSPENDIMIGKKNRKSTQKSSVLKNHNSVQSRHSQCKIVASHSGNENPISLMKSKET-ANSKCRSIGKSGTASRVGSF
             N+VN RA SIPMPRAV+SSPEND+MIGKKNRK+T+K SVLKNHNSVQSRHSQCK VA HSGNENPI   KSKET  NSKCRS GK+   S  G  
Subjt:  MADEDNKVNIRACSIPMPRAVVSSPENDIMIGKKNRKSTQKSSVLKNHNSVQSRHSQCKIVASHSGNENPISLMKSKET-ANSKCRSIGKSGTASRVGSF

Query:  VSK
        + K
Subjt:  VSK

A0A6J1FB38 uncharacterized protein LOC111443687 isoform X33.6e-7356.54Show/hide
Query:  MVRNPTPICRISVSSTAEAVPKKMEDRSSIYPRVKVREEKDLDDHPVVYEQKRSYLLPLKDLESLILQDSSNSPDSQRRTRRASCPDLGTASCRPKMWSK
        MV+NPTPICRIS       VP+ M+D+ +IYP+VKVR+E +LDDHP V EQKRSYLL LKDLESL L+DSS+S +  R   RASC  LGT + R ++WSK
Subjt:  MVRNPTPICRISVSSTAEAVPKKMEDRSSIYPRVKVREEKDLDDHPVVYEQKRSYLLPLKDLESLILQDSSNSPDSQRRTRRASCPDLGTASCRPKMWSK

Query:  FPPNLTENEGSRNQWLSSSKELFQMEHQFGGRVVQNCQTLKISQTLPRIKSLQLHQYLGYLSFRGKEHRVSPSSTAKIPKADSPNVIKPSTSESQERRSK
        FPPNLTENE                                                       GKEHRVSPSSTAK+PK  SPNV+KPSTSESQ  +  
Subjt:  FPPNLTENEGSRNQWLSSSKELFQMEHQFGGRVVQNCQTLKISQTLPRIKSLQLHQYLGYLSFRGKEHRVSPSSTAKIPKADSPNVIKPSTSESQERRSK

Query:  MADEDNKVNIRACSIPMPRAVVSSPENDIMIGKKNRKSTQKSSVLKNHNSVQSRHSQCKIVASHSGNENPISLMKSKET-ANSKCRSIGKSGTASRVGSF
             N+VN RA SIPMPRAV+SSPEND+MIGKKNRK+T+K SVLKNHNSVQSRHSQCK VA HSGNENPI   KSKET  NSKCRS GK+G A    +F
Subjt:  MADEDNKVNIRACSIPMPRAVVSSPENDIMIGKKNRKSTQKSSVLKNHNSVQSRHSQCKIVASHSGNENPISLMKSKET-ANSKCRSIGKSGTASRVGSF

Query:  VSKKAP
         SKK P
Subjt:  VSKKAP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G21865.1 unknown protein7.1e-0543.82Show/hide
Query:  AKIPKADSPNVIKPSTSESQERRSKM--ADEDNKVNIRACSIPMPRAVVSSPENDIMIGKKNRKSTQKSSV-LKNHNSVQSRHSQCKIV
        AKIPK   P+V+  S SES+E    +  AD + K   +A     PRAVVSSP+ND MIG  N    +K+   LK+ + ++SR SQ K V
Subjt:  AKIPKADSPNVIKPSTSESQERRSKM--ADEDNKVNIRACSIPMPRAVVSSPENDIMIGKKNRKSTQKSSV-LKNHNSVQSRHSQCKIV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCGAAACCCAACGCCCATATGCCGCATCTCTGTTTCTTCGACGGCCGAGGCTGTTCCTAAGAAGATGGAGGATCGATCCTCAATCTATCCGAGGGTGAAGGTGAG
GGAGGAGAAAGACCTCGATGATCACCCTGTTGTGTATGAGCAGAAGAGAAGTTATCTGTTGCCTTTGAAAGATTTGGAATCACTGATCCTTCAAGACTCCTCCAATTCTC
CAGATTCGCAAAGACGGACTAGAAGGGCATCTTGCCCTGATCTAGGGACAGCAAGCTGCAGACCTAAAATGTGGAGTAAGTTTCCACCAAATCTGACAGAAAATGAAGGT
TCGAGGAATCAGTGGTTGTCATCTTCAAAAGAACTCTTCCAAATGGAACACCAATTTGGTGGGAGGGTAGTACAAAATTGCCAGACACTCAAGATATCTCAAACTCTTCC
AAGGATAAAATCCCTCCAACTTCACCAATACCTTGGTTACCTCTCTTTTAGAGGAAAGGAGCATCGTGTTTCTCCATCGTCTACTGCTAAAATTCCGAAAGCTGACAGTC
CTAATGTAATCAAGCCATCCACTTCTGAATCTCAAGAAAGGAGGAGTAAAATGGCTGACGAGGACAACAAAGTAAATATTAGAGCTTGTTCAATCCCAATGCCACGTGCC
GTCGTATCCAGCCCAGAAAATGATATAATGATTGGGAAGAAAAACAGAAAATCAACTCAAAAATCATCAGTTTTGAAGAATCACAATTCAGTTCAATCCAGACACTCACA
GTGTAAGATCGTCGCGAGCCACAGTGGCAACGAAAATCCCATTTCCTTGATGAAATCCAAGGAAACTGCCAATAGTAAATGTCGATCCATTGGAAAGAGTGGTACGGCCT
CCAGGGTTGGTAGTTTTGTGTCTAAGAAGGCTCCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTTCGAAACCCAACGCCCATATGCCGCATCTCTGTTTCTTCGACGGCCGAGGCTGTTCCTAAGAAGATGGAGGATCGATCCTCAATCTATCCGAGGGTGAAGGTGAG
GGAGGAGAAAGACCTCGATGATCACCCTGTTGTGTATGAGCAGAAGAGAAGTTATCTGTTGCCTTTGAAAGATTTGGAATCACTGATCCTTCAAGACTCCTCCAATTCTC
CAGATTCGCAAAGACGGACTAGAAGGGCATCTTGCCCTGATCTAGGGACAGCAAGCTGCAGACCTAAAATGTGGAGTAAGTTTCCACCAAATCTGACAGAAAATGAAGGT
TCGAGGAATCAGTGGTTGTCATCTTCAAAAGAACTCTTCCAAATGGAACACCAATTTGGTGGGAGGGTAGTACAAAATTGCCAGACACTCAAGATATCTCAAACTCTTCC
AAGGATAAAATCCCTCCAACTTCACCAATACCTTGGTTACCTCTCTTTTAGAGGAAAGGAGCATCGTGTTTCTCCATCGTCTACTGCTAAAATTCCGAAAGCTGACAGTC
CTAATGTAATCAAGCCATCCACTTCTGAATCTCAAGAAAGGAGGAGTAAAATGGCTGACGAGGACAACAAAGTAAATATTAGAGCTTGTTCAATCCCAATGCCACGTGCC
GTCGTATCCAGCCCAGAAAATGATATAATGATTGGGAAGAAAAACAGAAAATCAACTCAAAAATCATCAGTTTTGAAGAATCACAATTCAGTTCAATCCAGACACTCACA
GTGTAAGATCGTCGCGAGCCACAGTGGCAACGAAAATCCCATTTCCTTGATGAAATCCAAGGAAACTGCCAATAGTAAATGTCGATCCATTGGAAAGAGTGGTACGGCCT
CCAGGGTTGGTAGTTTTGTGTCTAAGAAGGCTCCTTAG
Protein sequenceShow/hide protein sequence
MVRNPTPICRISVSSTAEAVPKKMEDRSSIYPRVKVREEKDLDDHPVVYEQKRSYLLPLKDLESLILQDSSNSPDSQRRTRRASCPDLGTASCRPKMWSKFPPNLTENEG
SRNQWLSSSKELFQMEHQFGGRVVQNCQTLKISQTLPRIKSLQLHQYLGYLSFRGKEHRVSPSSTAKIPKADSPNVIKPSTSESQERRSKMADEDNKVNIRACSIPMPRA
VVSSPENDIMIGKKNRKSTQKSSVLKNHNSVQSRHSQCKIVASHSGNENPISLMKSKETANSKCRSIGKSGTASRVGSFVSKKAP