; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi05G017720 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi05G017720
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionUnknown protein
Genome locationchr05:25048619..25054885
RNA-Seq ExpressionLsi05G017720
SyntenyLsi05G017720
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004134039.1 uncharacterized protein LOC101203442 [Cucumis sativus]4.4e-8063.93Show/hide
Query:  MDRSSIPICRISVSSTGEAVPEKMKDQSANYPRVKVRGEKDSDDYPAVYEQKRSYLFSLKDLESLLLQDTSNSPGKEHRVSPSFSAKIPKACSPNEIKPS
        M R++IPICRISVSST EAVPEKMKDQSANYP+VKVR E++ DD P VYEQKRSYL SLKDLESL LQD+SN+PGK                        
Subjt:  MDRSSIPICRISVSSTGEAVPEKMKDQSANYPRVKVRGEKDSDDYPAVYEQKRSYLFSLKDLESLLLQDTSNSPGKEHRVSPSFSAKIPKACSPNEIKPS

Query:  TFESQEAERRCKMVDEDNKANIRASSIPMPRVVISSPGKEHCVSPSSSAKIPKACSPNEIKSFTSESQEVERRCKMVDEDNKANIRASSIPMPRAIISNP
                                              KEH VS  S AKIPKACS NEIK  TSE QE ER C +VDEDNKANIRASSIPMPRA++S+P
Subjt:  TFESQEAERRCKMVDEDNKANIRASSIPMPRVVISSPGKEHCVSPSSSAKIPKACSPNEIKSFTSESQEVERRCKMVDEDNKANIRASSIPMPRAIISNP

Query:  ENDQMIGKKNRKTTEKPSVLKNRNSVQPRHSQCKIIASHSGNENPISSRRSKDTADSKCRSIGKNGTACRGGSFMSKTTP
        ENDQMIGKKNRKTTEKPSVLKN NSVQ RHSQCKIIA HS NEN ISSRRSKDT DSKCRS+GKNGT  RGGSFMSKTTP
Subjt:  ENDQMIGKKNRKTTEKPSVLKNRNSVQPRHSQCKIIASHSGNENPISSRRSKDTADSKCRSIGKNGTACRGGSFMSKTTP

XP_008438459.1 PREDICTED: uncharacterized protein LOC103483545 [Cucumis melo]2.1e-8263.93Show/hide
Query:  MDRSSIPICRISVSSTGEAVPEKMKDQSANYPRVKVRGEKDSDDYPAVYEQKRSYLFSLKDLESLLLQDTSNSPGKEHRVSPSFSAKIPKACSPNEIKPS
        M R++IPICRISVSST EAVPEKMKD+SANYP+VKV+ E++ DD P VYEQKRSYLFSLKDLESL LQD+SN+PGK                        
Subjt:  MDRSSIPICRISVSSTGEAVPEKMKDQSANYPRVKVRGEKDSDDYPAVYEQKRSYLFSLKDLESLLLQDTSNSPGKEHRVSPSFSAKIPKACSPNEIKPS

Query:  TFESQEAERRCKMVDEDNKANIRASSIPMPRVVISSPGKEHCVSPSSSAKIPKACSPNEIKSFTSESQEVERRCKMVDEDNKANIRASSIPMPRAIISNP
                                              KEHCVSP   AKIPKACS NEIK  TSE QE ERRCK+VDE+NKANIRASSIPMPRA+IS+P
Subjt:  TFESQEAERRCKMVDEDNKANIRASSIPMPRVVISSPGKEHCVSPSSSAKIPKACSPNEIKSFTSESQEVERRCKMVDEDNKANIRASSIPMPRAIISNP

Query:  ENDQMIGKKNRKTTEKPSVLKNRNSVQPRHSQCKIIASHSGNENPISSRRSKDTADSKCRSIGKNGTACRGGSFMSKTTP
        ENDQMIGKKNRKTT++PSVLKN NSVQ RHS CKIIASH GNENPISSRRSKDT DSKCRS+GKNGT   GGSFMSKTTP
Subjt:  ENDQMIGKKNRKTTEKPSVLKNRNSVQPRHSQCKIIASHSGNENPISSRRSKDTADSKCRSIGKNGTACRGGSFMSKTTP

XP_022157963.1 uncharacterized protein LOC111024560 [Momordica charantia]6.8e-7358.36Show/hide
Query:  MDRSSIPICRISVSSTGEAVPEKMKDQSANYPRVKVRGEKD-SDDYPAVYEQKRSYLFSLKDLESLLLQDTSNSPGKEHRVSPSFSAKIPKACSPNEIKP
        M ++  PICRISVSST +AVP+KMKDQS  YP+VKVR EKD  DD+PAVYEQKRSYL SLKD ESL L+D+SNSPGKEHRVSPS SA+IPKAC PN I+P
Subjt:  MDRSSIPICRISVSSTGEAVPEKMKDQSANYPRVKVRGEKD-SDDYPAVYEQKRSYLFSLKDLESLLLQDTSNSPGKEHRVSPSFSAKIPKACSPNEIKP

Query:  STFESQEAERRCKMVDEDNKANIRASSIPMPRVVISSPGKEHCVSPSSSAKIPKACSPNEIKSFTSESQEVERRCKMVDEDNKANIRASSIPMPRAIISN
        S  ESQ  ERRCK VD                                                            +V+EDN  N RA+SIPMPRA++S+
Subjt:  STFESQEAERRCKMVDEDNKANIRASSIPMPRVVISSPGKEHCVSPSSSAKIPKACSPNEIKSFTSESQEVERRCKMVDEDNKANIRASSIPMPRAIISN

Query:  PENDQMIGKKNRKTTEKPSVLKNRNSVQPRHSQCKIIASHSGNENPISSRRSKDTADSKCRSIGKNGTACRGGSFMSKTTP
        PEND MIGKKNRKTTEKPSVLKN NSVQ RH+QCKI+ASHSGNENPI++R+SKD AD+K R +GK+GT  RG SFMSK TP
Subjt:  PENDQMIGKKNRKTTEKPSVLKNRNSVQPRHSQCKIIASHSGNENPISSRRSKDTADSKCRSIGKNGTACRGGSFMSKTTP

XP_038895103.1 uncharacterized protein LOC120083415 isoform X1 [Benincasa hispida]4.6e-8567.74Show/hide
Query:  MDRSSIPICRISVSSTGEAVPEKMKDQSANYPRVKVRGEKDSDDYPAVYEQKRSYLFSLKDLESLLLQDTSNSPGKEHRVSPSFSAKIPKACSPNEIKPS
        M R+S PICRISVSST EAVPEKMKDQ+ANYPRVKVR EK  DD+PAVYEQKRSYL SLKDLESLLLQD+SNSPGKEH VSPS SAKIPKA  PNEIKPS
Subjt:  MDRSSIPICRISVSSTGEAVPEKMKDQSANYPRVKVRGEKDSDDYPAVYEQKRSYLFSLKDLESLLLQDTSNSPGKEHRVSPSFSAKIPKACSPNEIKPS

Query:  TFESQEAERRCKMVDEDNKANIRASSIPMPRVVISSPGKEHCVSPSSSAKIPKACSPNEIKSFTSESQEVERRCKMVDEDNKANIRASSIPMPRAIISNP
        T ESQ                                                                 ERRC+MVDEDNKANIRASSIPMPRAIIS+P
Subjt:  TFESQEAERRCKMVDEDNKANIRASSIPMPRVVISSPGKEHCVSPSSSAKIPKACSPNEIKSFTSESQEVERRCKMVDEDNKANIRASSIPMPRAIISNP

Query:  ENDQMIGKKNRKTTEKPSVLKNRNSVQPRHSQCKIIASHSGNENPISSRRSKDTADSKCRSIGKNGTACRGGSFMSKTT
        END MIGKKNRKTTEKPSVLKN NSVQ RHSQCKIIASHS NENPISSRRSK+TADSKCRSIGKNGTA RG SFMSKTT
Subjt:  ENDQMIGKKNRKTTEKPSVLKNRNSVQPRHSQCKIIASHSGNENPISSRRSKDTADSKCRSIGKNGTACRGGSFMSKTT

XP_038895110.1 uncharacterized protein LOC120083415 isoform X2 [Benincasa hispida]7.8e-8567.74Show/hide
Query:  MDRSSIPICRISVSSTGEAVPEKMKDQSANYPRVKVRGEKDSDDYPAVYEQKRSYLFSLKDLESLLLQDTSNSPGKEHRVSPSFSAKIPKACSPNEIKPS
        M R+S PICRISVSST EAVPEKMKDQ+ANYPRVKVR EK  DD+PAVYEQKRSYL SLKDLESLLLQD+SNSPGKEH VSPS SAKIPKA  PNEIKPS
Subjt:  MDRSSIPICRISVSSTGEAVPEKMKDQSANYPRVKVRGEKDSDDYPAVYEQKRSYLFSLKDLESLLLQDTSNSPGKEHRVSPSFSAKIPKACSPNEIKPS

Query:  TFESQEAERRCKMVDEDNKANIRASSIPMPRVVISSPGKEHCVSPSSSAKIPKACSPNEIKSFTSESQEVERRCKMVDEDNKANIRASSIPMPRAIISNP
        T ESQ                                                                 ERRC+MVDEDNKANIRASSIPMPRAIIS+P
Subjt:  TFESQEAERRCKMVDEDNKANIRASSIPMPRVVISSPGKEHCVSPSSSAKIPKACSPNEIKSFTSESQEVERRCKMVDEDNKANIRASSIPMPRAIISNP

Query:  ENDQMIGKKNRKTTEKPSVLKNRNSVQPRHSQCKIIASHSGNENPISSRRSKDTADSKCRSIGKNGTACRGGSFMSKTT
        END MIGKKNRKTTEKPSVLKN NSVQ RHSQCKIIASHS NENPISSRRSK+TADSKCRSIGKNGTA RG SFMSKTT
Subjt:  ENDQMIGKKNRKTTEKPSVLKNRNSVQPRHSQCKIIASHSGNENPISSRRSKDTADSKCRSIGKNGTACRGGSFMSKTT

TrEMBL top hitse value%identityAlignment
A0A0A0L861 Uncharacterized protein2.1e-8063.93Show/hide
Query:  MDRSSIPICRISVSSTGEAVPEKMKDQSANYPRVKVRGEKDSDDYPAVYEQKRSYLFSLKDLESLLLQDTSNSPGKEHRVSPSFSAKIPKACSPNEIKPS
        M R++IPICRISVSST EAVPEKMKDQSANYP+VKVR E++ DD P VYEQKRSYL SLKDLESL LQD+SN+PGK                        
Subjt:  MDRSSIPICRISVSSTGEAVPEKMKDQSANYPRVKVRGEKDSDDYPAVYEQKRSYLFSLKDLESLLLQDTSNSPGKEHRVSPSFSAKIPKACSPNEIKPS

Query:  TFESQEAERRCKMVDEDNKANIRASSIPMPRVVISSPGKEHCVSPSSSAKIPKACSPNEIKSFTSESQEVERRCKMVDEDNKANIRASSIPMPRAIISNP
                                              KEH VS  S AKIPKACS NEIK  TSE QE ER C +VDEDNKANIRASSIPMPRA++S+P
Subjt:  TFESQEAERRCKMVDEDNKANIRASSIPMPRVVISSPGKEHCVSPSSSAKIPKACSPNEIKSFTSESQEVERRCKMVDEDNKANIRASSIPMPRAIISNP

Query:  ENDQMIGKKNRKTTEKPSVLKNRNSVQPRHSQCKIIASHSGNENPISSRRSKDTADSKCRSIGKNGTACRGGSFMSKTTP
        ENDQMIGKKNRKTTEKPSVLKN NSVQ RHSQCKIIA HS NEN ISSRRSKDT DSKCRS+GKNGT  RGGSFMSKTTP
Subjt:  ENDQMIGKKNRKTTEKPSVLKNRNSVQPRHSQCKIIASHSGNENPISSRRSKDTADSKCRSIGKNGTACRGGSFMSKTTP

A0A1S3AW33 uncharacterized protein LOC1034835451.0e-8263.93Show/hide
Query:  MDRSSIPICRISVSSTGEAVPEKMKDQSANYPRVKVRGEKDSDDYPAVYEQKRSYLFSLKDLESLLLQDTSNSPGKEHRVSPSFSAKIPKACSPNEIKPS
        M R++IPICRISVSST EAVPEKMKD+SANYP+VKV+ E++ DD P VYEQKRSYLFSLKDLESL LQD+SN+PGK                        
Subjt:  MDRSSIPICRISVSSTGEAVPEKMKDQSANYPRVKVRGEKDSDDYPAVYEQKRSYLFSLKDLESLLLQDTSNSPGKEHRVSPSFSAKIPKACSPNEIKPS

Query:  TFESQEAERRCKMVDEDNKANIRASSIPMPRVVISSPGKEHCVSPSSSAKIPKACSPNEIKSFTSESQEVERRCKMVDEDNKANIRASSIPMPRAIISNP
                                              KEHCVSP   AKIPKACS NEIK  TSE QE ERRCK+VDE+NKANIRASSIPMPRA+IS+P
Subjt:  TFESQEAERRCKMVDEDNKANIRASSIPMPRVVISSPGKEHCVSPSSSAKIPKACSPNEIKSFTSESQEVERRCKMVDEDNKANIRASSIPMPRAIISNP

Query:  ENDQMIGKKNRKTTEKPSVLKNRNSVQPRHSQCKIIASHSGNENPISSRRSKDTADSKCRSIGKNGTACRGGSFMSKTTP
        ENDQMIGKKNRKTT++PSVLKN NSVQ RHS CKIIASH GNENPISSRRSKDT DSKCRS+GKNGT   GGSFMSKTTP
Subjt:  ENDQMIGKKNRKTTEKPSVLKNRNSVQPRHSQCKIIASHSGNENPISSRRSKDTADSKCRSIGKNGTACRGGSFMSKTTP

A0A6J1DUS8 uncharacterized protein LOC1110245603.3e-7358.36Show/hide
Query:  MDRSSIPICRISVSSTGEAVPEKMKDQSANYPRVKVRGEKD-SDDYPAVYEQKRSYLFSLKDLESLLLQDTSNSPGKEHRVSPSFSAKIPKACSPNEIKP
        M ++  PICRISVSST +AVP+KMKDQS  YP+VKVR EKD  DD+PAVYEQKRSYL SLKD ESL L+D+SNSPGKEHRVSPS SA+IPKAC PN I+P
Subjt:  MDRSSIPICRISVSSTGEAVPEKMKDQSANYPRVKVRGEKD-SDDYPAVYEQKRSYLFSLKDLESLLLQDTSNSPGKEHRVSPSFSAKIPKACSPNEIKP

Query:  STFESQEAERRCKMVDEDNKANIRASSIPMPRVVISSPGKEHCVSPSSSAKIPKACSPNEIKSFTSESQEVERRCKMVDEDNKANIRASSIPMPRAIISN
        S  ESQ  ERRCK VD                                                            +V+EDN  N RA+SIPMPRA++S+
Subjt:  STFESQEAERRCKMVDEDNKANIRASSIPMPRVVISSPGKEHCVSPSSSAKIPKACSPNEIKSFTSESQEVERRCKMVDEDNKANIRASSIPMPRAIISN

Query:  PENDQMIGKKNRKTTEKPSVLKNRNSVQPRHSQCKIIASHSGNENPISSRRSKDTADSKCRSIGKNGTACRGGSFMSKTTP
        PEND MIGKKNRKTTEKPSVLKN NSVQ RH+QCKI+ASHSGNENPI++R+SKD AD+K R +GK+GT  RG SFMSK TP
Subjt:  PENDQMIGKKNRKTTEKPSVLKNRNSVQPRHSQCKIIASHSGNENPISSRRSKDTADSKCRSIGKNGTACRGGSFMSKTTP

A0A6J1FB38 uncharacterized protein LOC111443687 isoform X34.6e-5953.55Show/hide
Query:  MDRSSIPICRISVSSTGEAVPEKMKDQSANYPRVKVRGEKDSDDYPAVYEQKRSYLFSLKDLESLLLQDTSNSPGKEHRVSPSFSAKIPKACSPNEIKPS
        M ++  PICRIS       VPE MKDQ A YP+VKVR E + DD+PAV EQKRSYL SLKDLESL L+D+S+S  + HR   S                 
Subjt:  MDRSSIPICRISVSSTGEAVPEKMKDQSANYPRVKVRGEKDSDDYPAVYEQKRSYLFSLKDLESLLLQDTSNSPGKEHRVSPSFSAKIPKACSPNEIKPS

Query:  TFESQEAERRCKMVDEDN-KANIRASSIPMPRVVISSPGKEHCVSPSSSAKIPKACSPNEIKSFTSESQEVERRCKMVDEDNKANIRASSIPMPRAIISN
                  C ++   N +A + +     P  +  + GKEH VSPSS+AK+PK  SPN +K  TSESQ  ++RC      N+ N RASSIPMPRA++S+
Subjt:  TFESQEAERRCKMVDEDN-KANIRASSIPMPRVVISSPGKEHCVSPSSSAKIPKACSPNEIKSFTSESQEVERRCKMVDEDNKANIRASSIPMPRAIISN

Query:  PENDQMIGKKNRKTTEKPSVLKNRNSVQPRHSQCKIIASHSGNENPISSRRSKDT-ADSKCRSIGKNGTACRGGSFMSKTTP
        PEND MIGKKNRKTTEKPSVLKN NSVQ RHSQCK +A HSGNENPI SR+SK+T  +SKCRS GKNG A    +F SK TP
Subjt:  PENDQMIGKKNRKTTEKPSVLKNRNSVQPRHSQCKIIASHSGNENPISSRRSKDT-ADSKCRSIGKNGTACRGGSFMSKTTP

A0A6J1FGL2 uncharacterized protein LOC111443687 isoform X68.2e-5652.26Show/hide
Query:  MDRSSIPICRISVSSTGEAVPEKMKDQSANYPRVKVRGEKDSDDYPAVYEQKRSYLFSLKDLESLLLQDTSNSPGKEHRVSPSFSAKIPKACSPNEIKPS
        M ++  PICRIS       VPE MKDQ A YP+VKVR E + DD+PAV EQKRSYL SLKDLESL L+D+S+S GKEHRVSPS +AK+PK  SPN +KPS
Subjt:  MDRSSIPICRISVSSTGEAVPEKMKDQSANYPRVKVRGEKDSDDYPAVYEQKRSYLFSLKDLESLLLQDTSNSPGKEHRVSPSFSAKIPKACSPNEIKPS

Query:  TFESQEAERRCKMVDEDNKANIRASSIPMPRVVISSPGKEHCVSPSSSAKIPKACSPNEIKSFTSESQEVERRCKMVDEDNKANIRASSIPMPRAIISNP
        T ESQ A++RC                                                                     N+ N RASSIPMPRA++S+P
Subjt:  TFESQEAERRCKMVDEDNKANIRASSIPMPRVVISSPGKEHCVSPSSSAKIPKACSPNEIKSFTSESQEVERRCKMVDEDNKANIRASSIPMPRAIISNP

Query:  ENDQMIGKKNRKTTEKPSVLKNRNSVQPRHSQCKIIASHSGNENPISSRRSKDT-ADSKCRSIGKN
        END MIGKKNRKTTEKPSVLKN NSVQ RHSQCK +A HSGNENPI SR+SK+T  +SKCRS GKN
Subjt:  ENDQMIGKKNRKTTEKPSVLKNRNSVQPRHSQCKIIASHSGNENPISSRRSKDT-ADSKCRSIGKN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G21865.1 unknown protein1.1e-0437.93Show/hide
Query:  AKIPKACSPNEIKSFTSESQEVERRCKMVDEDNKANIRASSIPMPRAIISNPENDQMIGKKNRKTTEKPSV-LKNRNSVQPRHSQCK
        AKIPK   P+ + S  SES+E+ +  +  D + K   +AS    PRA++S+P+ND MIG  N     K    LK+++ ++ R SQ K
Subjt:  AKIPKACSPNEIKSFTSESQEVERRCKMVDEDNKANIRASSIPMPRAIISNPENDQMIGKKNRKTTEKPSV-LKNRNSVQPRHSQCK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCGAAGCTCAATACCCATATGCCGCATCTCTGTTTCGTCGACGGGTGAAGCTGTTCCTGAGAAGATGAAGGATCAATCCGCAAACTATCCGAGGGTGAAGGTGAG
AGGGGAGAAAGACTCGGATGATTATCCTGCTGTATATGAGCAGAAGAGAAGTTATTTGTTCTCTCTGAAAGATTTGGAATCACTCTTACTTCAAGACACCTCCAATTCTC
CAGGAAAGGAACATCGTGTCTCTCCATCATTCTCTGCTAAAATTCCGAAAGCTTGTAGTCCCAATGAAATCAAGCCATCCACCTTTGAATCTCAAGAAGCAGAAAGGAGG
TGTAAAATGGTTGATGAGGATAACAAAGCAAATATTAGAGCTAGTTCAATCCCAATGCCCCGTGTTGTCATATCCAGCCCTGGAAAGGAACATTGTGTTTCTCCATCATC
CTCTGCTAAAATTCCGAAAGCTTGCAGTCCCAATGAAATCAAGTCATTCACTTCTGAATCTCAAGAAGTAGAAAGGAGGTGTAAAATGGTTGATGAGGATAACAAAGCAA
ATATTAGAGCTAGTTCAATCCCAATGCCCCGTGCCATCATATCCAACCCTGAAAATGATCAAATGATTGGGAAGAAAAACAGAAAAACAACCGAAAAACCGTCAGTTTTG
AAGAATCGCAATTCAGTCCAGCCTAGACACTCACAATGTAAGATCATCGCTAGCCACAGTGGCAATGAGAATCCCATTTCCTCGAGGAGATCCAAGGATACTGCTGATAG
CAAATGTCGAAGCATTGGAAAGAATGGTACGGCCTGCAGGGGCGGCAGCTTCATGTCAAAGACGACTCCCTAG
mRNA sequenceShow/hide mRNA sequence
CCTCCAATCACTTTTCACCATCTCAAATGCAAAACAATATTAAATAACAAACAAAATCAAAGATGATTTTGCAGATTCATTGAAACAGTCCAAACAAATAGGGAAAAGAA
GTGTATCAGTTGTGGGAAAACTCCTCATGAAGATCAAAAGCAACTGAACCCTACCAATCAAGAAAGCAAAAACAAAACCAAACTTCACCCAAATGCAGAAATATAAAAAG
GAATGGAGGAGATGAAAATGAAATGGCATCGTTGAAAAATTCCATTTTGATATTCAATAATCTTCTCTTCTGATTCCTGAACTCAACAGCAGCAAAAGCTTAAGGTTAAT
GGGGTCCGCCACTGCGAAGCTTTCAGGCAAATGCGCCCTTTGGCATTCGTTGACCCAACCATCACTTACACGGCCGATTTGAATCGTCATTCCTTTTTTTGAAAACCCCA
AAAGCCATTTCTCCCATAACTTGCTTCACTACTCTTTTTTCTGCCATTTCCTTTTTATCGCTTTGAAGATACGCTTTGAATACTTGATTTCTGGGTTTCTTCTTCTACTG
ATTCATTTTGATGGATCGAAGCTCAATACCCATATGCCGCATCTCTGTTTCGTCGACGGGTGAAGCTGTTCCTGAGAAGATGAAGGATCAATCCGCAAACTATCCGAGGG
TGAAGGTGAGAGGGGAGAAAGACTCGGATGATTATCCTGCTGTATATGAGCAGAAGAGAAGTTATTTGTTCTCTCTGAAAGATTTGGAATCACTCTTACTTCAAGACACC
TCCAATTCTCCAGGAAAGGAACATCGTGTCTCTCCATCATTCTCTGCTAAAATTCCGAAAGCTTGTAGTCCCAATGAAATCAAGCCATCCACCTTTGAATCTCAAGAAGC
AGAAAGGAGGTGTAAAATGGTTGATGAGGATAACAAAGCAAATATTAGAGCTAGTTCAATCCCAATGCCCCGTGTTGTCATATCCAGCCCTGGAAAGGAACATTGTGTTT
CTCCATCATCCTCTGCTAAAATTCCGAAAGCTTGCAGTCCCAATGAAATCAAGTCATTCACTTCTGAATCTCAAGAAGTAGAAAGGAGGTGTAAAATGGTTGATGAGGAT
AACAAAGCAAATATTAGAGCTAGTTCAATCCCAATGCCCCGTGCCATCATATCCAACCCTGAAAATGATCAAATGATTGGGAAGAAAAACAGAAAAACAACCGAAAAACC
GTCAGTTTTGAAGAATCGCAATTCAGTCCAGCCTAGACACTCACAATGTAAGATCATCGCTAGCCACAGTGGCAATGAGAATCCCATTTCCTCGAGGAGATCCAAGGATA
CTGCTGATAGCAAATGTCGAAGCATTGGAAAGAATGGTACGGCCTGCAGGGGCGGCAGCTTCATGTCAAAGACGACTCCCTAGAAACAAGAGACCAATTCTTGTGCTGAT
GATTTCGGGCATGTTTACCTTCCAATAGGCCAAGTATATAACACTTGAGTCAAGGCTTGGACTGGCTTTTAGTATCATTGTGGTGCTTTGTCCAATTGCTCAACAACCGA
TTCCTTGTTCATATCGTATCTTGGAATCAGCTTCCTCGCTAACATGAAGTACTTTATACCCATGTAGCCTGAGATGTTTTGAACTTGAAAACTTGAGACAAATGAAGTTC
TATACGAAACTGATCTCGACATGCATTTTTTTATCACTATGTTGTTTGCAACCAAGCTTCTGTCATTTTAGAATTCCATTTATCTCCATATTTAATAACTTTACAACATA
ACAGATTCTATCAAAATGAGGATTTGAACGATCTCCAGAAAGAAAGATGTGCCCACTATTGTTTACTTCAATCCAACTACACCCTGGCTGTTTCTTAACTCCATTATTCT
TCATTCTTGCCAAAATCCTTTCAGCATCTTCTCTTTTTCCACTTCTCAAATACATTTCTGCTAAAATCAAATATACCCCCGAGTTATAAGGCTCTTTTTCCAGGACCTTT
TCACCTGCAATCACCCCTACATCATAATTCTTATGGATTCTGCAAGCTCCAAGCAAAGCCCCCCAGGCACTTGGAGGAACTTCAATTTCCTCTTCTTTCATTTCGACTAG
AAAACTTAATGCCTCATCAATAAGCCCAAATCTCCCAAACAAGTCAACTAAGCATGTATAATGCTCAATCAATGGCTGAAGACAACATTCATTTTTCATAAAATCGAAAT
AATATCTACCTTGGTCCACCAAACCTTTATGGCTACAGGCAGACAGTACACCAATAAAGGTTATATGATTAGGTTCTATGTTAGCTAATCTCATTTTTTCAAACATGTCC
AGAGCTTCTTCGCCATTTCCATGGTGAGCAAACCCACAAATGATAGAATTCCAAGAAATTACATCTCTGTTCAACATGGAAGAGAACTCCATCAAGGCACAATCCATGTT
TCCACATCTTGCATACATATTAACCATAGCATTTGAGACCGCAACAAAGCCATTGAATCCTTCTTTTAGAATAAGTGCATGTGTTTGTCTACCAAGCTGCA
Protein sequenceShow/hide protein sequence
MDRSSIPICRISVSSTGEAVPEKMKDQSANYPRVKVRGEKDSDDYPAVYEQKRSYLFSLKDLESLLLQDTSNSPGKEHRVSPSFSAKIPKACSPNEIKPSTFESQEAERR
CKMVDEDNKANIRASSIPMPRVVISSPGKEHCVSPSSSAKIPKACSPNEIKSFTSESQEVERRCKMVDEDNKANIRASSIPMPRAIISNPENDQMIGKKNRKTTEKPSVL
KNRNSVQPRHSQCKIIASHSGNENPISSRRSKDTADSKCRSIGKNGTACRGGSFMSKTTP