; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0005985 (gene) of Snake gourd v1 genome

Gene IDTan0005985
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionTransmembrane protein
Genome locationLG09:57446173..57451216
RNA-Seq ExpressionTan0005985
SyntenyTan0005985
Gene Ontology termsGO:0036503 - ERAD pathway (biological process)
GO:0005783 - endoplasmic reticulum (cellular component)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6587744.1 hypothetical protein SDJN03_16309, partial [Cucurbita argyrosperma subsp. sororia]5.7e-13488.93Show/hide
Query:  GPEAIPSSSQSSSALKWQDDNLEQTPVQVPKFSSFSTVPNGDVQMIPIMYPALVPGSALSENQNRGAGIYAVPAFPSMGGPIIGMSTNNLIPLTYSISTR
        GPEAIPSSSQ SSALKWQDDNLE+   QVPKFSSF TVP+G VQMIPIMYPALVPGSA  +NQNRGAGIYAVPAFPSMGGP+IGMSTNNLIPLTYSI TR
Subjt:  GPEAIPSSSQSSSALKWQDDNLEQTPVQVPKFSSFSTVPNGDVQMIPIMYPALVPGSALSENQNRGAGIYAVPAFPSMGGPIIGMSTNNLIPLTYSISTR

Query:  SDASNRTSPEGGTTVEENGRVEGQQRQQQQQPAPQRQVVARRFQIAIQIDLFLILKLAAVIFLVHQDGSRQRLIVLVICASIVYLYQTGALTPLIRWLSQ
        S +SNRTS EGGT  EENGRVEGQQ+ QQQQPAPQRQVV RRFQIAIQIDL LILKLAAVIFLVHQDGSRQRLIVLVICASIVYLYQTGALTPLIRWLSQ
Subjt:  SDASNRTSPEGGTTVEENGRVEGQQRQQQQQPAPQRQVVARRFQIAIQIDLFLILKLAAVIFLVHQDGSRQRLIVLVICASIVYLYQTGALTPLIRWLSQ

Query:  GMQRAAAPPQPPRPGVRAEIALVAAPAAGQEA--QNAAFAEGENEPGNEGNRAVENENVAEPGAGNGGLNWWGVVKEIQMIVFGFITSLLPGFHNHMD
        GMQRAAAPP PPRPGVRAE A VA PAAGQEA    AAFAEGEN+PGNE NRAVENENVAEPGA NGGLNWWGVVKEIQMIVFGFITSLLPGFHNHMD
Subjt:  GMQRAAAPPQPPRPGVRAEIALVAAPAAGQEA--QNAAFAEGENEPGNEGNRAVENENVAEPGAGNGGLNWWGVVKEIQMIVFGFITSLLPGFHNHMD

TYK28583.1 uncharacterized protein E5676_scaffold629G002330 [Cucumis melo var. makuwa]1.5e-13486.84Show/hide
Query:  GPEAIPSSSQSSSALKWQDDNLEQTPVQVPKFSSFSTVPNGDVQMIPIMYPALVPGSALSENQNRGAGIYAVPAFPSMGGPIIGMSTNNLIPLTYSISTR
        GPE+IPSSSQSSSALKWQDDNLEQT  QVPK SSF T PNG+VQMIPIMYPALVPGSA SENQNRGAGIYAVP+FPSMGGPIIGM+TNNLIPLTYSI TR
Subjt:  GPEAIPSSSQSSSALKWQDDNLEQTPVQVPKFSSFSTVPNGDVQMIPIMYPALVPGSALSENQNRGAGIYAVPAFPSMGGPIIGMSTNNLIPLTYSISTR

Query:  SDASNRTSPEGGTTVEENGRVEGQQRQQQQQPAPQRQVVARRFQIAIQIDLFLILKLAAVIFLVHQDGSRQRLIVLVICASIVYLYQTGALTPLIRWLSQ
        SD SNRTSPEGG+ VEENGRVEGQQ+ QQQQPAPQRQVV RRFQIAIQIDL LILKLAAVIFLVHQDGSRQRLIVLVICAS+VYLYQTGALTPLIRWLSQ
Subjt:  SDASNRTSPEGGTTVEENGRVEGQQRQQQQQPAPQRQVVARRFQIAIQIDLFLILKLAAVIFLVHQDGSRQRLIVLVICASIVYLYQTGALTPLIRWLSQ

Query:  GMQRAAAPPQPPRPGVRAEIALVAAPAAGQEAQNAAFAEG------ENEPGNEGNRAVENENVAE--PGAGNGGLNWWGVVKEIQMIVFGFITSLLPGFH
        GMQRAAAPP PPRPGVRAE A +A PAA QE QNAAFAEG      EN+P NE NR  ENENVAE   GAGNGGLNWWGVVKEIQMIVFGFITSLLPGFH
Subjt:  GMQRAAAPPQPPRPGVRAEIALVAAPAAGQEAQNAAFAEG------ENEPGNEGNRAVENENVAE--PGAGNGGLNWWGVVKEIQMIVFGFITSLLPGFH

Query:  NHMD
        NHMD
Subjt:  NHMD

XP_004146332.1 uncharacterized protein LOC101222970 isoform X1 [Cucumis sativus]1.2e-13486.84Show/hide
Query:  GPEAIPSSSQSSSALKWQDDNLEQTPVQVPKFSSFSTVPNGDVQMIPIMYPALVPGSALSENQNRGAGIYAVPAFPSMGGPIIGMSTNNLIPLTYSISTR
        GPE+IPSSSQSSSALKWQDDNLEQ+  QVPK  SF T PNG+VQMIPIMYPALVPGSA SENQNRGAGIYAVP+FPSMGGPIIGM+TNNLIPLTYSI TR
Subjt:  GPEAIPSSSQSSSALKWQDDNLEQTPVQVPKFSSFSTVPNGDVQMIPIMYPALVPGSALSENQNRGAGIYAVPAFPSMGGPIIGMSTNNLIPLTYSISTR

Query:  SDASNRTSPEGGTTVEENGRVEGQQRQQQQQPAPQRQVVARRFQIAIQIDLFLILKLAAVIFLVHQDGSRQRLIVLVICASIVYLYQTGALTPLIRWLSQ
        SD SNRTSPEGG+ VEENGRVEGQQ+ QQQQP PQRQVV RRFQIAIQIDL LILKLAAVIFLVHQDGSRQRLIVLVICAS+VYLYQTGALTPLIRWLSQ
Subjt:  SDASNRTSPEGGTTVEENGRVEGQQRQQQQQPAPQRQVVARRFQIAIQIDLFLILKLAAVIFLVHQDGSRQRLIVLVICASIVYLYQTGALTPLIRWLSQ

Query:  GMQRAAAPPQPPRPGVRAEIALVAAPAAGQEAQNAAFAEG------ENEPGNEGNRAVENENVAE--PGAGNGGLNWWGVVKEIQMIVFGFITSLLPGFH
        GMQRAAAPP PPRPGVRA+ ALVA PAA QE QNAAFAEG      EN+P NE NR VENENVAE  PGAGNGGLNWWGVVKEIQMIVFGFITSLLPGFH
Subjt:  GMQRAAAPPQPPRPGVRAEIALVAAPAAGQEAQNAAFAEG------ENEPGNEGNRAVENENVAE--PGAGNGGLNWWGVVKEIQMIVFGFITSLLPGFH

Query:  NHMD
        NHMD
Subjt:  NHMD

XP_008453568.1 PREDICTED: uncharacterized protein LOC103494241 isoform X1 [Cucumis melo]2.2e-13386.51Show/hide
Query:  GPEAIPSSSQSSSALKWQDDNLEQTPVQVPKFSSFSTVPNGDVQMIPIMYPALVPGSALSENQNRGAGIYAVPAFPSMGGPIIGMSTNNLIPLTYSISTR
        GPE+IPSSSQSSSALKWQDDNLEQT  QVPK SSF T PNG+VQMIPIMYPALVPGSA SENQNRGAGIYAVP+FPSMGGPIIGM+TNNLIPLTYSI TR
Subjt:  GPEAIPSSSQSSSALKWQDDNLEQTPVQVPKFSSFSTVPNGDVQMIPIMYPALVPGSALSENQNRGAGIYAVPAFPSMGGPIIGMSTNNLIPLTYSISTR

Query:  SDASNRTSPEGGTTVEENGRVEGQQRQQQQQPAPQRQVVARRFQIAIQIDLFLILKLAAVIFLVHQDGSRQRLIVLVICASIVYLYQTGALTPLIRWLSQ
        SD SNRTSPEGG+ VEENGRVEGQQ+ QQQQ APQRQVV RRFQIAIQIDL LILKLAAVIFLVHQDGSRQRLIVLVICAS+VYLYQTGALTPLIRWLSQ
Subjt:  SDASNRTSPEGGTTVEENGRVEGQQRQQQQQPAPQRQVVARRFQIAIQIDLFLILKLAAVIFLVHQDGSRQRLIVLVICASIVYLYQTGALTPLIRWLSQ

Query:  GMQRAAAPPQPPRPGVRAEIALVAAPAAGQEAQNAAFAEG------ENEPGNEGNRAVENENVAE--PGAGNGGLNWWGVVKEIQMIVFGFITSLLPGFH
        GMQRAAAPP PPRPGVRAE A +A PAA QE QNAAFAEG      EN+P NE NR  ENENVAE   GAGNGGLNWWGVVKEIQMIVFGFITSLLPGFH
Subjt:  GMQRAAAPPQPPRPGVRAEIALVAAPAAGQEAQNAAFAEG------ENEPGNEGNRAVENENVAE--PGAGNGGLNWWGVVKEIQMIVFGFITSLLPGFH

Query:  NHMD
        NHMD
Subjt:  NHMD

XP_038880730.1 uncharacterized protein LOC120072333 isoform X1 [Benincasa hispida]2.9e-13888.74Show/hide
Query:  GPEAIPSSSQSSSALKWQDDNLEQTPVQVPKFSSFSTVPNGDVQMIPIMYPALVPGSALSENQNRGAGIYAVPAFPSMGGPIIGMSTNNLIPLTYSISTR
        GPEAIPSSSQSSS+LKWQ DNLEQT  QVPK SSF T PNG+VQMIPIMYPAL PGSA  ENQNRGAGIYAVP+FPSMGGPIIGM+TNNLIPLTYSI TR
Subjt:  GPEAIPSSSQSSSALKWQDDNLEQTPVQVPKFSSFSTVPNGDVQMIPIMYPALVPGSALSENQNRGAGIYAVPAFPSMGGPIIGMSTNNLIPLTYSISTR

Query:  SDASNRTSPEGGTTVEENGRVEGQQRQQQQQPAPQRQVVARRFQIAIQIDLFLILKLAAVIFLVHQDGSRQRLIVLVICASIVYLYQTGALTPLIRWLSQ
        SD SNRT+PEGGTTVEENGRVEGQQ+QQQQQPAPQRQVV RRFQIAIQIDL LILKLAAVIFLVHQDGSRQRLIVLVICAS+VYLYQTGALTPLIRWLSQ
Subjt:  SDASNRTSPEGGTTVEENGRVEGQQRQQQQQPAPQRQVVARRFQIAIQIDLFLILKLAAVIFLVHQDGSRQRLIVLVICASIVYLYQTGALTPLIRWLSQ

Query:  GMQRAAAPPQPPRPGVRAEIALVAAPAAGQEAQNAAFAEG------ENEPGNEGNRAVENENVAEPGAGNGGLNWWGVVKEIQMIVFGFITSLLPGFHNH
        GMQRAAAPP PPRPGVRAE A VA P A QEAQNAAFAEG      EN+P NEGNRAVENENVAEPGAGNGGLNWWGVVKEIQMIVFGFITSLLPGFHNH
Subjt:  GMQRAAAPPQPPRPGVRAEIALVAAPAAGQEAQNAAFAEG------ENEPGNEGNRAVENENVAEPGAGNGGLNWWGVVKEIQMIVFGFITSLLPGFHNH

Query:  MD
        MD
Subjt:  MD

TrEMBL top hitse value%identityAlignment
A0A0A0LU80 Uncharacterized protein5.6e-13586.84Show/hide
Query:  GPEAIPSSSQSSSALKWQDDNLEQTPVQVPKFSSFSTVPNGDVQMIPIMYPALVPGSALSENQNRGAGIYAVPAFPSMGGPIIGMSTNNLIPLTYSISTR
        GPE+IPSSSQSSSALKWQDDNLEQ+  QVPK  SF T PNG+VQMIPIMYPALVPGSA SENQNRGAGIYAVP+FPSMGGPIIGM+TNNLIPLTYSI TR
Subjt:  GPEAIPSSSQSSSALKWQDDNLEQTPVQVPKFSSFSTVPNGDVQMIPIMYPALVPGSALSENQNRGAGIYAVPAFPSMGGPIIGMSTNNLIPLTYSISTR

Query:  SDASNRTSPEGGTTVEENGRVEGQQRQQQQQPAPQRQVVARRFQIAIQIDLFLILKLAAVIFLVHQDGSRQRLIVLVICASIVYLYQTGALTPLIRWLSQ
        SD SNRTSPEGG+ VEENGRVEGQQ+ QQQQP PQRQVV RRFQIAIQIDL LILKLAAVIFLVHQDGSRQRLIVLVICAS+VYLYQTGALTPLIRWLSQ
Subjt:  SDASNRTSPEGGTTVEENGRVEGQQRQQQQQPAPQRQVVARRFQIAIQIDLFLILKLAAVIFLVHQDGSRQRLIVLVICASIVYLYQTGALTPLIRWLSQ

Query:  GMQRAAAPPQPPRPGVRAEIALVAAPAAGQEAQNAAFAEG------ENEPGNEGNRAVENENVAE--PGAGNGGLNWWGVVKEIQMIVFGFITSLLPGFH
        GMQRAAAPP PPRPGVRA+ ALVA PAA QE QNAAFAEG      EN+P NE NR VENENVAE  PGAGNGGLNWWGVVKEIQMIVFGFITSLLPGFH
Subjt:  GMQRAAAPPQPPRPGVRAEIALVAAPAAGQEAQNAAFAEG------ENEPGNEGNRAVENENVAE--PGAGNGGLNWWGVVKEIQMIVFGFITSLLPGFH

Query:  NHMD
        NHMD
Subjt:  NHMD

A0A1S3BW01 uncharacterized protein LOC103494241 isoform X11.1e-13386.51Show/hide
Query:  GPEAIPSSSQSSSALKWQDDNLEQTPVQVPKFSSFSTVPNGDVQMIPIMYPALVPGSALSENQNRGAGIYAVPAFPSMGGPIIGMSTNNLIPLTYSISTR
        GPE+IPSSSQSSSALKWQDDNLEQT  QVPK SSF T PNG+VQMIPIMYPALVPGSA SENQNRGAGIYAVP+FPSMGGPIIGM+TNNLIPLTYSI TR
Subjt:  GPEAIPSSSQSSSALKWQDDNLEQTPVQVPKFSSFSTVPNGDVQMIPIMYPALVPGSALSENQNRGAGIYAVPAFPSMGGPIIGMSTNNLIPLTYSISTR

Query:  SDASNRTSPEGGTTVEENGRVEGQQRQQQQQPAPQRQVVARRFQIAIQIDLFLILKLAAVIFLVHQDGSRQRLIVLVICASIVYLYQTGALTPLIRWLSQ
        SD SNRTSPEGG+ VEENGRVEGQQ+ QQQQ APQRQVV RRFQIAIQIDL LILKLAAVIFLVHQDGSRQRLIVLVICAS+VYLYQTGALTPLIRWLSQ
Subjt:  SDASNRTSPEGGTTVEENGRVEGQQRQQQQQPAPQRQVVARRFQIAIQIDLFLILKLAAVIFLVHQDGSRQRLIVLVICASIVYLYQTGALTPLIRWLSQ

Query:  GMQRAAAPPQPPRPGVRAEIALVAAPAAGQEAQNAAFAEG------ENEPGNEGNRAVENENVAE--PGAGNGGLNWWGVVKEIQMIVFGFITSLLPGFH
        GMQRAAAPP PPRPGVRAE A +A PAA QE QNAAFAEG      EN+P NE NR  ENENVAE   GAGNGGLNWWGVVKEIQMIVFGFITSLLPGFH
Subjt:  GMQRAAAPPQPPRPGVRAEIALVAAPAAGQEAQNAAFAEG------ENEPGNEGNRAVENENVAE--PGAGNGGLNWWGVVKEIQMIVFGFITSLLPGFH

Query:  NHMD
        NHMD
Subjt:  NHMD

A0A1S3BWN3 uncharacterized protein LOC103494241 isoform X29.9e-13286.18Show/hide
Query:  GPEAIPSSSQSSSALKWQDDNLEQTPVQVPKFSSFSTVPNGDVQMIPIMYPALVPGSALSENQNRGAGIYAVPAFPSMGGPIIGMSTNNLIPLTYSISTR
        GPE+IPSSSQSSSALKWQDDNLEQT  QVPK SSF T PNG+VQMIPIMYPALVPGSA SENQNRGAGIYAVP+FPSMGGPIIGM+TNNLIPLTYSI T 
Subjt:  GPEAIPSSSQSSSALKWQDDNLEQTPVQVPKFSSFSTVPNGDVQMIPIMYPALVPGSALSENQNRGAGIYAVPAFPSMGGPIIGMSTNNLIPLTYSISTR

Query:  SDASNRTSPEGGTTVEENGRVEGQQRQQQQQPAPQRQVVARRFQIAIQIDLFLILKLAAVIFLVHQDGSRQRLIVLVICASIVYLYQTGALTPLIRWLSQ
        SD SNRTSPEGG+ VEENGRVEGQQ+ QQQQ APQRQVV RRFQIAIQIDL LILKLAAVIFLVHQDGSRQRLIVLVICAS+VYLYQTGALTPLIRWLSQ
Subjt:  SDASNRTSPEGGTTVEENGRVEGQQRQQQQQPAPQRQVVARRFQIAIQIDLFLILKLAAVIFLVHQDGSRQRLIVLVICASIVYLYQTGALTPLIRWLSQ

Query:  GMQRAAAPPQPPRPGVRAEIALVAAPAAGQEAQNAAFAEG------ENEPGNEGNRAVENENVAE--PGAGNGGLNWWGVVKEIQMIVFGFITSLLPGFH
        GMQRAAAPP PPRPGVRAE A +A PAA QE QNAAFAEG      EN+P NE NR  ENENVAE   GAGNGGLNWWGVVKEIQMIVFGFITSLLPGFH
Subjt:  GMQRAAAPPQPPRPGVRAEIALVAAPAAGQEAQNAAFAEG------ENEPGNEGNRAVENENVAE--PGAGNGGLNWWGVVKEIQMIVFGFITSLLPGFH

Query:  NHMD
        NHMD
Subjt:  NHMD

A0A5A7UX77 Uncharacterized protein1.1e-13386.51Show/hide
Query:  GPEAIPSSSQSSSALKWQDDNLEQTPVQVPKFSSFSTVPNGDVQMIPIMYPALVPGSALSENQNRGAGIYAVPAFPSMGGPIIGMSTNNLIPLTYSISTR
        GPE+IPSSSQSSSALKWQDDNLEQT  QVPK SSF T PNG+VQMIPIMYPALVPGSA SENQNRGAGIYAVP+FPSMGGPIIGM+TNNLIPLTYSI TR
Subjt:  GPEAIPSSSQSSSALKWQDDNLEQTPVQVPKFSSFSTVPNGDVQMIPIMYPALVPGSALSENQNRGAGIYAVPAFPSMGGPIIGMSTNNLIPLTYSISTR

Query:  SDASNRTSPEGGTTVEENGRVEGQQRQQQQQPAPQRQVVARRFQIAIQIDLFLILKLAAVIFLVHQDGSRQRLIVLVICASIVYLYQTGALTPLIRWLSQ
        SD SNRTSPEGG+ VEENGRVEGQQ+ QQQQ APQRQVV RRFQIAIQIDL LILKLAAVIFLVHQDGSRQRLIVLVICAS+VYLYQTGALTPLIRWLSQ
Subjt:  SDASNRTSPEGGTTVEENGRVEGQQRQQQQQPAPQRQVVARRFQIAIQIDLFLILKLAAVIFLVHQDGSRQRLIVLVICASIVYLYQTGALTPLIRWLSQ

Query:  GMQRAAAPPQPPRPGVRAEIALVAAPAAGQEAQNAAFAEG------ENEPGNEGNRAVENENVAE--PGAGNGGLNWWGVVKEIQMIVFGFITSLLPGFH
        GMQRAAAPP PPRPGVRAE A +A PAA QE QNAAFAEG      EN+P NE NR  ENENVAE   GAGNGGLNWWGVVKEIQMIVFGFITSLLPGFH
Subjt:  GMQRAAAPPQPPRPGVRAEIALVAAPAAGQEAQNAAFAEG------ENEPGNEGNRAVENENVAE--PGAGNGGLNWWGVVKEIQMIVFGFITSLLPGFH

Query:  NHMD
        NHMD
Subjt:  NHMD

A0A5D3DXX7 Uncharacterized protein7.3e-13586.84Show/hide
Query:  GPEAIPSSSQSSSALKWQDDNLEQTPVQVPKFSSFSTVPNGDVQMIPIMYPALVPGSALSENQNRGAGIYAVPAFPSMGGPIIGMSTNNLIPLTYSISTR
        GPE+IPSSSQSSSALKWQDDNLEQT  QVPK SSF T PNG+VQMIPIMYPALVPGSA SENQNRGAGIYAVP+FPSMGGPIIGM+TNNLIPLTYSI TR
Subjt:  GPEAIPSSSQSSSALKWQDDNLEQTPVQVPKFSSFSTVPNGDVQMIPIMYPALVPGSALSENQNRGAGIYAVPAFPSMGGPIIGMSTNNLIPLTYSISTR

Query:  SDASNRTSPEGGTTVEENGRVEGQQRQQQQQPAPQRQVVARRFQIAIQIDLFLILKLAAVIFLVHQDGSRQRLIVLVICASIVYLYQTGALTPLIRWLSQ
        SD SNRTSPEGG+ VEENGRVEGQQ+ QQQQPAPQRQVV RRFQIAIQIDL LILKLAAVIFLVHQDGSRQRLIVLVICAS+VYLYQTGALTPLIRWLSQ
Subjt:  SDASNRTSPEGGTTVEENGRVEGQQRQQQQQPAPQRQVVARRFQIAIQIDLFLILKLAAVIFLVHQDGSRQRLIVLVICASIVYLYQTGALTPLIRWLSQ

Query:  GMQRAAAPPQPPRPGVRAEIALVAAPAAGQEAQNAAFAEG------ENEPGNEGNRAVENENVAE--PGAGNGGLNWWGVVKEIQMIVFGFITSLLPGFH
        GMQRAAAPP PPRPGVRAE A +A PAA QE QNAAFAEG      EN+P NE NR  ENENVAE   GAGNGGLNWWGVVKEIQMIVFGFITSLLPGFH
Subjt:  GMQRAAAPPQPPRPGVRAEIALVAAPAAGQEAQNAAFAEG------ENEPGNEGNRAVENENVAE--PGAGNGGLNWWGVVKEIQMIVFGFITSLLPGFH

Query:  NHMD
        NHMD
Subjt:  NHMD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G29960.1 unknown protein2.5e-6654.52Show/hide
Query:  PEAIPSS--SQSSSALKWQDDNLEQTPVQVPKFSSFSTVPNGDVQMIPIMYPALVPGS---ALSENQNRGAGIYAVPAFPSMGGPIIGMSTNNLIPLTYS
        PE + SS   Q S   K +D  ++  P      S F   PNGD  M P+ YP LVPGS      E  NRGAGIYAVP     GG + G+ +N LIPLTY+
Subjt:  PEAIPSS--SQSSSALKWQDDNLEQTPVQVPKFSSFSTVPNGDVQMIPIMYPALVPGS---ALSENQNRGAGIYAVPAFPSMGGPIIGMSTNNLIPLTYS

Query:  ISTRSDASNRTSPEGGTTVEENGRVEGQQRQQQQQPAPQRQVVARRFQIAIQIDLFLILKLAAVIFLVHQDGSRQRLIVLVICASIVYLYQTGALTPLIR
        + T       T P          + +  Q QQQQ PA QR VV RRFQIA Q+DLFLILKLAAVIFL +QDGSRQRL VLVI A+I+YLYQTGAL P +R
Subjt:  ISTRSDASNRTSPEGGTTVEENGRVEGQQRQQQQQPAPQRQVVARRFQIAIQIDLFLILKLAAVIFLVHQDGSRQRLIVLVICASIVYLYQTGALTPLIR

Query:  WLSQGMQRAAAPP-QPPRPGVRAEIALVAAPAAGQEAQNAAFAEGENEPGNEGNRAVENENVAE-PGAGNGGLNWWGVVKEIQMIVFGFITSLLPGFHN
        WLSQGM RAA PP +P RP VRA+      PAA       A  EGE    + GNRA  N N  E   AGN G  WWG+VKEIQMIVFGFITSLLPGFHN
Subjt:  WLSQGMQRAAAPP-QPPRPGVRAEIALVAAPAAGQEAQNAAFAEGENEPGNEGNRAVENENVAE-PGAGNGGLNWWGVVKEIQMIVFGFITSLLPGFHN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTTCCACTGGCGGAGGACCGGAAGCGATCCCTTCAAGCTCTCAATCCTCTTCTGCTCTGAAATGGCAGGATGACAATCTTGAACAGACCCCGGTGCAGGTTCCTAA
ATTTTCCAGTTTCTCCACTGTTCCGAATGGTGATGTGCAAATGATTCCAATCATGTATCCTGCACTTGTTCCTGGATCCGCTCTTTCGGAAAATCAAAATCGTGGAGCTG
GTATCTATGCAGTTCCTGCTTTTCCATCAATGGGGGGACCCATTATTGGAATGTCAACTAACAATCTTATTCCTTTGACTTACAGCATATCCACTAGGTCTGATGCCAGT
AATAGAACAAGTCCTGAGGGTGGCACAACAGTCGAGGAGAATGGGCGAGTTGAAGGACAACAACGGCAACAGCAGCAGCAACCAGCACCTCAAAGACAAGTTGTTGCGAG
AAGATTTCAAATTGCAATACAGATTGATTTGTTCCTCATATTGAAGCTTGCTGCTGTAATTTTTCTTGTTCATCAAGATGGTTCAAGACAAAGGCTTATTGTTCTGGTGA
TTTGTGCTTCAATAGTCTATCTATATCAAACTGGGGCGCTTACACCATTGATACGATGGCTATCGCAAGGCATGCAAAGGGCAGCTGCACCACCCCAACCTCCAAGGCCA
GGAGTTCGTGCAGAGATTGCTTTAGTAGCTGCTCCAGCTGCAGGGCAAGAGGCTCAGAATGCTGCTTTTGCAGAGGGTGAGAATGAACCTGGAAATGAAGGGAACCGAGC
GGTTGAAAATGAGAATGTGGCAGAGCCTGGTGCTGGAAATGGTGGTCTCAACTGGTGGGGAGTGGTGAAGGAAATCCAGATGATAGTGTTCGGCTTTATTACTTCCTTGC
TCCCAGGCTTCCATAACCACATGGACTAG
mRNA sequenceShow/hide mRNA sequence
CTTGCTCACAGTACAAACACGAACCTCTCTCTCACTTTTCCCCTATTTTTTCTCTGTCTCTCTCGATTTCTATTCACTTTTCGCCCTTTTCTCTTTCGCTTGAAGGATCT
ACAGTCTTCTCCCTTCTCCGGCTCCGGCGACTCCTCAAGTACCGCCGATCATCACCGACTCCACGTATGACTTCCACTGGCGGAGGACCGGAAGCGATCCCTTCAAGCTC
TCAATCCTCTTCTGCTCTGAAATGGCAGGATGACAATCTTGAACAGACCCCGGTGCAGGTTCCTAAATTTTCCAGTTTCTCCACTGTTCCGAATGGTGATGTGCAAATGA
TTCCAATCATGTATCCTGCACTTGTTCCTGGATCCGCTCTTTCGGAAAATCAAAATCGTGGAGCTGGTATCTATGCAGTTCCTGCTTTTCCATCAATGGGGGGACCCATT
ATTGGAATGTCAACTAACAATCTTATTCCTTTGACTTACAGCATATCCACTAGGTCTGATGCCAGTAATAGAACAAGTCCTGAGGGTGGCACAACAGTCGAGGAGAATGG
GCGAGTTGAAGGACAACAACGGCAACAGCAGCAGCAACCAGCACCTCAAAGACAAGTTGTTGCGAGAAGATTTCAAATTGCAATACAGATTGATTTGTTCCTCATATTGA
AGCTTGCTGCTGTAATTTTTCTTGTTCATCAAGATGGTTCAAGACAAAGGCTTATTGTTCTGGTGATTTGTGCTTCAATAGTCTATCTATATCAAACTGGGGCGCTTACA
CCATTGATACGATGGCTATCGCAAGGCATGCAAAGGGCAGCTGCACCACCCCAACCTCCAAGGCCAGGAGTTCGTGCAGAGATTGCTTTAGTAGCTGCTCCAGCTGCAGG
GCAAGAGGCTCAGAATGCTGCTTTTGCAGAGGGTGAGAATGAACCTGGAAATGAAGGGAACCGAGCGGTTGAAAATGAGAATGTGGCAGAGCCTGGTGCTGGAAATGGTG
GTCTCAACTGGTGGGGAGTGGTGAAGGAAATCCAGATGATAGTGTTCGGCTTTATTACTTCCTTGCTCCCAGGCTTCCATAACCACATGGACTAGAACATTACTGGAAGG
TCATGAACCGATCTGAAGTTCGAAACAGTTTTTCATTTTTCTTTTTCTTTGAACAAACAACTCAAAAAGTTATGAAGCATGATAACGGATCTATCTGTATAAACTGGAAT
TTAGCAATTAATTATGGGTTTTGCTTCATTTTAAAAAGGACATTTTTGTCTTTTCATTCCCTCCTTCCGTGAGGTTTCTTTGTAAATGAGTTTTTTTTTTAATGAATTTC
TTCATTTTTCCCCCCTAAAA
Protein sequenceShow/hide protein sequence
MTSTGGGPEAIPSSSQSSSALKWQDDNLEQTPVQVPKFSSFSTVPNGDVQMIPIMYPALVPGSALSENQNRGAGIYAVPAFPSMGGPIIGMSTNNLIPLTYSISTRSDAS
NRTSPEGGTTVEENGRVEGQQRQQQQQPAPQRQVVARRFQIAIQIDLFLILKLAAVIFLVHQDGSRQRLIVLVICASIVYLYQTGALTPLIRWLSQGMQRAAAPPQPPRP
GVRAEIALVAAPAAGQEAQNAAFAEGENEPGNEGNRAVENENVAEPGAGNGGLNWWGVVKEIQMIVFGFITSLLPGFHNHMD