; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0003015 (gene) of Snake gourd v1 genome

Gene IDTan0003015
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionABC transporter ABCE
Genome locationLG06:79806068..79810801
RNA-Seq ExpressionTan0003015
SyntenyTan0003015
Gene Ontology termsGO:0016020 - membrane (cellular component)
GO:0046872 - metal ion binding (molecular function)
GO:0051536 - iron-sulfur cluster binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575112.1 hypothetical protein SDJN03_25751, partial [Cucurbita argyrosperma subsp. sororia]5.7e-14288.42Show/hide
Query:  MTLSLSCHAALHLQHQVASKNNSNKNLDSVRNLVKRIGIASVQSSPLESLRNGNWVKLICGASFEDVVDIRNLSLVYTLAGVDCIDCAADASVVSAVNEG
        M LSLSCHAAL LQHQVAS N+SNKNLD+VR LV RIGIASVQSSPLESLR+GNW+KLICGASFEDVVDIRNLSLVYTLAGVDCIDCAADASVVSAVNEG
Subjt:  MTLSLSCHAALHLQHQVASKNNSNKNLDSVRNLVKRIGIASVQSSPLESLRNGNWVKLICGASFEDVVDIRNLSLVYTLAGVDCIDCAADASVVSAVNEG

Query:  IQAARGIVGVRRPWVMISVNDDQDLHFRKAEFDPENCPIDCSRPCEIVCPANAISLHEETMQELSQVASVFGGLKGGVITERCYGCGRCSPVCPYDKIKL
        IQAAR IV VRRPWVMISVNDDQDLHFRKAEFDPENCP+DCSRPCEIVCPANAISL EE M E S+VAS+ G LKGGVITERCYGCGRCSPVCPYDKI L
Subjt:  IQAARGIVGVRRPWVMISVNDDQDLHFRKAEFDPENCPIDCSRPCEIVCPANAISLHEETMQELSQVASVFGGLKGGVITERCYGCGRCSPVCPYDKIKL

Query:  ATYVRDAATTAELIKRSDVDALEIHTNGRQTTPFQELWDKLGDSSKYLRLIAVSLPNIGDLTVSTMKTMYSMMESRLHCLNLWQV
         TYVRDAATTA+LIK  DVDALEIHTNGRQTTPFQELWDKLGDSSKYLRL+AVSLPNIGDLT+STMKTM+S+MES+L C NLWQ+
Subjt:  ATYVRDAATTAELIKRSDVDALEIHTNGRQTTPFQELWDKLGDSSKYLRLIAVSLPNIGDLTVSTMKTMYSMMESRLHCLNLWQV

KAG7013681.1 hypothetical protein SDJN02_23848 [Cucurbita argyrosperma subsp. argyrosperma]6.8e-14388.77Show/hide
Query:  MTLSLSCHAALHLQHQVASKNNSNKNLDSVRNLVKRIGIASVQSSPLESLRNGNWVKLICGASFEDVVDIRNLSLVYTLAGVDCIDCAADASVVSAVNEG
        M LSLSCHAAL LQHQVAS N+SNKNLD+VR LV RIGIASVQSSPLESLR+GNW+KLICGASFEDVVDIRNLSLVYTLAGVDCIDCAADASVVSAVNEG
Subjt:  MTLSLSCHAALHLQHQVASKNNSNKNLDSVRNLVKRIGIASVQSSPLESLRNGNWVKLICGASFEDVVDIRNLSLVYTLAGVDCIDCAADASVVSAVNEG

Query:  IQAARGIVGVRRPWVMISVNDDQDLHFRKAEFDPENCPIDCSRPCEIVCPANAISLHEETMQELSQVASVFGGLKGGVITERCYGCGRCSPVCPYDKIKL
        IQAAR IV VRRPWVMISVNDDQDLHFRKAEFDPENCP+DCSRPCEIVCPANAISL EE M E S+VAS+ G LKGGVITERCYGCGRCSPVCPYDKI L
Subjt:  IQAARGIVGVRRPWVMISVNDDQDLHFRKAEFDPENCPIDCSRPCEIVCPANAISLHEETMQELSQVASVFGGLKGGVITERCYGCGRCSPVCPYDKIKL

Query:  ATYVRDAATTAELIKRSDVDALEIHTNGRQTTPFQELWDKLGDSSKYLRLIAVSLPNIGDLTVSTMKTMYSMMESRLHCLNLWQV
         TYVRDAATTA+LIKR DVDALEIHTNGRQTTPFQELWDKLGDSSKYLRL+AVSLPNIGDLT+STMKTM+S+MES+L C NLWQ+
Subjt:  ATYVRDAATTAELIKRSDVDALEIHTNGRQTTPFQELWDKLGDSSKYLRLIAVSLPNIGDLTVSTMKTMYSMMESRLHCLNLWQV

XP_023547405.1 uncharacterized protein LOC111806365 isoform X1 [Cucurbita pepo subsp. pepo]2.0e-14288.77Show/hide
Query:  MTLSLSCHAALHLQHQVASKNNSNKNLDSVRNLVKRIGIASVQSSPLESLRNGNWVKLICGASFEDVVDIRNLSLVYTLAGVDCIDCAADASVVSAVNEG
        M LSLSCHAAL LQHQVAS N+SNKNLD+VR LV RIGIASVQSSPLESLR+GNW+KLICGASFEDVVDIRNLSLVYTLAGVDCIDCAADASVVSAVNEG
Subjt:  MTLSLSCHAALHLQHQVASKNNSNKNLDSVRNLVKRIGIASVQSSPLESLRNGNWVKLICGASFEDVVDIRNLSLVYTLAGVDCIDCAADASVVSAVNEG

Query:  IQAARGIVGVRRPWVMISVNDDQDLHFRKAEFDPENCPIDCSRPCEIVCPANAISLHEETMQELSQVASVFGGLKGGVITERCYGCGRCSPVCPYDKIKL
        IQAAR IV VRRPWVMISVNDDQDLHFRKAEFDPENCP+DCSRPCEIVCPANAISL EE M E S+VAS+ G LKGGVITERCYGCGRCSPVCPYDKI L
Subjt:  IQAARGIVGVRRPWVMISVNDDQDLHFRKAEFDPENCPIDCSRPCEIVCPANAISLHEETMQELSQVASVFGGLKGGVITERCYGCGRCSPVCPYDKIKL

Query:  ATYVRDAATTAELIKRSDVDALEIHTNGRQTTPFQELWDKLGDSSKYLRLIAVSLPNIGDLTVSTMKTMYSMMESRLHCLNLWQV
         TYVRDAATTAELIKR DVDALEIHTNGRQTT FQELWDKLGDSSKYLRL+AVSLPNIGDLT+STMKTM+S+MES+L C NLWQ+
Subjt:  ATYVRDAATTAELIKRSDVDALEIHTNGRQTTPFQELWDKLGDSSKYLRLIAVSLPNIGDLTVSTMKTMYSMMESRLHCLNLWQV

XP_023547406.1 uncharacterized protein LOC111806365 isoform X2 [Cucurbita pepo subsp. pepo]2.0e-14288.77Show/hide
Query:  MTLSLSCHAALHLQHQVASKNNSNKNLDSVRNLVKRIGIASVQSSPLESLRNGNWVKLICGASFEDVVDIRNLSLVYTLAGVDCIDCAADASVVSAVNEG
        M LSLSCHAAL LQHQVAS N+SNKNLD+VR LV RIGIASVQSSPLESLR+GNW+KLICGASFEDVVDIRNLSLVYTLAGVDCIDCAADASVVSAVNEG
Subjt:  MTLSLSCHAALHLQHQVASKNNSNKNLDSVRNLVKRIGIASVQSSPLESLRNGNWVKLICGASFEDVVDIRNLSLVYTLAGVDCIDCAADASVVSAVNEG

Query:  IQAARGIVGVRRPWVMISVNDDQDLHFRKAEFDPENCPIDCSRPCEIVCPANAISLHEETMQELSQVASVFGGLKGGVITERCYGCGRCSPVCPYDKIKL
        IQAAR IV VRRPWVMISVNDDQDLHFRKAEFDPENCP+DCSRPCEIVCPANAISL EE M E S+VAS+ G LKGGVITERCYGCGRCSPVCPYDKI L
Subjt:  IQAARGIVGVRRPWVMISVNDDQDLHFRKAEFDPENCPIDCSRPCEIVCPANAISLHEETMQELSQVASVFGGLKGGVITERCYGCGRCSPVCPYDKIKL

Query:  ATYVRDAATTAELIKRSDVDALEIHTNGRQTTPFQELWDKLGDSSKYLRLIAVSLPNIGDLTVSTMKTMYSMMESRLHCLNLWQV
         TYVRDAATTAELIKR DVDALEIHTNGRQTT FQELWDKLGDSSKYLRL+AVSLPNIGDLT+STMKTM+S+MES+L C NLWQ+
Subjt:  ATYVRDAATTAELIKRSDVDALEIHTNGRQTTPFQELWDKLGDSSKYLRLIAVSLPNIGDLTVSTMKTMYSMMESRLHCLNLWQV

XP_038906651.1 uncharacterized protein LOC120092590 [Benincasa hispida]5.3e-14891.58Show/hide
Query:  MTLSLSCHAALHLQHQVASKNNSNKNLDSVRNLVKRIGIASVQSSPLESLRNGNWVKLICGASFEDVVDIRNLSLVYTLAGVDCIDCAADASVVSAVNEG
        M LSLSCHA LHLQHQVAS+NNSNKNL++VR+LV RIGIASVQSSPL+SLRNG+WVKLICGASFEDVVDIRNLSLVYTLAGVDCIDCAADASVVSAVNEG
Subjt:  MTLSLSCHAALHLQHQVASKNNSNKNLDSVRNLVKRIGIASVQSSPLESLRNGNWVKLICGASFEDVVDIRNLSLVYTLAGVDCIDCAADASVVSAVNEG

Query:  IQAARGIVGVRRPWVMISVNDDQDLHFRKAEFDPENCPIDCSRPCEIVCPANAISLHEETMQELSQVASVFGGLKGGVITERCYGCGRCSPVCPYDKIKL
        IQAARGIVGVRRPWVMISVNDDQDLHFRKAEFDPENCPIDCSRPCEIVCPANAISL EETM+E SQVASV G LKGGV+TERCYGCGRCSPVCPYDKI L
Subjt:  IQAARGIVGVRRPWVMISVNDDQDLHFRKAEFDPENCPIDCSRPCEIVCPANAISLHEETMQELSQVASVFGGLKGGVITERCYGCGRCSPVCPYDKIKL

Query:  ATYVRDAATTAELIKRSDVDALEIHTNGRQTTPFQELWDKLGDSSKYLRLIAVSLPNIGDLTVSTMKTMYSMMESRLHCLNLWQV
         TYVRDAATTA+LIKR DVDALEIHTNGRQTTPFQELWDKLGDSSKYLRL+AVSLPNIGDLTVSTMKTM+S+MES+LHCLNLWQ+
Subjt:  ATYVRDAATTAELIKRSDVDALEIHTNGRQTTPFQELWDKLGDSSKYLRLIAVSLPNIGDLTVSTMKTMYSMMESRLHCLNLWQV

TrEMBL top hitse value%identityAlignment
A0A1S3C7M9 uncharacterized protein LOC103497790 isoform X21.8e-14188.89Show/hide
Query:  MTLSLS-CHAALHL-QHQVASKNNSNKNLDSVRNLVKRIGIASVQSSPLESLRNGNWVKLICGASFEDVVDIRNLSLVYTLAGVDCIDCAADASVVSAVN
        M LSLS CHA LHL QHQVAS+NNS+KNLD+VR+LV RIGIASVQSS L+SL+NGNWVKLICGASFEDVVDIRNLSLVYTLAGVDCIDCAADASVVSAVN
Subjt:  MTLSLS-CHAALHL-QHQVASKNNSNKNLDSVRNLVKRIGIASVQSSPLESLRNGNWVKLICGASFEDVVDIRNLSLVYTLAGVDCIDCAADASVVSAVN

Query:  EGIQAARGIVGVRRPWVMISVNDDQDLHFRKAEFDPENCPIDCSRPCEIVCPANAISLHEETMQELSQVASVFGGLKGGVITERCYGCGRCSPVCPYDKI
        EGIQAARGI+GVRRPWVMISVNDDQDLHFRKAEFDPENCPIDCSRPCEIVCPANAISL EE ++ELSQVA V G LKGGVITERCYGCGRCSPVCPYDKI
Subjt:  EGIQAARGIVGVRRPWVMISVNDDQDLHFRKAEFDPENCPIDCSRPCEIVCPANAISLHEETMQELSQVASVFGGLKGGVITERCYGCGRCSPVCPYDKI

Query:  KLATYVRDAATTAELIKRSDVDALEIHTNGRQTTPFQELWDKLGDSSKYLRLIAVSLPNIG-DLTVSTMKTMYSMMESRLHCLNLWQV
         L TYVRDAATT +LIKR DVDALEIHTNGRQTT FQELWDKLGDSSKYLRL+AVSLPNIG DLTVSTMKTM+S+MES+LHCLNLWQ+
Subjt:  KLATYVRDAATTAELIKRSDVDALEIHTNGRQTTPFQELWDKLGDSSKYLRLIAVSLPNIG-DLTVSTMKTMYSMMESRLHCLNLWQV

A0A5D3BV06 Uncharacterized protein1.8e-14188.89Show/hide
Query:  MTLSLS-CHAALHL-QHQVASKNNSNKNLDSVRNLVKRIGIASVQSSPLESLRNGNWVKLICGASFEDVVDIRNLSLVYTLAGVDCIDCAADASVVSAVN
        M LSLS CHA LHL QHQVAS+NNS+KNLD+VR+LV RIGIASVQSS L+SL+NGNWVKLICGASFEDVVDIRNLSLVYTLAGVDCIDCAADASVVSAVN
Subjt:  MTLSLS-CHAALHL-QHQVASKNNSNKNLDSVRNLVKRIGIASVQSSPLESLRNGNWVKLICGASFEDVVDIRNLSLVYTLAGVDCIDCAADASVVSAVN

Query:  EGIQAARGIVGVRRPWVMISVNDDQDLHFRKAEFDPENCPIDCSRPCEIVCPANAISLHEETMQELSQVASVFGGLKGGVITERCYGCGRCSPVCPYDKI
        EGIQAARGI+GVRRPWVMISVNDDQDLHFRKAEFDPENCPIDCSRPCEIVCPANAISL EE ++ELSQVA V G LKGGVITERCYGCGRCSPVCPYDKI
Subjt:  EGIQAARGIVGVRRPWVMISVNDDQDLHFRKAEFDPENCPIDCSRPCEIVCPANAISLHEETMQELSQVASVFGGLKGGVITERCYGCGRCSPVCPYDKI

Query:  KLATYVRDAATTAELIKRSDVDALEIHTNGRQTTPFQELWDKLGDSSKYLRLIAVSLPNIG-DLTVSTMKTMYSMMESRLHCLNLWQV
         L TYVRDAATT +LIKR DVDALEIHTNGRQTT FQELWDKLGDSSKYLRL+AVSLPNIG DLTVSTMKTM+S+MES+LHCLNLWQ+
Subjt:  KLATYVRDAATTAELIKRSDVDALEIHTNGRQTTPFQELWDKLGDSSKYLRLIAVSLPNIG-DLTVSTMKTMYSMMESRLHCLNLWQV

A0A6J1H6Q4 uncharacterized protein LOC1114601213.6e-14288.42Show/hide
Query:  MTLSLSCHAALHLQHQVASKNNSNKNLDSVRNLVKRIGIASVQSSPLESLRNGNWVKLICGASFEDVVDIRNLSLVYTLAGVDCIDCAADASVVSAVNEG
        M LSLSCHAAL LQHQVAS N+SNKNLD+VR LV RIGIASVQSSPLESLR+GNW+KLICGASFEDVVDIRNLSLVYTLAGVDCIDCAADASVVSAVNEG
Subjt:  MTLSLSCHAALHLQHQVASKNNSNKNLDSVRNLVKRIGIASVQSSPLESLRNGNWVKLICGASFEDVVDIRNLSLVYTLAGVDCIDCAADASVVSAVNEG

Query:  IQAARGIVGVRRPWVMISVNDDQDLHFRKAEFDPENCPIDCSRPCEIVCPANAISLHEETMQELSQVASVFGGLKGGVITERCYGCGRCSPVCPYDKIKL
        IQAAR IV VRRPWVMISVND QDLHFRKAEFDPENCP+DCSRPCEIVCPANAISL EE M E S+VAS+ G LKGGVITERCYGCGRCSPVCPYDKI L
Subjt:  IQAARGIVGVRRPWVMISVNDDQDLHFRKAEFDPENCPIDCSRPCEIVCPANAISLHEETMQELSQVASVFGGLKGGVITERCYGCGRCSPVCPYDKIKL

Query:  ATYVRDAATTAELIKRSDVDALEIHTNGRQTTPFQELWDKLGDSSKYLRLIAVSLPNIGDLTVSTMKTMYSMMESRLHCLNLWQV
         TYVRDAATTA+LIKR DVDALEIHTNGRQTTPFQELWDKLGDSSKYLRL+AVSLPNIGDLT+STMKTM+S+MES+L C NLWQ+
Subjt:  ATYVRDAATTAELIKRSDVDALEIHTNGRQTTPFQELWDKLGDSSKYLRLIAVSLPNIGDLTVSTMKTMYSMMESRLHCLNLWQV

A0A6J1KVU8 uncharacterized protein LOC111499161 isoform X14.7e-14287.37Show/hide
Query:  MTLSLSCHAALHLQHQVASKNNSNKNLDSVRNLVKRIGIASVQSSPLESLRNGNWVKLICGASFEDVVDIRNLSLVYTLAGVDCIDCAADASVVSAVNEG
        M LSLSCHAAL LQHQVAS+N+SNKNLD+VR LV RIGI+SVQSSPLESLR+GNW+KLICGASFEDVVDIRNLSLVYTLAGVDCIDCAADASVVSAVNEG
Subjt:  MTLSLSCHAALHLQHQVASKNNSNKNLDSVRNLVKRIGIASVQSSPLESLRNGNWVKLICGASFEDVVDIRNLSLVYTLAGVDCIDCAADASVVSAVNEG

Query:  IQAARGIVGVRRPWVMISVNDDQDLHFRKAEFDPENCPIDCSRPCEIVCPANAISLHEETMQELSQVASVFGGLKGGVITERCYGCGRCSPVCPYDKIKL
        IQAAR I+ VRRPWVMISVNDDQDLHFRKA FDPENCP+DCSRPCEIVCPANAISL +E M E S+VAS+ G LKGGVITERCYGCGRCSPVCPYDKI L
Subjt:  IQAARGIVGVRRPWVMISVNDDQDLHFRKAEFDPENCPIDCSRPCEIVCPANAISLHEETMQELSQVASVFGGLKGGVITERCYGCGRCSPVCPYDKIKL

Query:  ATYVRDAATTAELIKRSDVDALEIHTNGRQTTPFQELWDKLGDSSKYLRLIAVSLPNIGDLTVSTMKTMYSMMESRLHCLNLWQV
         TYVRDAATTA+LIKR DVDALEIHTNGRQTTPFQELW+KLGDSSKYLRL+AVSLPNIGDLT+STMKTM+S+MES+L CLNLWQ+
Subjt:  ATYVRDAATTAELIKRSDVDALEIHTNGRQTTPFQELWDKLGDSSKYLRLIAVSLPNIGDLTVSTMKTMYSMMESRLHCLNLWQV

A0A6J1L4W8 uncharacterized protein LOC111499161 isoform X24.7e-14287.37Show/hide
Query:  MTLSLSCHAALHLQHQVASKNNSNKNLDSVRNLVKRIGIASVQSSPLESLRNGNWVKLICGASFEDVVDIRNLSLVYTLAGVDCIDCAADASVVSAVNEG
        M LSLSCHAAL LQHQVAS+N+SNKNLD+VR LV RIGI+SVQSSPLESLR+GNW+KLICGASFEDVVDIRNLSLVYTLAGVDCIDCAADASVVSAVNEG
Subjt:  MTLSLSCHAALHLQHQVASKNNSNKNLDSVRNLVKRIGIASVQSSPLESLRNGNWVKLICGASFEDVVDIRNLSLVYTLAGVDCIDCAADASVVSAVNEG

Query:  IQAARGIVGVRRPWVMISVNDDQDLHFRKAEFDPENCPIDCSRPCEIVCPANAISLHEETMQELSQVASVFGGLKGGVITERCYGCGRCSPVCPYDKIKL
        IQAAR I+ VRRPWVMISVNDDQDLHFRKA FDPENCP+DCSRPCEIVCPANAISL +E M E S+VAS+ G LKGGVITERCYGCGRCSPVCPYDKI L
Subjt:  IQAARGIVGVRRPWVMISVNDDQDLHFRKAEFDPENCPIDCSRPCEIVCPANAISLHEETMQELSQVASVFGGLKGGVITERCYGCGRCSPVCPYDKIKL

Query:  ATYVRDAATTAELIKRSDVDALEIHTNGRQTTPFQELWDKLGDSSKYLRLIAVSLPNIGDLTVSTMKTMYSMMESRLHCLNLWQV
         TYVRDAATTA+LIKR DVDALEIHTNGRQTTPFQELW+KLGDSSKYLRL+AVSLPNIGDLT+STMKTM+S+MES+L CLNLWQ+
Subjt:  ATYVRDAATTAELIKRSDVDALEIHTNGRQTTPFQELWDKLGDSSKYLRLIAVSLPNIGDLTVSTMKTMYSMMESRLHCLNLWQV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTCTGAGCCTGTCCTGCCATGCCGCTCTTCACCTTCAACATCAAGTGGCTTCCAAGAACAACAGCAACAAGAACCTCGACAGCGTAAGAAACCTCGTAAAACGAAT
TGGAATTGCTTCAGTTCAATCTTCTCCTCTTGAATCTCTCCGAAATGGCAACTGGGTCAAGCTTATTTGCGGTGCAAGTTTCGAGGATGTGGTTGATATTAGGAATCTCT
CACTTGTTTACACCCTTGCTGGGGTTGATTGTATTGATTGTGCTGCTGATGCATCGGTTGTTAGTGCGGTGAATGAGGGAATTCAAGCAGCAAGAGGGATTGTTGGTGTT
CGTAGGCCTTGGGTGATGATTAGTGTTAATGATGATCAAGATCTTCACTTCCGCAAAGCTGAGTTTGATCCTGAGAATTGTCCAATTGACTGTTCAAGGCCTTGTGAAAT
TGTTTGCCCTGCTAATGCAATCTCACTACATGAAGAAACCATGCAAGAGCTTTCACAAGTAGCTAGTGTATTTGGAGGATTGAAGGGCGGAGTAATCACGGAGCGCTGTT
ATGGTTGTGGTCGTTGCTCTCCCGTCTGCCCATATGATAAAATAAAGCTAGCCACATATGTAAGAGATGCAGCTACTACTGCTGAACTTATAAAACGGAGCGATGTCGAT
GCATTGGAGATTCACACCAATGGAAGGCAAACCACTCCTTTTCAAGAACTTTGGGATAAATTAGGGGACTCATCCAAATATCTAAGGCTAATAGCAGTAAGCCTACCTAA
TATTGGGGATTTAACAGTATCTACAATGAAAACGATGTACTCGATGATGGAATCTCGGCTCCATTGTTTGAACTTATGGCAGGTCTGCCTTGAACTTCATAAAGCCAAAC
AAATTACTTTCACTTATATATCTCTATGA
mRNA sequenceShow/hide mRNA sequence
GTTCATTTCTGTTGCTTGTATTCCAACATCCTTCTCTTCTCCAACAAAAGAAAGAAAAAATCAAAATGACTCTGAGCCTGTCCTGCCATGCCGCTCTTCACCTTCAACAT
CAAGTGGCTTCCAAGAACAACAGCAACAAGAACCTCGACAGCGTAAGAAACCTCGTAAAACGAATTGGAATTGCTTCAGTTCAATCTTCTCCTCTTGAATCTCTCCGAAA
TGGCAACTGGGTCAAGCTTATTTGCGGTGCAAGTTTCGAGGATGTGGTTGATATTAGGAATCTCTCACTTGTTTACACCCTTGCTGGGGTTGATTGTATTGATTGTGCTG
CTGATGCATCGGTTGTTAGTGCGGTGAATGAGGGAATTCAAGCAGCAAGAGGGATTGTTGGTGTTCGTAGGCCTTGGGTGATGATTAGTGTTAATGATGATCAAGATCTT
CACTTCCGCAAAGCTGAGTTTGATCCTGAGAATTGTCCAATTGACTGTTCAAGGCCTTGTGAAATTGTTTGCCCTGCTAATGCAATCTCACTACATGAAGAAACCATGCA
AGAGCTTTCACAAGTAGCTAGTGTATTTGGAGGATTGAAGGGCGGAGTAATCACGGAGCGCTGTTATGGTTGTGGTCGTTGCTCTCCCGTCTGCCCATATGATAAAATAA
AGCTAGCCACATATGTAAGAGATGCAGCTACTACTGCTGAACTTATAAAACGGAGCGATGTCGATGCATTGGAGATTCACACCAATGGAAGGCAAACCACTCCTTTTCAA
GAACTTTGGGATAAATTAGGGGACTCATCCAAATATCTAAGGCTAATAGCAGTAAGCCTACCTAATATTGGGGATTTAACAGTATCTACAATGAAAACGATGTACTCGAT
GATGGAATCTCGGCTCCATTGTTTGAACTTATGGCAGGTCTGCCTTGAACTTCATAAAGCCAAACAAATTACTTTCACTTATATATCTCTATGATCTAAATTCTCAAACC
TATATTTGTCTTCACAATCATCGTCTCGTTTAACCATTGAATGAAGTTAGATGGACGGCCGATGAGTGGAGATATCGGACGAGGTGCCACGAGAGAAACAATTGCTTTTG
CTGCTCAATTAGCTCTTTCTAGTGACCGTCCTCCCGGTTCGTGTACAGGCTTATCCCATGAGGCTCATATTCATGTCATTTCACCAACAAGCTTCTTGTTTCCTCTGCAG
GCTTCTTTCAACTGGCTGGTGGCACAAATTTTCACACTGTTGATGGCTTGAAGAAAGAAAAACTTTTTCAAACCACCTCAATTCTCAAGAATTCGATGATCGAAGAATTA
TCAGAAAAATCACCCAGTTCATTACACGCGTTGATCGGTGGTATCGCTTACGGGGGCTATGCCCGAAAAATAATTGGAAGGGTTTTGAATTCAATGCAAACACAAAATGG
AGATGCTAACATTGAAGATTATCCGGATTATCTCTTGGCTGCACTTGTGGAAGCCTTGGCTTTGGTGGGAACTGTCAAATGTTATGATCCTTCTGTGATCAGCTCAGCAA
AAGCTAATTGATCCTTAAGTTTGTAGTTCGTTTCAGACCACACTTTCGATAGCCTGGTTGTCGAAGTTGTCGTTTTTATGTCGACAACGTTGGCAAATAAGTTCGAGAAT
CCTTAGGGAGCGACTTAGACCTCTTCGACAAACTCAATAGATCAATTTTCTGCCATGGAAGATCAGTCGAAGAAACAACCGTGGAGCATGCTTAGAATTCGAAGAATCGT
CAAAGCATCGCCTTGATACTTTGTTCTGAATTCAAGAGAATAATTTGAAGAATACAGCATTTTAACGGCTAGATAATGTAGCTTGATTGAACTAAAGTTTTGTGTAGCCC
ATTTACCACAAAAAAAAATCAATTTTTTTTTTTCCAAAAAAAAAATCAATTTTTTCCCATACCCCAACCACTTCAACAGTCGTCCTACAATTACCGAGCACT
Protein sequenceShow/hide protein sequence
MTLSLSCHAALHLQHQVASKNNSNKNLDSVRNLVKRIGIASVQSSPLESLRNGNWVKLICGASFEDVVDIRNLSLVYTLAGVDCIDCAADASVVSAVNEGIQAARGIVGV
RRPWVMISVNDDQDLHFRKAEFDPENCPIDCSRPCEIVCPANAISLHEETMQELSQVASVFGGLKGGVITERCYGCGRCSPVCPYDKIKLATYVRDAATTAELIKRSDVD
ALEIHTNGRQTTPFQELWDKLGDSSKYLRLIAVSLPNIGDLTVSTMKTMYSMMESRLHCLNLWQVCLELHKAKQITFTYISL