; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0015595 (gene) of Snake gourd v1 genome

Gene IDTan0015595
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionN-acetyltransferase domain-containing protein
Genome locationLG09:67978104..67983512
RNA-Seq ExpressionTan0015595
SyntenyTan0015595
Gene Ontology termsGO:0009507 - chloroplast (cellular component)
GO:0008080 - N-acetyltransferase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022134834.1 uncharacterized protein LOC111007009 [Momordica charantia]1.7e-6078.92Show/hide
Query:  MAVLSSPLCSWSP---HRP-SCSSSSSSSISWLTLFSSKSFSFLRTQLSVPGISCFLSRRPITVSNVFTHQRTITVDKSKLRVSEGTSKDELWAAACLRV
        M+VLSS LC+  P   HRP SCSSSSS SISWL+  SS SFSFLR Q SVP  SCFL R  I VSNV THQ+TI VDKSKLRVSEGTS+ ELWAAACLRV
Subjt:  MAVLSSPLCSWSP---HRP-SCSSSSSSSISWLTLFSSKSFSFLRTQLSVPGISCFLSRRPITVSNVFTHQRTITVDKSKLRVSEGTSKDELWAAACLRV

Query:  RTFNQFRPDSYGIDDHKRYLAEHEYEAIKERIAGKRVSFKRVSCINATLPSAEISTLADDLCSTCK
        RTFN+FRP+SYGI+DHK+YLAEHEYEAIKER AGKRV FKRVSCINATLP AEISTLADDLC+TCK
Subjt:  RTFNQFRPDSYGIDDHKRYLAEHEYEAIKERIAGKRVSFKRVSCINATLPSAEISTLADDLCSTCK

XP_022929037.1 uncharacterized protein LOC111435752 isoform X1 [Cucurbita moschata]8.7e-6584.66Show/hide
Query:  MAVLSS-PLCSWSPHRPSCSSSSSSSISWLTLFSSKSFSFLRTQLSVPGISCFLSRRPITVSNVFTHQRTITVDKSKLRVSEGTSKDELWAAACLRVRTF
        MAVLSS PLC+W  HRPS SSSSSSS+SWL+ FSS SFSF  TQL VP  +CF    PI VSNVFTHQRTITVDKSKLRVSE TSKDELWAAACLRVRTF
Subjt:  MAVLSS-PLCSWSPHRPSCSSSSSSSISWLTLFSSKSFSFLRTQLSVPGISCFLSRRPITVSNVFTHQRTITVDKSKLRVSEGTSKDELWAAACLRVRTF

Query:  NQFRPDSYGIDDHKRYLAEHEYEAIKERIAGKRVSFKRVSCINATLPSAEISTLADDLCSTCK
        NQFRPDSY IDDHKRYLAE+EYEAI+ERIAGKRVSFKRVSCINATLP AEISTLADDLCSTCK
Subjt:  NQFRPDSYGIDDHKRYLAEHEYEAIKERIAGKRVSFKRVSCINATLPSAEISTLADDLCSTCK

XP_022929046.1 uncharacterized protein LOC111435752 isoform X2 [Cucurbita moschata]2.5e-5678.53Show/hide
Query:  MAVLSS-PLCSWSPHRPSCSSSSSSSISWLTLFSSKSFSFLRTQLSVPGISCFLSRRPITVSNVFTHQRTITVDKSKLRVSEGTSKDELWAAACLRVRTF
        MAVLSS PLC+W  HRPS SSSSS              SF  TQL VP  +CF    PI VSNVFTHQRTITVDKSKLRVSE TSKDELWAAACLRVRTF
Subjt:  MAVLSS-PLCSWSPHRPSCSSSSSSSISWLTLFSSKSFSFLRTQLSVPGISCFLSRRPITVSNVFTHQRTITVDKSKLRVSEGTSKDELWAAACLRVRTF

Query:  NQFRPDSYGIDDHKRYLAEHEYEAIKERIAGKRVSFKRVSCINATLPSAEISTLADDLCSTCK
        NQFRPDSY IDDHKRYLAE+EYEAI+ERIAGKRVSFKRVSCINATLP AEISTLADDLCSTCK
Subjt:  NQFRPDSYGIDDHKRYLAEHEYEAIKERIAGKRVSFKRVSCINATLPSAEISTLADDLCSTCK

XP_022969109.1 uncharacterized protein LOC111468198 isoform X1 [Cucurbita maxima]1.9e-5678.53Show/hide
Query:  MAVLSS-PLCSWSPHRPSCSSSSSSSISWLTLFSSKSFSFLRTQLSVPGISCFLSRRPITVSNVFTHQRTITVDKSKLRVSEGTSKDELWAAACLRVRTF
        MAVLSS PLC+W  HRPS SSSSS               F  TQL VP  +CF  R PI VSNVFTHQRTITVDKSKLRVSE TSKDELWAAACLRVRTF
Subjt:  MAVLSS-PLCSWSPHRPSCSSSSSSSISWLTLFSSKSFSFLRTQLSVPGISCFLSRRPITVSNVFTHQRTITVDKSKLRVSEGTSKDELWAAACLRVRTF

Query:  NQFRPDSYGIDDHKRYLAEHEYEAIKERIAGKRVSFKRVSCINATLPSAEISTLADDLCSTCK
        NQFRPDSY IDDHKRYLAE+EYEAI+ERIAGKRVSFKRVSCINATLP AEISTLADDLCSTCK
Subjt:  NQFRPDSYGIDDHKRYLAEHEYEAIKERIAGKRVSFKRVSCINATLPSAEISTLADDLCSTCK

XP_023531978.1 uncharacterized protein LOC111794081 isoform X1 [Cucurbita pepo subsp. pepo]2.8e-6384.05Show/hide
Query:  MAVLSS-PLCSWSPHRPSCSSSSSSSISWLTLFSSKSFSFLRTQLSVPGISCFLSRRPITVSNVFTHQRTITVDKSKLRVSEGTSKDELWAAACLRVRTF
        MAVLSS PLC+W  HRPS SSSSSSS SWL+ FSS SFSF  TQL VP  +CF    PI VSNV THQRTITVDKSKLRVSE TSKDELWAAACLRVRTF
Subjt:  MAVLSS-PLCSWSPHRPSCSSSSSSSISWLTLFSSKSFSFLRTQLSVPGISCFLSRRPITVSNVFTHQRTITVDKSKLRVSEGTSKDELWAAACLRVRTF

Query:  NQFRPDSYGIDDHKRYLAEHEYEAIKERIAGKRVSFKRVSCINATLPSAEISTLADDLCSTCK
        NQFRPDSY IDDHKRYLAE+EYEAI+ERIAGKRVSFKRVSCINATLP AEISTLADDLCSTCK
Subjt:  NQFRPDSYGIDDHKRYLAEHEYEAIKERIAGKRVSFKRVSCINATLPSAEISTLADDLCSTCK

TrEMBL top hitse value%identityAlignment
A0A1S3BJ79 uncharacterized protein LOC103490280 isoform X11.6e-4870.73Show/hide
Query:  MAVLSSPLCSWSPHRPSCSSSSSSSISWLTLFSSKSFSFLRTQLSVPGI-SCFLSRRPITVSNVFTH-QRTITVDKSKLRVSEGTSKDELWAAACLRVRT
        M VLSSP     P     SSSSSSSIS L+ FSS SFS LRT+ SVP   SCFL+R  I +SN+FT+ Q+TIT+  S  RVSEGTS DELWAAA LRVRT
Subjt:  MAVLSSPLCSWSPHRPSCSSSSSSSISWLTLFSSKSFSFLRTQLSVPGI-SCFLSRRPITVSNVFTH-QRTITVDKSKLRVSEGTSKDELWAAACLRVRT

Query:  FNQFRPDSYGIDDHKRYLAEHEYEAIKERIAGKRVSFKRVSCINATLPSAEISTLADDLCSTCK
        FNQF PDS+ I DHK+YLAEHE+EA+KERIAGKRV FKRVSCINATLP +EISTLA+DLCSTCK
Subjt:  FNQFRPDSYGIDDHKRYLAEHEYEAIKERIAGKRVSFKRVSCINATLPSAEISTLADDLCSTCK

A0A6J1BZW7 uncharacterized protein LOC1110070098.2e-6178.92Show/hide
Query:  MAVLSSPLCSWSP---HRP-SCSSSSSSSISWLTLFSSKSFSFLRTQLSVPGISCFLSRRPITVSNVFTHQRTITVDKSKLRVSEGTSKDELWAAACLRV
        M+VLSS LC+  P   HRP SCSSSSS SISWL+  SS SFSFLR Q SVP  SCFL R  I VSNV THQ+TI VDKSKLRVSEGTS+ ELWAAACLRV
Subjt:  MAVLSSPLCSWSP---HRP-SCSSSSSSSISWLTLFSSKSFSFLRTQLSVPGISCFLSRRPITVSNVFTHQRTITVDKSKLRVSEGTSKDELWAAACLRV

Query:  RTFNQFRPDSYGIDDHKRYLAEHEYEAIKERIAGKRVSFKRVSCINATLPSAEISTLADDLCSTCK
        RTFN+FRP+SYGI+DHK+YLAEHEYEAIKER AGKRV FKRVSCINATLP AEISTLADDLC+TCK
Subjt:  RTFNQFRPDSYGIDDHKRYLAEHEYEAIKERIAGKRVSFKRVSCINATLPSAEISTLADDLCSTCK

A0A6J1ELM7 uncharacterized protein LOC111435752 isoform X14.2e-6584.66Show/hide
Query:  MAVLSS-PLCSWSPHRPSCSSSSSSSISWLTLFSSKSFSFLRTQLSVPGISCFLSRRPITVSNVFTHQRTITVDKSKLRVSEGTSKDELWAAACLRVRTF
        MAVLSS PLC+W  HRPS SSSSSSS+SWL+ FSS SFSF  TQL VP  +CF    PI VSNVFTHQRTITVDKSKLRVSE TSKDELWAAACLRVRTF
Subjt:  MAVLSS-PLCSWSPHRPSCSSSSSSSISWLTLFSSKSFSFLRTQLSVPGISCFLSRRPITVSNVFTHQRTITVDKSKLRVSEGTSKDELWAAACLRVRTF

Query:  NQFRPDSYGIDDHKRYLAEHEYEAIKERIAGKRVSFKRVSCINATLPSAEISTLADDLCSTCK
        NQFRPDSY IDDHKRYLAE+EYEAI+ERIAGKRVSFKRVSCINATLP AEISTLADDLCSTCK
Subjt:  NQFRPDSYGIDDHKRYLAEHEYEAIKERIAGKRVSFKRVSCINATLPSAEISTLADDLCSTCK

A0A6J1EM00 uncharacterized protein LOC111435752 isoform X21.2e-5678.53Show/hide
Query:  MAVLSS-PLCSWSPHRPSCSSSSSSSISWLTLFSSKSFSFLRTQLSVPGISCFLSRRPITVSNVFTHQRTITVDKSKLRVSEGTSKDELWAAACLRVRTF
        MAVLSS PLC+W  HRPS SSSSS              SF  TQL VP  +CF    PI VSNVFTHQRTITVDKSKLRVSE TSKDELWAAACLRVRTF
Subjt:  MAVLSS-PLCSWSPHRPSCSSSSSSSISWLTLFSSKSFSFLRTQLSVPGISCFLSRRPITVSNVFTHQRTITVDKSKLRVSEGTSKDELWAAACLRVRTF

Query:  NQFRPDSYGIDDHKRYLAEHEYEAIKERIAGKRVSFKRVSCINATLPSAEISTLADDLCSTCK
        NQFRPDSY IDDHKRYLAE+EYEAI+ERIAGKRVSFKRVSCINATLP AEISTLADDLCSTCK
Subjt:  NQFRPDSYGIDDHKRYLAEHEYEAIKERIAGKRVSFKRVSCINATLPSAEISTLADDLCSTCK

A0A6J1I1L4 uncharacterized protein LOC111468198 isoform X19.4e-5778.53Show/hide
Query:  MAVLSS-PLCSWSPHRPSCSSSSSSSISWLTLFSSKSFSFLRTQLSVPGISCFLSRRPITVSNVFTHQRTITVDKSKLRVSEGTSKDELWAAACLRVRTF
        MAVLSS PLC+W  HRPS SSSSS               F  TQL VP  +CF  R PI VSNVFTHQRTITVDKSKLRVSE TSKDELWAAACLRVRTF
Subjt:  MAVLSS-PLCSWSPHRPSCSSSSSSSISWLTLFSSKSFSFLRTQLSVPGISCFLSRRPITVSNVFTHQRTITVDKSKLRVSEGTSKDELWAAACLRVRTF

Query:  NQFRPDSYGIDDHKRYLAEHEYEAIKERIAGKRVSFKRVSCINATLPSAEISTLADDLCSTCK
        NQFRPDSY IDDHKRYLAE+EYEAI+ERIAGKRVSFKRVSCINATLP AEISTLADDLCSTCK
Subjt:  NQFRPDSYGIDDHKRYLAEHEYEAIKERIAGKRVSFKRVSCINATLPSAEISTLADDLCSTCK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G28030.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein1.5e-3042.33Show/hide
Query:  CSSSSSSSISWLTLFSSKSFSFLRTQLSVPGISCFLSRRPITVSNVFTHQRTITVDKSKLRVSEGTSKDELWAAACLRVRTFNQFRPDSYGIDDHKRYLA
        CSS  SSS S        +    R+ LS+P +   L  RP+  S   +H     +DKS   +SE  S+DELWAAACLRVRTFN+  P +Y I DH+RYLA
Subjt:  CSSSSSSSISWLTLFSSKSFSFLRTQLSVPGISCFLSRRPITVSNVFTHQRTITVDKSKLRVSEGTSKDELWAAACLRVRTFNQFRPDSYGIDDHKRYLA

Query:  EHEYEAIKERIAGKRVSFKRVSCINATLPSAEISTLADDLCSTCKVC------LFLYCSNMNSVSWAPFLDHFRNFKPLTVNVQLVMIF
        E E+EA+KER +GKR  F RV+CINATLP +++S+  +DLCS CK        + +   ++N   W P  D     KP  + V     +
Subjt:  EHEYEAIKERIAGKRVSFKRVSCINATLPSAEISTLADDLCSTCKVC------LFLYCSNMNSVSWAPFLDHFRNFKPLTVNVQLVMIF

AT4G28030.2 Acyl-CoA N-acyltransferases (NAT) superfamily protein1.5e-3042.33Show/hide
Query:  CSSSSSSSISWLTLFSSKSFSFLRTQLSVPGISCFLSRRPITVSNVFTHQRTITVDKSKLRVSEGTSKDELWAAACLRVRTFNQFRPDSYGIDDHKRYLA
        CSS  SSS S        +    R+ LS+P +   L  RP+  S   +H     +DKS   +SE  S+DELWAAACLRVRTFN+  P +Y I DH+RYLA
Subjt:  CSSSSSSSISWLTLFSSKSFSFLRTQLSVPGISCFLSRRPITVSNVFTHQRTITVDKSKLRVSEGTSKDELWAAACLRVRTFNQFRPDSYGIDDHKRYLA

Query:  EHEYEAIKERIAGKRVSFKRVSCINATLPSAEISTLADDLCSTCKVC------LFLYCSNMNSVSWAPFLDHFRNFKPLTVNVQLVMIF
        E E+EA+KER +GKR  F RV+CINATLP +++S+  +DLCS CK        + +   ++N   W P  D     KP  + V     +
Subjt:  EHEYEAIKERIAGKRVSFKRVSCINATLPSAEISTLADDLCSTCKVC------LFLYCSNMNSVSWAPFLDHFRNFKPLTVNVQLVMIF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGTCCTCTCATCTCCTCTTTGCAGTTGGAGCCCTCATCGGCCTTCTTGTTCATCATCGTCATCTTCCTCCATTTCTTGGCTTACATTGTTCTCTTCCAAATCTTT
CTCCTTCCTTAGAACCCAACTATCTGTGCCTGGAATCTCTTGTTTTCTTAGTCGTCGACCGATTACAGTCTCAAATGTTTTCACCCACCAACGGACGATCACAGTCGACA
AATCTAAGTTGAGGGTCTCCGAAGGTACCTCCAAGGATGAGCTCTGGGCTGCTGCTTGTCTCCGCGTTCGCACCTTTAATCAGTTCCGCCCCGATTCCTATGGCATCGAC
GATCATAAGAGGTACTTGGCAGAGCATGAATATGAAGCAATTAAAGAGCGTATTGCTGGAAAAAGGGTTAGCTTTAAAAGGGTATCTTGCATAAATGCTACTCTTCCATC
AGCCGAAATATCAACCCTAGCTGATGATTTATGTTCAACATGTAAGGTCTGTTTATTTCTTTATTGTTCCAATATGAATAGTGTGTCATGGGCTCCATTCCTGGATCACT
TTAGGAATTTCAAACCTTTAACTGTCAATGTTCAGTTGGTTATGATATTTGAAGGATCAGATAAAATCATTATCTGA
mRNA sequenceShow/hide mRNA sequence
GGACGAGGAGCCTAGGAATATCCTAAAACACCGAACGCGGAGTGAACGAGAGATGGCTGTCCTCTCATCTCCTCTTTGCAGTTGGAGCCCTCATCGGCCTTCTTGTTCAT
CATCGTCATCTTCCTCCATTTCTTGGCTTACATTGTTCTCTTCCAAATCTTTCTCCTTCCTTAGAACCCAACTATCTGTGCCTGGAATCTCTTGTTTTCTTAGTCGTCGA
CCGATTACAGTCTCAAATGTTTTCACCCACCAACGGACGATCACAGTCGACAAATCTAAGTTGAGGGTCTCCGAAGGTACCTCCAAGGATGAGCTCTGGGCTGCTGCTTG
TCTCCGCGTTCGCACCTTTAATCAGTTCCGCCCCGATTCCTATGGCATCGACGATCATAAGAGGTACTTGGCAGAGCATGAATATGAAGCAATTAAAGAGCGTATTGCTG
GAAAAAGGGTTAGCTTTAAAAGGGTATCTTGCATAAATGCTACTCTTCCATCAGCCGAAATATCAACCCTAGCTGATGATTTATGTTCAACATGTAAGGTCTGTTTATTT
CTTTATTGTTCCAATATGAATAGTGTGTCATGGGCTCCATTCCTGGATCACTTTAGGAATTTCAAACCTTTAACTGTCAATGTTCAGTTGGTTATGATATTTGAAGGATC
AGATAAAATCATTATCTGAATTCTGGCAGTTCTTAAGCTTTTTTATATGAAAAATGATGTCGAGAAGTATGCGATAATAAAAAATTGAAGAGGCTGTTTAAAGAAATAGC
CTTCATTTCAGAAAATTAAAAACATTAGGTATTTTCTTCATTTGAATTGAAAAACATTAGGTATTGCAGGAAAGGCAAGGCACATGTATTAATGATGCTGCCTCGCCTCA
CATGAGGAGAGTTGCCTTTGAGGTACTAGACATGTTAAGATAGTGTATGTGTTGGCAACAATGTTGTTTGATAATGTTATGTATAGGCATGCTAAAATTATGAAAAAAAA
ATCTATATAACTAAAAAAATGATGTGGGTTTACCAAAAGCATATTTTGAAGATGGAAAAGGAATGAAACAGATTGAGACAAAATATATTGACTATTTTTCCCTCTTTATG
TAAATGCAGGGTCTTCTAATGCAAAAAAGTTATAATATTGATTGTAATGCTCTGGCCATGTTTTTTCCTTACCTGCTGCTGTAGATTTTGAGTTGATTGAAGCAGATTAT
GCAACCTTTTATGTCTCCTCTCTGAGCTTCGACTATATCACTTCTATAAGTTCTACTTTGTTTTGATCTGCAATCTTTTATTTATATTTTATTATCCTTTTCTCTTTTAG
TTTTCTGATGATGGAGAAGACCGAGTTGTTGTTGGCTCACTTGACATTAATCTGTGTCTAAGGCTTCCAGATGAAATAGCGGGAATGAAACCTGAGAATTTGGTAGACTC
CATGTGTTTCTATAATTCAGAATTTCAGATTATTGGACAGTATGGTACTATTCTTTTTGTTTTATTATTGTTTTTAAAATAAAATAGTAATTAGAAAGGGACTCCAGCCT
TTAGCCTGTCGAGATTTATACTTGGTCATACTATTAAGCTGGTTGTTATGTAAATAGGCCGTTGTATGTGCTATTATTCTTAAATGTTGTTATTGAAAAACCAGGCTGGT
TAGTTTGAATGTGTTATATAATGGAGAACTGCATATGTGTAAATAGCAAAGTAGGCCTTGAAGTTGAACAACTTTCGCAGAATATTATTCTATTTTTCCCTCTGAATGGA
AGTTGCATGCTCTAGCAGCTAACATTAGTAAAGACTGTGTCATCCTCAATTTTGTCCAATCCACGATCTTGTAATCTGGCAGCACCTAAAACACACATCGTCAGTTGTGT
AAAGCTAAGGTTATGCGATGCAAGAGCTCTTTCAAATCAGAGTAAAGGATATTTCTCCATGAAGGAATCTTTATAAGTTGAAGACCTCTCTGATTTGCTAGCCTCATAGG
TCACATAACGGAGAAAGACAACATTGTAAAGAGCTCTTTCAAATCAGAGTAAAGGATATTTCTCCATGAAGGAATCTTTATAAGTTGAAGACCTCTCTGATTTGCTAGCC
TCATAGGTCACATAACGGAGAAAGACAACATTGTAAAGAGTGTCTTAACTGATCTTGAAGCTGAATATTAATTTCTGTGATGACAATTATCACCTCAAAAATCAAGTCTT
TAACCATTCAAAAGACAAGTATCTTCCTCCTTGCATATGAGAGTTGCGTGCAACAAAATTTTCCAAACTTCGATGGATCTCAGATTCTCAACCTTGTGTAAATATGACCT
ATCTAGGTGAATCAAAGAAAAACAGTGAAAGTCAACGTCCTGTCGAGGAGGAAGCCAAAATGCTAATGATAGAGGAGAAAAGGATACAATGGAGAAAATTTATGGTCAAG
GAATTCTAAACCGGATAGCTCTATGGAAAGTTTTGACATAATGTATTCAAATGTTACTTCCACTTTGATTAATTGTTCAATGGGTTCCAAGTTCCACTAAGAATCCTTCT
ATATGTTGCTGTGCGAATAGCTGCTTCAGGCCACTTCATCGAATACCTTTGCTGATTTACCCTAATCAGAGAATGAGAGTTGAAAAGAGAACAATAATCTGCAAAGTTTA
TGTTCAACTTGAATTTCAACTCTACTATTTGAACATATGATAGATTGATAGTTTACTTAATAGAACTATTGCCGGGAGTCTTCTAGATTGTCCATAAACTTCTGAGGTTA
CGTCTTCCAGTCTCTATTTCATTCTAATAGCTAACTAATAGGGATTCCGTTTTTTTTTTGGTGTGTGTAACTGTGGTTGGTTTCTTCCATTCCTATTGTAGTCCATTCAT
TTGAGAGTTGTTTCATGTCCTTTCTTTTAAAGAAAACACATGCGGGCAGGCACACATATTTGTTTGCAATTGCACATCTTTCTGTATCATCTGTTGCTTTGTCGAATCTT
CAATGATAAGTGAGGTTTATCTCCAATTAAGTATATCCTTGTATTAACAAGATTTGATTCTGGATCATGAGTTGATTTTTTCATGATAAAGGATATCTAAAATATTTTTC
CAACAGGGAATTGGGGCCGATTTCACAAGGGCATACCTGAGTAATGTATGTGTTGCCAAGGAACTCCACAGAAATGGGTTGGGTTATGCACTTGTTGCAAAGGCAAAGAC
AATTGCAGAAAATTGGGGTAACAGTTTTGAGATACTATGTTTGTCCAATTCATTCATGTCAAATATGGTCATGATCTTGGTGCATTATTAGCATCTCCATTTTTATGACT
TCTGTTTTAGAATGTGGTCTTGGTGTTTATTGTGGAACTCTACTTCTAAGAACTATATGGATAATTTGTGCTACAATGTTCTAATGTTTTAGGCATCAGCGATCTGTACG
TCCATGTAGCTTTCGACAACGAACCCGCAAAGAATCTCTACTTGAAAAGTGGTTTTGTCTATGAAAGCGATGAACCCGCTTGGCAAGCCAGGTTTCTAGATCGACCTCGC
AGGATTCTCCTGTGGACTGCTCTCTCAGAACTTCTCTGATATTGCACTGAATTGATTTTTGCTCATAATTTTACTTGCAGTTTTTTCTCTTCACTGGTACACAGATCATT
TTCTTCCTTTAATATACTAAAATTCTATTCATTGTAAAATTATGTGAATATGCTGTCCATCTTCATAATTTATTATATAGCATGCAAAAATAGAATTGACGTCTCTCTAT
AGTCTATGGTGTGTTGGTGTCTCTCTCTGAAACTCCTCTTGCAAGCAATTTTCTTGATTTGCAAGCAAAGCCACCAGCAAACTCAGTTGTCTTCTTCCCAAAAAAGAAAA
AAGTTGTTTGGTGTATAAATATAGTAATAAAGGTTGTGGCTTGGGAATGGTCATACTTGAAACCAGTAATATTCTCTACAAAGTTTGTTGATGGAAAAATAAAAGCAATC
TTCATTGAAAAAAAGTTGAAGTTGCTGAGATTACAGTCATTGGTTGGTCAAGATACATAAAACCACCAAAGATTGAAGATCACAACCCACCATGAAGTGAGCATTGAAGA
ACCTTTTATCCTGCATTTGATGCAAGTCTGGTAGTTTTGGTCTTTCTGCTAAAAAACTTATCTATTTCTGAGATCTGAATTTGTTAAGCAAGGGGCAAGAAATCTGGTAG
TTTTGGCCTTTTTATGTCTAAAGCAGGCTAAGTGGTTTTGAAATTACCAACAGATTGATCCCAGAACCTGCTTTTTCCGTCCTTATTGTGTCAACATTTTGTGATTGGCC
TGTTAATTATCTGCAGGAAGCCTTATACGAGAATGTGTTTGCAACTTACAGAAATCAGTCTGCATCCCCCCTTTTTTTTAATAAAAAGGCTGGAGCTCTTGATCTTGCTA
CACAGCAGACAACTATGTCTTAATATGTATCAAGAAATAAACTATCTTATTCTGACCAAACATTGTACAAGCATTGAGAGTGATAGGATGTGGTAGTTTAGGTAAAACGA
TGAACTAATTTAACAATAAATAGGTAAAAGCTCGGATATATTATTTGATGAACTCATTTATAATGATTCAGTGTATATTGTGTATCATATATACATCTATATATACATCT
ATCGATCTTAAGAGAGAAATTATTAATTCAGTCGATTTCATATGACATTAATACAGTTCAATGTTGTTTAGGTATCTC
Protein sequenceShow/hide protein sequence
MAVLSSPLCSWSPHRPSCSSSSSSSISWLTLFSSKSFSFLRTQLSVPGISCFLSRRPITVSNVFTHQRTITVDKSKLRVSEGTSKDELWAAACLRVRTFNQFRPDSYGID
DHKRYLAEHEYEAIKERIAGKRVSFKRVSCINATLPSAEISTLADDLCSTCKVCLFLYCSNMNSVSWAPFLDHFRNFKPLTVNVQLVMIFEGSDKIII