; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC06g2055 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC06g2055
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionReplicase polyprotein 1ab
Genome locationMC06:28038188..28048095
RNA-Seq ExpressionMC06g2055
SyntenyMC06g2055
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008467324.1 PREDICTED: uncharacterized protein LOC103504702 [Cucumis melo]7.66e-15489.8Show/hide
Query:  MIASTALPPWQPPLQAPLRLRR-RPSI-PLRR-VGFVQAYRRG--NSDDFGEAWNKVWRGANDGFEKFVFEARKTAERLDRRYSVSRRVSSVAQSAAERA
        MIAST LPPWQPPL APLRLRR RP I P R  +GFVQAYRRG  N+D FGEAWNKVWRGANDGFEKFVFEARKTAERLDRRYSVSRRVSS AQS A+RA
Subjt:  MIASTALPPWQPPLQAPLRLRR-RPSI-PLRR-VGFVQAYRRG--NSDDFGEAWNKVWRGANDGFEKFVFEARKTAERLDRRYSVSRRVSSVAQSAAERA

Query:  REIDREFGIGLRWRNFTLDFSRNWPRYRRQLNEFMDTPLGKSFVTIFFLWFALSGWLFRFLIFATWILPFAGPLLIGTFANSLIIKGTCPACNREFAGYK
        REIDREF IGLRWRNFTLDFSRNWPRYRRQLNEF+DTPLGKSFVTIFFLWFALSGWLFRFLIF TWILPFAGP+L+GTFANSL+IKG CPACNREFAGYK
Subjt:  REIDREFGIGLRWRNFTLDFSRNWPRYRRQLNEFMDTPLGKSFVTIFFLWFALSGWLFRFLIFATWILPFAGPLLIGTFANSLIIKGTCPACNREFAGYK

Query:  NQIISCSGCGNIVWQPKGQGEYNSRKGNSGSKSQPNVIDVEFEEK
        NQIISC+GCGNIVWQPKGQGEYNSRKG+SGSKSQPNVIDVEFEEK
Subjt:  NQIISCSGCGNIVWQPKGQGEYNSRKGNSGSKSQPNVIDVEFEEK

XP_022159671.1 uncharacterized protein LOC111026015 [Momordica charantia]3.39e-174100Show/hide
Query:  MIASTALPPWQPPLQAPLRLRRRPSIPLRRVGFVQAYRRGNSDDFGEAWNKVWRGANDGFEKFVFEARKTAERLDRRYSVSRRVSSVAQSAAERAREIDR
        MIASTALPPWQPPLQAPLRLRRRPSIPLRRVGFVQAYRRGNSDDFGEAWNKVWRGANDGFEKFVFEARKTAERLDRRYSVSRRVSSVAQSAAERAREIDR
Subjt:  MIASTALPPWQPPLQAPLRLRRRPSIPLRRVGFVQAYRRGNSDDFGEAWNKVWRGANDGFEKFVFEARKTAERLDRRYSVSRRVSSVAQSAAERAREIDR

Query:  EFGIGLRWRNFTLDFSRNWPRYRRQLNEFMDTPLGKSFVTIFFLWFALSGWLFRFLIFATWILPFAGPLLIGTFANSLIIKGTCPACNREFAGYKNQIIS
        EFGIGLRWRNFTLDFSRNWPRYRRQLNEFMDTPLGKSFVTIFFLWFALSGWLFRFLIFATWILPFAGPLLIGTFANSLIIKGTCPACNREFAGYKNQIIS
Subjt:  EFGIGLRWRNFTLDFSRNWPRYRRQLNEFMDTPLGKSFVTIFFLWFALSGWLFRFLIFATWILPFAGPLLIGTFANSLIIKGTCPACNREFAGYKNQIIS

Query:  CSGCGNIVWQPKGQGEYNSRKGNSGSKSQPNVIDVEFEEK
        CSGCGNIVWQPKGQGEYNSRKGNSGSKSQPNVIDVEFEEK
Subjt:  CSGCGNIVWQPKGQGEYNSRKGNSGSKSQPNVIDVEFEEK

XP_022957921.1 uncharacterized protein LOC111459308 [Cucurbita moschata]8.31e-15391.43Show/hide
Query:  MIASTALPPWQPPLQAPLRLRR-RPS-IPLRR-VGFVQAYRRG--NSDDFGEAWNKVWRGANDGFEKFVFEARKTAERLDRRYSVSRRVSSVAQSAAERA
        MIAST LPPWQPPL+APLRL R RP  IPLRR VGFVQAYRRG  N+D FGE W+KVWRGANDGFEKFVFEARKTAERLDRRYSVSRRVSSVAQSAA+RA
Subjt:  MIASTALPPWQPPLQAPLRLRR-RPS-IPLRR-VGFVQAYRRG--NSDDFGEAWNKVWRGANDGFEKFVFEARKTAERLDRRYSVSRRVSSVAQSAAERA

Query:  REIDREFGIGLRWRNFTLDFSRNWPRYRRQLNEFMDTPLGKSFVTIFFLWFALSGWLFRFLIFATWILPFAGPLLIGTFANSLIIKGTCPACNREFAGYK
        REIDREFGIGLRWRNFTLDFSRNWPRYRRQLNEFMDTPLGK FVTIFFLWFALSGWLFR LIFATWILPFAGPLLIGTFANSLIIKGTCPACNREFAGYK
Subjt:  REIDREFGIGLRWRNFTLDFSRNWPRYRRQLNEFMDTPLGKSFVTIFFLWFALSGWLFRFLIFATWILPFAGPLLIGTFANSLIIKGTCPACNREFAGYK

Query:  NQIISCSGCGNIVWQPKGQGEYNSRKGNSGSKSQPNVIDVEFEEK
        NQIISC+GCGNIVWQPKGQGE  +RKG SGSKSQPNVIDVEFEEK
Subjt:  NQIISCSGCGNIVWQPKGQGEYNSRKGNSGSKSQPNVIDVEFEEK

XP_023533650.1 uncharacterized protein LOC111795447 [Cucurbita pepo subsp. pepo]1.38e-15191.02Show/hide
Query:  MIASTALPPWQPPLQAPLRLRR-RPS-IPLRR-VGFVQAYRRG--NSDDFGEAWNKVWRGANDGFEKFVFEARKTAERLDRRYSVSRRVSSVAQSAAERA
        MIAST LPPWQP L+APLRL R RP  IPLRR VGFVQAYRRG  N+D FGE W+KVWRGANDGFEKFVFEARKTAERLDRRYSVSRRVSSVAQSAA+RA
Subjt:  MIASTALPPWQPPLQAPLRLRR-RPS-IPLRR-VGFVQAYRRG--NSDDFGEAWNKVWRGANDGFEKFVFEARKTAERLDRRYSVSRRVSSVAQSAAERA

Query:  REIDREFGIGLRWRNFTLDFSRNWPRYRRQLNEFMDTPLGKSFVTIFFLWFALSGWLFRFLIFATWILPFAGPLLIGTFANSLIIKGTCPACNREFAGYK
        REIDREFGIGLRWRNFTLDFSRNWPRYRRQLNEFMDTPLGK FVTIFFLWFALSGWLFR LIFATWILPFAGPLLIGTFANSLIIKGTCPACNREFAGYK
Subjt:  REIDREFGIGLRWRNFTLDFSRNWPRYRRQLNEFMDTPLGKSFVTIFFLWFALSGWLFRFLIFATWILPFAGPLLIGTFANSLIIKGTCPACNREFAGYK

Query:  NQIISCSGCGNIVWQPKGQGEYNSRKGNSGSKSQPNVIDVEFEEK
        NQIISC+GCGNIVWQPKGQGE  +RKG SGSKSQPNVIDVEFEEK
Subjt:  NQIISCSGCGNIVWQPKGQGEYNSRKGNSGSKSQPNVIDVEFEEK

XP_038874339.1 uncharacterized protein LOC120067037 [Benincasa hispida]2.14e-16093.06Show/hide
Query:  MIASTALPPWQPPLQAPLRLRRRPS--IPLRR-VGFVQAYRRG--NSDDFGEAWNKVWRGANDGFEKFVFEARKTAERLDRRYSVSRRVSSVAQSAAERA
        MIAST LPPWQPP+QAPLRLRR  S  IP RR +GFVQAYRRG  NSD FGEAWNKVWRGANDGFEKFVFEARKTAERLDRRYSVSRRVSSVAQSAA+RA
Subjt:  MIASTALPPWQPPLQAPLRLRRRPS--IPLRR-VGFVQAYRRG--NSDDFGEAWNKVWRGANDGFEKFVFEARKTAERLDRRYSVSRRVSSVAQSAAERA

Query:  REIDREFGIGLRWRNFTLDFSRNWPRYRRQLNEFMDTPLGKSFVTIFFLWFALSGWLFRFLIFATWILPFAGPLLIGTFANSLIIKGTCPACNREFAGYK
        REIDREFGIGLRWRNFTLDFSRNWPRYRRQLNEFMDTPLGKSFVTIFFLWFALSGWLFRFLIFATWILPFAGPLLIG+FANSL+IKGTCPACNREFAGYK
Subjt:  REIDREFGIGLRWRNFTLDFSRNWPRYRRQLNEFMDTPLGKSFVTIFFLWFALSGWLFRFLIFATWILPFAGPLLIGTFANSLIIKGTCPACNREFAGYK

Query:  NQIISCSGCGNIVWQPKGQGEYNSRKGNSGSKSQPNVIDVEFEEK
        NQIISC+GCGNIVWQPKGQGEYNSRKG+SGSKSQPNVIDVEFEEK
Subjt:  NQIISCSGCGNIVWQPKGQGEYNSRKGNSGSKSQPNVIDVEFEEK

TrEMBL top hitse value%identityAlignment
A0A0A0KPH9 Uncharacterized protein3.66e-15086.64Show/hide
Query:  MIASTALPPWQPPLQAPLRLRR-RPSI-PLRR-VGFVQAYRRG----NSDDFGEAWNKVWRGANDGFEKFVFEARKTAERLDRRYSVSRRVSSVAQSAAE
        MIAST LPPWQPPLQAP RLRR RP I P R  +GFVQAYRRG    N+D FG+AWNKVWRGANDGFEKFVFEARKTAERLDRRYSVSRRV S AQS A+
Subjt:  MIASTALPPWQPPLQAPLRLRR-RPSI-PLRR-VGFVQAYRRG----NSDDFGEAWNKVWRGANDGFEKFVFEARKTAERLDRRYSVSRRVSSVAQSAAE

Query:  RAREIDREFGIGLRWRNFTLDFSRNWPRYRRQLNEFMDTPLGKSFVTIFFLWFALSGWLFRFLIFATWILPFAGPLLIGTFANSLIIKGTCPACNREFAG
        RAREIDREF IG+RWRNFTLDFSRNWPRYRRQLNEF+DTPLGKS VTIFFLWFALSGWLFRFLIF TWILPFAGP+LIGTFANSL+IKG CPACNREFAG
Subjt:  RAREIDREFGIGLRWRNFTLDFSRNWPRYRRQLNEFMDTPLGKSFVTIFFLWFALSGWLFRFLIFATWILPFAGPLLIGTFANSLIIKGTCPACNREFAG

Query:  YKNQIISCSGCGNIVWQPKGQGEYNSRKGNSGSKSQPNVIDVEFEEK
        YKNQIISC+GCGN+VWQPK  GEYNSRKG+SGSKSQPNVIDVEFEEK
Subjt:  YKNQIISCSGCGNIVWQPKGQGEYNSRKGNSGSKSQPNVIDVEFEEK

A0A1S3CT98 uncharacterized protein LOC1035047023.71e-15489.8Show/hide
Query:  MIASTALPPWQPPLQAPLRLRR-RPSI-PLRR-VGFVQAYRRG--NSDDFGEAWNKVWRGANDGFEKFVFEARKTAERLDRRYSVSRRVSSVAQSAAERA
        MIAST LPPWQPPL APLRLRR RP I P R  +GFVQAYRRG  N+D FGEAWNKVWRGANDGFEKFVFEARKTAERLDRRYSVSRRVSS AQS A+RA
Subjt:  MIASTALPPWQPPLQAPLRLRR-RPSI-PLRR-VGFVQAYRRG--NSDDFGEAWNKVWRGANDGFEKFVFEARKTAERLDRRYSVSRRVSSVAQSAAERA

Query:  REIDREFGIGLRWRNFTLDFSRNWPRYRRQLNEFMDTPLGKSFVTIFFLWFALSGWLFRFLIFATWILPFAGPLLIGTFANSLIIKGTCPACNREFAGYK
        REIDREF IGLRWRNFTLDFSRNWPRYRRQLNEF+DTPLGKSFVTIFFLWFALSGWLFRFLIF TWILPFAGP+L+GTFANSL+IKG CPACNREFAGYK
Subjt:  REIDREFGIGLRWRNFTLDFSRNWPRYRRQLNEFMDTPLGKSFVTIFFLWFALSGWLFRFLIFATWILPFAGPLLIGTFANSLIIKGTCPACNREFAGYK

Query:  NQIISCSGCGNIVWQPKGQGEYNSRKGNSGSKSQPNVIDVEFEEK
        NQIISC+GCGNIVWQPKGQGEYNSRKG+SGSKSQPNVIDVEFEEK
Subjt:  NQIISCSGCGNIVWQPKGQGEYNSRKGNSGSKSQPNVIDVEFEEK

A0A6J1E4L9 uncharacterized protein LOC1110260151.64e-174100Show/hide
Query:  MIASTALPPWQPPLQAPLRLRRRPSIPLRRVGFVQAYRRGNSDDFGEAWNKVWRGANDGFEKFVFEARKTAERLDRRYSVSRRVSSVAQSAAERAREIDR
        MIASTALPPWQPPLQAPLRLRRRPSIPLRRVGFVQAYRRGNSDDFGEAWNKVWRGANDGFEKFVFEARKTAERLDRRYSVSRRVSSVAQSAAERAREIDR
Subjt:  MIASTALPPWQPPLQAPLRLRRRPSIPLRRVGFVQAYRRGNSDDFGEAWNKVWRGANDGFEKFVFEARKTAERLDRRYSVSRRVSSVAQSAAERAREIDR

Query:  EFGIGLRWRNFTLDFSRNWPRYRRQLNEFMDTPLGKSFVTIFFLWFALSGWLFRFLIFATWILPFAGPLLIGTFANSLIIKGTCPACNREFAGYKNQIIS
        EFGIGLRWRNFTLDFSRNWPRYRRQLNEFMDTPLGKSFVTIFFLWFALSGWLFRFLIFATWILPFAGPLLIGTFANSLIIKGTCPACNREFAGYKNQIIS
Subjt:  EFGIGLRWRNFTLDFSRNWPRYRRQLNEFMDTPLGKSFVTIFFLWFALSGWLFRFLIFATWILPFAGPLLIGTFANSLIIKGTCPACNREFAGYKNQIIS

Query:  CSGCGNIVWQPKGQGEYNSRKGNSGSKSQPNVIDVEFEEK
        CSGCGNIVWQPKGQGEYNSRKGNSGSKSQPNVIDVEFEEK
Subjt:  CSGCGNIVWQPKGQGEYNSRKGNSGSKSQPNVIDVEFEEK

A0A6J1H0K1 uncharacterized protein LOC1114593084.03e-15391.43Show/hide
Query:  MIASTALPPWQPPLQAPLRLRR-RPS-IPLRR-VGFVQAYRRG--NSDDFGEAWNKVWRGANDGFEKFVFEARKTAERLDRRYSVSRRVSSVAQSAAERA
        MIAST LPPWQPPL+APLRL R RP  IPLRR VGFVQAYRRG  N+D FGE W+KVWRGANDGFEKFVFEARKTAERLDRRYSVSRRVSSVAQSAA+RA
Subjt:  MIASTALPPWQPPLQAPLRLRR-RPS-IPLRR-VGFVQAYRRG--NSDDFGEAWNKVWRGANDGFEKFVFEARKTAERLDRRYSVSRRVSSVAQSAAERA

Query:  REIDREFGIGLRWRNFTLDFSRNWPRYRRQLNEFMDTPLGKSFVTIFFLWFALSGWLFRFLIFATWILPFAGPLLIGTFANSLIIKGTCPACNREFAGYK
        REIDREFGIGLRWRNFTLDFSRNWPRYRRQLNEFMDTPLGK FVTIFFLWFALSGWLFR LIFATWILPFAGPLLIGTFANSLIIKGTCPACNREFAGYK
Subjt:  REIDREFGIGLRWRNFTLDFSRNWPRYRRQLNEFMDTPLGKSFVTIFFLWFALSGWLFRFLIFATWILPFAGPLLIGTFANSLIIKGTCPACNREFAGYK

Query:  NQIISCSGCGNIVWQPKGQGEYNSRKGNSGSKSQPNVIDVEFEEK
        NQIISC+GCGNIVWQPKGQGE  +RKG SGSKSQPNVIDVEFEEK
Subjt:  NQIISCSGCGNIVWQPKGQGEYNSRKGNSGSKSQPNVIDVEFEEK

A0A6J1K7K5 uncharacterized protein LOC1114913571.91e-15190.61Show/hide
Query:  MIASTALPPWQPPLQAPLRLRR-RPS-IPLRR-VGFVQAYRRG--NSDDFGEAWNKVWRGANDGFEKFVFEARKTAERLDRRYSVSRRVSSVAQSAAERA
        MIAST LPPWQP L+APLRL R RP  IPLRR VGFVQAYRRG  N+D FGE W+KVWRGANDGFEKFVFEARKTAERLDRRYSVSRRVSSVAQSAA+RA
Subjt:  MIASTALPPWQPPLQAPLRLRR-RPS-IPLRR-VGFVQAYRRG--NSDDFGEAWNKVWRGANDGFEKFVFEARKTAERLDRRYSVSRRVSSVAQSAAERA

Query:  REIDREFGIGLRWRNFTLDFSRNWPRYRRQLNEFMDTPLGKSFVTIFFLWFALSGWLFRFLIFATWILPFAGPLLIGTFANSLIIKGTCPACNREFAGYK
        REIDREFGIGLRWRNFTLDFSRNWPRYRRQLN+FMDTPLGK FVTIFFLWFALSGWLFR LIFATWILPFAGPLLIGTFANSLIIKGTCPACNREFAGYK
Subjt:  REIDREFGIGLRWRNFTLDFSRNWPRYRRQLNEFMDTPLGKSFVTIFFLWFALSGWLFRFLIFATWILPFAGPLLIGTFANSLIIKGTCPACNREFAGYK

Query:  NQIISCSGCGNIVWQPKGQGEYNSRKGNSGSKSQPNVIDVEFEEK
        NQIISC+GCGNIVWQPKGQGE  +RKG SGSKSQPNVIDVEFEEK
Subjt:  NQIISCSGCGNIVWQPKGQGEYNSRKGNSGSKSQPNVIDVEFEEK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G44870.1 unknown protein4.3e-7461.71Show/hide
Query:  VQAYRRGNSDDF------GEAWNKVWRGANDGFEKFVFEARKTAERLDRRYSVSRRVSSVAQSAAERAREIDREFGIGLRWRNFTLDFSRNWPRYRRQLN
        V+A++RG+ D        G+AW   WR ANDGFE+FVFEA+KTAER+DR+Y+VSRR SS A SAA+RAREIDREFGI  R R  + DFSRN+P+YR+Q +
Subjt:  VQAYRRGNSDDF------GEAWNKVWRGANDGFEKFVFEARKTAERLDRRYSVSRRVSSVAQSAAERAREIDREFGIGLRWRNFTLDFSRNWPRYRRQLN

Query:  EFMDTPLGKSFVTIFFLWFALSGWLFRFLIFATWILPFAGPLLIGTFANSLIIKGTCPACNREFAGYKNQIISCSGCGNIVWQPKG---------QGEYN
         F++TPLG SF TIFFLWFALSGWLFR +I ATW+LP AGPLLIG  AN+ +IKG CPAC R+F GYKNQII C GCGNIVWQP+G             N
Subjt:  EFMDTPLGKSFVTIFFLWFALSGWLFRFLIFATWILPFAGPLLIGTFANSLIIKGTCPACNREFAGYKNQIISCSGCGNIVWQPKG---------QGEYN

Query:  SRKGNSGSKSQPNVIDVEFEEK
        + KGNS    +  +IDV+FEEK
Subjt:  SRKGNSGSKSQPNVIDVEFEEK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATAGCTTCAACAGCTCTGCCGCCATGGCAGCCACCGCTTCAAGCTCCGTTAAGGCTGAGGAGGAGGCCCTCAATCCCTCTGCGGAGGGTTGGCTTCGTTCAGGCGTA
CCGTCGCGGGAACAGCGATGATTTTGGCGAGGCCTGGAATAAAGTGTGGCGAGGCGCCAACGATGGTTTCGAGAAATTCGTATTCGAGGCGAGGAAAACCGCGGAGCGCC
TCGACAGGCGCTACTCTGTATCGCGCCGTGTTAGCTCTGTTGCCCAATCAGCGGCCGAGCGGGCGCGCGAGATTGATAGGGAGTTCGGGATTGGATTGCGTTGGCGTAAT
TTTACATTGGATTTTAGCAGAAATTGGCCAAGGTATAGGAGGCAACTCAATGAATTTATGGACACGCCATTAGGGAAAAGTTTTGTGACAATATTCTTCCTTTGGTTTGC
ATTATCTGGATGGCTTTTCCGATTCTTAATATTTGCAACGTGGATACTACCTTTTGCCGGTCCACTTCTCATTGGGACTTTTGCCAATAGCCTTATAATAAAGGGTACTT
GTCCTGCCTGTAATAGGGAGTTTGCTGGGTACAAGAACCAAATTATTTCCTGCAGTGGCTGTGGAAACATAGTGTGGCAGCCTAAAGGCCAAGGGGAATACAATTCAAGA
AAAGGTAATTCTGGTTCTAAGTCACAACCCAATGTTATTGACGTGGAGTTTGAGGAGAAATGA
mRNA sequenceShow/hide mRNA sequence
ATTGGTTTAAGTTTATTTTTCGTGAAAAAACATTCTGACCGCAGAGACGAGCTGCCGAAGCGATAGAGACATTTCCAGCAAGCGAACACGCATAACTTGCAGTGGAAGAA
GCGGCCATCCAAACAACTGCAATACAACTTGGAATGATAGCTTCAACAGCTCTGCCGCCATGGCAGCCACCGCTTCAAGCTCCGTTAAGGCTGAGGAGGAGGCCCTCAAT
CCCTCTGCGGAGGGTTGGCTTCGTTCAGGCGTACCGTCGCGGGAACAGCGATGATTTTGGCGAGGCCTGGAATAAAGTGTGGCGAGGCGCCAACGATGGTTTCGAGAAAT
TCGTATTCGAGGCGAGGAAAACCGCGGAGCGCCTCGACAGGCGCTACTCTGTATCGCGCCGTGTTAGCTCTGTTGCCCAATCAGCGGCCGAGCGGGCGCGCGAGATTGAT
AGGGAGTTCGGGATTGGATTGCGTTGGCGTAATTTTACATTGGATTTTAGCAGAAATTGGCCAAGGTATAGGAGGCAACTCAATGAATTTATGGACACGCCATTAGGGAA
AAGTTTTGTGACAATATTCTTCCTTTGGTTTGCATTATCTGGATGGCTTTTCCGATTCTTAATATTTGCAACGTGGATACTACCTTTTGCCGGTCCACTTCTCATTGGGA
CTTTTGCCAATAGCCTTATAATAAAGGGTACTTGTCCTGCCTGTAATAGGGAGTTTGCTGGGTACAAGAACCAAATTATTTCCTGCAGTGGCTGTGGAAACATAGTGTGG
CAGCCTAAAGGCCAAGGGGAATACAATTCAAGAAAAGGTAATTCTGGTTCTAAGTCACAACCCAATGTTATTGACGTGGAGTTTGAGGAGAAATGACAAAGCTAGAACGG
ATGGTGAATCGCTCGACCACCTTTATCGACAGGCACGCCGTGACCAAAATGAAGATAGAGTTACCAACGACCATAACTTGGAGTCATTGAAGTTTCATGTTCGTGCGTCT
TCTCTTCTCCCTTGATAGTTACTGATGTAGTTTGGCAGCATTTGTTTGTAGTATCAATACATGATATGGTACATAAATGCGAAGTCCACAGGTGCTGGATCATTTCTGAA
ATATACTATCCTCACTTCTATTGGAGTGATAGAAACGGTCTTTCTTTGGAATTTCTTCATTTTTATCTTGTAACAAATGTCACTAGAGCCGATGAATTAGTATTATTGTC
CAACTAGAGCCTCGAGTTGTAAATTGGATTAATTAATGACCCGTTTAGTAACATTCATATTTCTTGTTTTTTTTTCTTTCTTATCAAAGAAGAAATAGAACTGCATTATG
GCAAT
Protein sequenceShow/hide protein sequence
MIASTALPPWQPPLQAPLRLRRRPSIPLRRVGFVQAYRRGNSDDFGEAWNKVWRGANDGFEKFVFEARKTAERLDRRYSVSRRVSSVAQSAAERAREIDREFGIGLRWRN
FTLDFSRNWPRYRRQLNEFMDTPLGKSFVTIFFLWFALSGWLFRFLIFATWILPFAGPLLIGTFANSLIIKGTCPACNREFAGYKNQIISCSGCGNIVWQPKGQGEYNSR
KGNSGSKSQPNVIDVEFEEK