; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g20700 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g20700
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase
Genome locationchr4:15033443..15040665
RNA-Seq ExpressionMoc04g20700
SyntenyMoc04g20700
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001969 - Aspartic peptidase, active site
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022153201.1 uncharacterized protein LOC111020757 [Momordica charantia]1.1e-6558.36Show/hide
Query:  KVWAYETISTLSLRVATRLSDDAIPRLLRWSCTYSRGFLTLQRDVFDNMISKVKEYLVSTNAEAEHMVCIMRPPEARAIPAPPVVPDPPAVPDPAVVPAP
        +VWAYETIST        LSDDAIPRLLRWSC YS GF  L  +VFDN  SKVKE+L++T+A+ +HMV ++ PPE R      V+PDPPAVPD AVVP P
Subjt:  KVWAYETISTLSLRVATRLSDDAIPRLLRWSCTYSRGFLTLQRDVFDNMISKVKEYLVSTNAEAEHMVCIMRPPEARAIPAPPVVPDPPAVPDPAVVPAP

Query:  AAVRNPTVVADPPADLERGTQERRVKDKGKNIIEDPVEEAETLDNDALQGPALDDAGPSGNDSEALQKRSKRKKFKNNISRRLKRLDDRVGAIEATLTGV
         A      V DPPAD+E G             +EDPV +A           A+D+A PS ND E L+KR K+ KFK  ISRRLKRLD+ VGAIE  L   
Subjt:  AAVRNPTVVADPPADLERGTQERRVKDKGKNIIEDPVEEAETLDNDALQGPALDDAGPSGNDSEALQKRSKRKKFKNNISRRLKRLDDRVGAIEATLTGV

Query:  GVAMKGIQRYLKKLSKGKFPDPTKYFARGGGPDDDDPSDQRPDEAPTPDGGPKSMDEDRRPKEVTKTDE
        GVA+KGIQ YLKKL+KGKFPD +KYF  GGGPDDD PSDQRPDE+P PDGG KSMDED+R  E  +TDE
Subjt:  GVAMKGIQRYLKKLSKGKFPDPTKYFARGGGPDDDDPSDQRPDEAPTPDGGPKSMDEDRRPKEVTKTDE

XP_022154299.1 uncharacterized protein LOC111021593 [Momordica charantia]6.3e-5879.33Show/hide
Query:  GTVLVHNVPAYVLFDSGSSHTFISTAFVRQATLELEPLGFLLSVSTPSGSVLIASQMVRAGELSFDNQTLEARLIQLDMRDFDVILGMDWLATNQANINC
        GTVLV N PAYVLFDSGSS TFISTAFVRQ  LEL PLGFLL VSTPSGSV+I+SQMV+ G LSFD Q L ARLIQLD+RDFDVILGMDWLATNQA+INC
Subjt:  GTVLVHNVPAYVLFDSGSSHTFISTAFVRQATLELEPLGFLLSVSTPSGSVLIASQMVRAGELSFDNQTLEARLIQLDMRDFDVILGMDWLATNQANINC

Query:  SRREVSFQLPSGRSFTFKGISGGVPRAVSALKARRLLQNGVWGYLANVVD
        S++EVSFQLP G SF FKG++GGVPR VSAL+AR LLQ G WG+LA+VVD
Subjt:  SRREVSFQLPSGRSFTFKGISGGVPRAVSALKARRLLQNGVWGYLANVVD

XP_022155163.1 uncharacterized protein LOC111022304 [Momordica charantia]1.1e-5785.81Show/hide
Query:  RLSDDAIPRLLRWSCTYSRGFLTLQRDVFDNMISKVKEYLVSTNAEAEHMVCIMRPPEARAIPA------PPVVPDPPAVPDPAVVPAPAAVRNPTVVAD
        RLSDDAIPRL RWSCTYSRGFLT+QRDVFDN +SKVKEYLVSTNAE EHMV IMRPPEARAIP       PP VPDPPAVPDPAVVPAPAAV N   VAD
Subjt:  RLSDDAIPRLLRWSCTYSRGFLTLQRDVFDNMISKVKEYLVSTNAEAEHMVCIMRPPEARAIPA------PPVVPDPPAVPDPAVVPAPAAVRNPTVVAD

Query:  PPADLERGTQERRVKDKGKNIIEDPVEEAETLDNDALQGPALDDAGPS
         PADLERGTQERRVKDKGKNIIEDPVEEAETLD+DALQ PALDDAGPS
Subjt:  PPADLERGTQERRVKDKGKNIIEDPVEEAETLDNDALQGPALDDAGPS

XP_022156985.1 uncharacterized protein LOC111023814 [Momordica charantia]7.5e-5976.73Show/hide
Query:  MSLPLHYLPGTVLVHNVPAYVLFDSGSSHTFISTAFVRQATLELEPLGFLLSVSTPSGSVLIASQMVRAGELSFDNQTLEARLIQLDMRDFDVILGMDWL
        MSL L YL   VLVHNVPAY LFDSGSSHTFISTAFV QA L LEPLGFLLSVSTPSGS +  SQMVR G+LS  + TL ARLIQLDM+DFD+ILGMDWL
Subjt:  MSLPLHYLPGTVLVHNVPAYVLFDSGSSHTFISTAFVRQATLELEPLGFLLSVSTPSGSVLIASQMVRAGELSFDNQTLEARLIQLDMRDFDVILGMDWL

Query:  ATNQANINCSRREVSFQLPSGRSFTFKGISGGVPRAVSALKARRLLQNGVWGYLANVVD
        ATNQA+IN  RREVSFQLPSG+ FTFKG++G VP+ VSALKAR+LLQ+G WGYL +VVD
Subjt:  ATNQANINCSRREVSFQLPSGRSFTFKGISGGVPRAVSALKARRLLQNGVWGYLANVVD

XP_022159077.1 uncharacterized protein LOC111025517 [Momordica charantia]1.7e-7194.7Show/hide
Query:  GTVLVHNVPAYVLFDSGSSHTFISTAFVRQATLELEPLGFLLSVSTPSGSVLIASQMVRAGELSFDNQTLEARLIQLDMRDFDVILGMDWLATNQANINC
        GT LVHNVPAYVLFD GSSHTFISTAFVRQATLELEPLGFLLSVSTPSGSVLIASQMVRAGELSFDNQTLEARLIQLDMRDFDVILGMDWLATNQANINC
Subjt:  GTVLVHNVPAYVLFDSGSSHTFISTAFVRQATLELEPLGFLLSVSTPSGSVLIASQMVRAGELSFDNQTLEARLIQLDMRDFDVILGMDWLATNQANINC

Query:  SRREVSFQLPSGRSFTFKGISGGVPRAVSALKARRLLQNGVWGYLANVVDI
        S+REVSFQLPSGRSFTFKG+SGGVPRAVSALKARRLL NG W YLA+VVDI
Subjt:  SRREVSFQLPSGRSFTFKGISGGVPRAVSALKARRLLQNGVWGYLANVVDI

TrEMBL top hitse value%identityAlignment
A0A6J1DJX9 uncharacterized protein LOC1110207575.2e-6658.36Show/hide
Query:  KVWAYETISTLSLRVATRLSDDAIPRLLRWSCTYSRGFLTLQRDVFDNMISKVKEYLVSTNAEAEHMVCIMRPPEARAIPAPPVVPDPPAVPDPAVVPAP
        +VWAYETIST        LSDDAIPRLLRWSC YS GF  L  +VFDN  SKVKE+L++T+A+ +HMV ++ PPE R      V+PDPPAVPD AVVP P
Subjt:  KVWAYETISTLSLRVATRLSDDAIPRLLRWSCTYSRGFLTLQRDVFDNMISKVKEYLVSTNAEAEHMVCIMRPPEARAIPAPPVVPDPPAVPDPAVVPAP

Query:  AAVRNPTVVADPPADLERGTQERRVKDKGKNIIEDPVEEAETLDNDALQGPALDDAGPSGNDSEALQKRSKRKKFKNNISRRLKRLDDRVGAIEATLTGV
         A      V DPPAD+E G             +EDPV +A           A+D+A PS ND E L+KR K+ KFK  ISRRLKRLD+ VGAIE  L   
Subjt:  AAVRNPTVVADPPADLERGTQERRVKDKGKNIIEDPVEEAETLDNDALQGPALDDAGPSGNDSEALQKRSKRKKFKNNISRRLKRLDDRVGAIEATLTGV

Query:  GVAMKGIQRYLKKLSKGKFPDPTKYFARGGGPDDDDPSDQRPDEAPTPDGGPKSMDEDRRPKEVTKTDE
        GVA+KGIQ YLKKL+KGKFPD +KYF  GGGPDDD PSDQRPDE+P PDGG KSMDED+R  E  +TDE
Subjt:  GVAMKGIQRYLKKLSKGKFPDPTKYFARGGGPDDDDPSDQRPDEAPTPDGGPKSMDEDRRPKEVTKTDE

A0A6J1DLN2 uncharacterized protein LOC1110215933.1e-5879.33Show/hide
Query:  GTVLVHNVPAYVLFDSGSSHTFISTAFVRQATLELEPLGFLLSVSTPSGSVLIASQMVRAGELSFDNQTLEARLIQLDMRDFDVILGMDWLATNQANINC
        GTVLV N PAYVLFDSGSS TFISTAFVRQ  LEL PLGFLL VSTPSGSV+I+SQMV+ G LSFD Q L ARLIQLD+RDFDVILGMDWLATNQA+INC
Subjt:  GTVLVHNVPAYVLFDSGSSHTFISTAFVRQATLELEPLGFLLSVSTPSGSVLIASQMVRAGELSFDNQTLEARLIQLDMRDFDVILGMDWLATNQANINC

Query:  SRREVSFQLPSGRSFTFKGISGGVPRAVSALKARRLLQNGVWGYLANVVD
        S++EVSFQLP G SF FKG++GGVPR VSAL+AR LLQ G WG+LA+VVD
Subjt:  SRREVSFQLPSGRSFTFKGISGGVPRAVSALKARRLLQNGVWGYLANVVD

A0A6J1DM86 uncharacterized protein LOC1110223045.2e-5885.81Show/hide
Query:  RLSDDAIPRLLRWSCTYSRGFLTLQRDVFDNMISKVKEYLVSTNAEAEHMVCIMRPPEARAIPA------PPVVPDPPAVPDPAVVPAPAAVRNPTVVAD
        RLSDDAIPRL RWSCTYSRGFLT+QRDVFDN +SKVKEYLVSTNAE EHMV IMRPPEARAIP       PP VPDPPAVPDPAVVPAPAAV N   VAD
Subjt:  RLSDDAIPRLLRWSCTYSRGFLTLQRDVFDNMISKVKEYLVSTNAEAEHMVCIMRPPEARAIPA------PPVVPDPPAVPDPAVVPAPAAVRNPTVVAD

Query:  PPADLERGTQERRVKDKGKNIIEDPVEEAETLDNDALQGPALDDAGPS
         PADLERGTQERRVKDKGKNIIEDPVEEAETLD+DALQ PALDDAGPS
Subjt:  PPADLERGTQERRVKDKGKNIIEDPVEEAETLDNDALQGPALDDAGPS

A0A6J1DRW8 uncharacterized protein LOC1110238143.6e-5976.73Show/hide
Query:  MSLPLHYLPGTVLVHNVPAYVLFDSGSSHTFISTAFVRQATLELEPLGFLLSVSTPSGSVLIASQMVRAGELSFDNQTLEARLIQLDMRDFDVILGMDWL
        MSL L YL   VLVHNVPAY LFDSGSSHTFISTAFV QA L LEPLGFLLSVSTPSGS +  SQMVR G+LS  + TL ARLIQLDM+DFD+ILGMDWL
Subjt:  MSLPLHYLPGTVLVHNVPAYVLFDSGSSHTFISTAFVRQATLELEPLGFLLSVSTPSGSVLIASQMVRAGELSFDNQTLEARLIQLDMRDFDVILGMDWL

Query:  ATNQANINCSRREVSFQLPSGRSFTFKGISGGVPRAVSALKARRLLQNGVWGYLANVVD
        ATNQA+IN  RREVSFQLPSG+ FTFKG++G VP+ VSALKAR+LLQ+G WGYL +VVD
Subjt:  ATNQANINCSRREVSFQLPSGRSFTFKGISGGVPRAVSALKARRLLQNGVWGYLANVVD

A0A6J1DYU5 uncharacterized protein LOC1110255178.3e-7294.7Show/hide
Query:  GTVLVHNVPAYVLFDSGSSHTFISTAFVRQATLELEPLGFLLSVSTPSGSVLIASQMVRAGELSFDNQTLEARLIQLDMRDFDVILGMDWLATNQANINC
        GT LVHNVPAYVLFD GSSHTFISTAFVRQATLELEPLGFLLSVSTPSGSVLIASQMVRAGELSFDNQTLEARLIQLDMRDFDVILGMDWLATNQANINC
Subjt:  GTVLVHNVPAYVLFDSGSSHTFISTAFVRQATLELEPLGFLLSVSTPSGSVLIASQMVRAGELSFDNQTLEARLIQLDMRDFDVILGMDWLATNQANINC

Query:  SRREVSFQLPSGRSFTFKGISGGVPRAVSALKARRLLQNGVWGYLANVVDI
        S+REVSFQLPSGRSFTFKG+SGGVPRAVSALKARRLL NG W YLA+VVDI
Subjt:  SRREVSFQLPSGRSFTFKGISGGVPRAVSALKARRLLQNGVWGYLANVVDI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACGGCTCTTGACATTTATTCCCAAGATACTGCCCCCCCTCCTTTTACAGCACTTGATGTTCGACCTTGCCATTTATTCCCAAGGACTAGCACCACTACTTCGCAAGC
ACCACTCTACTTTAATCCCTCAAGCCTAACTCATCGAGTCGCATTGACCCTTGACATTTATCCCCAAGGCCTCAATACTCCTCTACAGAAACTTATGTTCGGCCTTGCCA
TTTATTCCCAAGGCCTGAACTGTGTACCGGTCCTTGCCGTTTATCCCAAGCAGCGAGAAACGCCCAAAATGACCTCCAACGACTTCGATTTGATTCCCGTTACTACAAAA
CAACTTCGCTTAGTCGTTGGAGGTTGTCGGATCGAGGGAAAACAACGTGAACAGCCACCACGCGTCGCCCCGCAGCACCTGCTGCTGCCCGAGACCACGTGTCGCCGCCT
GCTGCTACCCGACGCCCGACGCTGCTGCTGCGCTGCCCGACGCCGCCTGCTACTGCCCGAGCCCGACGCTGCACCTACTGCTCCGATGTCGCCGCTGATCGTCGCGTCGC
CTGAACGGAGTTGCTCGTCGGAGAAGAAGAACCCGAGTGGGGGGTGGCCGGCGGCTCCAACAAGAGAGGGAAAGGTATGGGCTTACGAGACGATATCGACGTTGAGTCTG
CGCGTAGCCACGAGGCTGAGCGACGACGCCATTCCTCGACTCCTTAGGTGGTCGTGCACTTATTCTCGTGGGTTTCTTACTCTGCAGAGAGACGTGTTCGATAACATGAT
ATCCAAGGTTAAGGAATACTTGGTTTCGACGAATGCTGAGGCAGAACACATGGTCTGTATCATGCGTCCACCGGAAGCCCGCGCTATACCTGCCCCGCCGGTTGTACCTG
ACCCGCCTGCAGTACCTGACCCGGCTGTTGTACCTGCCCCGGCTGCAGTACGTAACCCGACTGTAGTAGCTGACCCGCCTGCAGATCTGGAAAGGGGTACTCAGGAAAGA
AGGGTGAAGGACAAAGGAAAGAATATCATAGAGGATCCGGTAGAAGAGGCCGAGACATTGGACAATGATGCATTACAGGGTCCTGCATTAGACGATGCTGGACCCAGTGG
AAATGACAGCGAAGCGCTACAGAAGAGGTCGAAACGGAAAAAATTCAAAAATAATATCAGTAGACGGTTGAAGAGGCTCGATGACCGAGTTGGTGCTATCGAGGCCACAC
TGACTGGCGTCGGGGTCGCCATGAAAGGTATCCAGAGATACCTGAAGAAACTGTCGAAGGGTAAATTCCCTGATCCGACCAAATATTTTGCACGTGGGGGTGGGCCCGAT
GATGATGATCCATCGGATCAAAGGCCTGATGAGGCCCCAACACCAGATGGAGGTCCGAAGAGTATGGACGAGGACCGGAGGCCGAAAGAGGTCACTAAGACTGACGAGTA
TCGGACCATGGAGCATGGTTCGAAGAATATGGACGGCCGCAAATACTCAGAGGCTAGGGTAGAGGGCTCCCCCAACAGTTTCGACGCAGGAAGATGTTTGTGTGCGGGGT
TTATGTCTTTACCTTTGCATTATCTCCCAGGTACGGTCTTAGTCCATAATGTGCCTGCTTATGTATTGTTTGACTCGGGATCGAGTCACACCTTCATCTCTACTGCGTTT
GTTCGTCAGGCAACCCTCGAACTAGAGCCGTTAGGGTTTCTGTTGTCAGTTTCTACACCTTCAGGGTCGGTTTTGATTGCTAGTCAAATGGTGAGAGCAGGTGAGTTATC
TTTTGACAATCAGACCCTAGAGGCACGTTTGATCCAATTGGACATGCGGGATTTTGACGTCATTTTAGGCATGGATTGGCTAGCTACCAACCAAGCCAACATTAATTGCT
CGAGGAGGGAAGTCTCTTTCCAACTACCTTCGGGTCGGAGCTTTACGTTTAAAGGAATTTCAGGTGGAGTTCCAAGGGCAGTCTCAGCGTTGAAGGCAAGGCGCCTTTTA
CAAAATGGTGTCTGGGGATATCTGGCCAATGTCGTCGACATTATGTCTCGTAGACTAGTACTAGACGAGTTGGATCGTTCTGAGGTGGAGTTAGCAGTGGAGGATGTCTC
GGCAGTGTTATCTCGACTCTCGGTTGAACCCACTTTGAGACAGCGGGTCATCGCTGCACAGAAGGGAGATCCCAGCCTGAGCAAGGGTTTCAGTATGGTGGACGAAACCT
TGTGTTATAAGGAGGTACCCATTGAGATTGTAGCAAGAGAGACCAAGGTGCTGCGGAATCGGGCAATTGACTTGGTGAAGGTCTTGTGGAGAAATCACCAAGTGGAGGAA
GCCACCTGGGAAAGGGAGGATGAAATTAGAGCCCGGTGCGCGCGGGTGACGGCAACAGCAGTTCTCGGACAACGGGCAGTAGCGATGCGCGGGCGTCTGCAGCAGCGTTG
CGCAGGTGTCGGGCAGCGGCGGAGGTGCACGGCGCGTCGGGCAGCAGCAGTGGTGCGCGAGTTCCGATAG
mRNA sequenceShow/hide mRNA sequence
ATGACGGCTCTTGACATTTATTCCCAAGATACTGCCCCCCCTCCTTTTACAGCACTTGATGTTCGACCTTGCCATTTATTCCCAAGGACTAGCACCACTACTTCGCAAGC
ACCACTCTACTTTAATCCCTCAAGCCTAACTCATCGAGTCGCATTGACCCTTGACATTTATCCCCAAGGCCTCAATACTCCTCTACAGAAACTTATGTTCGGCCTTGCCA
TTTATTCCCAAGGCCTGAACTGTGTACCGGTCCTTGCCGTTTATCCCAAGCAGCGAGAAACGCCCAAAATGACCTCCAACGACTTCGATTTGATTCCCGTTACTACAAAA
CAACTTCGCTTAGTCGTTGGAGGTTGTCGGATCGAGGGAAAACAACGTGAACAGCCACCACGCGTCGCCCCGCAGCACCTGCTGCTGCCCGAGACCACGTGTCGCCGCCT
GCTGCTACCCGACGCCCGACGCTGCTGCTGCGCTGCCCGACGCCGCCTGCTACTGCCCGAGCCCGACGCTGCACCTACTGCTCCGATGTCGCCGCTGATCGTCGCGTCGC
CTGAACGGAGTTGCTCGTCGGAGAAGAAGAACCCGAGTGGGGGGTGGCCGGCGGCTCCAACAAGAGAGGGAAAGGTATGGGCTTACGAGACGATATCGACGTTGAGTCTG
CGCGTAGCCACGAGGCTGAGCGACGACGCCATTCCTCGACTCCTTAGGTGGTCGTGCACTTATTCTCGTGGGTTTCTTACTCTGCAGAGAGACGTGTTCGATAACATGAT
ATCCAAGGTTAAGGAATACTTGGTTTCGACGAATGCTGAGGCAGAACACATGGTCTGTATCATGCGTCCACCGGAAGCCCGCGCTATACCTGCCCCGCCGGTTGTACCTG
ACCCGCCTGCAGTACCTGACCCGGCTGTTGTACCTGCCCCGGCTGCAGTACGTAACCCGACTGTAGTAGCTGACCCGCCTGCAGATCTGGAAAGGGGTACTCAGGAAAGA
AGGGTGAAGGACAAAGGAAAGAATATCATAGAGGATCCGGTAGAAGAGGCCGAGACATTGGACAATGATGCATTACAGGGTCCTGCATTAGACGATGCTGGACCCAGTGG
AAATGACAGCGAAGCGCTACAGAAGAGGTCGAAACGGAAAAAATTCAAAAATAATATCAGTAGACGGTTGAAGAGGCTCGATGACCGAGTTGGTGCTATCGAGGCCACAC
TGACTGGCGTCGGGGTCGCCATGAAAGGTATCCAGAGATACCTGAAGAAACTGTCGAAGGGTAAATTCCCTGATCCGACCAAATATTTTGCACGTGGGGGTGGGCCCGAT
GATGATGATCCATCGGATCAAAGGCCTGATGAGGCCCCAACACCAGATGGAGGTCCGAAGAGTATGGACGAGGACCGGAGGCCGAAAGAGGTCACTAAGACTGACGAGTA
TCGGACCATGGAGCATGGTTCGAAGAATATGGACGGCCGCAAATACTCAGAGGCTAGGGTAGAGGGCTCCCCCAACAGTTTCGACGCAGGAAGATGTTTGTGTGCGGGGT
TTATGTCTTTACCTTTGCATTATCTCCCAGGTACGGTCTTAGTCCATAATGTGCCTGCTTATGTATTGTTTGACTCGGGATCGAGTCACACCTTCATCTCTACTGCGTTT
GTTCGTCAGGCAACCCTCGAACTAGAGCCGTTAGGGTTTCTGTTGTCAGTTTCTACACCTTCAGGGTCGGTTTTGATTGCTAGTCAAATGGTGAGAGCAGGTGAGTTATC
TTTTGACAATCAGACCCTAGAGGCACGTTTGATCCAATTGGACATGCGGGATTTTGACGTCATTTTAGGCATGGATTGGCTAGCTACCAACCAAGCCAACATTAATTGCT
CGAGGAGGGAAGTCTCTTTCCAACTACCTTCGGGTCGGAGCTTTACGTTTAAAGGAATTTCAGGTGGAGTTCCAAGGGCAGTCTCAGCGTTGAAGGCAAGGCGCCTTTTA
CAAAATGGTGTCTGGGGATATCTGGCCAATGTCGTCGACATTATGTCTCGTAGACTAGTACTAGACGAGTTGGATCGTTCTGAGGTGGAGTTAGCAGTGGAGGATGTCTC
GGCAGTGTTATCTCGACTCTCGGTTGAACCCACTTTGAGACAGCGGGTCATCGCTGCACAGAAGGGAGATCCCAGCCTGAGCAAGGGTTTCAGTATGGTGGACGAAACCT
TGTGTTATAAGGAGGTACCCATTGAGATTGTAGCAAGAGAGACCAAGGTGCTGCGGAATCGGGCAATTGACTTGGTGAAGGTCTTGTGGAGAAATCACCAAGTGGAGGAA
GCCACCTGGGAAAGGGAGGATGAAATTAGAGCCCGGTGCGCGCGGGTGACGGCAACAGCAGTTCTCGGACAACGGGCAGTAGCGATGCGCGGGCGTCTGCAGCAGCGTTG
CGCAGGTGTCGGGCAGCGGCGGAGGTGCACGGCGCGTCGGGCAGCAGCAGTGGTGCGCGAGTTCCGATAG
Protein sequenceShow/hide protein sequence
MTALDIYSQDTAPPPFTALDVRPCHLFPRTSTTTSQAPLYFNPSSLTHRVALTLDIYPQGLNTPLQKLMFGLAIYSQGLNCVPVLAVYPKQRETPKMTSNDFDLIPVTTK
QLRLVVGGCRIEGKQREQPPRVAPQHLLLPETTCRRLLLPDARRCCCAARRRLLLPEPDAAPTAPMSPLIVASPERSCSSEKKNPSGGWPAAPTREGKVWAYETISTLSL
RVATRLSDDAIPRLLRWSCTYSRGFLTLQRDVFDNMISKVKEYLVSTNAEAEHMVCIMRPPEARAIPAPPVVPDPPAVPDPAVVPAPAAVRNPTVVADPPADLERGTQER
RVKDKGKNIIEDPVEEAETLDNDALQGPALDDAGPSGNDSEALQKRSKRKKFKNNISRRLKRLDDRVGAIEATLTGVGVAMKGIQRYLKKLSKGKFPDPTKYFARGGGPD
DDDPSDQRPDEAPTPDGGPKSMDEDRRPKEVTKTDEYRTMEHGSKNMDGRKYSEARVEGSPNSFDAGRCLCAGFMSLPLHYLPGTVLVHNVPAYVLFDSGSSHTFISTAF
VRQATLELEPLGFLLSVSTPSGSVLIASQMVRAGELSFDNQTLEARLIQLDMRDFDVILGMDWLATNQANINCSRREVSFQLPSGRSFTFKGISGGVPRAVSALKARRLL
QNGVWGYLANVVDIMSRRLVLDELDRSEVELAVEDVSAVLSRLSVEPTLRQRVIAAQKGDPSLSKGFSMVDETLCYKEVPIEIVARETKVLRNRAIDLVKVLWRNHQVEE
ATWEREDEIRARCARVTATAVLGQRAVAMRGRLQQRCAGVGQRRRCTARRAAAVVREFR