; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc02g19700 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc02g19700
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr2:14613582..14614838
RNA-Seq ExpressionMoc02g19700
SyntenyMoc02g19700
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]1.4e-11590.28Show/hide
Query:  RAESSHN---PAGIITREEFDQLRGKLDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPSDGTKDPKDYVEVFEDLMDFQAASD
        +AESS N   PAG+ITREEFDQLRG+LDAQVEALKAKCEQK+  LNDGDLGESPFTSDVLEAPIPPKFKAPTVKP DG+KDPKDYVEVFE LMDFQAASD
Subjt:  RAESSHN---PAGIITREEFDQLRGKLDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPSDGTKDPKDYVEVFEDLMDFQAASD

Query:  AIKCCAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRLQEEQLKVAHCSDDSAMCYFLTGLADEAL
        AIKC AF+IALTGSARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTR QEEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  AIKCCAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRLQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPGRKWKRCREGR
        TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRP RK  R R G+
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPGRKWKRCREGR

XP_022152033.1 uncharacterized protein LOC111019842 [Momordica charantia]2.8e-11685.07Show/hide
Query:  RGGTSKKGARGPAPAPTSCRRRAESSHN---PAGIITREEFDQLRGKLDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPSDGT
        RG + +KG + P+ +  S  ++AESSHN   PAG+ITREEFDQLRGKLDAQVEALKAKCEQK+ SLNDGDLGESPFTSDVLEAPIP KFKAPTVKP DG+
Subjt:  RGGTSKKGARGPAPAPTSCRRRAESSHN---PAGIITREEFDQLRGKLDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPSDGT

Query:  KDPKDYVEVFEDLMDFQAASDAIKCCAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRLQEEQLKV
        +DPKDYVEVFE LMDFQAASD IKC AFQIALT SARLWYRRLPARSISTYSQLRREFLAQFSSRHYDK+TATHLATIRQKEGETLREYVTR QEEQLKV
Subjt:  KDPKDYVEVFEDLMDFQAASDAIKCCAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRLQEEQLKV

Query:  AHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPGRKWKRCREGR
         HCSDDSAMCYFLTGLADEA TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRP RK  R R G+
Subjt:  AHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPGRKWKRCREGR

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]8.3e-12973.89Show/hide
Query:  MVQPANSTNTADRRTLAASDAHQREVGAAAVEGQGHDGLATQPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGARGPAPAPTSCRRRAESSHNP--A
        MVQPANSTNTADRR LAA+  HQREVGA  VEGQGH+ L T+PL RSARIT P LPPAHP+ SK                          AESS+NP   
Subjt:  MVQPANSTNTADRRTLAASDAHQREVGAAAVEGQGHDGLATQPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGARGPAPAPTSCRRRAESSHNP--A

Query:  GIITREEFDQLRGKLDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPSDGTKDPKDYVEVFEDLMDFQAASDAIKCCAFQIALT
        G+ITREEFDQL+ K DAQVEALKA+CE+K+ S +DGDLGE  F+SD+LEA IPPKFK PT+KP DG+KDPKDYVEVFE LMDFQAA+DAIKCCAFQIALT
Subjt:  GIITREEFDQLRGKLDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPSDGTKDPKDYVEVFEDLMDFQAASDAIKCCAFQIALT

Query:  GSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRLQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATF
        GSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTR  EEQLKVAHCSDDSAMCYFLTGLADE LTVKL EEAPATF
Subjt:  GSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRLQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATF

Query:  AEVLQKAKKVIDGQELLRTKTGRPGRKWKRCREGRSQ
        AEVLQK KKVIDGQELLRTKTGRP +   + R G+ +
Subjt:  AEVLQKAKKVIDGQELLRTKTGRPGRKWKRCREGRSQ

XP_022156542.1 uncharacterized protein LOC111023421 [Momordica charantia]7.3e-11794.89Show/hide
Query:  GIITREEFDQLRGKLDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPSDGTKDPKDYVEVFEDLMDFQAASDAIKCCAFQIALT
        GIITREEFDQLRG+LDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKP DGTKDPKDYVEVFE LMDFQAASDAIKC AFQIALT
Subjt:  GIITREEFDQLRGKLDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPSDGTKDPKDYVEVFEDLMDFQAASDAIKCCAFQIALT

Query:  GSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRLQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATF
        GSARLWYRRLP RSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTR QEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATF
Subjt:  GSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRLQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATF

Query:  AEVLQKAKKVIDGQELLRTKTGRPGRKWKRCREGR
        AEVLQKAKKVIDGQELLRTKTGRP RK  R R G+
Subjt:  AEVLQKAKKVIDGQELLRTKTGRPGRKWKRCREGR

XP_022159327.1 uncharacterized protein LOC111025738 [Momordica charantia]3.3e-11762.22Show/hide
Query:  MVQPANSTNTADRRTLAASDAHQREVGAAAVEGQGHDGLATQPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGARGPAPAPT---------------
        MVQP +STNT DRR L A+D HQREVGA  VEGQ H+GL T+P  RSARIT P L PAHP+  KA RGRGG S++   G APAP+               
Subjt:  MVQPANSTNTADRRTLAASDAHQREVGAAAVEGQGHDGLATQPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGARGPAPAPT---------------

Query:  -----------------------------------------------------SC-----RRRAESSHNPA---GIITREEFDQLRGKLDAQVEALKAKC
                                                             SC      ++AESS+NP    G+ITREEFDQL+ K DAQVE LKA+C
Subjt:  -----------------------------------------------------SC-----RRRAESSHNPA---GIITREEFDQLRGKLDAQVEALKAKC

Query:  EQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPSDGTKDPKDYVEVFEDLMDFQAASDAIKCCAFQIALTGSARLWYRRLPARSISTYSQLRREFL
        E K  + +DGDLGESPFTSD+LEA IP KFK PT+KP DG+KDPKDYVEVFE LM FQAA+DAIK  AFQIALT SARLWYRRLPARSISTYSQLR+EF 
Subjt:  EQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPSDGTKDPKDYVEVFEDLMDFQAASDAIKCCAFQIALTGSARLWYRRLPARSISTYSQLRREFL

Query:  AQFSSRHYDKKTATHLATIRQKEGETLREYVTRLQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGR
        +QFSSRHY++KTATHLATIRQKE ETLREYVT  QEEQLKVAH SDDSA+CYFLT L DE LTVKLGEEAPATFAEVLQKAKKVIDGQEL RTKTGR
Subjt:  AQFSSRHYDKKTATHLATIRQKEGETLREYVTRLQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGR

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088136.7e-11690.28Show/hide
Query:  RAESSHN---PAGIITREEFDQLRGKLDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPSDGTKDPKDYVEVFEDLMDFQAASD
        +AESS N   PAG+ITREEFDQLRG+LDAQVEALKAKCEQK+  LNDGDLGESPFTSDVLEAPIPPKFKAPTVKP DG+KDPKDYVEVFE LMDFQAASD
Subjt:  RAESSHN---PAGIITREEFDQLRGKLDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPSDGTKDPKDYVEVFEDLMDFQAASD

Query:  AIKCCAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRLQEEQLKVAHCSDDSAMCYFLTGLADEAL
        AIKC AF+IALTGSARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTR QEEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  AIKCCAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRLQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPGRKWKRCREGR
        TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRP RK  R R G+
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPGRKWKRCREGR

A0A6J1DDS5 uncharacterized protein LOC1110198421.3e-11685.07Show/hide
Query:  RGGTSKKGARGPAPAPTSCRRRAESSHN---PAGIITREEFDQLRGKLDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPSDGT
        RG + +KG + P+ +  S  ++AESSHN   PAG+ITREEFDQLRGKLDAQVEALKAKCEQK+ SLNDGDLGESPFTSDVLEAPIP KFKAPTVKP DG+
Subjt:  RGGTSKKGARGPAPAPTSCRRRAESSHN---PAGIITREEFDQLRGKLDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPSDGT

Query:  KDPKDYVEVFEDLMDFQAASDAIKCCAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRLQEEQLKV
        +DPKDYVEVFE LMDFQAASD IKC AFQIALT SARLWYRRLPARSISTYSQLRREFLAQFSSRHYDK+TATHLATIRQKEGETLREYVTR QEEQLKV
Subjt:  KDPKDYVEVFEDLMDFQAASDAIKCCAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRLQEEQLKV

Query:  AHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPGRKWKRCREGR
         HCSDDSAMCYFLTGLADEA TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRP RK  R R G+
Subjt:  AHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPGRKWKRCREGR

A0A6J1DHB3 uncharacterized protein LOC1110204794.0e-12973.89Show/hide
Query:  MVQPANSTNTADRRTLAASDAHQREVGAAAVEGQGHDGLATQPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGARGPAPAPTSCRRRAESSHNP--A
        MVQPANSTNTADRR LAA+  HQREVGA  VEGQGH+ L T+PL RSARIT P LPPAHP+ SK                          AESS+NP   
Subjt:  MVQPANSTNTADRRTLAASDAHQREVGAAAVEGQGHDGLATQPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGARGPAPAPTSCRRRAESSHNP--A

Query:  GIITREEFDQLRGKLDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPSDGTKDPKDYVEVFEDLMDFQAASDAIKCCAFQIALT
        G+ITREEFDQL+ K DAQVEALKA+CE+K+ S +DGDLGE  F+SD+LEA IPPKFK PT+KP DG+KDPKDYVEVFE LMDFQAA+DAIKCCAFQIALT
Subjt:  GIITREEFDQLRGKLDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPSDGTKDPKDYVEVFEDLMDFQAASDAIKCCAFQIALT

Query:  GSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRLQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATF
        GSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTR  EEQLKVAHCSDDSAMCYFLTGLADE LTVKL EEAPATF
Subjt:  GSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRLQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATF

Query:  AEVLQKAKKVIDGQELLRTKTGRPGRKWKRCREGRSQ
        AEVLQK KKVIDGQELLRTKTGRP +   + R G+ +
Subjt:  AEVLQKAKKVIDGQELLRTKTGRPGRKWKRCREGRSQ

A0A6J1DS95 uncharacterized protein LOC1110234213.5e-11794.89Show/hide
Query:  GIITREEFDQLRGKLDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPSDGTKDPKDYVEVFEDLMDFQAASDAIKCCAFQIALT
        GIITREEFDQLRG+LDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKP DGTKDPKDYVEVFE LMDFQAASDAIKC AFQIALT
Subjt:  GIITREEFDQLRGKLDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPSDGTKDPKDYVEVFEDLMDFQAASDAIKCCAFQIALT

Query:  GSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRLQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATF
        GSARLWYRRLP RSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTR QEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATF
Subjt:  GSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRLQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATF

Query:  AEVLQKAKKVIDGQELLRTKTGRPGRKWKRCREGR
        AEVLQKAKKVIDGQELLRTKTGRP RK  R R G+
Subjt:  AEVLQKAKKVIDGQELLRTKTGRPGRKWKRCREGR

A0A6J1DZJ1 uncharacterized protein LOC1110257381.6e-11762.22Show/hide
Query:  MVQPANSTNTADRRTLAASDAHQREVGAAAVEGQGHDGLATQPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGARGPAPAPT---------------
        MVQP +STNT DRR L A+D HQREVGA  VEGQ H+GL T+P  RSARIT P L PAHP+  KA RGRGG S++   G APAP+               
Subjt:  MVQPANSTNTADRRTLAASDAHQREVGAAAVEGQGHDGLATQPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGARGPAPAPT---------------

Query:  -----------------------------------------------------SC-----RRRAESSHNPA---GIITREEFDQLRGKLDAQVEALKAKC
                                                             SC      ++AESS+NP    G+ITREEFDQL+ K DAQVE LKA+C
Subjt:  -----------------------------------------------------SC-----RRRAESSHNPA---GIITREEFDQLRGKLDAQVEALKAKC

Query:  EQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPSDGTKDPKDYVEVFEDLMDFQAASDAIKCCAFQIALTGSARLWYRRLPARSISTYSQLRREFL
        E K  + +DGDLGESPFTSD+LEA IP KFK PT+KP DG+KDPKDYVEVFE LM FQAA+DAIK  AFQIALT SARLWYRRLPARSISTYSQLR+EF 
Subjt:  EQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPSDGTKDPKDYVEVFEDLMDFQAASDAIKCCAFQIALTGSARLWYRRLPARSISTYSQLRREFL

Query:  AQFSSRHYDKKTATHLATIRQKEGETLREYVTRLQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGR
        +QFSSRHY++KTATHLATIRQKE ETLREYVT  QEEQLKVAH SDDSA+CYFLT L DE LTVKLGEEAPATFAEVLQKAKKVIDGQEL RTKTGR
Subjt:  AQFSSRHYDKKTATHLATIRQKEGETLREYVTRLQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAACCCGCAAACTCGACCAATACGGCGGATCGAAGGACCCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGCGGTAGAGGGGCAAGGTCACGA
CGGCCTAGCAACACAACCCCTCCGCAGGTCGGCACGGATCACCGCGCCTGCCCTACCGCCTGCGCACCCGAGGACGTCCAAGGCCACCCGTGGCCGAGGCGGGACCTCTA
AGAAGGGCGCCCGGGGTCCAGCCCCGGCTCCAACAAGCTGCAGGCGCAGGGCTGAATCCTCTCACAATCCCGCAGGGATAATCACAAGGGAGGAGTTTGACCAGCTGAGG
GGCAAGCTCGATGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTAAACGATGGCGACTTGGGAGAATCGCCCTTCACCTCGGACGTTTTGGA
AGCACCAATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTCTGATGGGACGAAGGACCCCAAGGACTATGTTGAGGTCTTTGAAGACCTCATGGACTTCCAAGCGG
CATCAGACGCAATCAAATGCTGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTACCGGAGACTGCCAGCAAGGTCGATCTCGACCTACTCTCAGCTGAGA
AGGGAGTTCCTCGCCCAATTCTCTTCTCGGCACTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGGCAGAAGGAGGGTGAGACACTGCGGGAATATGTCACCAG
ATTGCAGGAGGAGCAGTTGAAGGTCGCACACTGCTCCGATGACTCGGCCATGTGCTACTTCCTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAACTTGGAGAGGAGG
CCCCGGCCACCTTCGCCGAGGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGTCGACCGGGGCGGAAGTGGAAAAGATGTAGA
GAGGGCAGATCCCAAGTCCAAGGGCAAGGGATCCTTTTCCAGCGGCCGAGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAACCCGCAAACTCGACCAATACGGCGGATCGAAGGACCCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGCGGTAGAGGGGCAAGGTCACGA
CGGCCTAGCAACACAACCCCTCCGCAGGTCGGCACGGATCACCGCGCCTGCCCTACCGCCTGCGCACCCGAGGACGTCCAAGGCCACCCGTGGCCGAGGCGGGACCTCTA
AGAAGGGCGCCCGGGGTCCAGCCCCGGCTCCAACAAGCTGCAGGCGCAGGGCTGAATCCTCTCACAATCCCGCAGGGATAATCACAAGGGAGGAGTTTGACCAGCTGAGG
GGCAAGCTCGATGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTAAACGATGGCGACTTGGGAGAATCGCCCTTCACCTCGGACGTTTTGGA
AGCACCAATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTCTGATGGGACGAAGGACCCCAAGGACTATGTTGAGGTCTTTGAAGACCTCATGGACTTCCAAGCGG
CATCAGACGCAATCAAATGCTGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTACCGGAGACTGCCAGCAAGGTCGATCTCGACCTACTCTCAGCTGAGA
AGGGAGTTCCTCGCCCAATTCTCTTCTCGGCACTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGGCAGAAGGAGGGTGAGACACTGCGGGAATATGTCACCAG
ATTGCAGGAGGAGCAGTTGAAGGTCGCACACTGCTCCGATGACTCGGCCATGTGCTACTTCCTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAACTTGGAGAGGAGG
CCCCGGCCACCTTCGCCGAGGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGTCGACCGGGGCGGAAGTGGAAAAGATGTAGA
GAGGGCAGATCCCAAGTCCAAGGGCAAGGGATCCTTTTCCAGCGGCCGAGCTGA
Protein sequenceShow/hide protein sequence
MVQPANSTNTADRRTLAASDAHQREVGAAAVEGQGHDGLATQPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGARGPAPAPTSCRRRAESSHNPAGIITREEFDQLR
GKLDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPSDGTKDPKDYVEVFEDLMDFQAASDAIKCCAFQIALTGSARLWYRRLPARSISTYSQLR
REFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRLQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPGRKWKRCR
EGRSQVQGQGILFQRPS