; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g23010 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g23010
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr4:16681268..16684285
RNA-Seq ExpressionMoc04g23010
SyntenyMoc04g23010
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]7.3e-11279.72Show/hide
Query:  QAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKA--------------------------------KSPTVKPYDGSKDPKDYVEVFEGLMDFQAASD
        +AESS NPATPAGVITR EFDQLRG+LDAQVEALKA                                K+PTVKPYDGSKDPKDYVEVFE LMDFQAASD
Subjt:  QAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKA--------------------------------KSPTVKPYDGSKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALIGSARLWYRRLPARSILTYSQLRREFLVQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
        AIKCRAF+IAL GSARLWYRRLPA SI TYSQLRREFL  FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIALIGSARLWYRRLPARSILTYSQLRREFLVQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEKAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKAAPKSKDKGSFSSGRAEFVLQEDCPQAHAP
        TVKLGE+APATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIE A PKSKDKGSFSSGRAE+   E+ P    P
Subjt:  TVKLGEKAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKAAPKSKDKGSFSSGRAEFVLQEDCPQAHAP

XP_022151719.1 uncharacterized protein LOC111019634 [Momordica charantia]1.8e-11873.23Show/hide
Query:  IREQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKA-----
        + EQRG HLGPV++ HPE  E E +T QRGDLREHLNRKR SSLRKGQSPS SHR+SNQQAESS+NP TP GVITR EFDQL+ K DAQVEALKA     
Subjt:  IREQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKA-----

Query:  ---------------------------KSPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALIGSARLWYRRLPARSILTYSQLRREFLVQF
                                   K+PT+KPYDGSKDPKDYVEVFEGLMDFQAA+DAIKCR FQIAL GSARLWYRRLPARSI TYSQLR+EF++QF
Subjt:  ---------------------------KSPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALIGSARLWYRRLPARSILTYSQLRREFLVQF

Query:  SSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEKAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIG
        SSRHYD+KTATHL TIRQKEGETLREYVTRFQEEQLKVAHCSD SAMCYFLT LADE LTVKL E+APATF EVLQKAKK+IDGQELLRTKT RPE+KI 
Subjt:  SSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEKAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIG

Query:  RGRSGKDIEKAAPKSKDKG--SFSS
        +GR+ KD  K   K++DKG  SFSS
Subjt:  RGRSGKDIEKAAPKSKDKG--SFSS

XP_022152033.1 uncharacterized protein LOC111019842 [Momordica charantia]1.1e-12080.13Show/hide
Query:  KRGSSLRKGQSPSRSHRSSNQQAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKA--------------------------------KSPTVKPYDGS
        +RGSSLRKGQSPSRSHRSSNQQAESSHNPATPAGVITR EFDQLRGKLDAQVEALKA                                K+PTVKPYDGS
Subjt:  KRGSSLRKGQSPSRSHRSSNQQAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKA--------------------------------KSPTVKPYDGS

Query:  KDPKDYVEVFEGLMDFQAASDAIKCRAFQIALIGSARLWYRRLPARSILTYSQLRREFLVQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKV
        +DPKDYVEVFEGLMDFQAASD IKCRAFQIAL  SARLWYRRLPARSI TYSQLRREFL QFSSRHYDK+TATHLATIRQKEGETLREYVTRFQEEQLKV
Subjt:  KDPKDYVEVFEGLMDFQAASDAIKCRAFQIALIGSARLWYRRLPARSILTYSQLRREFLVQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKV

Query:  AHCSDDSAMCYFLTGLADEALTVKLGEKAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKAAPKSKDKGSFSSGRAEFVLQEDCPQAH
         HCSDDSAMCYFLTGLADEA TVKLGE+APATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIE+A  KSKDKGSFSS RA +   E+ P   
Subjt:  AHCSDDSAMCYFLTGLADEALTVKLGEKAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKAAPKSKDKGSFSSGRAEFVLQEDCPQAH

Query:  AP
         P
Subjt:  AP

XP_022155128.1 uncharacterized protein LOC111022267 [Momordica charantia]1.9e-11273.63Show/hide
Query:  DNESEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKA-----------------------
        + E E +T QRGDLREHLNRKR SSLRKGQSPS SHR+SNQQAESS+NP TP  VITR EFDQL+ K DAQVEALKA                       
Subjt:  DNESEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKA-----------------------

Query:  ---------KSPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALIGSARLWYRRLPARSILTYSQLRREFLVQFSSRHYDKKTATHLATIRQ
                 K+PT+KPYDGSK+PKDYV+VFEGLM+FQAA+DAIKCRAFQIA  GSARLWYRRLPARSI TYSQLR+EF+ QFSSR+YD+KTATHLATIRQ
Subjt:  ---------KSPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALIGSARLWYRRLPARSILTYSQLRREFLVQFSSRHYDKKTATHLATIRQ

Query:  KEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEKAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKAAPKSKDK
        K+GETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLAD+ LTVKLGE+APATFAEVLQKAKKVIDGQELLRTKTGRPE+KI + R GKD  KA  KS+DK
Subjt:  KEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEKAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKAAPKSKDK

Query:  G-SFSSGRAEF
        G S SS RA++
Subjt:  G-SFSSGRAEF

XP_022156542.1 uncharacterized protein LOC111023421 [Momordica charantia]3.7e-10879.55Show/hide
Query:  GVITRAEFDQLRGKLDAQVEALKA--------------------------------KSPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALI
        G+ITR EFDQLRG+LDAQVEALKA                                K+PTVKPYDG+KDPKDYVEVFEGLMDFQAASDAIKCRAFQIAL 
Subjt:  GVITRAEFDQLRGKLDAQVEALKA--------------------------------KSPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALI

Query:  GSARLWYRRLPARSILTYSQLRREFLVQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEKAPATF
        GSARLWYRRLP RSI TYSQLRREFL QFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGE+APATF
Subjt:  GSARLWYRRLPARSILTYSQLRREFLVQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEKAPATF

Query:  AEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKAAPKSKDKGSFSSGRAEFVLQEDCPQAHAP
        AEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD+E+A PKSKDKGSFSSGRAE+   E+ P    P
Subjt:  AEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKAAPKSKDKGSFSSGRAEFVLQEDCPQAHAP

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088133.5e-11279.72Show/hide
Query:  QAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKA--------------------------------KSPTVKPYDGSKDPKDYVEVFEGLMDFQAASD
        +AESS NPATPAGVITR EFDQLRG+LDAQVEALKA                                K+PTVKPYDGSKDPKDYVEVFE LMDFQAASD
Subjt:  QAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKA--------------------------------KSPTVKPYDGSKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALIGSARLWYRRLPARSILTYSQLRREFLVQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
        AIKCRAF+IAL GSARLWYRRLPA SI TYSQLRREFL  FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIALIGSARLWYRRLPARSILTYSQLRREFLVQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEKAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKAAPKSKDKGSFSSGRAEFVLQEDCPQAHAP
        TVKLGE+APATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIE A PKSKDKGSFSSGRAE+   E+ P    P
Subjt:  TVKLGEKAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKAAPKSKDKGSFSSGRAEFVLQEDCPQAHAP

A0A6J1DDS5 uncharacterized protein LOC1110198425.4e-12180.13Show/hide
Query:  KRGSSLRKGQSPSRSHRSSNQQAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKA--------------------------------KSPTVKPYDGS
        +RGSSLRKGQSPSRSHRSSNQQAESSHNPATPAGVITR EFDQLRGKLDAQVEALKA                                K+PTVKPYDGS
Subjt:  KRGSSLRKGQSPSRSHRSSNQQAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKA--------------------------------KSPTVKPYDGS

Query:  KDPKDYVEVFEGLMDFQAASDAIKCRAFQIALIGSARLWYRRLPARSILTYSQLRREFLVQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKV
        +DPKDYVEVFEGLMDFQAASD IKCRAFQIAL  SARLWYRRLPARSI TYSQLRREFL QFSSRHYDK+TATHLATIRQKEGETLREYVTRFQEEQLKV
Subjt:  KDPKDYVEVFEGLMDFQAASDAIKCRAFQIALIGSARLWYRRLPARSILTYSQLRREFLVQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKV

Query:  AHCSDDSAMCYFLTGLADEALTVKLGEKAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKAAPKSKDKGSFSSGRAEFVLQEDCPQAH
         HCSDDSAMCYFLTGLADEA TVKLGE+APATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIE+A  KSKDKGSFSS RA +   E+ P   
Subjt:  AHCSDDSAMCYFLTGLADEALTVKLGEKAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKAAPKSKDKGSFSSGRAEFVLQEDCPQAH

Query:  AP
         P
Subjt:  AP

A0A6J1DDW5 uncharacterized protein LOC1110196348.6e-11973.23Show/hide
Query:  IREQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKA-----
        + EQRG HLGPV++ HPE  E E +T QRGDLREHLNRKR SSLRKGQSPS SHR+SNQQAESS+NP TP GVITR EFDQL+ K DAQVEALKA     
Subjt:  IREQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKA-----

Query:  ---------------------------KSPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALIGSARLWYRRLPARSILTYSQLRREFLVQF
                                   K+PT+KPYDGSKDPKDYVEVFEGLMDFQAA+DAIKCR FQIAL GSARLWYRRLPARSI TYSQLR+EF++QF
Subjt:  ---------------------------KSPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALIGSARLWYRRLPARSILTYSQLRREFLVQF

Query:  SSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEKAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIG
        SSRHYD+KTATHL TIRQKEGETLREYVTRFQEEQLKVAHCSD SAMCYFLT LADE LTVKL E+APATF EVLQKAKK+IDGQELLRTKT RPE+KI 
Subjt:  SSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEKAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIG

Query:  RGRSGKDIEKAAPKSKDKG--SFSS
        +GR+ KD  K   K++DKG  SFSS
Subjt:  RGRSGKDIEKAAPKSKDKG--SFSS

A0A6J1DM55 uncharacterized protein LOC1110222679.2e-11373.63Show/hide
Query:  DNESEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKA-----------------------
        + E E +T QRGDLREHLNRKR SSLRKGQSPS SHR+SNQQAESS+NP TP  VITR EFDQL+ K DAQVEALKA                       
Subjt:  DNESEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKA-----------------------

Query:  ---------KSPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALIGSARLWYRRLPARSILTYSQLRREFLVQFSSRHYDKKTATHLATIRQ
                 K+PT+KPYDGSK+PKDYV+VFEGLM+FQAA+DAIKCRAFQIA  GSARLWYRRLPARSI TYSQLR+EF+ QFSSR+YD+KTATHLATIRQ
Subjt:  ---------KSPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALIGSARLWYRRLPARSILTYSQLRREFLVQFSSRHYDKKTATHLATIRQ

Query:  KEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEKAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKAAPKSKDK
        K+GETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLAD+ LTVKLGE+APATFAEVLQKAKKVIDGQELLRTKTGRPE+KI + R GKD  KA  KS+DK
Subjt:  KEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEKAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKAAPKSKDK

Query:  G-SFSSGRAEF
        G S SS RA++
Subjt:  G-SFSSGRAEF

A0A6J1DS95 uncharacterized protein LOC1110234211.8e-10879.55Show/hide
Query:  GVITRAEFDQLRGKLDAQVEALKA--------------------------------KSPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALI
        G+ITR EFDQLRG+LDAQVEALKA                                K+PTVKPYDG+KDPKDYVEVFEGLMDFQAASDAIKCRAFQIAL 
Subjt:  GVITRAEFDQLRGKLDAQVEALKA--------------------------------KSPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALI

Query:  GSARLWYRRLPARSILTYSQLRREFLVQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEKAPATF
        GSARLWYRRLP RSI TYSQLRREFL QFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGE+APATF
Subjt:  GSARLWYRRLPARSILTYSQLRREFLVQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEKAPATF

Query:  AEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKAAPKSKDKGSFSSGRAEFVLQEDCPQAHAP
        AEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD+E+A PKSKDKGSFSSGRAE+   E+ P    P
Subjt:  AEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKAAPKSKDKGSFSSGRAEFVLQEDCPQAHAP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCGTTGGCATACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGTCGAGGAGGAACATCCCGAAGACAACGAGAGCGAGGGACACACTCGCCAGAGAGGAGAC
CTCCGTGAACACCTCAACAGAAAGAGAGGCTCATCTCTCCGAAAAGGACAGTCACCATCCCGCTCACACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAAC
CCAGCAACTCCCGCAGGGGTGATCACAAGGGCGGAGTTCGACCAGCTGAGGGGCAAGCTCGACGCTCAGGTGGAGGCCTTAAAGGCCAAATCTCCTACCGTGAAG
CCTTATGATGGGTCGAAGGACCCCAAGGATTATGTTGAGGTCTTTGAAGGCCTCATGGATTTCCAAGCGGCATCAGACGCAATCAAATGTCGCGCCTTTCAGATC
GCGCTTATCGGCAGCGCGCGTTTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTTGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGTCCAGTTCTCTTCTCGG
CATTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGACAGAAAGAGGGTGAGACGCTGCGAGAATATGTCACCAGATTTCAGGAGGAGCAATTGAAGGTC
GCACACTGCTCCGATGACTCGGCCATGTGCTATTTTCTCACTGGTCTAGCCGACGAAGCCCTCACGGTGAAACTTGGAGAGAAGGCCCCGGCCACTTTCGCCGAG
GTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACAAAAACCGGCCGACCAGAACGAAAGATCGGCCGGGGCAGAAGTGGAAAAGACATA
GAAAAGGCAGCTCCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGAGCTGAGTTCGTCTTGCAAGAGGATTGTCCTCAAGCTCATGCGCCTAACACCACT
GTGGCGATGCACAACGTCTATGACAGATGGATCAAGGCTAATGACAAAGCAAAAGTTTACATCTTGGCGAGCATATCTGATGTGCTTGCTAAGAAGCACGAGGAC
ACGGTCACCGCTAAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGCTCGTTGGCATACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGTCGAGGAGGAACATCCCGAAGACAACGAGAGCGAGGGACACACTCGCCAGAGAGGAGAC
CTCCGTGAACACCTCAACAGAAAGAGAGGCTCATCTCTCCGAAAAGGACAGTCACCATCCCGCTCACACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAAC
CCAGCAACTCCCGCAGGGGTGATCACAAGGGCGGAGTTCGACCAGCTGAGGGGCAAGCTCGACGCTCAGGTGGAGGCCTTAAAGGCCAAATCTCCTACCGTGAAG
CCTTATGATGGGTCGAAGGACCCCAAGGATTATGTTGAGGTCTTTGAAGGCCTCATGGATTTCCAAGCGGCATCAGACGCAATCAAATGTCGCGCCTTTCAGATC
GCGCTTATCGGCAGCGCGCGTTTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTTGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGTCCAGTTCTCTTCTCGG
CATTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGACAGAAAGAGGGTGAGACGCTGCGAGAATATGTCACCAGATTTCAGGAGGAGCAATTGAAGGTC
GCACACTGCTCCGATGACTCGGCCATGTGCTATTTTCTCACTGGTCTAGCCGACGAAGCCCTCACGGTGAAACTTGGAGAGAAGGCCCCGGCCACTTTCGCCGAG
GTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACAAAAACCGGCCGACCAGAACGAAAGATCGGCCGGGGCAGAAGTGGAAAAGACATA
GAAAAGGCAGCTCCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGAGCTGAGTTCGTCTTGCAAGAGGATTGTCCTCAAGCTCATGCGCCTAACACCACT
GTGGCGATGCACAACGTCTATGACAGATGGATCAAGGCTAATGACAAAGCAAAAGTTTACATCTTGGCGAGCATATCTGATGTGCTTGCTAAGAAGCACGAGGAC
ACGGTCACCGCTAAGTAG
Protein sequenceShow/hide protein sequence
MLVGIREQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKSPTVK
PYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALIGSARLWYRRLPARSILTYSQLRREFLVQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKV
AHCSDDSAMCYFLTGLADEALTVKLGEKAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKAAPKSKDKGSFSSGRAEFVLQEDCPQAHAPNTT
VAMHNVYDRWIKANDKAKVYILASISDVLAKKHEDTVTAK