; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC04g1053 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC04g1053
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionFAM192A_Fyv6_N domain-containing protein
Genome locationMC04:18638069..18644175
RNA-Seq ExpressionMC04g1053
SyntenyMC04g1053
Gene Ontology termsGO:0005634 - nucleus (cellular component)
InterPro domainsIPR019331 - FAM192A/Fyv6, N-terminal
IPR039845 - PSME3-interacting protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYJ98134.1 protein FAM192A [Cucumis melo var. makuwa]3.16e-10181.22Show/hide
Query:  MEDESGRAIRLMNFVSEEQLDEAKKTRGERVEDGTAQRDRPLFEILKENKDKRDAEFNERFKHRPPKALDEDETEFLDKLETSKREYERQMADAEEEELR
        MEDES RAIRLMNFVSEEQLDEAKKTRGERVEDGTAQRDRPLFEILK+NKDKRDAEFNERFKHRPPKALDEDETEFLDKLETSKREYERQMADAEE+ELR
Subjt:  MEDESGRAIRLMNFVSEEQLDEAKKTRGERVEDGTAQRDRPLFEILKENKDKRDAEFNERFKHRPPKALDEDETEFLDKLETSKREYERQMADAEEEELR

Query:  SFQAAVAAQSTMLHELKEVTPPAPTAQEKTSVRRETPATRAPSMIIKVKPQAKKARIEPRSPEKASN----------EPSESEKTPNSIGDRPHEAATI-
        SFQAAVAAQS +L+E++E+TPPAP AQEK SVRRETP +R PSMII+VKPQAKKARIEPRSP+ A +          EPS+S KT ++  DRP EAA I 
Subjt:  SFQAAVAAQSTMLHELKEVTPPAPTAQEKTSVRRETPATRAPSMIIKVKPQAKKARIEPRSPEKASN----------EPSESEKTPNSIGDRPHEAATI-

Query:  GLVSYSGESEDED
        GLVSYS ESEDED
Subjt:  GLVSYSGESEDED

XP_008466362.1 PREDICTED: protein FAM192A [Cucumis melo]1.13e-10181.22Show/hide
Query:  MEDESGRAIRLMNFVSEEQLDEAKKTRGERVEDGTAQRDRPLFEILKENKDKRDAEFNERFKHRPPKALDEDETEFLDKLETSKREYERQMADAEEEELR
        MEDES RAIRLMNFVSEEQLDEAKKTRGERVEDGTAQRDRPLFEILK+NKDKRDAEFNERFKHRPPKALDEDETEFLDKLETSKREYERQMADAEE+ELR
Subjt:  MEDESGRAIRLMNFVSEEQLDEAKKTRGERVEDGTAQRDRPLFEILKENKDKRDAEFNERFKHRPPKALDEDETEFLDKLETSKREYERQMADAEEEELR

Query:  SFQAAVAAQSTMLHELKEVTPPAPTAQEKTSVRRETPATRAPSMIIKVKPQAKKARIEPRSPEKASN----------EPSESEKTPNSIGDRPHEAATI-
        SFQAAVAAQS +L+E++E+TPPAP AQEK SVRRETP +R PSMII+VKPQAKKARIEPRSP+ A +          EPS+S KT ++  DRP EAA I 
Subjt:  SFQAAVAAQSTMLHELKEVTPPAPTAQEKTSVRRETPATRAPSMIIKVKPQAKKARIEPRSPEKASN----------EPSESEKTPNSIGDRPHEAATI-

Query:  GLVSYSGESEDED
        GLVSYS ESEDED
Subjt:  GLVSYSGESEDED

XP_022146349.1 protein FAM192A [Momordica charantia]2.89e-129100Show/hide
Query:  MEDESGRAIRLMNFVSEEQLDEAKKTRGERVEDGTAQRDRPLFEILKENKDKRDAEFNERFKHRPPKALDEDETEFLDKLETSKREYERQMADAEEEELR
        MEDESGRAIRLMNFVSEEQLDEAKKTRGERVEDGTAQRDRPLFEILKENKDKRDAEFNERFKHRPPKALDEDETEFLDKLETSKREYERQMADAEEEELR
Subjt:  MEDESGRAIRLMNFVSEEQLDEAKKTRGERVEDGTAQRDRPLFEILKENKDKRDAEFNERFKHRPPKALDEDETEFLDKLETSKREYERQMADAEEEELR

Query:  SFQAAVAAQSTMLHELKEVTPPAPTAQEKTSVRRETPATRAPSMIIKVKPQAKKARIEPRSPEKASNEPSESEKTPNSIGDRPHEAATIGLVSYSGESED
        SFQAAVAAQSTMLHELKEVTPPAPTAQEKTSVRRETPATRAPSMIIKVKPQAKKARIEPRSPEKASNEPSESEKTPNSIGDRPHEAATIGLVSYSGESED
Subjt:  SFQAAVAAQSTMLHELKEVTPPAPTAQEKTSVRRETPATRAPSMIIKVKPQAKKARIEPRSPEKASNEPSESEKTPNSIGDRPHEAATIGLVSYSGESED

Query:  ED
        ED
Subjt:  ED

XP_022993730.1 protein FAM192A [Cucurbita maxima]6.50e-10180.28Show/hide
Query:  MEDESGRAIRLMNFVSEEQLDEAKKTRGERVEDGTAQRDRPLFEILKENKDKRDAEFNERFKHRPPKALDEDETEFLDKLETSKREYERQMADAEEEELR
        MEDES RAIRLM+FVSEEQLDEAKKTRGERVEDGTAQRDRPL+EILKENKDKRDAEFNERFKHRPPKALDEDETEFLDK ETSKREYERQMADA+E+ELR
Subjt:  MEDESGRAIRLMNFVSEEQLDEAKKTRGERVEDGTAQRDRPLFEILKENKDKRDAEFNERFKHRPPKALDEDETEFLDKLETSKREYERQMADAEEEELR

Query:  SFQAAVAAQSTMLHELKEVTPPAPTAQEKTSVRRETPATRAPSMIIKVKPQAKKARIEPRSPEKA----------SNEPSESEKTPNSIGDRPHEAAT-I
        SFQAAVAAQS +L+E++E+TPPAP AQEK  VR+ETP TR PSMII+VKPQAKKARIEPRSPEKA          + E SES +T ++  DRP EAAT I
Subjt:  SFQAAVAAQSTMLHELKEVTPPAPTAQEKTSVRRETPATRAPSMIIKVKPQAKKARIEPRSPEKA----------SNEPSESEKTPNSIGDRPHEAAT-I

Query:  GLVSYSGESEDED
        GLVSYS ESEDED
Subjt:  GLVSYSGESEDED

XP_023549588.1 protein FAM192A [Cucurbita pepo subsp. pepo]1.31e-10080.28Show/hide
Query:  MEDESGRAIRLMNFVSEEQLDEAKKTRGERVEDGTAQRDRPLFEILKENKDKRDAEFNERFKHRPPKALDEDETEFLDKLETSKREYERQMADAEEEELR
        MEDES RAIRLM+FVSEEQLDEAKKTRGERVEDGTAQRDRPL+EILKENKDKRDAEFNERFKHRPPKALDEDETEFLDK ETSKREYERQMADA+E+ELR
Subjt:  MEDESGRAIRLMNFVSEEQLDEAKKTRGERVEDGTAQRDRPLFEILKENKDKRDAEFNERFKHRPPKALDEDETEFLDKLETSKREYERQMADAEEEELR

Query:  SFQAAVAAQSTMLHELKEVTPPAPTAQEKTSVRRETPATRAPSMIIKVKPQAKKARIEPRSPEKASN----------EPSESEKTPNSIGDRPHEAAT-I
        SFQAAVAAQS +L E++E+TPPAP AQEK  VR+ETP TR PSMII+VKPQAKKARIEPRSPEKA +          E SES +T ++  DRP EAAT I
Subjt:  SFQAAVAAQSTMLHELKEVTPPAPTAQEKTSVRRETPATRAPSMIIKVKPQAKKARIEPRSPEKASN----------EPSESEKTPNSIGDRPHEAAT-I

Query:  GLVSYSGESEDED
        GLVSYS ESEDED
Subjt:  GLVSYSGESEDED

TrEMBL top hitse value%identityAlignment
A0A1S3CR24 protein FAM192A5.46e-10281.22Show/hide
Query:  MEDESGRAIRLMNFVSEEQLDEAKKTRGERVEDGTAQRDRPLFEILKENKDKRDAEFNERFKHRPPKALDEDETEFLDKLETSKREYERQMADAEEEELR
        MEDES RAIRLMNFVSEEQLDEAKKTRGERVEDGTAQRDRPLFEILK+NKDKRDAEFNERFKHRPPKALDEDETEFLDKLETSKREYERQMADAEE+ELR
Subjt:  MEDESGRAIRLMNFVSEEQLDEAKKTRGERVEDGTAQRDRPLFEILKENKDKRDAEFNERFKHRPPKALDEDETEFLDKLETSKREYERQMADAEEEELR

Query:  SFQAAVAAQSTMLHELKEVTPPAPTAQEKTSVRRETPATRAPSMIIKVKPQAKKARIEPRSPEKASN----------EPSESEKTPNSIGDRPHEAATI-
        SFQAAVAAQS +L+E++E+TPPAP AQEK SVRRETP +R PSMII+VKPQAKKARIEPRSP+ A +          EPS+S KT ++  DRP EAA I 
Subjt:  SFQAAVAAQSTMLHELKEVTPPAPTAQEKTSVRRETPATRAPSMIIKVKPQAKKARIEPRSPEKASN----------EPSESEKTPNSIGDRPHEAATI-

Query:  GLVSYSGESEDED
        GLVSYS ESEDED
Subjt:  GLVSYSGESEDED

A0A5A7U2Y6 Protein FAM192A5.46e-10281.22Show/hide
Query:  MEDESGRAIRLMNFVSEEQLDEAKKTRGERVEDGTAQRDRPLFEILKENKDKRDAEFNERFKHRPPKALDEDETEFLDKLETSKREYERQMADAEEEELR
        MEDES RAIRLMNFVSEEQLDEAKKTRGERVEDGTAQRDRPLFEILK+NKDKRDAEFNERFKHRPPKALDEDETEFLDKLETSKREYERQMADAEE+ELR
Subjt:  MEDESGRAIRLMNFVSEEQLDEAKKTRGERVEDGTAQRDRPLFEILKENKDKRDAEFNERFKHRPPKALDEDETEFLDKLETSKREYERQMADAEEEELR

Query:  SFQAAVAAQSTMLHELKEVTPPAPTAQEKTSVRRETPATRAPSMIIKVKPQAKKARIEPRSPEKASN----------EPSESEKTPNSIGDRPHEAATI-
        SFQAAVAAQS +L+E++E+TPPAP AQEK SVRRETP +R PSMII+VKPQAKKARIEPRSP+ A +          EPS+S KT ++  DRP EAA I 
Subjt:  SFQAAVAAQSTMLHELKEVTPPAPTAQEKTSVRRETPATRAPSMIIKVKPQAKKARIEPRSPEKASN----------EPSESEKTPNSIGDRPHEAATI-

Query:  GLVSYSGESEDED
        GLVSYS ESEDED
Subjt:  GLVSYSGESEDED

A0A5D3BGC9 Protein FAM192A1.53e-10181.22Show/hide
Query:  MEDESGRAIRLMNFVSEEQLDEAKKTRGERVEDGTAQRDRPLFEILKENKDKRDAEFNERFKHRPPKALDEDETEFLDKLETSKREYERQMADAEEEELR
        MEDES RAIRLMNFVSEEQLDEAKKTRGERVEDGTAQRDRPLFEILK+NKDKRDAEFNERFKHRPPKALDEDETEFLDKLETSKREYERQMADAEE+ELR
Subjt:  MEDESGRAIRLMNFVSEEQLDEAKKTRGERVEDGTAQRDRPLFEILKENKDKRDAEFNERFKHRPPKALDEDETEFLDKLETSKREYERQMADAEEEELR

Query:  SFQAAVAAQSTMLHELKEVTPPAPTAQEKTSVRRETPATRAPSMIIKVKPQAKKARIEPRSPEKASN----------EPSESEKTPNSIGDRPHEAATI-
        SFQAAVAAQS +L+E++E+TPPAP AQEK SVRRETP +R PSMII+VKPQAKKARIEPRSP+ A +          EPS+S KT ++  DRP EAA I 
Subjt:  SFQAAVAAQSTMLHELKEVTPPAPTAQEKTSVRRETPATRAPSMIIKVKPQAKKARIEPRSPEKASN----------EPSESEKTPNSIGDRPHEAATI-

Query:  GLVSYSGESEDED
        GLVSYS ESEDED
Subjt:  GLVSYSGESEDED

A0A6J1CXV7 protein FAM192A1.40e-129100Show/hide
Query:  MEDESGRAIRLMNFVSEEQLDEAKKTRGERVEDGTAQRDRPLFEILKENKDKRDAEFNERFKHRPPKALDEDETEFLDKLETSKREYERQMADAEEEELR
        MEDESGRAIRLMNFVSEEQLDEAKKTRGERVEDGTAQRDRPLFEILKENKDKRDAEFNERFKHRPPKALDEDETEFLDKLETSKREYERQMADAEEEELR
Subjt:  MEDESGRAIRLMNFVSEEQLDEAKKTRGERVEDGTAQRDRPLFEILKENKDKRDAEFNERFKHRPPKALDEDETEFLDKLETSKREYERQMADAEEEELR

Query:  SFQAAVAAQSTMLHELKEVTPPAPTAQEKTSVRRETPATRAPSMIIKVKPQAKKARIEPRSPEKASNEPSESEKTPNSIGDRPHEAATIGLVSYSGESED
        SFQAAVAAQSTMLHELKEVTPPAPTAQEKTSVRRETPATRAPSMIIKVKPQAKKARIEPRSPEKASNEPSESEKTPNSIGDRPHEAATIGLVSYSGESED
Subjt:  SFQAAVAAQSTMLHELKEVTPPAPTAQEKTSVRRETPATRAPSMIIKVKPQAKKARIEPRSPEKASNEPSESEKTPNSIGDRPHEAATIGLVSYSGESED

Query:  ED
        ED
Subjt:  ED

A0A6J1JZC4 protein FAM192A3.15e-10180.28Show/hide
Query:  MEDESGRAIRLMNFVSEEQLDEAKKTRGERVEDGTAQRDRPLFEILKENKDKRDAEFNERFKHRPPKALDEDETEFLDKLETSKREYERQMADAEEEELR
        MEDES RAIRLM+FVSEEQLDEAKKTRGERVEDGTAQRDRPL+EILKENKDKRDAEFNERFKHRPPKALDEDETEFLDK ETSKREYERQMADA+E+ELR
Subjt:  MEDESGRAIRLMNFVSEEQLDEAKKTRGERVEDGTAQRDRPLFEILKENKDKRDAEFNERFKHRPPKALDEDETEFLDKLETSKREYERQMADAEEEELR

Query:  SFQAAVAAQSTMLHELKEVTPPAPTAQEKTSVRRETPATRAPSMIIKVKPQAKKARIEPRSPEKA----------SNEPSESEKTPNSIGDRPHEAAT-I
        SFQAAVAAQS +L+E++E+TPPAP AQEK  VR+ETP TR PSMII+VKPQAKKARIEPRSPEKA          + E SES +T ++  DRP EAAT I
Subjt:  SFQAAVAAQSTMLHELKEVTPPAPTAQEKTSVRRETPATRAPSMIIKVKPQAKKARIEPRSPEKA----------SNEPSESEKTPNSIGDRPHEAAT-I

Query:  GLVSYSGESEDED
        GLVSYS ESEDED
Subjt:  GLVSYSGESEDED

SwissProt top hitse value%identityAlignment
Q91WE2 PSME3-interacting protein7.8e-0529.96Show/hide
Query:  EDESGRAIRLMNFVSEEQLDEAKKTRGERVEDGTAQRD-----------RPLFEILKENKDKRDAEFNERFKHR-PPKALDEDETEFLDKLETSKREYER
        ED+S   I+   FVSE +LDE +K R E  E      D           R L+E L+E KD++  E+ E+FK +   + LDEDET FLD++   +   E+
Subjt:  EDESGRAIRLMNFVSEEQLDEAKKTRGERVEDGTAQRD-----------RPLFEILKENKDKRDAEFNERFKHR-PPKALDEDETEFLDKLETSKREYER

Query:  QMADAEEEELRSFQA-----AVAAQSTMLHELKEVTPPAPTAQEKTSVRRETPATRAPSMIIKVKPQAKKARIEPRSPEKASNEP------SESEKTPNS
        Q  + E EEL+ +++      ++A++  + E K    P  T  + +  +    A +  S   +     K+ + +P   +KA   P      S S   P S
Subjt:  QMADAEEEELRSFQA-----AVAAQSTMLHELKEVTPPAPTAQEKTSVRRETPATRAPSMIIKVKPQAKKARIEPRSPEKASNEP------SESEKTPNS

Query:  IGDRPHEAATIGLV----SYSGESEDE
        I   P  A  IG++    +YSG S+ E
Subjt:  IGDRPHEAATIGLV----SYSGESEDE

Q9GZU8 PSME3-interacting protein8.6e-0427.23Show/hide
Query:  ESGRAIRLMNFVSEEQLDEAKKTRGERVEDGTAQRD-----------RPLFEILKENKDKRDAEFNERFKHR-PPKALDEDETEFLDKLETSKREYERQM
        + G  I    FVSE +LDE +K R E  E      D           R L+E L+E KD++  E+ E+FK +   + LDEDET FLD++   +   E+Q 
Subjt:  ESGRAIRLMNFVSEEQLDEAKKTRGERVEDGTAQRD-----------RPLFEILKENKDKRDAEFNERFKHR-PPKALDEDETEFLDKLETSKREYERQM

Query:  ADAEEEELRSF-----QAAVAAQSTMLHELKEVTPPAPTAQEKTSVRRETPATRAPSMIIKVKPQAKKARIEPRSPEKASNEPSESEKTPNSIGDR----
         + E +EL+ +     +  ++ ++    E K    P  T  + +  +    A +  S   +     K+ + +P  P+  + EPS  +   N+        
Subjt:  ADAEEEELRSF-----QAAVAAQSTMLHELKEVTPPAPTAQEKTSVRRETPATRAPSMIIKVKPQAKKARIEPRSPEKASNEPSESEKTPNSIGDR----

Query:  -PHEAATIGLV----SYSGESEDE
         P  A  IG++    +YSG S+ E
Subjt:  -PHEAATIGLV----SYSGESEDE

Arabidopsis top hitse value%identityAlignment
AT3G62140.1 CONTAINS InterPro DOMAIN/s: NEFA-interacting nuclear protein NIP30, N-terminal (InterPro:IPR019331); Has 398 Blast hits to 395 proteins in 139 species: Archae - 0; Bacteria - 6; Metazoa - 193; Fungi - 83; Plants - 36; Viruses - 0; Other Eukaryotes - 80 (source: NCBI BLink).2.5e-5460.28Show/hide
Query:  MEDESGRAIRLMNFVSEEQLDEAKKTRGERVEDGTAQRDRPLFEILKENKDKRDAEFNERFKHRPPKALDEDETEFLDKLETSKREYERQMADAEEEELR
        M DE+ + IRL+NFVSEEQLDE+KK RGERVEDGT QRDR L+EILKENKDK+DAEFNERFKHRPPKALDEDETEFLDKLE SKREYERQ+A+ E+E+LR
Subjt:  MEDESGRAIRLMNFVSEEQLDEAKKTRGERVEDGTAQRDRPLFEILKENKDKRDAEFNERFKHRPPKALDEDETEFLDKLETSKREYERQMADAEEEELR

Query:  SFQAAVAAQSTMLHELKE--VTPPAPTAQEKTSVRRETPATRAPSMIIKVKPQAKKARIE-------PRSPEKASN-EPSESEKTPNSIGDRPHEAATIG
        +FQAAVAA+S +LHE KE  + PPAP  +E+  + +  PATR    IIKVKPQ KKA+         P + + AS+ + +  +    ++  +  E    G
Subjt:  SFQAAVAAQSTMLHELKE--VTPPAPTAQEKTSVRRETPATRAPSMIIKVKPQAKKARIE-------PRSPEKASN-EPSESEKTPNSIGDRPHEAATIG

Query:  --LVSYSGESEDED
          LVSYS ESED+D
Subjt:  --LVSYSGESEDED


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGACGAATCTGGGCGAGCAATTAGGCTGATGAATTTCGTCTCTGAGGAACAGCTGGATGAAGCCAAGAAAACAAGGGGCGAGCGAGTTGAAGACGGCACTGCTCA
AAGAGACAGACCCCTCTTCGAGATCCTAAAGGAAAATAAAGACAAGCGTGATGCTGAATTTAATGAACGGTTCAAGCACAGACCACCAAAAGCTCTGGATGAGGATGAGA
CAGAATTTCTGGATAAATTGGAAACGTCAAAGAGAGAATATGAAAGGCAAATGGCTGATGCAGAAGAAGAAGAGCTCCGCAGTTTTCAAGCAGCAGTAGCAGCACAATCT
ACTATGTTGCATGAACTGAAGGAAGTAACCCCGCCTGCTCCCACAGCTCAGGAAAAGACATCAGTTAGGAGAGAAACTCCAGCCACTCGAGCACCGAGTATGATTATTAA
AGTAAAGCCACAGGCAAAGAAAGCGAGAATCGAACCCAGAAGCCCAGAAAAAGCTTCAAATGAGCCTTCAGAGTCAGAAAAAACACCTAATAGCATTGGTGATAGACCTC
ATGAAGCTGCTACGATAGGACTCGTTTCATACAGTGGTGAAAGTGAAGACGAAGATTAG
mRNA sequenceShow/hide mRNA sequence
AGACATTGCCCACGGCCTCAGTATATAGCAGCCAGAGATATCATAATACTCAATCATTTCTCCACTGACAACCACCTAGACAAACTAGAAAAAGAATTATTCAAGTCACT
GAAAAGAACCCGTTCATATTATTACTGGACTGCTTCTTCCAAAAAAGAAATATCTTCAAATATTCTTTTACCATTTTATCTGTTTTACATAAGAAATTATTTCCTTGATC
AATGACATTTTACAACAATAAAATAAAAAAGGGCAAAGAAGTTCTAGAAATCGATGAGAATGATGAGAATGCAAACTCAACTCGTACAAATAAATTTACCCAACACAAAT
TTGGAACACAAAAATCAAGCAAGAAACAGCACCTCAGAATACCACTCTCCAAAGTTTTCATCCTTCTTATTGGTGAGACCTAAACCTGTCTCTTTCTTCACCTCCTTCTT
TTTCCCGCCTGCCCAGTAAACAACAAAGAAATAGTAAAAACAGAACCAAACAAAGGAATGAGAAACAAATAGTGTATTAGAGGAACAATATTTACCAGCTTTTGGATTAG
TAGCAGAGCTACCAGGCTTAGGACCAGCCATGCTTTCAATTTATAACCTCTGCACGATAATCATCTAAAAAAATTCAATTGGAGACAATGAAATGCAATAAAATAACAAT
CAAGTGGACATTGGGAGAAGTTCCAACAATCTACTATGTAAGAATGACCAGCCCACAGGAAGAAATCTAAGTCAACACTATGAAATTTGGACTGGATTGGGTCTGGTTGT
CCTTTTTTTTTTCCTTTCAAATCGAAAATCGGCTACAAATGCCACAAAATAAACAAAAAAACTTAAATCATCAATCCATGCTAACCATCCCGACAAGGAAAAAATAACAA
AGAACGAAAAAAACGAAGAGTAAATAACAGAGGAGCACTTAAGAACTAGATCTGAGTAGTTAGTGATTCTTATCAAAAAGGAAAATCGTATTCAGATTTGATATCCTTCC
CAGCAACAAATAGCAAAAATGGAGAAAATATGAGAAAATAAAAAAATAAAAAAATAACACAATAATTCTGACTGTGTCATGGTTGAGATTCAGAAAATTAAAATGAACAC
AACGACGAACATTGAAGAGTTCACGAAGATGCTCAACAAGCTTCAAACCCACTGAATGTGAGCGTGAGTGTGAGTCAGAAAGAGAAAGAAGGGACTGTGCAATTGAAGTA
ACCAAAAAAAGATGCCACAATGAGTTCTACGATGGATCGACCTCTGCGATTCAGATGAGCACCGGACTACCCCGCGGCGGCGATGCGCAGGCGGCGCGAACAGTGATGAA
ATGCGAGGAGGAATTTGAGGGTTGAGGACAGAACAGAACAGAAATGGATCAATTTTTCAAAAAAGCTTTCTTTACAAAGAAAAAAAGAAATCCAAAACTAAAAAAGCCGT
TGACTGCATCTTTTGGGCGGACTGGAGCTGATCGTTGGGCTGGTCCACTGGGCTGCACGGGCCACAGGGTGGGATGGGATTGGGCTGCTGAAGGCACAAAAGGAACTTGG
GTGGTTCGGTTCGGTTTTTCTATCAGAAAATTTTCTGTTTTTTTGATCAGCGGCGGCGAAGGAACGGAGGGAGGGAGAGACTCGGACTCGGAGACTGGAAAGAACATTGT
AAGGAAGTGACTTGTAAAATTGAAATCTTGGAAGTGTTTGAAGTGGAGCGCGAGCGAGAATGGAGGACGAATCTGGGCGAGCAATTAGGCTGATGAATTTCGTCTCTGAG
GAACAGCTGGATGAAGCCAAGAAAACAAGGGGCGAGCGAGTTGAAGACGGCACTGCTCAAAGAGACAGACCCCTCTTCGAGATCCTAAAGGAAAATAAAGACAAGCGTGA
TGCTGAATTTAATGAACGGTTCAAGCACAGACCACCAAAAGCTCTGGATGAGGATGAGACAGAATTTCTGGATAAATTGGAAACGTCAAAGAGAGAATATGAAAGGCAAA
TGGCTGATGCAGAAGAAGAAGAGCTCCGCAGTTTTCAAGCAGCAGTAGCAGCACAATCTACTATGTTGCATGAACTGAAGGAAGTAACCCCGCCTGCTCCCACAGCTCAG
GAAAAGACATCAGTTAGGAGAGAAACTCCAGCCACTCGAGCACCGAGTATGATTATTAAAGTAAAGCCACAGGCAAAGAAAGCGAGAATCGAACCCAGAAGCCCAGAAAA
AGCTTCAAATGAGCCTTCAGAGTCAGAAAAAACACCTAATAGCATTGGTGATAGACCTCATGAAGCTGCTACGATAGGACTCGTTTCATACAGTGGTGAAAGTGAAGACG
AAGATTAGCAAGTCGAGTCGAGTCAACTATTAGAATTTTCTAAACAATCACCATATTGTATTTAATTTTTTTCGGTGTCAAGGTTGTGTCTTCTATGGGATTAAGCAATG
AATCTTTTCTTCTTCCATTTGTTCATCGTTAAAATGTTTTAAACAATCCACGATTTGATTTACAGTTCCGTGACTCGGTTATAGATGTATATTTGACCAAGCGGATTGTT
GCAATTTTCACTTCTAGAATTCTTATGATTTAGAATTCCTATTTAATCAATTTATAATTTGT
Protein sequenceShow/hide protein sequence
MEDESGRAIRLMNFVSEEQLDEAKKTRGERVEDGTAQRDRPLFEILKENKDKRDAEFNERFKHRPPKALDEDETEFLDKLETSKREYERQMADAEEEELRSFQAAVAAQS
TMLHELKEVTPPAPTAQEKTSVRRETPATRAPSMIIKVKPQAKKARIEPRSPEKASNEPSESEKTPNSIGDRPHEAATIGLVSYSGESEDED