; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc05g24800 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc05g24800
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr5:17661338..17662399
RNA-Seq ExpressionMoc05g24800
SyntenyMoc05g24800
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GFS39075.1 hypothetical protein Acr_00g0061040 [Actinidia rufa]1.8e-7045.54Show/hide
Query:  SNTTSISLPKSTSTTPHISISENTSLQITCPKLNGKNFLQRSRSALLVIPGRSRLGYINGTIAEPDEADPSFSVWDAQNSMVMAWLINSMEEDIKESFIF
        S+  + S PK   T    S+      QIT  KL+G+N+LQ SRS  L+I G  R GY++G+I +P   DPSF +WD QNSMVMAWL+NSM++ I E ++ 
Subjt:  SNTTSISLPKSTSTTPHISISENTSLQITCPKLNGKNFLQRSRSALLVIPGRSRLGYINGTIAEPDEADPSFSVWDAQNSMVMAWLINSMEEDIKESFIF

Query:  YSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLRPELDDVRGRLL
        Y TAK +W+A+T+A+SD ++S+Q+F+LR+++R+LRQ E  VTQY+SSL +LW ELDL     W  + + E +RK + KER Y+FL GL P LDDVRGR+L
Subjt:  YSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLRPELDDVRGRLL

Query:  ATKPIPAIDEIFAEVRWESSRKRVMMGD-THTKPLSLPLESSALAARGPPSPSSRSTRRNNLWCDHCKRTNHTKDRCWELHGRP-------QRKNLSGDY
        + KP+P++D IF+EVR E  R+R+M+G        S+  ++SA+AAR P S   R      LWCDHC R +HTK+ CW+LHG+P       QRK  S   
Subjt:  ATKPIPAIDEIFAEVRWESSRKRVMMGD-THTKPLSLPLESSALAARGPPSPSSRSTRRNNLWCDHCKRTNHTKDRCWELHGRP-------QRKNLSGDY

Query:  RPPPPTNTPSSRTSSSGY------QVGPSVSNSQDL
        +PP    + ++ +SS  +      Q+G   S+S  L
Subjt:  RPPPPTNTPSSRTSSSGY------QVGPSVSNSQDL

RVW27477.1 hypothetical protein CK203_092018 [Vitis vinifera]2.3e-7046.31Show/hide
Query:  MTDVRKDESSDGSNTTSISLPKSTSTTPHISI-SENTSLQITCPKLNGKNFLQRSRSALLVIPGRSRLGYINGTIAEPDEADPSFSVWDAQNSMVMAWLI
        M  + + + S  SN TS   P ++   P + I + +TSLQI   KLNGKN+LQ   SA +VI G+ ++ Y++G I +P  +D S+  WD QNSMVMAWLI
Subjt:  MTDVRKDESSDGSNTTSISLPKSTSTTPHISI-SENTSLQITCPKLNGKNFLQRSRSALLVIPGRSRLGYINGTIAEPDEADPSFSVWDAQNSMVMAWLI

Query:  NSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAG
        +SME++I ++++FY TAK++W+ +++A+SDF+ S+Q+FELRNKA++L+Q +SDVTQY+++L++LW ELDL    EW++ +D   F+K V+K R+YDFLAG
Subjt:  NSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAG

Query:  LRPELDDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMGDTHTKPLSLPLESSALAARGPPSPSS--RSTRRNNLWCDHCKRTNHTKDRCWELHGRP
        L   LD+VRGR+   +P+  IDE+F+EVR E  R++V++G+  T P++   E+SAL  +      +     +R+  WCDHC+++ HTKD+CW+LHG+P
Subjt:  LRPELDDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMGDTHTKPLSLPLESSALAARGPPSPSS--RSTRRNNLWCDHCKRTNHTKDRCWELHGRP

RVW74112.1 hypothetical protein CK203_052218 [Vitis vinifera]3.1e-7046.31Show/hide
Query:  MTDVRKDESSDGSNTTSISLPKSTSTTPHISI-SENTSLQITCPKLNGKNFLQRSRSALLVIPGRSRLGYINGTIAEPDEADPSFSVWDAQNSMVMAWLI
        M  + + + S  SN TS   P ++   P + I + +TSLQI   KLNGKN+LQ   SA +VI G+ ++ Y++G I +P  +D S+  WD QNSMVMAWLI
Subjt:  MTDVRKDESSDGSNTTSISLPKSTSTTPHISI-SENTSLQITCPKLNGKNFLQRSRSALLVIPGRSRLGYINGTIAEPDEADPSFSVWDAQNSMVMAWLI

Query:  NSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAG
        +SME++I ++++FY TAK++W+ +++A+SDF+ S+Q+FELRNKA++L+Q +S+VTQY+++L++LW ELDL    EW++ +D   F+K V+K R+YDFLAG
Subjt:  NSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAG

Query:  LRPELDDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMGDTHTKPLSLPLESSALAARGPPSPSS--RSTRRNNLWCDHCKRTNHTKDRCWELHGRP
        L   LD+VRGR+   +P+  IDE+F+EVR E  R++VM+G+  T P++   E+SAL  +      +     +R+  WCDHC+++ HTKD+CW+LHG+P
Subjt:  LRPELDDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMGDTHTKPLSLPLESSALAARGPPSPSS--RSTRRNNLWCDHCKRTNHTKDRCWELHGRP

XP_022154801.1 uncharacterized protein LOC111021967 [Momordica charantia]1.7e-8996.57Show/hide
Query:  MEEDIKESFIFYSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLR
        MEEDIKESFIFYSTAKDLWNALTM FSDFDNSAQLFELRNKA SLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAG+R
Subjt:  MEEDIKESFIFYSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLR

Query:  PELDDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMGDTHTKPLSLPLESSALAARGPPSPSSRSTRRNNLWCD
        PELDDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMGDTHTKPLSL LESSALAARGPP PSSRSTRRNNLW D
Subjt:  PELDDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMGDTHTKPLSLPLESSALAARGPPSPSSRSTRRNNLWCD

XP_022159153.1 uncharacterized protein LOC111025577 [Momordica charantia]2.8e-14097.75Show/hide
Query:  MVKTAMMTDVRKDESSDGSNTTSISLPKSTSTTPHISISENTSLQITCPKLNGKNFLQRSRSALLVIPGRSRLGYINGTIAEPDEADPSFSVWDAQNSMV
        MVKTAMMTDVRKDESSDGSNTTSISLPKSTSTTPHISISENTSLQIT PKLNGKNFLQ SRSALLVI GRSRLGYINGTIAEPDEADPSFSVWDAQNSMV
Subjt:  MVKTAMMTDVRKDESSDGSNTTSISLPKSTSTTPHISISENTSLQITCPKLNGKNFLQRSRSALLVIPGRSRLGYINGTIAEPDEADPSFSVWDAQNSMV

Query:  MAWLINSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIY
        MAWLINSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSAQLFEL NKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIY
Subjt:  MAWLINSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIY

Query:  DFLAGLRPELDDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMGDTHTKPLSLPLESSALAARGPP
        DFLAGLRPELDDVRGRLLATKPIPAIDEIFAEV WESSRKRVMMGDTHTKPLSL LESSALAARGPP
Subjt:  DFLAGLRPELDDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMGDTHTKPLSLPLESSALAARGPP

TrEMBL top hitse value%identityAlignment
A0A438CW90 Uncharacterized protein1.1e-7046.31Show/hide
Query:  MTDVRKDESSDGSNTTSISLPKSTSTTPHISI-SENTSLQITCPKLNGKNFLQRSRSALLVIPGRSRLGYINGTIAEPDEADPSFSVWDAQNSMVMAWLI
        M  + + + S  SN TS   P ++   P + I + +TSLQI   KLNGKN+LQ   SA +VI G+ ++ Y++G I +P  +D S+  WD QNSMVMAWLI
Subjt:  MTDVRKDESSDGSNTTSISLPKSTSTTPHISI-SENTSLQITCPKLNGKNFLQRSRSALLVIPGRSRLGYINGTIAEPDEADPSFSVWDAQNSMVMAWLI

Query:  NSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAG
        +SME++I ++++FY TAK++W+ +++A+SDF+ S+Q+FELRNKA++L+Q +SDVTQY+++L++LW ELDL    EW++ +D   F+K V+K R+YDFLAG
Subjt:  NSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAG

Query:  LRPELDDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMGDTHTKPLSLPLESSALAARGPPSPSS--RSTRRNNLWCDHCKRTNHTKDRCWELHGRP
        L   LD+VRGR+   +P+  IDE+F+EVR E  R++V++G+  T P++   E+SAL  +      +     +R+  WCDHC+++ HTKD+CW+LHG+P
Subjt:  LRPELDDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMGDTHTKPLSLPLESSALAARGPPSPSS--RSTRRNNLWCDHCKRTNHTKDRCWELHGRP

A0A438GPJ5 Uncharacterized protein1.5e-7046.31Show/hide
Query:  MTDVRKDESSDGSNTTSISLPKSTSTTPHISI-SENTSLQITCPKLNGKNFLQRSRSALLVIPGRSRLGYINGTIAEPDEADPSFSVWDAQNSMVMAWLI
        M  + + + S  SN TS   P ++   P + I + +TSLQI   KLNGKN+LQ   SA +VI G+ ++ Y++G I +P  +D S+  WD QNSMVMAWLI
Subjt:  MTDVRKDESSDGSNTTSISLPKSTSTTPHISI-SENTSLQITCPKLNGKNFLQRSRSALLVIPGRSRLGYINGTIAEPDEADPSFSVWDAQNSMVMAWLI

Query:  NSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAG
        +SME++I ++++FY TAK++W+ +++A+SDF+ S+Q+FELRNKA++L+Q +S+VTQY+++L++LW ELDL    EW++ +D   F+K V+K R+YDFLAG
Subjt:  NSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAG

Query:  LRPELDDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMGDTHTKPLSLPLESSALAARGPPSPSS--RSTRRNNLWCDHCKRTNHTKDRCWELHGRP
        L   LD+VRGR+   +P+  IDE+F+EVR E  R++VM+G+  T P++   E+SAL  +      +     +R+  WCDHC+++ HTKD+CW+LHG+P
Subjt:  LRPELDDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMGDTHTKPLSLPLESSALAARGPPSPSS--RSTRRNNLWCDHCKRTNHTKDRCWELHGRP

A0A6J1DPT5 uncharacterized protein LOC1110219678.4e-9096.57Show/hide
Query:  MEEDIKESFIFYSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLR
        MEEDIKESFIFYSTAKDLWNALTM FSDFDNSAQLFELRNKA SLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAG+R
Subjt:  MEEDIKESFIFYSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLR

Query:  PELDDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMGDTHTKPLSLPLESSALAARGPPSPSSRSTRRNNLWCD
        PELDDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMGDTHTKPLSL LESSALAARGPP PSSRSTRRNNLW D
Subjt:  PELDDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMGDTHTKPLSLPLESSALAARGPPSPSSRSTRRNNLWCD

A0A6J1DY12 uncharacterized protein LOC1110255771.4e-14097.75Show/hide
Query:  MVKTAMMTDVRKDESSDGSNTTSISLPKSTSTTPHISISENTSLQITCPKLNGKNFLQRSRSALLVIPGRSRLGYINGTIAEPDEADPSFSVWDAQNSMV
        MVKTAMMTDVRKDESSDGSNTTSISLPKSTSTTPHISISENTSLQIT PKLNGKNFLQ SRSALLVI GRSRLGYINGTIAEPDEADPSFSVWDAQNSMV
Subjt:  MVKTAMMTDVRKDESSDGSNTTSISLPKSTSTTPHISISENTSLQITCPKLNGKNFLQRSRSALLVIPGRSRLGYINGTIAEPDEADPSFSVWDAQNSMV

Query:  MAWLINSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIY
        MAWLINSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSAQLFEL NKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIY
Subjt:  MAWLINSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIY

Query:  DFLAGLRPELDDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMGDTHTKPLSLPLESSALAARGPP
        DFLAGLRPELDDVRGRLLATKPIPAIDEIFAEV WESSRKRVMMGDTHTKPLSL LESSALAARGPP
Subjt:  DFLAGLRPELDDVRGRLLATKPIPAIDEIFAEVRWESSRKRVMMGDTHTKPLSLPLESSALAARGPP

A0A7J0DNJ6 Uncharacterized protein8.7e-7145.54Show/hide
Query:  SNTTSISLPKSTSTTPHISISENTSLQITCPKLNGKNFLQRSRSALLVIPGRSRLGYINGTIAEPDEADPSFSVWDAQNSMVMAWLINSMEEDIKESFIF
        S+  + S PK   T    S+      QIT  KL+G+N+LQ SRS  L+I G  R GY++G+I +P   DPSF +WD QNSMVMAWL+NSM++ I E ++ 
Subjt:  SNTTSISLPKSTSTTPHISISENTSLQITCPKLNGKNFLQRSRSALLVIPGRSRLGYINGTIAEPDEADPSFSVWDAQNSMVMAWLINSMEEDIKESFIF

Query:  YSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLRPELDDVRGRLL
        Y TAK +W+A+T+A+SD ++S+Q+F+LR+++R+LRQ E  VTQY+SSL +LW ELDL     W  + + E +RK + KER Y+FL GL P LDDVRGR+L
Subjt:  YSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLRPELDDVRGRLL

Query:  ATKPIPAIDEIFAEVRWESSRKRVMMGD-THTKPLSLPLESSALAARGPPSPSSRSTRRNNLWCDHCKRTNHTKDRCWELHGRP-------QRKNLSGDY
        + KP+P++D IF+EVR E  R+R+M+G        S+  ++SA+AAR P S   R      LWCDHC R +HTK+ CW+LHG+P       QRK  S   
Subjt:  ATKPIPAIDEIFAEVRWESSRKRVMMGD-THTKPLSLPLESSALAARGPPSPSSRSTRRNNLWCDHCKRTNHTKDRCWELHGRP-------QRKNLSGDY

Query:  RPPPPTNTPSSRTSSSGY------QVGPSVSNSQDL
        +PP    + ++ +SS  +      Q+G   S+S  L
Subjt:  RPPPPTNTPSSRTSSSGY------QVGPSVSNSQDL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).1.6e-2133.87Show/hide
Query:  RSRSALLVIPGRSRLGYINGTIAEPDEADPSFSVWDAQNSMVMAWLINSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQGESD
        R RS L V     + G+I+GT+ +PD   P +  W+  N+MVM WL+NSM + + ES ++  TA  +W  L   F    +  ++++LR +  +LRQG   
Subjt:  RSRSALLVIPGRSRLGYINGTIAEPDEADPSFSVWDAQNSMVMAWLINSMEEDIKESFIFYSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQGESD

Query:  VTQYYSSLRRLWAELD--------LCLNLEWENSKDAERFRKHVEKERIYDFLAGLR--PELDDVRGRLLATKPIPAIDEIFAEVR
        V +Y+  L ++W EL          C     E +K AE  R   EKE+ Y+FL GL+     + V  +++  KP P++ E FA V+
Subjt:  VTQYYSSLRRLWAELD--------LCLNLEWENSKDAERFRKHVEKERIYDFLAGLR--PELDDVRGRLLATKPIPAIDEIFAEVR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTAAAACAGCCATGATGACTGATGTGCGCAAGGATGAGAGTTCCGACGGATCAAATACGACCTCAATTTCTCTCCCTAAAAGCACATCCACGACACCTCACATCTC
TATCTCTGAAAACACTTCGCTCCAAATTACCTGTCCAAAACTCAACGGAAAAAATTTTCTGCAACGGTCTCGATCGGCCCTCCTAGTTATTCCTGGCCGCAGTCGACTCG
GGTACATCAATGGCACGATAGCAGAACCAGATGAAGCTGACCCTTCTTTTTCCGTGTGGGATGCACAAAACTCAATGGTCATGGCATGGCTCATTAACTCGATGGAGGAG
GACATTAAAGAATCCTTCATCTTCTACTCAACAGCAAAGGATCTTTGGAATGCGCTCACTATGGCTTTTTCTGATTTTGATAACTCAGCTCAATTGTTTGAATTACGCAA
TAAGGCACGTTCCTTACGACAAGGTGAATCTGATGTCACCCAATACTACAGCTCATTACGTAGGTTGTGGGCTGAACTTGATTTATGTCTCAATCTCGAATGGGAGAACT
CGAAGGATGCTGAACGCTTCCGCAAACACGTCGAGAAGGAACGAATTTATGATTTTCTTGCAGGTCTTCGTCCGGAACTAGATGATGTGCGTGGCCGCTTACTTGCCACA
AAGCCAATCCCAGCCATTGACGAAATCTTCGCAGAAGTTCGCTGGGAGTCAAGCCGTAAACGTGTGATGATGGGTGATACACACACAAAACCTCTGTCCCTCCCACTGGA
ATCATCAGCTCTGGCGGCACGAGGTCCACCATCACCTTCATCCCGATCTACTCGTCGAAACAACCTATGGTGTGATCATTGTAAGCGCACAAACCATACAAAAGATCGGT
GTTGGGAACTCCATGGTCGTCCTCAACGGAAGAATTTATCAGGGGATTATCGGCCACCTCCACCTACAAATACTCCATCTTCTCGAACCAGCTCCTCCGGCTATCAAGTG
GGCCCTAGTGTGTCAAACTCCCAAGATTTGGCCATCTCTCTCCCTCCATTTTCGAAGGCACAACTTGAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGTTAAAACAGCCATGATGACTGATGTGCGCAAGGATGAGAGTTCCGACGGATCAAATACGACCTCAATTTCTCTCCCTAAAAGCACATCCACGACACCTCACATCTC
TATCTCTGAAAACACTTCGCTCCAAATTACCTGTCCAAAACTCAACGGAAAAAATTTTCTGCAACGGTCTCGATCGGCCCTCCTAGTTATTCCTGGCCGCAGTCGACTCG
GGTACATCAATGGCACGATAGCAGAACCAGATGAAGCTGACCCTTCTTTTTCCGTGTGGGATGCACAAAACTCAATGGTCATGGCATGGCTCATTAACTCGATGGAGGAG
GACATTAAAGAATCCTTCATCTTCTACTCAACAGCAAAGGATCTTTGGAATGCGCTCACTATGGCTTTTTCTGATTTTGATAACTCAGCTCAATTGTTTGAATTACGCAA
TAAGGCACGTTCCTTACGACAAGGTGAATCTGATGTCACCCAATACTACAGCTCATTACGTAGGTTGTGGGCTGAACTTGATTTATGTCTCAATCTCGAATGGGAGAACT
CGAAGGATGCTGAACGCTTCCGCAAACACGTCGAGAAGGAACGAATTTATGATTTTCTTGCAGGTCTTCGTCCGGAACTAGATGATGTGCGTGGCCGCTTACTTGCCACA
AAGCCAATCCCAGCCATTGACGAAATCTTCGCAGAAGTTCGCTGGGAGTCAAGCCGTAAACGTGTGATGATGGGTGATACACACACAAAACCTCTGTCCCTCCCACTGGA
ATCATCAGCTCTGGCGGCACGAGGTCCACCATCACCTTCATCCCGATCTACTCGTCGAAACAACCTATGGTGTGATCATTGTAAGCGCACAAACCATACAAAAGATCGGT
GTTGGGAACTCCATGGTCGTCCTCAACGGAAGAATTTATCAGGGGATTATCGGCCACCTCCACCTACAAATACTCCATCTTCTCGAACCAGCTCCTCCGGCTATCAAGTG
GGCCCTAGTGTGTCAAACTCCCAAGATTTGGCCATCTCTCTCCCTCCATTTTCGAAGGCACAACTTGAATAG
Protein sequenceShow/hide protein sequence
MVKTAMMTDVRKDESSDGSNTTSISLPKSTSTTPHISISENTSLQITCPKLNGKNFLQRSRSALLVIPGRSRLGYINGTIAEPDEADPSFSVWDAQNSMVMAWLINSMEE
DIKESFIFYSTAKDLWNALTMAFSDFDNSAQLFELRNKARSLRQGESDVTQYYSSLRRLWAELDLCLNLEWENSKDAERFRKHVEKERIYDFLAGLRPELDDVRGRLLAT
KPIPAIDEIFAEVRWESSRKRVMMGDTHTKPLSLPLESSALAARGPPSPSSRSTRRNNLWCDHCKRTNHTKDRCWELHGRPQRKNLSGDYRPPPPTNTPSSRTSSSGYQV
GPSVSNSQDLAISLPPFSKAQLE