; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g14170 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g14170
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr4:10852143..10857299
RNA-Seq ExpressionMoc04g14170
SyntenyMoc04g14170
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN71748.1 hypothetical protein VITISV_019194 [Vitis vinifera]6.7e-5338.27Show/hide
Query:  MISKIIEHKLNGSNYSSWKINVRLFLRSIDMDDHIIEDPPKDESKKAWLRDDARLVLQIKNSIEGDIIGLVNE---------------------------
        ++SKI EHKLNGSNY  W   ++++LRS+  DDH+ E+PP D ++K W++DDARL LQ+KNSI  DI+GL++                            
Subjt:  MISKIIEHKLNGSNYSSWKINVRLFLRSIDMDDHIIEDPPKDESKKAWLRDDARLVLQIKNSIEGDIIGLVNE---------------------------

Query:  ---------------------------------YSNDAKVRLAQREQLAVISFLLGLPPRFDVAKDQFLSGSEIPALEEAYTQVLRVEKSQPVLSSQSNS
                                         +S D +V+ AQREQ+AV+SFL GLP  F+ AK Q LSGS+I +L+E +++VLR E    V SSQ  +
Subjt:  ---------------------------------YSNDAKVRLAQREQLAVISFLLGLPPRFDVAKDQFLSGSEIPALEEAYTQVLRVEKSQPVLSSQSNS

Query:  ALVGRSTNA----------------NRGN----------TYQGNSTTSNAQRSSAPRGQKTQSAHVAST-----PDNAGKLVTIPAEEFAKFQQYQESLT
         LV +  NA                NRGN           ++   T  N  R    R ++ Q+A+VA++      D++ K+VT+ AEEF+K+ QYQ++L 
Subjt:  ALVGRSTNA----------------NRGN----------TYQGNSTTSNAQRSSAPRGQKTQSAHVAST-----PDNAGKLVTIPAEEFAKFQQYQESLT

Query:  ASSSNPIIAIAESGNTSKCLISSTSKWVIDSGATNHMSGNPNLFANFFPSASLPDVTIANGTTSPVLGSGT
        AS+  P+ A+AESG T  CL+SS++KW+IDSGAT+HM+GN   F+  F + S P VT+A+G+T  + GSGT
Subjt:  ASSSNPIIAIAESGNTSKCLISSTSKWVIDSGATNHMSGNPNLFANFFPSASLPDVTIANGTTSPVLGSGT

KAB5516879.1 hypothetical protein DKX38_027527 [Salix brachista]3.2e-5534.42Show/hide
Query:  ISKIIEHKLNGSNYSSWKINVRLFLRSIDMDDHIIEDPPKDES-KKAWLRDDARLVLQIKNSIEGDIIGLVNE---------------------------
        +SKI +HKLNG+NY  W   +R++LRS++ DDH+ E+PP DE+ KK W+RDDARL LQI+NSI+ +I+GL+N                            
Subjt:  ISKIIEHKLNGSNYSSWKINVRLFLRSIDMDDHIIEDPPKDES-KKAWLRDDARLVLQIKNSIEGDIIGLVNE---------------------------

Query:  ---------------------------------YSNDAKVRLAQREQLAVISFLLGLPPRFDVAKDQFLSGSEIPALEEAYTQVLRVEKSQPVLSSQSNS
                                         +S D KV+  QRE++AV+SFL GLP   +  K Q LS  EI +L+E ++++LR E +  +  + +  
Subjt:  ---------------------------------YSNDAKVRLAQREQLAVISFLLGLPPRFDVAKDQFLSGSEIPALEEAYTQVLRVEKSQPVLSSQSNS

Query:  ALVGRSTNANRGNTYQGNSTTSNAQRSSA----------------------PRGQKTQSAHVASTPD----NAGKLVTIPAEEFAKFQQYQESLTASSSN
           GR  +  R    +G S T ++  + +                       R Q+ Q A+V +T      ++ K + + A++FAKF  YQ+SL  S+  
Subjt:  ALVGRSTNANRGNTYQGNSTTSNAQRSSA----------------------PRGQKTQSAHVASTPD----NAGKLVTIPAEEFAKFQQYQESLTASSSN

Query:  PIIAIAESGNTSKCLISSTSKWVIDSGATNHMSGNPNLFANFFPSASLPDVTIANGTTSPVLGSGT-----------GESSQEDDDFLVYFIVSPST---
        P  A  E+G T  CLISS+++WVIDSGAT+HM+GNP  F+NF    +   VTIA+G+TS ++G GT            ++  ED+++L+Y +  P+T   
Subjt:  PIIAIAESGNTSKCLISSTSKWVIDSGATNHMSGNPNLFANFFPSASLPDVTIANGTTSPVLGSGT-----------GESSQEDDDFLVYFIVSPST---

Query:  -EELPSNTSSYVPDPSIPTITQVYSRRQPPTNSCPIPAASSPEDPGISDDLPIALRKGP
            P +     P  + P I QVYSRRQ  T++CP P      DP    DLPI LRK P
Subjt:  -EELPSNTSSYVPDPSIPTITQVYSRRQPPTNSCPIPAASSPEDPGISDDLPIALRKGP

KAF9661591.1 hypothetical protein SADUNF_Sadunf19G0084700 [Salix dunnii]7.2e-5536.29Show/hide
Query:  MISKIIEHKLNGSNYSSWKINVRLFLRSIDMDDHIIEDPPKD-ESKKAWLRDDARLVLQIKNSIEGDIIGLVNE--------------------------
        M +KI EHKLN +NY +W   VR++LRSID DDH+++DPP D  +KKAWLRDDAR+ LQI+NSI+ ++I LVN                           
Subjt:  MISKIIEHKLNGSNYSSWKINVRLFLRSIDMDDHIIEDPPKD-ESKKAWLRDDARLVLQIKNSIEGDIIGLVNE--------------------------

Query:  ----------------------------------YSNDAKVRLAQREQLAVISFLLGLPPRFDVAKDQFLSGSEIPALEEAYTQVLRVE--KSQPVLS--
                                          +S D K + +QREQ+AV+SFL GLPP FD A+ Q LS  E+  L + +T++LR E  +S P+++  
Subjt:  ----------------------------------YSNDAKVRLAQREQLAVISFLLGLPPRFDVAKDQFLSGSEIPALEEAYTQVLRVE--KSQPVLS--

Query:  --SQSNSALVGRSTNANRGN----TYQGNSTTSNAQRSSAP---------------------RGQKTQSA-----------HVASTPDNAGKLVTIPAEE
          S+  S   GR +   RG+    ++QG+ +     R++                        G+  Q A           HV++   +  + V + A+E
Subjt:  --SQSNSALVGRSTNANRGN----TYQGNSTTSNAQRSSAP---------------------RGQKTQSA-----------HVASTPDNAGKLVTIPAEE

Query:  FAKFQQYQESLTASSSNPIIAIAESGNTSKCLISSTSKWVIDSGATNHMSGNPNLFANFFP---SASLPDVTIANGTTSPVLGSGTGESSQEDDDFLVYF
        FA+F QYQ SL    SNP   I ESG  + CL+SS+SKWVIDSGAT+HM+G+  + + F P     +LP VT+A+G+T+ V               +VY 
Subjt:  FAKFQQYQESLTASSSNPIIAIAESGNTSKCLISSTSKWVIDSGATNHMSGNPNLFANFFP---SASLPDVTIANGTTSPVLGSGTGESSQEDDDFLVYF

Query:  IVSPSTEELPSNTSSYVPDPSIPTITQVYSRRQPPTNSCPIPAASSPEDPGISDDLPIALRKG
        I +       ++ SS  P P  P ITQVYSRR PP NSCP P A +  DP +  DLPIA+RKG
Subjt:  IVSPSTEELPSNTSSYVPDPSIPTITQVYSRRQPPTNSCPIPAASSPEDPGISDDLPIALRKG

KAF9681460.1 hypothetical protein SADUNF_Sadunf05G0003800 [Salix dunnii]9.4e-5534.31Show/hide
Query:  ISKIIEHKLNGSNYSSWKINVRLFLRSIDMDDHIIEDPPKDES-KKAWLRDDARLVLQIKNSIEGDIIGLVNE---------------------------
        +SKI +HKLNG+NY  W   +R++LRS++ DDH+ E+PP DE+ KK W+RDDARL LQI+NSI+ +I+GL+N                            
Subjt:  ISKIIEHKLNGSNYSSWKINVRLFLRSIDMDDHIIEDPPKDES-KKAWLRDDARLVLQIKNSIEGDIIGLVNE---------------------------

Query:  ---------------------------------YSNDAKVRLAQREQLAVISFLLGLPPRFDVAKDQFLSGSEIPALEEAYTQVLRVEKSQPVLSSQSNS
                                         +S D KV+  QRE++AV+SFL GLP   + AK Q LS  EI +L+E ++++LR E +  +  + +  
Subjt:  ---------------------------------YSNDAKVRLAQREQLAVISFLLGLPPRFDVAKDQFLSGSEIPALEEAYTQVLRVEKSQPVLSSQSNS

Query:  ALVGRSTNANRGNTYQGNST----------------------TSNAQRSSAPRGQKTQSAHV----ASTPDNAGKLVTIPAEEFAKFQQYQESLTASSSN
           GRS +  R    +G S                       T    +    R Q+ Q A+V    ++T  ++ K + + A+EFAKF QYQESL  S+  
Subjt:  ALVGRSTNANRGNTYQGNST----------------------TSNAQRSSAPRGQKTQSAHV----ASTPDNAGKLVTIPAEEFAKFQQYQESLTASSSN

Query:  PIIAIAESGNTSKCLISSTSKWVIDSGATNHMSGNPNLFANFFPSASLPDVTIANGTTSPVLGSGT------------------------GESSQEDDDF
        P     E+G T  CLISS+++WVIDSGAT+HM+GNP  F+NF    +   VTI +G+TS ++G GT                         +S  ED+++
Subjt:  PIIAIAESGNTSKCLISSTSKWVIDSGATNHMSGNPNLFANFFPSASLPDVTIANGTTSPVLGSGT------------------------GESSQEDDDF

Query:  LVYFIVSPST----EELPSNTSSYVPDPSIPTITQVYSRRQPPTNSCPIPAASSP--EDPGISDDLPIALRKGPSARC
        L+Y +   +T       P +     P P+ P I QVYSRRQ  T++CP   A +P   DP    DLPI LRKG    C
Subjt:  LVYFIVSPST----EELPSNTSSYVPDPSIPTITQVYSRRQPPTNSCPIPAASSP--EDPGISDDLPIALRKGPSARC

XP_031744753.1 uncharacterized protein LOC101212255 isoform X1 [Cucumis sativus]3.2e-5542.15Show/hide
Query:  MISKIIEHKLNGSNYSSWKINVRLFLRSIDMDDHIIEDPPKD-ESKKAWLRDDARLVLQIKNSIEGDIIGLVNE--------------------------
        + SKI EHKLNGSNY  W+  +  +LRS DMDDH+ EDPPKD + KK WLRDDARL LQIKNSIE +IIGLV+                           
Subjt:  MISKIIEHKLNGSNYSSWKINVRLFLRSIDMDDHIIEDPPKD-ESKKAWLRDDARLVLQIKNSIEGDIIGLVNE--------------------------

Query:  ----------------------------------YSNDAKVRLAQREQLAVISFLLGLPPRFDVAKDQFLSGSEIPALEEAYTQVLRVEKSQPVLS-SQS
                                          +S D KV+  QRE++AV+ FL GL P F +AK Q LS S+IP+L++A+T+VLR+E S   +S  Q 
Subjt:  ----------------------------------YSNDAKVRLAQREQLAVISFLLGLPPRFDVAKDQFLSGSEIPALEEAYTQVLRVEKSQPVLS-SQS

Query:  NSALVGRSTNANRGNTYQGNSTTSNAQRSSA---------------------PRGQKTQSAHVASTPDNAGKLVTIPAEEFAKFQQYQESLTASSSNPII
        +SAL  ++ N       Q NST      S                          Q++Q A +AST D     VTI A+EFAKFQ YQESL ASSS+  I
Subjt:  NSALVGRSTNANRGNTYQGNSTTSNAQRSSA---------------------PRGQKTQSAHVASTPDNAGKLVTIPAEEFAKFQQYQESLTASSSNPII

Query:  AIAESGNTSKCLISSTSKWVIDSGATNHMSGNPNLFANFFPSASLPDVTIANGTTSPVLGSGT
        A   +    KCL++S++KWVIDSGAT HM+GN +LF+     A  P VT+A+G+TS VLGSGT
Subjt:  AIAESGNTSKCLISSTSKWVIDSGATNHMSGNPNLFANFFPSASLPDVTIANGTTSPVLGSGT

TrEMBL top hitse value%identityAlignment
A0A438DT29 Retrovirus-related Pol polyprotein from transposon TNT 1-943.3e-5338.11Show/hide
Query:  MISKIIEHKLNGSNYSSWKINVRLFLRSIDMDDHIIEDPPKDESKKAWLRDDARLVLQIKNSIEGDIIGLVNE---------------------------
        ++SKI EHKLNGSNY  W   ++++LRS+  DDH+ E+PP D ++K W++DDARL LQ+KNSI  DI+GL++                            
Subjt:  MISKIIEHKLNGSNYSSWKINVRLFLRSIDMDDHIIEDPPKDESKKAWLRDDARLVLQIKNSIEGDIIGLVNE---------------------------

Query:  ---------------------------------YSNDAKVRLAQREQLAVISFLLGLPPRFDVAKDQFLSGSEIPALEEAYTQVLRVEKSQPVLSSQSNS
                                         +S D +V+ AQREQ+AV+SFL GLP  F+ AK Q LSGS+I +L+E +++VLR E    V SSQ  +
Subjt:  ---------------------------------YSNDAKVRLAQREQLAVISFLLGLPPRFDVAKDQFLSGSEIPALEEAYTQVLRVEKSQPVLSSQSNS

Query:  ALVGRSTNA----------------NRGN---------TYQGNSTTSNAQRSSAPRGQKTQSAHVAST-----PDNAGKLVTIPAEEFAKFQQYQESLTA
         LV +  NA                NRGN          ++   T  N ++    R ++ Q+A+VA++      D++ K+VT+ AEEF+K+ QYQ++L A
Subjt:  ALVGRSTNA----------------NRGN---------TYQGNSTTSNAQRSSAPRGQKTQSAHVAST-----PDNAGKLVTIPAEEFAKFQQYQESLTA

Query:  SSSNPIIAIAESGNTSKCLISSTSKWVIDSGATNHMSGNPNLFANFFPSASLPDVTIANGTTSPVLGSGT
        S+  P+ A+AESG T  CL+SS++KW+IDSGAT+HM+GN   F+  F + S P VT+A+G+T  + GSGT
Subjt:  SSSNPIIAIAESGNTSKCLISSTSKWVIDSGATNHMSGNPNLFANFFPSASLPDVTIANGTTSPVLGSGT

A0A438H537 Retrovirus-related Pol polyprotein from transposon TNT 1-945.6e-5337.94Show/hide
Query:  MISKIIEHKLNGSNYSSWKINVRLFLRSIDMDDHIIEDPPKDESKKAWLRDDARLVLQIKNSIEGDIIGLVNE---------------------------
        ++SKI EHKLNGSNY  W   ++++LRS+  DDH+ E+PP D ++K W++DDARL LQ+KNSI  DI+GL++                            
Subjt:  MISKIIEHKLNGSNYSSWKINVRLFLRSIDMDDHIIEDPPKDESKKAWLRDDARLVLQIKNSIEGDIIGLVNE---------------------------

Query:  ---------------------------------YSNDAKVRLAQREQLAVISFLLGLPPRFDVAKDQFLSGSEIPALEEAYTQVLRVEKSQPVLSSQSNS
                                         +S D +V+ AQREQ+AV+SFL GLP  F+ AK Q LSGS+I +L+E +++VLR E    V SSQ  +
Subjt:  ---------------------------------YSNDAKVRLAQREQLAVISFLLGLPPRFDVAKDQFLSGSEIPALEEAYTQVLRVEKSQPVLSSQSNS

Query:  ALVGR---STNANRGNTYQGNSTTSNAQRSSAP---------------------RGQKTQSAHVAST-----PDNAGKLVTIPAEEFAKFQQYQESLTAS
         LV +   + NA R N   GN    N    S+                      R ++ Q+A+VA++      D++ K+VT+ AEEF+K+ QYQ++L AS
Subjt:  ALVGR---STNANRGNTYQGNSTTSNAQRSSAP---------------------RGQKTQSAHVAST-----PDNAGKLVTIPAEEFAKFQQYQESLTAS

Query:  SSNPIIAIAESGNTSKCLISSTSKWVIDSGATNHMSGNPNLFANFFPSASLPDVTIANGTTSPVLGSGT
        +  P+ A+ ESG T  CL+SS++KW+IDSGAT+HM+GN   F+  F + S P VT+A+G+T  + GSGT
Subjt:  SSNPIIAIAESGNTSKCLISSTSKWVIDSGATNHMSGNPNLFANFFPSASLPDVTIANGTTSPVLGSGT

A0A5N5JC74 Uncharacterized protein1.6e-5534.42Show/hide
Query:  ISKIIEHKLNGSNYSSWKINVRLFLRSIDMDDHIIEDPPKDES-KKAWLRDDARLVLQIKNSIEGDIIGLVNE---------------------------
        +SKI +HKLNG+NY  W   +R++LRS++ DDH+ E+PP DE+ KK W+RDDARL LQI+NSI+ +I+GL+N                            
Subjt:  ISKIIEHKLNGSNYSSWKINVRLFLRSIDMDDHIIEDPPKDES-KKAWLRDDARLVLQIKNSIEGDIIGLVNE---------------------------

Query:  ---------------------------------YSNDAKVRLAQREQLAVISFLLGLPPRFDVAKDQFLSGSEIPALEEAYTQVLRVEKSQPVLSSQSNS
                                         +S D KV+  QRE++AV+SFL GLP   +  K Q LS  EI +L+E ++++LR E +  +  + +  
Subjt:  ---------------------------------YSNDAKVRLAQREQLAVISFLLGLPPRFDVAKDQFLSGSEIPALEEAYTQVLRVEKSQPVLSSQSNS

Query:  ALVGRSTNANRGNTYQGNSTTSNAQRSSA----------------------PRGQKTQSAHVASTPD----NAGKLVTIPAEEFAKFQQYQESLTASSSN
           GR  +  R    +G S T ++  + +                       R Q+ Q A+V +T      ++ K + + A++FAKF  YQ+SL  S+  
Subjt:  ALVGRSTNANRGNTYQGNSTTSNAQRSSA----------------------PRGQKTQSAHVASTPD----NAGKLVTIPAEEFAKFQQYQESLTASSSN

Query:  PIIAIAESGNTSKCLISSTSKWVIDSGATNHMSGNPNLFANFFPSASLPDVTIANGTTSPVLGSGT-----------GESSQEDDDFLVYFIVSPST---
        P  A  E+G T  CLISS+++WVIDSGAT+HM+GNP  F+NF    +   VTIA+G+TS ++G GT            ++  ED+++L+Y +  P+T   
Subjt:  PIIAIAESGNTSKCLISSTSKWVIDSGATNHMSGNPNLFANFFPSASLPDVTIANGTTSPVLGSGT-----------GESSQEDDDFLVYFIVSPST---

Query:  -EELPSNTSSYVPDPSIPTITQVYSRRQPPTNSCPIPAASSPEDPGISDDLPIALRKGP
            P +     P  + P I QVYSRRQ  T++CP P      DP    DLPI LRK P
Subjt:  -EELPSNTSSYVPDPSIPTITQVYSRRQPPTNSCPIPAASSPEDPGISDDLPIALRKGP

A5AWD0 Uncharacterized protein3.3e-5338.27Show/hide
Query:  MISKIIEHKLNGSNYSSWKINVRLFLRSIDMDDHIIEDPPKDESKKAWLRDDARLVLQIKNSIEGDIIGLVNE---------------------------
        ++SKI EHKLNGSNY  W   ++++LRS+  DDH+ E+PP D ++K W++DDARL LQ+KNSI  DI+GL++                            
Subjt:  MISKIIEHKLNGSNYSSWKINVRLFLRSIDMDDHIIEDPPKDESKKAWLRDDARLVLQIKNSIEGDIIGLVNE---------------------------

Query:  ---------------------------------YSNDAKVRLAQREQLAVISFLLGLPPRFDVAKDQFLSGSEIPALEEAYTQVLRVEKSQPVLSSQSNS
                                         +S D +V+ AQREQ+AV+SFL GLP  F+ AK Q LSGS+I +L+E +++VLR E    V SSQ  +
Subjt:  ---------------------------------YSNDAKVRLAQREQLAVISFLLGLPPRFDVAKDQFLSGSEIPALEEAYTQVLRVEKSQPVLSSQSNS

Query:  ALVGRSTNA----------------NRGN----------TYQGNSTTSNAQRSSAPRGQKTQSAHVAST-----PDNAGKLVTIPAEEFAKFQQYQESLT
         LV +  NA                NRGN           ++   T  N  R    R ++ Q+A+VA++      D++ K+VT+ AEEF+K+ QYQ++L 
Subjt:  ALVGRSTNA----------------NRGN----------TYQGNSTTSNAQRSSAPRGQKTQSAHVAST-----PDNAGKLVTIPAEEFAKFQQYQESLT

Query:  ASSSNPIIAIAESGNTSKCLISSTSKWVIDSGATNHMSGNPNLFANFFPSASLPDVTIANGTTSPVLGSGT
        AS+  P+ A+AESG T  CL+SS++KW+IDSGAT+HM+GN   F+  F + S P VT+A+G+T  + GSGT
Subjt:  ASSSNPIIAIAESGNTSKCLISSTSKWVIDSGATNHMSGNPNLFANFFPSASLPDVTIANGTTSPVLGSGT

B0FBS2 Uncharacterized protein5.6e-5338.01Show/hide
Query:  MISKIIEHKLNGSNYSSWKINVRLFLRSIDMDDHIIEDPPKDESKKAWLRDDARLVLQIKNSIEGDIIGLVNE---------------------------
        ++SKI EHKLNGSNY  W   ++++LRS+  DDH+ E+PP D ++K W++DDARL LQ+KNSI  DI+GL++                            
Subjt:  MISKIIEHKLNGSNYSSWKINVRLFLRSIDMDDHIIEDPPKDESKKAWLRDDARLVLQIKNSIEGDIIGLVNE---------------------------

Query:  ---------------------------------YSNDAKVRLAQREQLAVISFLLGLPPRFDVAKDQFLSGSEIPALEEAYTQVLRVEKSQPVLSSQSNS
                                         +S D +V+ AQREQ+AV+SFL GLP  F+ AK Q LSGS+I +L+E +++VLR E    V SSQ  +
Subjt:  ---------------------------------YSNDAKVRLAQREQLAVISFLLGLPPRFDVAKDQFLSGSEIPALEEAYTQVLRVEKSQPVLSSQSNS

Query:  ALVGRSTNA----------------NRGN----------TYQGNSTTSNAQRSSAPRGQKTQSAHVAST-----PDNAGKLVTIPAEEFAKFQQYQESLT
         L+ +  NA                NRGN           ++   T  N  R    R ++ Q+A+VA++      D++ K+VT+ AEEF+K+ QYQ++L 
Subjt:  ALVGRSTNA----------------NRGN----------TYQGNSTTSNAQRSSAPRGQKTQSAHVAST-----PDNAGKLVTIPAEEFAKFQQYQESLT

Query:  ASSSNPIIAIAESGNTSKCLISSTSKWVIDSGATNHMSGNPNLFANFFPSASLPDVTIANGTTSPVLGSGT
        AS+  P+ A+AESG T  CL+SS++KW+IDSGAT+HM+GN   F+  F + S P VT+A+G+T  + GSGT
Subjt:  ASSSNPIIAIAESGNTSKCLISSTSKWVIDSGATNHMSGNPNLFANFFPSASLPDVTIANGTTSPVLGSGT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATATCAAAAATCATAGAACACAAGTTGAATGGATCAAACTATTCTTCATGGAAAATCAATGTTCGTCTTTTTTTGCGAAGTATTGATATGGACGATCACATAATTGA
GGATCCGCCTAAGGATGAGAGCAAGAAAGCTTGGTTACGGGATGATGCTCGACTGGTTCTGCAGATCAAGAATTCGATCGAGGGTGACATTATTGGCTTGGTCAATGAAT
ATAGTAATGATGCAAAAGTTCGGCTTGCCCAACGCGAACAACTAGCAGTTATCAGTTTTCTTCTTGGTCTTCCACCTAGATTTGATGTGGCCAAAGACCAATTTCTCTCT
GGTTCGGAAATTCCAGCTTTAGAGGAGGCATACACTCAAGTACTTCGAGTTGAGAAGTCACAACCCGTCTTGTCATCTCAGTCTAATAGTGCTTTGGTTGGACGTAGTAC
AAATGCAAACAGAGGTAATACCTACCAAGGGAATTCCACGACTTCGAACGCTCAACGTTCTAGTGCCCCAAGAGGTCAGAAAACACAGTCTGCCCATGTTGCATCTACTC
CTGATAATGCTGGCAAGTTAGTTACAATTCCTGCGGAAGAATTTGCTAAGTTCCAACAGTATCAAGAGTCATTGACGGCATCGTCCTCTAATCCGATTATCGCCATCGCT
GAGTCAGGTAACACCAGTAAATGTCTTATTTCCTCCACATCAAAATGGGTCATTGACTCTGGTGCGACAAATCATATGTCAGGTAATCCTAACTTATTTGCTAATTTCTT
CCCATCTGCATCTTTGCCTGATGTTACCATAGCAAATGGCACAACTTCTCCTGTTCTTGGCTCTGGCACAGGGGAGAGTTCACAAGAAGATGATGACTTTCTTGTATATT
TCATTGTCTCTCCTTCCACTGAAGAGCTTCCTAGCAATACATCTTCCTATGTGCCTGATCCTTCTATTCCCACCATTACTCAAGTTTATTCTCGTCGGCAACCTCCTACG
AACTCATGCCCTATACCAGCAGCTTCTTCGCCCGAGGATCCAGGAATAAGTGATGACCTTCCAATTGCTCTTAGAAAAGGCCCAAGTGCAAGGTGTTGCCGTGTGTGTTT
AGGAGGTGCTGGTTCTTACAAAGGTTGCCTAAGCCACTTGTGGAAGTGCTACCAAGGTTATGAGAGTGTTGCTGGAGCGTGGTCGAAAGAGGCATCTATACTGGAGCCTA
ATGTCGCTCGGATGCCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGATATCAAAAATCATAGAACACAAGTTGAATGGATCAAACTATTCTTCATGGAAAATCAATGTTCGTCTTTTTTTGCGAAGTATTGATATGGACGATCACATAATTGA
GGATCCGCCTAAGGATGAGAGCAAGAAAGCTTGGTTACGGGATGATGCTCGACTGGTTCTGCAGATCAAGAATTCGATCGAGGGTGACATTATTGGCTTGGTCAATGAAT
ATAGTAATGATGCAAAAGTTCGGCTTGCCCAACGCGAACAACTAGCAGTTATCAGTTTTCTTCTTGGTCTTCCACCTAGATTTGATGTGGCCAAAGACCAATTTCTCTCT
GGTTCGGAAATTCCAGCTTTAGAGGAGGCATACACTCAAGTACTTCGAGTTGAGAAGTCACAACCCGTCTTGTCATCTCAGTCTAATAGTGCTTTGGTTGGACGTAGTAC
AAATGCAAACAGAGGTAATACCTACCAAGGGAATTCCACGACTTCGAACGCTCAACGTTCTAGTGCCCCAAGAGGTCAGAAAACACAGTCTGCCCATGTTGCATCTACTC
CTGATAATGCTGGCAAGTTAGTTACAATTCCTGCGGAAGAATTTGCTAAGTTCCAACAGTATCAAGAGTCATTGACGGCATCGTCCTCTAATCCGATTATCGCCATCGCT
GAGTCAGGTAACACCAGTAAATGTCTTATTTCCTCCACATCAAAATGGGTCATTGACTCTGGTGCGACAAATCATATGTCAGGTAATCCTAACTTATTTGCTAATTTCTT
CCCATCTGCATCTTTGCCTGATGTTACCATAGCAAATGGCACAACTTCTCCTGTTCTTGGCTCTGGCACAGGGGAGAGTTCACAAGAAGATGATGACTTTCTTGTATATT
TCATTGTCTCTCCTTCCACTGAAGAGCTTCCTAGCAATACATCTTCCTATGTGCCTGATCCTTCTATTCCCACCATTACTCAAGTTTATTCTCGTCGGCAACCTCCTACG
AACTCATGCCCTATACCAGCAGCTTCTTCGCCCGAGGATCCAGGAATAAGTGATGACCTTCCAATTGCTCTTAGAAAAGGCCCAAGTGCAAGGTGTTGCCGTGTGTGTTT
AGGAGGTGCTGGTTCTTACAAAGGTTGCCTAAGCCACTTGTGGAAGTGCTACCAAGGTTATGAGAGTGTTGCTGGAGCGTGGTCGAAAGAGGCATCTATACTGGAGCCTA
ATGTCGCTCGGATGCCCTAG
Protein sequenceShow/hide protein sequence
MISKIIEHKLNGSNYSSWKINVRLFLRSIDMDDHIIEDPPKDESKKAWLRDDARLVLQIKNSIEGDIIGLVNEYSNDAKVRLAQREQLAVISFLLGLPPRFDVAKDQFLS
GSEIPALEEAYTQVLRVEKSQPVLSSQSNSALVGRSTNANRGNTYQGNSTTSNAQRSSAPRGQKTQSAHVASTPDNAGKLVTIPAEEFAKFQQYQESLTASSSNPIIAIA
ESGNTSKCLISSTSKWVIDSGATNHMSGNPNLFANFFPSASLPDVTIANGTTSPVLGSGTGESSQEDDDFLVYFIVSPSTEELPSNTSSYVPDPSIPTITQVYSRRQPPT
NSCPIPAASSPEDPGISDDLPIALRKGPSARCCRVCLGGAGSYKGCLSHLWKCYQGYESVAGAWSKEASILEPNVARMP