; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc05g28090 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc05g28090
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase Ty1/copia-type domain-containing protein
Genome locationchr5:20608025..20610844
RNA-Seq ExpressionMoc05g28090
SyntenyMoc05g28090
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN74381.1 hypothetical protein VITISV_007944 [Vitis vinifera]6.7e-6854.76Show/hide
Query:  MAKSETTLSIQSFHQCSSLISIKLSASNYLLWKSQVIPLVRSLGIEHHLRGSSEPEKLLTDKDGETTPNPQYDTWLNNDGLLTSWLLGIITEEVLATIEG
        MA  E  LSIQ+FHQCSSL+SIKL+ SN LLW+SQV+PLVRSLG+ HHL  +          + + T +   +TW +NDGLLTSWLLG++TEEV+  ++G
Subjt:  MAKSETTLSIQSFHQCSSLISIKLSASNYLLWKSQVIPLVRSLGIEHHLRGSSEPEKLLTDKDGETTPNPQYDTWLNNDGLLTSWLLGIITEEVLATIEG

Query:  TESAYQMWKSLEEQLLTMTKENEIHLNEVILSLKKGSLSLDEYLKKIKSLCDQLAATKKLVDDLTKVFHVARGLGTKYQGFKTAMLSKAPYPTYNEFILA
        TE+AY +W SL E+LL MTKE E+ L   +  +KKG+ SLDEYL++ K +CD LAA +K V DL K F +A+GLGTKY  F+ AMLSK PYP+YN+F+LA
Subjt:  TESAYQMWKSLEEQLLTMTKENEIHLNEVILSLKKGSLSLDEYLKKIKSLCDQLAATKKLVDDLTKVFHVARGLGTKYQGFKTAMLSKAPYPTYNEFILA

Query:  LKTHEVMINADIEQENTSQPDHNQAFYAHRGRSRGRGRTFCSRGRGF-PQGK
        L+ HE MI  +  +EN    +H QA++  RGR R RG  F SRGRGF P G+
Subjt:  LKTHEVMINADIEQENTSQPDHNQAFYAHRGRSRGRGRTFCSRGRGF-PQGK

RVW19921.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]2.3e-6854.55Show/hide
Query:  TMAKSETTLSIQSFHQCSSLISIKLSASNYLLWKSQVIPLVRSLGIEHHLRGSSEPEKLLTDKDGETTPNPQYDTWLNNDGLLTSWLLGIITEEVLATIE
        +MA  E  LSIQ+FHQCSSL+SIKL+ SN LLW+SQV+PLVRSLG+ HHL  +    +     + + T +   +TW +NDGLLTSWLLG++TEEV+  ++
Subjt:  TMAKSETTLSIQSFHQCSSLISIKLSASNYLLWKSQVIPLVRSLGIEHHLRGSSEPEKLLTDKDGETTPNPQYDTWLNNDGLLTSWLLGIITEEVLATIE

Query:  GTESAYQMWKSLEEQLLTMTKENEIHLNEVILSLKKGSLSLDEYLKKIKSLCDQLAATKKLVDDLTKVFHVARGLGTKYQGFKTAMLSKAPYPTYNEFIL
        GTE+AY +W SL E+LL MTKE E+ L   +  +KKG+ SLDEYL++ K +CD LAA +K V DL KVF +A+GLGTKY  F+ AMLSK PYP+YN+F+L
Subjt:  GTESAYQMWKSLEEQLLTMTKENEIHLNEVILSLKKGSLSLDEYLKKIKSLCDQLAATKKLVDDLTKVFHVARGLGTKYQGFKTAMLSKAPYPTYNEFIL

Query:  ALKTHEVMINADIEQENTSQPDHNQAFYAHRGRSRGRGRTFCSRGRGF-PQGK
        AL+ HE MI  +  +EN    +H QA++  RGR R +G  F SRGRGF P G+
Subjt:  ALKTHEVMINADIEQENTSQPDHNQAFYAHRGRSRGRGRTFCSRGRGF-PQGK

RVW43526.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]2.7e-6955.34Show/hide
Query:  TMAKSETTLSIQSFHQCSSLISIKLSASNYLLWKSQVIPLVRSLGIEHHLRGSSEPEKLLTDKDGETTPNPQYDTWLNNDGLLTSWLLGIITEEVLATIE
        +MA  E  LSIQ+FHQCSSL+SIKL+ SN LLW+SQV+PLVRSLG+ HHL  +    K     + + T +   +TW +NDGLLTSWLLG++TEEV+  ++
Subjt:  TMAKSETTLSIQSFHQCSSLISIKLSASNYLLWKSQVIPLVRSLGIEHHLRGSSEPEKLLTDKDGETTPNPQYDTWLNNDGLLTSWLLGIITEEVLATIE

Query:  GTESAYQMWKSLEEQLLTMTKENEIHLNEVILSLKKGSLSLDEYLKKIKSLCDQLAATKKLVDDLTKVFHVARGLGTKYQGFKTAMLSKAPYPTYNEFIL
        GTE+AY +W SL E+LL MTKE E+ L   +  +KKG+ SLDEYL++ K +CD LAA +K V DL KVF +A+GLGTKY  F+ AMLSK PYP+YN+F+L
Subjt:  GTESAYQMWKSLEEQLLTMTKENEIHLNEVILSLKKGSLSLDEYLKKIKSLCDQLAATKKLVDDLTKVFHVARGLGTKYQGFKTAMLSKAPYPTYNEFIL

Query:  ALKTHEVMINADIEQENTSQPDHNQAFYAHRGRSRGRGRTFCSRGRGF-PQGK
        AL+ HE MI  +  +EN    +H QA++  RGR R RG  F SRGRGF P G+
Subjt:  ALKTHEVMINADIEQENTSQPDHNQAFYAHRGRSRGRGRTFCSRGRGF-PQGK

RVW93768.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]8.8e-6854.37Show/hide
Query:  MAKSETTLSIQSFHQCSSLISIKLSASNYLLWKSQVIPLVRSLGIEHHLRGSSEPEKLLTDKDGETTPNPQYDTWLNNDGLLTSWLLGIITEEVLATIEG
        MA  E  LSIQ+FHQCSSL+SIKL+ SN LLW+SQV+PLVRSLG+ HHL  +    +     + + T +   +TW +NDGLLTSWLLG++T+EV+  ++G
Subjt:  MAKSETTLSIQSFHQCSSLISIKLSASNYLLWKSQVIPLVRSLGIEHHLRGSSEPEKLLTDKDGETTPNPQYDTWLNNDGLLTSWLLGIITEEVLATIEG

Query:  TESAYQMWKSLEEQLLTMTKENEIHLNEVILSLKKGSLSLDEYLKKIKSLCDQLAATKKLVDDLTKVFHVARGLGTKYQGFKTAMLSKAPYPTYNEFILA
        TE+AY +W SL E+LL MTKE E+ L   +  +KKG+ SLDEYL++ K +CD LAA +K V DL KVF +A+GLGTKY  F+ AMLSK PYP+YN+F+LA
Subjt:  TESAYQMWKSLEEQLLTMTKENEIHLNEVILSLKKGSLSLDEYLKKIKSLCDQLAATKKLVDDLTKVFHVARGLGTKYQGFKTAMLSKAPYPTYNEFILA

Query:  LKTHEVMINADIEQENTSQPDHNQAFYAHRGRSRGRGRTFCSRGRGF-PQGK
        L+ HE MI  +  +EN    +H QA++  RGR R RG  F S+GRGF P G+
Subjt:  LKTHEVMINADIEQENTSQPDHNQAFYAHRGRSRGRGRTFCSRGRGF-PQGK

XP_022154021.1 uncharacterized protein LOC111021379 [Momordica charantia]2.1e-8564.06Show/hide
Query:  MAKSETTLSIQSFHQCSSLISIKLSASNYLLWKSQVIPLVRSLGIEHHLRGSSEPEKLLTDKDGETTPNPQYDTWLNNDGLLTSWLLGIITEEVLATIEG
        MA  E  L++QSFHQCSSLIS+KL++SNYLLWKSQV+PL+R+LG+EHHL   +        K+GE+    Q  TW+NNDGLLTSWLLGII E+VL  +EG
Subjt:  MAKSETTLSIQSFHQCSSLISIKLSASNYLLWKSQVIPLVRSLGIEHHLRGSSEPEKLLTDKDGETTPNPQYDTWLNNDGLLTSWLLGIITEEVLATIEG

Query:  TESAYQMWKSLEEQLLTMTKENEIHLNEVILSLKKGSLSLDEYLKKIKSLCDQLAATKKLVDDLTKVFHVARGLGTKYQGFKTAMLSKAPYPTYNEFILA
        TE+A ++W SLEE LLTMTKENEIHLNE +L+LKKGSLS+DEY++K K+LCD+L A KK +DDLTKVFH+ARGLG KY+ F+TAMLSKAPYP+YN+F+LA
Subjt:  TESAYQMWKSLEEQLLTMTKENEIHLNEVILSLKKGSLSLDEYLKKIKSLCDQLAATKKLVDDLTKVFHVARGLGTKYQGFKTAMLSKAPYPTYNEFILA

Query:  LKTHEVMINADIEQENTSQPDHNQAFYAHRGRSRGRGRTFCSRGRGFPQGKQALSQ
        LK H+  +  D E+E +SQ + NQAF+++RGRSRGRGR F SRGRGF   K   +Q
Subjt:  LKTHEVMINADIEQENTSQPDHNQAFYAHRGRSRGRGRTFCSRGRGFPQGKQALSQ

TrEMBL top hitse value%identityAlignment
A0A438C9J9 Retrovirus-related Pol polyprotein from transposon RE11.1e-6854.55Show/hide
Query:  TMAKSETTLSIQSFHQCSSLISIKLSASNYLLWKSQVIPLVRSLGIEHHLRGSSEPEKLLTDKDGETTPNPQYDTWLNNDGLLTSWLLGIITEEVLATIE
        +MA  E  LSIQ+FHQCSSL+SIKL+ SN LLW+SQV+PLVRSLG+ HHL  +    +     + + T +   +TW +NDGLLTSWLLG++TEEV+  ++
Subjt:  TMAKSETTLSIQSFHQCSSLISIKLSASNYLLWKSQVIPLVRSLGIEHHLRGSSEPEKLLTDKDGETTPNPQYDTWLNNDGLLTSWLLGIITEEVLATIE

Query:  GTESAYQMWKSLEEQLLTMTKENEIHLNEVILSLKKGSLSLDEYLKKIKSLCDQLAATKKLVDDLTKVFHVARGLGTKYQGFKTAMLSKAPYPTYNEFIL
        GTE+AY +W SL E+LL MTKE E+ L   +  +KKG+ SLDEYL++ K +CD LAA +K V DL KVF +A+GLGTKY  F+ AMLSK PYP+YN+F+L
Subjt:  GTESAYQMWKSLEEQLLTMTKENEIHLNEVILSLKKGSLSLDEYLKKIKSLCDQLAATKKLVDDLTKVFHVARGLGTKYQGFKTAMLSKAPYPTYNEFIL

Query:  ALKTHEVMINADIEQENTSQPDHNQAFYAHRGRSRGRGRTFCSRGRGF-PQGK
        AL+ HE MI  +  +EN    +H QA++  RGR R +G  F SRGRGF P G+
Subjt:  ALKTHEVMINADIEQENTSQPDHNQAFYAHRGRSRGRGRTFCSRGRGF-PQGK

A0A438E6Z5 Retrovirus-related Pol polyprotein from transposon TNT 1-941.3e-6955.34Show/hide
Query:  TMAKSETTLSIQSFHQCSSLISIKLSASNYLLWKSQVIPLVRSLGIEHHLRGSSEPEKLLTDKDGETTPNPQYDTWLNNDGLLTSWLLGIITEEVLATIE
        +MA  E  LSIQ+FHQCSSL+SIKL+ SN LLW+SQV+PLVRSLG+ HHL  +    K     + + T +   +TW +NDGLLTSWLLG++TEEV+  ++
Subjt:  TMAKSETTLSIQSFHQCSSLISIKLSASNYLLWKSQVIPLVRSLGIEHHLRGSSEPEKLLTDKDGETTPNPQYDTWLNNDGLLTSWLLGIITEEVLATIE

Query:  GTESAYQMWKSLEEQLLTMTKENEIHLNEVILSLKKGSLSLDEYLKKIKSLCDQLAATKKLVDDLTKVFHVARGLGTKYQGFKTAMLSKAPYPTYNEFIL
        GTE+AY +W SL E+LL MTKE E+ L   +  +KKG+ SLDEYL++ K +CD LAA +K V DL KVF +A+GLGTKY  F+ AMLSK PYP+YN+F+L
Subjt:  GTESAYQMWKSLEEQLLTMTKENEIHLNEVILSLKKGSLSLDEYLKKIKSLCDQLAATKKLVDDLTKVFHVARGLGTKYQGFKTAMLSKAPYPTYNEFIL

Query:  ALKTHEVMINADIEQENTSQPDHNQAFYAHRGRSRGRGRTFCSRGRGF-PQGK
        AL+ HE MI  +  +EN    +H QA++  RGR R RG  F SRGRGF P G+
Subjt:  ALKTHEVMINADIEQENTSQPDHNQAFYAHRGRSRGRGRTFCSRGRGF-PQGK

A0A438IAM2 Retrovirus-related Pol polyprotein from transposon RE14.3e-6854.37Show/hide
Query:  MAKSETTLSIQSFHQCSSLISIKLSASNYLLWKSQVIPLVRSLGIEHHLRGSSEPEKLLTDKDGETTPNPQYDTWLNNDGLLTSWLLGIITEEVLATIEG
        MA  E  LSIQ+FHQCSSL+SIKL+ SN LLW+SQV+PLVRSLG+ HHL  +    +     + + T +   +TW +NDGLLTSWLLG++T+EV+  ++G
Subjt:  MAKSETTLSIQSFHQCSSLISIKLSASNYLLWKSQVIPLVRSLGIEHHLRGSSEPEKLLTDKDGETTPNPQYDTWLNNDGLLTSWLLGIITEEVLATIEG

Query:  TESAYQMWKSLEEQLLTMTKENEIHLNEVILSLKKGSLSLDEYLKKIKSLCDQLAATKKLVDDLTKVFHVARGLGTKYQGFKTAMLSKAPYPTYNEFILA
        TE+AY +W SL E+LL MTKE E+ L   +  +KKG+ SLDEYL++ K +CD LAA +K V DL KVF +A+GLGTKY  F+ AMLSK PYP+YN+F+LA
Subjt:  TESAYQMWKSLEEQLLTMTKENEIHLNEVILSLKKGSLSLDEYLKKIKSLCDQLAATKKLVDDLTKVFHVARGLGTKYQGFKTAMLSKAPYPTYNEFILA

Query:  LKTHEVMINADIEQENTSQPDHNQAFYAHRGRSRGRGRTFCSRGRGF-PQGK
        L+ HE MI  +  +EN    +H QA++  RGR R RG  F S+GRGF P G+
Subjt:  LKTHEVMINADIEQENTSQPDHNQAFYAHRGRSRGRGRTFCSRGRGF-PQGK

A0A6J1DMG5 uncharacterized protein LOC1110213791.0e-8564.06Show/hide
Query:  MAKSETTLSIQSFHQCSSLISIKLSASNYLLWKSQVIPLVRSLGIEHHLRGSSEPEKLLTDKDGETTPNPQYDTWLNNDGLLTSWLLGIITEEVLATIEG
        MA  E  L++QSFHQCSSLIS+KL++SNYLLWKSQV+PL+R+LG+EHHL   +        K+GE+    Q  TW+NNDGLLTSWLLGII E+VL  +EG
Subjt:  MAKSETTLSIQSFHQCSSLISIKLSASNYLLWKSQVIPLVRSLGIEHHLRGSSEPEKLLTDKDGETTPNPQYDTWLNNDGLLTSWLLGIITEEVLATIEG

Query:  TESAYQMWKSLEEQLLTMTKENEIHLNEVILSLKKGSLSLDEYLKKIKSLCDQLAATKKLVDDLTKVFHVARGLGTKYQGFKTAMLSKAPYPTYNEFILA
        TE+A ++W SLEE LLTMTKENEIHLNE +L+LKKGSLS+DEY++K K+LCD+L A KK +DDLTKVFH+ARGLG KY+ F+TAMLSKAPYP+YN+F+LA
Subjt:  TESAYQMWKSLEEQLLTMTKENEIHLNEVILSLKKGSLSLDEYLKKIKSLCDQLAATKKLVDDLTKVFHVARGLGTKYQGFKTAMLSKAPYPTYNEFILA

Query:  LKTHEVMINADIEQENTSQPDHNQAFYAHRGRSRGRGRTFCSRGRGFPQGKQALSQ
        LK H+  +  D E+E +SQ + NQAF+++RGRSRGRGR F SRGRGF   K   +Q
Subjt:  LKTHEVMINADIEQENTSQPDHNQAFYAHRGRSRGRGRTFCSRGRGFPQGKQALSQ

A5BC00 Uncharacterized protein3.3e-6854.76Show/hide
Query:  MAKSETTLSIQSFHQCSSLISIKLSASNYLLWKSQVIPLVRSLGIEHHLRGSSEPEKLLTDKDGETTPNPQYDTWLNNDGLLTSWLLGIITEEVLATIEG
        MA  E  LSIQ+FHQCSSL+SIKL+ SN LLW+SQV+PLVRSLG+ HHL  +          + + T +   +TW +NDGLLTSWLLG++TEEV+  ++G
Subjt:  MAKSETTLSIQSFHQCSSLISIKLSASNYLLWKSQVIPLVRSLGIEHHLRGSSEPEKLLTDKDGETTPNPQYDTWLNNDGLLTSWLLGIITEEVLATIEG

Query:  TESAYQMWKSLEEQLLTMTKENEIHLNEVILSLKKGSLSLDEYLKKIKSLCDQLAATKKLVDDLTKVFHVARGLGTKYQGFKTAMLSKAPYPTYNEFILA
        TE+AY +W SL E+LL MTKE E+ L   +  +KKG+ SLDEYL++ K +CD LAA +K V DL K F +A+GLGTKY  F+ AMLSK PYP+YN+F+LA
Subjt:  TESAYQMWKSLEEQLLTMTKENEIHLNEVILSLKKGSLSLDEYLKKIKSLCDQLAATKKLVDDLTKVFHVARGLGTKYQGFKTAMLSKAPYPTYNEFILA

Query:  LKTHEVMINADIEQENTSQPDHNQAFYAHRGRSRGRGRTFCSRGRGF-PQGK
        L+ HE MI  +  +EN    +H QA++  RGR R RG  F SRGRGF P G+
Subjt:  LKTHEVMINADIEQENTSQPDHNQAFYAHRGRSRGRGRTFCSRGRGF-PQGK

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE19.3e-1227.66Show/hide
Query:  KLSASNYLLWKSQVIPLVRSLGIEHHLRGSSEPEKLLTDKDGETTPNPQYDTWLNNDGLLTSWLLGIITEEVLATIEGTESAYQMWKSLEEQLLTMTKEN
        KL+++NYL+W  QV  L     +   L GS+         D     NP Y  W   D L+ S +LG I+  V   +    +A Q+W++L +     +  +
Subjt:  KLSASNYLLWKSQVIPLVRSLGIEHHLRGSSEPEKLLTDKDGETTPNPQYDTWLNNDGLLTSWLLGIITEEVLATIEGTESAYQMWKSLEEQLLTMTKEN

Query:  EIHLNEVILSLKKGSLSLDEYLKKIKSLCDQLAATKKLVDDLTKVFHVARGLGTKYQGFKTAMLSKAPYPTYNEFILALKTHEVMINA
           L   +    KG+ ++D+Y++ + +  DQLA   K +D   +V  V   L  +Y+     + +K   PT  E    L  HE  I A
Subjt:  EIHLNEVILSLKKGSLSLDEYLKKIKSLCDQLAATKKLVDDLTKVFHVARGLGTKYQGFKTAMLSKAPYPTYNEFILALKTHEVMINA

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).9.5e-0422.99Show/hide
Query:  NPQYDTWLNNDGLLTSWLLGIITEEVLATIEGTESAYQMWKSLEEQLLTMTKENEIHLNEVILSLKKGSLSLDEYLKKIKSLCDQLA
        +P Y  W   + ++  WL+  +T+++L ++   E+A++MW+ L    +         L   + +L++G  S++EY  K+  +  +L+
Subjt:  NPQYDTWLNNDGLLTSWLLGIITEEVLATIEGTESAYQMWKSLEEQLLTMTKENEIHLNEVILSLKKGSLSLDEYLKKIKSLCDQLA

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)3.5e-1426.03Show/hide
Query:  ISIKLSASNYLLWKSQVIPLVRSLGIEHHLRGSSEPEKLLTDKDGETTPNPQYD-TWLNNDGLLTSWLLGIITEEVLATIEGTE-SAYQMWKSLEEQLLT
        +++ L+  NY +W+     L  S G+  H+             DG +TP P  +  W   DGL+  W+ G IT+ +L TI     +A  +W SLE     
Subjt:  ISIKLSASNYLLWKSQVIPLVRSLGIEHHLRGSSEPEKLLTDKDGETTPNPQYD-TWLNNDGLLTSWLLGIITEEVLATIEGTE-SAYQMWKSLEEQLLT

Query:  MTKENEIHLNEVILSLKKGSLSLDEYLKKIKSLCDQLAATKKLVDDLTKVFHVARGLGTKYQGFKTAMLSKAPYPTYNEFILALKTHE------------
          +   +     + +     LS+ EY +K+KSL D L      + D   V H+  GL  KY      +  K+P+P++ E    L   E            
Subjt:  MTKENEIHLNEVILSLKKGSLSLDEYLKKIKSLCDQLAATKKLVDDLTKVFHVARGLGTKYQGFKTAMLSKAPYPTYNEFILALKTHE------------

Query:  ---------VMINADIEQENTSQPDHNQAFYAHRGRSRGRGR
                 V+     +QE   Q  HN      RGRS+ + R
Subjt:  ---------VMINADIEQENTSQPDHNQAFYAHRGRSRGRGR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGTCTTGTGTTCGAGTCTTGCATTGGTCATATTGTTGCTCAGTCCAGACAACAAGTTCCGATTATAAAGATACACCCTGCATAAGCTCTAAGCTATATGACATCGC
CTCGGGAGAGAAAAAAAATAAAGATGGACAGCTCTCCCTTTTGCTTGTTGTGCAGAAGAAGCAGCAGCCCCTGAGGGCAGGCCAAGGCAGTTATGAGAGGATGAGGATGA
ATATTATTATGTGTCTAATAAGTGTTAAGGACAACAATATTCCCAAGTACCGCAATGAGCTTGCTCGAAGGAGTGTACTGCTGATCGATCTGCAACAATTCACTATGGCT
AAATCAGAAACAACACTTTCGATTCAATCCTTCCACCAATGCTCAAGTTTAATCTCCATCAAATTGAGTGCCTCCAATTATCTATTGTGGAAGTCACAAGTTATACCTTT
GGTGAGAAGTCTTGGGATTGAACATCATCTTAGAGGAAGTAGTGAACCAGAGAAGCTCTTGACAGACAAAGATGGAGAAACTACTCCAAATCCTCAATACGACACTTGGC
TCAATAATGATGGCCTTTTAACATCATGGCTTCTAGGCATCATAACCGAGGAAGTATTGGCAACAATTGAAGGAACAGAATCAGCATATCAAATGTGGAAATCACTAGAA
GAACAACTCCTCACTATGACGAAGGAGAACGAAATCCACTTGAATGAAGTCATTCTAAGTCTGAAAAAGGGAAGTCTCTCGTTGGATGAATACTTAAAGAAAATTAAATC
TCTATGTGATCAACTTGCAGCTACGAAGAAACTAGTGGATGACCTCACAAAAGTATTCCATGTAGCTAGAGGTTTGGGGACAAAATACCAAGGATTCAAGACAGCAATGC
TATCCAAAGCACCATACCCAACCTACAACGAATTCATTCTAGCTCTCAAGACACATGAGGTAATGATCAATGCTGATATTGAACAAGAAAACACATCACAACCAGACCAC
AACCAAGCATTCTATGCTCACAGGGGAAGGAGCAGAGGTAGAGGAAGAACCTTCTGTTCTAGAGGAAGAGGGTTTCCACAAGGGAAACAAGCTCTTTCTCAGTCTATCAA
CTACACAAGCCCGTAG
mRNA sequenceShow/hide mRNA sequence
ATGATGTCTTGTGTTCGAGTCTTGCATTGGTCATATTGTTGCTCAGTCCAGACAACAAGTTCCGATTATAAAGATACACCCTGCATAAGCTCTAAGCTATATGACATCGC
CTCGGGAGAGAAAAAAAATAAAGATGGACAGCTCTCCCTTTTGCTTGTTGTGCAGAAGAAGCAGCAGCCCCTGAGGGCAGGCCAAGGCAGTTATGAGAGGATGAGGATGA
ATATTATTATGTGTCTAATAAGTGTTAAGGACAACAATATTCCCAAGTACCGCAATGAGCTTGCTCGAAGGAGTGTACTGCTGATCGATCTGCAACAATTCACTATGGCT
AAATCAGAAACAACACTTTCGATTCAATCCTTCCACCAATGCTCAAGTTTAATCTCCATCAAATTGAGTGCCTCCAATTATCTATTGTGGAAGTCACAAGTTATACCTTT
GGTGAGAAGTCTTGGGATTGAACATCATCTTAGAGGAAGTAGTGAACCAGAGAAGCTCTTGACAGACAAAGATGGAGAAACTACTCCAAATCCTCAATACGACACTTGGC
TCAATAATGATGGCCTTTTAACATCATGGCTTCTAGGCATCATAACCGAGGAAGTATTGGCAACAATTGAAGGAACAGAATCAGCATATCAAATGTGGAAATCACTAGAA
GAACAACTCCTCACTATGACGAAGGAGAACGAAATCCACTTGAATGAAGTCATTCTAAGTCTGAAAAAGGGAAGTCTCTCGTTGGATGAATACTTAAAGAAAATTAAATC
TCTATGTGATCAACTTGCAGCTACGAAGAAACTAGTGGATGACCTCACAAAAGTATTCCATGTAGCTAGAGGTTTGGGGACAAAATACCAAGGATTCAAGACAGCAATGC
TATCCAAAGCACCATACCCAACCTACAACGAATTCATTCTAGCTCTCAAGACACATGAGGTAATGATCAATGCTGATATTGAACAAGAAAACACATCACAACCAGACCAC
AACCAAGCATTCTATGCTCACAGGGGAAGGAGCAGAGGTAGAGGAAGAACCTTCTGTTCTAGAGGAAGAGGGTTTCCACAAGGGAAACAAGCTCTTTCTCAGTCTATCAA
CTACACAAGCCCGTAG
Protein sequenceShow/hide protein sequence
MMSCVRVLHWSYCCSVQTTSSDYKDTPCISSKLYDIASGEKKNKDGQLSLLLVVQKKQQPLRAGQGSYERMRMNIIMCLISVKDNNIPKYRNELARRSVLLIDLQQFTMA
KSETTLSIQSFHQCSSLISIKLSASNYLLWKSQVIPLVRSLGIEHHLRGSSEPEKLLTDKDGETTPNPQYDTWLNNDGLLTSWLLGIITEEVLATIEGTESAYQMWKSLE
EQLLTMTKENEIHLNEVILSLKKGSLSLDEYLKKIKSLCDQLAATKKLVDDLTKVFHVARGLGTKYQGFKTAMLSKAPYPTYNEFILALKTHEVMINADIEQENTSQPDH
NQAFYAHRGRSRGRGRTFCSRGRGFPQGKQALSQSINYTSP