; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g12310 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g12310
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr6:9380102..9394133
RNA-Seq ExpressionMoc06g12310
SyntenyMoc06g12310
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_020417987.1 uncharacterized protein LOC109948697 [Prunus persica]4.8e-4047.92Show/hide
Query:  CDGRCHQTLAISI---SLKDRAKNWLNSLPTESIETWNDLAHKFVAKFFPPGKTTKIKLEIGNFAQLEGE---EAWEKYKELLRKCSHHGLSKWMQMHNF
        C+G  +  + + +   SLK++AK+WLNS P  SI TW+DLA KF+AKFFPP KT K++ +I +FAQ + E   EAW++YK++LRKC HH L  WMQ+  F
Subjt:  CDGRCHQTLAISI---SLKDRAKNWLNSLPTESIETWNDLAHKFVAKFFPPGKTTKIKLEIGNFAQLEGE---EAWEKYKELLRKCSHHGLSKWMQMHNF

Query:  YNGLANVMRTTVDVASGGAFMGKTKSEAYELLEEMTLNNYQWHIGKHYNSVHGRNQ-DQFQNQADSIRNLEVQVGELNNMMVDRLPGLNLHC
        YNGL N  +TT+D A+GGA MGKT++EAY LLEEM  NNYQW   +      G ++ D        I  L  Q+G LN   V+ +   NL C
Subjt:  YNGLANVMRTTVDVASGGAFMGKTKSEAYELLEEMTLNNYQWHIGKHYNSVHGRNQ-DQFQNQADSIRNLEVQVGELNNMMVDRLPGLNLHC

XP_024021836.1 uncharacterized protein LOC112091747 [Morus notabilis]1.8e-4253.66Show/hide
Query:  SLKDRAKNWLNSLPTESIETWNDLAHKFVAKFFPPGKTTKIKLEIGNFAQLEGE---EAWEKYKELLRKCSHHGLSKWMQMHNFYNGLANVMRTTVDVAS
        SL+D+AKNWLNSLP   I  W +L  KF++KFFPP KT K++ EI NFAQ EGE   EAWE+Y+ELLR+C HHGL KWMQ+HNFYNGL+   RT +D ++
Subjt:  SLKDRAKNWLNSLPTESIETWNDLAHKFVAKFFPPGKTTKIKLEIGNFAQLEGE---EAWEKYKELLRKCSHHGLSKWMQMHNFYNGLANVMRTTVDVAS

Query:  GGAFMGKTKSEAYELLEEMTLNNYQWHIGKHYNSVHGRNQDQFQNQADSIRNLEVQVGELNNMM
        GGA M KT+ EAYELLE+M  NNYQW   +       R +    ++ D+I NL  Q+  L+  +
Subjt:  GGAFMGKTKSEAYELLEEMTLNNYQWHIGKHYNSVHGRNQDQFQNQADSIRNLEVQVGELNNMM

XP_024025398.1 uncharacterized protein LOC112092750 [Morus notabilis]1.6e-4348.26Show/hide
Query:  NEALVGAKKILETVGAVGVKPCDGRCHQTLAISISLKDRAKNWLNSLPTESIETWNDLAHKFVAKFFPPGKTTKIKLEIGNFAQLEGE---EAWEKYKEL
        N  ++   +I +TV   GV   +    +      SL+D+AK+WLNSLP  SI +W DL  KF++KFFPP KT K+ +EI NFAQ EGE   EAWE+YKEL
Subjt:  NEALVGAKKILETVGAVGVKPCDGRCHQTLAISISLKDRAKNWLNSLPTESIETWNDLAHKFVAKFFPPGKTTKIKLEIGNFAQLEGE---EAWEKYKEL

Query:  LRKCSHHGLSKWMQMHNFYNGLANVMRTTVDVASGGAFMGKTKSEAYELLEEMTLNNYQW----HIGKHYNSVHGRNQDQFQNQADSIRNLEVQVGELNN
        +R+C HHGL KWMQ+HNFYNGL+   RT +DV++GGA M KT+ EAYELLE+M  NNYQW     I K    +H         + ++I NL  Q+  L+ 
Subjt:  LRKCSHHGLSKWMQMHNFYNGLANVMRTTVDVASGGAFMGKTKSEAYELLEEMTLNNYQW----HIGKHYNSVHGRNQDQFQNQADSIRNLEVQVGELNN

Query:  M
        +
Subjt:  M

XP_024031895.1 uncharacterized protein LOC112094656 [Morus notabilis]1.1e-4152.98Show/hide
Query:  SLKDRAKNWLNSLPTESIETWNDLAHKFVAKFFPPGKTTKIKLEIGNFAQLEGE---EAWEKYKELLRKCSHHGLSKWMQMHNFYNGLANVMRTTVDVAS
        SL+D+AKNWLNSLP  SI +W DL  KF++KFFPP KT K+ +EI NFAQ E E   EAWE+YKELLR+C H+GL KW Q+HNFYNGL+   RT +D ++
Subjt:  SLKDRAKNWLNSLPTESIETWNDLAHKFVAKFFPPGKTTKIKLEIGNFAQLEGE---EAWEKYKELLRKCSHHGLSKWMQMHNFYNGLANVMRTTVDVAS

Query:  GGAFMGKTKSEAYELLEEMTLNNYQW----HIGKHYNSVHGRNQDQFQNQADSIRNLEVQVGELNNMM
        GGA M KT+ EAYELLE+M  NNYQW     I K    +H         + D+I NL  Q+   +  +
Subjt:  GGAFMGKTKSEAYELLEEMTLNNYQW----HIGKHYNSVHGRNQDQFQNQADSIRNLEVQVGELNNMM

XP_034899370.1 LOW QUALITY PROTEIN: uncharacterized protein LOC118037487 [Populus alba]1.8e-4254.88Show/hide
Query:  SLKDRAKNWLNSLPTESIETWNDLAHKFVAKFFPPGKTTKIKLEIGNFAQLEGE---EAWEKYKELLRKCSHHGLSKWMQMHNFYNGLANVMRTTVDVAS
        SL+DRAKNWLNS+P +S+ +W DLA KF+AKFFPP KT K+++EI NFAQLE E   E WE+YK+LLR+C HHGL KWMQ+ NFYNGL    RT +D AS
Subjt:  SLKDRAKNWLNSLPTESIETWNDLAHKFVAKFFPPGKTTKIKLEIGNFAQLEGE---EAWEKYKELLRKCSHHGLSKWMQMHNFYNGLANVMRTTVDVAS

Query:  GGAFMGKTKSEAYELLEEMTLNNYQWHIGKHYNSVHGRNQDQFQNQADSIRNLEVQVGELNNMM
        GGAFM K++ +AY LLEEM +NNYQW      N    + +    ++ D+I  L  QV  L   +
Subjt:  GGAFMGKTKSEAYELLEEMTLNNYQWHIGKHYNSVHGRNQDQFQNQADSIRNLEVQVGELNNMM

TrEMBL top hitse value%identityAlignment
A0A3S3N117 Retrotrans_gag domain-containing protein2.9e-3849.46Show/hide
Query:  ISISLKDRAKNWLNSLPTESIETWNDLAHKFVAKFFPPGKTTKIKLEIGNFAQLEGE---EAWEKYKELLRKCSHHGLSKWMQMHNFYNGLANVMRTTVD
        +  SL+D+AK WLNSLP  +I TW++LA KF+AKFFPP KT K++ +I  FAQ E E   EAWE+YKELLRKC HHGL  W+Q+  FYNGL +  RT++D
Subjt:  ISISLKDRAKNWLNSLPTESIETWNDLAHKFVAKFFPPGKTTKIKLEIGNFAQLEGE---EAWEKYKELLRKCSHHGLSKWMQMHNFYNGLANVMRTTVD

Query:  VASGGAFMGKTKSEAYELLEEMTLNNYQW---HI-GKHYNSVHGRNQDQFQNQADSIRNLEVQVGELN----NMMVDRLPGLNLHC
         A+GG  M K+  EAYEL+EEM  NNYQW   H+  K    VH         + DSI  L  QV  L+    +M V  +   N+ C
Subjt:  VASGGAFMGKTKSEAYELLEEMTLNNYQW---HI-GKHYNSVHGRNQDQFQNQADSIRNLEVQVGELN----NMMVDRLPGLNLHC

A0A6J0ZX64 LOW QUALITY PROTEIN: uncharacterized protein LOC1104129459.2e-3747.16Show/hide
Query:  SLKDRAKNWLNSLPTESIETWNDLAHKFVAKFFPPGKTTKIKLEIGNFAQLEGE---EAWEKYKELLRKCSHHGLSKWMQMHNFYNGLANVMRTTVDVAS
        SL+D+AK+WLNSLP  SI TW DLA KF+AKFFPP KT K++ +I +F Q +GE   EAWE++KELLR+C HHG+  W+Q+  FYNGL   ++T +D A+
Subjt:  SLKDRAKNWLNSLPTESIETWNDLAHKFVAKFFPPGKTTKIKLEIGNFAQLEGE---EAWEKYKELLRKCSHHGLSKWMQMHNFYNGLANVMRTTVDVAS

Query:  GGAFMGKTKSEAYELLEEMTLNNYQWHIGKHYNSVHGRNQDQFQNQADSIRNLEVQVGELNNMMVDRLPGLNLHCM
        GGA M K   +AY LLEEM  NNYQW      +   G  +     + D++  L  QV  L+     +L  L +H +
Subjt:  GGAFMGKTKSEAYELLEEMTLNNYQWHIGKHYNSVHGRNQDQFQNQADSIRNLEVQVGELNNMMVDRLPGLNLHCM

A0A6J0ZYV0 uncharacterized protein LOC1104134139.2e-3747.16Show/hide
Query:  SLKDRAKNWLNSLPTESIETWNDLAHKFVAKFFPPGKTTKIKLEIGNFAQLEGE---EAWEKYKELLRKCSHHGLSKWMQMHNFYNGLANVMRTTVDVAS
        SL+D+AK+WLNSLP  SI TW DLA KF+AKFFPP KT K++ +I +F Q +GE   EAWE++KELLR+C HHG+  W+Q+  FYNGL   ++T +D A+
Subjt:  SLKDRAKNWLNSLPTESIETWNDLAHKFVAKFFPPGKTTKIKLEIGNFAQLEGE---EAWEKYKELLRKCSHHGLSKWMQMHNFYNGLANVMRTTVDVAS

Query:  GGAFMGKTKSEAYELLEEMTLNNYQWHIGKHYNSVHGRNQDQFQNQADSIRNLEVQVGELNNMMVDRLPGLNLHCM
        GGA M K   +AY LLEEM  NNYQW      +   G  +     + D++  L  QV  L+     +L  L +H +
Subjt:  GGAFMGKTKSEAYELLEEMTLNNYQWHIGKHYNSVHGRNQDQFQNQADSIRNLEVQVGELNNMMVDRLPGLNLHCM

A0A6J1AZQ8 uncharacterized protein LOC1104227871.1e-3450.31Show/hide
Query:  NEALVGAKKILETVGAVGVKPCDGRCHQTLAISISLKDRAKNWLNSLPTESIETWNDLAHKFVAKFFPPGKTTKIKLEIGNFAQLEGE---EAWEKYKEL
        N  ++   KI +T  A GV   D      L    SL+D+AKNWLNSLP  SI TW++LA K++AKFFP  K  K++ +I +F Q + E   EAWE+YK+L
Subjt:  NEALVGAKKILETVGAVGVKPCDGRCHQTLAISISLKDRAKNWLNSLPTESIETWNDLAHKFVAKFFPPGKTTKIKLEIGNFAQLEGE---EAWEKYKEL

Query:  LRKCSHHGLSKWMQMHNFYNGLANVMRTTVDVASGGAFMGKTKSEAYELLEEMTLNNYQ
        +R+C HHGL KW+Q+  FYNGL    RTT+D A+ GA M K+  EAY+LLEE+  NNYQ
Subjt:  LRKCSHHGLSKWMQMHNFYNGLANVMRTTVDVASGGAFMGKTKSEAYELLEEMTLNNYQ

A0A6P5SHM2 uncharacterized protein LOC1107561913.9e-3559.52Show/hide
Query:  SLKDRAKNWLNSLPTESIETWNDLAHKFVAKFFPPGKTTKIKLEIGNFAQLEGE---EAWEKYKELLRKCSHHGLSKWMQMHNFYNGLANVMRTTVDVAS
        SLKD+AK WL S P +SI TW+DL+ KF AKFFPP KT K + +I +FAQ + E   EAWE++K+LLRKC HH L  W+Q+  FYNGL+   RT VD A+
Subjt:  SLKDRAKNWLNSLPTESIETWNDLAHKFVAKFFPPGKTTKIKLEIGNFAQLEGE---EAWEKYKELLRKCSHHGLSKWMQMHNFYNGLANVMRTTVDVAS

Query:  GGAFMGKTKSEAYELLEEMTLNNYQW
        GGA M KT +EA+ELLE M  NNYQW
Subjt:  GGAFMGKTKSEAYELLEEMTLNNYQW

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.6e-0727.75Show/hide
Query:  SEWVVDSTASVDVSLDRSWFTSFTAGNHGTVRIANRRLSKIRGLGDVRLRTDSGIELVLHGVRAADSSGGDLKELAALTIKIDQENLPSVQVQQLGIVRR
        SEWVVD+ AS   +  R  F  + AG+ GTV++ N   SKI G+GD+ ++T+ G  LVL  VR       DL+      I +D++   S    Q   + +
Subjt:  SEWVVDSTASVDVSLDRSWFTSFTAGNHGTVRIANRRLSKIRGLGDVRLRTDSGIELVLHGVRAADSSGGDLKELAALTIKIDQENLPSVQVQQLGIVRR

Query:  INELMKSRRRKGALRKT---IVGAEVDGGVSRVATDLSGSARLLSLTAITNEALVGAKKILETVGAVGVKPCD
         + ++     +G L +T   I   E++     ++ DL    + +   +     ++  K ++       VKPCD
Subjt:  INELMKSRRRKGALRKT---IVGAEVDGGVSRVATDLSGSARLLSLTAITNEALVGAKKILETVGAVGVKPCD

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGATCCAGTGGAACGTCACATCATTACTTCAAGGCAGTTTTGAGATTTGACGGGAAAAATTTTGGGTTTTGGAAAATGCAAGTGACAGACCTTCTTACATACAAGAA
GACACACAAAACCTTGAAGGAACGACAAGATGGGATGACAGACAAGGAATGGAGCAACAAGGTTGAATGTAACACAACACCTGTAGACCGATCATCAGAGTGGGTAGTGG
ACAGTACAGCTTCGGTTGATGTATCTTTAGACAGGAGCTGGTTCACGTCCTTTACTGCAGGAAATCATGGCACGGTGAGGATAGCAAATAGGAGACTCTCCAAGATCAGA
GGACTTGGGGATGTACGTCTGAGGACTGACAGTGGGATCGAGTTAGTTTTGCATGGTGTCAGGGCTGCAGATAGTTCAGGGGGAGACTTGAAAGAACTAGCAGCATTGAC
AATCAAGATAGATCAGGAGAATCTGCCATCAGTTCAAGTACAACAGCTGGGAATTGTCAGACGGATTAACGAGCTGATGAAGTCGCGTAGGCGAAAAGGTGCATTGAGAA
AGACTATAGTTGGTGCCGAGGTCGATGGTGGTGTCTCTAGAGTGGCAACAGACTTGAGTGGGAGTGCAAGATTGCTCAGTCTCACGGCAATCACAAATGAAGCCTTGGTT
GGTGCTAAGAAGATACTAGAAACTGTCGGTGCAGTGGGAGTTAAGCCTTGTGATGGACGATGCCATCAGACTCTGGCTATTTCCATTTCTTTAAAGGATCGGGCTAAAAA
TTGGTTGAACTCCTTGCCGACAGAATCAATTGAAACTTGGAATGATTTGGCTCATAAGTTTGTAGCAAAGTTCTTCCCACCTGGCAAAACAACAAAGATAAAACTTGAGA
TTGGCAATTTTGCTCAACTTGAGGGAGAAGAGGCATGGGAAAAGTATAAGGAGTTATTGAGAAAATGTTCTCACCATGGGCTATCTAAATGGATGCAAATGCATAATTTC
TATAATGGGTTGGCTAATGTGATGAGAACTACAGTTGATGTAGCATCAGGAGGGGCATTCATGGGTAAAACTAAAAGTGAAGCATATGAGTTGTTGGAAGAAATGACATT
GAACAACTACCAATGGCATATTGGAAAACACTATAACTCAGTTCATGGCCGAAACCAAGACCAATTCCAAAATCAAGCAGATTCTATTCGTAACTTAGAGGTACAAGTTG
GTGAGTTGAACAATATGATGGTGGATAGGCTTCCAGGTTTAAATCTCCATTGCATGTCTTTGCTCGAGAACGAGCAAAATGATAAGTTTGGGGAGCTGAGTGACTCATGG
ACCGGCGTACCTCAACAATGGGAGAGGAGGGAAGGTGGGCCCGGCCAATGGGCCGAGCCCTACTTGGTGCCAAGCAGGGGTCAGGACCAAGCCCTGGTCGGCTCGGTCTT
CTCTTTGTCGTTGCTTGTGTTTCAACAAGCCCGTGACCGACCCGTCCTCATCACACGGCGTGTAGTCCCATCTTTTGTCAAACCCATGGTGATAGGTCTCGACCTGACGG
TCTTTAACCAGTGGTTCGAGGCATGCTCTTTTGATTCTAGGGCTATAAAAATCGATGACTTCCTTTTGAATCCTTGCTCATTTTCTTTGTCTGAGATGACGAGTTGTCAT
CAAACCCATCTTCTGAGACTGGGTCCTATCGACCTTCCTAGTTGGATAATTAGCCTCACTGCGAGGGAGCTACGCCGTCTTCGTGTGAAGTATCGCATCCCGAATTCTAT
TAGTCTCCGGCTTCCACGTCCCAGTAAAAGCATCGACGGGTCTTCCCCTGGGGAGTTCCTCGTAATTGCAAGCTTACCACTAAAGCGTTTCTACAATACCATCATCAAGT
TACCAGACACTGAGTGGTGGGGTCCTGACATGGTGAAGGAGCAAAATCTGGTGGATTTTTGTTGGCACCTTCTCATTGGACTCCAGCATCAAGCTTGGCCTTCGATGCCT
CCTACCGCACCTCGTCGTTCTACTCGCCTCCAAGAGCTATCAACCTCGTCCTCTTCTCGTGGTCACCTAGGGTCCTCTTCTCATGCGTTGGTTTTCCCATTTTCTGGCCC
ATACACAACCGACACTGATGGGGCAGCTTATCTAGAGAGGGCGTTTGACCTTTGGCAAGGACTTGGTGAATCATCTAGTGGATCGACCTTTGGATACTCGAATCCCAAGC
AGAGCATCAACACTTGTCTCAGTAACCTCCACCAGGCAAACAGCTTGCTAGAGAATGCCCAACGTGACCTGGAGGCTGAAAGGAGGATGTCTTGGGTTGAGTTGGCCGAC
TTGCAGAGGAAGCTTGACGAGGCCAAGGCTCAACTTTTCAACTCTGAATTTTTGGCCGAAGAGTTTAAGAAGACCCAAGAGAGTGCATTAGAGCTGAGTGTAATGCGCCT
GATGACATGGATGATGACAATGATGAATAGGGCAATAGCGAGTCCAGTTATCTGTATTGCTTTCTTTGCTTTCCATGTTTTGGAACATGGTCTGTTGTTGTTACTGTTCC
TTTCTGTTATCCTGTTGCCTAGGGCTCCAATCTTAGCGAAGCTAATTCGCCTGCCTCCAACTTCCATAGGCTCGGTGGCCCCCTTTTGGTTTGGCCCTGATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGATCCAGTGGAACGTCACATCATTACTTCAAGGCAGTTTTGAGATTTGACGGGAAAAATTTTGGGTTTTGGAAAATGCAAGTGACAGACCTTCTTACATACAAGAA
GACACACAAAACCTTGAAGGAACGACAAGATGGGATGACAGACAAGGAATGGAGCAACAAGGTTGAATGTAACACAACACCTGTAGACCGATCATCAGAGTGGGTAGTGG
ACAGTACAGCTTCGGTTGATGTATCTTTAGACAGGAGCTGGTTCACGTCCTTTACTGCAGGAAATCATGGCACGGTGAGGATAGCAAATAGGAGACTCTCCAAGATCAGA
GGACTTGGGGATGTACGTCTGAGGACTGACAGTGGGATCGAGTTAGTTTTGCATGGTGTCAGGGCTGCAGATAGTTCAGGGGGAGACTTGAAAGAACTAGCAGCATTGAC
AATCAAGATAGATCAGGAGAATCTGCCATCAGTTCAAGTACAACAGCTGGGAATTGTCAGACGGATTAACGAGCTGATGAAGTCGCGTAGGCGAAAAGGTGCATTGAGAA
AGACTATAGTTGGTGCCGAGGTCGATGGTGGTGTCTCTAGAGTGGCAACAGACTTGAGTGGGAGTGCAAGATTGCTCAGTCTCACGGCAATCACAAATGAAGCCTTGGTT
GGTGCTAAGAAGATACTAGAAACTGTCGGTGCAGTGGGAGTTAAGCCTTGTGATGGACGATGCCATCAGACTCTGGCTATTTCCATTTCTTTAAAGGATCGGGCTAAAAA
TTGGTTGAACTCCTTGCCGACAGAATCAATTGAAACTTGGAATGATTTGGCTCATAAGTTTGTAGCAAAGTTCTTCCCACCTGGCAAAACAACAAAGATAAAACTTGAGA
TTGGCAATTTTGCTCAACTTGAGGGAGAAGAGGCATGGGAAAAGTATAAGGAGTTATTGAGAAAATGTTCTCACCATGGGCTATCTAAATGGATGCAAATGCATAATTTC
TATAATGGGTTGGCTAATGTGATGAGAACTACAGTTGATGTAGCATCAGGAGGGGCATTCATGGGTAAAACTAAAAGTGAAGCATATGAGTTGTTGGAAGAAATGACATT
GAACAACTACCAATGGCATATTGGAAAACACTATAACTCAGTTCATGGCCGAAACCAAGACCAATTCCAAAATCAAGCAGATTCTATTCGTAACTTAGAGGTACAAGTTG
GTGAGTTGAACAATATGATGGTGGATAGGCTTCCAGGTTTAAATCTCCATTGCATGTCTTTGCTCGAGAACGAGCAAAATGATAAGTTTGGGGAGCTGAGTGACTCATGG
ACCGGCGTACCTCAACAATGGGAGAGGAGGGAAGGTGGGCCCGGCCAATGGGCCGAGCCCTACTTGGTGCCAAGCAGGGGTCAGGACCAAGCCCTGGTCGGCTCGGTCTT
CTCTTTGTCGTTGCTTGTGTTTCAACAAGCCCGTGACCGACCCGTCCTCATCACACGGCGTGTAGTCCCATCTTTTGTCAAACCCATGGTGATAGGTCTCGACCTGACGG
TCTTTAACCAGTGGTTCGAGGCATGCTCTTTTGATTCTAGGGCTATAAAAATCGATGACTTCCTTTTGAATCCTTGCTCATTTTCTTTGTCTGAGATGACGAGTTGTCAT
CAAACCCATCTTCTGAGACTGGGTCCTATCGACCTTCCTAGTTGGATAATTAGCCTCACTGCGAGGGAGCTACGCCGTCTTCGTGTGAAGTATCGCATCCCGAATTCTAT
TAGTCTCCGGCTTCCACGTCCCAGTAAAAGCATCGACGGGTCTTCCCCTGGGGAGTTCCTCGTAATTGCAAGCTTACCACTAAAGCGTTTCTACAATACCATCATCAAGT
TACCAGACACTGAGTGGTGGGGTCCTGACATGGTGAAGGAGCAAAATCTGGTGGATTTTTGTTGGCACCTTCTCATTGGACTCCAGCATCAAGCTTGGCCTTCGATGCCT
CCTACCGCACCTCGTCGTTCTACTCGCCTCCAAGAGCTATCAACCTCGTCCTCTTCTCGTGGTCACCTAGGGTCCTCTTCTCATGCGTTGGTTTTCCCATTTTCTGGCCC
ATACACAACCGACACTGATGGGGCAGCTTATCTAGAGAGGGCGTTTGACCTTTGGCAAGGACTTGGTGAATCATCTAGTGGATCGACCTTTGGATACTCGAATCCCAAGC
AGAGCATCAACACTTGTCTCAGTAACCTCCACCAGGCAAACAGCTTGCTAGAGAATGCCCAACGTGACCTGGAGGCTGAAAGGAGGATGTCTTGGGTTGAGTTGGCCGAC
TTGCAGAGGAAGCTTGACGAGGCCAAGGCTCAACTTTTCAACTCTGAATTTTTGGCCGAAGAGTTTAAGAAGACCCAAGAGAGTGCATTAGAGCTGAGTGTAATGCGCCT
GATGACATGGATGATGACAATGATGAATAGGGCAATAGCGAGTCCAGTTATCTGTATTGCTTTCTTTGCTTTCCATGTTTTGGAACATGGTCTGTTGTTGTTACTGTTCC
TTTCTGTTATCCTGTTGCCTAGGGCTCCAATCTTAGCGAAGCTAATTCGCCTGCCTCCAACTTCCATAGGCTCGGTGGCCCCCTTTTGGTTTGGCCCTGATTAA
Protein sequenceShow/hide protein sequence
MGSSGTSHHYFKAVLRFDGKNFGFWKMQVTDLLTYKKTHKTLKERQDGMTDKEWSNKVECNTTPVDRSSEWVVDSTASVDVSLDRSWFTSFTAGNHGTVRIANRRLSKIR
GLGDVRLRTDSGIELVLHGVRAADSSGGDLKELAALTIKIDQENLPSVQVQQLGIVRRINELMKSRRRKGALRKTIVGAEVDGGVSRVATDLSGSARLLSLTAITNEALV
GAKKILETVGAVGVKPCDGRCHQTLAISISLKDRAKNWLNSLPTESIETWNDLAHKFVAKFFPPGKTTKIKLEIGNFAQLEGEEAWEKYKELLRKCSHHGLSKWMQMHNF
YNGLANVMRTTVDVASGGAFMGKTKSEAYELLEEMTLNNYQWHIGKHYNSVHGRNQDQFQNQADSIRNLEVQVGELNNMMVDRLPGLNLHCMSLLENEQNDKFGELSDSW
TGVPQQWERREGGPGQWAEPYLVPSRGQDQALVGSVFSLSLLVFQQARDRPVLITRRVVPSFVKPMVIGLDLTVFNQWFEACSFDSRAIKIDDFLLNPCSFSLSEMTSCH
QTHLLRLGPIDLPSWIISLTARELRRLRVKYRIPNSISLRLPRPSKSIDGSSPGEFLVIASLPLKRFYNTIIKLPDTEWWGPDMVKEQNLVDFCWHLLIGLQHQAWPSMP
PTAPRRSTRLQELSTSSSSRGHLGSSSHALVFPFSGPYTTDTDGAAYLERAFDLWQGLGESSSGSTFGYSNPKQSINTCLSNLHQANSLLENAQRDLEAERRMSWVELAD
LQRKLDEAKAQLFNSEFLAEEFKKTQESALELSVMRLMTWMMTMMNRAIASPVICIAFFAFHVLEHGLLLLLFLSVILLPRAPILAKLIRLPPTSIGSVAPFWFGPD