; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc05g26010 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc05g26010
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr5:18481236..18482081
RNA-Seq ExpressionMoc05g26010
SyntenyMoc05g26010
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG69253.1 hypothetical protein EZV62_004188 [Acer yangbiense]5.9e-2434.25Show/hide
Query:  MIKGRAGISSSEIQVELLVFEKRLEFQNSQKNTVAFNHTPTLNMANSKYPNRGQRQHLNNNQNNNQ-----------RSSGSRYRGRGKWNNNDVNRQIC
        +I+ R   +  EI   LL ++ +LE  N+         +P+ ++A +K  N       +N QN NQ           R  G R+RGRG  NNN  +R  C
Subjt:  MIKGRAGISSSEIQVELLVFEKRLEFQNSQKNTVAFNHTPTLNMANSKYPNRGQRQHLNNNQNNNQ-----------RSSGSRYRGRGKWNNNDVNRQIC

Query:  QVCGKSGHSIFVCGIDLIKNLQDLVRIRLRVMVPTHIMHLYK-MVVTLKVQLLKPLLQYKTLIRLLQIWRQWLIQAGTVDSGASNHVTADYNDIINPVEY
        QVCGK GHS  VC      N            VPT   +     V     + +     Y                    DSGA+NHVT D  ++    +Y
Subjt:  QVCGKSGHSIFVCGIDLIKNLQDLVRIRLRVMVPTHIMHLYK-MVVTLKVQLLKPLLQYKTLIRLLQIWRQWLIQAGTVDSGASNHVTADYNDIINPVEY

Query:  GGKETVTIGNGHKLFISHIG-KSCLVFENGLLNLENVLCVPYFVKNLVRVSKLVQDNNIYLEFHADSCIVKDICTGNVVLKGVFKDGLYKLD
         G E++ +GNG +L ISH+G KS        + L+ VL VP   KNL+ VS+LV DN++++EFHA+ C VKD  TG  VL+G  K+GLY+L+
Subjt:  GGKETVTIGNGHKLFISHIG-KSCLVFENGLLNLENVLCVPYFVKNLVRVSKLVQDNNIYLEFHADSCIVKDICTGNVVLKGVFKDGLYKLD

TYK05754.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]3.9e-3639.79Show/hide
Query:  MIKGRAGISSSEIQVELLVFEKRLEFQNSQKNTVAFNHTP---TLNMANSKYPNRGQRQHLNNNQNNNQRSSGSRYRGRGKWNNNDVNRQICQVCGKSGH
        +I+G+  IS  ++Q ELL FEKRLE Q++QKNT            N  +S +      Q   NN+NN+Q   G    GRG+      N+  CQVC K GH
Subjt:  MIKGRAGISSSEIQVELLVFEKRLEFQNSQKNTVAFNHTP---TLNMANSKYPNRGQRQHLNNNQNNNQRSSGSRYRGRGKWNNNDVNRQICQVCGKSGH

Query:  SIFVCGIDLIKNLQDLVRIRLRVMVPTHIMHLYKMVVTLKVQLLKPLLQYKTLIRLLQIWRQWLIQAGTVDSGASNHVTADYNDIINPVEYGGKETVTIG
        S  VC     K     + ++ R    ++      + V +  Q +       T+I L      W I     DSGA+NH+T +Y+++ NP EY G E + +G
Subjt:  SIFVCGIDLIKNLQDLVRIRLRVMVPTHIMHLYKMVVTLKVQLLKPLLQYKTLIRLLQIWRQWLIQAGTVDSGASNHVTADYNDIINPVEYGGKETVTIG

Query:  NGHKLFISHIGKSCLVFENGLLNLENVLCVPYFVKNLVRVSKLVQDNNIYLEFHADSCIVKDICTGNVVLKGVFKDGLYKLDTV
        NG  L IS+IG + L      LNL+NVLCVP   KNLV VSKL QDNN+Y+EFH   C +KD  TG  +L    KDGLY LDT+
Subjt:  NGHKLFISHIGKSCLVFENGLLNLENVLCVPYFVKNLVRVSKLVQDNNIYLEFHADSCIVKDICTGNVVLKGVFKDGLYKLDTV

XP_016902197.1 PREDICTED: uncharacterized protein LOC107991581 isoform X1 [Cucumis melo]2.5e-3039.47Show/hide
Query:  MIKGRAGISSSEIQVELLVFEKRLEFQNSQ-KNTVAFNHTPTLNMANSKYPNRGQRQHLNNN--QNNNQRSSGSRYRGRGKWNNNDVNRQICQVCGKSGH
        +I+G+  IS  ++Q +LL+FEKRL+ QN+Q KNT     +P LNMA  ++   GQR   N      N Q  SG     RG  NN       CQ+CGK GH
Subjt:  MIKGRAGISSSEIQVELLVFEKRLEFQNSQ-KNTVAFNHTPTLNMANSKYPNRGQRQHLNNN--QNNNQRSSGSRYRGRGKWNNNDVNRQICQVCGKSGH

Query:  SIFVCGIDLIKNLQDLVRIRLRVMVPTHIMHLYKMVVTLKVQLLKPLLQYKTLIRLLQIWRQWLIQAGTVDSGASNHVTADYNDIINPVEYGGKETVTIG
        S  VC     K     + ++ R    ++        V +  Q   P     T++        W I     DSGA+NHVT + +++ NP EY G E VT+G
Subjt:  SIFVCGIDLIKNLQDLVRIRLRVMVPTHIMHLYKMVVTLKVQLLKPLLQYKTLIRLLQIWRQWLIQAGTVDSGASNHVTADYNDIINPVEYGGKETVTIG

Query:  NGHKLFISHIGKSCLVFENGLLNLENVLCVPYFVKNLVRVSKLVQDNNIYLEFHADSCIVKDICTG
        NG++L IS++G +CL   +  L L+N+LCVP   KNL+ VSKL QDN+IY+EFH   C +KD  TG
Subjt:  NGHKLFISHIGKSCLVFENGLLNLENVLCVPYFVKNLVRVSKLVQDNNIYLEFHADSCIVKDICTG

XP_022158549.1 uncharacterized protein LOC111025011 isoform X1 [Momordica charantia]9.5e-9998.45Show/hide
Query:  MIKGRAGISSSEIQVELLVFEKRLEFQNSQKNTVAFNHTPTLNMANSKYPNRGQRQHLNNNQNNNQRSSGSRYRGRGKWNNNDVNRQICQVCGKSGHSIF
        MIKGRAGISSSEIQVELLVFEKRLEFQNSQKNTVAFNHTPTLNMANSKYPNRGQRQHLNNNQNNNQRSSGSRYRGRGKWNNNDVNRQICQVCGKSGHSIF
Subjt:  MIKGRAGISSSEIQVELLVFEKRLEFQNSQKNTVAFNHTPTLNMANSKYPNRGQRQHLNNNQNNNQRSSGSRYRGRGKWNNNDVNRQICQVCGKSGHSIF

Query:  VCGIDLIKNLQDLVRIRLRVMVPTHIMHLYKMVVTLKVQLLKPLLQYKTLIRLLQIWRQWLIQAGTVDSGASNHVTADYNDIINPVEYGGKETV
        VCGIDLIKNLQDLVRIRLRVMVPTHIMHLYKMVVTLKVQLLKPLLQYKTLIRLLQIWRQWLIQAGTVDSGASNHVTADYNDIINPVEYGG   V
Subjt:  VCGIDLIKNLQDLVRIRLRVMVPTHIMHLYKMVVTLKVQLLKPLLQYKTLIRLLQIWRQWLIQAGTVDSGASNHVTADYNDIINPVEYGGKETV

XP_022158550.1 uncharacterized protein LOC111025011 isoform X2 [Momordica charantia]1.6e-98100Show/hide
Query:  MIKGRAGISSSEIQVELLVFEKRLEFQNSQKNTVAFNHTPTLNMANSKYPNRGQRQHLNNNQNNNQRSSGSRYRGRGKWNNNDVNRQICQVCGKSGHSIF
        MIKGRAGISSSEIQVELLVFEKRLEFQNSQKNTVAFNHTPTLNMANSKYPNRGQRQHLNNNQNNNQRSSGSRYRGRGKWNNNDVNRQICQVCGKSGHSIF
Subjt:  MIKGRAGISSSEIQVELLVFEKRLEFQNSQKNTVAFNHTPTLNMANSKYPNRGQRQHLNNNQNNNQRSSGSRYRGRGKWNNNDVNRQICQVCGKSGHSIF

Query:  VCGIDLIKNLQDLVRIRLRVMVPTHIMHLYKMVVTLKVQLLKPLLQYKTLIRLLQIWRQWLIQAGTVDSGASNHVTADYNDIINPVEYGG
        VCGIDLIKNLQDLVRIRLRVMVPTHIMHLYKMVVTLKVQLLKPLLQYKTLIRLLQIWRQWLIQAGTVDSGASNHVTADYNDIINPVEYGG
Subjt:  VCGIDLIKNLQDLVRIRLRVMVPTHIMHLYKMVVTLKVQLLKPLLQYKTLIRLLQIWRQWLIQAGTVDSGASNHVTADYNDIINPVEYGG

TrEMBL top hitse value%identityAlignment
A0A1S4E1U6 uncharacterized protein LOC107991581 isoform X11.2e-3039.47Show/hide
Query:  MIKGRAGISSSEIQVELLVFEKRLEFQNSQ-KNTVAFNHTPTLNMANSKYPNRGQRQHLNNN--QNNNQRSSGSRYRGRGKWNNNDVNRQICQVCGKSGH
        +I+G+  IS  ++Q +LL+FEKRL+ QN+Q KNT     +P LNMA  ++   GQR   N      N Q  SG     RG  NN       CQ+CGK GH
Subjt:  MIKGRAGISSSEIQVELLVFEKRLEFQNSQ-KNTVAFNHTPTLNMANSKYPNRGQRQHLNNN--QNNNQRSSGSRYRGRGKWNNNDVNRQICQVCGKSGH

Query:  SIFVCGIDLIKNLQDLVRIRLRVMVPTHIMHLYKMVVTLKVQLLKPLLQYKTLIRLLQIWRQWLIQAGTVDSGASNHVTADYNDIINPVEYGGKETVTIG
        S  VC     K     + ++ R    ++        V +  Q   P     T++        W I     DSGA+NHVT + +++ NP EY G E VT+G
Subjt:  SIFVCGIDLIKNLQDLVRIRLRVMVPTHIMHLYKMVVTLKVQLLKPLLQYKTLIRLLQIWRQWLIQAGTVDSGASNHVTADYNDIINPVEYGGKETVTIG

Query:  NGHKLFISHIGKSCLVFENGLLNLENVLCVPYFVKNLVRVSKLVQDNNIYLEFHADSCIVKDICTG
        NG++L IS++G +CL   +  L L+N+LCVP   KNL+ VSKL QDN+IY+EFH   C +KD  TG
Subjt:  NGHKLFISHIGKSCLVFENGLLNLENVLCVPYFVKNLVRVSKLVQDNNIYLEFHADSCIVKDICTG

A0A5D3C373 Retrovirus-related Pol polyprotein from transposon TNT 1-941.9e-3639.79Show/hide
Query:  MIKGRAGISSSEIQVELLVFEKRLEFQNSQKNTVAFNHTP---TLNMANSKYPNRGQRQHLNNNQNNNQRSSGSRYRGRGKWNNNDVNRQICQVCGKSGH
        +I+G+  IS  ++Q ELL FEKRLE Q++QKNT            N  +S +      Q   NN+NN+Q   G    GRG+      N+  CQVC K GH
Subjt:  MIKGRAGISSSEIQVELLVFEKRLEFQNSQKNTVAFNHTP---TLNMANSKYPNRGQRQHLNNNQNNNQRSSGSRYRGRGKWNNNDVNRQICQVCGKSGH

Query:  SIFVCGIDLIKNLQDLVRIRLRVMVPTHIMHLYKMVVTLKVQLLKPLLQYKTLIRLLQIWRQWLIQAGTVDSGASNHVTADYNDIINPVEYGGKETVTIG
        S  VC     K     + ++ R    ++      + V +  Q +       T+I L      W I     DSGA+NH+T +Y+++ NP EY G E + +G
Subjt:  SIFVCGIDLIKNLQDLVRIRLRVMVPTHIMHLYKMVVTLKVQLLKPLLQYKTLIRLLQIWRQWLIQAGTVDSGASNHVTADYNDIINPVEYGGKETVTIG

Query:  NGHKLFISHIGKSCLVFENGLLNLENVLCVPYFVKNLVRVSKLVQDNNIYLEFHADSCIVKDICTGNVVLKGVFKDGLYKLDTV
        NG  L IS+IG + L      LNL+NVLCVP   KNLV VSKL QDNN+Y+EFH   C +KD  TG  +L    KDGLY LDT+
Subjt:  NGHKLFISHIGKSCLVFENGLLNLENVLCVPYFVKNLVRVSKLVQDNNIYLEFHADSCIVKDICTGNVVLKGVFKDGLYKLDTV

A0A6J1DW60 uncharacterized protein LOC111025011 isoform X27.9e-99100Show/hide
Query:  MIKGRAGISSSEIQVELLVFEKRLEFQNSQKNTVAFNHTPTLNMANSKYPNRGQRQHLNNNQNNNQRSSGSRYRGRGKWNNNDVNRQICQVCGKSGHSIF
        MIKGRAGISSSEIQVELLVFEKRLEFQNSQKNTVAFNHTPTLNMANSKYPNRGQRQHLNNNQNNNQRSSGSRYRGRGKWNNNDVNRQICQVCGKSGHSIF
Subjt:  MIKGRAGISSSEIQVELLVFEKRLEFQNSQKNTVAFNHTPTLNMANSKYPNRGQRQHLNNNQNNNQRSSGSRYRGRGKWNNNDVNRQICQVCGKSGHSIF

Query:  VCGIDLIKNLQDLVRIRLRVMVPTHIMHLYKMVVTLKVQLLKPLLQYKTLIRLLQIWRQWLIQAGTVDSGASNHVTADYNDIINPVEYGG
        VCGIDLIKNLQDLVRIRLRVMVPTHIMHLYKMVVTLKVQLLKPLLQYKTLIRLLQIWRQWLIQAGTVDSGASNHVTADYNDIINPVEYGG
Subjt:  VCGIDLIKNLQDLVRIRLRVMVPTHIMHLYKMVVTLKVQLLKPLLQYKTLIRLLQIWRQWLIQAGTVDSGASNHVTADYNDIINPVEYGG

A0A6J1DZR1 uncharacterized protein LOC111025011 isoform X14.6e-9998.45Show/hide
Query:  MIKGRAGISSSEIQVELLVFEKRLEFQNSQKNTVAFNHTPTLNMANSKYPNRGQRQHLNNNQNNNQRSSGSRYRGRGKWNNNDVNRQICQVCGKSGHSIF
        MIKGRAGISSSEIQVELLVFEKRLEFQNSQKNTVAFNHTPTLNMANSKYPNRGQRQHLNNNQNNNQRSSGSRYRGRGKWNNNDVNRQICQVCGKSGHSIF
Subjt:  MIKGRAGISSSEIQVELLVFEKRLEFQNSQKNTVAFNHTPTLNMANSKYPNRGQRQHLNNNQNNNQRSSGSRYRGRGKWNNNDVNRQICQVCGKSGHSIF

Query:  VCGIDLIKNLQDLVRIRLRVMVPTHIMHLYKMVVTLKVQLLKPLLQYKTLIRLLQIWRQWLIQAGTVDSGASNHVTADYNDIINPVEYGGKETV
        VCGIDLIKNLQDLVRIRLRVMVPTHIMHLYKMVVTLKVQLLKPLLQYKTLIRLLQIWRQWLIQAGTVDSGASNHVTADYNDIINPVEYGG   V
Subjt:  VCGIDLIKNLQDLVRIRLRVMVPTHIMHLYKMVVTLKVQLLKPLLQYKTLIRLLQIWRQWLIQAGTVDSGASNHVTADYNDIINPVEYGGKETV

A0A803NU85 Uncharacterized protein2.9e-2935.74Show/hide
Query:  MIKGRAGISSSEIQVELLVFEKRLEFQNSQKNTVAFNHTPTLNMAN----------SKYPNRGQRQHLNN--NQNNNQRSSGSRYR-GRGKWNNNDVNRQ
        +I+ R   +  ++Q  LL F+ RLE  N+        + P+ N A           S  P RG  Q  NN  N NN   + G  +R GRG+    + ++ 
Subjt:  MIKGRAGISSSEIQVELLVFEKRLEFQNSQKNTVAFNHTPTLNMAN----------SKYPNRGQRQHLNN--NQNNNQRSSGSRYR-GRGKWNNNDVNRQ

Query:  ICQVCGKSGHSIFVCGIDLIKNLQDLVRIRLRVMVPTHIMHLYKMVVTLKVQLLKPLLQYKTLIRLLQIWRQWLIQAGTVDSGASNHVTADYNDIINPVE
         CQVCGK GHS  +C     ++             P +  +  K              Q   L+   ++       +   DSGASNH+T+D   I N  E
Subjt:  ICQVCGKSGHSIFVCGIDLIKNLQDLVRIRLRVMVPTHIMHLYKMVVTLKVQLLKPLLQYKTLIRLLQIWRQWLIQAGTVDSGASNHVTADYNDIINPVE

Query:  YGGKETVTIGNGHKLFISHIGKSCLVFENGLLNLENVLCVPYFVKNLVRVSKLVQDNNIYLEFHADSCIVKDICTGNVVLKGVFKDGLYKL
        YGGKE +TIG+G KL I H+G   L  +N  L L N+L VP   KNL+ VSKL  DNN+ +EF +D C+VK+  TG VVL+G  KDGLY+L
Subjt:  YGGKETVTIGNGHKLFISHIGKSCLVFENGLLNLENVLCVPYFVKNLVRVSKLVQDNNIYLEFHADSCIVKDICTGNVVLKGVFKDGLYKL

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE17.5e-1428.04Show/hide
Query:  SEIQVELLVFE-KRLEFQNSQKNTVAFNHTPTLNMANSKYPNRGQRQHLNNNQNNNQRSSGSRYRGRGKWNNNDVNRQI---CQVCGKSGHSIFVCGIDL
        +EI   LL  E K L   ++    +  N     N   +   N G R +  +N+NNN  S   +        NN+ ++     CQ+CG  GHS   C    
Subjt:  SEIQVELLVFE-KRLEFQNSQKNTVAFNHTPTLNMANSKYPNRGQRQHLNNNQNNNQRSSGSRYRGRGKWNNNDVNRQI---CQVCGKSGHSIFVCGIDL

Query:  IKNLQDLVRIRLRVMVPTHIMHLYKMVVTLKVQLLKPLLQYKTLIRLLQIWRQWLIQAGTVDSGASNHVTADYNDIINPVEYGGKETVTIGNGHKLFISH
                         + + H    V +   Q   P   ++     L +   +      +DSGA++H+T+D+N++     Y G + V + +G  + ISH
Subjt:  IKNLQDLVRIRLRVMVPTHIMHLYKMVVTLKVQLLKPLLQYKTLIRLLQIWRQWLIQAGTVDSGASNHVTADYNDIINPVEYGGKETVTIGNGHKLFISH

Query:  IGKSCLVFENGLLNLENVLCVPYFVKNLVRVSKLVQDNNIYLEFHADSCIVKDICTGNVVLKGVFKDGLYK
         G + L  ++  LNL N+L VP   KNL+ V +L   N + +EF   S  VKD+ TG  +L+G  KD LY+
Subjt:  IGKSCLVFENGLLNLENVLCVPYFVKNLVRVSKLVQDNNIYLEFHADSCIVKDICTGNVVLKGVFKDGLYK

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.4e-1228.36Show/hide
Query:  SSSEIQVELLVFEKRLEFQNSQK------NTVAFNHTPTLNMANSKYPNRGQRQHLNNNQNNNQRSSGSRYRGRGKWNNNDVNRQICQVCGKSGHSIFVC
        S +EI   L+  E +L   NS +      N V   +T T    N++  NR      NNN++N+ + S S  R   +     + R  CQ+C   GHS   C
Subjt:  SSSEIQVELLVFEKRLEFQNSQK------NTVAFNHTPTLNMANSKYPNRGQRQHLNNNQNNNQRSSGSRYRGRGKWNNNDVNRQICQVCGKSGHSIFVC

Query:  GIDLIKNLQDLVRIRLRVMVPTHIMHLYKMVVTLKVQLLKPLLQYKTLIRLLQIWRQWLIQAGTVDSGASNHVTADYNDIINPVEYGGKETVTIGNGHKL
                                +H ++   T + Q   P   ++     L +   +      +DSGA++H+T+D+N++     Y G + V I +G  +
Subjt:  GIDLIKNLQDLVRIRLRVMVPTHIMHLYKMVVTLKVQLLKPLLQYKTLIRLLQIWRQWLIQAGTVDSGASNHVTADYNDIINPVEYGGKETVTIGNGHKL

Query:  FISHIGKSCLVFENGLLNLENVLCVPYFVKNLVRVSKLVQDNNIYLEFHADSCIVKDICTGNVVLKGVFKDGLYK
         I+H G + L   +  L+L  VL VP   KNL+ V +L   N + +EF   S  VKD+ TG  +L+G  KD LY+
Subjt:  FISHIGKSCLVFENGLLNLENVLCVPYFVKNLVRVSKLVQDNNIYLEFHADSCIVKDICTGNVVLKGVFKDGLYK

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTAAAGGGAGAGCTGGTATTTCTTCGTCTGAAATACAAGTGGAACTACTCGTCTTTGAGAAAAGACTAGAATTTCAAAACTCACAGAAGAACACAGTAGCGTTCAA
TCACACTCCTACATTAAATATGGCAAATAGCAAATATCCCAATAGAGGTCAGCGCCAACATTTGAACAATAACCAGAACAATAACCAGAGGAGTAGTGGAAGTCGATATC
GTGGAAGAGGAAAATGGAACAACAATGATGTCAATCGTCAAATATGTCAAGTTTGTGGAAAATCTGGGCACTCTATATTTGTGTGTGGCATAGATTTGATAAAAAATTTA
CAGGACCTAGTTAGAATCAGACTAAGAGTAATGGTTCCAACTCATATAATGCACCTGTACAAAATGGTGGTAACTCTCAAGGTACAACTTCTCAAGCCTTTGTTACAATA
CAAAACACTAATTCGTTTGTTGCAAATCTGGAGACAATGGTTGATCCAAGCTGGTACGGTGGATAGCGGAGCCTCAAATCATGTTACAGCAGACTACAATGATATTATCA
ACCCAGTTGAATATGGAGGTAAGGAAACAGTAACTATTGGTAATGGACACAAGTTGTTTATTTCTCATATTGGTAAATCGTGCTTAGTCTTTGAAAATGGACTTCTTAAT
CTTGAGAATGTATTGTGCGTACCTTATTTTGTAAAGAATCTTGTGAGGGTATCTAAACTTGTTCAAGATAATAATATTTATCTTGAATTTCATGCTGATTCTTGTATTGT
TAAGGATATATGTACCGGCAATGTGGTGCTAAAGGGGGTCTTTAAAGATGGCCTTTACAAATTGGATACTGTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGATTAAAGGGAGAGCTGGTATTTCTTCGTCTGAAATACAAGTGGAACTACTCGTCTTTGAGAAAAGACTAGAATTTCAAAACTCACAGAAGAACACAGTAGCGTTCAA
TCACACTCCTACATTAAATATGGCAAATAGCAAATATCCCAATAGAGGTCAGCGCCAACATTTGAACAATAACCAGAACAATAACCAGAGGAGTAGTGGAAGTCGATATC
GTGGAAGAGGAAAATGGAACAACAATGATGTCAATCGTCAAATATGTCAAGTTTGTGGAAAATCTGGGCACTCTATATTTGTGTGTGGCATAGATTTGATAAAAAATTTA
CAGGACCTAGTTAGAATCAGACTAAGAGTAATGGTTCCAACTCATATAATGCACCTGTACAAAATGGTGGTAACTCTCAAGGTACAACTTCTCAAGCCTTTGTTACAATA
CAAAACACTAATTCGTTTGTTGCAAATCTGGAGACAATGGTTGATCCAAGCTGGTACGGTGGATAGCGGAGCCTCAAATCATGTTACAGCAGACTACAATGATATTATCA
ACCCAGTTGAATATGGAGGTAAGGAAACAGTAACTATTGGTAATGGACACAAGTTGTTTATTTCTCATATTGGTAAATCGTGCTTAGTCTTTGAAAATGGACTTCTTAAT
CTTGAGAATGTATTGTGCGTACCTTATTTTGTAAAGAATCTTGTGAGGGTATCTAAACTTGTTCAAGATAATAATATTTATCTTGAATTTCATGCTGATTCTTGTATTGT
TAAGGATATATGTACCGGCAATGTGGTGCTAAAGGGGGTCTTTAAAGATGGCCTTTACAAATTGGATACTGTTTGA
Protein sequenceShow/hide protein sequence
MIKGRAGISSSEIQVELLVFEKRLEFQNSQKNTVAFNHTPTLNMANSKYPNRGQRQHLNNNQNNNQRSSGSRYRGRGKWNNNDVNRQICQVCGKSGHSIFVCGIDLIKNL
QDLVRIRLRVMVPTHIMHLYKMVVTLKVQLLKPLLQYKTLIRLLQIWRQWLIQAGTVDSGASNHVTADYNDIINPVEYGGKETVTIGNGHKLFISHIGKSCLVFENGLLN
LENVLCVPYFVKNLVRVSKLVQDNNIYLEFHADSCIVKDICTGNVVLKGVFKDGLYKLDTV