; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc05g23640 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc05g23640
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr5:16865171..16868014
RNA-Seq ExpressionMoc05g23640
SyntenyMoc05g23640
Gene Ontology termsGO:0004518 - nuclease activity (molecular function)
InterPro domainsIPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022150918.1 uncharacterized protein LOC111018954 [Momordica charantia]4.1e-3450.93Show/hide
Query:  QLDQCASKLSRWGWYRIGNFKQRIKEAEDNLQKAIYGLPMAADRSESQQAEVHLEHILKEGKTYWRQRSQEVWLKFGDRNTKWFHNKASYRRHQNEINGH
        +L+QC   L  WG  + GNF+ R+K AE  LQ AI+ LP A +R   QQA  ++  +LKE + +WRQRS+++W K GDRNTKWFH KAS+RR  NEI G 
Subjt:  QLDQCASKLSRWGWYRIGNFKQRIKEAEDNLQKAIYGLPMAADRSESQQAEVHLEHILKEGKTYWRQRSQEVWLKFGDRNTKWFHNKASYRRHQNEINGH

Query:  MDHHGVREKDKSKVAGMIEYIFTDLFTSTGLSEYDIENVTWCIETPVSPEQNRELLRLFNE
        +D  G  E++K KV GMIE  FT+LF+S+  S   IE VT CIE  VS   N +LL+ F E
Subjt:  MDHHGVREKDKSKVAGMIEYIFTDLFTSTGLSEYDIENVTWCIETPVSPEQNRELLRLFNE

XP_030483769.1 uncharacterized protein LOC115700339 [Cannabis sativa]1.6e-3331.83Show/hide
Query:  KGDKLISDFNKAVDDCNIFDTGFPGDNFTWCNGHPNGHTVFERLDKILCN--------DSSVHHLDFRTSDHRPVKLHICGQPKRN-FSKARRIFRVQDT
        + D  +  F   +D C +      GD+FTW     N   + ERLD    N        +  + HLD+  SDHR + + I    + N   + +  FR +  
Subjt:  KGDKLISDFNKAVDDCNIFDTGFPGDNFTWCNGHPNGHTVFERLDKILCN--------DSSVHHLDFRTSDHRPVKLHICGQPKRN-FSKARRIFRVQDT

Query:  WLEDPSCENVVSNCWSISPRDSNPRALIGQLDQCASKLSRWGWYRIGNFKQRIKEAEDNLQKAIYGLPMAADRS-ESQQAEVHLEHILKEGKTYWRQRSQ
        WL+D  C +++SNCW  +  +     L+  L QCAS+L  W + + G  K+ I  A+  + +       + + S E Q AE  LE +L   + YW+QRS+
Subjt:  WLEDPSCENVVSNCWSISPRDSNPRALIGQLDQCASKLSRWGWYRIGNFKQRIKEAEDNLQKAIYGLPMAADRS-ESQQAEVHLEHILKEGKTYWRQRSQ

Query:  EVWLKFGDRNTKWFHNKASYRRHQNEINGHMDHHGVREKDKSKVAGMIEYIFTDLFTSTGLSEYDIENVTWCIETPVSPEQNRELLRLF
          WL+ GDRNTK+FH+KAS R+  N I    D  G     K  ++ ++   FT LFT++    + + +V   I T +S +QN  LL  F
Subjt:  EVWLKFGDRNTKWFHNKASYRRHQNEINGHMDHHGVREKDKSKVAGMIEYIFTDLFTSTGLSEYDIENVTWCIETPVSPEQNRELLRLF

XP_030502823.1 uncharacterized protein LOC115717993 [Cannabis sativa]3.5e-3331.14Show/hide
Query:  KGDKLISDFNKAVDDCNIFDTGFPGDNFTWCNGHPNGHTVFERLDKILCN---DSS-----VHHLDFRTSDHRPVKLHICGQPKRNFSKAR-RIFRVQDT
        + +K +  F K +D C + +T F  D +TW        T+ ERLD    N   DS+     + HLD+ +SDHR +         R     R  IFR +  
Subjt:  KGDKLISDFNKAVDDCNIFDTGFPGDNFTWCNGHPNGHTVFERLDKILCN---DSS-----VHHLDFRTSDHRPVKLHICGQPKRNFSKAR-RIFRVQDT

Query:  WLEDPSCENVVSNCWSISPRDSNPRALIGQLDQCASKLSRWGWYRIGNFKQRIKEAEDNLQKAIYGLPMAADRSES-QQAEVHLEHILKEGKTYWRQRSQ
        WL D   ++++++CW+ S   +  +A+I  L +CA  L +W   + GN +++I +A+  ++        + D  ++ +++E  LE +L++ K YW QRS+
Subjt:  WLEDPSCENVVSNCWSISPRDSNPRALIGQLDQCASKLSRWGWYRIGNFKQRIKEAEDNLQKAIYGLPMAADRSES-QQAEVHLEHILKEGKTYWRQRSQ

Query:  EVWLKFGDRNTKWFHNKASYRRHQNEINGHMDHHGVREKDKSKVAGMIEYIFTDLFTSTGLSEYDIENVTWCIETPVSPEQNRELLRLF
          WL  GDRNTK+FH KAS R+  N I   ++  G     KS +   I   F+ +FT++ L E  +      I T V+ + N EL++ F
Subjt:  EVWLKFGDRNTKWFHNKASYRRHQNEINGHMDHHGVREKDKSKVAGMIEYIFTDLFTSTGLSEYDIENVTWCIETPVSPEQNRELLRLF

XP_030509269.1 uncharacterized protein LOC115723947 [Cannabis sativa]9.1e-3433.33Show/hide
Query:  KKGDKLISDFNKAVDDCNIFDTGFPGDNFTWCNGHPNGHTVFERLDKILCND--------SSVHHLDFRTSDHRPVKLHICGQ-PKRNFSKARRIFRVQD
        K  D+LI  F KA+DDCN+ D    G  +TW  G  +   + ERLDK L  +        SS+ +L+F TSDH P+ L + G  P  +F    R FR ++
Subjt:  KKGDKLISDFNKAVDDCNIFDTGFPGDNFTWCNGHPNGHTVFERLDKILCND--------SSVHHLDFRTSDHRPVKLHICGQ-PKRNFSKARRIFRVQD

Query:  TWLEDPSCENVVSNCWSISPRDSNPRALIGQLDQCASKLSRWGWYRIGNFKQRIKEAEDNLQKAIYGLPMAADRSESQQAEVHLEHILKEGKTYWRQRSQ
         W+ +P CE +V +CW    R S    +  ++  C   L RWG    GNF++RI + +  ++ + +G    + +   +  E  L  +L + + +W+QRS+
Subjt:  TWLEDPSCENVVSNCWSISPRDSNPRALIGQLDQCASKLSRWGWYRIGNFKQRIKEAEDNLQKAIYGLPMAADRSESQQAEVHLEHILKEGKTYWRQRSQ

Query:  EVWLKFGDRNTKWFHNKASYRRHQNEINGHMDHHGVREKDKSKVAGMIEYIFTDLFTSTGLSEYDIENVTWCIETPVSPEQNRELLRLFNE
        + WL  GD+N+K+FH+ A+ R+  N +    D +G     +S +A +I   F +LF++   S  ++ +V   I+  VS EQN ELLR  +E
Subjt:  EVWLKFGDRNTKWFHNKASYRRHQNEINGHMDHHGVREKDKSKVAGMIEYIFTDLFTSTGLSEYDIENVTWCIETPVSPEQNRELLRLFNE

XP_030510497.1 uncharacterized protein LOC115725200 [Cannabis sativa]1.6e-3333.11Show/hide
Query:  LGGSDATVSKKG-----DKLISDFNKAVDDCNIFDTGFPGDNFTWCNGHPNGHTVFERLDKILCNDS--------SVHHLDFRTSDHRPVKLHICGQ-PK
        L    + + KKG     D LI+ F  A+ DCN+ D    G  FTW  G  +G  + ERLDK L N           +++L+F TSDH P++L   G  P 
Subjt:  LGGSDATVSKKG-----DKLISDFNKAVDDCNIFDTGFPGDNFTWCNGHPNGHTVFERLDKILCNDS--------SVHHLDFRTSDHRPVKLHICGQ-PK

Query:  RNFSKARRIFRVQDTWLEDPSCENVVSNCWSISPRDSNPRALIGQLDQCASKLSRWGWYRIGNFKQRIKEAEDNLQKAIYGLPMAADRSESQQAEVHLEH
         +F      FR ++ WL +P C+ +V  CW  S   S  R    ++ +C   L RWG    G+FK RI + ++ ++   +G    + ++  Q+A+  L  
Subjt:  RNFSKARRIFRVQDTWLEDPSCENVVSNCWSISPRDSNPRALIGQLDQCASKLSRWGWYRIGNFKQRIKEAEDNLQKAIYGLPMAADRSESQQAEVHLEH

Query:  ILKEGKTYWRQRSQEVWLKFGDRNTKWFHNKASYRRHQNEINGHMDHHGVREKDKSKVAGMIEYIFTDLFTSTGLSEYDIENVTWCIETPVSPEQNRELL
        +L + + +W+QRS++ WL  GD+N+K+FH+ AS RR  N I+   D++G     +S +  +I   F +LF S   S  +I  V   +   VS +QN ELL
Subjt:  ILKEGKTYWRQRSQEVWLKFGDRNTKWFHNKASYRRHQNEINGHMDHHGVREKDKSKVAGMIEYIFTDLFTSTGLSEYDIENVTWCIETPVSPEQNRELL

Query:  RLFNE
           +E
Subjt:  RLFNE

TrEMBL top hitse value%identityAlignment
A0A2N9I921 Reverse transcriptase domain-containing protein1.4e-3531.56Show/hide
Query:  LKLGGSDATVSKKGDKLISDFNKAVDDCNIFDTGFPGDNFTWCNGHPNGHTVFERLDKILCN--------DSSVHHLDFRTSDHRPVKLHICGQPKRNFS
        ++L      +SK   +++S F +A+DDC   D G+ G  FTWCN   +G TV+ERLD+++ +         + V+HLD+  SDH+P+ L       R  +
Subjt:  LKLGGSDATVSKKGDKLISDFNKAVDDCNIFDTGFPGDNFTWCNGHPNGHTVFERLDKILCN--------DSSVHHLDFRTSDHRPVKLHICGQPKRNFS

Query:  KARRIFRVQDTWLEDPSCENVVSNCWSISPRDSNPRALIGQLDQCASKLSRWGWYRIGNFKQRIKEAEDNL----QKAIYGLPMAADRSESQQAEVHLEH
        K    FR ++ W+ D  C   +S  W      ++   +I +L+ C ++L  W     G+ +++++E  + L    +K++ G   + D   S +AEV +  
Subjt:  KARRIFRVQDTWLEDPSCENVVSNCWSISPRDSNPRALIGQLDQCASKLSRWGWYRIGNFKQRIKEAEDNL----QKAIYGLPMAADRSESQQAEVHLEH

Query:  ILKEGKTYWRQRSQEVWLKFGDRNTKWFHNKASYRRHQNEINGHMDHHGVREKDKSKVAGMIEYIFTDLFTSTGLSEYDIENVTWCIETPVSPEQNRELL
        +L + +  WRQRS+  WLK GDRNT +FH +A++R+ +N I G  D  G  + D  +V  ++   F ++F S+  S   I+ V  CI T +S   N  L 
Subjt:  ILKEGKTYWRQRSQEVWLKFGDRNTKWFHNKASYRRHQNEINGHMDHHGVREKDKSKVAGMIEYIFTDLFTSTGLSEYDIENVTWCIETPVSPEQNRELL

Query:  R
        R
Subjt:  R

A0A803NMB3 Uncharacterized protein1.6e-3631.16Show/hide
Query:  SKKGDKLISDFNKAVDDCNIFDTGFPGDNFTWCNGHPNGHTVFERLD--------KILCNDSSVHHLDFRTSDHRPVKLHICGQPKRNFSKARRI-FRVQ
        S + +K +  F K +D C++ +T F G+ +TW       +TV ERLD         I  N  + HHLD+ +SDHR + + +      +  + RR  FR +
Subjt:  SKKGDKLISDFNKAVDDCNIFDTGFPGDNFTWCNGHPNGHTVFERLD--------KILCNDSSVHHLDFRTSDHRPVKLHICGQPKRNFSKARRI-FRVQ

Query:  DTWLEDPSCENVVSNCWSISPRDSNPRALIGQLDQCASKLSRWGWYRIGNFKQRIKEAEDNLQKAIYGL--PMAADRSESQQAEVHLEHILKEGKTYWRQ
          WL DP C+ ++  CW+ S      + L+  LD CA+ L +W  ++ G+ K+ I E +  +   +  L      + +E ++AE  L+ +L++ + YW+Q
Subjt:  DTWLEDPSCENVVSNCWSISPRDSNPRALIGQLDQCASKLSRWGWYRIGNFKQRIKEAEDNLQKAIYGL--PMAADRSESQQAEVHLEHILKEGKTYWRQ

Query:  RSQEVWLKFGDRNTKWFHNKASYRRHQNEINGHMDHHGVREKDKSKVAGMIEYIFTDLFTSTGLSEYDIENVTWCIETPVSPEQNRELLRLF
        RS+  WL  GDRNTK+FH KAS R+  N+I   ++  GVR   K+ +A  +   F  +F +  + E  +      I   V+ + N EL + F
Subjt:  RSQEVWLKFGDRNTKWFHNKASYRRHQNEINGHMDHHGVREKDKSKVAGMIEYIFTDLFTSTGLSEYDIENVTWCIETPVSPEQNRELLRLF

A0A803PNH7 Uncharacterized protein6.2e-3632.53Show/hide
Query:  KGDKLISDFNKAVDDCNIFDTGFPGDNFTWCNGHPNGHTVFERLDKILCND--------SSVHHLDFRTSDHRPVKLHICGQPKR-NFSKARRIFRVQDT
        + ++ +  F K +D C + +T F GD FTW        T+  RLD    N           V+HLD+ +SDHR +         R   ++ R  FR +  
Subjt:  KGDKLISDFNKAVDDCNIFDTGFPGDNFTWCNGHPNGHTVFERLDKILCND--------SSVHHLDFRTSDHRPVKLHICGQPKR-NFSKARRIFRVQDT

Query:  WLEDPSCENVVSNCWSISPRDSNP-RALIGQLDQCASKLSRWGWYRIGNFKQRIKEAE---DNLQKAIYGLPMAADRSESQQAEVHLEHILKEGKTYWRQ
        WL D  C+++++  W  S  +SNP   ++G LD+CA  L +W + + GN K+RI +A+   + L  + +  P A   +  + +E  L+ +L++ + YW+Q
Subjt:  WLEDPSCENVVSNCWSISPRDSNP-RALIGQLDQCASKLSRWGWYRIGNFKQRIKEAE---DNLQKAIYGLPMAADRSESQQAEVHLEHILKEGKTYWRQ

Query:  RSQEVWLKFGDRNTKWFHNKASYRRHQNEINGHMDHHGVREKDKSKVAGMIEYIFTDLFTSTGLSEYDIENVTWCIETPVSPEQNRELLRLF
        RS+  WL+ GDRNTK+FH KAS R+  N I   +D +G R   KS++A  I   F ++F+++ L E  +      I + V+ E N  LL+ F
Subjt:  RSQEVWLKFGDRNTKWFHNKASYRRHQNEINGHMDHHGVREKDKSKVAGMIEYIFTDLFTSTGLSEYDIENVTWCIETPVSPEQNRELLRLF

A0A803PQM8 Uncharacterized protein3.1e-3531Show/hide
Query:  KLGGSDATVSKKGDKLISDFNKAVDDCNIFDTGFPGDNFTWCNGHPNGHTVFERLDKILCND--------SSVHHLDFRTSDHRPVK-LHICGQPKRNFS
        KLGG       + D+ +  F K +D C + +T F G++FTW       + + ERLD    N          SV HLDF  SDHR +  +      +    
Subjt:  KLGGSDATVSKKGDKLISDFNKAVDDCNIFDTGFPGDNFTWCNGHPNGHTVFERLDKILCND--------SSVHHLDFRTSDHRPVK-LHICGQPKRNFS

Query:  KARRIFRVQDTWLEDPSCENVVSNCWSISPRDSNPRALIGQLDQCASKLSRWGWYRIGNFKQRIKEAEDNLQKA-IYGLPMAADRSESQQAEVHLEHILK
        K R  FR +  WL D   + ++S CW+ S  +    A++  LD CA+KL +W   + GN K +I +A+  +++         A  ++ + +E  L+ +L+
Subjt:  KARRIFRVQDTWLEDPSCENVVSNCWSISPRDSNPRALIGQLDQCASKLSRWGWYRIGNFKQRIKEAEDNLQKA-IYGLPMAADRSESQQAEVHLEHILK

Query:  EGKTYWRQRSQEVWLKFGDRNTKWFHNKASYRRHQNEINGHMDHHGVREKDKSKVAGMIEYIFTDLFTSTGLSEYDIENVTWCIETPVSPEQNRELLRLF
        + + YW+Q S+  WL  GDRNTK+FH KAS R+  N I    +  G R   K++++ +I+  F  +FT++ + E  + +    I T ++ + N ELL+ F
Subjt:  EGKTYWRQRSQEVWLKFGDRNTKWFHNKASYRRHQNEINGHMDHHGVREKDKSKVAGMIEYIFTDLFTSTGLSEYDIENVTWCIETPVSPEQNRELLRLF

A0A803PV25 Uncharacterized protein1.1e-3532.8Show/hide
Query:  KLGGSDATVSKKGDKLISDFNKAVDDCNIFDTGFPGDNFTWCNGHPNGHTVFERLDKILCND--------SSVHHLDFRTSDHR------PVKL--HICG
        K+GG D         ++  F + +DDC   D        TWCN H N   + ERLD+ LC +        + +  LD+  SDHR      PV+L    CG
Subjt:  KLGGSDATVSKKGDKLISDFNKAVDDCNIFDTGFPGDNFTWCNGHPNGHTVFERLDKILCND--------SSVHHLDFRTSDHR------PVKL--HICG

Query:  QPKRNFSKARRIFRVQDTWLEDPSCENVVSNCWSISPRDSNPRALIGQLDQCASKLSRWGWYRIGNFKQRIKEAEDNLQKAIYGLPMAADR---SESQQA
        + KR   K+R  F  ++ W ++  C  +V   WS         +   ++++C   L  W   +    K ++    + L+KA++ L M          QQ 
Subjt:  QPKRNFSKARRIFRVQDTWLEDPSCENVVSNCWSISPRDSNPRALIGQLDQCASKLSRWGWYRIGNFKQRIKEAEDNLQKAIYGLPMAADR---SESQQA

Query:  EVHLEHILKEGKTYWRQRSQEVWLKFGDRNTKWFHNKASYRRHQNEINGHMDHHGVREKDKSKVAGMIEYIFTDLFTSTGLSEYDIENVTWCIETPVSPE
        E  L  +L++ + YWRQRS+ +WL++GDRNTK+FH+KAS RR +NEI G  DH GV + DK  V  ++E  +  LFTS+ ++E  +  V   ++  VS  
Subjt:  EVHLEHILKEGKTYWRQRSQEVWLKFGDRNTKWFHNKASYRRHQNEINGHMDHHGVREKDKSKVAGMIEYIFTDLFTSTGLSEYDIENVTWCIETPVSPE

Query:  QNRELLRLFNE
         N +LL  F E
Subjt:  QNRELLRLFNE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein1.9e-0823.48Show/hide
Query:  ISDFNKAVDDCNIFDTGFPGDNFTWCNGHPNGHTVFERLDKILCNDS---------SVHHLDFRTSDHRPVKLHICGQPKRNFSKARRIFRVQDTWLEDP
        + +F   + D ++ D    G ++TW N H + + +  +LD+ + N           +V  L    SDH P  + +   PKR    +++ FR        P
Subjt:  ISDFNKAVDDCNIFDTGFPGDNFTWCNGHPNGHTVFERLDKILCNDS---------SVHHLDFRTSDHRPVKLHICGQPKRNFSKARRIFRVQDTWLEDP

Query:  SCENVVSNCWSIS-PRDSNPRAL---IGQLDQCASKLSRWGWYRIGNFKQRIKEAEDNLQKAIYGLPMAADRSESQQAEVHLEHILKEG--------KTY
        +    ++  W    P  S+  +L   +    +C   L+R G+   GN + + KEA D+L+     L      +    +   +EH+ ++         +++
Subjt:  SCENVVSNCWSIS-PRDSNPRAL---IGQLDQCASKLSRWGWYRIGNFKQRIKEAEDNLQKAIYGLPMAADRSESQQAEVHLEHILKEG--------KTY

Query:  WRQRSQEVWLKFGDRNTKWFHNKASYRRHQNEINGHMDHHGVREKDKSKVAGMIEYIFTDLFTS
        +RQ+S+  WL+ GD NT++FH      + +N I        VR ++ ++V  MI   +T L  S
Subjt:  WRQRSQEVWLKFGDRNTKWFHNKASYRRHQNEINGHMDHHGVREKDKSKVAGMIEYIFTDLFTS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGGACGGTGCTCTTTGTTCTCGTGTGTGTGATTACTCCATGGAGGACCTCACTGATCTATGGGAGAACTTCAAGTTAACGAATGATGAGAATGAGACCATTTGTAT
TGATGAAGGAAAGTCGATTATGACACTGGAGAATGTTCAGCTGTACGCCATAGGGAAACTGCACAAGCAAACTAATCAGTTCAGAGGCTATTCGTTCGGTGATGAAATCG
GCGGGAGTGCGATTACAAGGGGAGTGGTCATGTTGAAAAAACACGTGCATAGTGGTGAGGAAGAGTCGGCCTGTCGTGTAGGTAGATCAGGCACTAGTTTTCGCTACACT
GGAGGGTGTGGAGGATGCAGTGGACGAGGTGGTTGGGACAGGAGTGAGGAGGTTTGGAGGGATCTAAGCGAGCCGCCGGAAATGTCTCACTGGCGACTGACGGAGGTAGG
AGAATATCGGGCACAAACAGGGGAGAAGATTCTGGAGGTGGCGCAATCTCCGATGGGGATGGTGGAAATCACGGAAAAGAATTTATTATCCCAATCTTTTCCTAAAATCG
ATCATAGTTTAAAAGGAAAGTTGGAGGGTAGTGAGGAAGGGAAGAAAACCCCTGCCTTAAACGAGATCCCAGACTCTTTGGCCTCTCTGGCTTTCGGGAAAAGGAAGGAC
AGTGCTGAACAGGTGGAATTGCCTGAGGTTGAGGAGCAAATTCAGAACAGAAAGAAAAAACAACTGAAGTTGGGAGGGAGTGATGCAACAGTTTCGAAGAAGGGTGATAA
GCTTATTTCTGATTTTAATAAAGCTGTTGATGATTGTAATATTTTTGACACAGGTTTTCCTGGAGATAATTTTACCTGGTGTAATGGTCATCCAAACGGTCACACTGTTT
TTGAAAGACTTGATAAAATTCTTTGTAATGATAGTTCTGTTCATCATTTAGACTTCAGAACTTCTGATCATAGACCTGTGAAATTACACATATGCGGGCAGCCTAAAAGA
AATTTTTCGAAAGCTAGGCGAATTTTCAGAGTTCAAGATACGTGGCTGGAGGATCCTAGTTGTGAGAATGTTGTTTCCAATTGCTGGTCGATTTCCCCGCGGGATAGTAA
TCCTCGAGCTTTAATTGGGCAATTGGATCAGTGTGCATCGAAACTCTCAAGATGGGGATGGTATAGAATTGGAAACTTTAAACAACGTATAAAAGAAGCGGAGGATAATC
TCCAAAAAGCAATTTATGGCCTTCCTATGGCGGCTGATCGAAGTGAGTCCCAGCAAGCTGAGGTTCATCTTGAGCATATTTTAAAGGAAGGGAAAACTTATTGGCGCCAA
AGATCACAGGAGGTGTGGTTAAAATTTGGTGATCGTAATACAAAATGGTTCCACAATAAGGCTTCTTATAGAAGACATCAAAATGAGATCAATGGCCATATGGATCATCA
TGGAGTCCGGGAGAAGGATAAAAGCAAGGTGGCAGGTATGATTGAATATATTTTCACTGACTTGTTTACATCTACAGGGCTTAGTGAGTATGATATAGAGAATGTCACTT
GGTGTATAGAGACCCCGGTCTCCCCTGAACAAAACCGAGAGCTTCTAAGGCTCTTTAATGAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCGGACGGTGCTCTTTGTTCTCGTGTGTGTGATTACTCCATGGAGGACCTCACTGATCTATGGGAGAACTTCAAGTTAACGAATGATGAGAATGAGACCATTTGTAT
TGATGAAGGAAAGTCGATTATGACACTGGAGAATGTTCAGCTGTACGCCATAGGGAAACTGCACAAGCAAACTAATCAGTTCAGAGGCTATTCGTTCGGTGATGAAATCG
GCGGGAGTGCGATTACAAGGGGAGTGGTCATGTTGAAAAAACACGTGCATAGTGGTGAGGAAGAGTCGGCCTGTCGTGTAGGTAGATCAGGCACTAGTTTTCGCTACACT
GGAGGGTGTGGAGGATGCAGTGGACGAGGTGGTTGGGACAGGAGTGAGGAGGTTTGGAGGGATCTAAGCGAGCCGCCGGAAATGTCTCACTGGCGACTGACGGAGGTAGG
AGAATATCGGGCACAAACAGGGGAGAAGATTCTGGAGGTGGCGCAATCTCCGATGGGGATGGTGGAAATCACGGAAAAGAATTTATTATCCCAATCTTTTCCTAAAATCG
ATCATAGTTTAAAAGGAAAGTTGGAGGGTAGTGAGGAAGGGAAGAAAACCCCTGCCTTAAACGAGATCCCAGACTCTTTGGCCTCTCTGGCTTTCGGGAAAAGGAAGGAC
AGTGCTGAACAGGTGGAATTGCCTGAGGTTGAGGAGCAAATTCAGAACAGAAAGAAAAAACAACTGAAGTTGGGAGGGAGTGATGCAACAGTTTCGAAGAAGGGTGATAA
GCTTATTTCTGATTTTAATAAAGCTGTTGATGATTGTAATATTTTTGACACAGGTTTTCCTGGAGATAATTTTACCTGGTGTAATGGTCATCCAAACGGTCACACTGTTT
TTGAAAGACTTGATAAAATTCTTTGTAATGATAGTTCTGTTCATCATTTAGACTTCAGAACTTCTGATCATAGACCTGTGAAATTACACATATGCGGGCAGCCTAAAAGA
AATTTTTCGAAAGCTAGGCGAATTTTCAGAGTTCAAGATACGTGGCTGGAGGATCCTAGTTGTGAGAATGTTGTTTCCAATTGCTGGTCGATTTCCCCGCGGGATAGTAA
TCCTCGAGCTTTAATTGGGCAATTGGATCAGTGTGCATCGAAACTCTCAAGATGGGGATGGTATAGAATTGGAAACTTTAAACAACGTATAAAAGAAGCGGAGGATAATC
TCCAAAAAGCAATTTATGGCCTTCCTATGGCGGCTGATCGAAGTGAGTCCCAGCAAGCTGAGGTTCATCTTGAGCATATTTTAAAGGAAGGGAAAACTTATTGGCGCCAA
AGATCACAGGAGGTGTGGTTAAAATTTGGTGATCGTAATACAAAATGGTTCCACAATAAGGCTTCTTATAGAAGACATCAAAATGAGATCAATGGCCATATGGATCATCA
TGGAGTCCGGGAGAAGGATAAAAGCAAGGTGGCAGGTATGATTGAATATATTTTCACTGACTTGTTTACATCTACAGGGCTTAGTGAGTATGATATAGAGAATGTCACTT
GGTGTATAGAGACCCCGGTCTCCCCTGAACAAAACCGAGAGCTTCTAAGGCTCTTTAATGAGTAG
Protein sequenceShow/hide protein sequence
MSDGALCSRVCDYSMEDLTDLWENFKLTNDENETICIDEGKSIMTLENVQLYAIGKLHKQTNQFRGYSFGDEIGGSAITRGVVMLKKHVHSGEEESACRVGRSGTSFRYT
GGCGGCSGRGGWDRSEEVWRDLSEPPEMSHWRLTEVGEYRAQTGEKILEVAQSPMGMVEITEKNLLSQSFPKIDHSLKGKLEGSEEGKKTPALNEIPDSLASLAFGKRKD
SAEQVELPEVEEQIQNRKKKQLKLGGSDATVSKKGDKLISDFNKAVDDCNIFDTGFPGDNFTWCNGHPNGHTVFERLDKILCNDSSVHHLDFRTSDHRPVKLHICGQPKR
NFSKARRIFRVQDTWLEDPSCENVVSNCWSISPRDSNPRALIGQLDQCASKLSRWGWYRIGNFKQRIKEAEDNLQKAIYGLPMAADRSESQQAEVHLEHILKEGKTYWRQ
RSQEVWLKFGDRNTKWFHNKASYRRHQNEINGHMDHHGVREKDKSKVAGMIEYIFTDLFTSTGLSEYDIENVTWCIETPVSPEQNRELLRLFNE