; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g19050 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g19050
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionGag/pol protein
Genome locationchr6:14831122..14840354
RNA-Seq ExpressionMoc06g19050
SyntenyMoc06g19050
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022151688.1 uncharacterized protein LOC111019603 [Momordica charantia]2.0e-13298.79Show/hide
Query:  VNAQVALLGRALQALIDNSIAAGAAQALQPRQALALQSEAQFIRDFRRYGPPTFNGESEKATVVEEWIRELEALYTYLGCSDQLKVKGAVFMLRGEALNW
        + A+VALLGRALQALIDNSIAAGAAQALQPRQALALQSEAQFIRDFRRYGPPTFNGESEKATVVEEWIRELEALYTYLGCSDQLKVKGAVFMLRGEALNW
Subjt:  VNAQVALLGRALQALIDNSIAAGAAQALQPRQALALQSEAQFIRDFRRYGPPTFNGESEKATVVEEWIRELEALYTYLGCSDQLKVKGAVFMLRGEALNW

Query:  WDVVATVEDHTNEPITWTTSKDLLYDYYFPKTIKDEKEIEFLHLTQRTLMVAQYEKKFTEFSRFALDLIPTEARKIKRFVRGLWKGIKGPIDLQRPTTYA
        WDVVATVEDHTNEPITWTTSKDLLYDYYFPKTIKDEKEIEFLHLTQRTLMVAQYEKKFTEFSRFALDLIPTEARKIKRFVRGLWKGIKGPIDLQRPTTYA
Subjt:  WDVVATVEDHTNEPITWTTSKDLLYDYYFPKTIKDEKEIEFLHLTQRTLMVAQYEKKFTEFSRFALDLIPTEARKIKRFVRGLWKGIKGPIDLQRPTTYA

Query:  EAIKGALVMDKDVIEKAQPQQKVGLSSGVKRKVPPISSSQPSKTSPQQ
        EAIKGALVMDKDVIEKAQPQQKVGLSSGVKRKVPPISSSQPSKTSPQQ
Subjt:  EAIKGALVMDKDVIEKAQPQQKVGLSSGVKRKVPPISSSQPSKTSPQQ

XP_022155925.1 uncharacterized protein LOC111022925 [Momordica charantia]2.1e-7660.41Show/hide
Query:  PQVNAQVALLGRALQALIDNSIAAGAAQALQPRQALALQSEAQFIRDFRRYGPPTFNGESEKATVVEEWIRELEALYTYLGCSDQLKVKGAVFMLRGEAL
        P V  +V LL  ALQ L+DN+  AG AQ  QP +    Q E QFIRDF+R+GPP FNG SE+ T  EEW+RELEALY YLGCSD  KV+GAVFML+GEA+
Subjt:  PQVNAQVALLGRALQALIDNSIAAGAAQALQPRQALALQSEAQFIRDFRRYGPPTFNGESEKATVVEEWIRELEALYTYLGCSDQLKVKGAVFMLRGEAL

Query:  NWWDVVATVEDHTNEPITWTTSKDLLYDYYFPKTIKDEKEIEFLHLTQRTLMVAQYEKKFTEFSRFALDLIPTEARKIKRFVRGLWKGIKGPIDLQRPTT
        NWW+ VA  EDH N P+TW   KDLLY+YYFP T+++EK  EFL LTQ +L+VAQYE+KFTE SRF +  IPTE  KI +F+ GL + IKG + L+ PTT
Subjt:  NWWDVVATVEDHTNEPITWTTSKDLLYDYYFPKTIKDEKEIEFLHLTQRTLMVAQYEKKFTEFSRFALDLIPTEARKIKRFVRGLWKGIKGPIDLQRPTT

Query:  YAEAIKGALVMDKDVIEKAQPQQKVGLSSGVKRKVPPISSSQPSK
        YA A++ ALVMDK  +E+ Q QQ +G SSGVKRK    SSSQPS+
Subjt:  YAEAIKGALVMDKDVIEKAQPQQKVGLSSGVKRKVPPISSSQPSK

XP_022156326.1 uncharacterized protein LOC111023247 [Momordica charantia]1.0e-8373.58Show/hide
Query:  GAAQALQPRQALALQSEAQFIRDFRRYGPPTFNGESEKATVVEEWIRELEALYTYLGCSDQLKVKGAVFMLRGEALNWWDVVATVEDHTNEPITWTTSKD
        G  QA  P+     QSEA+FI+DF+RYGPPTF+GESE+AT VEEWIRELEALY YLGC DQ KVKGAVFMLRGEALNWWD VA  ED+ N PI W   K+
Subjt:  GAAQALQPRQALALQSEAQFIRDFRRYGPPTFNGESEKATVVEEWIRELEALYTYLGCSDQLKVKGAVFMLRGEALNWWDVVATVEDHTNEPITWTTSKD

Query:  LLYDYYFPKTIKDEKEIEFLHLTQRTLMVAQYEKKFTEFSRFALDLIPTEARKIKRFVRGLWKGIKGPIDLQRPTTYAEAIKGALVMDKDVIEKAQPQQK
        LLYDYY+P+T+KD KE EFLHL Q TL VAQYE+KFTE SRFAL+LIPTEA KIKRFV+GL KGI+GP+DLQRPTTYAEA++GALVMDKDV  KA P  +
Subjt:  LLYDYYFPKTIKDEKEIEFLHLTQRTLMVAQYEKKFTEFSRFALDLIPTEARKIKRFVRGLWKGIKGPIDLQRPTTYAEAIKGALVMDKDVIEKAQPQQK

Query:  VGLSSGVKRKVP
        VG SSGVKRK P
Subjt:  VGLSSGVKRKVP

XP_022156546.1 uncharacterized protein LOC111023424 [Momordica charantia]1.1e-8559.93Show/hide
Query:  RVVPQIPPATPQEGV-DPPAPPIGPRRRIVPPGPSTAPQVNAQVALLGRALQALIDNSIAAGAAQALQPRQALALQSEAQFIRDFRRYGPPTFNGESEKA
        R+   + PA   E V DPP PPIG +  +VPP P  A Q             ALI+N+   G AQ   PR     QSEAQFI+DF+RYGPPTF G SE+A
Subjt:  RVVPQIPPATPQEGV-DPPAPPIGPRRRIVPPGPSTAPQVNAQVALLGRALQALIDNSIAAGAAQALQPRQALALQSEAQFIRDFRRYGPPTFNGESEKA

Query:  TVVEEWIRELEALYTYLGCSDQLKVKGAVFMLRGEALNWWDVVATVEDHTNEPITWTTSKDLLYDYYFPKTIKDEKEIEFLHLTQRTLMVAQYEKKFTEF
        T+ EEW+RELEALY YLGC DQ KVKGAVFMLR EALNWWD VA  EDH N P+ W   K+LLYD+Y+ +T++D KE+EFLHL Q TL VAQYE+KFTE 
Subjt:  TVVEEWIRELEALYTYLGCSDQLKVKGAVFMLRGEALNWWDVVATVEDHTNEPITWTTSKDLLYDYYFPKTIKDEKEIEFLHLTQRTLMVAQYEKKFTEF

Query:  SRFALDLIPTEARKIKRFVRGLWKGIKGPIDLQRPTTYAEAIKGALVMDKDVIEKAQPQQKVGLSSGVKRKVPPISSSQPSKTSPQQ
        S FAL+LIPTEA KIKRFV+GL KGI+G +DLQRP TYAEA++G L+MDKDV  + QP  +VG S GVKRKVPP  + QP + +PQ+
Subjt:  SRFALDLIPTEARKIKRFVRGLWKGIKGPIDLQRPTTYAEAIKGALVMDKDVIEKAQPQQKVGLSSGVKRKVPPISSSQPSKTSPQQ

XP_022157413.1 uncharacterized protein LOC111024114 [Momordica charantia]5.2e-8060.07Show/hide
Query:  EGVDPPAPPIGPRRRIVPPGPSTA----PQVNAQVALLGRALQALIDNSIAAGAAQALQPRQALALQSEAQFIRDFRRYGPPTFNGESEKATVVEEWIRE
        EG   P  P+    R+ PP P  A    PQVN QVALL  ALQ L+DN+  AG AQ  QPR+A   Q E QFIRDF+R+GPP FNG SE+ T  EEW+RE
Subjt:  EGVDPPAPPIGPRRRIVPPGPSTA----PQVNAQVALLGRALQALIDNSIAAGAAQALQPRQALALQSEAQFIRDFRRYGPPTFNGESEKATVVEEWIRE

Query:  LEALYTYLGCSDQLKVKGAVFMLRGEALNWWDVVATVEDHTNEPITWTTSKDLLYDYYFPKTIKDEKEIEFLHLTQRTLMVAQYEKKFTEFSRFALDLIP
        LEALY YLGCSD  KV+GAVFMLRGEA+NWW+ VA  EDHTN P+TW   KDLLY+YYFP T+++EK  EFL LTQ +L VAQYE+KFTE SRF +  IP
Subjt:  LEALYTYLGCSDQLKVKGAVFMLRGEALNWWDVVATVEDHTNEPITWTTSKDLLYDYYFPKTIKDEKEIEFLHLTQRTLMVAQYEKKFTEFSRFALDLIP

Query:  TEARKIKRFVRGLWKGIKGPIDLQRPTTYAEAIKGALVMDKDVIEKAQPQQKVGLSSGVKRKVPPISSSQPSK
        TE  KI +F+ GL   IKG + ++ PTTYA AI+ ALVMDK  +E+ Q QQ +G SSGVKRK    SSSQ S+
Subjt:  TEARKIKRFVRGLWKGIKGPIDLQRPTTYAEAIKGALVMDKDVIEKAQPQQKVGLSSGVKRKVPPISSSQPSK

TrEMBL top hitse value%identityAlignment
A0A6J1DCW8 uncharacterized protein LOC1110196039.8e-13398.79Show/hide
Query:  VNAQVALLGRALQALIDNSIAAGAAQALQPRQALALQSEAQFIRDFRRYGPPTFNGESEKATVVEEWIRELEALYTYLGCSDQLKVKGAVFMLRGEALNW
        + A+VALLGRALQALIDNSIAAGAAQALQPRQALALQSEAQFIRDFRRYGPPTFNGESEKATVVEEWIRELEALYTYLGCSDQLKVKGAVFMLRGEALNW
Subjt:  VNAQVALLGRALQALIDNSIAAGAAQALQPRQALALQSEAQFIRDFRRYGPPTFNGESEKATVVEEWIRELEALYTYLGCSDQLKVKGAVFMLRGEALNW

Query:  WDVVATVEDHTNEPITWTTSKDLLYDYYFPKTIKDEKEIEFLHLTQRTLMVAQYEKKFTEFSRFALDLIPTEARKIKRFVRGLWKGIKGPIDLQRPTTYA
        WDVVATVEDHTNEPITWTTSKDLLYDYYFPKTIKDEKEIEFLHLTQRTLMVAQYEKKFTEFSRFALDLIPTEARKIKRFVRGLWKGIKGPIDLQRPTTYA
Subjt:  WDVVATVEDHTNEPITWTTSKDLLYDYYFPKTIKDEKEIEFLHLTQRTLMVAQYEKKFTEFSRFALDLIPTEARKIKRFVRGLWKGIKGPIDLQRPTTYA

Query:  EAIKGALVMDKDVIEKAQPQQKVGLSSGVKRKVPPISSSQPSKTSPQQ
        EAIKGALVMDKDVIEKAQPQQKVGLSSGVKRKVPPISSSQPSKTSPQQ
Subjt:  EAIKGALVMDKDVIEKAQPQQKVGLSSGVKRKVPPISSSQPSKTSPQQ

A0A6J1DQB9 Reverse transcriptase9.9e-7756.63Show/hide
Query:  PPATPQEGVDPPAPPIGPRRRIVPPGPSTA----PQVNAQVALLGRALQALIDNSIAAGAAQALQPRQALALQSEAQFIRDFRRYGPPTFNGESEKATVV
        P    +   DP   P+ P   +VPP P  A    PQVN QVALL  ALQ L+ N+  AG AQ  QPR+A   Q E QFIRDF+ +GPP FNG SE+ T  
Subjt:  PPATPQEGVDPPAPPIGPRRRIVPPGPSTA----PQVNAQVALLGRALQALIDNSIAAGAAQALQPRQALALQSEAQFIRDFRRYGPPTFNGESEKATVV

Query:  EEWIRELEALYTYLGCSDQLKVKGAVFMLRGEALNWWDVVATVEDHTNEPITWTTSKDLLYDYYFPKTIKDEKEIEFLHLTQRTLMVAQYEKKFTEFSRF
        EEW+RELEALY YLGCSD  KV+GAVFMLRGEA+NWW+ VA  EDH N P+TW   KDLLY+YYFP   ++EK +EFL LTQ +L VAQYE+KFTE SRF
Subjt:  EEWIRELEALYTYLGCSDQLKVKGAVFMLRGEALNWWDVVATVEDHTNEPITWTTSKDLLYDYYFPKTIKDEKEIEFLHLTQRTLMVAQYEKKFTEFSRF

Query:  ALDLIPTEARKIKRFVRGLWKGIKGPIDLQRPTTYAEAIKGALVMDKDVIEKAQPQQKVGLSSGVKRKVPPISSSQPSK
            +PTE  KI +F+ GL + IKG + L+ PTTYA A++ ALVMDK  +E+ Q QQ +G +SGVKRK    S+SQ S+
Subjt:  ALDLIPTEARKIKRFVRGLWKGIKGPIDLQRPTTYAEAIKGALVMDKDVIEKAQPQQKVGLSSGVKRKVPPISSSQPSK

A0A6J1DTA8 uncharacterized protein LOC1110241142.5e-8060.07Show/hide
Query:  EGVDPPAPPIGPRRRIVPPGPSTA----PQVNAQVALLGRALQALIDNSIAAGAAQALQPRQALALQSEAQFIRDFRRYGPPTFNGESEKATVVEEWIRE
        EG   P  P+    R+ PP P  A    PQVN QVALL  ALQ L+DN+  AG AQ  QPR+A   Q E QFIRDF+R+GPP FNG SE+ T  EEW+RE
Subjt:  EGVDPPAPPIGPRRRIVPPGPSTA----PQVNAQVALLGRALQALIDNSIAAGAAQALQPRQALALQSEAQFIRDFRRYGPPTFNGESEKATVVEEWIRE

Query:  LEALYTYLGCSDQLKVKGAVFMLRGEALNWWDVVATVEDHTNEPITWTTSKDLLYDYYFPKTIKDEKEIEFLHLTQRTLMVAQYEKKFTEFSRFALDLIP
        LEALY YLGCSD  KV+GAVFMLRGEA+NWW+ VA  EDHTN P+TW   KDLLY+YYFP T+++EK  EFL LTQ +L VAQYE+KFTE SRF +  IP
Subjt:  LEALYTYLGCSDQLKVKGAVFMLRGEALNWWDVVATVEDHTNEPITWTTSKDLLYDYYFPKTIKDEKEIEFLHLTQRTLMVAQYEKKFTEFSRFALDLIP

Query:  TEARKIKRFVRGLWKGIKGPIDLQRPTTYAEAIKGALVMDKDVIEKAQPQQKVGLSSGVKRKVPPISSSQPSK
        TE  KI +F+ GL   IKG + ++ PTTYA AI+ ALVMDK  +E+ Q QQ +G SSGVKRK    SSSQ S+
Subjt:  TEARKIKRFVRGLWKGIKGPIDLQRPTTYAEAIKGALVMDKDVIEKAQPQQKVGLSSGVKRKVPPISSSQPSK

A0A6J1DUM2 uncharacterized protein LOC1110232474.9e-8473.58Show/hide
Query:  GAAQALQPRQALALQSEAQFIRDFRRYGPPTFNGESEKATVVEEWIRELEALYTYLGCSDQLKVKGAVFMLRGEALNWWDVVATVEDHTNEPITWTTSKD
        G  QA  P+     QSEA+FI+DF+RYGPPTF+GESE+AT VEEWIRELEALY YLGC DQ KVKGAVFMLRGEALNWWD VA  ED+ N PI W   K+
Subjt:  GAAQALQPRQALALQSEAQFIRDFRRYGPPTFNGESEKATVVEEWIRELEALYTYLGCSDQLKVKGAVFMLRGEALNWWDVVATVEDHTNEPITWTTSKD

Query:  LLYDYYFPKTIKDEKEIEFLHLTQRTLMVAQYEKKFTEFSRFALDLIPTEARKIKRFVRGLWKGIKGPIDLQRPTTYAEAIKGALVMDKDVIEKAQPQQK
        LLYDYY+P+T+KD KE EFLHL Q TL VAQYE+KFTE SRFAL+LIPTEA KIKRFV+GL KGI+GP+DLQRPTTYAEA++GALVMDKDV  KA P  +
Subjt:  LLYDYYFPKTIKDEKEIEFLHLTQRTLMVAQYEKKFTEFSRFALDLIPTEARKIKRFVRGLWKGIKGPIDLQRPTTYAEAIKGALVMDKDVIEKAQPQQK

Query:  VGLSSGVKRKVP
        VG SSGVKRK P
Subjt:  VGLSSGVKRKVP

A0A6J1DVA0 uncharacterized protein LOC1110234245.3e-8659.93Show/hide
Query:  RVVPQIPPATPQEGV-DPPAPPIGPRRRIVPPGPSTAPQVNAQVALLGRALQALIDNSIAAGAAQALQPRQALALQSEAQFIRDFRRYGPPTFNGESEKA
        R+   + PA   E V DPP PPIG +  +VPP P  A Q             ALI+N+   G AQ   PR     QSEAQFI+DF+RYGPPTF G SE+A
Subjt:  RVVPQIPPATPQEGV-DPPAPPIGPRRRIVPPGPSTAPQVNAQVALLGRALQALIDNSIAAGAAQALQPRQALALQSEAQFIRDFRRYGPPTFNGESEKA

Query:  TVVEEWIRELEALYTYLGCSDQLKVKGAVFMLRGEALNWWDVVATVEDHTNEPITWTTSKDLLYDYYFPKTIKDEKEIEFLHLTQRTLMVAQYEKKFTEF
        T+ EEW+RELEALY YLGC DQ KVKGAVFMLR EALNWWD VA  EDH N P+ W   K+LLYD+Y+ +T++D KE+EFLHL Q TL VAQYE+KFTE 
Subjt:  TVVEEWIRELEALYTYLGCSDQLKVKGAVFMLRGEALNWWDVVATVEDHTNEPITWTTSKDLLYDYYFPKTIKDEKEIEFLHLTQRTLMVAQYEKKFTEF

Query:  SRFALDLIPTEARKIKRFVRGLWKGIKGPIDLQRPTTYAEAIKGALVMDKDVIEKAQPQQKVGLSSGVKRKVPPISSSQPSKTSPQQ
        S FAL+LIPTEA KIKRFV+GL KGI+G +DLQRP TYAEA++G L+MDKDV  + QP  +VG S GVKRKVPP  + QP + +PQ+
Subjt:  SRFALDLIPTEARKIKRFVRGLWKGIKGPIDLQRPTTYAEAIKGALVMDKDVIEKAQPQQKVGLSSGVKRKVPPISSSQPSKTSPQQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGGCACACCCGTGCCGTCCGACACTGTCCTTGAGATTCCCATTCTCGAAGGAAGACGCCCAAGGTCTTGGTCTTGCAACGCTCGCCCACGCCATGACACAC
GCATGTGCCGTGAGGCGTGTACGTGCAAAGTGGCGAGGAGTCACTCGCCAACGTAGAGCGCACGGTCATGGCCCGGGGACCCAACTGGCCCCCACTGAGCTTGTC
ACCGACACCACCGACATCGCCTTCGCCTTGCATTCTGCATATGCCTTCTCTAGGCATTCTTGCGTACATCCATGGGGTGTGATAGGTGATGCACCATCCTCCTTG
GTCGCATCACAGCACTTCTCTCTCGACCCTCAACATGGCCTGACGGGCGACATATGCATGTCGCCACCACTTGTCCACTATGTGGGCGTCACAACCTACTCTCTA
CTAGATTATTGGGGTGGACCTCTGAGGTCCGGAAATGTTGGGTCACACTTACGAGGAGTTGTTAACATGTCTACTTCCATTATTGCACTCTTAGCCGCGCAAAGA
CTTAATGGCGAAAATTACAAACAATGGAAGTCAAACCTAAACACTATTCTCGTGATAGATGATCTTAGGTTCGTCTTGCAAGAGGATTGTCCTCAAGCTCCTGCG
CCTAACGTTATTGTGGCGGTGCGCAACGCCTATGATAGGTGGATCAAGGCCAATGACAAGGCCAAGGTCTACATCTTGGCGAGCATATCTGATGTGCTTGCCAAT
AAGCACGAGGACACGGTCACCGCTAAGGAGATCATGGACTTGCTGCAGAGCATGTCTGGACAACCGTCCTCACAGGCTCGACATGAAGCCCTTAAATTCGTTTAC
AACTCCCGCATGAAGGAGGGTTCATCAGTGCGAGAACACGTTCTCAACCTGATGAGTTTCCTGCCATTCCGCAGCAATGCGGTTATGAATATGCTGGAGTACACT
CTTACCACGCTCCTTAACGAGCTGCAGACCTACCAGTCTGCGCCCTCTTCTTCTGGAAGTAAGACTTTTAAGAAGAAGAAGGCTGTTGCTAAGGGGTCTAAACCT
GACTCAGCTGCTGCTGCCCAGAAAAGCAAGGTCAAGGTTGCAGAGAAAGAAAGTGTTTCCACTGCAACATGGACGGGCATTGGAAGCGCAACTGCCCAAAGTACT
TGGCCGAAAAGAAGAAAGCCAACGAAGGGAATTAGTTCCTGGAGGCAGCTTGACGCCGAAGAGATGACTCTTAAGGTCGGAACGGGAGAGGTCGTCTCAGCTGTG
GCGGATTGCTCCCACCAGCACGATCCCGAGACCCAAGAGGATAGCGAGGAAGACATAGTGGTGGTGTTCGAGGGAAAGTCGTTGAAGAAACGGGTAGTCCCTCAA
ATTCCTCCAGCAACCCCTCAAGAAGGGGTAGACCCTCCAGCTCCCCCTATAGGTCCTCGGAGAAGGATAGTTCCTCCAGGCCCCTCGACAGCCCCTCAAGTGAAT
GCTCAGGTGGCTTTACTGGGAAGAGCACTACAAGCATTAATTGACAATTCGATTGCAGCAGGTGCTGCTCAAGCCCTGCAACCTCGTCAAGCTCTGGCTCTTCAG
AGTGAAGCTCAGTTCATCAGGGACTTTAGGCGTTATGGACCCCCTACTTTTAATGGAGAAAGTGAGAAAGCTACAGTAGTGGAGGAGTGGATCAGGGAGTTGGAA
GCTTTATACACTTATCTAGGTTGCAGCGACCAACTTAAAGTCAAAGGTGCAGTATTTATGTTGAGAGGCGAAGCTCTAAATTGGTGGGATGTAGTAGCAACTGTA
GAAGACCATACAAATGAACCCATCACTTGGACAACGTCCAAAGATCTGCTTTACGATTATTACTTTCCGAAGACGATAAAAGATGAAAAAGAGATAGAGTTCCTT
CACCTCACTCAACGAACTTTGATGGTGGCTCAGTATGAGAAGAAGTTTACAGAATTCTCTCGTTTTGCTCTGGATCTAATCCCCACTGAGGCGAGGAAGATTAAA
AGGTTTGTTAGAGGTCTATGGAAAGGGATTAAGGGACCAATTGATCTTCAGCGGCCAACCACTTATGCGGAAGCAATTAAGGGTGCCTTGGTTATGGATAAGGAC
GTCATCGAAAAAGCTCAACCACAGCAGAAAGTCGGCTTATCCTCAGGAGTAAAAAGGAAGGTTCCTCCGATATCCTCTAGCCAACCTTCAAAAACCAGCCCTCAG
CAATAG
mRNA sequenceShow/hide mRNA sequence
ATGCCGGCACACCCGTGCCGTCCGACACTGTCCTTGAGATTCCCATTCTCGAAGGAAGACGCCCAAGGTCTTGGTCTTGCAACGCTCGCCCACGCCATGACACAC
GCATGTGCCGTGAGGCGTGTACGTGCAAAGTGGCGAGGAGTCACTCGCCAACGTAGAGCGCACGGTCATGGCCCGGGGACCCAACTGGCCCCCACTGAGCTTGTC
ACCGACACCACCGACATCGCCTTCGCCTTGCATTCTGCATATGCCTTCTCTAGGCATTCTTGCGTACATCCATGGGGTGTGATAGGTGATGCACCATCCTCCTTG
GTCGCATCACAGCACTTCTCTCTCGACCCTCAACATGGCCTGACGGGCGACATATGCATGTCGCCACCACTTGTCCACTATGTGGGCGTCACAACCTACTCTCTA
CTAGATTATTGGGGTGGACCTCTGAGGTCCGGAAATGTTGGGTCACACTTACGAGGAGTTGTTAACATGTCTACTTCCATTATTGCACTCTTAGCCGCGCAAAGA
CTTAATGGCGAAAATTACAAACAATGGAAGTCAAACCTAAACACTATTCTCGTGATAGATGATCTTAGGTTCGTCTTGCAAGAGGATTGTCCTCAAGCTCCTGCG
CCTAACGTTATTGTGGCGGTGCGCAACGCCTATGATAGGTGGATCAAGGCCAATGACAAGGCCAAGGTCTACATCTTGGCGAGCATATCTGATGTGCTTGCCAAT
AAGCACGAGGACACGGTCACCGCTAAGGAGATCATGGACTTGCTGCAGAGCATGTCTGGACAACCGTCCTCACAGGCTCGACATGAAGCCCTTAAATTCGTTTAC
AACTCCCGCATGAAGGAGGGTTCATCAGTGCGAGAACACGTTCTCAACCTGATGAGTTTCCTGCCATTCCGCAGCAATGCGGTTATGAATATGCTGGAGTACACT
CTTACCACGCTCCTTAACGAGCTGCAGACCTACCAGTCTGCGCCCTCTTCTTCTGGAAGTAAGACTTTTAAGAAGAAGAAGGCTGTTGCTAAGGGGTCTAAACCT
GACTCAGCTGCTGCTGCCCAGAAAAGCAAGGTCAAGGTTGCAGAGAAAGAAAGTGTTTCCACTGCAACATGGACGGGCATTGGAAGCGCAACTGCCCAAAGTACT
TGGCCGAAAAGAAGAAAGCCAACGAAGGGAATTAGTTCCTGGAGGCAGCTTGACGCCGAAGAGATGACTCTTAAGGTCGGAACGGGAGAGGTCGTCTCAGCTGTG
GCGGATTGCTCCCACCAGCACGATCCCGAGACCCAAGAGGATAGCGAGGAAGACATAGTGGTGGTGTTCGAGGGAAAGTCGTTGAAGAAACGGGTAGTCCCTCAA
ATTCCTCCAGCAACCCCTCAAGAAGGGGTAGACCCTCCAGCTCCCCCTATAGGTCCTCGGAGAAGGATAGTTCCTCCAGGCCCCTCGACAGCCCCTCAAGTGAAT
GCTCAGGTGGCTTTACTGGGAAGAGCACTACAAGCATTAATTGACAATTCGATTGCAGCAGGTGCTGCTCAAGCCCTGCAACCTCGTCAAGCTCTGGCTCTTCAG
AGTGAAGCTCAGTTCATCAGGGACTTTAGGCGTTATGGACCCCCTACTTTTAATGGAGAAAGTGAGAAAGCTACAGTAGTGGAGGAGTGGATCAGGGAGTTGGAA
GCTTTATACACTTATCTAGGTTGCAGCGACCAACTTAAAGTCAAAGGTGCAGTATTTATGTTGAGAGGCGAAGCTCTAAATTGGTGGGATGTAGTAGCAACTGTA
GAAGACCATACAAATGAACCCATCACTTGGACAACGTCCAAAGATCTGCTTTACGATTATTACTTTCCGAAGACGATAAAAGATGAAAAAGAGATAGAGTTCCTT
CACCTCACTCAACGAACTTTGATGGTGGCTCAGTATGAGAAGAAGTTTACAGAATTCTCTCGTTTTGCTCTGGATCTAATCCCCACTGAGGCGAGGAAGATTAAA
AGGTTTGTTAGAGGTCTATGGAAAGGGATTAAGGGACCAATTGATCTTCAGCGGCCAACCACTTATGCGGAAGCAATTAAGGGTGCCTTGGTTATGGATAAGGAC
GTCATCGAAAAAGCTCAACCACAGCAGAAAGTCGGCTTATCCTCAGGAGTAAAAAGGAAGGTTCCTCCGATATCCTCTAGCCAACCTTCAAAAACCAGCCCTCAG
CAATAG
Protein sequenceShow/hide protein sequence
MPAHPCRPTLSLRFPFSKEDAQGLGLATLAHAMTHACAVRRVRAKWRGVTRQRRAHGHGPGTQLAPTELVTDTTDIAFALHSAYAFSRHSCVHPWGVIGDAPSSL
VASQHFSLDPQHGLTGDICMSPPLVHYVGVTTYSLLDYWGGPLRSGNVGSHLRGVVNMSTSIIALLAAQRLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPA
PNVIVAVRNAYDRWIKANDKAKVYILASISDVLANKHEDTVTAKEIMDLLQSMSGQPSSQARHEALKFVYNSRMKEGSSVREHVLNLMSFLPFRSNAVMNMLEYT
LTTLLNELQTYQSAPSSSGSKTFKKKKAVAKGSKPDSAAAAQKSKVKVAEKESVSTATWTGIGSATAQSTWPKRRKPTKGISSWRQLDAEEMTLKVGTGEVVSAV
ADCSHQHDPETQEDSEEDIVVVFEGKSLKKRVVPQIPPATPQEGVDPPAPPIGPRRRIVPPGPSTAPQVNAQVALLGRALQALIDNSIAAGAAQALQPRQALALQ
SEAQFIRDFRRYGPPTFNGESEKATVVEEWIRELEALYTYLGCSDQLKVKGAVFMLRGEALNWWDVVATVEDHTNEPITWTTSKDLLYDYYFPKTIKDEKEIEFL
HLTQRTLMVAQYEKKFTEFSRFALDLIPTEARKIKRFVRGLWKGIKGPIDLQRPTTYAEAIKGALVMDKDVIEKAQPQQKVGLSSGVKRKVPPISSSQPSKTSPQ
Q