; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0000719 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0000719
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotransposon gag protein
Genome locationchr4:14006360..14012608
RNA-Seq ExpressionLag0000719
SyntenyLag0000719
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032121.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]9.3e-4943.49Show/hide
Query:  IEESMIVNTTLPKSSSKEKRQTNGTYH-------LTLKERQKKIYPFPEVDISDMLEQLLEAQLIELPKCKRPEEMEKVDDPMYCKYHRVIGHLEERCFI
        + ESM+V  T  KS SK K   +   H        TL+ERQKK+YPFP+ D++DMLEQL+E QLI+LP+CKRPE++ KVDDP YCKYHRVI HL E+CF+
Subjt:  IEESMIVNTTLPKSSSKEKRQTNGTYH-------LTLKERQKKIYPFPEVDISDMLEQLLEAQLIELPKCKRPEEMEKVDDPMYCKYHRVIGHLEERCFI

Query:  LKDLILKLAKEGKIELDLDEVTQSN--------------------------------LATIKKK---SKHQRKKDPKQLQPKR-----------------
        LK+LI KLA+E KIELD+DEV Q+N                                L   ++K   S  Q K++P + + +                  
Subjt:  LKDLILKLAKEGKIELDLDEVTQSN--------------------------------LATIKKK---SKHQRKKDPKQLQPKR-----------------

Query:  -------------------KRKEVDNSKKGEQRTSVFDLIKSSTTPPSVFQRVSMVTTEEENQCSVSTSTRPSAFRRLSVSTSKKSRPSTSVFDRLKVID
                             +E+DNS + +QRTSVFD IK  TT  SVFQR+SM T +EENQC   T  + SAF+RLS+S SKK RPST  FDRLK+ +
Subjt:  -------------------KRKEVDNSKKGEQRTSVFDLIKSSTTPPSVFQRVSMVTTEEENQCSVSTSTRPSAFRRLSVSTSKKSRPSTSVFDRLKVID

Query:  DQPQRKMDNLEFDVF
        DQ QR+M  L+   F
Subjt:  DQPQRKMDNLEFDVF

KAA0040811.1 retrotransposon gag protein [Cucumis melo var. makuwa]5.1e-4738.62Show/hide
Query:  IEESMIVNTTLPKSSSKEKRQTNGTYH-------LTLKERQKKIYPFPEVDISDMLEQLLEAQLIELPKCKRPEEMEKVDDPMYCKYHRVIGHLEERCFI
        + ESM+V  T  KS SK K + +   H        TL+ERQKK+YPFP+ D++DMLEQL+E QLI+LPKCKRPE+  KVDDP YCKYHRVI H  E+CF+
Subjt:  IEESMIVNTTLPKSSSKEKRQTNGTYH-------LTLKERQKKIYPFPEVDISDMLEQLLEAQLIELPKCKRPEEMEKVDDPMYCKYHRVIGHLEERCFI

Query:  LKDLILKLAKEGKIELDLDEVTQSNLATI-----------------------------------------------------------------------
        LK+LILKL +E KIELD+DEV Q+N   +                                                                       
Subjt:  LKDLILKLAKEGKIELDLDEVTQSNLATI-----------------------------------------------------------------------

Query:  ---------------KKKSKHQRK---------KDPKQLQPKR---------------------------------------KRKEVDNSKKGEQRTSVF
                       KKK +  +K         KD   LQP++                                         +EVDNS + +QRTSVF
Subjt:  ---------------KKKSKHQRK---------KDPKQLQPKR---------------------------------------KRKEVDNSKKGEQRTSVF

Query:  DLIKSSTTPPSVFQRVSMVTTEEENQCSVSTSTRPSAFRRLSVSTSKKSRPSTSVFDRLKVIDDQPQRKMDNLEFDVF
        D IK  TT  SVFQR+S+ T EEENQC  ST TR SAF+ LS+STSKK RPSTS FDRLK+I+DQ QR+M +L+   F
Subjt:  DLIKSSTTPPSVFQRVSMVTTEEENQCSVSTSTRPSAFRRLSVSTSKKSRPSTSVFDRLKVIDDQPQRKMDNLEFDVF

KAA0056121.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]5.2e-5251.27Show/hide
Query:  IEESMIVNTTLPKSSSKEKRQTNGTYH-------LTLKERQKKIYPFPEVDISDMLEQLLEAQLIELPKCKRPEEMEKVDDPMYCKYHRVIGHLEERCFI
        I+ESM+V+ T  KS SK K       H        TLKERQ+K+YPFP+ D++DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI H  E+CF+
Subjt:  IEESMIVNTTLPKSSSKEKRQTNGTYH-------LTLKERQKKIYPFPEVDISDMLEQLLEAQLIELPKCKRPEEMEKVDDPMYCKYHRVIGHLEERCFI

Query:  LKDLILKLAKEGKIELDLDEVTQSNLATIKKKSKHQRKKDPKQLQPKR---------------------------------------KRKEVDNSKKGEQ
        LK+LILKLA+E KIELD+DEV Q+N A I+  S   + KD   LQ +R                                         KEV+NS +  Q
Subjt:  LKDLILKLAKEGKIELDLDEVTQSNLATIKKKSKHQRKKDPKQLQPKR---------------------------------------KRKEVDNSKKGEQ

Query:  RTSVFDLIKSSTTPPSVFQRVSMVTTEEENQCSVSTSTRPSAFRRLSVSTSKKSRPSTSVFDRLKVIDDQPQRKM
        RTSVFD IK STT  SVFQR+S+ T EEENQC     TR S  +RLS+ST KK RPSTS FDRLK+ +DQ QR+M
Subjt:  RTSVFDLIKSSTTPPSVFQRVSMVTTEEENQCSVSTSTRPSAFRRLSVSTSKKSRPSTSVFDRLKVIDDQPQRKM

KAA0065984.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]9.9e-5148.41Show/hide
Query:  IEESMIVNTTLPKSSSKEKRQTNGTYH-------LTLKERQKKIYPFPEVDISDMLEQLLEAQLIELPKCKRPEEMEKVDDPMYCKYHRVIGHLEERCFI
        + ESM+V  T  KS SK K   +   H        TL+ERQKK+YPFP+ D++DMLEQL+E QLI+L  CKRP +  KVDDP YCKYHRVI H  E+CF+
Subjt:  IEESMIVNTTLPKSSSKEKRQTNGTYH-------LTLKERQKKIYPFPEVDISDMLEQLLEAQLIELPKCKRPEEMEKVDDPMYCKYHRVIGHLEERCFI

Query:  LKDLILKLAKEGKIELDLDEVTQSNLATIKKKSKHQRKKDPKQLQPKR---------------------------------------KRKEVDNSKKGEQ
        LK+LILKLA+E KI+LD+DE  +               KD   LQP+R                                         +EVDNS + +Q
Subjt:  LKDLILKLAKEGKIELDLDEVTQSNLATIKKKSKHQRKKDPKQLQPKR---------------------------------------KRKEVDNSKKGEQ

Query:  RTSVFDLIKSSTTPPSVFQRVSMVTTEEENQCSVSTSTRPSAFRRLSVSTSKKSRPSTSVFDRLKVIDDQPQRKMDNLEFDVF
        RTSVFD IK  TT   VFQR+SM T EEENQC  ST TR SAF+RLS+STSKK RPSTS FDRLK+ +DQ QR+M +L+   F
Subjt:  RTSVFDLIKSSTTPPSVFQRVSMVTTEEENQCSVSTSTRPSAFRRLSVSTSKKSRPSTSVFDRLKVIDDQPQRKMDNLEFDVF

TYK04576.1 retrotransposon gag protein [Cucumis melo var. makuwa]2.5e-4637.7Show/hide
Query:  IEESMIVNTTLPKSSSKEKRQTNGTYH-------LTLKERQKKIYPFPEVDISDMLEQLLEAQLIELPKCKRPEEMEKVDDPMYCKYHRVIGHLEERCFI
        I+ESM+V+ T  KS SK K       H        TLKERQKK+YPFP+ D++DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI H  E+CF+
Subjt:  IEESMIVNTTLPKSSSKEKRQTNGTYH-------LTLKERQKKIYPFPEVDISDMLEQLLEAQLIELPKCKRPEEMEKVDDPMYCKYHRVIGHLEERCFI

Query:  LKDLILKLAKEGKIELDLDEVTQSNLATI-----------------------------------------------------------------------
        LK+LILKLA+E KIELD+DEV Q+N   I                                                                       
Subjt:  LKDLILKLAKEGKIELDLDEVTQSNLATI-----------------------------------------------------------------------

Query:  ---------------KKKSKHQRK---------KDPKQLQPKR---------------------------------------KRKEVDNSKKGEQRTSVF
                       KKK +  +K         KD   LQ +R                                         KEV+NS +  QRTSVF
Subjt:  ---------------KKKSKHQRK---------KDPKQLQPKR---------------------------------------KRKEVDNSKKGEQRTSVF

Query:  DLIKSSTTPPSVFQRVSMVTTEEENQCSVSTSTRPSAFRRLSVSTSKKSRPSTSVFDRLKVIDDQPQRKMDNLEFDVFTH----PKFVVPSPSLKVLRSC
        D IK STT  SVFQR+SM T EEENQC     TR S F+RLS+STSKK RPSTS FDRLK+ +DQ QR+M +L+   F+      K     PS    +  
Subjt:  DLIKSSTTPPSVFQRVSMVTTEEENQCSVSTSTRPSAFRRLSVSTSKKSRPSTSVFDRLKVIDDQPQRKMDNLEFDVFTH----PKFVVPSPSLKVLRSC

Query:  VDPSPSLKFEGSDSALLPSPSSKVLTL
        VD    +  EG+ S LL +   + L L
Subjt:  VDPSPSLKFEGSDSALLPSPSSKVLTL

TrEMBL top hitse value%identityAlignment
A0A5A7SRE2 Ty3-gypsy retrotransposon protein4.5e-4943.49Show/hide
Query:  IEESMIVNTTLPKSSSKEKRQTNGTYH-------LTLKERQKKIYPFPEVDISDMLEQLLEAQLIELPKCKRPEEMEKVDDPMYCKYHRVIGHLEERCFI
        + ESM+V  T  KS SK K   +   H        TL+ERQKK+YPFP+ D++DMLEQL+E QLI+LP+CKRPE++ KVDDP YCKYHRVI HL E+CF+
Subjt:  IEESMIVNTTLPKSSSKEKRQTNGTYH-------LTLKERQKKIYPFPEVDISDMLEQLLEAQLIELPKCKRPEEMEKVDDPMYCKYHRVIGHLEERCFI

Query:  LKDLILKLAKEGKIELDLDEVTQSN--------------------------------LATIKKK---SKHQRKKDPKQLQPKR-----------------
        LK+LI KLA+E KIELD+DEV Q+N                                L   ++K   S  Q K++P + + +                  
Subjt:  LKDLILKLAKEGKIELDLDEVTQSN--------------------------------LATIKKK---SKHQRKKDPKQLQPKR-----------------

Query:  -------------------KRKEVDNSKKGEQRTSVFDLIKSSTTPPSVFQRVSMVTTEEENQCSVSTSTRPSAFRRLSVSTSKKSRPSTSVFDRLKVID
                             +E+DNS + +QRTSVFD IK  TT  SVFQR+SM T +EENQC   T  + SAF+RLS+S SKK RPST  FDRLK+ +
Subjt:  -------------------KRKEVDNSKKGEQRTSVFDLIKSSTTPPSVFQRVSMVTTEEENQCSVSTSTRPSAFRRLSVSTSKKSRPSTSVFDRLKVID

Query:  DQPQRKMDNLEFDVF
        DQ QR+M  L+   F
Subjt:  DQPQRKMDNLEFDVF

A0A5A7TGM1 Retrotransposon gag protein2.5e-4738.62Show/hide
Query:  IEESMIVNTTLPKSSSKEKRQTNGTYH-------LTLKERQKKIYPFPEVDISDMLEQLLEAQLIELPKCKRPEEMEKVDDPMYCKYHRVIGHLEERCFI
        + ESM+V  T  KS SK K + +   H        TL+ERQKK+YPFP+ D++DMLEQL+E QLI+LPKCKRPE+  KVDDP YCKYHRVI H  E+CF+
Subjt:  IEESMIVNTTLPKSSSKEKRQTNGTYH-------LTLKERQKKIYPFPEVDISDMLEQLLEAQLIELPKCKRPEEMEKVDDPMYCKYHRVIGHLEERCFI

Query:  LKDLILKLAKEGKIELDLDEVTQSNLATI-----------------------------------------------------------------------
        LK+LILKL +E KIELD+DEV Q+N   +                                                                       
Subjt:  LKDLILKLAKEGKIELDLDEVTQSNLATI-----------------------------------------------------------------------

Query:  ---------------KKKSKHQRK---------KDPKQLQPKR---------------------------------------KRKEVDNSKKGEQRTSVF
                       KKK +  +K         KD   LQP++                                         +EVDNS + +QRTSVF
Subjt:  ---------------KKKSKHQRK---------KDPKQLQPKR---------------------------------------KRKEVDNSKKGEQRTSVF

Query:  DLIKSSTTPPSVFQRVSMVTTEEENQCSVSTSTRPSAFRRLSVSTSKKSRPSTSVFDRLKVIDDQPQRKMDNLEFDVF
        D IK  TT  SVFQR+S+ T EEENQC  ST TR SAF+ LS+STSKK RPSTS FDRLK+I+DQ QR+M +L+   F
Subjt:  DLIKSSTTPPSVFQRVSMVTTEEENQCSVSTSTRPSAFRRLSVSTSKKSRPSTSVFDRLKVIDDQPQRKMDNLEFDVF

A0A5A7URH1 Ty3-gypsy retrotransposon protein2.5e-5251.27Show/hide
Query:  IEESMIVNTTLPKSSSKEKRQTNGTYH-------LTLKERQKKIYPFPEVDISDMLEQLLEAQLIELPKCKRPEEMEKVDDPMYCKYHRVIGHLEERCFI
        I+ESM+V+ T  KS SK K       H        TLKERQ+K+YPFP+ D++DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI H  E+CF+
Subjt:  IEESMIVNTTLPKSSSKEKRQTNGTYH-------LTLKERQKKIYPFPEVDISDMLEQLLEAQLIELPKCKRPEEMEKVDDPMYCKYHRVIGHLEERCFI

Query:  LKDLILKLAKEGKIELDLDEVTQSNLATIKKKSKHQRKKDPKQLQPKR---------------------------------------KRKEVDNSKKGEQ
        LK+LILKLA+E KIELD+DEV Q+N A I+  S   + KD   LQ +R                                         KEV+NS +  Q
Subjt:  LKDLILKLAKEGKIELDLDEVTQSNLATIKKKSKHQRKKDPKQLQPKR---------------------------------------KRKEVDNSKKGEQ

Query:  RTSVFDLIKSSTTPPSVFQRVSMVTTEEENQCSVSTSTRPSAFRRLSVSTSKKSRPSTSVFDRLKVIDDQPQRKM
        RTSVFD IK STT  SVFQR+S+ T EEENQC     TR S  +RLS+ST KK RPSTS FDRLK+ +DQ QR+M
Subjt:  RTSVFDLIKSSTTPPSVFQRVSMVTTEEENQCSVSTSTRPSAFRRLSVSTSKKSRPSTSVFDRLKVIDDQPQRKM

A0A5A7VFA5 Ty3-gypsy retrotransposon protein4.8e-5148.41Show/hide
Query:  IEESMIVNTTLPKSSSKEKRQTNGTYH-------LTLKERQKKIYPFPEVDISDMLEQLLEAQLIELPKCKRPEEMEKVDDPMYCKYHRVIGHLEERCFI
        + ESM+V  T  KS SK K   +   H        TL+ERQKK+YPFP+ D++DMLEQL+E QLI+L  CKRP +  KVDDP YCKYHRVI H  E+CF+
Subjt:  IEESMIVNTTLPKSSSKEKRQTNGTYH-------LTLKERQKKIYPFPEVDISDMLEQLLEAQLIELPKCKRPEEMEKVDDPMYCKYHRVIGHLEERCFI

Query:  LKDLILKLAKEGKIELDLDEVTQSNLATIKKKSKHQRKKDPKQLQPKR---------------------------------------KRKEVDNSKKGEQ
        LK+LILKLA+E KI+LD+DE  +               KD   LQP+R                                         +EVDNS + +Q
Subjt:  LKDLILKLAKEGKIELDLDEVTQSNLATIKKKSKHQRKKDPKQLQPKR---------------------------------------KRKEVDNSKKGEQ

Query:  RTSVFDLIKSSTTPPSVFQRVSMVTTEEENQCSVSTSTRPSAFRRLSVSTSKKSRPSTSVFDRLKVIDDQPQRKMDNLEFDVF
        RTSVFD IK  TT   VFQR+SM T EEENQC  ST TR SAF+RLS+STSKK RPSTS FDRLK+ +DQ QR+M +L+   F
Subjt:  RTSVFDLIKSSTTPPSVFQRVSMVTTEEENQCSVSTSTRPSAFRRLSVSTSKKSRPSTSVFDRLKVIDDQPQRKMDNLEFDVF

A0A5D3C2C8 Retrotransposon gag protein1.2e-4637.7Show/hide
Query:  IEESMIVNTTLPKSSSKEKRQTNGTYH-------LTLKERQKKIYPFPEVDISDMLEQLLEAQLIELPKCKRPEEMEKVDDPMYCKYHRVIGHLEERCFI
        I+ESM+V+ T  KS SK K       H        TLKERQKK+YPFP+ D++DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI H  E+CF+
Subjt:  IEESMIVNTTLPKSSSKEKRQTNGTYH-------LTLKERQKKIYPFPEVDISDMLEQLLEAQLIELPKCKRPEEMEKVDDPMYCKYHRVIGHLEERCFI

Query:  LKDLILKLAKEGKIELDLDEVTQSNLATI-----------------------------------------------------------------------
        LK+LILKLA+E KIELD+DEV Q+N   I                                                                       
Subjt:  LKDLILKLAKEGKIELDLDEVTQSNLATI-----------------------------------------------------------------------

Query:  ---------------KKKSKHQRK---------KDPKQLQPKR---------------------------------------KRKEVDNSKKGEQRTSVF
                       KKK +  +K         KD   LQ +R                                         KEV+NS +  QRTSVF
Subjt:  ---------------KKKSKHQRK---------KDPKQLQPKR---------------------------------------KRKEVDNSKKGEQRTSVF

Query:  DLIKSSTTPPSVFQRVSMVTTEEENQCSVSTSTRPSAFRRLSVSTSKKSRPSTSVFDRLKVIDDQPQRKMDNLEFDVFTH----PKFVVPSPSLKVLRSC
        D IK STT  SVFQR+SM T EEENQC     TR S F+RLS+STSKK RPSTS FDRLK+ +DQ QR+M +L+   F+      K     PS    +  
Subjt:  DLIKSSTTPPSVFQRVSMVTTEEENQCSVSTSTRPSAFRRLSVSTSKKSRPSTSVFDRLKVIDDQPQRKMDNLEFDVFTH----PKFVVPSPSLKVLRSC

Query:  VDPSPSLKFEGSDSALLPSPSSKVLTL
        VD    +  EG+ S LL +   + L L
Subjt:  VDPSPSLKFEGSDSALLPSPSSKVLTL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGAAAGAAGGAAGGAACGGCGAAGAGACTATAGAAGAATCTATGATTGTAAACACAACCCTTCCCAAGTCGTCTTCGAAAGAAAAACGACAAACAAATGGAACATA
TCACTTAACTTTAAAGGAAAGACAGAAGAAAATCTATCCTTTCCCTGAGGTCGACATCTCTGATATGTTGGAACAACTACTGGAAGCGCAACTGATAGAACTTCCTAAGT
GTAAACGACCAGAAGAGATGGAGAAAGTCGATGATCCCATGTATTGCAAGTATCATCGAGTTATTGGTCATCTAGAGGAAAGATGCTTCATCCTAAAGGACTTAATTCTA
AAGCTAGCTAAGGAAGGAAAAATTGAGCTCGACCTTGATGAAGTAACTCAATCAAATCTTGCTACAATAAAAAAAAAGAGCAAACATCAAAGAAAGAAGGATCCTAAGCA
ACTTCAACCCAAGAGGAAGAGGAAAGAAGTTGACAATTCCAAGAAGGGTGAACAAAGGACTTCCGTCTTCGATCTCATCAAGTCTTCAACTACTCCTCCTTCGGTATTCC
AAAGAGTGAGTATGGTCACGACAGAAGAAGAAAATCAATGTTCGGTGTCCACCTCCACTCGACCTTCAGCTTTCCGAAGGCTAAGTGTCTCCACATCGAAGAAAAGTCGA
CCTTCAACATCCGTTTTTGATCGTCTCAAAGTAATAGACGATCAACCTCAAAGAAAGATGGATAACTTGGAGTTCGATGTTTTCACTCACCCTAAGTTCGTTGTTCCTTC
TCCAAGTTTGAAGGTTCTTCGCTCTTGCGTTGATCCTTCTCCAAGTCTGAAGTTTGAAGGTTCTGACTCTGCGCTGCTTCCTTCACCAAGTTCGAAGGTTCTCACGCTGC
GCTGTTGCGCTGCTTTCTTCTCCAAGTTTGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNAAAGTTTTCACGTTGTGCTGCTGCGCGCTGCTCTCTTCTCTCTCCAAGTTTGAAGGTTCTTACGCTACACTGCTTCCTTCACCAAGTTCA
AAGGTTCTCACGCTGCGCTGCTTCGCTATTCCTTCTCCAAGTTCGAAGGTTCTCACGTTGCGCTGCTTCCTTCTCCAAGTTTGAAGGTTCTGACGTTGCGCTGCTTCCTT
CTCCAAGTTCGAAGGTTCTCATGCCATGCTGCTTCGCTGTTCCTTTTCCAAGTTTGAAGGTTCTCATATGGCGCTGTTATGCTGCTTCCTTCTTCAAGTTTGAAGATTCT
CATGTCACGTTGCTTCGTTGTTCCTTCTCCAAGTTTGAAGTTCCTTCTCTCCAAGTTCGAAGGTTCTCATGTGCTTCACTGAAGTTCCTTCTCTCAAAGTTTGAAGATTT
CTTCTCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTGGAGTTCCTTCTCTCCAAGTTCGAAGGTTTCACGTTGTTTCGCTGAAGTTCCTTCTCTCCAAGTTTGAAGGTT
TTCACGCTCTTCGCTGTAGTTCCTTCTTCAAGTTCGAAGGTTCTCACGCAGCGCTGTTACGCTGCTTCATTCTCCAAGTTCAAAAGTTTGCACGTTGTGTTGCTGCACGC
TGCTTCCTTCTCTCTCAAAGTTCTCACGTTGCTACGCTGCCTGTTCTGCCAGTCCCTCAAGGTTCGAAGGGCACTGCTACGTTGCCAATCCATTCTCTCTCCAAGTTCGA
AGGTCCTCATGCGTTACGCTACTGTTCCTTCTCAAAGTTCGAAGTTCTTTCCTCCAAGTTCGAAGGTTCTCAGTTCCTTCTCTCTAAGTTTGAAGGTTCTCATGTTGCTT
CGCTGCTGTTCCTTCTCCAAGTTCGAAGGTTCTCATGCCACGTCGCTTTGTTGTTCCTTCTCCAAGTTCAAAAGTTCTCATACTGCGCTGCTACGTTGCTTCCTTCTCCA
AGTTCGAAGGTTCTCAAGTGTTACGTTGTTTTGCTGTTGCTTCCTTCTCCAAGTTCGAAAGTTCACACGTTGTATAGTGCTATTTCTTCTCCAAGTTCGAAGTTCTCGTG
TTACGCTGCTTTGCTTTTCCTTCTTCAAGTTCGAAGGTTCTGCTTTGTTGTAGACGTTGCAGCAGTAAAGGAGTCCAAGTGGGAATTACATCATGTATCATTGGGCCGGG
AATGTCATAACAACAAGAGTCCAAAGACGTCATATGTCCTTGTACTTATGTTGAAAGACGTTGTAGCAGTAAAGGAGTCCAATAATGGAGTCCAAGTGGAAATGACATCA
TGTGTCCTTGGGTCGGAGAACGTCGTAGCAACAAAAGTCCAAGGAACATGTCCTTGTACTTATGCTGAAAGACGTTGCAGCAGTAAAGGAGCCCAAGACATCAACACTTC
TGGAAGACTAAAGACTCCTTCAAGACTGGAAGACTTCAAGCTCCAAGAATCTATTGAAGCTTCAGACTCGAGATCAAGCGTTCCAGCCCTCGAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGGAAAGAAGGAAGGAACGGCGAAGAGACTATAGAAGAATCTATGATTGTAAACACAACCCTTCCCAAGTCGTCTTCGAAAGAAAAACGACAAACAAATGGAACATA
TCACTTAACTTTAAAGGAAAGACAGAAGAAAATCTATCCTTTCCCTGAGGTCGACATCTCTGATATGTTGGAACAACTACTGGAAGCGCAACTGATAGAACTTCCTAAGT
GTAAACGACCAGAAGAGATGGAGAAAGTCGATGATCCCATGTATTGCAAGTATCATCGAGTTATTGGTCATCTAGAGGAAAGATGCTTCATCCTAAAGGACTTAATTCTA
AAGCTAGCTAAGGAAGGAAAAATTGAGCTCGACCTTGATGAAGTAACTCAATCAAATCTTGCTACAATAAAAAAAAAGAGCAAACATCAAAGAAAGAAGGATCCTAAGCA
ACTTCAACCCAAGAGGAAGAGGAAAGAAGTTGACAATTCCAAGAAGGGTGAACAAAGGACTTCCGTCTTCGATCTCATCAAGTCTTCAACTACTCCTCCTTCGGTATTCC
AAAGAGTGAGTATGGTCACGACAGAAGAAGAAAATCAATGTTCGGTGTCCACCTCCACTCGACCTTCAGCTTTCCGAAGGCTAAGTGTCTCCACATCGAAGAAAAGTCGA
CCTTCAACATCCGTTTTTGATCGTCTCAAAGTAATAGACGATCAACCTCAAAGAAAGATGGATAACTTGGAGTTCGATGTTTTCACTCACCCTAAGTTCGTTGTTCCTTC
TCCAAGTTTGAAGGTTCTTCGCTCTTGCGTTGATCCTTCTCCAAGTCTGAAGTTTGAAGGTTCTGACTCTGCGCTGCTTCCTTCACCAAGTTCGAAGGTTCTCACGCTGC
GCTGTTGCGCTGCTTTCTTCTCCAAGTTTGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNAAAGTTTTCACGTTGTGCTGCTGCGCGCTGCTCTCTTCTCTCTCCAAGTTTGAAGGTTCTTACGCTACACTGCTTCCTTCACCAAGTTCA
AAGGTTCTCACGCTGCGCTGCTTCGCTATTCCTTCTCCAAGTTCGAAGGTTCTCACGTTGCGCTGCTTCCTTCTCCAAGTTTGAAGGTTCTGACGTTGCGCTGCTTCCTT
CTCCAAGTTCGAAGGTTCTCATGCCATGCTGCTTCGCTGTTCCTTTTCCAAGTTTGAAGGTTCTCATATGGCGCTGTTATGCTGCTTCCTTCTTCAAGTTTGAAGATTCT
CATGTCACGTTGCTTCGTTGTTCCTTCTCCAAGTTTGAAGTTCCTTCTCTCCAAGTTCGAAGGTTCTCATGTGCTTCACTGAAGTTCCTTCTCTCAAAGTTTGAAGATTT
CTTCTCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTGGAGTTCCTTCTCTCCAAGTTCGAAGGTTTCACGTTGTTTCGCTGAAGTTCCTTCTCTCCAAGTTTGAAGGTT
TTCACGCTCTTCGCTGTAGTTCCTTCTTCAAGTTCGAAGGTTCTCACGCAGCGCTGTTACGCTGCTTCATTCTCCAAGTTCAAAAGTTTGCACGTTGTGTTGCTGCACGC
TGCTTCCTTCTCTCTCAAAGTTCTCACGTTGCTACGCTGCCTGTTCTGCCAGTCCCTCAAGGTTCGAAGGGCACTGCTACGTTGCCAATCCATTCTCTCTCCAAGTTCGA
AGGTCCTCATGCGTTACGCTACTGTTCCTTCTCAAAGTTCGAAGTTCTTTCCTCCAAGTTCGAAGGTTCTCAGTTCCTTCTCTCTAAGTTTGAAGGTTCTCATGTTGCTT
CGCTGCTGTTCCTTCTCCAAGTTCGAAGGTTCTCATGCCACGTCGCTTTGTTGTTCCTTCTCCAAGTTCAAAAGTTCTCATACTGCGCTGCTACGTTGCTTCCTTCTCCA
AGTTCGAAGGTTCTCAAGTGTTACGTTGTTTTGCTGTTGCTTCCTTCTCCAAGTTCGAAAGTTCACACGTTGTATAGTGCTATTTCTTCTCCAAGTTCGAAGTTCTCGTG
TTACGCTGCTTTGCTTTTCCTTCTTCAAGTTCGAAGGTTCTGCTTTGTTGTAGACGTTGCAGCAGTAAAGGAGTCCAAGTGGGAATTACATCATGTATCATTGGGCCGGG
AATGTCATAACAACAAGAGTCCAAAGACGTCATATGTCCTTGTACTTATGTTGAAAGACGTTGTAGCAGTAAAGGAGTCCAATAATGGAGTCCAAGTGGAAATGACATCA
TGTGTCCTTGGGTCGGAGAACGTCGTAGCAACAAAAGTCCAAGGAACATGTCCTTGTACTTATGCTGAAAGACGTTGCAGCAGTAAAGGAGCCCAAGACATCAACACTTC
TGGAAGACTAAAGACTCCTTCAAGACTGGAAGACTTCAAGCTCCAAGAATCTATTGAAGCTTCAGACTCGAGATCAAGCGTTCCAGCCCTCGAGTGA
Protein sequenceShow/hide protein sequence
MRKEGRNGEETIEESMIVNTTLPKSSSKEKRQTNGTYHLTLKERQKKIYPFPEVDISDMLEQLLEAQLIELPKCKRPEEMEKVDDPMYCKYHRVIGHLEERCFILKDLIL
KLAKEGKIELDLDEVTQSNLATIKKKSKHQRKKDPKQLQPKRKRKEVDNSKKGEQRTSVFDLIKSSTTPPSVFQRVSMVTTEEENQCSVSTSTRPSAFRRLSVSTSKKSR
PSTSVFDRLKVIDDQPQRKMDNLEFDVFTHPKFVVPSPSLKVLRSCVDPSPSLKFEGSDSALLPSPSSKVLTLRCCAAFFSKFXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXKFSRCAAARCSLLSPSLKVLTLHCFLHQVQRFSRCAASLFLLQVRRFSRCAASFSKFEGSDVALLPSPSSKVLMPCCFAVPFPSLKVLIWRCYAASFFKFEDS
HVTLLRCSFSKFEVPSLQVRRFSCASLKFLLSKFEDFFSPSSKVLTRFAGVPSLQVRRFHVVSLKFLLSKFEGFHALRCSSFFKFEGSHAALLRCFILQVQKFARCVAAR
CFLLSQSSHVATLPVLPVPQGSKGTATLPIHSLSKFEGPHALRYCSFSKFEVLSSKFEGSQFLLSKFEGSHVASLLFLLQVRRFSCHVALLFLLQVQKFSYCAATLLPSP
SSKVLKCYVVLLLLPSPSSKVHTLYSAISSPSSKFSCYAALLFLLQVRRFCFVVDVAAVKESKWELHHVSLGRECHNNKSPKTSYVLVLMLKDVVAVKESNNGVQVEMTS
CVLGSENVVATKVQGTCPCTYAERRCSSKGAQDINTSGRLKTPSRLEDFKLQESIEASDSRSSVPALE