; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0036485 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0036485
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionGag/pol protein
Genome locationchr3:47099906..47112046
RNA-Seq ExpressionLag0036485
SyntenyLag0036485
Gene Ontology termsGO:0005488 - binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025159.1 gag/pol protein [Cucumis melo var. makuwa]1.3e-7559.5Show/hide
Query:  MSQPEGFITQGQEQKVCKLNRSIYGLKQASKSWNIRFDTAIKSFGFDQNVDEPCVYKKINKNKVAFL--------------------------------S
        MS+PEGFIT+GQEQKVCKLNRSIYGLKQASKSWNIRFD AIKS+GFDQNVDEPC+YKKINK KVAFL                                 
Subjt:  MSQPEGFITQGQEQKVCKLNRSIYGLKQASKSWNIRFDTAIKSFGFDQNVDEPCVYKKINKNKVAFL--------------------------------S

Query:  PKTPQEVEDMRQIPYASTVGGLI----------------------------------------------------------YTDSDLQFDKDSRKSTSGS
        PKTPQEVEDMR+IPYAS VG L+                                                          YTDSD Q DKDSRKST GS
Subjt:  PKTPQEVEDMRQIPYASTVGGLI----------------------------------------------------------YTDSDLQFDKDSRKSTSGS

Query:  VFTLNGGAVVWRSIEQRCIADSTMEAEYVAACEAAKKTVWLRKFLTDLEVVSNMNLPITLYYYNSGAVANSKEPRSHKR
        VFTLNGGAVVWRSI+Q CIADSTMEAEYVAACEA K+ VWLRKFL DLEVV NMNLPITLYY NS AVANSKEPRSHKR
Subjt:  VFTLNGGAVVWRSIEQRCIADSTMEAEYVAACEAAKKTVWLRKFLTDLEVVSNMNLPITLYYYNSGAVANSKEPRSHKR

KAA0034955.1 gag/pol protein [Cucumis melo var. makuwa]5.7e-7946.99Show/hide
Query:  MSQPEGFITQGQEQKVCKLNRSIYGLKQASKSWNIRFDTAIKSFGFDQNVDEPCVYKKINKNKVAFL---------------------------------
        MSQ EGFITQGQEQKVCKLNRSIYGLKQAS+SWNIRFDTAIKS+GFDQNVDEPCVYKKINK KVAFL                                 
Subjt:  MSQPEGFITQGQEQKVCKLNRSIYGLKQASKSWNIRFDTAIKSFGFDQNVDEPCVYKKINKNKVAFL---------------------------------

Query:  ---------------------------------SPKTPQEVEDMRQIPYASTVGGLIY----TDSDL---------------------------------
                                          PKTPQEVE+MR+I Y S VG L+Y    T  D+                                 
Subjt:  ---------------------------------SPKTPQEVEDMRQIPYASTVGGLIY----TDSDL---------------------------------

Query:  -------------------QFDKDSRKSTSGSVFTLNGGAVVWRSIEQRCIADSTMEAEYVAACEAAKKTVWLRKFLTDLEVVSNMNLPITLYYYNSGAV
                           Q DKDSRKSTS SVF LNGGAVVWRSI+Q CIADSTMEAEYVAACEAAK+ VWLRKFL DLEVV NMNLP+TLY  NSGA+
Subjt:  -------------------QFDKDSRKSTSGSVFTLNGGAVVWRSIEQRCIADSTMEAEYVAACEAAKKTVWLRKFLTDLEVVSNMNLPITLYYYNSGAV

Query:  ANSKEPRSHK-----------------------------------------------------------RDCPLICTGESDLHDDSISHHFGDKTEWGAE
        ANSKEPRSHK                                                           RDCPLICT ES    DSI   F DKTEWGA 
Subjt:  ANSKEPRSHK-----------------------------------------------------------RDCPLICTGESDLHDDSISHHFGDKTEWGAE

Query:  NIITQDGIHSFPTLG
        NIITQDGIHSFP LG
Subjt:  NIITQDGIHSFPTLG

KAA0058244.1 gag/pol protein [Cucumis melo var. makuwa]5.0e-7565.71Show/hide
Query:  MSQPEGFITQGQEQKVCKLNRSIYGLKQASKSWNIRFDTAIKSFGFDQNVDEPCVYKKINKNKVAFL--------------------------------S
        MSQPEGFITQGQEQKVCKLNRSIYGLKQAS+SWNIRFDTAIKS+GFDQNVDEPCVYKKINK K+AFL                                 
Subjt:  MSQPEGFITQGQEQKVCKLNRSIYGLKQASKSWNIRFDTAIKSFGFDQNVDEPCVYKKINKNKVAFL--------------------------------S

Query:  PKTPQEVEDMRQIPYASTVGGLIY----TDSDL--------QFDKDS------------RKSTSGSVFTLNGGAVVWRSIEQRCIADSTMEAEYVAACEA
        PKTPQEVED+R+IPYASTVG LIY    T  D+        ++  +S            +KSTSGSVFTLNGGAVVW SI+Q CI DSTMEAEYVAACEA
Subjt:  PKTPQEVEDMRQIPYASTVGGLIY----TDSDL--------QFDKDS------------RKSTSGSVFTLNGGAVVWRSIEQRCIADSTMEAEYVAACEA

Query:  AKKTVWLRKFLTDLEVVSNMNLPITLYYYNSGAVANSKEPRSHKR
        AK+ VWLRKFL DLEVV NM LPITLY  NSGAVANSKEPRSHKR
Subjt:  AKKTVWLRKFLTDLEVVSNMNLPITLYYYNSGAVANSKEPRSHKR

KAA0059226.1 gag/pol protein [Cucumis melo var. makuwa]5.1e-8046.17Show/hide
Query:  MSQPEGFITQGQEQKVCKLNRSIYGLKQASKSWNIRFDTAIKSFGFDQNVDEPCVYKKINKNKVAFL---------------------------------
        MSQPEGFITQGQEQKVCKLNRSIYGLKQAS+SWNIRFDTAIKS+GFDQNVDEPCVYKKINK KVAFL                                 
Subjt:  MSQPEGFITQGQEQKVCKLNRSIYGLKQASKSWNIRFDTAIKSFGFDQNVDEPCVYKKINKNKVAFL---------------------------------

Query:  ---------------------------------------------------------SPKTPQEVEDMRQIPYASTVGGLI-------------------
                                                                 SPKTPQEVEDMR+IPYAS VG L+                   
Subjt:  ---------------------------------------------------------SPKTPQEVEDMRQIPYASTVGGLI-------------------

Query:  ---------------------------------------YTDSDLQFDKDSRKSTSGSVFTLNGGAVVWRSIEQRCIADSTMEAEYVAACEAAKKTVWLR
                                               YTDSD Q DKDSRKSTSGSVFTLNGGAVVWRSI+Q CIADSTMEAEYVAACEAAK+ VWLR
Subjt:  ---------------------------------------YTDSDLQFDKDSRKSTSGSVFTLNGGAVVWRSIEQRCIADSTMEAEYVAACEAAKKTVWLR

Query:  KFLTDLEVVSNMNLPITLYYYNSGAVANSKEPRSHKR--------------------------------------------DCPLICTGESDLH---DDS
        KFL DLEVV NMNLPITLY  NSGAVANSKEPRSHKR                                            +  L   G  D++      
Subjt:  KFLTDLEVVSNMNLPITLYYYNSGAVANSKEPRSHKR--------------------------------------------DCPLICTGESDLH---DDS

Query:  ISHHFGDKTEWGAENIITQDGIHSFPTLGKK
         S+HFGD TEW A NIITQDGIHSFP LG K
Subjt:  ISHHFGDKTEWGAENIITQDGIHSFPTLGKK

TYK04889.1 retrovirus-related pol polyprotein from transposon tnt 1-94 [Cucumis melo var. makuwa]4.5e-7667.8Show/hide
Query:  MSQPEGFITQGQEQKVCKLNRSIYGLKQASKSWNIRFDTAIKSFGFDQNVDEPCVYKKINKNKVAFL---------------------------------
        + +PEGFITQGQEQKVCKLNRSIYGLKQAS+ WNIRFDTAIKS+GFDQNVDEPCVYKKINK KVAFL                                 
Subjt:  MSQPEGFITQGQEQKVCKLNRSIYGLKQASKSWNIRFDTAIKSFGFDQNVDEPCVYKKINKNKVAFL---------------------------------

Query:  --------------SPKTPQEVEDMRQIPYASTVGGLIYTDSDLQFDKDSRKSTSGSVFTLNGGAVVWRSIEQRCIADSTMEAEYVAACEAAKKTVWLRK
                       PK PQEVEDMR+IPYASTVG   YTD D Q DKDSRKSTS S+FTLNGGAVVWRSI+Q CIADSTMEAEYVAACEAAK+ +WLRK
Subjt:  --------------SPKTPQEVEDMRQIPYASTVGGLIYTDSDLQFDKDSRKSTSGSVFTLNGGAVVWRSIEQRCIADSTMEAEYVAACEAAKKTVWLRK

Query:  FLTDLEVVSNMNLPITLYYYNSGAVANSKEPRSHKR
        FL DLEVV NMNL ITLY  NSGAVANSKEP SHKR
Subjt:  FLTDLEVVSNMNLPITLYYYNSGAVANSKEPRSHKR

TrEMBL top hitse value%identityAlignment
A0A5A7SIN2 Gag/pol protein6.3e-7659.5Show/hide
Query:  MSQPEGFITQGQEQKVCKLNRSIYGLKQASKSWNIRFDTAIKSFGFDQNVDEPCVYKKINKNKVAFL--------------------------------S
        MS+PEGFIT+GQEQKVCKLNRSIYGLKQASKSWNIRFD AIKS+GFDQNVDEPC+YKKINK KVAFL                                 
Subjt:  MSQPEGFITQGQEQKVCKLNRSIYGLKQASKSWNIRFDTAIKSFGFDQNVDEPCVYKKINKNKVAFL--------------------------------S

Query:  PKTPQEVEDMRQIPYASTVGGLI----------------------------------------------------------YTDSDLQFDKDSRKSTSGS
        PKTPQEVEDMR+IPYAS VG L+                                                          YTDSD Q DKDSRKST GS
Subjt:  PKTPQEVEDMRQIPYASTVGGLI----------------------------------------------------------YTDSDLQFDKDSRKSTSGS

Query:  VFTLNGGAVVWRSIEQRCIADSTMEAEYVAACEAAKKTVWLRKFLTDLEVVSNMNLPITLYYYNSGAVANSKEPRSHKR
        VFTLNGGAVVWRSI+Q CIADSTMEAEYVAACEA K+ VWLRKFL DLEVV NMNLPITLYY NS AVANSKEPRSHKR
Subjt:  VFTLNGGAVVWRSIEQRCIADSTMEAEYVAACEAAKKTVWLRKFLTDLEVVSNMNLPITLYYYNSGAVANSKEPRSHKR

A0A5A7T0G5 Gag/pol protein2.7e-7946.99Show/hide
Query:  MSQPEGFITQGQEQKVCKLNRSIYGLKQASKSWNIRFDTAIKSFGFDQNVDEPCVYKKINKNKVAFL---------------------------------
        MSQ EGFITQGQEQKVCKLNRSIYGLKQAS+SWNIRFDTAIKS+GFDQNVDEPCVYKKINK KVAFL                                 
Subjt:  MSQPEGFITQGQEQKVCKLNRSIYGLKQASKSWNIRFDTAIKSFGFDQNVDEPCVYKKINKNKVAFL---------------------------------

Query:  ---------------------------------SPKTPQEVEDMRQIPYASTVGGLIY----TDSDL---------------------------------
                                          PKTPQEVE+MR+I Y S VG L+Y    T  D+                                 
Subjt:  ---------------------------------SPKTPQEVEDMRQIPYASTVGGLIY----TDSDL---------------------------------

Query:  -------------------QFDKDSRKSTSGSVFTLNGGAVVWRSIEQRCIADSTMEAEYVAACEAAKKTVWLRKFLTDLEVVSNMNLPITLYYYNSGAV
                           Q DKDSRKSTS SVF LNGGAVVWRSI+Q CIADSTMEAEYVAACEAAK+ VWLRKFL DLEVV NMNLP+TLY  NSGA+
Subjt:  -------------------QFDKDSRKSTSGSVFTLNGGAVVWRSIEQRCIADSTMEAEYVAACEAAKKTVWLRKFLTDLEVVSNMNLPITLYYYNSGAV

Query:  ANSKEPRSHK-----------------------------------------------------------RDCPLICTGESDLHDDSISHHFGDKTEWGAE
        ANSKEPRSHK                                                           RDCPLICT ES    DSI   F DKTEWGA 
Subjt:  ANSKEPRSHK-----------------------------------------------------------RDCPLICTGESDLHDDSISHHFGDKTEWGAE

Query:  NIITQDGIHSFPTLG
        NIITQDGIHSFP LG
Subjt:  NIITQDGIHSFPTLG

A0A5A7USV4 Gag/pol protein2.4e-7565.71Show/hide
Query:  MSQPEGFITQGQEQKVCKLNRSIYGLKQASKSWNIRFDTAIKSFGFDQNVDEPCVYKKINKNKVAFL--------------------------------S
        MSQPEGFITQGQEQKVCKLNRSIYGLKQAS+SWNIRFDTAIKS+GFDQNVDEPCVYKKINK K+AFL                                 
Subjt:  MSQPEGFITQGQEQKVCKLNRSIYGLKQASKSWNIRFDTAIKSFGFDQNVDEPCVYKKINKNKVAFL--------------------------------S

Query:  PKTPQEVEDMRQIPYASTVGGLIY----TDSDL--------QFDKDS------------RKSTSGSVFTLNGGAVVWRSIEQRCIADSTMEAEYVAACEA
        PKTPQEVED+R+IPYASTVG LIY    T  D+        ++  +S            +KSTSGSVFTLNGGAVVW SI+Q CI DSTMEAEYVAACEA
Subjt:  PKTPQEVEDMRQIPYASTVGGLIY----TDSDL--------QFDKDS------------RKSTSGSVFTLNGGAVVWRSIEQRCIADSTMEAEYVAACEA

Query:  AKKTVWLRKFLTDLEVVSNMNLPITLYYYNSGAVANSKEPRSHKR
        AK+ VWLRKFL DLEVV NM LPITLY  NSGAVANSKEPRSHKR
Subjt:  AKKTVWLRKFLTDLEVVSNMNLPITLYYYNSGAVANSKEPRSHKR

A0A5A7UYE8 Gag/pol protein2.5e-8046.17Show/hide
Query:  MSQPEGFITQGQEQKVCKLNRSIYGLKQASKSWNIRFDTAIKSFGFDQNVDEPCVYKKINKNKVAFL---------------------------------
        MSQPEGFITQGQEQKVCKLNRSIYGLKQAS+SWNIRFDTAIKS+GFDQNVDEPCVYKKINK KVAFL                                 
Subjt:  MSQPEGFITQGQEQKVCKLNRSIYGLKQASKSWNIRFDTAIKSFGFDQNVDEPCVYKKINKNKVAFL---------------------------------

Query:  ---------------------------------------------------------SPKTPQEVEDMRQIPYASTVGGLI-------------------
                                                                 SPKTPQEVEDMR+IPYAS VG L+                   
Subjt:  ---------------------------------------------------------SPKTPQEVEDMRQIPYASTVGGLI-------------------

Query:  ---------------------------------------YTDSDLQFDKDSRKSTSGSVFTLNGGAVVWRSIEQRCIADSTMEAEYVAACEAAKKTVWLR
                                               YTDSD Q DKDSRKSTSGSVFTLNGGAVVWRSI+Q CIADSTMEAEYVAACEAAK+ VWLR
Subjt:  ---------------------------------------YTDSDLQFDKDSRKSTSGSVFTLNGGAVVWRSIEQRCIADSTMEAEYVAACEAAKKTVWLR

Query:  KFLTDLEVVSNMNLPITLYYYNSGAVANSKEPRSHKR--------------------------------------------DCPLICTGESDLH---DDS
        KFL DLEVV NMNLPITLY  NSGAVANSKEPRSHKR                                            +  L   G  D++      
Subjt:  KFLTDLEVVSNMNLPITLYYYNSGAVANSKEPRSHKR--------------------------------------------DCPLICTGESDLH---DDS

Query:  ISHHFGDKTEWGAENIITQDGIHSFPTLGKK
         S+HFGD TEW A NIITQDGIHSFP LG K
Subjt:  ISHHFGDKTEWGAENIITQDGIHSFPTLGKK

A0A5D3C3C9 Retrovirus-related pol polyprotein from transposon tnt 1-942.2e-7667.8Show/hide
Query:  MSQPEGFITQGQEQKVCKLNRSIYGLKQASKSWNIRFDTAIKSFGFDQNVDEPCVYKKINKNKVAFL---------------------------------
        + +PEGFITQGQEQKVCKLNRSIYGLKQAS+ WNIRFDTAIKS+GFDQNVDEPCVYKKINK KVAFL                                 
Subjt:  MSQPEGFITQGQEQKVCKLNRSIYGLKQASKSWNIRFDTAIKSFGFDQNVDEPCVYKKINKNKVAFL---------------------------------

Query:  --------------SPKTPQEVEDMRQIPYASTVGGLIYTDSDLQFDKDSRKSTSGSVFTLNGGAVVWRSIEQRCIADSTMEAEYVAACEAAKKTVWLRK
                       PK PQEVEDMR+IPYASTVG   YTD D Q DKDSRKSTS S+FTLNGGAVVWRSI+Q CIADSTMEAEYVAACEAAK+ +WLRK
Subjt:  --------------SPKTPQEVEDMRQIPYASTVGGLIYTDSDLQFDKDSRKSTSGSVFTLNGGAVVWRSIEQRCIADSTMEAEYVAACEAAKKTVWLRK

Query:  FLTDLEVVSNMNLPITLYYYNSGAVANSKEPRSHKR
        FL DLEVV NMNL ITLY  NSGAVANSKEP SHKR
Subjt:  FLTDLEVVSNMNLPITLYYYNSGAVANSKEPRSHKR

SwissProt top hitse value%identityAlignment
P04146 Copia protein4.9e-0934.34Show/hide
Query:  YTDSDLQFDKDSRKSTSGSVFTL-NGGAVVWRSIEQRCIADSTMEAEYVAACEAAKKTVWLRKFLTDLEVVSNMNLPITLYYYNSGAVANSKEPRSHKR
        Y DSD    +  RKST+G +F + +   + W +  Q  +A S+ EAEY+A  EA ++ +WL+  LT + +   +  PI +Y  N G ++ +  P  HKR
Subjt:  YTDSDLQFDKDSRKSTSGSVFTL-NGGAVVWRSIEQRCIADSTMEAEYVAACEAAKKTVWLRKFLTDLEVVSNMNLPITLYYYNSGAVANSKEPRSHKR

P04146 Copia protein6.9e-0339.29Show/hide
Query:  MSQPEGFITQGQEQKVCKLNRSIYGLKQASKSWNIRFDTAIKSFGFDQNVDEPCVY
        M  P+G         VCKLN++IYGLKQA++ W   F+ A+K   F  +  + C+Y
Subjt:  MSQPEGFITQGQEQKVCKLNRSIYGLKQASKSWNIRFDTAIKSFGFDQNVDEPCVY

P0CV72 Secreted RxLR effector protein 1612.2e-0953.33Show/hide
Query:  YTDSDLQFDKDSRKSTSGSVFTLNGGAVVWRSIEQRCIADSTMEAEYVAACEAAKKTVWL
        Y+D+D   D +SR+STSG +F LNGG V WRS +QR +A S+ E EY+A  EA ++ VWL
Subjt:  YTDSDLQFDKDSRKSTSGSVFTLNGGAVVWRSIEQRCIADSTMEAEYVAACEAAKKTVWL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.6e-2024.26Show/hide
Query:  MSQPEGFITQGQEQKVCKLNRSIYGLKQASKSWNIRFDTAIKSFGFDQNVDEPCVY-KKINKNKVAFL--------------------------------
        M QPEGF   G++  VCKLN+S+YGLKQA + W ++FD+ +KS  + +   +PCVY K+ ++N    L                                
Subjt:  MSQPEGFITQGQEQKVCKLNRSIYGLKQASKSWNIRFDTAIKSFGFDQNVDEPCVY-KKINKNKVAFL--------------------------------

Query:  ----------------------------------------------------------SPKTPQEVEDMRQIPYASTVGGLI------------------
                                                                   P T +E  +M ++PY+S VG L+                  
Subjt:  ----------------------------------------------------------SPKTPQEVEDMRQIPYASTVGGLI------------------

Query:  ----------------------------------------YTDSDLQFDKDSRKSTSGSVFTLNGGAVVWRSIEQRCIADSTMEAEYVAACEAAKKTVWL
                                                YTD+D+  D D+RKS++G +FT +GGA+ W+S  Q+C+A ST EAEY+AA E  K+ +WL
Subjt:  ----------------------------------------YTDSDLQFDKDSRKSTSGSVFTLNGGAVVWRSIEQRCIADSTMEAEYVAACEAAKKTVWL

Query:  RKFLTDLEVVSNMNLPITLYYYNSGAVANSKEPRSHKR
        ++FL +L +         +Y  +  A+  SK    H R
Subjt:  RKFLTDLEVVSNMNLPITLYYYNSGAVANSKEPRSHKR

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE15.6e-0537.5Show/hide
Query:  MSQPEGFITQGQEQKVCKLNRSIYGLKQASKSWNIRFDTAIKSFGFDQNVDEPCVY
        MSQP GFI + +   VCKL +++YGLKQA ++W +     + + GF  +V +  ++
Subjt:  MSQPEGFITQGQEQKVCKLNRSIYGLKQASKSWNIRFDTAIKSFGFDQNVDEPCVY

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.3e-0535.71Show/hide
Query:  MSQPEGFITQGQEQKVCKLNRSIYGLKQASKSWNIRFDTAIKSFGFDQNVDEPCVY
        MSQP GF+ + +   VC+L ++IYGLKQA ++W +   T + + GF  ++ +  ++
Subjt:  MSQPEGFITQGQEQKVCKLNRSIYGLKQASKSWNIRFDTAIKSFGFDQNVDEPCVY

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.7e-0439.68Show/hide
Query:  MSQPEGFIT-QGQE---QKVCKLNRSIYGLKQASKSWNIRFDTAIKSFGFDQNVDEPCVYKKI
        M  P G+   QG       VC L +SIYGLKQAS+ W ++F   +  FGF Q+  +   + KI
Subjt:  MSQPEGFIT-QGQE---QKVCKLNRSIYGLKQASKSWNIRFDTAIKSFGFDQNVDEPCVYKKI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTCAGCCCGAAGGGTTCATAACCCAAGGTCAGGAGCAAAAGGTTTGCAAACTGAATCGATCCATTTATGGGTTGAAACAGGCCTCCAAATCTTGGAATATTAGATT
TGATACTGCGATCAAATCTTTTGGCTTTGACCAAAATGTTGATGAGCCTTGTGTATACAAGAAAATCAACAAAAATAAAGTAGCTTTTCTTTCTCCTAAGACACCTCAAG
AAGTTGAGGATATGAGACAGATTCCCTACGCCTCTACTGTGGGCGGCTTAATATACACCGACTCTGACCTCCAGTTTGACAAAGATTCTAGAAAATCCACGTCAGGGTCA
GTGTTCACCCTTAACGGGGGAGCTGTAGTTTGGAGAAGCATCGAGCAAAGATGCATAGCGGACTCAACCATGGAGGCTGAGTATGTAGCTGCTTGTGAAGCAGCTAAGAA
GACAGTTTGGCTAAGGAAGTTCTTGACTGATTTGGAAGTAGTTTCAAATATGAACTTGCCCATTACTTTATACTATTACAATAGTGGGGCTGTAGCCAATTCAAAGGAAC
CTCGCAGCCACAAACGAGATTGTCCTTTGATTTGTACGGGTGAGAGTGACCTGCACGACGACTCAATAAGCCATCATTTTGGGGACAAGACCGAGTGGGGAGCTGAGAAC
ATAATCACACAAGATGGAATTCACTCCTTCCCGACTCTAGGCAAGAAGGAAACGCTGGGGCTATTGTTTTTTGATAACACAAGGAGAATTTGGCGAAATTCTGTCAAGAA
CGACTATGTAATCGAGATAGCCATGTTGAGGACACTTCCTCGGCAGCTCTTTAAATCACTCCCAGGCCTTGAACAACTACTCATCAAACTGCTGTTGGAACGTCCCAATC
TCGGTCCTCAGCCTGACCGTCTAAGTAGGAGGGAAGAATTTCTTTTAAAACGCCTGGACCAAAGCATCCCAAGTAGTAATGCTCTCAGATGTCTAGGAAGGATTTCAGGT
GAGAATTTGGATCCCCTGGTAGCGTTGCGACCCTACCGCCTGCCGCAGATTCTGGAATTGGAAAATTGCAAGTGTCGAGACACTATGGTCACAACGTCTCGACGCTGTGA
CCTGTTCGCGCCTATTTCAAAGATGGAATGTCAGCGTCGAGACGTGGTTGAGGTTAGTGCTGGTCCATTAGGTCTCACCGGTAGCTCACTAGGGGTGTTGAGCAAGAAGG
TTCTTAAAGGGTTGCTGAATTTTCTGCACTCCGTTGGTGAATTTGGCGAATCGATCAAGAACGTCTTCAAAGGTCTTGATCGTTTCCGCTGCGCAAGGGCTTTTATCCCT
TTAAGCTTAACTCCTAGTATAAAAACTATTGTTGATGCAGCTGCAGGTGGGACTCTATTGTCCAAAACTGTTGAGAATACTAAGACTTTGCTAGAGGAAATGACCACCAA
TAGCTATCAGTGGCCATCTGAGCGGTCGGGACCAAAAAAGATTGTTGCTGGACTGTTCGAGATTGATAATGTAAGTGCACTTCAGGCCCAAATGTCCTCCCTTGCTAATG
CGTTTCTGAAATTTTCAGGTACAAGGAGTGCTCAATCGATCGAGTCTGCTGCTGCCCTTGCATCCCAAACTCAGGAGGAAAATCTCGAGCAGAATGTGTTGAATCCTCCA
AGCTTCACTCCTAAGCAAGAAAGTAAACAGTCTTTAGAGGATCTCGTTGGAGCTTTTATTTCAGAATCAAGTAACAGGTCAAATAAGCTTGAGGAGGTTGTGATTGCCAT
AAACACTACTGTCAATGGTCATTCTGCATCCAACAAGAACATTGAGACTCAGCTAGGACAACTGGTAAGTGTTGTCAACACAATGAATAAAGGTAAAGCCCCAACTGAAC
AGGAGAAATCTACATTGGAGTATTGCAAGATCGTCACTATGCATCACGAGGAGGAGACTACTGAAATAGAGGAACTCACTGGAGAAGCTGAGGAGGACACCACCTCAAAC
GAAGCTGAAAAGCTTATACCTGAGCCCTCTATCCCTTCTCCTACTGTTTTAGTTCCTAAGACAAAGAATAAGAAAAAGAAAAACTCTAAGGCTCAGTTTGAAAAGTTTCT
TGATGCTTTTACAGGTTTGACTGTTAATATTCCTTTTGCAGATGCACTGGAGCAGATGCCTCACTACACGAAGTTCATGAAGGAATGGCTCATCAAGAAGAAAAATGAAA
AGCAGATGGAGACTGTATACCTTGCATCAACATGCAGTGCTCGTGTCCAACAGGAAGTACCAGAGAAATTGTCTGATCCAGGGAGTTTTACTATTCCTTATAGTTTTGAT
ATTAAAGAGAATCTTGCCATGCCTATTATATTAGGGAGACCATTCCTTGCTACTGGGAGGGTTATAATTGATATTGAGGGCAGGGAGCTAATCATTAGAGTCCAACAGGA
GAAGGAGATCCTAAAACCTTTTGAGGATCCTCAGAACACATCAGAGACCATGTTGGTGGGGTACAGGAGAGGTCTTGAGATCCCAGTTTCTTCTTTGCTTGCAAGCTTCC
ATAAAAGAGGACATCATCATTCGAGGACAATATTGTTCACTCCTCCTACTCTTATTGTTCTCATGCTTGATTATGCCTTATTTTCTTTATACATTGAGGTCAATGCATGT
TTTAAGTTTGGGGGTGGAATAGCATGCGGTTCAAGGCGGTTGGGCGGTCTGGTTCACACGGTCTGGCTGGACTGGAACCGATTTGGTCCGTTTCAGCTGGAATTTGACTC
GTTTCGTGGTTTTCGTAGATGGTTCGAAGTGGTTCGGGTCGGTTCGGACGATCTAAACCAGTATTTGGAGCTGCTGAAGGATTTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCTCAGCCCGAAGGGTTCATAACCCAAGGTCAGGAGCAAAAGGTTTGCAAACTGAATCGATCCATTTATGGGTTGAAACAGGCCTCCAAATCTTGGAATATTAGATT
TGATACTGCGATCAAATCTTTTGGCTTTGACCAAAATGTTGATGAGCCTTGTGTATACAAGAAAATCAACAAAAATAAAGTAGCTTTTCTTTCTCCTAAGACACCTCAAG
AAGTTGAGGATATGAGACAGATTCCCTACGCCTCTACTGTGGGCGGCTTAATATACACCGACTCTGACCTCCAGTTTGACAAAGATTCTAGAAAATCCACGTCAGGGTCA
GTGTTCACCCTTAACGGGGGAGCTGTAGTTTGGAGAAGCATCGAGCAAAGATGCATAGCGGACTCAACCATGGAGGCTGAGTATGTAGCTGCTTGTGAAGCAGCTAAGAA
GACAGTTTGGCTAAGGAAGTTCTTGACTGATTTGGAAGTAGTTTCAAATATGAACTTGCCCATTACTTTATACTATTACAATAGTGGGGCTGTAGCCAATTCAAAGGAAC
CTCGCAGCCACAAACGAGATTGTCCTTTGATTTGTACGGGTGAGAGTGACCTGCACGACGACTCAATAAGCCATCATTTTGGGGACAAGACCGAGTGGGGAGCTGAGAAC
ATAATCACACAAGATGGAATTCACTCCTTCCCGACTCTAGGCAAGAAGGAAACGCTGGGGCTATTGTTTTTTGATAACACAAGGAGAATTTGGCGAAATTCTGTCAAGAA
CGACTATGTAATCGAGATAGCCATGTTGAGGACACTTCCTCGGCAGCTCTTTAAATCACTCCCAGGCCTTGAACAACTACTCATCAAACTGCTGTTGGAACGTCCCAATC
TCGGTCCTCAGCCTGACCGTCTAAGTAGGAGGGAAGAATTTCTTTTAAAACGCCTGGACCAAAGCATCCCAAGTAGTAATGCTCTCAGATGTCTAGGAAGGATTTCAGGT
GAGAATTTGGATCCCCTGGTAGCGTTGCGACCCTACCGCCTGCCGCAGATTCTGGAATTGGAAAATTGCAAGTGTCGAGACACTATGGTCACAACGTCTCGACGCTGTGA
CCTGTTCGCGCCTATTTCAAAGATGGAATGTCAGCGTCGAGACGTGGTTGAGGTTAGTGCTGGTCCATTAGGTCTCACCGGTAGCTCACTAGGGGTGTTGAGCAAGAAGG
TTCTTAAAGGGTTGCTGAATTTTCTGCACTCCGTTGGTGAATTTGGCGAATCGATCAAGAACGTCTTCAAAGGTCTTGATCGTTTCCGCTGCGCAAGGGCTTTTATCCCT
TTAAGCTTAACTCCTAGTATAAAAACTATTGTTGATGCAGCTGCAGGTGGGACTCTATTGTCCAAAACTGTTGAGAATACTAAGACTTTGCTAGAGGAAATGACCACCAA
TAGCTATCAGTGGCCATCTGAGCGGTCGGGACCAAAAAAGATTGTTGCTGGACTGTTCGAGATTGATAATGTAAGTGCACTTCAGGCCCAAATGTCCTCCCTTGCTAATG
CGTTTCTGAAATTTTCAGGTACAAGGAGTGCTCAATCGATCGAGTCTGCTGCTGCCCTTGCATCCCAAACTCAGGAGGAAAATCTCGAGCAGAATGTGTTGAATCCTCCA
AGCTTCACTCCTAAGCAAGAAAGTAAACAGTCTTTAGAGGATCTCGTTGGAGCTTTTATTTCAGAATCAAGTAACAGGTCAAATAAGCTTGAGGAGGTTGTGATTGCCAT
AAACACTACTGTCAATGGTCATTCTGCATCCAACAAGAACATTGAGACTCAGCTAGGACAACTGGTAAGTGTTGTCAACACAATGAATAAAGGTAAAGCCCCAACTGAAC
AGGAGAAATCTACATTGGAGTATTGCAAGATCGTCACTATGCATCACGAGGAGGAGACTACTGAAATAGAGGAACTCACTGGAGAAGCTGAGGAGGACACCACCTCAAAC
GAAGCTGAAAAGCTTATACCTGAGCCCTCTATCCCTTCTCCTACTGTTTTAGTTCCTAAGACAAAGAATAAGAAAAAGAAAAACTCTAAGGCTCAGTTTGAAAAGTTTCT
TGATGCTTTTACAGGTTTGACTGTTAATATTCCTTTTGCAGATGCACTGGAGCAGATGCCTCACTACACGAAGTTCATGAAGGAATGGCTCATCAAGAAGAAAAATGAAA
AGCAGATGGAGACTGTATACCTTGCATCAACATGCAGTGCTCGTGTCCAACAGGAAGTACCAGAGAAATTGTCTGATCCAGGGAGTTTTACTATTCCTTATAGTTTTGAT
ATTAAAGAGAATCTTGCCATGCCTATTATATTAGGGAGACCATTCCTTGCTACTGGGAGGGTTATAATTGATATTGAGGGCAGGGAGCTAATCATTAGAGTCCAACAGGA
GAAGGAGATCCTAAAACCTTTTGAGGATCCTCAGAACACATCAGAGACCATGTTGGTGGGGTACAGGAGAGGTCTTGAGATCCCAGTTTCTTCTTTGCTTGCAAGCTTCC
ATAAAAGAGGACATCATCATTCGAGGACAATATTGTTCACTCCTCCTACTCTTATTGTTCTCATGCTTGATTATGCCTTATTTTCTTTATACATTGAGGTCAATGCATGT
TTTAAGTTTGGGGGTGGAATAGCATGCGGTTCAAGGCGGTTGGGCGGTCTGGTTCACACGGTCTGGCTGGACTGGAACCGATTTGGTCCGTTTCAGCTGGAATTTGACTC
GTTTCGTGGTTTTCGTAGATGGTTCGAAGTGGTTCGGGTCGGTTCGGACGATCTAAACCAGTATTTGGAGCTGCTGAAGGATTTTTAG
Protein sequenceShow/hide protein sequence
MSQPEGFITQGQEQKVCKLNRSIYGLKQASKSWNIRFDTAIKSFGFDQNVDEPCVYKKINKNKVAFLSPKTPQEVEDMRQIPYASTVGGLIYTDSDLQFDKDSRKSTSGS
VFTLNGGAVVWRSIEQRCIADSTMEAEYVAACEAAKKTVWLRKFLTDLEVVSNMNLPITLYYYNSGAVANSKEPRSHKRDCPLICTGESDLHDDSISHHFGDKTEWGAEN
IITQDGIHSFPTLGKKETLGLLFFDNTRRIWRNSVKNDYVIEIAMLRTLPRQLFKSLPGLEQLLIKLLLERPNLGPQPDRLSRREEFLLKRLDQSIPSSNALRCLGRISG
ENLDPLVALRPYRLPQILELENCKCRDTMVTTSRRCDLFAPISKMECQRRDVVEVSAGPLGLTGSSLGVLSKKVLKGLLNFLHSVGEFGESIKNVFKGLDRFRCARAFIP
LSLTPSIKTIVDAAAGGTLLSKTVENTKTLLEEMTTNSYQWPSERSGPKKIVAGLFEIDNVSALQAQMSSLANAFLKFSGTRSAQSIESAAALASQTQEENLEQNVLNPP
SFTPKQESKQSLEDLVGAFISESSNRSNKLEEVVIAINTTVNGHSASNKNIETQLGQLVSVVNTMNKGKAPTEQEKSTLEYCKIVTMHHEEETTEIEELTGEAEEDTTSN
EAEKLIPEPSIPSPTVLVPKTKNKKKKNSKAQFEKFLDAFTGLTVNIPFADALEQMPHYTKFMKEWLIKKKNEKQMETVYLASTCSARVQQEVPEKLSDPGSFTIPYSFD
IKENLAMPIILGRPFLATGRVIIDIEGRELIIRVQQEKEILKPFEDPQNTSETMLVGYRRGLEIPVSSLLASFHKRGHHHSRTILFTPPTLIVLMLDYALFSLYIEVNAC
FKFGGGIACGSRRLGGLVHTVWLDWNRFGPFQLEFDSFRGFRRWFEVVRVGSDDLNQYLELLKDF