; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0018722 (gene) of Snake gourd v1 genome

Gene IDTan0018722
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionReverse transcriptase
Genome locationLG01:67379237..67381065
RNA-Seq ExpressionTan0018722
SyntenyTan0018722
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035138.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]3.6e-7040.58Show/hide
Query:  MPPRTSRRRRQDQGGTQDPTQGQSEKGSSAPRVQTEARDHQHAISSRGGRRTAESRP-------------------------------------------
        M PRT RRRRQ+Q G Q PTQG S   SS   V+  A + Q   +++   R   + P                                           
Subjt:  MPPRTSRRRRQDQGGTQDPTQGQSEKGSSAPRVQTEARDHQHAISSRGGRRTAESRP-------------------------------------------

Query:  ---------------------------TRRSDARTLDWHTFRGIFEDKYYPGTYREAKRDEFLELKQGSFSVAEYERKYTELSRYADVIVASESDRCRRF
                                   TRRSDAR LDW TFR IFEDKYYP TY EAKRDEFL LKQGS SVAEYE+KYTELSRYADVIVASESDRCRRF
Subjt:  ---------------------------TRRSDARTLDWHTFRGIFEDKYYPGTYREAKRDEFLELKQGSFSVAEYERKYTELSRYADVIVASESDRCRRF

Query:  ERGLRPEIRTPVTAIAKWTNFSQLVETALRVEQSIAE-------------------------------AHRQDFKNRAGGQTSRQMNISGAYQRQSQRAH
        ERGLR EIRTPVTAIAKWTNFSQLVETALRVEQSI E                               + RQDFKNR+GGQ SR ++    +QRQSQR  
Subjt:  ERGLRPEIRTPVTAIAKWTNFSQLVETALRVEQSIAE-------------------------------AHRQDFKNRAGGQTSRQMNISGAYQRQSQRAH

Query:  SH-ARSTTRPQTMQESVA----------------------------------------------------------------------------------
        S   RST R Q  QES+A                                                                                  
Subjt:  SH-ARSTTRPQTMQESVA----------------------------------------------------------------------------------

Query:  -----------------------------MDLLPLELQVLDAILGMDFLFTHYATMDCHRKEVVFRKPG----------------LVSAFKAQKLLRKGC
                                     +DLLPLELQ LD ILGMDFLF HYA+MDCHRKEVVF+KPG                L+S  KA+KLLRKGC
Subjt:  -----------------------------MDLLPLELQVLDAILGMDFLFTHYATMDCHRKEVVFRKPG----------------LVSAFKAQKLLRKGC

Query:  TTFLAHVVEVQEEKL
        T FLAHVV VQ EKL
Subjt:  TTFLAHVVEVQEEKL

KAA0041132.1 hypothetical protein E6C27_scaffold128G00450 [Cucumis melo var. makuwa]3.2e-7142.86Show/hide
Query:  MPPRTSRRRRQDQGGTQDPTQGQSEKGSSAPRVQTEARDHQHAISSRGGRRTAESRPTRRSDARTLDWHTFRGIFEDKYYPGTYREAKRDEFLELKQGSF
        MPPRT RRRRQ+Q G Q PTQ              EA     +I +            RRSDAR LDW TFRGIFEDKYYP TY EA+RDEFL LKQGS 
Subjt:  MPPRTSRRRRQDQGGTQDPTQGQSEKGSSAPRVQTEARDHQHAISSRGGRRTAESRPTRRSDARTLDWHTFRGIFEDKYYPGTYREAKRDEFLELKQGSF

Query:  SVAEYERKYTELSRYADVIVASESDRCRRFERGLRPEIRTPVTAIAKWTNFSQLVETALRVEQSIAE-------------------------------AH
        SVAEYERKYTELSRYADV VASESDRCRRFERGL  EIRTPVTAIAKWTNFSQLVETALRVEQSI E                               + 
Subjt:  SVAEYERKYTELSRYADVIVASESDRCRRFERGLRPEIRTPVTAIAKWTNFSQLVETALRVEQSIAE-------------------------------AH

Query:  RQDFKNRAGGQTSRQMNISGAYQRQSQRAHSH-ARSTTRPQTMQESVA----------------------------------------------------
         QDFKNR+GGQ SR ++    +QRQS R  S   RST R Q  QES+A                                                    
Subjt:  RQDFKNRAGGQTSRQMNISGAYQRQSQRAHSH-ARSTTRPQTMQESVA----------------------------------------------------

Query:  -----------------------------------------------------------------------------------MDLLPLELQVLDAILGM
                                                                                           +DLLPLELQ LD ILGM
Subjt:  -----------------------------------------------------------------------------------MDLLPLELQVLDAILGM

Query:  DFLFTHYATMDCHRKEVVFRKPG----------------LVSAFKAQKLLRKGCTTFLAHVVEVQEEKL
        DFLF HYA+MDCHRKEVVFRKPG                L+S  KA+KLLRKGCTTFLAH+V VQ EKL
Subjt:  DFLFTHYATMDCHRKEVVFRKPG----------------LVSAFKAQKLLRKGCTTFLAHVVEVQEEKL

KAA0051348.1 reverse transcriptase [Cucumis melo var. makuwa]6.7e-6939.39Show/hide
Query:  MPPRTSRRRRQDQGGTQDPTQGQSEKGSSAPRVQTEARDHQHAISSRGGRRTAESRPT------------------------------------------
        MPPRT RRRRQ+Q G Q PTQG S   SS   V+  A + Q A +++   R   + P+                                          
Subjt:  MPPRTSRRRRQDQGGTQDPTQGQSEKGSSAPRVQTEARDHQHAISSRGGRRTAESRPT------------------------------------------

Query:  ----------------------------RRSDARTLDWHTFRGIFEDKYYPGTYREAKRDEFLELKQGSFSVAEYERKYTELSRYADVIVASESDRCRRF
                                    RRSDA  LDW TFRGIFEDKYYP TY EAKRDEFL LKQGS SVAEYERKYTELSRYADVIVASESDRCRRF
Subjt:  ----------------------------RRSDARTLDWHTFRGIFEDKYYPGTYREAKRDEFLELKQGSFSVAEYERKYTELSRYADVIVASESDRCRRF

Query:  ERGLRPEIRTPVTAIAKWTNFSQLVETALRVEQSIAE-------------------------------AHRQDFKNRAGGQTSRQMNISGAYQRQSQRAH
        ERGLR EIRTPVTAIAKWTNFSQLVETALRVEQSI E                               +   DFKNR+GGQ SR ++    +QRQSQR  
Subjt:  ERGLRPEIRTPVTAIAKWTNFSQLVETALRVEQSIAE-------------------------------AHRQDFKNRAGGQTSRQMNISGAYQRQSQRAH

Query:  SH-ARSTTRPQTMQESVA----------------------------------------------------------------------------------
        S   RST R Q  QES+                                                                                   
Subjt:  SH-ARSTTRPQTMQESVA----------------------------------------------------------------------------------

Query:  ------------------------------------------MDLLPLELQVLDAILGMDFLFTHYATMDCHRKEVVFRKPG----------------LV
                                                  +DLLPLELQ LD ILGMDFLF HYA+MDCHRKEVVFRKPG                L+
Subjt:  ------------------------------------------MDLLPLELQVLDAILGMDFLFTHYATMDCHRKEVVFRKPG----------------LV

Query:  SAFKAQKLLRKGCTTFLAHVVEVQEEKL
        S  KA+KLLRKGCT FLAH+V VQ EKL
Subjt:  SAFKAQKLLRKGCTTFLAHVVEVQEEKL

KAA0051980.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]4.5e-7341.73Show/hide
Query:  MPPRTSRRRRQDQGGTQDPTQGQSEKGSSAPRVQTEARDHQHAISSRGGRRTAESRPTR------------------------------RSDARTLDWHT
        MPPRTSRRRRQ+Q   QDPTQGQSE+GSS PR Q EA   +   S++   R+  +RP+                                +DARTLDW T
Subjt:  MPPRTSRRRRQDQGGTQDPTQGQSEKGSSAPRVQTEARDHQHAISSRGGRRTAESRPTR------------------------------RSDARTLDWHT

Query:  FRGIFEDKYYPGTYREAKRDEFLELKQGSFSVAEYERKYTELSRYADVIVASESDRCRRFERGLRPEIRTPVTAIAKWTNFSQLVETALRVEQSIAEAHR
        FRGIFE+KYYP T  EAKRDEFLELKQGS SVA+Y+RKYTELS YA+VI+ASESDRCRRFERGL  EIRTPVTAIAKWT+FSQL+ETALRVEQSI E   
Subjt:  FRGIFEDKYYPGTYREAKRDEFLELKQGSFSVAEYERKYTELSRYADVIVASESDRCRRFERGLRPEIRTPVTAIAKWTNFSQLVETALRVEQSIAEAHR

Query:  -------------------------------QDFKNRAGGQTSRQMNISGAYQRQSQRAHSH-ARSTTRPQTMQESVA----------------------
                                       QDFK R GG+  RQM+   AYQRQSQRA S    S  RP+T QESVA                      
Subjt:  -------------------------------QDFKNRAGGQTSRQMNISGAYQRQSQRAHSH-ARSTTRPQTMQESVA----------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----MDLLPLELQVLDAILGMDFLFTHYATMDCHRKEVVFRKPG----------------LVSAFKAQKLLRKGCTTFLAHVVEVQEEKLDKRRI
             +DLLPLELQ  D ILGMDFLFTHYA+M+ HRKEV FRKPG                L+ A KA KLLRKG T FLAHVVE+QEEKL  + +
Subjt:  -----MDLLPLELQVLDAILGMDFLFTHYATMDCHRKEVVFRKPG----------------LVSAFKAQKLLRKGCTTFLAHVVEVQEEKLDKRRI

KAA0056288.1 uncharacterized protein E6C27_scaffold226G00600 [Cucumis melo var. makuwa]7.0e-7443.51Show/hide
Query:  MPPRTSRRRRQDQGGTQDPTQGQSEKGSSAPRVQTEARDHQHAISSRGGRRTAE--------------------------------SRPTRRSDARTLDW
        MPPRT RRRRQ+Q G Q PTQG S   S+  RV+    + Q A ++    R  E                                S   RRSDAR L+W
Subjt:  MPPRTSRRRRQDQGGTQDPTQGQSEKGSSAPRVQTEARDHQHAISSRGGRRTAE--------------------------------SRPTRRSDARTLDW

Query:  HTFRGIFEDKYYPGTYREAKRDEFLELKQGSFSVAEYERKYTELSRYADVIVASESDRCRRFERGLRPEIRTPVTAIAKWTNFSQLVETALRVEQSIAE-
         TFRGIFEDKYYP TY EAKRDEFL LKQGS SVAEYERKYTELSRYADVIVASESDRCRRFERGLR EIRTPVTAIAKWTNFSQLVETAL VEQSI E 
Subjt:  HTFRGIFEDKYYPGTYREAKRDEFLELKQGSFSVAEYERKYTELSRYADVIVASESDRCRRFERGLRPEIRTPVTAIAKWTNFSQLVETALRVEQSIAE-

Query:  ------------------------------AHRQDFKNRAGGQTSRQMNISGAYQRQSQRAHSH-ARSTTRPQTMQESVA--------------------
                                      + RQDFKNR+ GQ SR ++    +Q+QSQR  +   RST R Q  QESVA                    
Subjt:  ------------------------------AHRQDFKNRAGGQTSRQMNISGAYQRQSQRAHSH-ARSTTRPQTMQESVA--------------------

Query:  --------------------------------------------------------------------------------------------MDLLPLEL
                                                                                                    +DLLPLEL
Subjt:  --------------------------------------------------------------------------------------------MDLLPLEL

Query:  QVLDAILGMDFLFTHYATMDCHRKEVVFRKPG----------------LVSAFKAQKLLRKGCTTFLAHVVEVQEEKL
        Q LD ILGMDFLF HYA+MDCHRKEVVFRKPG                L+S  KA+KLLRKGCTTFLAH+V VQ E L
Subjt:  QVLDAILGMDFLFTHYATMDCHRKEVVFRKPG----------------LVSAFKAQKLLRKGCTTFLAHVVEVQEEKL

TrEMBL top hitse value%identityAlignment
A0A5A7SX06 DNA/RNA polymerases superfamily protein1.7e-7040.58Show/hide
Query:  MPPRTSRRRRQDQGGTQDPTQGQSEKGSSAPRVQTEARDHQHAISSRGGRRTAESRP-------------------------------------------
        M PRT RRRRQ+Q G Q PTQG S   SS   V+  A + Q   +++   R   + P                                           
Subjt:  MPPRTSRRRRQDQGGTQDPTQGQSEKGSSAPRVQTEARDHQHAISSRGGRRTAESRP-------------------------------------------

Query:  ---------------------------TRRSDARTLDWHTFRGIFEDKYYPGTYREAKRDEFLELKQGSFSVAEYERKYTELSRYADVIVASESDRCRRF
                                   TRRSDAR LDW TFR IFEDKYYP TY EAKRDEFL LKQGS SVAEYE+KYTELSRYADVIVASESDRCRRF
Subjt:  ---------------------------TRRSDARTLDWHTFRGIFEDKYYPGTYREAKRDEFLELKQGSFSVAEYERKYTELSRYADVIVASESDRCRRF

Query:  ERGLRPEIRTPVTAIAKWTNFSQLVETALRVEQSIAE-------------------------------AHRQDFKNRAGGQTSRQMNISGAYQRQSQRAH
        ERGLR EIRTPVTAIAKWTNFSQLVETALRVEQSI E                               + RQDFKNR+GGQ SR ++    +QRQSQR  
Subjt:  ERGLRPEIRTPVTAIAKWTNFSQLVETALRVEQSIAE-------------------------------AHRQDFKNRAGGQTSRQMNISGAYQRQSQRAH

Query:  SH-ARSTTRPQTMQESVA----------------------------------------------------------------------------------
        S   RST R Q  QES+A                                                                                  
Subjt:  SH-ARSTTRPQTMQESVA----------------------------------------------------------------------------------

Query:  -----------------------------MDLLPLELQVLDAILGMDFLFTHYATMDCHRKEVVFRKPG----------------LVSAFKAQKLLRKGC
                                     +DLLPLELQ LD ILGMDFLF HYA+MDCHRKEVVF+KPG                L+S  KA+KLLRKGC
Subjt:  -----------------------------MDLLPLELQVLDAILGMDFLFTHYATMDCHRKEVVFRKPG----------------LVSAFKAQKLLRKGC

Query:  TTFLAHVVEVQEEKL
        T FLAHVV VQ EKL
Subjt:  TTFLAHVVEVQEEKL

A0A5A7TCD9 Reverse transcriptase1.6e-7142.86Show/hide
Query:  MPPRTSRRRRQDQGGTQDPTQGQSEKGSSAPRVQTEARDHQHAISSRGGRRTAESRPTRRSDARTLDWHTFRGIFEDKYYPGTYREAKRDEFLELKQGSF
        MPPRT RRRRQ+Q G Q PTQ              EA     +I +            RRSDAR LDW TFRGIFEDKYYP TY EA+RDEFL LKQGS 
Subjt:  MPPRTSRRRRQDQGGTQDPTQGQSEKGSSAPRVQTEARDHQHAISSRGGRRTAESRPTRRSDARTLDWHTFRGIFEDKYYPGTYREAKRDEFLELKQGSF

Query:  SVAEYERKYTELSRYADVIVASESDRCRRFERGLRPEIRTPVTAIAKWTNFSQLVETALRVEQSIAE-------------------------------AH
        SVAEYERKYTELSRYADV VASESDRCRRFERGL  EIRTPVTAIAKWTNFSQLVETALRVEQSI E                               + 
Subjt:  SVAEYERKYTELSRYADVIVASESDRCRRFERGLRPEIRTPVTAIAKWTNFSQLVETALRVEQSIAE-------------------------------AH

Query:  RQDFKNRAGGQTSRQMNISGAYQRQSQRAHSH-ARSTTRPQTMQESVA----------------------------------------------------
         QDFKNR+GGQ SR ++    +QRQS R  S   RST R Q  QES+A                                                    
Subjt:  RQDFKNRAGGQTSRQMNISGAYQRQSQRAHSH-ARSTTRPQTMQESVA----------------------------------------------------

Query:  -----------------------------------------------------------------------------------MDLLPLELQVLDAILGM
                                                                                           +DLLPLELQ LD ILGM
Subjt:  -----------------------------------------------------------------------------------MDLLPLELQVLDAILGM

Query:  DFLFTHYATMDCHRKEVVFRKPG----------------LVSAFKAQKLLRKGCTTFLAHVVEVQEEKL
        DFLF HYA+MDCHRKEVVFRKPG                L+S  KA+KLLRKGCTTFLAH+V VQ EKL
Subjt:  DFLFTHYATMDCHRKEVVFRKPG----------------LVSAFKAQKLLRKGCTTFLAHVVEVQEEKL

A0A5A7U9X4 DNA/RNA polymerases superfamily protein2.2e-7341.73Show/hide
Query:  MPPRTSRRRRQDQGGTQDPTQGQSEKGSSAPRVQTEARDHQHAISSRGGRRTAESRPTR------------------------------RSDARTLDWHT
        MPPRTSRRRRQ+Q   QDPTQGQSE+GSS PR Q EA   +   S++   R+  +RP+                                +DARTLDW T
Subjt:  MPPRTSRRRRQDQGGTQDPTQGQSEKGSSAPRVQTEARDHQHAISSRGGRRTAESRPTR------------------------------RSDARTLDWHT

Query:  FRGIFEDKYYPGTYREAKRDEFLELKQGSFSVAEYERKYTELSRYADVIVASESDRCRRFERGLRPEIRTPVTAIAKWTNFSQLVETALRVEQSIAEAHR
        FRGIFE+KYYP T  EAKRDEFLELKQGS SVA+Y+RKYTELS YA+VI+ASESDRCRRFERGL  EIRTPVTAIAKWT+FSQL+ETALRVEQSI E   
Subjt:  FRGIFEDKYYPGTYREAKRDEFLELKQGSFSVAEYERKYTELSRYADVIVASESDRCRRFERGLRPEIRTPVTAIAKWTNFSQLVETALRVEQSIAEAHR

Query:  -------------------------------QDFKNRAGGQTSRQMNISGAYQRQSQRAHSH-ARSTTRPQTMQESVA----------------------
                                       QDFK R GG+  RQM+   AYQRQSQRA S    S  RP+T QESVA                      
Subjt:  -------------------------------QDFKNRAGGQTSRQMNISGAYQRQSQRAHSH-ARSTTRPQTMQESVA----------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----MDLLPLELQVLDAILGMDFLFTHYATMDCHRKEVVFRKPG----------------LVSAFKAQKLLRKGCTTFLAHVVEVQEEKLDKRRI
             +DLLPLELQ  D ILGMDFLFTHYA+M+ HRKEV FRKPG                L+ A KA KLLRKG T FLAHVVE+QEEKL  + +
Subjt:  -----MDLLPLELQVLDAILGMDFLFTHYATMDCHRKEVVFRKPG----------------LVSAFKAQKLLRKGCTTFLAHVVEVQEEKLDKRRI

A0A5A7V0A7 DNA/RNA polymerases superfamily protein5.6e-6954.9Show/hide
Query:  RRSDARTLDWHTFRGIFEDKYYPGTYREAKRDEFLELKQGSFSVAEYERKYTELSRYADVIVASESDRCRRFERGLRPEIRTPVTAIAKWTNFSQLVETA
        RRSDAR LDW TFRGIFEDKYYP TY EAKRDEFL LKQGS SVAEYERKYTELSRYADVIVASESDRCRRFERGLR EIRTP TAIAKWTNFSQLVE+A
Subjt:  RRSDARTLDWHTFRGIFEDKYYPGTYREAKRDEFLELKQGSFSVAEYERKYTELSRYADVIVASESDRCRRFERGLRPEIRTPVTAIAKWTNFSQLVETA

Query:  LRVEQSIAEAHRQDFKNRAGGQTSRQMNISGAYQRQS----QRAHSHAR---------STTRPQTMQE--------------------------------
        LRVEQSI E       +R    TS      G  QR+S         HAR            R QT+++                                
Subjt:  LRVEQSIAEAHRQDFKNRAGGQTSRQMNISGAYQRQS----QRAHSHAR---------STTRPQTMQE--------------------------------

Query:  ------------------SVAMDLLPLELQVLDAILGMDFLFTHYATMDCHRKEVVFRKPG----------------LVSAFKAQKLLRKGCTTFLAHVV
                          S+ +DLLPLELQ LD IL MDFLF HYA+M+CHRKEVVFRKPG                L+S  KA+KLLRKGCTTFLAH+V
Subjt:  ------------------SVAMDLLPLELQVLDAILGMDFLFTHYATMDCHRKEVVFRKPG----------------LVSAFKAQKLLRKGCTTFLAHVV

Query:  EVQEEK
         VQ EK
Subjt:  EVQEEK

A0A5D3BZQ5 Retrotrans_gag domain-containing protein3.4e-7443.51Show/hide
Query:  MPPRTSRRRRQDQGGTQDPTQGQSEKGSSAPRVQTEARDHQHAISSRGGRRTAE--------------------------------SRPTRRSDARTLDW
        MPPRT RRRRQ+Q G Q PTQG S   S+  RV+    + Q A ++    R  E                                S   RRSDAR L+W
Subjt:  MPPRTSRRRRQDQGGTQDPTQGQSEKGSSAPRVQTEARDHQHAISSRGGRRTAE--------------------------------SRPTRRSDARTLDW

Query:  HTFRGIFEDKYYPGTYREAKRDEFLELKQGSFSVAEYERKYTELSRYADVIVASESDRCRRFERGLRPEIRTPVTAIAKWTNFSQLVETALRVEQSIAE-
         TFRGIFEDKYYP TY EAKRDEFL LKQGS SVAEYERKYTELSRYADVIVASESDRCRRFERGLR EIRTPVTAIAKWTNFSQLVETAL VEQSI E 
Subjt:  HTFRGIFEDKYYPGTYREAKRDEFLELKQGSFSVAEYERKYTELSRYADVIVASESDRCRRFERGLRPEIRTPVTAIAKWTNFSQLVETALRVEQSIAE-

Query:  ------------------------------AHRQDFKNRAGGQTSRQMNISGAYQRQSQRAHSH-ARSTTRPQTMQESVA--------------------
                                      + RQDFKNR+ GQ SR ++    +Q+QSQR  +   RST R Q  QESVA                    
Subjt:  ------------------------------AHRQDFKNRAGGQTSRQMNISGAYQRQSQRAHSH-ARSTTRPQTMQESVA--------------------

Query:  --------------------------------------------------------------------------------------------MDLLPLEL
                                                                                                    +DLLPLEL
Subjt:  --------------------------------------------------------------------------------------------MDLLPLEL

Query:  QVLDAILGMDFLFTHYATMDCHRKEVVFRKPG----------------LVSAFKAQKLLRKGCTTFLAHVVEVQEEKL
        Q LD ILGMDFLF HYA+MDCHRKEVVFRKPG                L+S  KA+KLLRKGCTTFLAH+V VQ E L
Subjt:  QVLDAILGMDFLFTHYATMDCHRKEVVFRKPG----------------LVSAFKAQKLLRKGCTTFLAHVVEVQEEKL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCACCACGTACTAGCAGACGACGGAGGCAGGATCAGGGCGGGACACAGGATCCTACCCAAGGGCAATCTGAGAAGGGATCTAGTGCCCCGAGAGTCCAAACTGAGGC
CAGAGATCACCAGCATGCCATTTCCTCACGAGGAGGTAGGCGCACTGCAGAGAGCAGGCCAACCAGACGGAGTGATGCACGTACTCTGGATTGGCACACATTTAGAGGCA
TATTTGAAGATAAATATTACCCTGGCACGTACCGCGAAGCGAAGAGAGATGAATTTTTAGAATTAAAGCAAGGATCATTTTCAGTGGCTGAATACGAGAGAAAGTATACT
GAGTTGTCGCGGTATGCTGATGTAATTGTGGCATCTGAGAGTGATAGGTGTCGAAGATTTGAAAGAGGGTTACGACCCGAGATACGTACCCCAGTTACAGCCATTGCTAA
GTGGACTAATTTTTCTCAGTTGGTAGAGACTGCTCTTCGTGTTGAGCAGAGTATAGCAGAGGCGCATCGTCAGGACTTTAAGAATCGAGCTGGCGGCCAAACATCGAGGC
AGATGAATATTAGTGGTGCCTATCAGAGGCAAAGTCAAAGAGCACATAGTCATGCCAGATCCACAACAAGACCACAGACAATGCAGGAGTCTGTTGCCATGGATCTTCTT
CCACTAGAGTTGCAAGTGTTAGATGCAATCTTAGGGATGGATTTCTTATTCACTCATTATGCTACCATGGATTGCCATAGAAAGGAAGTTGTTTTTAGAAAACCAGGTTT
GGTTTCAGCATTCAAAGCTCAAAAGTTGTTGAGAAAAGGTTGCACGACGTTTCTTGCACATGTAGTGGAGGTGCAGGAAGAAAAGCTTGATAAGCGCCGAATTTGTTGGT
TTAACAAAGCTAGAGGTCCTTCTAGGATCTTTGCAAACGTCATGAACAAGGCTAACTTAACGAATTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCACCACGTACTAGCAGACGACGGAGGCAGGATCAGGGCGGGACACAGGATCCTACCCAAGGGCAATCTGAGAAGGGATCTAGTGCCCCGAGAGTCCAAACTGAGGC
CAGAGATCACCAGCATGCCATTTCCTCACGAGGAGGTAGGCGCACTGCAGAGAGCAGGCCAACCAGACGGAGTGATGCACGTACTCTGGATTGGCACACATTTAGAGGCA
TATTTGAAGATAAATATTACCCTGGCACGTACCGCGAAGCGAAGAGAGATGAATTTTTAGAATTAAAGCAAGGATCATTTTCAGTGGCTGAATACGAGAGAAAGTATACT
GAGTTGTCGCGGTATGCTGATGTAATTGTGGCATCTGAGAGTGATAGGTGTCGAAGATTTGAAAGAGGGTTACGACCCGAGATACGTACCCCAGTTACAGCCATTGCTAA
GTGGACTAATTTTTCTCAGTTGGTAGAGACTGCTCTTCGTGTTGAGCAGAGTATAGCAGAGGCGCATCGTCAGGACTTTAAGAATCGAGCTGGCGGCCAAACATCGAGGC
AGATGAATATTAGTGGTGCCTATCAGAGGCAAAGTCAAAGAGCACATAGTCATGCCAGATCCACAACAAGACCACAGACAATGCAGGAGTCTGTTGCCATGGATCTTCTT
CCACTAGAGTTGCAAGTGTTAGATGCAATCTTAGGGATGGATTTCTTATTCACTCATTATGCTACCATGGATTGCCATAGAAAGGAAGTTGTTTTTAGAAAACCAGGTTT
GGTTTCAGCATTCAAAGCTCAAAAGTTGTTGAGAAAAGGTTGCACGACGTTTCTTGCACATGTAGTGGAGGTGCAGGAAGAAAAGCTTGATAAGCGCCGAATTTGTTGGT
TTAACAAAGCTAGAGGTCCTTCTAGGATCTTTGCAAACGTCATGAACAAGGCTAACTTAACGAATTGA
Protein sequenceShow/hide protein sequence
MPPRTSRRRRQDQGGTQDPTQGQSEKGSSAPRVQTEARDHQHAISSRGGRRTAESRPTRRSDARTLDWHTFRGIFEDKYYPGTYREAKRDEFLELKQGSFSVAEYERKYT
ELSRYADVIVASESDRCRRFERGLRPEIRTPVTAIAKWTNFSQLVETALRVEQSIAEAHRQDFKNRAGGQTSRQMNISGAYQRQSQRAHSHARSTTRPQTMQESVAMDLL
PLELQVLDAILGMDFLFTHYATMDCHRKEVVFRKPGLVSAFKAQKLLRKGCTTFLAHVVEVQEEKLDKRRICWFNKARGPSRIFANVMNKANLTN