; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0021130 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0021130
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotran_gag_3 domain-containing protein
Genome locationchr7:4907323..4909553
RNA-Seq ExpressionLag0021130
SyntenyLag0021130
Gene Ontology termsGO:0097159 - organic cyclic compound binding (molecular function)
GO:1901363 - heterocyclic compound binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8645659.1 hypothetical protein Csa_020439 [Cucumis sativus]3.3e-5042.59Show/hide
Query:  RDQALITLINATLTQTALSYVIGCQTSKEVWDRLEKHFSSSTQTNIVGVKTKLQSVSKKSGESIDVYVRRIKEIVNKLVAVSVVIDAEDLNIYTINGLSS
        +DQAL+T+INATL+  AL+YV+G  +SK+VWD L K +SS +++N+V +K+ LQ++ KK  ESID Y++RIKEI +KL  VS  I+ EDL IY +NGL +
Subjt:  RDQALITLINATLTQTALSYVIGCQTSKEVWDRLEKHFSSSTQTNIVGVKTKLQSVSKKSGESIDVYVRRIKEIVNKLVAVSVVIDAEDLNIYTINGLSS

Query:  AYNVFKTSLCTRSQALTFVELHILMKTEETALEQQIKADEIPNNSHLAMAASFDFGGRGNWRGRFDGNRNGGRGRGTSFSSQITTPSSHGQFSSSPSFDG
         YN F+TS+ TRSQ +TF ELH+L++ EE+AL +Q K D+  N   + +++S       +    F+ N   G G G ++         HG+FS      G
Subjt:  AYNVFKTSLCTRSQALTFVELHILMKTEETALEQQIKADEIPNNSHLAMAASFDFGGRGNWRGRFDGNRNGGRGRGTSFSSQITTPSSHGQFSSSPSFDG

Query:  ------SRP--PNKVTCQIFQKYGHNALDCYNIMNHLSYTPLVPFAFVSRHPPAKLAAMATFAPSNNLAHNFVSNVAPSDSHVWLSDTGCNAHLTHNLAN
               +P   N  TCQI  + GH ALDC+N MN         + F  RHPP +LAAM   A  NN    F+S V  S     L+D+GCN H+T ++  
Subjt:  ------SRP--PNKVTCQIFQKYGHNALDCYNIMNHLSYTPLVPFAFVSRHPPAKLAAMATFAPSNNLAHNFVSNVAPSDSHVWLSDTGCNAHLTHNLAN

Query:  LSVSNAYNGEENITVGN
        +S++  YNGEE + VGN
Subjt:  LSVSNAYNGEENITVGN

XP_008448007.1 PREDICTED: uncharacterized protein LOC103490319 isoform X2 [Cucumis melo]7.3e-5042.59Show/hide
Query:  RDQALITLINATLTQTALSYVIGCQTSKEVWDRLEKHFSSSTQTNIVGVKTKLQSVSKKSGESIDVYVRRIKEIVNKLVAVSVVIDAEDLNIYTINGLSS
        +DQAL+T+INATL+  AL+YV+G  +SK+VWD L K +SS +++N+V +K+ LQ++ KK  ESID Y++RIKEI +KL  VS  I+ EDL IY +NGL +
Subjt:  RDQALITLINATLTQTALSYVIGCQTSKEVWDRLEKHFSSSTQTNIVGVKTKLQSVSKKSGESIDVYVRRIKEIVNKLVAVSVVIDAEDLNIYTINGLSS

Query:  AYNVFKTSLCTRSQALTFVELHILMKTEETALEQQIKADEIPNNSHLAMAASFDFGGRGNWRGRFDGNRNGGRGRGTSFSSQITTPSSHGQFS--SSPSF
         YN F+TS+ TRSQ +TF ELH+L++ EE+AL +Q K D+  N   + +++S       +    FD N   G G G  +         HG+FS  +    
Subjt:  AYNVFKTSLCTRSQALTFVELHILMKTEETALEQQIKADEIPNNSHLAMAASFDFGGRGNWRGRFDGNRNGGRGRGTSFSSQITTPSSHGQFS--SSPSF

Query:  DGSRP------PNKVTCQIFQKYGHNALDCYNIMNHLSYTPLVPFAFVSRHPPAKLAAMATFAPSNNLAHNFVSNVAPSDSHVWLSDTGCNAHLTHNLAN
         GS P       N  TCQI  + GH ALDC+N MN         + F  RHPP +LAAM   A  NN    F+S V  S     L+D+GCN  +T ++  
Subjt:  DGSRP------PNKVTCQIFQKYGHNALDCYNIMNHLSYTPLVPFAFVSRHPPAKLAAMATFAPSNNLAHNFVSNVAPSDSHVWLSDTGCNAHLTHNLAN

Query:  LSVSNAYNGEENITVGN
        +S++  YNGEE + +GN
Subjt:  LSVSNAYNGEENITVGN

XP_008448008.1 PREDICTED: uncharacterized protein LOC103490319 isoform X3 [Cucumis melo]7.3e-5042.59Show/hide
Query:  RDQALITLINATLTQTALSYVIGCQTSKEVWDRLEKHFSSSTQTNIVGVKTKLQSVSKKSGESIDVYVRRIKEIVNKLVAVSVVIDAEDLNIYTINGLSS
        +DQAL+T+INATL+  AL+YV+G  +SK+VWD L K +SS +++N+V +K+ LQ++ KK  ESID Y++RIKEI +KL  VS  I+ EDL IY +NGL +
Subjt:  RDQALITLINATLTQTALSYVIGCQTSKEVWDRLEKHFSSSTQTNIVGVKTKLQSVSKKSGESIDVYVRRIKEIVNKLVAVSVVIDAEDLNIYTINGLSS

Query:  AYNVFKTSLCTRSQALTFVELHILMKTEETALEQQIKADEIPNNSHLAMAASFDFGGRGNWRGRFDGNRNGGRGRGTSFSSQITTPSSHGQFS--SSPSF
         YN F+TS+ TRSQ +TF ELH+L++ EE+AL +Q K D+  N   + +++S       +    FD N   G G G  +         HG+FS  +    
Subjt:  AYNVFKTSLCTRSQALTFVELHILMKTEETALEQQIKADEIPNNSHLAMAASFDFGGRGNWRGRFDGNRNGGRGRGTSFSSQITTPSSHGQFS--SSPSF

Query:  DGSRP------PNKVTCQIFQKYGHNALDCYNIMNHLSYTPLVPFAFVSRHPPAKLAAMATFAPSNNLAHNFVSNVAPSDSHVWLSDTGCNAHLTHNLAN
         GS P       N  TCQI  + GH ALDC+N MN         + F  RHPP +LAAM   A  NN    F+S V  S     L+D+GCN  +T ++  
Subjt:  DGSRP------PNKVTCQIFQKYGHNALDCYNIMNHLSYTPLVPFAFVSRHPPAKLAAMATFAPSNNLAHNFVSNVAPSDSHVWLSDTGCNAHLTHNLAN

Query:  LSVSNAYNGEENITVGN
        +S++  YNGEE + +GN
Subjt:  LSVSNAYNGEENITVGN

XP_011658579.1 uncharacterized protein LOC105436058 [Cucumis sativus]3.3e-5042.59Show/hide
Query:  RDQALITLINATLTQTALSYVIGCQTSKEVWDRLEKHFSSSTQTNIVGVKTKLQSVSKKSGESIDVYVRRIKEIVNKLVAVSVVIDAEDLNIYTINGLSS
        +DQAL+T+INATL+  AL+YV+G  +SK+VWD L K +SS +++N+V +K+ LQ++ KK  ESID Y++RIKEI +KL  VS  I+ EDL IY +NGL +
Subjt:  RDQALITLINATLTQTALSYVIGCQTSKEVWDRLEKHFSSSTQTNIVGVKTKLQSVSKKSGESIDVYVRRIKEIVNKLVAVSVVIDAEDLNIYTINGLSS

Query:  AYNVFKTSLCTRSQALTFVELHILMKTEETALEQQIKADEIPNNSHLAMAASFDFGGRGNWRGRFDGNRNGGRGRGTSFSSQITTPSSHGQFSSSPSFDG
         YN F+TS+ TRSQ +TF ELH+L++ EE+AL +Q K D+  N   + +++S       +    F+ N   G G G ++         HG+FS      G
Subjt:  AYNVFKTSLCTRSQALTFVELHILMKTEETALEQQIKADEIPNNSHLAMAASFDFGGRGNWRGRFDGNRNGGRGRGTSFSSQITTPSSHGQFSSSPSFDG

Query:  ------SRP--PNKVTCQIFQKYGHNALDCYNIMNHLSYTPLVPFAFVSRHPPAKLAAMATFAPSNNLAHNFVSNVAPSDSHVWLSDTGCNAHLTHNLAN
               +P   N  TCQI  + GH ALDC+N MN         + F  RHPP +LAAM   A  NN    F+S V  S     L+D+GCN H+T ++  
Subjt:  ------SRP--PNKVTCQIFQKYGHNALDCYNIMNHLSYTPLVPFAFVSRHPPAKLAAMATFAPSNNLAHNFVSNVAPSDSHVWLSDTGCNAHLTHNLAN

Query:  LSVSNAYNGEENITVGN
        +S++  YNGEE + VGN
Subjt:  LSVSNAYNGEENITVGN

XP_022150845.1 uncharacterized protein LOC111018892 [Momordica charantia]4.4e-5544.44Show/hide
Query:  RDQALITLINATLTQTALSYVIGCQTSKEVWDRLEKHFSSSTQTNIVGVKTKLQSVSKKSGESIDVYVRRIKEIVNKLVAVSVVIDAEDLNIYTINGLSS
        +DQAL+TLINATL+  AL+YV+   TSK+VW+ LEKH+SS+++TN+V +K+ LQS+ KK+ ESID YV+RIKEI +K   VS+ I+ E L IY +NGLS+
Subjt:  RDQALITLINATLTQTALSYVIGCQTSKEVWDRLEKHFSSSTQTNIVGVKTKLQSVSKKSGESIDVYVRRIKEIVNKLVAVSVVIDAEDLNIYTINGLSS

Query:  AYNVFKTSLCTRSQALTFVELHILMKTEETALEQQIKADEIPNNSHLAMAASFDFGGRGNW---RGRFDGNRNGGRGRGTSFSSQITTPSSHGQFSSSPS
         YN   TS+ TR+Q+++F ELH+ MK+EE+A+E+Q+K +++    +   A+S     R +        D  R    GRG +  +   T    G+ SS   
Subjt:  AYNVFKTSLCTRSQALTFVELHILMKTEETALEQQIKADEIPNNSHLAMAASFDFGGRGNW---RGRFDGNRNGGRGRGTSFSSQITTPSSHGQFSSSPS

Query:  FDGSRPPNKVTCQIFQKYGHNALDCYNIMNHLSYTPLVPFAFVSRHPPAKLAAMATFAPSNNLAHNFVSNVAPSDSHVWLSDTGCNAHLTHNLANLSVSN
        F   +  N+  CQI  K GH ALDCYN MN         F F  RHPP +LAAM     ++ LA   V N +P+    WL+D+ CN H+T +L+NLS+++
Subjt:  FDGSRPPNKVTCQIFQKYGHNALDCYNIMNHLSYTPLVPFAFVSRHPPAKLAAMATFAPSNNLAHNFVSNVAPSDSHVWLSDTGCNAHLTHNLANLSVSN

Query:  A---YNGEENITVGN
            YNGEENI+VG+
Subjt:  A---YNGEENITVGN

TrEMBL top hitse value%identityAlignment
A0A1S3BI58 uncharacterized protein LOC103490319 isoform X23.5e-5042.59Show/hide
Query:  RDQALITLINATLTQTALSYVIGCQTSKEVWDRLEKHFSSSTQTNIVGVKTKLQSVSKKSGESIDVYVRRIKEIVNKLVAVSVVIDAEDLNIYTINGLSS
        +DQAL+T+INATL+  AL+YV+G  +SK+VWD L K +SS +++N+V +K+ LQ++ KK  ESID Y++RIKEI +KL  VS  I+ EDL IY +NGL +
Subjt:  RDQALITLINATLTQTALSYVIGCQTSKEVWDRLEKHFSSSTQTNIVGVKTKLQSVSKKSGESIDVYVRRIKEIVNKLVAVSVVIDAEDLNIYTINGLSS

Query:  AYNVFKTSLCTRSQALTFVELHILMKTEETALEQQIKADEIPNNSHLAMAASFDFGGRGNWRGRFDGNRNGGRGRGTSFSSQITTPSSHGQFS--SSPSF
         YN F+TS+ TRSQ +TF ELH+L++ EE+AL +Q K D+  N   + +++S       +    FD N   G G G  +         HG+FS  +    
Subjt:  AYNVFKTSLCTRSQALTFVELHILMKTEETALEQQIKADEIPNNSHLAMAASFDFGGRGNWRGRFDGNRNGGRGRGTSFSSQITTPSSHGQFS--SSPSF

Query:  DGSRP------PNKVTCQIFQKYGHNALDCYNIMNHLSYTPLVPFAFVSRHPPAKLAAMATFAPSNNLAHNFVSNVAPSDSHVWLSDTGCNAHLTHNLAN
         GS P       N  TCQI  + GH ALDC+N MN         + F  RHPP +LAAM   A  NN    F+S V  S     L+D+GCN  +T ++  
Subjt:  DGSRP------PNKVTCQIFQKYGHNALDCYNIMNHLSYTPLVPFAFVSRHPPAKLAAMATFAPSNNLAHNFVSNVAPSDSHVWLSDTGCNAHLTHNLAN

Query:  LSVSNAYNGEENITVGN
        +S++  YNGEE + +GN
Subjt:  LSVSNAYNGEENITVGN

A0A1S3BIR3 uncharacterized protein LOC103490319 isoform X33.5e-5042.59Show/hide
Query:  RDQALITLINATLTQTALSYVIGCQTSKEVWDRLEKHFSSSTQTNIVGVKTKLQSVSKKSGESIDVYVRRIKEIVNKLVAVSVVIDAEDLNIYTINGLSS
        +DQAL+T+INATL+  AL+YV+G  +SK+VWD L K +SS +++N+V +K+ LQ++ KK  ESID Y++RIKEI +KL  VS  I+ EDL IY +NGL +
Subjt:  RDQALITLINATLTQTALSYVIGCQTSKEVWDRLEKHFSSSTQTNIVGVKTKLQSVSKKSGESIDVYVRRIKEIVNKLVAVSVVIDAEDLNIYTINGLSS

Query:  AYNVFKTSLCTRSQALTFVELHILMKTEETALEQQIKADEIPNNSHLAMAASFDFGGRGNWRGRFDGNRNGGRGRGTSFSSQITTPSSHGQFS--SSPSF
         YN F+TS+ TRSQ +TF ELH+L++ EE+AL +Q K D+  N   + +++S       +    FD N   G G G  +         HG+FS  +    
Subjt:  AYNVFKTSLCTRSQALTFVELHILMKTEETALEQQIKADEIPNNSHLAMAASFDFGGRGNWRGRFDGNRNGGRGRGTSFSSQITTPSSHGQFS--SSPSF

Query:  DGSRP------PNKVTCQIFQKYGHNALDCYNIMNHLSYTPLVPFAFVSRHPPAKLAAMATFAPSNNLAHNFVSNVAPSDSHVWLSDTGCNAHLTHNLAN
         GS P       N  TCQI  + GH ALDC+N MN         + F  RHPP +LAAM   A  NN    F+S V  S     L+D+GCN  +T ++  
Subjt:  DGSRP------PNKVTCQIFQKYGHNALDCYNIMNHLSYTPLVPFAFVSRHPPAKLAAMATFAPSNNLAHNFVSNVAPSDSHVWLSDTGCNAHLTHNLAN

Query:  LSVSNAYNGEENITVGN
        +S++  YNGEE + +GN
Subjt:  LSVSNAYNGEENITVGN

A0A1S4DWT9 uncharacterized protein LOC103490319 isoform X13.5e-5042.59Show/hide
Query:  RDQALITLINATLTQTALSYVIGCQTSKEVWDRLEKHFSSSTQTNIVGVKTKLQSVSKKSGESIDVYVRRIKEIVNKLVAVSVVIDAEDLNIYTINGLSS
        +DQAL+T+INATL+  AL+YV+G  +SK+VWD L K +SS +++N+V +K+ LQ++ KK  ESID Y++RIKEI +KL  VS  I+ EDL IY +NGL +
Subjt:  RDQALITLINATLTQTALSYVIGCQTSKEVWDRLEKHFSSSTQTNIVGVKTKLQSVSKKSGESIDVYVRRIKEIVNKLVAVSVVIDAEDLNIYTINGLSS

Query:  AYNVFKTSLCTRSQALTFVELHILMKTEETALEQQIKADEIPNNSHLAMAASFDFGGRGNWRGRFDGNRNGGRGRGTSFSSQITTPSSHGQFS--SSPSF
         YN F+TS+ TRSQ +TF ELH+L++ EE+AL +Q K D+  N   + +++S       +    FD N   G G G  +         HG+FS  +    
Subjt:  AYNVFKTSLCTRSQALTFVELHILMKTEETALEQQIKADEIPNNSHLAMAASFDFGGRGNWRGRFDGNRNGGRGRGTSFSSQITTPSSHGQFS--SSPSF

Query:  DGSRP------PNKVTCQIFQKYGHNALDCYNIMNHLSYTPLVPFAFVSRHPPAKLAAMATFAPSNNLAHNFVSNVAPSDSHVWLSDTGCNAHLTHNLAN
         GS P       N  TCQI  + GH ALDC+N MN         + F  RHPP +LAAM   A  NN    F+S V  S     L+D+GCN  +T ++  
Subjt:  DGSRP------PNKVTCQIFQKYGHNALDCYNIMNHLSYTPLVPFAFVSRHPPAKLAAMATFAPSNNLAHNFVSNVAPSDSHVWLSDTGCNAHLTHNLAN

Query:  LSVSNAYNGEENITVGN
        +S++  YNGEE + +GN
Subjt:  LSVSNAYNGEENITVGN

A0A5D3CLI6 T4.53.5e-5042.59Show/hide
Query:  RDQALITLINATLTQTALSYVIGCQTSKEVWDRLEKHFSSSTQTNIVGVKTKLQSVSKKSGESIDVYVRRIKEIVNKLVAVSVVIDAEDLNIYTINGLSS
        +DQAL+T+INATL+  AL+YV+G  +SK+VWD L K +SS +++N+V +K+ LQ++ KK  ESID Y++RIKEI +KL  VS  I+ EDL IY +NGL +
Subjt:  RDQALITLINATLTQTALSYVIGCQTSKEVWDRLEKHFSSSTQTNIVGVKTKLQSVSKKSGESIDVYVRRIKEIVNKLVAVSVVIDAEDLNIYTINGLSS

Query:  AYNVFKTSLCTRSQALTFVELHILMKTEETALEQQIKADEIPNNSHLAMAASFDFGGRGNWRGRFDGNRNGGRGRGTSFSSQITTPSSHGQFS--SSPSF
         YN F+TS+ TRSQ +TF ELH+L++ EE+AL +Q K D+  N   + +++S       +    FD N   G G G  +         HG+FS  +    
Subjt:  AYNVFKTSLCTRSQALTFVELHILMKTEETALEQQIKADEIPNNSHLAMAASFDFGGRGNWRGRFDGNRNGGRGRGTSFSSQITTPSSHGQFS--SSPSF

Query:  DGSRP------PNKVTCQIFQKYGHNALDCYNIMNHLSYTPLVPFAFVSRHPPAKLAAMATFAPSNNLAHNFVSNVAPSDSHVWLSDTGCNAHLTHNLAN
         GS P       N  TCQI  + GH ALDC+N MN         + F  RHPP +LAAM   A  NN    F+S V  S     L+D+GCN  +T ++  
Subjt:  DGSRP------PNKVTCQIFQKYGHNALDCYNIMNHLSYTPLVPFAFVSRHPPAKLAAMATFAPSNNLAHNFVSNVAPSDSHVWLSDTGCNAHLTHNLAN

Query:  LSVSNAYNGEENITVGN
        +S++  YNGEE + +GN
Subjt:  LSVSNAYNGEENITVGN

A0A6J1D9L6 uncharacterized protein LOC1110188922.1e-5544.44Show/hide
Query:  RDQALITLINATLTQTALSYVIGCQTSKEVWDRLEKHFSSSTQTNIVGVKTKLQSVSKKSGESIDVYVRRIKEIVNKLVAVSVVIDAEDLNIYTINGLSS
        +DQAL+TLINATL+  AL+YV+   TSK+VW+ LEKH+SS+++TN+V +K+ LQS+ KK+ ESID YV+RIKEI +K   VS+ I+ E L IY +NGLS+
Subjt:  RDQALITLINATLTQTALSYVIGCQTSKEVWDRLEKHFSSSTQTNIVGVKTKLQSVSKKSGESIDVYVRRIKEIVNKLVAVSVVIDAEDLNIYTINGLSS

Query:  AYNVFKTSLCTRSQALTFVELHILMKTEETALEQQIKADEIPNNSHLAMAASFDFGGRGNW---RGRFDGNRNGGRGRGTSFSSQITTPSSHGQFSSSPS
         YN   TS+ TR+Q+++F ELH+ MK+EE+A+E+Q+K +++    +   A+S     R +        D  R    GRG +  +   T    G+ SS   
Subjt:  AYNVFKTSLCTRSQALTFVELHILMKTEETALEQQIKADEIPNNSHLAMAASFDFGGRGNW---RGRFDGNRNGGRGRGTSFSSQITTPSSHGQFSSSPS

Query:  FDGSRPPNKVTCQIFQKYGHNALDCYNIMNHLSYTPLVPFAFVSRHPPAKLAAMATFAPSNNLAHNFVSNVAPSDSHVWLSDTGCNAHLTHNLANLSVSN
        F   +  N+  CQI  K GH ALDCYN MN         F F  RHPP +LAAM     ++ LA   V N +P+    WL+D+ CN H+T +L+NLS+++
Subjt:  FDGSRPPNKVTCQIFQKYGHNALDCYNIMNHLSYTPLVPFAFVSRHPPAKLAAMATFAPSNNLAHNFVSNVAPSDSHVWLSDTGCNAHLTHNLANLSVSN

Query:  A---YNGEENITVGN
            YNGEENI+VG+
Subjt:  A---YNGEENITVGN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNCGAGA
TCAAGCGTTAATTACACTGATCAATGCAACCTTGACGCAAACCGCCCTCTCTTATGTAATCGGTTGTCAAACCTCCAAGGAAGTTTGGGATCGACTTGAGAAGCACTTCT
CCTCTTCGACTCAAACGAACATTGTAGGTGTGAAGACAAAGTTGCAAAGTGTTTCAAAGAAATCCGGTGAGTCAATTGATGTTTATGTTCGAAGAATCAAGGAAATTGTG
AACAAATTGGTTGCGGTGTCGGTAGTTATTGATGCTGAAGATCTCAATATTTACACTATTAATGGCCTTTCATCTGCTTACAATGTTTTCAAGACCTCTTTATGCACCAG
ATCTCAAGCCCTAACATTTGTCGAGTTACATATCCTGATGAAGACTGAGGAGACTGCACTTGAACAACAAATAAAAGCTGATGAAATCCCGAATAATTCTCATTTGGCCA
TGGCGGCTAGTTTTGATTTTGGTGGCAGAGGAAATTGGAGAGGTCGTTTTGATGGAAATCGAAATGGAGGTAGAGGTCGTGGTACTTCTTTTTCCTCTCAGATTACTACT
CCTTCATCTCATGGTCAGTTTTCTTCCTCTCCTTCATTTGATGGCAGTCGTCCGCCGAACAAAGTTACCTGTCAAATTTTTCAGAAATACGGCCACAATGCCCTAGACTG
TTATAACATAATGAATCACCTTTCGTATACTCCACTCGTTCCCTTCGCCTTCGTAAGCAGACATCCACCAGCTAAGCTTGCAGCAATGGCTACCTTTGCTCCGTCAAACA
ATTTGGCTCATAATTTTGTTTCTAATGTTGCACCCTCTGACTCACATGTTTGGCTTTCTGACACAGGCTGCAATGCACATTTGACTCATAATCTTGCCAATTTGAGTGTG
TCCAATGCATACAATGGGGAAGAGAACATCACTGTTGGTAATGAAATGAGATCTGCCTCCATTTTCCACTGCACCCGAGGTTGGGCATGTCAAAAGGACCCGTATTGGTC
TATGGGGACAATCTTTGTTCATAGAAGCAGTTCAAGACTATTCAACCCAAAAGGCCTCATACCAATGGAGATAGTTGTCCCTCCAATTGGCGACCGGCGAGGTGAAGAGA
TCGAAATTGCAATTGTAAGCAAAAGACTTGGTGTTGTCTCTTGTATAGTATTCTTTCTTCACTTCCCCGTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNCGAGA
TCAAGCGTTAATTACACTGATCAATGCAACCTTGACGCAAACCGCCCTCTCTTATGTAATCGGTTGTCAAACCTCCAAGGAAGTTTGGGATCGACTTGAGAAGCACTTCT
CCTCTTCGACTCAAACGAACATTGTAGGTGTGAAGACAAAGTTGCAAAGTGTTTCAAAGAAATCCGGTGAGTCAATTGATGTTTATGTTCGAAGAATCAAGGAAATTGTG
AACAAATTGGTTGCGGTGTCGGTAGTTATTGATGCTGAAGATCTCAATATTTACACTATTAATGGCCTTTCATCTGCTTACAATGTTTTCAAGACCTCTTTATGCACCAG
ATCTCAAGCCCTAACATTTGTCGAGTTACATATCCTGATGAAGACTGAGGAGACTGCACTTGAACAACAAATAAAAGCTGATGAAATCCCGAATAATTCTCATTTGGCCA
TGGCGGCTAGTTTTGATTTTGGTGGCAGAGGAAATTGGAGAGGTCGTTTTGATGGAAATCGAAATGGAGGTAGAGGTCGTGGTACTTCTTTTTCCTCTCAGATTACTACT
CCTTCATCTCATGGTCAGTTTTCTTCCTCTCCTTCATTTGATGGCAGTCGTCCGCCGAACAAAGTTACCTGTCAAATTTTTCAGAAATACGGCCACAATGCCCTAGACTG
TTATAACATAATGAATCACCTTTCGTATACTCCACTCGTTCCCTTCGCCTTCGTAAGCAGACATCCACCAGCTAAGCTTGCAGCAATGGCTACCTTTGCTCCGTCAAACA
ATTTGGCTCATAATTTTGTTTCTAATGTTGCACCCTCTGACTCACATGTTTGGCTTTCTGACACAGGCTGCAATGCACATTTGACTCATAATCTTGCCAATTTGAGTGTG
TCCAATGCATACAATGGGGAAGAGAACATCACTGTTGGTAATGAAATGAGATCTGCCTCCATTTTCCACTGCACCCGAGGTTGGGCATGTCAAAAGGACCCGTATTGGTC
TATGGGGACAATCTTTGTTCATAGAAGCAGTTCAAGACTATTCAACCCAAAAGGCCTCATACCAATGGAGATAGTTGTCCCTCCAATTGGCGACCGGCGAGGTGAAGAGA
TCGAAATTGCAATTGTAAGCAAAAGACTTGGTGTTGTCTCTTGTATAGTATTCTTTCTTCACTTCCCCGTCTAG
Protein sequenceShow/hide protein sequence
MXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRDQALITLINATLTQTALSYVIGCQTSKEVWDRLEKHFSSSTQTNIVGVKTKLQSVSKKSGESIDVYVRRIKEIV
NKLVAVSVVIDAEDLNIYTINGLSSAYNVFKTSLCTRSQALTFVELHILMKTEETALEQQIKADEIPNNSHLAMAASFDFGGRGNWRGRFDGNRNGGRGRGTSFSSQITT
PSSHGQFSSSPSFDGSRPPNKVTCQIFQKYGHNALDCYNIMNHLSYTPLVPFAFVSRHPPAKLAAMATFAPSNNLAHNFVSNVAPSDSHVWLSDTGCNAHLTHNLANLSV
SNAYNGEENITVGNEMRSASIFHCTRGWACQKDPYWSMGTIFVHRSSSRLFNPKGLIPMEIVVPPIGDRRGEEIEIAIVSKRLGVVSCIVFFLHFPV