; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0002093 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0002093
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr4:39205759..39208766
RNA-Seq ExpressionLag0002093
SyntenyLag0002093
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
VVA32947.1 PREDICTED: retrotransposon [Prunus dulcis]6.5e-6243.68Show/hide
Query:  MLHAVESARAISGLKISRQGLSLSHLFFADDSLLFFRARIGEAQAVQDILRRYERASGQTVNFDKSTIAFSPNTETSVQDSVGGVLQVKVTACHQHYLGL
        +L   E    + G++++R G S++HL FADDS+LF +A     +A++ + + YE  SGQ +N+ KS  + SPN   +  D + GVL V V  CH+ YLGL
Subjt:  MLHAVESARAISGLKISRQGLSLSHLFFADDSLLFFRARIGEAQAVQDILRRYERASGQTVNFDKSTIAFSPNTETSVQDSVGGVLQVKVTACHQHYLGL

Query:  PAFMPRSRSNSLKFIKDRVWRKMQGWKGKLFSAGGREVLIKSVVQAIPCYSMNCFRLPKKIVCEINRTMARFWWSGVEEDQKIHWVSWKGLCRPKCYWGM
        P    + R    + +KD++W+ + GWK KL S  G+E+L+K+V+QAIP YSM+CFR+PK +  E+N  MARFWW+  ++ + IHWV W+ LC+ K   G+
Subjt:  PAFMPRSRSNSLKFIKDRVWRKMQGWKGKLFSAGGREVLIKSVVQAIPCYSMNCFRLPKKIVCEINRTMARFWWSGVEEDQKIHWVSWKGLCRPKCYWGM

Query:  GFRDLETFNQALLAKQGWRVLQNPDSMLARVLKGR-------ADVFVGPDPFIIFYGLNHG
        GFRDLE FNQALLAKQ WR+L+ P+S++AR+ + R        +  VG +P  I+  L  G
Subjt:  GFRDLETFNQALLAKQGWRVLQNPDSMLARVLKGR-------ADVFVGPDPFIIFYGLNHG

XP_030479133.1 uncharacterized protein LOC115696372 [Cannabis sativa]1.0e-6250.43Show/hide
Query:  MLHAVESARAISGLKISRQGLSLSHLFFADDSLLFFRARIGEAQAVQDILRRYERASGQTVNFDKSTIAFSPNTETSVQDSVGGVLQVKVTACHQHYLGL
        +L   E    + GL +SR   S+SHLFFADDSLLF +A      A++  L  Y RASGQ +N DKS ++FSPNT  +VQ+S   +L + +  CH+ YLGL
Subjt:  MLHAVESARAISGLKISRQGLSLSHLFFADDSLLFFRARIGEAQAVQDILRRYERASGQTVNFDKSTIAFSPNTETSVQDSVGGVLQVKVTACHQHYLGL

Query:  PAFMPRSRSNSLKFIKDRVWRKMQGWKGKLFSAGGREVLIKSVVQAIPCYSMNCFRLPKKIVCEINRTMARFWWSGVEEDQKIHWVSWKGLCRPKCYWGM
        PA+  R +S     IK+++W+ M  W  K+FS GG+EVL+K+VVQ+IP Y+M+CFRLP K+  EI   MA+FWW    +++KIHW  W+ LC+ K   GM
Subjt:  PAFMPRSRSNSLKFIKDRVWRKMQGWKGKLFSAGGREVLIKSVVQAIPCYSMNCFRLPKKIVCEINRTMARFWWSGVEEDQKIHWVSWKGLCRPKCYWGM

Query:  GFRDLETFNQALLAKQGWRVLQNPDSMLARVLKG
        GFR    FNQALLAKQ WR+ Q+P S+L+RVLKG
Subjt:  GFRDLETFNQALLAKQGWRVLQNPDSMLARVLKG

XP_030483769.1 uncharacterized protein LOC115700339 [Cannabis sativa]1.3e-6250.21Show/hide
Query:  MLHAVESARAISGLKISRQGLSLSHLFFADDSLLFFRARIGEAQAVQDILRRYERASGQTVNFDKSTIAFSPNTETSVQDSVGGVLQVKVTACHQHYLGL
        +L   E+   + GL +SR   ++SHL FADDSLLF +A      A++  L  Y RASGQ +N DKS ++FSPNT   VQ+S   +L + +  CH+ YLGL
Subjt:  MLHAVESARAISGLKISRQGLSLSHLFFADDSLLFFRARIGEAQAVQDILRRYERASGQTVNFDKSTIAFSPNTETSVQDSVGGVLQVKVTACHQHYLGL

Query:  PAFMPRSRSNSLKFIKDRVWRKMQGWKGKLFSAGGREVLIKSVVQAIPCYSMNCFRLPKKIVCEINRTMARFWWSGVEEDQKIHWVSWKGLCRPKCYWGM
        PA+  R +S     IK+++W+ M  W  K+FS GG+EVL+K+VVQ+IP Y+M+CFRL KK   E+   MARFWW    +++KIHW  WK LC+ K   GM
Subjt:  PAFMPRSRSNSLKFIKDRVWRKMQGWKGKLFSAGGREVLIKSVVQAIPCYSMNCFRLPKKIVCEINRTMARFWWSGVEEDQKIHWVSWKGLCRPKCYWGM

Query:  GFRDLETFNQALLAKQGWRVLQNPDSMLARVLKGR
        GFR    FNQALLAKQ WR+ Q P+S+L+RVLKGR
Subjt:  GFRDLETFNQALLAKQGWRVLQNPDSMLARVLKGR

XP_030924745.1 uncharacterized protein LOC115951731 [Quercus lobata]7.6e-6349.79Show/hide
Query:  MLHAVESARAISGLKISRQGLSLSHLFFADDSLLFFRARIGEAQAVQDILRRYERASGQTVNFDKSTIAFSPNTETSVQDSVGGVLQVKVTACHQHYLGL
        ++   E+   I G+ I R G  +SHLFFADDS+LF RAR+ E Q + D+L  YE+ SGQ +N +K+ I FS NT   +Q  +  +L V     ++ YLGL
Subjt:  MLHAVESARAISGLKISRQGLSLSHLFFADDSLLFFRARIGEAQAVQDILRRYERASGQTVNFDKSTIAFSPNTETSVQDSVGGVLQVKVTACHQHYLGL

Query:  PAFMPRSRSNSLKFIKDRVWRKMQGWKGKLFSAGGREVLIKSVVQAIPCYSMNCFRLPKKIVCEINRTMARFWWSGVEEDQKIHWVSWKGLCRPKCYWGM
        PA + R++  S  +IK+RVWRK+QGWK KL S  GREVLIKSV+QAIP Y+M+CF+LPK ++ E+   + +FWW    E++K+HWV W+ LC  K   GM
Subjt:  PAFMPRSRSNSLKFIKDRVWRKMQGWKGKLFSAGGREVLIKSVVQAIPCYSMNCFRLPKKIVCEINRTMARFWWSGVEEDQKIHWVSWKGLCRPKCYWGM

Query:  GFRDLETFNQALLAKQGWRVLQNPDSMLARVLKGR
        GF+D+E FN +LLAKQ WR++ NPDS+  RV K R
Subjt:  GFRDLETFNQALLAKQGWRVLQNPDSMLARVLKGR

XP_030969964.1 uncharacterized protein LOC115990257 [Quercus lobata]2.0e-6350.64Show/hide
Query:  MLHAVESARAISGLKISRQGLSLSHLFFADDSLLFFRARIGEAQAVQDILRRYERASGQTVNFDKSTIAFSPNTETSVQDSVGGVLQVKVTACHQHYLGL
        +LH  ES   I G+ I + G  +SHLFFADDS+LF RA+  E Q + D+L  YER SGQ +N DK+ I F+ NT   VQ  +  +L V     ++ YLGL
Subjt:  MLHAVESARAISGLKISRQGLSLSHLFFADDSLLFFRARIGEAQAVQDILRRYERASGQTVNFDKSTIAFSPNTETSVQDSVGGVLQVKVTACHQHYLGL

Query:  PAFMPRSRSNSLKFIKDRVWRKMQGWKGKLFSAGGREVLIKSVVQAIPCYSMNCFRLPKKIVCEINRTMARFWWSGVEEDQKIHWVSWKGLCRPKCYWGM
        PAF+ R++     +IK+R+W+K+QGWK KL S  GREVLIKSV+QAIP YSM+CF+LPK ++ EI   + +FWW    +++K+HWV W+ LC+ K   GM
Subjt:  PAFMPRSRSNSLKFIKDRVWRKMQGWKGKLFSAGGREVLIKSVVQAIPCYSMNCFRLPKKIVCEINRTMARFWWSGVEEDQKIHWVSWKGLCRPKCYWGM

Query:  GFRDLETFNQALLAKQGWRVLQNPDSMLARVLKGR
        GF+ +E FN ALLAKQ WR+L+NP+S+  RV K R
Subjt:  GFRDLETFNQALLAKQGWRVLQNPDSMLARVLKGR

TrEMBL top hitse value%identityAlignment
A0A2N9EX83 Reverse transcriptase domain-containing protein4.7e-6653.51Show/hide
Query:  ARAISGLKISRQGLSLSHLFFADDSLLFFRARIGEAQAVQDILRRYERASGQTVNFDKSTIAFSPNTETSVQDSVGGVLQVKVTACHQHYLGLPAFMPRS
        ++ I G+ ISR G  LSHLFFADDS+LF RA + E  A+QDILR YERASGQ +N DK+T+ FS +T    Q+ +   L++ V   +++YLGLP+ + R+
Subjt:  ARAISGLKISRQGLSLSHLFFADDSLLFFRARIGEAQAVQDILRRYERASGQTVNFDKSTIAFSPNTETSVQDSVGGVLQVKVTACHQHYLGLPAFMPRS

Query:  RSNSLKFIKDRVWRKMQGWKGKLFSAGGREVLIKSVVQAIPCYSMNCFRLPKKIVCEINRTMARFWWSGVEEDQKIHWVSWKGLCRPKCYWGMGFRDLET
        +  S   +KDRVW K+QGWKGKL S  GREVLIK+VVQAIP Y+MN F+LPKK+  ++ R +  FWW    E +K+HWV+W  LC+PK   GMGFR+L  
Subjt:  RSNSLKFIKDRVWRKMQGWKGKLFSAGGREVLIKSVVQAIPCYSMNCFRLPKKIVCEINRTMARFWWSGVEEDQKIHWVSWKGLCRPKCYWGMGFRDLET

Query:  FNQALLAKQGWRVLQNPDSMLARVLKGR
        FN ALLAKQ WR++ N  S+L RVL+ +
Subjt:  FNQALLAKQGWRVLQNPDSMLARVLKGR

A0A2N9F4S2 Reverse transcriptase domain-containing protein9.7e-6450Show/hide
Query:  ARAISGLKISRQGLSLSHLFFADDSLLFFRARIGEAQAVQDILRRYERASGQTVNFDKSTIAFSPNTETSVQDSVGGVLQVKVTACHQHYLGLPAFMPRS
        +R + G+ ISR G  L+HLFFADDS+LF RA + E   + +ILR+YERASGQ +N DK+T+ FS +T    Q+ +   LQ+ V   ++ YLGLP+ + RS
Subjt:  ARAISGLKISRQGLSLSHLFFADDSLLFFRARIGEAQAVQDILRRYERASGQTVNFDKSTIAFSPNTETSVQDSVGGVLQVKVTACHQHYLGLPAFMPRS

Query:  RSNSLKFIKDRVWRKMQGWKGKLFSAGGREVLIKSVVQAIPCYSMNCFRLPKKIVCEINRTMARFWWSGVEEDQKIHWVSWKGLCRPKCYWGMGFRDLET
        +  S   +K+R+WRK+QGWK KL +  G+EVLIK+++QAIP Y+M+CF+LPKK+  E+   +  FWW    E +K+HW+ W  LCR KC  GMGFRDL  
Subjt:  RSNSLKFIKDRVWRKMQGWKGKLFSAGGREVLIKSVVQAIPCYSMNCFRLPKKIVCEINRTMARFWWSGVEEDQKIHWVSWKGLCRPKCYWGMGFRDLET

Query:  FNQALLAKQGWRVLQNPDSMLARVLKGR
        FN+ALLAKQ WR+L N +S++ +V K +
Subjt:  FNQALLAKQGWRVLQNPDSMLARVLKGR

A0A2N9F5W1 Reverse transcriptase domain-containing protein7.4e-6452.19Show/hide
Query:  ARAISGLKISRQGLSLSHLFFADDSLLFFRARIGEAQAVQDILRRYERASGQTVNFDKSTIAFSPNTETSVQDSVGGVLQVKVTACHQHYLGLPAFMPRS
        ++ I G+ ISR G  LSHLFFADDS+LF RA + E  A+QDIL  YERAS Q +N DK+T+ FS +T+   Q+ +   L++ V   +++YLGLP+ + R+
Subjt:  ARAISGLKISRQGLSLSHLFFADDSLLFFRARIGEAQAVQDILRRYERASGQTVNFDKSTIAFSPNTETSVQDSVGGVLQVKVTACHQHYLGLPAFMPRS

Query:  RSNSLKFIKDRVWRKMQGWKGKLFSAGGREVLIKSVVQAIPCYSMNCFRLPKKIVCEINRTMARFWWSGVEEDQKIHWVSWKGLCRPKCYWGMGFRDLET
        +  S   +KDRVW K+QGWKGKL S  GREVLIK+VVQAIP Y+MN F+LPKK+  ++ R +  FWW    E +K+HWV+W  LC+PK    MGFR+L  
Subjt:  RSNSLKFIKDRVWRKMQGWKGKLFSAGGREVLIKSVVQAIPCYSMNCFRLPKKIVCEINRTMARFWWSGVEEDQKIHWVSWKGLCRPKCYWGMGFRDLET

Query:  FNQALLAKQGWRVLQNPDSMLARVLKGR
        FN ALLAKQ WR++ N  S+L RVL+ +
Subjt:  FNQALLAKQGWRVLQNPDSMLARVLKGR

A0A2N9F9I5 Uncharacterized protein9.7e-6452.19Show/hide
Query:  ARAISGLKISRQGLSLSHLFFADDSLLFFRARIGEAQAVQDILRRYERASGQTVNFDKSTIAFSPNTETSVQDSVGGVLQVKVTACHQHYLGLPAFMPRS
        ++ I G+ ISR G  L+HLFFADDS+LF RA + E  A+QDILR YER SGQ VN DK+T+ FS +T  ++Q+ +   L +     +++YLGLP+ + R+
Subjt:  ARAISGLKISRQGLSLSHLFFADDSLLFFRARIGEAQAVQDILRRYERASGQTVNFDKSTIAFSPNTETSVQDSVGGVLQVKVTACHQHYLGLPAFMPRS

Query:  RSNSLKFIKDRVWRKMQGWKGKLFSAGGREVLIKSVVQAIPCYSMNCFRLPKKIVCEINRTMARFWWSGVEEDQKIHWVSWKGLCRPKCYWGMGFRDLET
        + NS   +KDRVWRK+QGWKGKL S  GREVLIK+VVQAIP Y+MN F+LPKK+  E+ R +  F W    E +K++WV+W  +C+PK   GMGFR+L  
Subjt:  RSNSLKFIKDRVWRKMQGWKGKLFSAGGREVLIKSVVQAIPCYSMNCFRLPKKIVCEINRTMARFWWSGVEEDQKIHWVSWKGLCRPKCYWGMGFRDLET

Query:  FNQALLAKQGWRVLQNPDSMLARVLKGR
        FN+ALLAKQ WR++ N  S+L RV + +
Subjt:  FNQALLAKQGWRVLQNPDSMLARVLKGR

A0A803QGT2 Uncharacterized protein4.8e-6350.43Show/hide
Query:  MLHAVESARAISGLKISRQGLSLSHLFFADDSLLFFRARIGEAQAVQDILRRYERASGQTVNFDKSTIAFSPNTETSVQDSVGGVLQVKVTACHQHYLGL
        +L   E    + GL +SR   S+SHLFFADDSLLF +A      A++  L  Y RASGQ +N DKS ++FSPNT  +VQ+S   +L + +  CH+ YLGL
Subjt:  MLHAVESARAISGLKISRQGLSLSHLFFADDSLLFFRARIGEAQAVQDILRRYERASGQTVNFDKSTIAFSPNTETSVQDSVGGVLQVKVTACHQHYLGL

Query:  PAFMPRSRSNSLKFIKDRVWRKMQGWKGKLFSAGGREVLIKSVVQAIPCYSMNCFRLPKKIVCEINRTMARFWWSGVEEDQKIHWVSWKGLCRPKCYWGM
        PA+  R +S     IK+++W+ M  W  K+FS GG+EVL+K+VVQ+IP Y+M+CFRLP K+  EI   MA+FWW    +++KIHW  W+ LC+ K   GM
Subjt:  PAFMPRSRSNSLKFIKDRVWRKMQGWKGKLFSAGGREVLIKSVVQAIPCYSMNCFRLPKKIVCEINRTMARFWWSGVEEDQKIHWVSWKGLCRPKCYWGM

Query:  GFRDLETFNQALLAKQGWRVLQNPDSMLARVLKG
        GFR    FNQALLAKQ WR+ Q+P S+L+RVLKG
Subjt:  GFRDLETFNQALLAKQGWRVLQNPDSMLARVLKG

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657504.9e-2033.81Show/hide
Query:  LPAFMPRSRSNSLKFIKDRVWRKMQGWKGKLFSAGGREVLIKSVVQAIPCYSMNCFRLPKKIVCEINRTMARFWWSGVEEDQKIHWVSWKGLCRPKCYWG
        +P    R   ++   I +RV  +M GW+ K  S  GR  L K+V+ ++P +SM+   LP+ I+  +++    F W    E +K H V W  +C PK   G
Subjt:  LPAFMPRSRSNSLKFIKDRVWRKMQGWKGKLFSAGGREVLIKSVVQAIPCYSMNCFRLPKKIVCEINRTMARFWWSGVEEDQKIHWVSWKGLCRPKCYWG

Query:  MGFRDLETFNQALLAKQGWRVLQNPDSMLARVLKGRADV
        +G R  ++ N+AL++K GWR+LQ  +S+   VL+ +  V
Subjt:  MGFRDLETFNQALLAKQGWRVLQNPDSMLARVLKGRADV

P93295 Uncharacterized mitochondrial protein AtMg003103.4e-2137.16Show/hide
Query:  AIPCYSMNCFRLPKKIVCEINRTMARFWWSGVEEDQKIHWVSWKGLCRPK-CYWGMGFRDLETFNQALLAKQGWRVLQNPDSMLARVLKGR-------AD
        A+P Y+M+CFRL K +  ++   M  FWWS  E  +KI WV+W+ LC+ K    G+GFRDL  FNQALLAKQ +R++  P ++L+R+L+ R        +
Subjt:  AIPCYSMNCFRLPKKIVCEINRTMARFWWSGVEEDQKIHWVSWKGLCRPK-CYWGMGFRDLETFNQALLAKQGWRVLQNPDSMLARVLKGR-------AD

Query:  VFVGPDPFIIFYGLNHG--------------------MVDRMLEDETP
          VG  P   +  + HG                     +DR + DETP
Subjt:  VFVGPDPFIIFYGLNHG--------------------MVDRMLEDETP

Arabidopsis top hitse value%identityAlignment
AT4G29090.1 Ribonuclease H-like superfamily protein3.8e-2045.56Show/hide
Query:  AIPCYSMNCFRLPKKIVCEINRTMARFWWSGVEEDQKIHWVSWKGLCRPKCYWGMGFRDLETFNQALLAKQGWRVLQNPDSMLARVLKGR
        A+P Y+M CF LPK +  +I   +A FWW   +E + +HW +W  L   K   G+GF+D+E FN ALL KQ WR+L  P+S++A+V K R
Subjt:  AIPCYSMNCFRLPKKIVCEINRTMARFWWSGVEEDQKIHWVSWKGLCRPKCYWGMGFRDLETFNQALLAKQGWRVLQNPDSMLARVLKGR

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.4e-2237.16Show/hide
Query:  AIPCYSMNCFRLPKKIVCEINRTMARFWWSGVEEDQKIHWVSWKGLCRPK-CYWGMGFRDLETFNQALLAKQGWRVLQNPDSMLARVLKGR-------AD
        A+P Y+M+CFRL K +  ++   M  FWWS  E  +KI WV+W+ LC+ K    G+GFRDL  FNQALLAKQ +R++  P ++L+R+L+ R        +
Subjt:  AIPCYSMNCFRLPKKIVCEINRTMARFWWSGVEEDQKIHWVSWKGLCRPK-CYWGMGFRDLETFNQALLAKQGWRVLQNPDSMLARVLKGR-------AD

Query:  VFVGPDPFIIFYGLNHG--------------------MVDRMLEDETP
          VG  P   +  + HG                     +DR + DETP
Subjt:  VFVGPDPFIIFYGLNHG--------------------MVDRMLEDETP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGCATGCAGTAGAGTCAGCGAGGGCTATATCGGGTTTGAAGATCTCTCGACAGGGCCTTTCTTTATCGCACCTCTTTTTCGCCGATGATAGTCTTTTATTTTTTCG
AGCACGTATAGGGGAGGCACAGGCTGTTCAGGATATCCTCCGACGCTATGAAAGAGCCTCAGGCCAAACGGTTAATTTTGATAAGTCCACGATTGCTTTTAGCCCAAATA
CAGAGACAAGTGTGCAAGATAGCGTGGGTGGAGTCCTGCAGGTAAAGGTAACTGCATGTCACCAACATTATTTGGGGCTGCCAGCTTTTATGCCCAGAAGTAGATCGAAC
TCCTTGAAGTTTATCAAGGACCGTGTGTGGAGGAAGATGCAGGGGTGGAAGGGGAAGCTTTTCTCAGCAGGTGGGAGGGAAGTCCTTATCAAGTCAGTAGTCCAGGCTAT
ACCGTGTTATTCGATGAACTGTTTCAGGCTCCCAAAGAAGATTGTATGTGAAATCAACCGGACTATGGCTCGCTTCTGGTGGAGTGGGGTTGAGGAGGATCAGAAAATCC
ATTGGGTGAGTTGGAAGGGTCTATGTAGGCCGAAATGTTATTGGGGCATGGGTTTTAGGGACTTGGAGACTTTTAACCAGGCATTGTTGGCGAAACAAGGTTGGAGGGTC
CTTCAGAACCCGGATTCGATGCTTGCTCGTGTGTTAAAAGGGAGGGCAGATGTGTTCGTTGGCCCAGATCCCTTCATCATCTTCTATGGACTCAATCATGGGATGGTGGA
CAGGATGTTGGAAGATGAAACTCCCAAATCTCTGTTTGGTAGGATTCAAGCCGACAGCCCTCTCTTGTTGCTGAGGGATGCGAGAGATGTCCTGGGGTTGGAGCGGTTTG
AGGAACTCATGGTGTTGCTCTGGGGTATTTGGAATTGTAGAAACAAGGTGAAATTTCGTGGGGAAGGTCTGTCTCTGGAGCTCCCATCCTGGGCGGCTGGATATGTGGCA
TCTGTCTGGGGAGGAAAAGGCGTTCAAGAAGGAGCGTCGTTGGGCAGGCTCGAGACTAGTGATTCGGACCATGCCGACGAAGTTATGGCAGTTGCAACTCGAGTCCATGA
ACATCTCTCTGATCTGAGCAATTTCCTGTTGGATGCCTGTGTTGATTGGCCGTGTGTGTGGCCGCTGCGATTCAGCTTCTGCTATCGGGAAGGGAACTGTCTTGCACACC
AATTAGCAGTCCTTGCTTCAGTGGATCAGCGTGATTGTGAGTGGATGGAAGAGGTTCCTGTTTGTGTGCGGAGTTTTTTAGTTTCTGATGTTGATTCCTTATCTGAACCG
AAAACGAGGGAGAGGGATGGATCTGCTACTCAAGAACGTCGTGGACGTCGTGAAGCTGCTGCTCGTTGTGGGTCTGAAACGGATGGCCATGACGCTGCCGTTCGTCGTGG
ATTTGAACCTAGGATCAACAACCCCGAACAACGCCGACGTTCACGAACAAACCCCGAACAGCGCGGCGTTCACGAACCCACACCCTTTTCGACTGCAGATCTACAACCCA
TGCCTTTCACGAACCCACGCCGGCGTTCACGAGTAACAACTCCACGGCTAATGGACGAGAGCAGCACTAGGTTTTCGGTTCAGATCCACGGCGTCCACAAGAGCAACTCC
ACGGCGTTCAAGAGTAGCAGATCTACAACCCTCTCCCTCGTGGTTCAGATATGA
mRNA sequenceShow/hide mRNA sequence
ATGTTGCATGCAGTAGAGTCAGCGAGGGCTATATCGGGTTTGAAGATCTCTCGACAGGGCCTTTCTTTATCGCACCTCTTTTTCGCCGATGATAGTCTTTTATTTTTTCG
AGCACGTATAGGGGAGGCACAGGCTGTTCAGGATATCCTCCGACGCTATGAAAGAGCCTCAGGCCAAACGGTTAATTTTGATAAGTCCACGATTGCTTTTAGCCCAAATA
CAGAGACAAGTGTGCAAGATAGCGTGGGTGGAGTCCTGCAGGTAAAGGTAACTGCATGTCACCAACATTATTTGGGGCTGCCAGCTTTTATGCCCAGAAGTAGATCGAAC
TCCTTGAAGTTTATCAAGGACCGTGTGTGGAGGAAGATGCAGGGGTGGAAGGGGAAGCTTTTCTCAGCAGGTGGGAGGGAAGTCCTTATCAAGTCAGTAGTCCAGGCTAT
ACCGTGTTATTCGATGAACTGTTTCAGGCTCCCAAAGAAGATTGTATGTGAAATCAACCGGACTATGGCTCGCTTCTGGTGGAGTGGGGTTGAGGAGGATCAGAAAATCC
ATTGGGTGAGTTGGAAGGGTCTATGTAGGCCGAAATGTTATTGGGGCATGGGTTTTAGGGACTTGGAGACTTTTAACCAGGCATTGTTGGCGAAACAAGGTTGGAGGGTC
CTTCAGAACCCGGATTCGATGCTTGCTCGTGTGTTAAAAGGGAGGGCAGATGTGTTCGTTGGCCCAGATCCCTTCATCATCTTCTATGGACTCAATCATGGGATGGTGGA
CAGGATGTTGGAAGATGAAACTCCCAAATCTCTGTTTGGTAGGATTCAAGCCGACAGCCCTCTCTTGTTGCTGAGGGATGCGAGAGATGTCCTGGGGTTGGAGCGGTTTG
AGGAACTCATGGTGTTGCTCTGGGGTATTTGGAATTGTAGAAACAAGGTGAAATTTCGTGGGGAAGGTCTGTCTCTGGAGCTCCCATCCTGGGCGGCTGGATATGTGGCA
TCTGTCTGGGGAGGAAAAGGCGTTCAAGAAGGAGCGTCGTTGGGCAGGCTCGAGACTAGTGATTCGGACCATGCCGACGAAGTTATGGCAGTTGCAACTCGAGTCCATGA
ACATCTCTCTGATCTGAGCAATTTCCTGTTGGATGCCTGTGTTGATTGGCCGTGTGTGTGGCCGCTGCGATTCAGCTTCTGCTATCGGGAAGGGAACTGTCTTGCACACC
AATTAGCAGTCCTTGCTTCAGTGGATCAGCGTGATTGTGAGTGGATGGAAGAGGTTCCTGTTTGTGTGCGGAGTTTTTTAGTTTCTGATGTTGATTCCTTATCTGAACCG
AAAACGAGGGAGAGGGATGGATCTGCTACTCAAGAACGTCGTGGACGTCGTGAAGCTGCTGCTCGTTGTGGGTCTGAAACGGATGGCCATGACGCTGCCGTTCGTCGTGG
ATTTGAACCTAGGATCAACAACCCCGAACAACGCCGACGTTCACGAACAAACCCCGAACAGCGCGGCGTTCACGAACCCACACCCTTTTCGACTGCAGATCTACAACCCA
TGCCTTTCACGAACCCACGCCGGCGTTCACGAGTAACAACTCCACGGCTAATGGACGAGAGCAGCACTAGGTTTTCGGTTCAGATCCACGGCGTCCACAAGAGCAACTCC
ACGGCGTTCAAGAGTAGCAGATCTACAACCCTCTCCCTCGTGGTTCAGATATGA
Protein sequenceShow/hide protein sequence
MLHAVESARAISGLKISRQGLSLSHLFFADDSLLFFRARIGEAQAVQDILRRYERASGQTVNFDKSTIAFSPNTETSVQDSVGGVLQVKVTACHQHYLGLPAFMPRSRSN
SLKFIKDRVWRKMQGWKGKLFSAGGREVLIKSVVQAIPCYSMNCFRLPKKIVCEINRTMARFWWSGVEEDQKIHWVSWKGLCRPKCYWGMGFRDLETFNQALLAKQGWRV
LQNPDSMLARVLKGRADVFVGPDPFIIFYGLNHGMVDRMLEDETPKSLFGRIQADSPLLLLRDARDVLGLERFEELMVLLWGIWNCRNKVKFRGEGLSLELPSWAAGYVA
SVWGGKGVQEGASLGRLETSDSDHADEVMAVATRVHEHLSDLSNFLLDACVDWPCVWPLRFSFCYREGNCLAHQLAVLASVDQRDCEWMEEVPVCVRSFLVSDVDSLSEP
KTRERDGSATQERRGRREAAARCGSETDGHDAAVRRGFEPRINNPEQRRRSRTNPEQRGVHEPTPFSTADLQPMPFTNPRRRSRVTTPRLMDESSTRFSVQIHGVHKSNS
TAFKSSRSTTLSLVVQI