; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg019103 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg019103
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase
Genome locationscaffold1:42642179..42652228
RNA-Seq ExpressionSpg019103
SyntenySpg019103
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004518 - nuclease activity (molecular function)
GO:0016779 - nucleotidyltransferase activity (molecular function)
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037220.1 reverse transcriptase [Cucumis melo var. makuwa]3.9e-1562.2Show/hide
Query:  GKTQVERLVEIEEQMLYLREVPDSVRFLESRLEEIATRTDKIDAVAGRVDGMPIQELIMRVENLEN--RAAKPNNFERAHRS
        GK Q +RLVE+EEQMLYL EVPDS+R+LESRLEEI+ +T+ IDAVAGRV+G PIQEL+ RV+ LE      +  N+ER   S
Subjt:  GKTQVERLVEIEEQMLYLREVPDSVRFLESRLEEIATRTDKIDAVAGRVDGMPIQELIMRVENLEN--RAAKPNNFERAHRS

KAA0037220.1 reverse transcriptase [Cucumis melo var. makuwa]7.7e-7261.78Show/hide
Query:  ENLENRAAKPNNFERAHRSYQCPSRTAFNAFQASLTGSAGPETEIVETNTTNETNEDNPRMGALKFLSALQRKSEGAKEPLERGLMYVEAWVNQRQAKST
        +NL NR       +  H + +CP++TAFNAFQASLT  +  +    E         DNPRMGALKFLS+LQ+K      P+ERGLMYV+ W+NQ+  KST
Subjt:  ENLENRAAKPNNFERAHRSYQCPSRTAFNAFQASLTGSAGPETEIVETNTTNETNEDNPRMGALKFLSALQRKSEGAKEPLERGLMYVEAWVNQRQAKST

Query:  MVDSGATHNFMTEAEARRLNLKWDRDPGKMKAVNSAALPIMGIAKRATVKLGDWNDHADFVIVKMDDFDVVLGMEFLLEHKVIPMPLAKCLVITGTNPTV
        MVDSGATHNF+TE EA+RLNL+W++D G+MKAVNSAALPI+G+ KR  ++LG W+   DFV+VKMDDFDVVLGMEFLLEH+VIPMPLAKCLVITG  P+V
Subjt:  MVDSGATHNFMTEAEARRLNLKWDRDPGKMKAVNSAALPIMGIAKRATVKLGDWNDHADFVIVKMDDFDVVLGMEFLLEHKVIPMPLAKCLVITGTNPTV

Query:  VTTSIRQPNGVKMISALQLKKAAFR
        V T +RQP+G+KMISA+QLKK   R
Subjt:  VTTSIRQPNGVKMISALQLKKAAFR

KAA0063412.1 reverse transcriptase [Cucumis melo var. makuwa]3.9e-1562.2Show/hide
Query:  GKTQVERLVEIEEQMLYLREVPDSVRFLESRLEEIATRTDKIDAVAGRVDGMPIQELIMRVENLEN--RAAKPNNFERAHRS
        GK Q +RLVE+EEQMLYL EVPDS+R+LESRLEEI+ +T+ IDAVAGRV+G PIQEL+ RV+ LE      +  N+ER   S
Subjt:  GKTQVERLVEIEEQMLYLREVPDSVRFLESRLEEIATRTDKIDAVAGRVDGMPIQELIMRVENLEN--RAAKPNNFERAHRS

KAA0067557.1 reverse transcriptase [Cucumis melo var. makuwa]3.4e-1158.97Show/hide
Query:  GKTQVERLVEIEEQMLYLREVPDSVRFLESRLEEIATRTDKIDAVAGRVDGMPIQELIMRVENLEN--RAAKPNNFER
        GK Q +RLVE    +LYL EVPDS+R+LESRLEEI+ +T+ IDAVAGRV+G PIQEL+ RV+ LE      +  N+ER
Subjt:  GKTQVERLVEIEEQMLYLREVPDSVRFLESRLEEIATRTDKIDAVAGRVDGMPIQELIMRVENLEN--RAAKPNNFER

KAA0067557.1 reverse transcriptase [Cucumis melo var. makuwa]7.7e-7261.78Show/hide
Query:  ENLENRAAKPNNFERAHRSYQCPSRTAFNAFQASLTGSAGPETEIVETNTTNETNEDNPRMGALKFLSALQRKSEGAKEPLERGLMYVEAWVNQRQAKST
        +NL NR       +  H + +CP++TAFNAFQASLT  +  +    E         DNPRMGALKFLS+LQ+K      P+ERGLMYV+ W+NQ+  KST
Subjt:  ENLENRAAKPNNFERAHRSYQCPSRTAFNAFQASLTGSAGPETEIVETNTTNETNEDNPRMGALKFLSALQRKSEGAKEPLERGLMYVEAWVNQRQAKST

Query:  MVDSGATHNFMTEAEARRLNLKWDRDPGKMKAVNSAALPIMGIAKRATVKLGDWNDHADFVIVKMDDFDVVLGMEFLLEHKVIPMPLAKCLVITGTNPTV
        MVDSGATHNF+TE EA+RLNL+W++D G+MKAVNSAALPI+G+ KR  ++LG W+   DFV+VKMDDFDVVLGMEFLLEH+VIPMPLAKCLVITG  P+V
Subjt:  MVDSGATHNFMTEAEARRLNLKWDRDPGKMKAVNSAALPIMGIAKRATVKLGDWNDHADFVIVKMDDFDVVLGMEFLLEHKVIPMPLAKCLVITGTNPTV

Query:  VTTSIRQPNGVKMISALQLKKAAFR
        V T +RQP+G+KMISA+QLKK   R
Subjt:  VTTSIRQPNGVKMISALQLKKAAFR

TYK25585.1 uncharacterized protein E5676_scaffold352G007440 [Cucumis melo var. makuwa]3.9e-1562.2Show/hide
Query:  GKTQVERLVEIEEQMLYLREVPDSVRFLESRLEEIATRTDKIDAVAGRVDGMPIQELIMRVENLEN--RAAKPNNFERAHRS
        GK Q +RLVE+EEQMLYL EVPDS+R+LESRLEEI+ +T+ IDAVAGRV+G PIQEL+ RV+ LE      +  N+ER   S
Subjt:  GKTQVERLVEIEEQMLYLREVPDSVRFLESRLEEIATRTDKIDAVAGRVDGMPIQELIMRVENLEN--RAAKPNNFERAHRS

TYK25585.1 uncharacterized protein E5676_scaffold352G007440 [Cucumis melo var. makuwa]7.7e-7261.78Show/hide
Query:  ENLENRAAKPNNFERAHRSYQCPSRTAFNAFQASLTGSAGPETEIVETNTTNETNEDNPRMGALKFLSALQRKSEGAKEPLERGLMYVEAWVNQRQAKST
        +NL NR       +  H + +CP++TAFNAFQASLT  +  +    E         DNPRMGALKFLS+LQ+K      P+ERGLMYV+ W+NQ+  KST
Subjt:  ENLENRAAKPNNFERAHRSYQCPSRTAFNAFQASLTGSAGPETEIVETNTTNETNEDNPRMGALKFLSALQRKSEGAKEPLERGLMYVEAWVNQRQAKST

Query:  MVDSGATHNFMTEAEARRLNLKWDRDPGKMKAVNSAALPIMGIAKRATVKLGDWNDHADFVIVKMDDFDVVLGMEFLLEHKVIPMPLAKCLVITGTNPTV
        MVDSGATHNF+TE EA+RLNL+W++D G+MKAVNSAALPI+G+ KR  ++LG W+   DFV+VKMDDFDVVLGMEFLLEH+VIPMPLAKCLVITG  P+V
Subjt:  MVDSGATHNFMTEAEARRLNLKWDRDPGKMKAVNSAALPIMGIAKRATVKLGDWNDHADFVIVKMDDFDVVLGMEFLLEHKVIPMPLAKCLVITGTNPTV

Query:  VTTSIRQPNGVKMISALQLKKAAFR
        V T +RQP+G+KMISA+QLKK   R
Subjt:  VTTSIRQPNGVKMISALQLKKAAFR

XP_022155185.1 uncharacterized protein LOC111022320 [Momordica charantia]9.1e-8176.1Show/hide
Query:  HRSYQCPSRTAFNAFQASLTGSAGPETEIVETNTTNETNEDNPRMGALKFLSALQRKSEGAKEPLERGLMYVEAWVNQRQAKSTMVDSGATHNFMTEAEA
        HR Y+CP+R A  AFQA+LT     E E  ET+T  E  +DNPRMGALKFLSALQ+K+E  KEPLERGLMYVEAWVNQ+ AKSTMVDSGATHNFMTE EA
Subjt:  HRSYQCPSRTAFNAFQASLTGSAGPETEIVETNTTNETNEDNPRMGALKFLSALQRKSEGAKEPLERGLMYVEAWVNQRQAKSTMVDSGATHNFMTEAEA

Query:  RRLNLKWDRDPGKMKAVNSAALPIMGIAKRATVKLGDWNDHADFVIVKMDDFDVVLGMEFLLEHKVIPMPLAKCLVITGTNPTVVTTSIRQPNGVKMISA
        RRLNL WD+DPGKMKAVNSAALPIMG+AKR +VKLG W+   DFVIV+MDDFDVVLG++FLLEHKVIPMPLAKCLV+T ++P VV TSI+QP+GVKMISA
Subjt:  RRLNLKWDRDPGKMKAVNSAALPIMGIAKRATVKLGDWNDHADFVIVKMDDFDVVLGMEFLLEHKVIPMPLAKCLVITGTNPTVVTTSIRQPNGVKMISA

Query:  LQLKK
        LQLKK
Subjt:  LQLKK

XP_022155185.1 uncharacterized protein LOC111022320 [Momordica charantia]1.6e-1652.87Show/hide
Query:  VSEIKQLGKTQVERLVEIEEQMLYLREVPDSVRFLESRLEEIATRTDKIDAVAGRVDGMPIQELIMRVENLENRAAKPNNFERAHRS
        +S  KQLGK+ ++RLVEIEE++L+LRE+PD++R++ESRL+EI+T+ D ID V  R+DG+ I+EL++RVE LE++  + +N ER   S
Subjt:  VSEIKQLGKTQVERLVEIEEQMLYLREVPDSVRFLESRLEEIATRTDKIDAVAGRVDGMPIQELIMRVENLENRAAKPNNFERAHRS

XP_022155185.1 uncharacterized protein LOC111022320 [Momordica charantia]3.5e-7261.78Show/hide
Query:  ENLENRAAKPNNFERAHRSYQCPSRTAFNAFQASLTGSAGPETEIVETNTTNETNEDNPRMGALKFLSALQRKSEGAKEPLERGLMYVEAWVNQRQAKST
        +NL NR       +  H + +CP++TAFNAFQASL   +  +    E         DNPRMGALKFLS+LQ+K      P+ERGLMYV+ W+NQ+  KST
Subjt:  ENLENRAAKPNNFERAHRSYQCPSRTAFNAFQASLTGSAGPETEIVETNTTNETNEDNPRMGALKFLSALQRKSEGAKEPLERGLMYVEAWVNQRQAKST

Query:  MVDSGATHNFMTEAEARRLNLKWDRDPGKMKAVNSAALPIMGIAKRATVKLGDWNDHADFVIVKMDDFDVVLGMEFLLEHKVIPMPLAKCLVITGTNPTV
        MVDSGATHNF+TE EA+RLNL+W++D G+MKAVNSAALPI+G+ KR  ++LG W+   DFV+VKMDDFDVVLGMEFLLEH+VIPMPLAKCLVITG  P+V
Subjt:  MVDSGATHNFMTEAEARRLNLKWDRDPGKMKAVNSAALPIMGIAKRATVKLGDWNDHADFVIVKMDDFDVVLGMEFLLEHKVIPMPLAKCLVITGTNPTV

Query:  VTTSIRQPNGVKMISALQLKKAAFR
        V T +RQP+G+KMISA+QLKK  FR
Subjt:  VTTSIRQPNGVKMISALQLKKAAFR

TrEMBL top hitse value%identityAlignment
A0A5A7T0E2 Reverse transcriptase1.9e-1562.2Show/hide
Query:  GKTQVERLVEIEEQMLYLREVPDSVRFLESRLEEIATRTDKIDAVAGRVDGMPIQELIMRVENLEN--RAAKPNNFERAHRS
        GK Q +RLVE+EEQMLYL EVPDS+R+LESRLEEI+ +T+ IDAVAGRV+G PIQEL+ RV+ LE      +  N+ER   S
Subjt:  GKTQVERLVEIEEQMLYLREVPDSVRFLESRLEEIATRTDKIDAVAGRVDGMPIQELIMRVENLEN--RAAKPNNFERAHRS

A0A5D3BRZ6 Reverse transcriptase1.9e-1562.2Show/hide
Query:  GKTQVERLVEIEEQMLYLREVPDSVRFLESRLEEIATRTDKIDAVAGRVDGMPIQELIMRVENLEN--RAAKPNNFERAHRS
        GK Q +RLVE+EEQMLYL EVPDS+R+LESRLEEI+ +T+ IDAVAGRV+G PIQEL+ RV+ LE      +  N+ER   S
Subjt:  GKTQVERLVEIEEQMLYLREVPDSVRFLESRLEEIATRTDKIDAVAGRVDGMPIQELIMRVENLEN--RAAKPNNFERAHRS

A0A5D3BRZ6 Reverse transcriptase3.7e-7261.78Show/hide
Query:  ENLENRAAKPNNFERAHRSYQCPSRTAFNAFQASLTGSAGPETEIVETNTTNETNEDNPRMGALKFLSALQRKSEGAKEPLERGLMYVEAWVNQRQAKST
        +NL NR       +  H + +CP++TAFNAFQASLT  +  +    E         DNPRMGALKFLS+LQ+K      P+ERGLMYV+ W+NQ+  KST
Subjt:  ENLENRAAKPNNFERAHRSYQCPSRTAFNAFQASLTGSAGPETEIVETNTTNETNEDNPRMGALKFLSALQRKSEGAKEPLERGLMYVEAWVNQRQAKST

Query:  MVDSGATHNFMTEAEARRLNLKWDRDPGKMKAVNSAALPIMGIAKRATVKLGDWNDHADFVIVKMDDFDVVLGMEFLLEHKVIPMPLAKCLVITGTNPTV
        MVDSGATHNF+TE EA+RLNL+W++D G+MKAVNSAALPI+G+ KR  ++LG W+   DFV+VKMDDFDVVLGMEFLLEH+VIPMPLAKCLVITG  P+V
Subjt:  MVDSGATHNFMTEAEARRLNLKWDRDPGKMKAVNSAALPIMGIAKRATVKLGDWNDHADFVIVKMDDFDVVLGMEFLLEHKVIPMPLAKCLVITGTNPTV

Query:  VTTSIRQPNGVKMISALQLKKAAFR
        V T +RQP+G+KMISA+QLKK   R
Subjt:  VTTSIRQPNGVKMISALQLKKAAFR

A0A5D3C4R1 Reverse transcriptase1.9e-1562.2Show/hide
Query:  GKTQVERLVEIEEQMLYLREVPDSVRFLESRLEEIATRTDKIDAVAGRVDGMPIQELIMRVENLEN--RAAKPNNFERAHRS
        GK Q +RLVE+EEQMLYL EVPDS+R+LESRLEEI+ +T+ IDAVAGRV+G PIQEL+ RV+ LE      +  N+ER   S
Subjt:  GKTQVERLVEIEEQMLYLREVPDSVRFLESRLEEIATRTDKIDAVAGRVDGMPIQELIMRVENLEN--RAAKPNNFERAHRS

A0A5D3C4R1 Reverse transcriptase3.7e-7261.78Show/hide
Query:  ENLENRAAKPNNFERAHRSYQCPSRTAFNAFQASLTGSAGPETEIVETNTTNETNEDNPRMGALKFLSALQRKSEGAKEPLERGLMYVEAWVNQRQAKST
        +NL NR       +  H + +CP++TAFNAFQASLT  +  +    E         DNPRMGALKFLS+LQ+K      P+ERGLMYV+ W+NQ+  KST
Subjt:  ENLENRAAKPNNFERAHRSYQCPSRTAFNAFQASLTGSAGPETEIVETNTTNETNEDNPRMGALKFLSALQRKSEGAKEPLERGLMYVEAWVNQRQAKST

Query:  MVDSGATHNFMTEAEARRLNLKWDRDPGKMKAVNSAALPIMGIAKRATVKLGDWNDHADFVIVKMDDFDVVLGMEFLLEHKVIPMPLAKCLVITGTNPTV
        MVDSGATHNF+TE EA+RLNL+W++D G+MKAVNSAALPI+G+ KR  ++LG W+   DFV+VKMDDFDVVLGMEFLLEH+VIPMPLAKCLVITG  P+V
Subjt:  MVDSGATHNFMTEAEARRLNLKWDRDPGKMKAVNSAALPIMGIAKRATVKLGDWNDHADFVIVKMDDFDVVLGMEFLLEHKVIPMPLAKCLVITGTNPTV

Query:  VTTSIRQPNGVKMISALQLKKAAFR
        V T +RQP+G+KMISA+QLKK   R
Subjt:  VTTSIRQPNGVKMISALQLKKAAFR

A0A5D3DQ20 Retrotrans_gag domain-containing protein1.9e-1562.2Show/hide
Query:  GKTQVERLVEIEEQMLYLREVPDSVRFLESRLEEIATRTDKIDAVAGRVDGMPIQELIMRVENLEN--RAAKPNNFERAHRS
        GK Q +RLVE+EEQMLYL EVPDS+R+LESRLEEI+ +T+ IDAVAGRV+G PIQEL+ RV+ LE      +  N+ER   S
Subjt:  GKTQVERLVEIEEQMLYLREVPDSVRFLESRLEEIATRTDKIDAVAGRVDGMPIQELIMRVENLEN--RAAKPNNFERAHRS

A0A5D3DQ20 Retrotrans_gag domain-containing protein3.7e-7261.78Show/hide
Query:  ENLENRAAKPNNFERAHRSYQCPSRTAFNAFQASLTGSAGPETEIVETNTTNETNEDNPRMGALKFLSALQRKSEGAKEPLERGLMYVEAWVNQRQAKST
        +NL NR       +  H + +CP++TAFNAFQASLT  +  +    E         DNPRMGALKFLS+LQ+K      P+ERGLMYV+ W+NQ+  KST
Subjt:  ENLENRAAKPNNFERAHRSYQCPSRTAFNAFQASLTGSAGPETEIVETNTTNETNEDNPRMGALKFLSALQRKSEGAKEPLERGLMYVEAWVNQRQAKST

Query:  MVDSGATHNFMTEAEARRLNLKWDRDPGKMKAVNSAALPIMGIAKRATVKLGDWNDHADFVIVKMDDFDVVLGMEFLLEHKVIPMPLAKCLVITGTNPTV
        MVDSGATHNF+TE EA+RLNL+W++D G+MKAVNSAALPI+G+ KR  ++LG W+   DFV+VKMDDFDVVLGMEFLLEH+VIPMPLAKCLVITG  P+V
Subjt:  MVDSGATHNFMTEAEARRLNLKWDRDPGKMKAVNSAALPIMGIAKRATVKLGDWNDHADFVIVKMDDFDVVLGMEFLLEHKVIPMPLAKCLVITGTNPTV

Query:  VTTSIRQPNGVKMISALQLKKAAFR
        V T +RQP+G+KMISA+QLKK   R
Subjt:  VTTSIRQPNGVKMISALQLKKAAFR

A0A6J1DLQ6 uncharacterized protein LOC1110223204.4e-8176.1Show/hide
Query:  HRSYQCPSRTAFNAFQASLTGSAGPETEIVETNTTNETNEDNPRMGALKFLSALQRKSEGAKEPLERGLMYVEAWVNQRQAKSTMVDSGATHNFMTEAEA
        HR Y+CP+R A  AFQA+LT     E E  ET+T  E  +DNPRMGALKFLSALQ+K+E  KEPLERGLMYVEAWVNQ+ AKSTMVDSGATHNFMTE EA
Subjt:  HRSYQCPSRTAFNAFQASLTGSAGPETEIVETNTTNETNEDNPRMGALKFLSALQRKSEGAKEPLERGLMYVEAWVNQRQAKSTMVDSGATHNFMTEAEA

Query:  RRLNLKWDRDPGKMKAVNSAALPIMGIAKRATVKLGDWNDHADFVIVKMDDFDVVLGMEFLLEHKVIPMPLAKCLVITGTNPTVVTTSIRQPNGVKMISA
        RRLNL WD+DPGKMKAVNSAALPIMG+AKR +VKLG W+   DFVIV+MDDFDVVLG++FLLEHKVIPMPLAKCLV+T ++P VV TSI+QP+GVKMISA
Subjt:  RRLNLKWDRDPGKMKAVNSAALPIMGIAKRATVKLGDWNDHADFVIVKMDDFDVVLGMEFLLEHKVIPMPLAKCLVITGTNPTVVTTSIRQPNGVKMISA

Query:  LQLKK
        LQLKK
Subjt:  LQLKK

A0A6J1DLQ6 uncharacterized protein LOC1110223207.6e-1752.87Show/hide
Query:  VSEIKQLGKTQVERLVEIEEQMLYLREVPDSVRFLESRLEEIATRTDKIDAVAGRVDGMPIQELIMRVENLENRAAKPNNFERAHRS
        +S  KQLGK+ ++RLVEIEE++L+LRE+PD++R++ESRL+EI+T+ D ID V  R+DG+ I+EL++RVE LE++  + +N ER   S
Subjt:  VSEIKQLGKTQVERLVEIEEQMLYLREVPDSVRFLESRLEEIATRTDKIDAVAGRVDGMPIQELIMRVENLENRAAKPNNFERAHRS

A0A6J1DLQ6 uncharacterized protein LOC1110223201.7e-7261.78Show/hide
Query:  ENLENRAAKPNNFERAHRSYQCPSRTAFNAFQASLTGSAGPETEIVETNTTNETNEDNPRMGALKFLSALQRKSEGAKEPLERGLMYVEAWVNQRQAKST
        +NL NR       +  H + +CP++TAFNAFQASL   +  +    E         DNPRMGALKFLS+LQ+K      P+ERGLMYV+ W+NQ+  KST
Subjt:  ENLENRAAKPNNFERAHRSYQCPSRTAFNAFQASLTGSAGPETEIVETNTTNETNEDNPRMGALKFLSALQRKSEGAKEPLERGLMYVEAWVNQRQAKST

Query:  MVDSGATHNFMTEAEARRLNLKWDRDPGKMKAVNSAALPIMGIAKRATVKLGDWNDHADFVIVKMDDFDVVLGMEFLLEHKVIPMPLAKCLVITGTNPTV
        MVDSGATHNF+TE EA+RLNL+W++D G+MKAVNSAALPI+G+ KR  ++LG W+   DFV+VKMDDFDVVLGMEFLLEH+VIPMPLAKCLVITG  P+V
Subjt:  MVDSGATHNFMTEAEARRLNLKWDRDPGKMKAVNSAALPIMGIAKRATVKLGDWNDHADFVIVKMDDFDVVLGMEFLLEHKVIPMPLAKCLVITGTNPTV

Query:  VTTSIRQPNGVKMISALQLKKAAFR
        V T +RQP+G+KMISA+QLKK  FR
Subjt:  VTTSIRQPNGVKMISALQLKKAAFR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGCCGCAAGCCTCTGGTATGCCTTCTTTCTCTCTCTCTCTCTCGTCTCTCATCCCCACGACCCTCACTCTCTCACTCGTCTTCAGCGTCGCTGTCGTCGCAAACAG
CCCCAGTGCCGTCGTCGCCTCCTCGCTTCGATCGTCAGTAGCGCAGCCGTTGACCTCCCTCTCTTCTCCAATCTGTCATCGTGTCATCGCTCACACTATCGCACACACCC
GAGTCTGTTGCAAGAAGCTCGGGATCCACGCGTTCAGCTCGTTTCCAACATTTGTACCATCGCTTATCGCTGTGTTTTGTCGATGTCGTCGTCTAATGTTGATTGTAGCA
GCCTTGGGTGTCGTTTGGGGTAAGTTTCGGCATTTCCGGAACGGGGGTAGAGGTTGCGATATTCGCCCGTTGTGCCACCGGCGATCTGTTTTCAAGTTGGCTGACTTACT
TGTGAGCGAAATAAAACAGTTGGGCAAGACCCAAGTGGAACGATTGGTAGAGATCGAAGAGCAGATGCTCTACCTGAGGGAAGTCCCTGATTCAGTTCGCTTCCTAGAAT
CTCGTCTTGAAGAGATTGCGACGAGGACAGATAAGATCGATGCGGTAGCTGGCCGCGTGGATGGAATGCCCATCCAAGAATTGATTATGAGGGTTGAAAACCTCGAGAAC
AGAGCTGCGAAACCTAATAACTTCGAGCGGGCACATCGATCGTACCAGTGCCCTAGTCGAACAGCATTTAATGCCTTTCAAGCTTCGTTGACGGGCAGCGCGGGTCCTGA
GACTGAGATTGTTGAGACGAACACGACGAATGAGACGAATGAGGACAACCCTCGCATGGGGGCCTTGAAGTTTCTATCCGCCCTCCAAAGGAAATCCGAAGGGGCCAAAG
AGCCGCTCGAACGGGGTCTCATGTATGTTGAGGCGTGGGTTAATCAGAGGCAGGCCAAGAGTACAATGGTCGACTCTGGTGCCACCCACAACTTTATGACCGAAGCTGAA
GCACGACGATTGAACCTTAAGTGGGATAGGGATCCCGGAAAGATGAAAGCGGTAAACTCAGCCGCCCTACCGATCATGGGAATTGCTAAGAGAGCAACTGTCAAGTTGGG
GGATTGGAATGACCATGCAGATTTTGTGATTGTAAAGATGGATGATTTCGACGTGGTGTTGGGAATGGAATTCCTTCTCGAACACAAAGTCATTCCAATGCCCCTAGCAA
AATGCTTGGTGATCACAGGAACCAACCCCACGGTCGTGACGACCAGTATCAGACAACCGAATGGAGTGAAAATGATATCAGCCCTCCAGTTAAAGAAAGCGGCTTTCCGG
CTGTTTCTTCCGACCAACGTCACCGGCAGAGGTAGTGTCAGCGACGACGATTTCCGACGGCTCCGTGAGCAACAACGCAAGGTGAGGCAACTGCGTTTTCGAGTTCCAAT
AGGTTCTTCTGCGCGATATCTCTTCCTCCTCGACGCCGGCAACCACGTTCGATCTTCACCAGCGACGATTACCGGCGAAGCATGGCGGCACGTGTGCGGTGGTGGGCGAG
AGACTCCAGCGTTAGGGGTGTTCATCGGTCAGTCGGTTTTCCAGAAATCTGGAAACCGACGACTTCCGATGGGTGCTACCAGCGGACCGACCGAACGGTGTCGTTTCAGC
GGGTTTTTGGGCGATTTCGGTGTTGGTGGGTCATTGTGCTCCGGCAACTTGGTGAAGTTTCAGCATTGGGCTGAAGATTCTGACTTCTCCAAGAAATTGGCTTGCGTGCC
CCAACCATCTCTTGCCAGCTTGCAAAGCAGTCACACGTCAGGGCGACCATCCATCCGGCTGTCATTGTACGATTCACGCTCTGGGATCTGCGCCTACGCTGAAGATAACT
TGGGCAAGGAAATAACTATAGAAATCATGCGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGCCGCCGCAAGCCTCTGGTATGCCTTCTTTCTCTCTCTCTCTCTCGTCTCTCATCCCCACGACCCTCACTCTCTCACTCGTCTTCAGCGTCGCTGTCGTCGCAAACAG
CCCCAGTGCCGTCGTCGCCTCCTCGCTTCGATCGTCAGTAGCGCAGCCGTTGACCTCCCTCTCTTCTCCAATCTGTCATCGTGTCATCGCTCACACTATCGCACACACCC
GAGTCTGTTGCAAGAAGCTCGGGATCCACGCGTTCAGCTCGTTTCCAACATTTGTACCATCGCTTATCGCTGTGTTTTGTCGATGTCGTCGTCTAATGTTGATTGTAGCA
GCCTTGGGTGTCGTTTGGGGTAAGTTTCGGCATTTCCGGAACGGGGGTAGAGGTTGCGATATTCGCCCGTTGTGCCACCGGCGATCTGTTTTCAAGTTGGCTGACTTACT
TGTGAGCGAAATAAAACAGTTGGGCAAGACCCAAGTGGAACGATTGGTAGAGATCGAAGAGCAGATGCTCTACCTGAGGGAAGTCCCTGATTCAGTTCGCTTCCTAGAAT
CTCGTCTTGAAGAGATTGCGACGAGGACAGATAAGATCGATGCGGTAGCTGGCCGCGTGGATGGAATGCCCATCCAAGAATTGATTATGAGGGTTGAAAACCTCGAGAAC
AGAGCTGCGAAACCTAATAACTTCGAGCGGGCACATCGATCGTACCAGTGCCCTAGTCGAACAGCATTTAATGCCTTTCAAGCTTCGTTGACGGGCAGCGCGGGTCCTGA
GACTGAGATTGTTGAGACGAACACGACGAATGAGACGAATGAGGACAACCCTCGCATGGGGGCCTTGAAGTTTCTATCCGCCCTCCAAAGGAAATCCGAAGGGGCCAAAG
AGCCGCTCGAACGGGGTCTCATGTATGTTGAGGCGTGGGTTAATCAGAGGCAGGCCAAGAGTACAATGGTCGACTCTGGTGCCACCCACAACTTTATGACCGAAGCTGAA
GCACGACGATTGAACCTTAAGTGGGATAGGGATCCCGGAAAGATGAAAGCGGTAAACTCAGCCGCCCTACCGATCATGGGAATTGCTAAGAGAGCAACTGTCAAGTTGGG
GGATTGGAATGACCATGCAGATTTTGTGATTGTAAAGATGGATGATTTCGACGTGGTGTTGGGAATGGAATTCCTTCTCGAACACAAAGTCATTCCAATGCCCCTAGCAA
AATGCTTGGTGATCACAGGAACCAACCCCACGGTCGTGACGACCAGTATCAGACAACCGAATGGAGTGAAAATGATATCAGCCCTCCAGTTAAAGAAAGCGGCTTTCCGG
CTGTTTCTTCCGACCAACGTCACCGGCAGAGGTAGTGTCAGCGACGACGATTTCCGACGGCTCCGTGAGCAACAACGCAAGGTGAGGCAACTGCGTTTTCGAGTTCCAAT
AGGTTCTTCTGCGCGATATCTCTTCCTCCTCGACGCCGGCAACCACGTTCGATCTTCACCAGCGACGATTACCGGCGAAGCATGGCGGCACGTGTGCGGTGGTGGGCGAG
AGACTCCAGCGTTAGGGGTGTTCATCGGTCAGTCGGTTTTCCAGAAATCTGGAAACCGACGACTTCCGATGGGTGCTACCAGCGGACCGACCGAACGGTGTCGTTTCAGC
GGGTTTTTGGGCGATTTCGGTGTTGGTGGGTCATTGTGCTCCGGCAACTTGGTGAAGTTTCAGCATTGGGCTGAAGATTCTGACTTCTCCAAGAAATTGGCTTGCGTGCC
CCAACCATCTCTTGCCAGCTTGCAAAGCAGTCACACGTCAGGGCGACCATCCATCCGGCTGTCATTGTACGATTCACGCTCTGGGATCTGCGCCTACGCTGAAGATAACT
TGGGCAAGGAAATAACTATAGAAATCATGCGTTAA
Protein sequenceShow/hide protein sequence
MPPQASGMPSFSLSLSSLIPTTLTLSLVFSVAVVANSPSAVVASSLRSSVAQPLTSLSSPICHRVIAHTIAHTRVCCKKLGIHAFSSFPTFVPSLIAVFCRCRRLMLIVA
ALGVVWGKFRHFRNGGRGCDIRPLCHRRSVFKLADLLVSEIKQLGKTQVERLVEIEEQMLYLREVPDSVRFLESRLEEIATRTDKIDAVAGRVDGMPIQELIMRVENLEN
RAAKPNNFERAHRSYQCPSRTAFNAFQASLTGSAGPETEIVETNTTNETNEDNPRMGALKFLSALQRKSEGAKEPLERGLMYVEAWVNQRQAKSTMVDSGATHNFMTEAE
ARRLNLKWDRDPGKMKAVNSAALPIMGIAKRATVKLGDWNDHADFVIVKMDDFDVVLGMEFLLEHKVIPMPLAKCLVITGTNPTVVTTSIRQPNGVKMISALQLKKAAFR
LFLPTNVTGRGSVSDDDFRRLREQQRKVRQLRFRVPIGSSARYLFLLDAGNHVRSSPATITGEAWRHVCGGGRETPALGVFIGQSVFQKSGNRRLPMGATSGPTERCRFS
GFLGDFGVGGSLCSGNLVKFQHWAEDSDFSKKLACVPQPSLASLQSSHTSGRPSIRLSLYDSRSGICAYAEDNLGKEITIEIMR