; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022864 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022864
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr7:39589673..39603233
RNA-Seq ExpressionLag0022864
SyntenyLag0022864
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0004518 - nuclease activity (molecular function)
GO:0005488 - binding (molecular function)
GO:0016779 - nucleotidyltransferase activity (molecular function)
InterPro domainsIPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0047446.1 pol protein [Cucumis melo var. makuwa]1.5e-6259.56Show/hide
Query:  SRDLAIVEAVVKWPHSTIVA---------------EKLNQRLVSAPVLTVPDGSGSFVVYNYASKKGLGCVLIQNGKVIAYASPQLKNYECNYPTHDLEL
        S D A +EAV  W   + ++               + L Q+LV+APVLTVPDGSGSFV+Y+ ASKKGLGCVL+Q GKV+AYAS QLK++E NYPTHDLEL
Subjt:  SRDLAIVEAVVKWPHSTIVA---------------EKLNQRLVSAPVLTVPDGSGSFVVYNYASKKGLGCVLIQNGKVIAYASPQLKNYECNYPTHDLEL

Query:  AAVVSGLKIWRYYIWLELVKDYDVEILYHPSKENVVADGLSRRTTHTSALITDQTHVRFDMEYAGITVLVGEVASQLAQLTIHPTLRQHIIDSQKDDRNF
        AAVV  LKIWR+Y+WLELVKDYD EILYHP K NVVAD LSR+ +H++ALIT Q  +  D+E A I V VG V  QLAQLTI PTLRQ IID+Q +D   
Subjt:  AAVVSGLKIWRYYIWLELVKDYDVEILYHPSKENVVADGLSRRTTHTSALITDQTHVRFDMEYAGITVLVGEVASQLAQLTIHPTLRQHIIDSQKDDRNF

Query:  SDLFHQVESSQGDEFSLSSDYGLLF
         +     E+ Q  EFS+SSD GLLF
Subjt:  SDLFHQVESSQGDEFSLSSDYGLLF

KAA0049956.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]3.4e-6257.14Show/hide
Query:  VKSFWCSRDLAIVEAVVKWPHSTIVAE---------------------KLNQRLVSAPVLTVPDGSGSFVVYNYASKKGLGCVLIQNGKVIAYASPQLKN
        V S   S D A +EAV  WP  + V+E                     +L Q+LV+APVL VPDGSGSFV+Y+ ASKKGLGCVL+Q GKV+AYAS QLK+
Subjt:  VKSFWCSRDLAIVEAVVKWPHSTIVAE---------------------KLNQRLVSAPVLTVPDGSGSFVVYNYASKKGLGCVLIQNGKVIAYASPQLKN

Query:  YECNYPTHDLELAAVVSGLKIWRYYI-WLELVKDYDVEILYHPSKENVVADGLSRRTTHTSALITDQTHVRFDMEYAGITVLVGEVASQLAQLTIHPTLR
        +E NYPTHDLELAAVV  LKIWR+Y+ WLELVKDYD EILYHP K N+VAD LSR+  H++ALIT Q  +  D E   I V VGEV SQLAQL++ PTLR
Subjt:  YECNYPTHDLELAAVVSGLKIWRYYI-WLELVKDYDVEILYHPSKENVVADGLSRRTTHTSALITDQTHVRFDMEYAGITVLVGEVASQLAQLTIHPTLR

Query:  QHIIDSQKDDRNFSDLFHQVESSQGDEFSLSSDYGLLF
        Q II +Q +D    +  H VE+ QG++FS+SSD GL+F
Subjt:  QHIIDSQKDDRNFSDLFHQVESSQGDEFSLSSDYGLLF

TYJ98640.1 pol protein [Cucumis melo var. makuwa]1.5e-6259.56Show/hide
Query:  SRDLAIVEAVVKWPHSTIVA---------------EKLNQRLVSAPVLTVPDGSGSFVVYNYASKKGLGCVLIQNGKVIAYASPQLKNYECNYPTHDLEL
        S D A +EAV  W   + ++               + L Q+LV+APVLTVPDGSGSFV+Y+ ASKKGLGCVL+Q GKV+AYAS QLK++E NYPTHDLEL
Subjt:  SRDLAIVEAVVKWPHSTIVA---------------EKLNQRLVSAPVLTVPDGSGSFVVYNYASKKGLGCVLIQNGKVIAYASPQLKNYECNYPTHDLEL

Query:  AAVVSGLKIWRYYIWLELVKDYDVEILYHPSKENVVADGLSRRTTHTSALITDQTHVRFDMEYAGITVLVGEVASQLAQLTIHPTLRQHIIDSQKDDRNF
        AAVV  LKIWR+Y+WLELVKDYD EILYHP K NVVAD LSR+ +H++ALIT Q  +  D+E A I V VG V  QLAQLTI PTLRQ IID+Q +D   
Subjt:  AAVVSGLKIWRYYIWLELVKDYDVEILYHPSKENVVADGLSRRTTHTSALITDQTHVRFDMEYAGITVLVGEVASQLAQLTIHPTLRQHIIDSQKDDRNF

Query:  SDLFHQVESSQGDEFSLSSDYGLLF
         +     E+ Q  EFS+SSD GLLF
Subjt:  SDLFHQVESSQGDEFSLSSDYGLLF

TYK01676.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]8.9e-6360.37Show/hide
Query:  DLAIVEAVVKWPHSTIVAE---------KLNQRLVSAPVLTVPDGSGSFVVYNYASKKGLGCVLIQNGKVIAYASPQLKNYECNYPTHDLELAAVVSGLK
        D A +EAV  WP  + V+E          L Q+LV+APVLTVPDGSGSFV+Y+ ASKKGLGCVL+Q GKV+AYAS QLK++E NYPTHDLELA VV  LK
Subjt:  DLAIVEAVVKWPHSTIVAE---------KLNQRLVSAPVLTVPDGSGSFVVYNYASKKGLGCVLIQNGKVIAYASPQLKNYECNYPTHDLELAAVVSGLK

Query:  IWRYYIWLELVKDYDVEILYHPSKENVVADGLSRRTTHTSALITDQTHVRFDMEYAGITVLVGEVASQLAQLTIHPTLRQHIIDSQKDDRNFSDLFHQVE
        IWR+Y+WLELVKDYD EILYHP K NVVAD LSR+ +H++ LIT Q  +  D++   I V VG V  QLAQLT+ PTLRQ IID+Q +D    +     E
Subjt:  IWRYYIWLELVKDYDVEILYHPSKENVVADGLSRRTTHTSALITDQTHVRFDMEYAGITVLVGEVASQLAQLTIHPTLRQHIIDSQKDDRNFSDLFHQVE

Query:  SSQGDEFSLSSDYGLLF
        + Q  EFS+SSD GLLF
Subjt:  SSQGDEFSLSSDYGLLF

TYK07688.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]3.4e-6257.14Show/hide
Query:  VKSFWCSRDLAIVEAVVKWPHSTIVAE---------------------KLNQRLVSAPVLTVPDGSGSFVVYNYASKKGLGCVLIQNGKVIAYASPQLKN
        V S   S D A +EAV  WP  + V+E                     +L Q+LV+APVL VPDGSGSFV+Y+ ASKKGLGCVL+Q GKV+AYAS QLK+
Subjt:  VKSFWCSRDLAIVEAVVKWPHSTIVAE---------------------KLNQRLVSAPVLTVPDGSGSFVVYNYASKKGLGCVLIQNGKVIAYASPQLKN

Query:  YECNYPTHDLELAAVVSGLKIWRYYI-WLELVKDYDVEILYHPSKENVVADGLSRRTTHTSALITDQTHVRFDMEYAGITVLVGEVASQLAQLTIHPTLR
        +E NYPTHDLELAAVV  LKIWR+Y+ WLELVKDYD EILYHP K N+VAD LSR+  H++ALIT Q  +  D E   I V VGEV SQLAQL++ PTLR
Subjt:  YECNYPTHDLELAAVVSGLKIWRYYI-WLELVKDYDVEILYHPSKENVVADGLSRRTTHTSALITDQTHVRFDMEYAGITVLVGEVASQLAQLTIHPTLR

Query:  QHIIDSQKDDRNFSDLFHQVESSQGDEFSLSSDYGLLF
        Q II +Q +D    +  H VE+ QG++FS+SSD GL+F
Subjt:  QHIIDSQKDDRNFSDLFHQVESSQGDEFSLSSDYGLLF

TrEMBL top hitse value%identityAlignment
A0A5A7TZP2 Pol protein7.3e-6359.56Show/hide
Query:  SRDLAIVEAVVKWPHSTIVA---------------EKLNQRLVSAPVLTVPDGSGSFVVYNYASKKGLGCVLIQNGKVIAYASPQLKNYECNYPTHDLEL
        S D A +EAV  W   + ++               + L Q+LV+APVLTVPDGSGSFV+Y+ ASKKGLGCVL+Q GKV+AYAS QLK++E NYPTHDLEL
Subjt:  SRDLAIVEAVVKWPHSTIVA---------------EKLNQRLVSAPVLTVPDGSGSFVVYNYASKKGLGCVLIQNGKVIAYASPQLKNYECNYPTHDLEL

Query:  AAVVSGLKIWRYYIWLELVKDYDVEILYHPSKENVVADGLSRRTTHTSALITDQTHVRFDMEYAGITVLVGEVASQLAQLTIHPTLRQHIIDSQKDDRNF
        AAVV  LKIWR+Y+WLELVKDYD EILYHP K NVVAD LSR+ +H++ALIT Q  +  D+E A I V VG V  QLAQLTI PTLRQ IID+Q +D   
Subjt:  AAVVSGLKIWRYYIWLELVKDYDVEILYHPSKENVVADGLSRRTTHTSALITDQTHVRFDMEYAGITVLVGEVASQLAQLTIHPTLRQHIIDSQKDDRNF

Query:  SDLFHQVESSQGDEFSLSSDYGLLF
         +     E+ Q  EFS+SSD GLLF
Subjt:  SDLFHQVESSQGDEFSLSSDYGLLF

A0A5A7U6Z9 Ty3-gypsy retrotransposon protein1.6e-6257.14Show/hide
Query:  VKSFWCSRDLAIVEAVVKWPHSTIVAE---------------------KLNQRLVSAPVLTVPDGSGSFVVYNYASKKGLGCVLIQNGKVIAYASPQLKN
        V S   S D A +EAV  WP  + V+E                     +L Q+LV+APVL VPDGSGSFV+Y+ ASKKGLGCVL+Q GKV+AYAS QLK+
Subjt:  VKSFWCSRDLAIVEAVVKWPHSTIVAE---------------------KLNQRLVSAPVLTVPDGSGSFVVYNYASKKGLGCVLIQNGKVIAYASPQLKN

Query:  YECNYPTHDLELAAVVSGLKIWRYYI-WLELVKDYDVEILYHPSKENVVADGLSRRTTHTSALITDQTHVRFDMEYAGITVLVGEVASQLAQLTIHPTLR
        +E NYPTHDLELAAVV  LKIWR+Y+ WLELVKDYD EILYHP K N+VAD LSR+  H++ALIT Q  +  D E   I V VGEV SQLAQL++ PTLR
Subjt:  YECNYPTHDLELAAVVSGLKIWRYYI-WLELVKDYDVEILYHPSKENVVADGLSRRTTHTSALITDQTHVRFDMEYAGITVLVGEVASQLAQLTIHPTLR

Query:  QHIIDSQKDDRNFSDLFHQVESSQGDEFSLSSDYGLLF
        Q II +Q +D    +  H VE+ QG++FS+SSD GL+F
Subjt:  QHIIDSQKDDRNFSDLFHQVESSQGDEFSLSSDYGLLF

A0A5D3BFS1 Pol protein7.3e-6359.56Show/hide
Query:  SRDLAIVEAVVKWPHSTIVA---------------EKLNQRLVSAPVLTVPDGSGSFVVYNYASKKGLGCVLIQNGKVIAYASPQLKNYECNYPTHDLEL
        S D A +EAV  W   + ++               + L Q+LV+APVLTVPDGSGSFV+Y+ ASKKGLGCVL+Q GKV+AYAS QLK++E NYPTHDLEL
Subjt:  SRDLAIVEAVVKWPHSTIVA---------------EKLNQRLVSAPVLTVPDGSGSFVVYNYASKKGLGCVLIQNGKVIAYASPQLKNYECNYPTHDLEL

Query:  AAVVSGLKIWRYYIWLELVKDYDVEILYHPSKENVVADGLSRRTTHTSALITDQTHVRFDMEYAGITVLVGEVASQLAQLTIHPTLRQHIIDSQKDDRNF
        AAVV  LKIWR+Y+WLELVKDYD EILYHP K NVVAD LSR+ +H++ALIT Q  +  D+E A I V VG V  QLAQLTI PTLRQ IID+Q +D   
Subjt:  AAVVSGLKIWRYYIWLELVKDYDVEILYHPSKENVVADGLSRRTTHTSALITDQTHVRFDMEYAGITVLVGEVASQLAQLTIHPTLRQHIIDSQKDDRNF

Query:  SDLFHQVESSQGDEFSLSSDYGLLF
         +     E+ Q  EFS+SSD GLLF
Subjt:  SDLFHQVESSQGDEFSLSSDYGLLF

A0A5D3BTX5 Ty3-gypsy retrotransposon protein4.3e-6360.37Show/hide
Query:  DLAIVEAVVKWPHSTIVAE---------KLNQRLVSAPVLTVPDGSGSFVVYNYASKKGLGCVLIQNGKVIAYASPQLKNYECNYPTHDLELAAVVSGLK
        D A +EAV  WP  + V+E          L Q+LV+APVLTVPDGSGSFV+Y+ ASKKGLGCVL+Q GKV+AYAS QLK++E NYPTHDLELA VV  LK
Subjt:  DLAIVEAVVKWPHSTIVAE---------KLNQRLVSAPVLTVPDGSGSFVVYNYASKKGLGCVLIQNGKVIAYASPQLKNYECNYPTHDLELAAVVSGLK

Query:  IWRYYIWLELVKDYDVEILYHPSKENVVADGLSRRTTHTSALITDQTHVRFDMEYAGITVLVGEVASQLAQLTIHPTLRQHIIDSQKDDRNFSDLFHQVE
        IWR+Y+WLELVKDYD EILYHP K NVVAD LSR+ +H++ LIT Q  +  D++   I V VG V  QLAQLT+ PTLRQ IID+Q +D    +     E
Subjt:  IWRYYIWLELVKDYDVEILYHPSKENVVADGLSRRTTHTSALITDQTHVRFDMEYAGITVLVGEVASQLAQLTIHPTLRQHIIDSQKDDRNFSDLFHQVE

Query:  SSQGDEFSLSSDYGLLF
        + Q  EFS+SSD GLLF
Subjt:  SSQGDEFSLSSDYGLLF

A0A5D3C942 Ty3-gypsy retrotransposon protein1.6e-6257.14Show/hide
Query:  VKSFWCSRDLAIVEAVVKWPHSTIVAE---------------------KLNQRLVSAPVLTVPDGSGSFVVYNYASKKGLGCVLIQNGKVIAYASPQLKN
        V S   S D A +EAV  WP  + V+E                     +L Q+LV+APVL VPDGSGSFV+Y+ ASKKGLGCVL+Q GKV+AYAS QLK+
Subjt:  VKSFWCSRDLAIVEAVVKWPHSTIVAE---------------------KLNQRLVSAPVLTVPDGSGSFVVYNYASKKGLGCVLIQNGKVIAYASPQLKN

Query:  YECNYPTHDLELAAVVSGLKIWRYYI-WLELVKDYDVEILYHPSKENVVADGLSRRTTHTSALITDQTHVRFDMEYAGITVLVGEVASQLAQLTIHPTLR
        +E NYPTHDLELAAVV  LKIWR+Y+ WLELVKDYD EILYHP K N+VAD LSR+  H++ALIT Q  +  D E   I V VGEV SQLAQL++ PTLR
Subjt:  YECNYPTHDLELAAVVSGLKIWRYYI-WLELVKDYDVEILYHPSKENVVADGLSRRTTHTSALITDQTHVRFDMEYAGITVLVGEVASQLAQLTIHPTLR

Query:  QHIIDSQKDDRNFSDLFHQVESSQGDEFSLSSDYGLLF
        Q II +Q +D    +  H VE+ QG++FS+SSD GL+F
Subjt:  QHIIDSQKDDRNFSDLFHQVESSQGDEFSLSSDYGLLF

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.66.3e-1133.82Show/hide
Query:  EKLNQRLVSAPVLTVPDGSGSFVVYNYASKKGLGCVLIQNGKVIAYASPQLKNYECNYPTHDLELAAVVSGLKIWRYYI---------------WLELVK
        +KL   +   P+L VPD +  F +   AS   LG VL Q+G  ++Y S  L  +E NY T + EL A+V   K +R+Y+               WL  +K
Subjt:  EKLNQRLVSAPVLTVPDGSGSFVVYNYASKKGLGCVLIQNGKVIAYASPQLKNYECNYPTHDLELAAVVSGLKIWRYYI---------------WLELVK

Query:  D--------------YDVEILYHPSKENVVADGLSR
        D              +D +I Y   KEN VAD LSR
Subjt:  D--------------YDVEILYHPSKENVVADGLSR

P0CT34 Transposon Tf2-1 polyprotein1.3e-0827.71Show/hide
Query:  KW-PHSTIVAEKLNQRLVSAPVLTVPDGSGSFVVYNYASKKGLGCVLIQNGK-----VIAYASPQLKNYECNYPTHDLELAAVVSGLKIWRYYI------
        KW P  T   E + Q LVS PVL   D S   ++   AS   +G VL Q         + Y S ++   + NY   D E+ A++  LK WR+Y+      
Subjt:  KW-PHSTIVAEKLNQRLVSAPVLTVPDGSGSFVVYNYASKKGLGCVLIQNGK-----VIAYASPQLKNYECNYPTHDLELAAVVSGLKIWRYYI------

Query:  ---------------------------WLELVKDYDVEILYHPSKENVVADGLSRRTTHTSALITD
                                   W   ++D++ EI Y P   N +AD LSR    T  +  D
Subjt:  ---------------------------WLELVKDYDVEILYHPSKENVVADGLSRRTTHTSALITD

P0CT41 Transposon Tf2-12 polyprotein1.3e-0827.71Show/hide
Query:  KW-PHSTIVAEKLNQRLVSAPVLTVPDGSGSFVVYNYASKKGLGCVLIQNGK-----VIAYASPQLKNYECNYPTHDLELAAVVSGLKIWRYYI------
        KW P  T   E + Q LVS PVL   D S   ++   AS   +G VL Q         + Y S ++   + NY   D E+ A++  LK WR+Y+      
Subjt:  KW-PHSTIVAEKLNQRLVSAPVLTVPDGSGSFVVYNYASKKGLGCVLIQNGK-----VIAYASPQLKNYECNYPTHDLELAAVVSGLKIWRYYI------

Query:  ---------------------------WLELVKDYDVEILYHPSKENVVADGLSRRTTHTSALITD
                                   W   ++D++ EI Y P   N +AD LSR    T  +  D
Subjt:  ---------------------------WLELVKDYDVEILYHPSKENVVADGLSRRTTHTSALITD

P20825 Retrovirus-related Pol polyprotein from transposon 2978.2e-1133.82Show/hide
Query:  EKLNQRLVSAPVLTVPDGSGSFVVYNYASKKGLGCVLIQNGKVIAYASPQLKNYECNYPTHDLELAAVVSGLKIWRYYI---------------WLELVK
        EKL   ++  P+L +PD    FV+   AS   LG VL QNG  I++ S  L ++E NY   + EL A+V   K +R+Y+               WL  +K
Subjt:  EKLNQRLVSAPVLTVPDGSGSFVVYNYASKKGLGCVLIQNGKVIAYASPQLKNYECNYPTHDLELAAVVSGLKIWRYYI---------------WLELVK

Query:  --------------DYDVEILYHPSKENVVADGLSR
                      +Y  +I Y   KEN VAD LSR
Subjt:  --------------DYDVEILYHPSKENVVADGLSR

Q9UR07 Transposon Tf2-11 polyprotein1.3e-0827.71Show/hide
Query:  KW-PHSTIVAEKLNQRLVSAPVLTVPDGSGSFVVYNYASKKGLGCVLIQNGK-----VIAYASPQLKNYECNYPTHDLELAAVVSGLKIWRYYI------
        KW P  T   E + Q LVS PVL   D S   ++   AS   +G VL Q         + Y S ++   + NY   D E+ A++  LK WR+Y+      
Subjt:  KW-PHSTIVAEKLNQRLVSAPVLTVPDGSGSFVVYNYASKKGLGCVLIQNGK-----VIAYASPQLKNYECNYPTHDLELAAVVSGLKIWRYYI------

Query:  ---------------------------WLELVKDYDVEILYHPSKENVVADGLSRRTTHTSALITD
                                   W   ++D++ EI Y P   N +AD LSR    T  +  D
Subjt:  ---------------------------WLELVKDYDVEILYHPSKENVVADGLSRRTTHTSALITD

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGTTTCAACCCCAATGAAGTCGACCGATCAAGCACTAGGCAACTGGCTATCCCTAGCTCAGTCACTGGCGTTCACCAATACACAATGATCGTAGTTTCTGTTCCCAG
TTTTGCTGCAGCACAGAATTTGCTCCCTCCAGTCGCCAGCAACATATCTGCACAAAATGATTTCCAAGCTTCCATTCTTCCAAATGGGTTAAAACATTTCTTGGACGACG
AGGAGGTGAAAAGACCGGAGCAGATCGAAGCAGGCTTGAAGAGTAGTGCCAAAGAGTCAAAGGGCAACGATCTTAGGTTTGAACCGGAGATCTTAAAAGCAGATACGAGG
TTAAGAAGACGAGCAAGAAGAAGGCTTGTAGATGACCTCGGCGAGGTTGAGGCCCCAGCCATGGTTGAGAGGACTTTGAGGCAGAGCTGCGCCAGAACTGAACCAGCAGT
CACTTTGCATTACCTACCCCGAGACGAAAATGCAAGTGAGGATGATGCGCAATTTGCTTACCGCAAATGTATGGGTCAAATTCAATCGTCTCTGTGCAAGTTTCCCCACC
ACAAGATTCTCAATCATCTCTTGACCCAATATTTCTGTGAAGGATTATTGTCAATGGATATGGGTATGATTAATGCTACTAGCGGAGGAACCTTGTTAGATAAGACCCGT
TCAGAAGTTAGGCAGTTGATCTCAAGCATGCCCAAAAATTCTCAGCAGTTCTGCCCAAGAGAACCTATAGCACAGATTCAAGCAGCTCCAAAAGCGAGTCTCGTCAGGAG
ACATGCAACGCAATCCAAGACTTGGGGAACCAGGATTCCTCCTCCAATCGATGAACCCTCTGAGAGGGAGATAAAAGAGGTTGAATCGAGCCCTCCCAAGTCTCCTACTC
GGGATGTGATTGCTAAAGACGATTGCATGATCGAAGAAGATTCTGAGGACGAGTCAGATGAGAATGGTAGTGAGGTGTCATCCTCTATGTCGTTCCCTAACTCTCACGAC
ATAAATGCTGAAACTCCTTCTCAAGTTGATAAACTTTTGCCTTCTATTGTGCAGGCCCCAAAAGTAGAACTAAAGGCTTTGTCTAAACACCTTATGTATGTATACCTAGG
TGATTCTGAGACCCTACCTGTTATAATCTCCAAAGAGGTTGCGGTCGAGCAGGAGCTGAGATTGATTCGAGTATTGAAAGAAAACAAGCTAGCCATCGGCTGGACCTTAA
CTAACATAAAAGGAATCAGTCCCGCCCTCTGCATGTCCAGAATCCTGATGGAAGATGACGCCAAACCTTCACGAGAACCCCAAAGGAGGATGATGTGCAATTTGCTTACT
GCAAGTGTACGGGTCAAGTCATTTTGGTGTTCTCGGGACCTAGCGATAGTTGAAGCAGTAGTTAAGTGGCCTCATTCAACCATTGTCGCTGAGAAGCTCAATCAAAGATT
GGTTTCAGCACCAGTGCTCACTGTCCCAGATGGATCAGGTTCTTTTGTTGTTTACAATTATGCTTCCAAGAAAGGTTTAGGTTGTGTTCTTATTCAAAACGGGAAGGTTA
TTGCCTATGCTTCTCCCCAGTTGAAGAACTACGAATGCAACTACCCAACTCATGATTTGGAGTTAGCAGCAGTAGTTTCCGGTCTCAAGATTTGGAGATATTACATATGG
CTGGAGTTGGTAAAGGACTATGATGTAGAGATTCTTTACCACCCAAGTAAGGAAAATGTGGTTGCAGATGGGCTAAGTAGAAGAACAACCCACACTTCAGCCTTAATCAC
AGACCAGACTCACGTAAGGTTTGACATGGAGTATGCAGGAATAACAGTTCTAGTGGGAGAAGTTGCCTCACAGTTGGCCCAGTTAACAATTCATCCAACATTGAGACAAC
ATATTATAGATTCCCAGAAAGATGATCGCAACTTCAGTGACCTCTTCCACCAGGTCGAGTCAAGTCAAGGGGATGAGTTCTCCCTATCCTCAGATTATGGCCTTCTCTTT
TGA
mRNA sequenceShow/hide mRNA sequence
ATGTGTTTCAACCCCAATGAAGTCGACCGATCAAGCACTAGGCAACTGGCTATCCCTAGCTCAGTCACTGGCGTTCACCAATACACAATGATCGTAGTTTCTGTTCCCAG
TTTTGCTGCAGCACAGAATTTGCTCCCTCCAGTCGCCAGCAACATATCTGCACAAAATGATTTCCAAGCTTCCATTCTTCCAAATGGGTTAAAACATTTCTTGGACGACG
AGGAGGTGAAAAGACCGGAGCAGATCGAAGCAGGCTTGAAGAGTAGTGCCAAAGAGTCAAAGGGCAACGATCTTAGGTTTGAACCGGAGATCTTAAAAGCAGATACGAGG
TTAAGAAGACGAGCAAGAAGAAGGCTTGTAGATGACCTCGGCGAGGTTGAGGCCCCAGCCATGGTTGAGAGGACTTTGAGGCAGAGCTGCGCCAGAACTGAACCAGCAGT
CACTTTGCATTACCTACCCCGAGACGAAAATGCAAGTGAGGATGATGCGCAATTTGCTTACCGCAAATGTATGGGTCAAATTCAATCGTCTCTGTGCAAGTTTCCCCACC
ACAAGATTCTCAATCATCTCTTGACCCAATATTTCTGTGAAGGATTATTGTCAATGGATATGGGTATGATTAATGCTACTAGCGGAGGAACCTTGTTAGATAAGACCCGT
TCAGAAGTTAGGCAGTTGATCTCAAGCATGCCCAAAAATTCTCAGCAGTTCTGCCCAAGAGAACCTATAGCACAGATTCAAGCAGCTCCAAAAGCGAGTCTCGTCAGGAG
ACATGCAACGCAATCCAAGACTTGGGGAACCAGGATTCCTCCTCCAATCGATGAACCCTCTGAGAGGGAGATAAAAGAGGTTGAATCGAGCCCTCCCAAGTCTCCTACTC
GGGATGTGATTGCTAAAGACGATTGCATGATCGAAGAAGATTCTGAGGACGAGTCAGATGAGAATGGTAGTGAGGTGTCATCCTCTATGTCGTTCCCTAACTCTCACGAC
ATAAATGCTGAAACTCCTTCTCAAGTTGATAAACTTTTGCCTTCTATTGTGCAGGCCCCAAAAGTAGAACTAAAGGCTTTGTCTAAACACCTTATGTATGTATACCTAGG
TGATTCTGAGACCCTACCTGTTATAATCTCCAAAGAGGTTGCGGTCGAGCAGGAGCTGAGATTGATTCGAGTATTGAAAGAAAACAAGCTAGCCATCGGCTGGACCTTAA
CTAACATAAAAGGAATCAGTCCCGCCCTCTGCATGTCCAGAATCCTGATGGAAGATGACGCCAAACCTTCACGAGAACCCCAAAGGAGGATGATGTGCAATTTGCTTACT
GCAAGTGTACGGGTCAAGTCATTTTGGTGTTCTCGGGACCTAGCGATAGTTGAAGCAGTAGTTAAGTGGCCTCATTCAACCATTGTCGCTGAGAAGCTCAATCAAAGATT
GGTTTCAGCACCAGTGCTCACTGTCCCAGATGGATCAGGTTCTTTTGTTGTTTACAATTATGCTTCCAAGAAAGGTTTAGGTTGTGTTCTTATTCAAAACGGGAAGGTTA
TTGCCTATGCTTCTCCCCAGTTGAAGAACTACGAATGCAACTACCCAACTCATGATTTGGAGTTAGCAGCAGTAGTTTCCGGTCTCAAGATTTGGAGATATTACATATGG
CTGGAGTTGGTAAAGGACTATGATGTAGAGATTCTTTACCACCCAAGTAAGGAAAATGTGGTTGCAGATGGGCTAAGTAGAAGAACAACCCACACTTCAGCCTTAATCAC
AGACCAGACTCACGTAAGGTTTGACATGGAGTATGCAGGAATAACAGTTCTAGTGGGAGAAGTTGCCTCACAGTTGGCCCAGTTAACAATTCATCCAACATTGAGACAAC
ATATTATAGATTCCCAGAAAGATGATCGCAACTTCAGTGACCTCTTCCACCAGGTCGAGTCAAGTCAAGGGGATGAGTTCTCCCTATCCTCAGATTATGGCCTTCTCTTT
TGA
Protein sequenceShow/hide protein sequence
MCFNPNEVDRSSTRQLAIPSSVTGVHQYTMIVVSVPSFAAAQNLLPPVASNISAQNDFQASILPNGLKHFLDDEEVKRPEQIEAGLKSSAKESKGNDLRFEPEILKADTR
LRRRARRRLVDDLGEVEAPAMVERTLRQSCARTEPAVTLHYLPRDENASEDDAQFAYRKCMGQIQSSLCKFPHHKILNHLLTQYFCEGLLSMDMGMINATSGGTLLDKTR
SEVRQLISSMPKNSQQFCPREPIAQIQAAPKASLVRRHATQSKTWGTRIPPPIDEPSEREIKEVESSPPKSPTRDVIAKDDCMIEEDSEDESDENGSEVSSSMSFPNSHD
INAETPSQVDKLLPSIVQAPKVELKALSKHLMYVYLGDSETLPVIISKEVAVEQELRLIRVLKENKLAIGWTLTNIKGISPALCMSRILMEDDAKPSREPQRRMMCNLLT
ASVRVKSFWCSRDLAIVEAVVKWPHSTIVAEKLNQRLVSAPVLTVPDGSGSFVVYNYASKKGLGCVLIQNGKVIAYASPQLKNYECNYPTHDLELAAVVSGLKIWRYYIW
LELVKDYDVEILYHPSKENVVADGLSRRTTHTSALITDQTHVRFDMEYAGITVLVGEVASQLAQLTIHPTLRQHIIDSQKDDRNFSDLFHQVESSQGDEFSLSSDYGLLF