; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc01G12100 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc01G12100
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationClcChr01:20554450..20566589
RNA-Seq ExpressionClc01G12100
SyntenyClc01G12100
Gene Ontology termsGO:0009231 - riboflavin biosynthetic process (biological process)
GO:0015074 - DNA integration (biological process)
GO:0071897 - DNA biosynthetic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003887 - DNA-directed DNA polymerase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0008686 - 3,4-dihydroxy-2-butanone-4-phosphate synthase activity (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAA7051484.1 unnamed protein product [Microthlaspi erraticum]4.5e-6364.02Show/hide
Query:  GQISSPFKKLLKSKGTCAQYTMPRSPNQNGVTEKRNRTLMEMVRSMMNDSVIPISLWMYALRIVTYILNRIPSKAVPKTPYELWTSRKPSLRYLHVWGWQ
        GQ   PF KLL+SKG CAQYTMP +P QNGV E+RNRTL EMVRSM+++S +P+SLW+YALR  TY+LNR+PSKAVPKTPYELWT RKPSLR+L VWG  
Subjt:  GQISSPFKKLLKSKGTCAQYTMPRSPNQNGVTEKRNRTLMEMVRSMMNDSVIPISLWMYALRIVTYILNRIPSKAVPKTPYELWTSRKPSLRYLHVWGWQ

Query:  PKMRIYNPHEKKLDSKTISGYFIGYPKMSNGYRFYCSNHSTRIVESRNARFIENGGVSGSVGAHDVEIKESLMDQNPSSDPSQVVVPII
         ++R+YNPHEKKLDS+T+S +FIGYP+ S GY FYC  HSTRIVE+ NARFIENG  SGS  +  V+I+E   + +       +VVPI+
Subjt:  PKMRIYNPHEKKLDSKTISGYFIGYPKMSNGYRFYCSNHSTRIVESRNARFIENGGVSGSVGAHDVEIKESLMDQNPSSDPSQVVVPII

KAG7551855.1 Integrase catalytic core [Arabidopsis thaliana x Arabidopsis arenosa]1.3e-6564.4Show/hide
Query:  GQISSPFKKLLKSKGTCAQYTMPRSPNQNGVTEKRNRTLMEMVRSMMNDSVIPISLWMYALRIVTYILNRIPSKAVPKTPYELWTSRKPSLRYLHVWGWQ
        GQ   PF KLL+S+G CAQYTMP +P QNGV E+RNRTLM+MVRSM+++S +P+SLW+YAL+  TY+LNR+PSKAVPKTP+ELWT RKPSLR+L VWG  
Subjt:  GQISSPFKKLLKSKGTCAQYTMPRSPNQNGVTEKRNRTLMEMVRSMMNDSVIPISLWMYALRIVTYILNRIPSKAVPKTPYELWTSRKPSLRYLHVWGWQ

Query:  PKMRIYNPHEKKLDSKTISGYFIGYPKMSNGYRFYCSNHSTRIVESRNARFIENGGVSGSVGAHDVEIKESLMDQNPSSDPSQVVVPIIVV
         +++ YNPHEKKLDS+T+SG+FIGYP+ S GY FYC NHSTRIVE+ NARFIENG  SGS  +  V+I+E  ++ +    PS+VVVPI+ V
Subjt:  PKMRIYNPHEKKLDSKTISGYFIGYPKMSNGYRFYCSNHSTRIVESRNARFIENGGVSGSVGAHDVEIKESLMDQNPSSDPSQVVVPIIVV

KAG7564986.1 Integrase catalytic core [Arabidopsis suecica]1.3e-6564.4Show/hide
Query:  GQISSPFKKLLKSKGTCAQYTMPRSPNQNGVTEKRNRTLMEMVRSMMNDSVIPISLWMYALRIVTYILNRIPSKAVPKTPYELWTSRKPSLRYLHVWGWQ
        GQ   PF KLL+S+G CAQYTMP +P QNGV E+RNRTLM+MVRSM+++S +P+SLW+YAL+  TY+LNR+PSKAVPKTP+ELWT RKPSLR+L VWG  
Subjt:  GQISSPFKKLLKSKGTCAQYTMPRSPNQNGVTEKRNRTLMEMVRSMMNDSVIPISLWMYALRIVTYILNRIPSKAVPKTPYELWTSRKPSLRYLHVWGWQ

Query:  PKMRIYNPHEKKLDSKTISGYFIGYPKMSNGYRFYCSNHSTRIVESRNARFIENGGVSGSVGAHDVEIKESLMDQNPSSDPSQVVVPIIVV
         +++ YNPHEKKLDS+T+SG+FIGYP+ S GY FYC NHSTRIVE+ NARFIENG  SGS  +  V+I+E  ++ +    PS+VVVPI+ V
Subjt:  PKMRIYNPHEKKLDSKTISGYFIGYPKMSNGYRFYCSNHSTRIVESRNARFIENGGVSGSVGAHDVEIKESLMDQNPSSDPSQVVVPIIVV

RYE20332.1 transposase, partial [Sphingobacteriaceae bacterium]1.4e-6465.79Show/hide
Query:  GQISSPFKKLLKSKGTCAQYTMPRSPNQNGVTEKRNRTLMEMVRSMMNDSVIPISLWMYALRIVTYILNRIPSKAVPKTPYELWTSRKPSLRYLHVWGWQ
        GQ   PF K L+S+G CAQYTMP +P QNGV E+RNRTLM+MVRSM+++S +P SLWM+AL+   Y+LNR+PSKAVPKTP+ELWT RKPSLR+LHV+G  
Subjt:  GQISSPFKKLLKSKGTCAQYTMPRSPNQNGVTEKRNRTLMEMVRSMMNDSVIPISLWMYALRIVTYILNRIPSKAVPKTPYELWTSRKPSLRYLHVWGWQ

Query:  PKMRIYNPHEKKLDSKTISGYFIGYPKMSNGYRFYCSNHSTRIVESRNARFIENGGVSGSVGAHDVEIKESLMDQNPSSDPSQVVVPIIV
         ++RIYNPHE+KLDS+TISG+FIGYP+ S GYRFYC NHSTRIVE+ NARFIENG VSGS+   +VEI+E  +        SQVVVP+ V
Subjt:  PKMRIYNPHEKKLDSKTISGYFIGYPKMSNGYRFYCSNHSTRIVESRNARFIENGGVSGSVGAHDVEIKESLMDQNPSSDPSQVVVPIIV

RZC25410.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Glycine soja]1.2e-6368.82Show/hide
Query:  GQISSPFKKLLKSKGTCAQYTMPRSPNQNGVTEKRNRTLMEMVRSMMNDSVIPISLWMYALRIVTYILNRIPSKAVPKTPYELWTSRKPSLRYLHVWGWQ
        GQ  SPF KLL+ +G CAQYTMP +P QNGV+E+RN+TLM+MVRSM+ +S +P+SLWMYAL+   Y+LNR+PSKAVPKTP+ELWT+R PS+R+LHVWG Q
Subjt:  GQISSPFKKLLKSKGTCAQYTMPRSPNQNGVTEKRNRTLMEMVRSMMNDSVIPISLWMYALRIVTYILNRIPSKAVPKTPYELWTSRKPSLRYLHVWGWQ

Query:  PKMRIYNPHEKKLDSKTISGYFIGYPKMSNGYRFYCSNHSTRIVESRNARFIENGGVSGSVGAHDVEIKE
         ++RIYNP E+KLD++TISGYFIGYP+ S GY FYC NHSTRIVE+ NARFIENG +SGS    +VEIKE
Subjt:  PKMRIYNPHEKKLDSKTISGYFIGYPKMSNGYRFYCSNHSTRIVESRNARFIENGGVSGSVGAHDVEIKE

TrEMBL top hitse value%identityAlignment
A0A445LQ30 Retrovirus-related Pol polyprotein from transposon TNT 1-945.7e-6468.82Show/hide
Query:  GQISSPFKKLLKSKGTCAQYTMPRSPNQNGVTEKRNRTLMEMVRSMMNDSVIPISLWMYALRIVTYILNRIPSKAVPKTPYELWTSRKPSLRYLHVWGWQ
        GQ  SPF KLL+ +G CAQYTMP +P QNGV+E+RN+TLM+MVRSM+ +S +P+SLWMYAL+   Y+LNR+PSKAVPKTP+ELWT+R PS+R+LHVWG Q
Subjt:  GQISSPFKKLLKSKGTCAQYTMPRSPNQNGVTEKRNRTLMEMVRSMMNDSVIPISLWMYALRIVTYILNRIPSKAVPKTPYELWTSRKPSLRYLHVWGWQ

Query:  PKMRIYNPHEKKLDSKTISGYFIGYPKMSNGYRFYCSNHSTRIVESRNARFIENGGVSGSVGAHDVEIKE
         ++RIYNP E+KLD++TISGYFIGYP+ S GY FYC NHSTRIVE+ NARFIENG +SGS    +VEIKE
Subjt:  PKMRIYNPHEKKLDSKTISGYFIGYPKMSNGYRFYCSNHSTRIVESRNARFIENGGVSGSVGAHDVEIKE

A0A4Q3ELL0 Transposase (Fragment)6.7e-6565.79Show/hide
Query:  GQISSPFKKLLKSKGTCAQYTMPRSPNQNGVTEKRNRTLMEMVRSMMNDSVIPISLWMYALRIVTYILNRIPSKAVPKTPYELWTSRKPSLRYLHVWGWQ
        GQ   PF K L+S+G CAQYTMP +P QNGV E+RNRTLM+MVRSM+++S +P SLWM+AL+   Y+LNR+PSKAVPKTP+ELWT RKPSLR+LHV+G  
Subjt:  GQISSPFKKLLKSKGTCAQYTMPRSPNQNGVTEKRNRTLMEMVRSMMNDSVIPISLWMYALRIVTYILNRIPSKAVPKTPYELWTSRKPSLRYLHVWGWQ

Query:  PKMRIYNPHEKKLDSKTISGYFIGYPKMSNGYRFYCSNHSTRIVESRNARFIENGGVSGSVGAHDVEIKESLMDQNPSSDPSQVVVPIIV
         ++RIYNPHE+KLDS+TISG+FIGYP+ S GYRFYC NHSTRIVE+ NARFIENG VSGS+   +VEI+E  +        SQVVVP+ V
Subjt:  PKMRIYNPHEKKLDSKTISGYFIGYPKMSNGYRFYCSNHSTRIVESRNARFIENGGVSGSVGAHDVEIKESLMDQNPSSDPSQVVVPIIV

A0A6D2KEK6 Uncharacterized protein2.2e-6364.02Show/hide
Query:  GQISSPFKKLLKSKGTCAQYTMPRSPNQNGVTEKRNRTLMEMVRSMMNDSVIPISLWMYALRIVTYILNRIPSKAVPKTPYELWTSRKPSLRYLHVWGWQ
        GQ   PF KLL+SKG CAQYTMP +P QNGV E+RNRTL EMVRSM+++S +P+SLW+YALR  TY+LNR+PSKAVPKTPYELWT RKPSLR+L VWG  
Subjt:  GQISSPFKKLLKSKGTCAQYTMPRSPNQNGVTEKRNRTLMEMVRSMMNDSVIPISLWMYALRIVTYILNRIPSKAVPKTPYELWTSRKPSLRYLHVWGWQ

Query:  PKMRIYNPHEKKLDSKTISGYFIGYPKMSNGYRFYCSNHSTRIVESRNARFIENGGVSGSVGAHDVEIKESLMDQNPSSDPSQVVVPII
         ++R+YNPHEKKLDS+T+S +FIGYP+ S GY FYC  HSTRIVE+ NARFIENG  SGS  +  V+I+E   + +       +VVPI+
Subjt:  PKMRIYNPHEKKLDSKTISGYFIGYPKMSNGYRFYCSNHSTRIVESRNARFIENGGVSGSVGAHDVEIKESLMDQNPSSDPSQVVVPII

A0A6N2K712 Uncharacterized protein2.8e-6355.41Show/hide
Query:  GQISSPFKKLLKSKGTCAQYTMPRSPNQNGVTEKRNRTLMEMVRSMMNDSVIPISLWMYALRIVTYILNRIPSKAVPKTPYELWTSRKPSLRYLHVWGWQ
        GQ   PF KLL+SKG CAQYTMP +P QNGV E+RNRTLMEMVRSM+++  +P+SLW+YAL+  TYILNR+PSKAVP TP+EL+  RKPSLR+L VWG  
Subjt:  GQISSPFKKLLKSKGTCAQYTMPRSPNQNGVTEKRNRTLMEMVRSMMNDSVIPISLWMYALRIVTYILNRIPSKAVPKTPYELWTSRKPSLRYLHVWGWQ

Query:  PKMRIYNPHEKKLDSKTISGYFIGYPKMSNGYRFYCSNHSTRIVESRNARFIENGGVSGSVGAHDVEIKESLMDQNPSSDPSQVVVPIIVVIPPPK----
         +++ YNPHEKKLDS+T+SGYFIGYP+ S G+ FYC +HSTRIVE+ NARFIENG  SGS  +  V IKE  ++ +    P+QVVVP++ V P  +    
Subjt:  PKMRIYNPHEKKLDSKTISGYFIGYPKMSNGYRFYCSNHSTRIVESRNARFIENGGVSGSVGAHDVEIKESLMDQNPSSDPSQVVVPIIVVIPPPK----

Query:  -----YPLNLGESMRTADTDPLATSTANLIP
              PLN     +  + +P+A    +++P
Subjt:  -----YPLNLGESMRTADTDPLATSTANLIP

A0A6N2L229 Uncharacterized protein1.3e-6362.69Show/hide
Query:  GQISSPFKKLLKSKGTCAQYTMPRSPNQNGVTEKRNRTLMEMVRSMMNDSVIPISLWMYALRIVTYILNRIPSKAVPKTPYELWTSRKPSLRYLHVWGWQ
        GQ   PF KLL+SKG CAQYTMP +P QNGV E+RNRTLMEMVRSM+++  +P+SLW+YAL+  TYILNR+PSKAVP TP+EL+  RKPSLR+LHVWG  
Subjt:  GQISSPFKKLLKSKGTCAQYTMPRSPNQNGVTEKRNRTLMEMVRSMMNDSVIPISLWMYALRIVTYILNRIPSKAVPKTPYELWTSRKPSLRYLHVWGWQ

Query:  PKMRIYNPHEKKLDSKTISGYFIGYPKMSNGYRFYCSNHSTRIVESRNARFIENGGVSGSVGAHDVEIKESLMDQNPSSDPSQVVVPIIVVIP
         +++ YNPHEKKLDS+T++GYFIGYP+ S G+ FYC +HSTRIVE+ NARFIENG  SGS  +  V IKE  ++ +    P+QVV+P++ V P
Subjt:  PKMRIYNPHEKKLDSKTISGYFIGYPKMSNGYRFYCSNHSTRIVESRNARFIENGGVSGSVGAHDVEIKESLMDQNPSSDPSQVVVPIIVVIP

SwissProt top hitse value%identityAlignment
P04146 Copia protein8.6e-1732.79Show/hide
Query:  SGVGQISSPFKKLLKSKGTCAQYTMPRSPNQNGVTEKRNRTLMEMVRSMMNDSVIPISLWMYALRIVTYILNRIPSKAV---PKTPYELWTSRKPSLRYL
        +G   +S+  ++    KG     T+P +P  NGV+E+  RT+ E  R+M++ + +  S W  A+   TY++NRIPS+A+    KTPYE+W ++KP L++L
Subjt:  SGVGQISSPFKKLLKSKGTCAQYTMPRSPNQNGVTEKRNRTLMEMVRSMMNDSVIPISLWMYALRIVTYILNRIPSKAV---PKTPYELWTSRKPSLRYL

Query:  HVWGWQPKMRIYNPHEKKLDSKTISGYFIGYPKMSNGYRFYCSNHSTRIVESRNARFIENGGV-SGSVGAHDVEIKESLMDQN
         V+G    + I N  + K D K+    F+GY    NG++ + + +   IV +R+    E   V S +V    V +K+S   +N
Subjt:  HVWGWQPKMRIYNPHEKKLDSKTISGYFIGYPKMSNGYRFYCSNHSTRIVESRNARFIENGGV-SGSVGAHDVEIKESLMDQN

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.3e-1632.14Show/hide
Query:  GEGIPASASTSGVGQISSPFKKLLKSKGTCAQYTMPRSPNQNGVTEKRNRTLMEMVRSMMNDSVIPISLWMYALRIVTYILNRIPSKAVP-KTPYELWTS
        G  +    S +G    S  F++   S G   + T+P +P  NGV E+ NRT++E VRSM+  + +P S W  A++   Y++NR PS  +  + P  +WT+
Subjt:  GEGIPASASTSGVGQISSPFKKLLKSKGTCAQYTMPRSPNQNGVTEKRNRTLMEMVRSMMNDSVIPISLWMYALRIVTYILNRIPSKAVP-KTPYELWTS

Query:  RKPSLRYLHVWGWQPKMRIYNPHEKKLDSKTISGYFIGYPKMSNGYRFYCSNHSTRIVESRNARFIEN
        ++ S  +L V+G +    +      KLD K+I   FIGY     GYR +      +++ SR+  F E+
Subjt:  RKPSLRYLHVWGWQPKMRIYNPHEKKLDSKTISGYFIGYPKMSNGYRFYCSNHSTRIVESRNARFIEN

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE17.0e-1130.88Show/hide
Query:  PRSPNQNGVTEKRNRTLMEMVRSMMNDSVIPISLWMYALRIVTYILNRIPSKAVP-KTPYELWTSRKPSLRYLHVWG--WQPKMRIYNPHEKKLDSKTIS
        P +P  NG++E+++R ++E   ++++ + IP + W YA  +  Y++NR+P+  +  ++P++      P+   L V+G    P +R YN H  KLD K+  
Subjt:  PRSPNQNGVTEKRNRTLMEMVRSMMNDSVIPISLWMYALRIVTYILNRIPSKAVP-KTPYELWTSRKPSLRYLHVWG--WQPKMRIYNPHEKKLDSKTIS

Query:  GYFIGYPKMSNGYRFYCSN-HSTRIVESRNARFIEN
          F+GY    + Y   C +  ++R+  SR+ RF EN
Subjt:  GYFIGYPKMSNGYRFYCSN-HSTRIVESRNARFIEN

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.5e-1028.16Show/hide
Query:  PRSPNQNGVTEKRNRTLMEMVRSMMNDSVIPISLWMYALRIVTYILNRIPSKAVP-KTPYELWTSRKPSLRYLHVWG--WQPKMRIYNPHEKKLDSKTIS
        P +P  NG++E+++R ++EM  ++++ + +P + W YA  +  Y++NR+P+  +  ++P++    + P+   L V+G    P +R YN H  KL+ K+  
Subjt:  PRSPNQNGVTEKRNRTLMEMVRSMMNDSVIPISLWMYALRIVTYILNRIPSKAVP-KTPYELWTSRKPSLRYLHVWG--WQPKMRIYNPHEKKLDSKTIS

Query:  GYFIGYPKMSNGYRFYCSNHST-RIVESRNARFIE--------NGGVSGSVGAHDVEIKESLMDQNPSSDPSQVVVPIIVVIPPPKYPLNLGESMRTADT
          F+GY    + Y   C +  T R+  SR+ +F E        N GVS S        +E   D  P+  PS   +P   ++ P   P  LG  + T+  
Subjt:  GYFIGYPKMSNGYRFYCSNHST-RIVESRNARFIE--------NGGVSGSVGAHDVEIKESLMDQNPSSDPSQVVVPIIVVIPPPKYPLNLGESMRTADT

Query:  DPLATS
         P + S
Subjt:  DPLATS

Q9ZUX4 Uncharacterized protein At2g27730, mitochondrial1.6e-1859.6Show/hide
Query:  SARTVARIFSRRFSSSGKILSEEEKAAENVYIKKTEQEKLEKLARKGPKPEEKAGGSVTDSVPSGSASTSGASTEKISTDKHRNYAVVAGTVTILGALG
        + R   RI SRRF SSGK+LSEEE+AAENV+IKK EQEKL+KLAR+GP  E+ AG +    V   +AS S  S  K+S DK+RNYAVVAG V I+G++G
Subjt:  SARTVARIFSRRFSSSGKILSEEEKAAENVYIKKTEQEKLEKLARKGPKPEEKAGGSVTDSVPSGSASTSGASTEKISTDKHRNYAVVAGTVTILGALG

Arabidopsis top hitse value%identityAlignment
AT2G27730.1 copper ion binding1.1e-1959.6Show/hide
Query:  SARTVARIFSRRFSSSGKILSEEEKAAENVYIKKTEQEKLEKLARKGPKPEEKAGGSVTDSVPSGSASTSGASTEKISTDKHRNYAVVAGTVTILGALG
        + R   RI SRRF SSGK+LSEEE+AAENV+IKK EQEKL+KLAR+GP  E+ AG +    V   +AS S  S  K+S DK+RNYAVVAG V I+G++G
Subjt:  SARTVARIFSRRFSSSGKILSEEEKAAENVYIKKTEQEKLEKLARKGPKPEEKAGGSVTDSVPSGSASTSGASTEKISTDKHRNYAVVAGTVTILGALG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTTTTACGGCAACAAGCACGCATTCTCCATGTGCAGCCATTGTGACTGAAGATCTTCTCCCAAGCTCACAAGTTCTGAAGATTGAAGCCCTTTCCTCCACTGCAAC
TACCAAAAACTCTCTCCTTCTTTCGGAAAGGCGTTCAATCGATATGGCATCAGCTAGGACGGTCGCTAGAATCTTTTCCCGAAGGTTCTCGAGCAGCGGGAAGATTCTCA
GCGAGGAAGAGAAGGCTGCTGAGAATGTCTACATCAAGAAAACTGAACAAGAAAAACTGGAGAAGCTTGCACGCAAGGGACCTAAACCAGAAGAAAAGGCAGGAGGGTCA
GTAACTGATTCCGTTCCCAGTGGCTCGGCCTCAACATCAGGAGCATCGACAGAGAAAATATCTACTGACAAACACCGGAATTATGCTGTTGTAGCTGGAACTGTGACGAT
TCTCGGTGCTCTTGGATGTAGGGTTGGCAAAAAACTAGTGAGGCTGGGTCCCCACGGGGCCCCGACCCGATCTGGTCGGGAAATCCCTAGTTTGACTAGGGATGTAGGTC
AAACCGGGGAGATTCCCCAAGTACCTGTTCAGGGTCAGGGCAGGGATGGGGAGGGTATCCCCGCCTCGGCCTCGACATCGGGAGTTGGACAAATTTCAAGTCCATTTAAG
AAGCTCCTTAAGTCTAAGGGCACTTGTGCGCAGTATACAATGCCCAGATCGCCAAATCAAAATGGTGTAACGGAAAAGCGTAATCGTACATTGATGGAAATGGTAAGGAG
TATGATGAATGATAGTGTTATACCTATTTCATTGTGGATGTATGCATTGAGGATAGTCACATACATATTGAATAGGATACCTAGTAAAGCAGTTCCTAAGACACCTTATG
AACTGTGGACATCTAGGAAGCCTAGTTTAAGATATCTTCATGTGTGGGGCTGGCAACCTAAAATGAGGATATATAATCCACATGAAAAAAAGTTGGATTCCAAGACCATT
AGTGGCTATTTCATTGGATATCCGAAAATGTCAAATGGGTATAGATTCTATTGTTCTAATCATAGTACGAGAATTGTAGAGTCTAGAAATGCTCGCTTCATTGAAAATGG
CGGAGTTAGTGGGAGTGTGGGAGCACATGATGTAGAGATAAAAGAGTCATTGATGGACCAAAATCCATCAAGTGATCCATCTCAAGTTGTTGTTCCTATTATTGTTGTCA
TACCCCCTCCCAAGTACCCTCTTAACCTAGGAGAAAGCATGAGGACAGCGGACACCGACCCTCTTGCGACATCCACTGCCAACCTTATACCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTTTTTACGGCAACAAGCACGCATTCTCCATGTGCAGCCATTGTGACTGAAGATCTTCTCCCAAGCTCACAAGTTCTGAAGATTGAAGCCCTTTCCTCCACTGCAAC
TACCAAAAACTCTCTCCTTCTTTCGGAAAGGCGTTCAATCGATATGGCATCAGCTAGGACGGTCGCTAGAATCTTTTCCCGAAGGTTCTCGAGCAGCGGGAAGATTCTCA
GCGAGGAAGAGAAGGCTGCTGAGAATGTCTACATCAAGAAAACTGAACAAGAAAAACTGGAGAAGCTTGCACGCAAGGGACCTAAACCAGAAGAAAAGGCAGGAGGGTCA
GTAACTGATTCCGTTCCCAGTGGCTCGGCCTCAACATCAGGAGCATCGACAGAGAAAATATCTACTGACAAACACCGGAATTATGCTGTTGTAGCTGGAACTGTGACGAT
TCTCGGTGCTCTTGGATGTAGGGTTGGCAAAAAACTAGTGAGGCTGGGTCCCCACGGGGCCCCGACCCGATCTGGTCGGGAAATCCCTAGTTTGACTAGGGATGTAGGTC
AAACCGGGGAGATTCCCCAAGTACCTGTTCAGGGTCAGGGCAGGGATGGGGAGGGTATCCCCGCCTCGGCCTCGACATCGGGAGTTGGACAAATTTCAAGTCCATTTAAG
AAGCTCCTTAAGTCTAAGGGCACTTGTGCGCAGTATACAATGCCCAGATCGCCAAATCAAAATGGTGTAACGGAAAAGCGTAATCGTACATTGATGGAAATGGTAAGGAG
TATGATGAATGATAGTGTTATACCTATTTCATTGTGGATGTATGCATTGAGGATAGTCACATACATATTGAATAGGATACCTAGTAAAGCAGTTCCTAAGACACCTTATG
AACTGTGGACATCTAGGAAGCCTAGTTTAAGATATCTTCATGTGTGGGGCTGGCAACCTAAAATGAGGATATATAATCCACATGAAAAAAAGTTGGATTCCAAGACCATT
AGTGGCTATTTCATTGGATATCCGAAAATGTCAAATGGGTATAGATTCTATTGTTCTAATCATAGTACGAGAATTGTAGAGTCTAGAAATGCTCGCTTCATTGAAAATGG
CGGAGTTAGTGGGAGTGTGGGAGCACATGATGTAGAGATAAAAGAGTCATTGATGGACCAAAATCCATCAAGTGATCCATCTCAAGTTGTTGTTCCTATTATTGTTGTCA
TACCCCCTCCCAAGTACCCTCTTAACCTAGGAGAAAGCATGAGGACAGCGGACACCGACCCTCTTGCGACATCCACTGCCAACCTTATACCTTGA
Protein sequenceShow/hide protein sequence
MFFTATSTHSPCAAIVTEDLLPSSQVLKIEALSSTATTKNSLLLSERRSIDMASARTVARIFSRRFSSSGKILSEEEKAAENVYIKKTEQEKLEKLARKGPKPEEKAGGS
VTDSVPSGSASTSGASTEKISTDKHRNYAVVAGTVTILGALGCRVGKKLVRLGPHGAPTRSGREIPSLTRDVGQTGEIPQVPVQGQGRDGEGIPASASTSGVGQISSPFK
KLLKSKGTCAQYTMPRSPNQNGVTEKRNRTLMEMVRSMMNDSVIPISLWMYALRIVTYILNRIPSKAVPKTPYELWTSRKPSLRYLHVWGWQPKMRIYNPHEKKLDSKTI
SGYFIGYPKMSNGYRFYCSNHSTRIVESRNARFIENGGVSGSVGAHDVEIKESLMDQNPSSDPSQVVVPIIVVIPPPKYPLNLGESMRTADTDPLATSTANLIP