; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0031437 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0031437
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr11:8465631..8472444
RNA-Seq ExpressionLag0031437
SyntenyLag0031437
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0005488 - binding (molecular function)
GO:0016779 - nucleotidyltransferase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022147763.1 uncharacterized protein LOC111016620 [Momordica charantia]2.0e-9872.59Show/hide
Query:  MAANSSTN---VAGSTILKIHAEKPEKFKGENFKRWQQKMIFYLTTLNLAHILKEDCPVTPPKGVTLETEAAKQTWMHSDFLCRNYILSGLEDTLYNVYY
        MAAN+STN   + GSTI K HAEKPEKFKGENFKRWQQKM+FY TTLNLAHI+KE CP T  + +T ETEAAKQ W+HSDFLC NYILS ++DTLYNVY 
Subjt:  MAANSSTN---VAGSTILKIHAEKPEKFKGENFKRWQQKMIFYLTTLNLAHILKEDCPVTPPKGVTLETEAAKQTWMHSDFLCRNYILSGLEDTLYNVYY

Query:  NAYTTSRLLWEALDKKYKLEDVGTKKFLVGKFLDYKMIDTKLVVNQMEELQIIISDLQSEGLGISEPFQVAAVIEKLPPSWKDFKCYLKHKRKELSMENL
        NA+ TSR LWEALDKKYKLED GTKKFLVGKFLDYKM+DTKLVVN +EELQIIISDLQSEGL I+EPFQV  VIEKL P+W++FKCYLKHK+KELS+ENL
Subjt:  NAYTTSRLLWEALDKKYKLEDVGTKKFLVGKFLDYKMIDTKLVVNQMEELQIIISDLQSEGLGISEPFQVAAVIEKLPPSWKDFKCYLKHKRKELSMENL

Query:  AIKLRIEEDNRKGGKASLEVEANAHVAESSRRDPKKRQFKKGNVNPQPRNDANKRIRGV
         +KLRI+E+N KG K   + EA AH+AE+SRR PKK Q K  N+   PRND NK I  +
Subjt:  AIKLRIEEDNRKGGKASLEVEANAHVAESSRRDPKKRQFKKGNVNPQPRNDANKRIRGV

XP_022148559.1 uncharacterized protein LOC111017193 [Momordica charantia]9.8e-10672.86Show/hide
Query:  MAANSSTN---VAGSTILKIHAEKPEKFKGENFKRWQQKMIFYLTTLNLAHILKEDCPVTPPKGVTLETEAAKQTWMHSDFLCRNYILSGLEDTLYNVYY
        MAAN+S N   + GSTI+K HAEK EKFKGENFKRWQQKMIFY TTLNLAHILKE CP TP + +TLETEA KQ  +HS+FLC NYILS L+DTL+NVY 
Subjt:  MAANSSTN---VAGSTILKIHAEKPEKFKGENFKRWQQKMIFYLTTLNLAHILKEDCPVTPPKGVTLETEAAKQTWMHSDFLCRNYILSGLEDTLYNVYY

Query:  NAYTTSRLLWEALDKKYKLEDVGTKKFLVGKFLDYKMIDTKLVVNQMEELQIIISDLQSEGLGISEPFQVAAVIEKLPPSWKDFKCYLKHKRKELSMENL
        NA+ TSR LWEALDKKYKLED GTKKFLV KFLDYK+IDTKLV+NQ+EELQII SDLQSE L I+EPFQ+ AVIEKLPP+W++FK YLKHKRKELSMENL
Subjt:  NAYTTSRLLWEALDKKYKLEDVGTKKFLVGKFLDYKMIDTKLVVNQMEELQIIISDLQSEGLGISEPFQVAAVIEKLPPSWKDFKCYLKHKRKELSMENL

Query:  AIKLRIEEDNRKGGKASLEVEANAHVAESSRRDPKKRQFKKGNVNPQPRNDANKRIRGVYWVCGKSDHIAADCRNKKDSS
         +KLRIEEDNRK  K   + EA AH+ E+SRR PKK Q K  N    PRNDANKRIR + WVCGKS HIAA+ R+KK  S
Subjt:  AIKLRIEEDNRKGGKASLEVEANAHVAESSRRDPKKRQFKKGNVNPQPRNDANKRIRGVYWVCGKSDHIAADCRNKKDSS

XP_022150041.1 uncharacterized protein LOC111018314 [Momordica charantia]3.3e-8562.5Show/hide
Query:  MAANSSTN---VAGSTILKIHAEKPEKFKGENFKRWQQKMIFYLTTLNLAHILKEDCPVTPPKGVTLETEAAKQTWMHSDFLCRNYILSGLEDTLYNVYY
        MAAN+S N   + G TI+K H EKPEKFKGENFKRWQQKMIFYLTTLNLAH LK   P TP + +T ETEAAKQ W+HSDFLC NYILS L+DTLYNVY 
Subjt:  MAANSSTN---VAGSTILKIHAEKPEKFKGENFKRWQQKMIFYLTTLNLAHILKEDCPVTPPKGVTLETEAAKQTWMHSDFLCRNYILSGLEDTLYNVYY

Query:  NAYTTSRLLWEALDKKYKLEDVGTKKFLVGKFLDYKMIDTKLVVNQMEELQIIISDLQSEGLGISEPFQVAAVIEKLPPSWKDFKCYLKHKRKELSMENL
        NA+  SR LWEALDKKYKLE                  D +LV+N                    E FQVAAVIEKLPP+W++FKCYLKHKRK+L MENL
Subjt:  NAYTTSRLLWEALDKKYKLEDVGTKKFLVGKFLDYKMIDTKLVVNQMEELQIIISDLQSEGLGISEPFQVAAVIEKLPPSWKDFKCYLKHKRKELSMENL

Query:  AIKLRIEEDNRKGGKASLEVEANAHVAESSRRDPKKRQFKKGNVNPQPRNDANKRIRGVYWVCGKSDHIAADCRNKKDSS
         +KL IEEDNRKG K   +VEAN H+AE+SRR PKK Q K  NVN  PRNDANKRIR + WVCG S HIAA+CR+KK  S
Subjt:  AIKLRIEEDNRKGGKASLEVEANAHVAESSRRDPKKRQFKKGNVNPQPRNDANKRIRGVYWVCGKSDHIAADCRNKKDSS

XP_022155402.1 uncharacterized protein LOC111022547 [Momordica charantia]1.4e-8360.65Show/hide
Query:  MAANSSTNVA---GSTILKIHAEKPEKFKGENFKRWQQKMIFYLTTLNLAHILKEDCPVTPPKGVTLETEAAKQTWMHSDFLCRNYILSGLEDTLYNVYY
        M  N+STN +   GSTI+K HAEK +KFKGENFKRWQQKMIFYLTTLNLA+ILKE CP T  K +T E EA KQ W+HSDFLCRNYIL+ ++DTLYNVY 
Subjt:  MAANSSTNVA---GSTILKIHAEKPEKFKGENFKRWQQKMIFYLTTLNLAHILKEDCPVTPPKGVTLETEAAKQTWMHSDFLCRNYILSGLEDTLYNVYY

Query:  NAYTTSRLLWEALDKKYKLEDVGTKKFLVGKFLDYKMIDTKLVVNQMEELQIIISDLQSEGLGISEPFQVAAVIEKLPPSWKDFKCYLKHKRKELSMENL
        NA+ TSR LWEALDKKY+LED                                                VA VIEKLPP+W++FK YLKHKRKELS+ENL
Subjt:  NAYTTSRLLWEALDKKYKLEDVGTKKFLVGKFLDYKMIDTKLVVNQMEELQIIISDLQSEGLGISEPFQVAAVIEKLPPSWKDFKCYLKHKRKELSMENL

Query:  AIKLRIEEDNRKGGKASLEVEANAHVAESSRRDPKKRQFKKGNVNPQPRNDANKRIRGVYWVCGKSDHIAADCRNKK
         +K+RIEEDNRKG K   +VEANAH+AE+SRR PKK QFK  NVN  PRN+ANKRIR + WVCGKSDHIAA+CR+KK
Subjt:  AIKLRIEEDNRKGGKASLEVEANAHVAESSRRDPKKRQFKKGNVNPQPRNDANKRIRGVYWVCGKSDHIAADCRNKK

XP_022156727.1 uncharacterized protein LOC111023572 [Momordica charantia]1.9e-8073.49Show/hide
Query:  MAANSSTN---VAGSTILKIHAEKPEKFKGENFKRWQQKMIFYLTTLNLAHILKEDCPVTPPKGVTLETEAAKQTWMHSDFLCRNYILSGLEDTLYNVYY
        MAAN+STN   + GSTI+K H EK EKF+G+NFK WQ KMIFYLTTLNLAHIL++ CP TP + +  ETEAAKQ W+HSDFL  NYIL+ L+ TL NVY 
Subjt:  MAANSSTN---VAGSTILKIHAEKPEKFKGENFKRWQQKMIFYLTTLNLAHILKEDCPVTPPKGVTLETEAAKQTWMHSDFLCRNYILSGLEDTLYNVYY

Query:  NAYTTSRLLWEALDKKYKLEDVGTKKFLVGKFLDYKMIDTKLVVNQMEELQIIISDLQSEGLGISEPFQVAAVIEKLPPSWKDFKCYLKHKRKELSMENL
        NA+ TSR LW+ LDKKYKLED GTKKFLVGKFLDYKM++TKLVVNQ+EELQII SDLQSEGL I+E FQVAAVIE LP  W++FKCYLKHKRK+LSMENL
Subjt:  NAYTTSRLLWEALDKKYKLEDVGTKKFLVGKFLDYKMIDTKLVVNQMEELQIIISDLQSEGLGISEPFQVAAVIEKLPPSWKDFKCYLKHKRKELSMENL

Query:  AIKLRIEEDNRKGGK
         +KLRIEED RKG K
Subjt:  AIKLRIEEDNRKGGK

TrEMBL top hitse value%identityAlignment
A0A6J1D271 uncharacterized protein LOC1110166209.6e-9972.59Show/hide
Query:  MAANSSTN---VAGSTILKIHAEKPEKFKGENFKRWQQKMIFYLTTLNLAHILKEDCPVTPPKGVTLETEAAKQTWMHSDFLCRNYILSGLEDTLYNVYY
        MAAN+STN   + GSTI K HAEKPEKFKGENFKRWQQKM+FY TTLNLAHI+KE CP T  + +T ETEAAKQ W+HSDFLC NYILS ++DTLYNVY 
Subjt:  MAANSSTN---VAGSTILKIHAEKPEKFKGENFKRWQQKMIFYLTTLNLAHILKEDCPVTPPKGVTLETEAAKQTWMHSDFLCRNYILSGLEDTLYNVYY

Query:  NAYTTSRLLWEALDKKYKLEDVGTKKFLVGKFLDYKMIDTKLVVNQMEELQIIISDLQSEGLGISEPFQVAAVIEKLPPSWKDFKCYLKHKRKELSMENL
        NA+ TSR LWEALDKKYKLED GTKKFLVGKFLDYKM+DTKLVVN +EELQIIISDLQSEGL I+EPFQV  VIEKL P+W++FKCYLKHK+KELS+ENL
Subjt:  NAYTTSRLLWEALDKKYKLEDVGTKKFLVGKFLDYKMIDTKLVVNQMEELQIIISDLQSEGLGISEPFQVAAVIEKLPPSWKDFKCYLKHKRKELSMENL

Query:  AIKLRIEEDNRKGGKASLEVEANAHVAESSRRDPKKRQFKKGNVNPQPRNDANKRIRGV
         +KLRI+E+N KG K   + EA AH+AE+SRR PKK Q K  N+   PRND NK I  +
Subjt:  AIKLRIEEDNRKGGKASLEVEANAHVAESSRRDPKKRQFKKGNVNPQPRNDANKRIRGV

A0A6J1D4C8 uncharacterized protein LOC1110171934.7e-10672.86Show/hide
Query:  MAANSSTN---VAGSTILKIHAEKPEKFKGENFKRWQQKMIFYLTTLNLAHILKEDCPVTPPKGVTLETEAAKQTWMHSDFLCRNYILSGLEDTLYNVYY
        MAAN+S N   + GSTI+K HAEK EKFKGENFKRWQQKMIFY TTLNLAHILKE CP TP + +TLETEA KQ  +HS+FLC NYILS L+DTL+NVY 
Subjt:  MAANSSTN---VAGSTILKIHAEKPEKFKGENFKRWQQKMIFYLTTLNLAHILKEDCPVTPPKGVTLETEAAKQTWMHSDFLCRNYILSGLEDTLYNVYY

Query:  NAYTTSRLLWEALDKKYKLEDVGTKKFLVGKFLDYKMIDTKLVVNQMEELQIIISDLQSEGLGISEPFQVAAVIEKLPPSWKDFKCYLKHKRKELSMENL
        NA+ TSR LWEALDKKYKLED GTKKFLV KFLDYK+IDTKLV+NQ+EELQII SDLQSE L I+EPFQ+ AVIEKLPP+W++FK YLKHKRKELSMENL
Subjt:  NAYTTSRLLWEALDKKYKLEDVGTKKFLVGKFLDYKMIDTKLVVNQMEELQIIISDLQSEGLGISEPFQVAAVIEKLPPSWKDFKCYLKHKRKELSMENL

Query:  AIKLRIEEDNRKGGKASLEVEANAHVAESSRRDPKKRQFKKGNVNPQPRNDANKRIRGVYWVCGKSDHIAADCRNKKDSS
         +KLRIEEDNRK  K   + EA AH+ E+SRR PKK Q K  N    PRNDANKRIR + WVCGKS HIAA+ R+KK  S
Subjt:  AIKLRIEEDNRKGGKASLEVEANAHVAESSRRDPKKRQFKKGNVNPQPRNDANKRIRGVYWVCGKSDHIAADCRNKKDSS

A0A6J1DA93 uncharacterized protein LOC1110183141.6e-8562.5Show/hide
Query:  MAANSSTN---VAGSTILKIHAEKPEKFKGENFKRWQQKMIFYLTTLNLAHILKEDCPVTPPKGVTLETEAAKQTWMHSDFLCRNYILSGLEDTLYNVYY
        MAAN+S N   + G TI+K H EKPEKFKGENFKRWQQKMIFYLTTLNLAH LK   P TP + +T ETEAAKQ W+HSDFLC NYILS L+DTLYNVY 
Subjt:  MAANSSTN---VAGSTILKIHAEKPEKFKGENFKRWQQKMIFYLTTLNLAHILKEDCPVTPPKGVTLETEAAKQTWMHSDFLCRNYILSGLEDTLYNVYY

Query:  NAYTTSRLLWEALDKKYKLEDVGTKKFLVGKFLDYKMIDTKLVVNQMEELQIIISDLQSEGLGISEPFQVAAVIEKLPPSWKDFKCYLKHKRKELSMENL
        NA+  SR LWEALDKKYKLE                  D +LV+N                    E FQVAAVIEKLPP+W++FKCYLKHKRK+L MENL
Subjt:  NAYTTSRLLWEALDKKYKLEDVGTKKFLVGKFLDYKMIDTKLVVNQMEELQIIISDLQSEGLGISEPFQVAAVIEKLPPSWKDFKCYLKHKRKELSMENL

Query:  AIKLRIEEDNRKGGKASLEVEANAHVAESSRRDPKKRQFKKGNVNPQPRNDANKRIRGVYWVCGKSDHIAADCRNKKDSS
         +KL IEEDNRKG K   +VEAN H+AE+SRR PKK Q K  NVN  PRNDANKRIR + WVCG S HIAA+CR+KK  S
Subjt:  AIKLRIEEDNRKGGKASLEVEANAHVAESSRRDPKKRQFKKGNVNPQPRNDANKRIRGVYWVCGKSDHIAADCRNKKDSS

A0A6J1DP87 uncharacterized protein LOC1110225476.7e-8460.65Show/hide
Query:  MAANSSTNVA---GSTILKIHAEKPEKFKGENFKRWQQKMIFYLTTLNLAHILKEDCPVTPPKGVTLETEAAKQTWMHSDFLCRNYILSGLEDTLYNVYY
        M  N+STN +   GSTI+K HAEK +KFKGENFKRWQQKMIFYLTTLNLA+ILKE CP T  K +T E EA KQ W+HSDFLCRNYIL+ ++DTLYNVY 
Subjt:  MAANSSTNVA---GSTILKIHAEKPEKFKGENFKRWQQKMIFYLTTLNLAHILKEDCPVTPPKGVTLETEAAKQTWMHSDFLCRNYILSGLEDTLYNVYY

Query:  NAYTTSRLLWEALDKKYKLEDVGTKKFLVGKFLDYKMIDTKLVVNQMEELQIIISDLQSEGLGISEPFQVAAVIEKLPPSWKDFKCYLKHKRKELSMENL
        NA+ TSR LWEALDKKY+LED                                                VA VIEKLPP+W++FK YLKHKRKELS+ENL
Subjt:  NAYTTSRLLWEALDKKYKLEDVGTKKFLVGKFLDYKMIDTKLVVNQMEELQIIISDLQSEGLGISEPFQVAAVIEKLPPSWKDFKCYLKHKRKELSMENL

Query:  AIKLRIEEDNRKGGKASLEVEANAHVAESSRRDPKKRQFKKGNVNPQPRNDANKRIRGVYWVCGKSDHIAADCRNKK
         +K+RIEEDNRKG K   +VEANAH+AE+SRR PKK QFK  NVN  PRN+ANKRIR + WVCGKSDHIAA+CR+KK
Subjt:  AIKLRIEEDNRKGGKASLEVEANAHVAESSRRDPKKRQFKKGNVNPQPRNDANKRIRGVYWVCGKSDHIAADCRNKK

A0A6J1DSQ3 uncharacterized protein LOC1110235729.0e-8173.49Show/hide
Query:  MAANSSTN---VAGSTILKIHAEKPEKFKGENFKRWQQKMIFYLTTLNLAHILKEDCPVTPPKGVTLETEAAKQTWMHSDFLCRNYILSGLEDTLYNVYY
        MAAN+STN   + GSTI+K H EK EKF+G+NFK WQ KMIFYLTTLNLAHIL++ CP TP + +  ETEAAKQ W+HSDFL  NYIL+ L+ TL NVY 
Subjt:  MAANSSTN---VAGSTILKIHAEKPEKFKGENFKRWQQKMIFYLTTLNLAHILKEDCPVTPPKGVTLETEAAKQTWMHSDFLCRNYILSGLEDTLYNVYY

Query:  NAYTTSRLLWEALDKKYKLEDVGTKKFLVGKFLDYKMIDTKLVVNQMEELQIIISDLQSEGLGISEPFQVAAVIEKLPPSWKDFKCYLKHKRKELSMENL
        NA+ TSR LW+ LDKKYKLED GTKKFLVGKFLDYKM++TKLVVNQ+EELQII SDLQSEGL I+E FQVAAVIE LP  W++FKCYLKHKRK+LSMENL
Subjt:  NAYTTSRLLWEALDKKYKLEDVGTKKFLVGKFLDYKMIDTKLVVNQMEELQIIISDLQSEGLGISEPFQVAAVIEKLPPSWKDFKCYLKHKRKELSMENL

Query:  AIKLRIEEDNRKGGK
         +KLRIEED RKG K
Subjt:  AIKLRIEEDNRKGGK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G00980.1 zinc knuckle (CCHC-type) family protein4.5e-2424.64Show/hide
Query:  GSTILKIHAEKPEKFKGENFKRWQQKMIFYLTTLNLAHILKEDC--------PVTPPKGVTLETEAAKQTWMHSDFLCRNYILSGLEDTLYNVYYNAYTT
        G +++KI      +F G+++  W  +M  +L  L L ++L E C        P T P+ +T   +A  + W+  D+LC  ++++ L D LY  Y   +  
Subjt:  GSTILKIHAEKPEKFKGENFKRWQQKMIFYLTTLNLAHILKEDC--------PVTPPKGVTLETEAAKQTWMHSDFLCRNYILSGLEDTLYNVYYNAYTT

Query:  SRLLWEALDKKYKLEDVGTKKFLVGKFLDYKMIDTKLVVNQMEELQIIISDLQSEGLGISEPFQVAAVIEKLPPSWKDFKCYLKHKRKELSMENLAIKLR
        ++ LW+ L   Y+ ++  +K+  V K+++++M++ + ++ Q++    I   + S G+ + E F V+ +I K PPSW+ F C    + + L +  L  +++
Subjt:  SRLLWEALDKKYKLEDVGTKKFLVGKFLDYKMIDTKLVVNQMEELQIIISDLQSEGLGISEPFQVAAVIEKLPPSWKDFKCYLKHKRKELSMENLAIKLR

Query:  IEEDNRKGGKASLEVEANAHVAESSRRDPKKRQFKKGNVN---PQPRNDANKRIRGVYWVCGKSDHIAADCRNKKDSSRA
         EE+  + G   +     A  +    R P      +G+ +    +   + ++R+  V   CG+  H+A  C   K   RA
Subjt:  IEEDNRKGGKASLEVEANAHVAESSRRDPKKRQFKKGNVN---PQPRNDANKRIRGVYWVCGKSDHIAADCRNKKDSSRA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGCAAACTCCTCCACTAATGTTGCTGGATCGACCATCCTCAAAATTCATGCTGAAAAACCAGAGAAATTCAAGGGAGAAAATTTCAAGAGGTGGCAACAGAAGAT
GATCTTCTACCTCACCACACTGAACCTTGCTCACATCTTGAAGGAAGATTGTCCAGTTACCCCACCAAAAGGTGTTACTCTTGAAACTGAAGCTGCCAAGCAGACATGGA
TGCATTCAGATTTTTTATGCCGCAATTATATATTGAGTGGTCTTGAAGACACCTTGTATAATGTCTACTACAATGCCTATACTACTTCAAGGCTATTGTGGGAGGCATTA
GACAAGAAGTATAAGCTGGAAGATGTTGGTACTAAGAAGTTTCTTGTCGGAAAATTCTTAGATTATAAGATGATTGATACCAAGTTGGTAGTCAATCAGATGGAAGAATT
GCAAATTATCATTAGTGATTTGCAAAGTGAAGGATTGGGCATCAGTGAACCATTCCAAGTTGCTGCTGTGATTGAGAAGCTACCTCCTTCTTGGAAGGACTTCAAATGTT
ATCTTAAACACAAGCGAAAGGAGCTTTCCATGGAGAATCTTGCAATCAAACTCCGCATTGAAGAAGATAATAGAAAAGGAGGCAAAGCTTCGTTGGAAGTTGAAGCCAAT
GCTCATGTTGCTGAATCTTCAAGGCGTGATCCCAAGAAGCGACAATTCAAGAAGGGAAATGTGAATCCTCAACCGAGGAATGATGCCAATAAGCGCATTCGTGGAGTCTA
CTGGGTTTGTGGTAAGAGTGACCATATTGCTGCTGATTGTAGAAACAAGAAGGATAGTTCTAGAGCTTGGTCTCACTATTTCTTCGGTAAATTCGATAGATATTCACTAA
GGAAGGTTTTCGATAGAGCAGAAGGGTGTGTGGGTGTGTGGGTACCTCAAATCGCCAAAAAGTTACCTCAAATCACCGGAAAACGCTCGTGGGTTTCATGGGGGAGACCG
TGGGTAAGGTGGAATGACTTTATTGTTGGGGAAGCCATTGGAGAACAAACGGAAGAAATAGAGTTCGCAAGGCTTGCGACGAATTTCGGATTTTCGTCGCAAAGATCTTG
CGACGAATTTCGGATTTTCGTCGCAAAGACCTTGCGACGAATCGTTCCTTCTAGTCGCAGTTATTTTGAGATGAAGTATGCGACGTTTTTTCTCGTTTCGTCGCAACTTG
CAACGAATTTACCTTTTCGTCGCAGGTTGCGACGCATTTTAACAATTCGACGCAAAATCCTGCTTGCGACGGTGTTAAAGGTTATCGATTGTTTGATATCACAAAACGAC
AGGTCCTTATATCTCGAGTCGTGGTTTTCTTTGAAAAATTCTTTTCCATTTCATACTAGTGGTATTTCTGATGACACTATTAATGCCTTATTTGTTGATCATGCTTTGCC
AGGTCCTATTTTGGACCCTATTGTGCTTGAAAGTGTGGTCACTGATTCAAATCAAGAGTCTTCTGATTCTATTCTTGACTCTGGTATTTCTGATGACACTATTGATTCCT
TATTTGTTGATCATGCTTTGCAATGTCCTATTTTGGACCCTATTGTGCTTGAAAGTGTGGTCACTGATCCAAATCAAGAGTCTTCTGATTCTATTCTTGATTTGTCTGTT
GCTAAAGATCATGGAGCACAAAAAACTCAAATTGATGATTCAGAGAATGAGCCACAACCTTCTTGTAAGGATTTGGAATTCCCCAATTCCGTTAAAGCGGAAGCGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTGCAAACTCCTCCACTAATGTTGCTGGATCGACCATCCTCAAAATTCATGCTGAAAAACCAGAGAAATTCAAGGGAGAAAATTTCAAGAGGTGGCAACAGAAGAT
GATCTTCTACCTCACCACACTGAACCTTGCTCACATCTTGAAGGAAGATTGTCCAGTTACCCCACCAAAAGGTGTTACTCTTGAAACTGAAGCTGCCAAGCAGACATGGA
TGCATTCAGATTTTTTATGCCGCAATTATATATTGAGTGGTCTTGAAGACACCTTGTATAATGTCTACTACAATGCCTATACTACTTCAAGGCTATTGTGGGAGGCATTA
GACAAGAAGTATAAGCTGGAAGATGTTGGTACTAAGAAGTTTCTTGTCGGAAAATTCTTAGATTATAAGATGATTGATACCAAGTTGGTAGTCAATCAGATGGAAGAATT
GCAAATTATCATTAGTGATTTGCAAAGTGAAGGATTGGGCATCAGTGAACCATTCCAAGTTGCTGCTGTGATTGAGAAGCTACCTCCTTCTTGGAAGGACTTCAAATGTT
ATCTTAAACACAAGCGAAAGGAGCTTTCCATGGAGAATCTTGCAATCAAACTCCGCATTGAAGAAGATAATAGAAAAGGAGGCAAAGCTTCGTTGGAAGTTGAAGCCAAT
GCTCATGTTGCTGAATCTTCAAGGCGTGATCCCAAGAAGCGACAATTCAAGAAGGGAAATGTGAATCCTCAACCGAGGAATGATGCCAATAAGCGCATTCGTGGAGTCTA
CTGGGTTTGTGGTAAGAGTGACCATATTGCTGCTGATTGTAGAAACAAGAAGGATAGTTCTAGAGCTTGGTCTCACTATTTCTTCGGTAAATTCGATAGATATTCACTAA
GGAAGGTTTTCGATAGAGCAGAAGGGTGTGTGGGTGTGTGGGTACCTCAAATCGCCAAAAAGTTACCTCAAATCACCGGAAAACGCTCGTGGGTTTCATGGGGGAGACCG
TGGGTAAGGTGGAATGACTTTATTGTTGGGGAAGCCATTGGAGAACAAACGGAAGAAATAGAGTTCGCAAGGCTTGCGACGAATTTCGGATTTTCGTCGCAAAGATCTTG
CGACGAATTTCGGATTTTCGTCGCAAAGACCTTGCGACGAATCGTTCCTTCTAGTCGCAGTTATTTTGAGATGAAGTATGCGACGTTTTTTCTCGTTTCGTCGCAACTTG
CAACGAATTTACCTTTTCGTCGCAGGTTGCGACGCATTTTAACAATTCGACGCAAAATCCTGCTTGCGACGGTGTTAAAGGTTATCGATTGTTTGATATCACAAAACGAC
AGGTCCTTATATCTCGAGTCGTGGTTTTCTTTGAAAAATTCTTTTCCATTTCATACTAGTGGTATTTCTGATGACACTATTAATGCCTTATTTGTTGATCATGCTTTGCC
AGGTCCTATTTTGGACCCTATTGTGCTTGAAAGTGTGGTCACTGATTCAAATCAAGAGTCTTCTGATTCTATTCTTGACTCTGGTATTTCTGATGACACTATTGATTCCT
TATTTGTTGATCATGCTTTGCAATGTCCTATTTTGGACCCTATTGTGCTTGAAAGTGTGGTCACTGATCCAAATCAAGAGTCTTCTGATTCTATTCTTGATTTGTCTGTT
GCTAAAGATCATGGAGCACAAAAAACTCAAATTGATGATTCAGAGAATGAGCCACAACCTTCTTGTAAGGATTTGGAATTCCCCAATTCCGTTAAAGCGGAAGCGTAA
Protein sequenceShow/hide protein sequence
MAANSSTNVAGSTILKIHAEKPEKFKGENFKRWQQKMIFYLTTLNLAHILKEDCPVTPPKGVTLETEAAKQTWMHSDFLCRNYILSGLEDTLYNVYYNAYTTSRLLWEAL
DKKYKLEDVGTKKFLVGKFLDYKMIDTKLVVNQMEELQIIISDLQSEGLGISEPFQVAAVIEKLPPSWKDFKCYLKHKRKELSMENLAIKLRIEEDNRKGGKASLEVEAN
AHVAESSRRDPKKRQFKKGNVNPQPRNDANKRIRGVYWVCGKSDHIAADCRNKKDSSRAWSHYFFGKFDRYSLRKVFDRAEGCVGVWVPQIAKKLPQITGKRSWVSWGRP
WVRWNDFIVGEAIGEQTEEIEFARLATNFGFSSQRSCDEFRIFVAKTLRRIVPSSRSYFEMKYATFFLVSSQLATNLPFRRRLRRILTIRRKILLATVLKVIDCLISQND
RSLYLESWFSLKNSFPFHTSGISDDTINALFVDHALPGPILDPIVLESVVTDSNQESSDSILDSGISDDTIDSLFVDHALQCPILDPIVLESVVTDPNQESSDSILDLSV
AKDHGAQKTQIDDSENEPQPSCKDLEFPNSVKAEA