; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022702 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022702
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr7:35933507..35942823
RNA-Seq ExpressionLag0022702
SyntenyLag0022702
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022147763.1 uncharacterized protein LOC111016620 [Momordica charantia]2.2e-9773.49Show/hide
Query:  MAANSSTNVAGATIMRSTILKTHAEKPEKFKGENFKRWQQKMIFYLTTLNLAHILKEDCPVTPPEGVTPETEAAKQAWMHSDFLCRNYILSGLEDTLYNV
        MAAN+STN     +M STI K+HAEKPEKFKGENFKRWQQKM+FY TTLNLAHI+KE CP T  E +TPETEAAKQAW+HSDFLC NYILS ++DTLYNV
Subjt:  MAANSSTNVAGATIMRSTILKTHAEKPEKFKGENFKRWQQKMIFYLTTLNLAHILKEDCPVTPPEGVTPETEAAKQAWMHSDFLCRNYILSGLEDTLYNV

Query:  YCNAYTTSRLLWEALDKKDKLEDAGTKKFLVRKFLDYKMIDTKLVVNQMEELQIIISDLQSEGMSISKPFQVAAVIEKLPPSWKDFKCYLKHKRKELSME
        YCNA+ TSR LWEALDKK KLEDAGTKKFLV KFLDYKM+DTKLVVN +EELQIIISDLQSEG+ I++PFQV  VIEKL P+W++FKCYLKHK+KELS+E
Subjt:  YCNAYTTSRLLWEALDKKDKLEDAGTKKFLVRKFLDYKMIDTKLVVNQMEELQIIISDLQSEGMSISKPFQVAAVIEKLPPSWKDFKCYLKHKRKELSME

Query:  NLAIKLRIEEDNRKGDKASLGVEANAHIAESSKHGPKEQQLKKRNVNPR
        NL +KLRI+E+N KGDK     EA AHIAE+S+  PK+ Q K +N+ PR
Subjt:  NLAIKLRIEEDNRKGDKASLGVEANAHIAESSKHGPKEQQLKKRNVNPR

XP_022148559.1 uncharacterized protein LOC111017193 [Momordica charantia]2.3e-9472.83Show/hide
Query:  MAANSSTNVAGATIMRSTILKTHAEKPEKFKGENFKRWQQKMIFYLTTLNLAHILKEDCPVTPPEGVTPETEAAKQAWMHSDFLCRNYILSGLEDTLYNV
        MAAN+S N    ++M STI+K+HAEK EKFKGENFKRWQQKMIFY TTLNLAHILKE CP TP E +T ETEA KQA +HS+FLC NYILS L+DTL+NV
Subjt:  MAANSSTNVAGATIMRSTILKTHAEKPEKFKGENFKRWQQKMIFYLTTLNLAHILKEDCPVTPPEGVTPETEAAKQAWMHSDFLCRNYILSGLEDTLYNV

Query:  YCNAYTTSRLLWEALDKKDKLEDAGTKKFLVRKFLDYKMIDTKLVVNQMEELQIIISDLQSEGMSISKPFQVAAVIEKLPPSWKDFKCYLKHKRKELSME
        YCNA+ TSR LWEALDKK KLEDAGTKKFLVRKFLDYK+IDTKLV+NQ+EELQII SDLQSE + I++PFQ+ AVIEKLPP+W++FK YLKHKRKELSME
Subjt:  YCNAYTTSRLLWEALDKKDKLEDAGTKKFLVRKFLDYKMIDTKLVVNQMEELQIIISDLQSEGMSISKPFQVAAVIEKLPPSWKDFKCYLKHKRKELSME

Query:  NLAIKLRIEEDNRKGDKASLGVEANAHIAESSKHGPKEQQLKKRNVNPRPRNDA
        NL +KLRIEEDNRK DK     EA AHI E+S+  PK+ Q K +N    PRNDA
Subjt:  NLAIKLRIEEDNRKGDKASLGVEANAHIAESSKHGPKEQQLKKRNVNPRPRNDA

XP_022150041.1 uncharacterized protein LOC111018314 [Momordica charantia]9.0e-7562.2Show/hide
Query:  MAANSSTNVAGATIMRSTILKTHAEKPEKFKGENFKRWQQKMIFYLTTLNLAHILKEDCPVTPPEGVTPETEAAKQAWMHSDFLCRNYILSGLEDTLYNV
        MAAN+S N    ++M  TI+K+H EKPEKFKGENFKRWQQKMIFYLTTLNLAH LK   P TP E +TPETEAAKQAW+HSDFLC NYILS L+DTLYNV
Subjt:  MAANSSTNVAGATIMRSTILKTHAEKPEKFKGENFKRWQQKMIFYLTTLNLAHILKEDCPVTPPEGVTPETEAAKQAWMHSDFLCRNYILSGLEDTLYNV

Query:  YCNAYTTSRLLWEALDKKDKLEDAGTKKFLVRKFLDYKMIDTKLVVNQMEELQIIISDLQSEGMSISKPFQVAAVIEKLPPSWKDFKCYLKHKRKELSME
        YCNA+  SR LWEALDKK KLEDA                  +LV+N+                     FQVAAVIEKLPP+W++FKCYLKHKRK+L ME
Subjt:  YCNAYTTSRLLWEALDKKDKLEDAGTKKFLVRKFLDYKMIDTKLVVNQMEELQIIISDLQSEGMSISKPFQVAAVIEKLPPSWKDFKCYLKHKRKELSME

Query:  NLAIKLRIEEDNRKGDKASLGVEANAHIAESSKHGPKEQQLKKRNVNPRPRNDA
        NL +KL IEEDNRKGDK    VEAN HIAE+S+  PK+ Q+K +NVN  PRNDA
Subjt:  NLAIKLRIEEDNRKGDKASLGVEANAHIAESSKHGPKEQQLKKRNVNPRPRNDA

XP_022155403.1 uncharacterized protein LOC111022548 [Momordica charantia]2.1e-7157.78Show/hide
Query:  MAANSSTNVAGATIMRSTILKTHAEKPEKFKGENFKRWQQKMIFYLTTLNLAHILKEDCPVTPPEGVTPETEAAKQAWMHSDFLCRNYILSGLEDTLYNV
        MA N+S N    +++ STI+K+H EK EKFKGENFKR Q+KMIFYLTTLNLAH+LKE CP T  EG+TPE EA KQAW+HSDFLCRNYILS L+DTLYNV
Subjt:  MAANSSTNVAGATIMRSTILKTHAEKPEKFKGENFKRWQQKMIFYLTTLNLAHILKEDCPVTPPEGVTPETEAAKQAWMHSDFLCRNYILSGLEDTLYNV

Query:  YCNAYTTSRLLWEALDKKDKLEDAGTKKFLVRKFLDYKMIDTKLVVNQMEELQIIISDLQSEGMSISKPFQVAAVIEKLPPSWKDFKCYLKHKRKELSME
        Y N   TSR LWE LDKK KL+DA  KKF+V KF                           EG+ I++PFQVAAVIEKLP +W++FK YLKHKRKELSME
Subjt:  YCNAYTTSRLLWEALDKKDKLEDAGTKKFLVRKFLDYKMIDTKLVVNQMEELQIIISDLQSEGMSISKPFQVAAVIEKLPPSWKDFKCYLKHKRKELSME

Query:  NLAIKLRIEEDNRKGDKASLGVEANAHIAESSKHGPKEQQLKKRNVNPRPRNDAKSAFVEFAGFVVRVAM
        NL +KL IE+DN+K DK     +A  HIAE+S+  PK+ Q K ++VN  PRNDA         FV+RV +
Subjt:  NLAIKLRIEEDNRKGDKASLGVEANAHIAESSKHGPKEQQLKKRNVNPRPRNDAKSAFVEFAGFVVRVAM

XP_022156727.1 uncharacterized protein LOC111023572 [Momordica charantia]9.9e-8273.73Show/hide
Query:  MAANSSTNVAGATIMRSTILKTHAEKPEKFKGENFKRWQQKMIFYLTTLNLAHILKEDCPVTPPEGVTPETEAAKQAWMHSDFLCRNYILSGLEDTLYNV
        MAAN+STN    T+M STI+K H EK EKF+G+NFK WQ KMIFYLTTLNLAHIL++ CP TP E + PETEAAKQAW+HSDFL  NYIL+ L+ TL NV
Subjt:  MAANSSTNVAGATIMRSTILKTHAEKPEKFKGENFKRWQQKMIFYLTTLNLAHILKEDCPVTPPEGVTPETEAAKQAWMHSDFLCRNYILSGLEDTLYNV

Query:  YCNAYTTSRLLWEALDKKDKLEDAGTKKFLVRKFLDYKMIDTKLVVNQMEELQIIISDLQSEGMSISKPFQVAAVIEKLPPSWKDFKCYLKHKRKELSME
        YCNA+ TSR LW+ LDKK KLED GTKKFLV KFLDYKM++TKLVVNQ+EELQII SDLQSEG+ I++ FQVAAVIE LP  W++FKCYLKHKRK+LSME
Subjt:  YCNAYTTSRLLWEALDKKDKLEDAGTKKFLVRKFLDYKMIDTKLVVNQMEELQIIISDLQSEGMSISKPFQVAAVIEKLPPSWKDFKCYLKHKRKELSME

Query:  NLAIKLRIEEDNRKGDK
        NL +KLRIEED RKGDK
Subjt:  NLAIKLRIEEDNRKGDK

TrEMBL top hitse value%identityAlignment
A0A6J1D271 uncharacterized protein LOC1110166201.1e-9773.49Show/hide
Query:  MAANSSTNVAGATIMRSTILKTHAEKPEKFKGENFKRWQQKMIFYLTTLNLAHILKEDCPVTPPEGVTPETEAAKQAWMHSDFLCRNYILSGLEDTLYNV
        MAAN+STN     +M STI K+HAEKPEKFKGENFKRWQQKM+FY TTLNLAHI+KE CP T  E +TPETEAAKQAW+HSDFLC NYILS ++DTLYNV
Subjt:  MAANSSTNVAGATIMRSTILKTHAEKPEKFKGENFKRWQQKMIFYLTTLNLAHILKEDCPVTPPEGVTPETEAAKQAWMHSDFLCRNYILSGLEDTLYNV

Query:  YCNAYTTSRLLWEALDKKDKLEDAGTKKFLVRKFLDYKMIDTKLVVNQMEELQIIISDLQSEGMSISKPFQVAAVIEKLPPSWKDFKCYLKHKRKELSME
        YCNA+ TSR LWEALDKK KLEDAGTKKFLV KFLDYKM+DTKLVVN +EELQIIISDLQSEG+ I++PFQV  VIEKL P+W++FKCYLKHK+KELS+E
Subjt:  YCNAYTTSRLLWEALDKKDKLEDAGTKKFLVRKFLDYKMIDTKLVVNQMEELQIIISDLQSEGMSISKPFQVAAVIEKLPPSWKDFKCYLKHKRKELSME

Query:  NLAIKLRIEEDNRKGDKASLGVEANAHIAESSKHGPKEQQLKKRNVNPR
        NL +KLRI+E+N KGDK     EA AHIAE+S+  PK+ Q K +N+ PR
Subjt:  NLAIKLRIEEDNRKGDKASLGVEANAHIAESSKHGPKEQQLKKRNVNPR

A0A6J1D4C8 uncharacterized protein LOC1110171931.1e-9472.83Show/hide
Query:  MAANSSTNVAGATIMRSTILKTHAEKPEKFKGENFKRWQQKMIFYLTTLNLAHILKEDCPVTPPEGVTPETEAAKQAWMHSDFLCRNYILSGLEDTLYNV
        MAAN+S N    ++M STI+K+HAEK EKFKGENFKRWQQKMIFY TTLNLAHILKE CP TP E +T ETEA KQA +HS+FLC NYILS L+DTL+NV
Subjt:  MAANSSTNVAGATIMRSTILKTHAEKPEKFKGENFKRWQQKMIFYLTTLNLAHILKEDCPVTPPEGVTPETEAAKQAWMHSDFLCRNYILSGLEDTLYNV

Query:  YCNAYTTSRLLWEALDKKDKLEDAGTKKFLVRKFLDYKMIDTKLVVNQMEELQIIISDLQSEGMSISKPFQVAAVIEKLPPSWKDFKCYLKHKRKELSME
        YCNA+ TSR LWEALDKK KLEDAGTKKFLVRKFLDYK+IDTKLV+NQ+EELQII SDLQSE + I++PFQ+ AVIEKLPP+W++FK YLKHKRKELSME
Subjt:  YCNAYTTSRLLWEALDKKDKLEDAGTKKFLVRKFLDYKMIDTKLVVNQMEELQIIISDLQSEGMSISKPFQVAAVIEKLPPSWKDFKCYLKHKRKELSME

Query:  NLAIKLRIEEDNRKGDKASLGVEANAHIAESSKHGPKEQQLKKRNVNPRPRNDA
        NL +KLRIEEDNRK DK     EA AHI E+S+  PK+ Q K +N    PRNDA
Subjt:  NLAIKLRIEEDNRKGDKASLGVEANAHIAESSKHGPKEQQLKKRNVNPRPRNDA

A0A6J1DA93 uncharacterized protein LOC1110183144.3e-7562.2Show/hide
Query:  MAANSSTNVAGATIMRSTILKTHAEKPEKFKGENFKRWQQKMIFYLTTLNLAHILKEDCPVTPPEGVTPETEAAKQAWMHSDFLCRNYILSGLEDTLYNV
        MAAN+S N    ++M  TI+K+H EKPEKFKGENFKRWQQKMIFYLTTLNLAH LK   P TP E +TPETEAAKQAW+HSDFLC NYILS L+DTLYNV
Subjt:  MAANSSTNVAGATIMRSTILKTHAEKPEKFKGENFKRWQQKMIFYLTTLNLAHILKEDCPVTPPEGVTPETEAAKQAWMHSDFLCRNYILSGLEDTLYNV

Query:  YCNAYTTSRLLWEALDKKDKLEDAGTKKFLVRKFLDYKMIDTKLVVNQMEELQIIISDLQSEGMSISKPFQVAAVIEKLPPSWKDFKCYLKHKRKELSME
        YCNA+  SR LWEALDKK KLEDA                  +LV+N+                     FQVAAVIEKLPP+W++FKCYLKHKRK+L ME
Subjt:  YCNAYTTSRLLWEALDKKDKLEDAGTKKFLVRKFLDYKMIDTKLVVNQMEELQIIISDLQSEGMSISKPFQVAAVIEKLPPSWKDFKCYLKHKRKELSME

Query:  NLAIKLRIEEDNRKGDKASLGVEANAHIAESSKHGPKEQQLKKRNVNPRPRNDA
        NL +KL IEEDNRKGDK    VEAN HIAE+S+  PK+ Q+K +NVN  PRNDA
Subjt:  NLAIKLRIEEDNRKGDKASLGVEANAHIAESSKHGPKEQQLKKRNVNPRPRNDA

A0A6J1DMV2 uncharacterized protein LOC1110225481.0e-7157.78Show/hide
Query:  MAANSSTNVAGATIMRSTILKTHAEKPEKFKGENFKRWQQKMIFYLTTLNLAHILKEDCPVTPPEGVTPETEAAKQAWMHSDFLCRNYILSGLEDTLYNV
        MA N+S N    +++ STI+K+H EK EKFKGENFKR Q+KMIFYLTTLNLAH+LKE CP T  EG+TPE EA KQAW+HSDFLCRNYILS L+DTLYNV
Subjt:  MAANSSTNVAGATIMRSTILKTHAEKPEKFKGENFKRWQQKMIFYLTTLNLAHILKEDCPVTPPEGVTPETEAAKQAWMHSDFLCRNYILSGLEDTLYNV

Query:  YCNAYTTSRLLWEALDKKDKLEDAGTKKFLVRKFLDYKMIDTKLVVNQMEELQIIISDLQSEGMSISKPFQVAAVIEKLPPSWKDFKCYLKHKRKELSME
        Y N   TSR LWE LDKK KL+DA  KKF+V KF                           EG+ I++PFQVAAVIEKLP +W++FK YLKHKRKELSME
Subjt:  YCNAYTTSRLLWEALDKKDKLEDAGTKKFLVRKFLDYKMIDTKLVVNQMEELQIIISDLQSEGMSISKPFQVAAVIEKLPPSWKDFKCYLKHKRKELSME

Query:  NLAIKLRIEEDNRKGDKASLGVEANAHIAESSKHGPKEQQLKKRNVNPRPRNDAKSAFVEFAGFVVRVAM
        NL +KL IE+DN+K DK     +A  HIAE+S+  PK+ Q K ++VN  PRNDA         FV+RV +
Subjt:  NLAIKLRIEEDNRKGDKASLGVEANAHIAESSKHGPKEQQLKKRNVNPRPRNDAKSAFVEFAGFVVRVAM

A0A6J1DSQ3 uncharacterized protein LOC1110235724.8e-8273.73Show/hide
Query:  MAANSSTNVAGATIMRSTILKTHAEKPEKFKGENFKRWQQKMIFYLTTLNLAHILKEDCPVTPPEGVTPETEAAKQAWMHSDFLCRNYILSGLEDTLYNV
        MAAN+STN    T+M STI+K H EK EKF+G+NFK WQ KMIFYLTTLNLAHIL++ CP TP E + PETEAAKQAW+HSDFL  NYIL+ L+ TL NV
Subjt:  MAANSSTNVAGATIMRSTILKTHAEKPEKFKGENFKRWQQKMIFYLTTLNLAHILKEDCPVTPPEGVTPETEAAKQAWMHSDFLCRNYILSGLEDTLYNV

Query:  YCNAYTTSRLLWEALDKKDKLEDAGTKKFLVRKFLDYKMIDTKLVVNQMEELQIIISDLQSEGMSISKPFQVAAVIEKLPPSWKDFKCYLKHKRKELSME
        YCNA+ TSR LW+ LDKK KLED GTKKFLV KFLDYKM++TKLVVNQ+EELQII SDLQSEG+ I++ FQVAAVIE LP  W++FKCYLKHKRK+LSME
Subjt:  YCNAYTTSRLLWEALDKKDKLEDAGTKKFLVRKFLDYKMIDTKLVVNQMEELQIIISDLQSEGMSISKPFQVAAVIEKLPPSWKDFKCYLKHKRKELSME

Query:  NLAIKLRIEEDNRKGDK
        NL +KLRIEED RKGDK
Subjt:  NLAIKLRIEEDNRKGDK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G00980.1 zinc knuckle (CCHC-type) family protein2.1e-2124.7Show/hide
Query:  REGQLRCNQFASVTQFLFEHIANNLKTITIELMAANSSTNVAGATIMRSTILKTHAEKPEKFKGENFKRWQQKMIFYLTTLNLAHILKEDC--------P
        R   L  NQ+ SV +  F  I + +K    +L +  +        + + +       K  +F G+++  W  +M  +L  L L ++L E C        P
Subjt:  REGQLRCNQFASVTQFLFEHIANNLKTITIELMAANSSTNVAGATIMRSTILKTHAEKPEKFKGENFKRWQQKMIFYLTTLNLAHILKEDC--------P

Query:  VTPPEGVTPETEAAKQAWMHSDFLCRNYILSGLEDTLYNVYCNAYTTSRLLWEALDKKDKLEDAGTKKFLVRKFLDYKMIDTKLVVNQMEELQIIISDLQ
         T P  +T   +A  + W+  D+LC  ++++ L D LY  Y   +  ++ LW+ L    + +++ +K+  VRK+++++M++ + ++ Q++    I   + 
Subjt:  VTPPEGVTPETEAAKQAWMHSDFLCRNYILSGLEDTLYNVYCNAYTTSRLLWEALDKKDKLEDAGTKKFLVRKFLDYKMIDTKLVVNQMEELQIIISDLQ

Query:  SEGMSISKPFQVAAVIEKLPPSWKDFKCYLKHKRKELSMENLAIKLRIEED
        S GM + + F V+ +I K PPSW+ F C    + + L +  L  +++ EE+
Subjt:  SEGMSISKPFQVAAVIEKLPPSWKDFKCYLKHKRKELSMENLAIKLRIEED


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGGCAACCAGCTATCAATGGCCGACCGAGAGGGGAGCAATTTCTAAGAAGGCTGGAATATATGAATTAGATGAGTCGGGTTCATTGAAGGCACAGATGGCATCTCT
GACCAATGCCCTGAACAAGCTGACTTCATCTGAGGTGGTCAAGATGTGGAAGAGTATTCAAAGAAGGAAGCCACCGCCAGGTTTTGCATCATCTAGTGCGCCTGAAAAGA
AAAATAACCTGGAGGAGATGGTGACTATGTTTATAAAAGAACAAAAAGTGTTGAATGTGAATCTCCAGACGACAGTTAATAACCACGACACAACTCTGAAGAATATGAAA
GTTCAAATAGGTCAAATAGCTTCAGCGGTGAATGCCCTTCAGAAGGGAAAATTTCCAAGCGACACTAAACCTAACCCAAGAGAGCAGTGCAAGATGGTGACACTAAGAAG
TGGTAGGAAGCTGGAGATCAGTTCAGAAAAGAAGAAGGAAGAAGAAAAGAGCAAGGATGAAGATGAAAGGGCTGATGCACAAAAAGCCTCCTCTGAAAGGTCCCAACGTC
CTCCTAACTCTGTTAATTTGAATTGTGGTTTTCTAACTCTTTTGCAGGAAAAGAAAAAGATGAAAGGAATGAAGGACACCGACGTCACCCTCCAGCTTGCTGATAGATCA
ATTACCCACCCGATGGGTGTTGTGGAGGACGTTCTGGTGAAGGTCAACAAATTTATCTTTCCTGTAGACTTTGTGGTACTGGACATGCAGGAGGACAGAGAGGTGCCCAT
TATTCTAGGCAGACCTTTCCTAGCTACAGGTAAAGATGAGATTAGGGTGCATACAGGTAACCTTACCTTGAACATTGATGATGAAAAAGTCGTGTTTAGTATTTTTGGCC
AAGATGAGTCTGTATGTAGTTTGCATACTTGTTTTTCTATTGGTCCTGAATACTTGACTGACGATGATGAGGAGGATGAGCATGAATTGCATAAAAAGGATTACTTGATA
GATAATTTTGAGTCTGATCATGATAATGTTGAATCTATCAAGTCTGATCTTGATATTCCTGAATGCATGAATCCTGACAATGTTAGTACTTTTGATTCGTGTCATGATGA
TGTAGATAGCATAAAATCTGACCCGGAGGAACTCGAATTTGTGCATAATATTGAATCCGATGAATGGACTGACATGATTGATAGGCCATCTTTAGATCCCATGCCAGTAG
ATATCATAACGCTTGATGACTCTGTTAACAATTGTTTTGAGAATAGAGATTTAGAAAAGAGATTTGATACTTGTGCAATTCGGTGCTTGATTTTTTGCAAAGCCCGGAGG
ATGATGAAAACTTTTATTTTGAGCCTCCATAAGCGTGCCGAGCCTCGTCTAGCTAAAGACGTTAAAGAAAGCATTGCTTGGGAGGCAACCCAAGGATTAAATTGGCCCAA
GAAAGAAGCCCAATCCGCAGCAGGAAGTTATGGAAATCGCATAGCGTCGAGAGGCTGCAAGGACAGCGTCGCGACGCTGTCCATTTCTTGGCACCAAGCCGATGACGTCA
CACATCGCGACGCTGTGCCATATAGCGTCGCGACGCTACCCCCTTTCCGAGCCTATATAAGGCGCCCCTTGGTGCCTCATTTTACCATCAATCATCATTTCTTCCATTCT
TCCTTCTTTCCTTGGCTCCTGTGGAGCCTCTCTCAAGGCTTTCGAGCCTTTTTAAGAGTTTCTTTAGTGGGAAAATTTTGGGGAATTTTGGGGAGCATCCTAGGAGGTTC
AAGGAGTGTTTGGAGGGACTCCAAGCGTCAGCGAAGGGGTCTTCAACGATCAATTCCTTCCTTTTATTATGCTCTTAATTATGCTTTCCATTTGCTCAAGGAATCTTTCG
CTTGCTTAATTTTCGCTTCGTCTCCCTGTGGATTCGATACTCGGTATTACCTACCATTTTATATTACTTGCGGCCCGTGCACTTTGCGGTCCATCAATGTTAAAGGTCTC
AGTGACATGTTCTGCCACGTCTTTCCCCTAGAAATAAATTTACTGTTGGTGTCACGTGAAGGTCAGTTAAGGTGTAATCAGTTTGCTAGTGTAACTCAGTTCTTATTTGA
GCATATCGCCAACAATCTGAAGACAATTACTATTGAACTGATGGCTGCAAACTCCTCCACTAATGTTGCTGGTGCTACCATCATGAGATCAACCATCCTCAAAACTCATG
CTGAAAAACCAGAGAAATTTAAGGGAGAAAATTTCAAGAGGTGGCAACAAAAGATGATCTTCTACCTCACCACACTGAACCTTGCTCACATCTTGAAGGAAGATTGCCCA
GTTACCCCACCAGAAGGTGTTACTCCTGAAACTGAAGCTGCCAAGCAGGCATGGATGCATTCAGATTTTTTATGCCGCAATTATATATTGAGTGGTCTTGAAGACACCTT
GTATAATGTCTACTGCAATGCCTATACTACTTCAAGGCTATTGTGGGAGGCGTTAGATAAGAAGGATAAGCTGGAAGATGCTGGTACTAAGAAGTTTCTTGTCAGAAAAT
TCTTAGATTATAAGATGATTGATACCAAGTTGGTAGTCAATCAGATGGAAGAATTGCAAATTATCATTAGTGATTTGCAAAGTGAGGGAATGAGCATCAGCAAACCATTC
CAAGTTGCTGCTGTGATTGAGAAGCTACCTCCTTCTTGGAAGGATTTCAAATGCTATCTTAAGCACAAACGAAAGGAGCTTTCCATGGAGAATCTTGCAATCAAACTCCG
CATTGAAGAAGATAATAGAAAAGGAGACAAAGCTTCGTTGGGGGTTGAAGCTAATGCACATATTGCTGAATCTTCAAAACATGGTCCCAAGGAGCAACAACTCAAGAAGA
GGAATGTGAATCCTCGACCGAGGAATGACGCCAAAAGCGCATTCGTGGAGTTTGCTGGGTTTGTGGTAAGAGTGGCCATGTTTCTGCTGATTGTAGACACAAAAAGGGAC
AAAACTCCAACAATCAGACCAATATTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTGGCAACCAGCTATCAATGGCCGACCGAGAGGGGAGCAATTTCTAAGAAGGCTGGAATATATGAATTAGATGAGTCGGGTTCATTGAAGGCACAGATGGCATCTCT
GACCAATGCCCTGAACAAGCTGACTTCATCTGAGGTGGTCAAGATGTGGAAGAGTATTCAAAGAAGGAAGCCACCGCCAGGTTTTGCATCATCTAGTGCGCCTGAAAAGA
AAAATAACCTGGAGGAGATGGTGACTATGTTTATAAAAGAACAAAAAGTGTTGAATGTGAATCTCCAGACGACAGTTAATAACCACGACACAACTCTGAAGAATATGAAA
GTTCAAATAGGTCAAATAGCTTCAGCGGTGAATGCCCTTCAGAAGGGAAAATTTCCAAGCGACACTAAACCTAACCCAAGAGAGCAGTGCAAGATGGTGACACTAAGAAG
TGGTAGGAAGCTGGAGATCAGTTCAGAAAAGAAGAAGGAAGAAGAAAAGAGCAAGGATGAAGATGAAAGGGCTGATGCACAAAAAGCCTCCTCTGAAAGGTCCCAACGTC
CTCCTAACTCTGTTAATTTGAATTGTGGTTTTCTAACTCTTTTGCAGGAAAAGAAAAAGATGAAAGGAATGAAGGACACCGACGTCACCCTCCAGCTTGCTGATAGATCA
ATTACCCACCCGATGGGTGTTGTGGAGGACGTTCTGGTGAAGGTCAACAAATTTATCTTTCCTGTAGACTTTGTGGTACTGGACATGCAGGAGGACAGAGAGGTGCCCAT
TATTCTAGGCAGACCTTTCCTAGCTACAGGTAAAGATGAGATTAGGGTGCATACAGGTAACCTTACCTTGAACATTGATGATGAAAAAGTCGTGTTTAGTATTTTTGGCC
AAGATGAGTCTGTATGTAGTTTGCATACTTGTTTTTCTATTGGTCCTGAATACTTGACTGACGATGATGAGGAGGATGAGCATGAATTGCATAAAAAGGATTACTTGATA
GATAATTTTGAGTCTGATCATGATAATGTTGAATCTATCAAGTCTGATCTTGATATTCCTGAATGCATGAATCCTGACAATGTTAGTACTTTTGATTCGTGTCATGATGA
TGTAGATAGCATAAAATCTGACCCGGAGGAACTCGAATTTGTGCATAATATTGAATCCGATGAATGGACTGACATGATTGATAGGCCATCTTTAGATCCCATGCCAGTAG
ATATCATAACGCTTGATGACTCTGTTAACAATTGTTTTGAGAATAGAGATTTAGAAAAGAGATTTGATACTTGTGCAATTCGGTGCTTGATTTTTTGCAAAGCCCGGAGG
ATGATGAAAACTTTTATTTTGAGCCTCCATAAGCGTGCCGAGCCTCGTCTAGCTAAAGACGTTAAAGAAAGCATTGCTTGGGAGGCAACCCAAGGATTAAATTGGCCCAA
GAAAGAAGCCCAATCCGCAGCAGGAAGTTATGGAAATCGCATAGCGTCGAGAGGCTGCAAGGACAGCGTCGCGACGCTGTCCATTTCTTGGCACCAAGCCGATGACGTCA
CACATCGCGACGCTGTGCCATATAGCGTCGCGACGCTACCCCCTTTCCGAGCCTATATAAGGCGCCCCTTGGTGCCTCATTTTACCATCAATCATCATTTCTTCCATTCT
TCCTTCTTTCCTTGGCTCCTGTGGAGCCTCTCTCAAGGCTTTCGAGCCTTTTTAAGAGTTTCTTTAGTGGGAAAATTTTGGGGAATTTTGGGGAGCATCCTAGGAGGTTC
AAGGAGTGTTTGGAGGGACTCCAAGCGTCAGCGAAGGGGTCTTCAACGATCAATTCCTTCCTTTTATTATGCTCTTAATTATGCTTTCCATTTGCTCAAGGAATCTTTCG
CTTGCTTAATTTTCGCTTCGTCTCCCTGTGGATTCGATACTCGGTATTACCTACCATTTTATATTACTTGCGGCCCGTGCACTTTGCGGTCCATCAATGTTAAAGGTCTC
AGTGACATGTTCTGCCACGTCTTTCCCCTAGAAATAAATTTACTGTTGGTGTCACGTGAAGGTCAGTTAAGGTGTAATCAGTTTGCTAGTGTAACTCAGTTCTTATTTGA
GCATATCGCCAACAATCTGAAGACAATTACTATTGAACTGATGGCTGCAAACTCCTCCACTAATGTTGCTGGTGCTACCATCATGAGATCAACCATCCTCAAAACTCATG
CTGAAAAACCAGAGAAATTTAAGGGAGAAAATTTCAAGAGGTGGCAACAAAAGATGATCTTCTACCTCACCACACTGAACCTTGCTCACATCTTGAAGGAAGATTGCCCA
GTTACCCCACCAGAAGGTGTTACTCCTGAAACTGAAGCTGCCAAGCAGGCATGGATGCATTCAGATTTTTTATGCCGCAATTATATATTGAGTGGTCTTGAAGACACCTT
GTATAATGTCTACTGCAATGCCTATACTACTTCAAGGCTATTGTGGGAGGCGTTAGATAAGAAGGATAAGCTGGAAGATGCTGGTACTAAGAAGTTTCTTGTCAGAAAAT
TCTTAGATTATAAGATGATTGATACCAAGTTGGTAGTCAATCAGATGGAAGAATTGCAAATTATCATTAGTGATTTGCAAAGTGAGGGAATGAGCATCAGCAAACCATTC
CAAGTTGCTGCTGTGATTGAGAAGCTACCTCCTTCTTGGAAGGATTTCAAATGCTATCTTAAGCACAAACGAAAGGAGCTTTCCATGGAGAATCTTGCAATCAAACTCCG
CATTGAAGAAGATAATAGAAAAGGAGACAAAGCTTCGTTGGGGGTTGAAGCTAATGCACATATTGCTGAATCTTCAAAACATGGTCCCAAGGAGCAACAACTCAAGAAGA
GGAATGTGAATCCTCGACCGAGGAATGACGCCAAAAGCGCATTCGTGGAGTTTGCTGGGTTTGTGGTAAGAGTGGCCATGTTTCTGCTGATTGTAGACACAAAAAGGGAC
AAAACTCCAACAATCAGACCAATATTGTAG
Protein sequenceShow/hide protein sequence
MVATSYQWPTERGAISKKAGIYELDESGSLKAQMASLTNALNKLTSSEVVKMWKSIQRRKPPPGFASSSAPEKKNNLEEMVTMFIKEQKVLNVNLQTTVNNHDTTLKNMK
VQIGQIASAVNALQKGKFPSDTKPNPREQCKMVTLRSGRKLEISSEKKKEEEKSKDEDERADAQKASSERSQRPPNSVNLNCGFLTLLQEKKKMKGMKDTDVTLQLADRS
ITHPMGVVEDVLVKVNKFIFPVDFVVLDMQEDREVPIILGRPFLATGKDEIRVHTGNLTLNIDDEKVVFSIFGQDESVCSLHTCFSIGPEYLTDDDEEDEHELHKKDYLI
DNFESDHDNVESIKSDLDIPECMNPDNVSTFDSCHDDVDSIKSDPEELEFVHNIESDEWTDMIDRPSLDPMPVDIITLDDSVNNCFENRDLEKRFDTCAIRCLIFCKARR
MMKTFILSLHKRAEPRLAKDVKESIAWEATQGLNWPKKEAQSAAGSYGNRIASRGCKDSVATLSISWHQADDVTHRDAVPYSVATLPPFRAYIRRPLVPHFTINHHFFHS
SFFPWLLWSLSQGFRAFLRVSLVGKFWGILGSILGGSRSVWRDSKRQRRGLQRSIPSFYYALNYAFHLLKESFACLIFASSPCGFDTRYYLPFYITCGPCTLRSINVKGL
SDMFCHVFPLEINLLLVSREGQLRCNQFASVTQFLFEHIANNLKTITIELMAANSSTNVAGATIMRSTILKTHAEKPEKFKGENFKRWQQKMIFYLTTLNLAHILKEDCP
VTPPEGVTPETEAAKQAWMHSDFLCRNYILSGLEDTLYNVYCNAYTTSRLLWEALDKKDKLEDAGTKKFLVRKFLDYKMIDTKLVVNQMEELQIIISDLQSEGMSISKPF
QVAAVIEKLPPSWKDFKCYLKHKRKELSMENLAIKLRIEEDNRKGDKASLGVEANAHIAESSKHGPKEQQLKKRNVNPRPRNDAKSAFVEFAGFVVRVAMFLLIVDTKRD
KTPTIRPIL