; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi05G021290 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi05G021290
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionCCHC-type domain-containing protein
Genome locationchr05:28116852..28126439
RNA-Seq ExpressionLsi05G021290
SyntenyLsi05G021290
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR003173 - Transcriptional coactivator p15 (PC4), C-terminal
IPR009044 - ssDNA-binding transcriptional regulator
IPR014876 - DEK, C-terminal
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK20834.1 Zinc knuckle family protein, putative isoform 2 [Cucumis melo var. makuwa]2.2e-20481.56Show/hide
Query:  MDQETQRRIEETVIDVLKKSNMEDTTEYKVRGQVEERLGIDLSNRQYKLLVRNVVESFLLSMSERVCMGKED--------ENKAVEQEIVQKKEINDDGD
        M+ ET+R+IEE VI+VLK+SN+EDTTE+KVR QVEER+GIDLSN+Q KLLVRNVVESFLLSMSERVCMGKED        EN+AVEQ+I+ KKE NDDGD
Subjt:  MDQETQRRIEETVIDVLKKSNMEDTTEYKVRGQVEERLGIDLSNRQYKLLVRNVVESFLLSMSERVCMGKED--------ENKAVEQEIVQKKEINDDGD

Query:  LVICRLSNNRSVTIHKFRGATMVSIRQFFEKDGKQLPTLKGISMSTEQWSAFKSNIPAIEEAILQMKRKKRSEHDADKIGAVST-TVITPPKFPIETTIR
        L+ICRLSNNRSVTIHKF+G  MVSIRQ++ KDGKQLPTLKGISM TEQWS FKSNIPAI EAILQMKR KRSEHDADKIGA+S  T +T PKFPIE TIR
Subjt:  LVICRLSNNRSVTIHKFRGATMVSIRQFFEKDGKQLPTLKGISMSTEQWSAFKSNIPAIEEAILQMKRKKRSEHDADKIGAVST-TVITPPKFPIETTIR

Query:  FDGKNYRAWARQMESLLQHLKIAYVLSDQCPTAVLGPESSSVNAAQSKVAGQKWMSDDHMCRHNILNSLSDRLFNEYVTKTMSASELWKELKLLYFLEEF
        FDGKNY AWA QME LLQ LKIAYVLS+QCPTAVLG ESSS NAAQSKVA QKWMSDDHMC  NILNSLSDRLFNEY  K MSASELWKELKLLYFLEEF
Subjt:  FDGKNYRAWARQMESLLQHLKIAYVLSDQCPTAVLGPESSSVNAAQSKVAGQKWMSDDHMCRHNILNSLSDRLFNEYVTKTMSASELWKELKLLYFLEEF

Query:  GTKRSQVNKYLEFKMVEEKSILEQVEELNNIADSIVSAGTRIDEDFHVSAIISKLPLSWKNVWVNLMHEQYLPLPKLIDRLRIEEQLRTQKNSHLSGVSF
        GTKRSQV KYLEFKMVEEKSILEQVEELN+IADSI SAGT IDEDFHVSAIISKLPLSWKNVW++LMHE YLPL KL DRLRIEEQLRTQKNS LS VS 
Subjt:  GTKRSQVNKYLEFKMVEEKSILEQVEELNNIADSIVSAGTRIDEDFHVSAIISKLPLSWKNVWVNLMHEQYLPLPKLIDRLRIEEQLRTQKNSHLSGVSF

Query:  APNPGGQHNAANHPSKMGDPKLSSLPLGKREWQKDVKTLLCLNCGKEGHTSPNCPIRKVDN
         PN  GQH+AANHPSKMGDP   ++PL K+E QK+VKTLLCL+CGKEGHTSPNCP +KVDN
Subjt:  APNPGGQHNAANHPSKMGDPKLSSLPLGKREWQKDVKTLLCLNCGKEGHTSPNCPIRKVDN

XP_004134299.1 uncharacterized protein LOC101205072 [Cucumis sativus]2.7e-21082.69Show/hide
Query:  MDQETQRRIEETVIDVLKKSNMEDTTEYKVRGQVEERLGIDLSNRQYKLLVRNVVESFLLSMSERVCMGKED--------ENKAVEQEIVQKKEINDDGD
        M+ ET+RRIEE VI+VLKKS+MEDTTE+KVR QVEERLGIDLSN+Q KLLVRNVVESFLLSMSERVCMGKED        ENKAVEQ+IV KKE NDDGD
Subjt:  MDQETQRRIEETVIDVLKKSNMEDTTEYKVRGQVEERLGIDLSNRQYKLLVRNVVESFLLSMSERVCMGKED--------ENKAVEQEIVQKKEINDDGD

Query:  LVICRLSNNRSVTIHKFRGATMVSIRQFFEKDGKQLPTLKGISMSTEQWSAFKSNIPAIEEAILQMKRKKRSEHDADKIGAVS--TTVITPPKFPIETTI
        L+ICRLSNNRSVTIHKF+GA MVS+RQ++EKDGKQLPTLKGISM TEQWS FKSNIPAI EAILQMKR KRSEHDA+KIGA S  TT +T PK+PIE TI
Subjt:  LVICRLSNNRSVTIHKFRGATMVSIRQFFEKDGKQLPTLKGISMSTEQWSAFKSNIPAIEEAILQMKRKKRSEHDADKIGAVS--TTVITPPKFPIETTI

Query:  RFDGKNYRAWARQMESLLQHLKIAYVLSDQCPTAVLGPESSSVNAAQSKVAGQKWMSDDHMCRHNILNSLSDRLFNEYVTKTMSASELWKELKLLYFLEE
        RFDGKNY AWA QME LLQ LKIAYVLS+QCPTAVLG ESSS NAAQSK A QKWM DDHMCR NILNSLSDRLFNEY  KTMSASELWKELKLLY LEE
Subjt:  RFDGKNYRAWARQMESLLQHLKIAYVLSDQCPTAVLGPESSSVNAAQSKVAGQKWMSDDHMCRHNILNSLSDRLFNEYVTKTMSASELWKELKLLYFLEE

Query:  FGTKRSQVNKYLEFKMVEEKSILEQVEELNNIADSIVSAGTRIDEDFHVSAIISKLPLSWKNVWVNLMHEQYLPLPKLIDRLRIEEQLRTQKNSHLSGVS
        FGTKRSQV KYLEFKMVEEKSILEQVEELN+IADSI S+GT IDEDFHVSAIISKLPLSWKNVWVNLMHEQYLPL KL DRLRIEEQLRTQKNS LSGVS
Subjt:  FGTKRSQVNKYLEFKMVEEKSILEQVEELNNIADSIVSAGTRIDEDFHVSAIISKLPLSWKNVWVNLMHEQYLPLPKLIDRLRIEEQLRTQKNSHLSGVS

Query:  FAPNPGGQHNAANHPSKMGDPKLSSLPLGKREWQKDVKTLLCLNCGKEGHTSPNCPIRKVDNEVTRER
         +P P GQH+AANHPSKMGDPK  ++PL K+E QK+VKTLLCL+CGKEGHTSPNCP +KV+NEV R+R
Subjt:  FAPNPGGQHNAANHPSKMGDPKLSSLPLGKREWQKDVKTLLCLNCGKEGHTSPNCPIRKVDNEVTRER

XP_008437880.1 PREDICTED: uncharacterized protein LOC103483179 [Cucumis melo]1.9e-20381.34Show/hide
Query:  MDQETQRRIEETVIDVLKKSNMEDTTEYKVRGQVEERLGIDLSNRQYKLLVRNVVESFLLSMSERVCMGKED--------ENKAVEQEIVQKKEINDDGD
        M+ ET+R+IEE VI+VLK+SN+EDTTE+KVR QVEER+GIDLSN+Q KLLVRNVVESFLLSMSERVCMGKED        EN+AVEQ+I+ KKE NDDGD
Subjt:  MDQETQRRIEETVIDVLKKSNMEDTTEYKVRGQVEERLGIDLSNRQYKLLVRNVVESFLLSMSERVCMGKED--------ENKAVEQEIVQKKEINDDGD

Query:  LVICRLSNNRSVTIHKFRGATMVSIRQFFEKDGKQLPTLKGISMSTEQWSAFKSNIPAIEEAILQMKRKKRSEHDADKIGAVST-TVITPPKFPIETTIR
        L+ICRLSNNRSVTIHKF+G  MVSIRQ++ KDGKQLPTLKGISM TEQWS FKSNIPAI EAILQMKR KRSEHDADKIGA+S  T +T PKFPIE TIR
Subjt:  LVICRLSNNRSVTIHKFRGATMVSIRQFFEKDGKQLPTLKGISMSTEQWSAFKSNIPAIEEAILQMKRKKRSEHDADKIGAVST-TVITPPKFPIETTIR

Query:  FDGKNYRAWARQMESLLQHLKIAYVLSDQCPTAVLGPESSSVNAAQSKVAGQKWMSDDHMCRHNILNSLSDRLFNEYVTKTMSASELWKELKLLYFLEEF
        FDGKNY AWA QME LLQ LKIAYVLS+QCPTAVLG ESSS NAAQSKVA QKWMSDDHMC  NILNSLSDRLFNEY  K MSASELWKELKLLYFLEEF
Subjt:  FDGKNYRAWARQMESLLQHLKIAYVLSDQCPTAVLGPESSSVNAAQSKVAGQKWMSDDHMCRHNILNSLSDRLFNEYVTKTMSASELWKELKLLYFLEEF

Query:  GTKRSQVNKYLEFKMVEEKSILEQVEELNNIADSIVSAGTRIDEDFHVSAIISKLPLSWKNVWVNLMHEQYLPLPKLIDRLRIEEQLRTQKNSHLSGVSF
        GTKRSQV KYLEFKMVEEKSILEQVEELN+IADSI SAGT IDEDFHVSAIISKLPLSWKNVW++LM E YLPL KL DRLRIEEQLRTQKNS LS VS 
Subjt:  GTKRSQVNKYLEFKMVEEKSILEQVEELNNIADSIVSAGTRIDEDFHVSAIISKLPLSWKNVWVNLMHEQYLPLPKLIDRLRIEEQLRTQKNSHLSGVSF

Query:  APNPGGQHNAANHPSKMGDPKLSSLPLGKREWQKDVKTLLCLNCGKEGHTSPNCPIRKVDN
         PN  GQH+AANHPSKMGDP   ++PL K+E QK+VKTLLCL+CGKEGHTSPNCP +KVDN
Subjt:  APNPGGQHNAANHPSKMGDPKLSSLPLGKREWQKDVKTLLCLNCGKEGHTSPNCPIRKVDN

XP_022945450.1 uncharacterized protein LOC111449676 [Cucurbita moschata]1.8e-16168.41Show/hide
Query:  MDQETQRRIEETVIDVLKKSNMEDTTEYKVRGQVEERLGIDLSNRQYKLLVRNVVESFLLSMSERVCMGKED--------ENKAVEQEIVQKKEINDDGD
        MD ET+RRI+ETVID+LK SNME+ TEYK+R + E+RLG+DLS+ Q K LVR+VVE FL S +ER   GKE         ENKA EQEIV+KKEIN D D
Subjt:  MDQETQRRIEETVIDVLKKSNMEDTTEYKVRGQVEERLGIDLSNRQYKLLVRNVVESFLLSMSERVCMGKED--------ENKAVEQEIVQKKEINDDGD

Query:  LVICRLSNNRSVTIHKFRGATMVSIRQFFEKDGKQLPTLKGISMSTEQWSAFKSNIPAIEEAILQMKRK-KRSEHDADKIGAVSTTVI-TPPKFPIETTI
         VIC+LSNNR+VT+H+F+G  +VSIRQ++EKDGKQLP +KGIS++TEQWSAF+SNIPAIEEAILQMKRK KRSEHDA+  GAVS     + PKFP E TI
Subjt:  LVICRLSNNRSVTIHKFRGATMVSIRQFFEKDGKQLPTLKGISMSTEQWSAFKSNIPAIEEAILQMKRK-KRSEHDADKIGAVSTTVI-TPPKFPIETTI

Query:  RFDGKNYRAWARQMESLLQHLKIAYVLSDQCPTAVLGPESSSVNAAQSKVAGQKWMSDDHMCRHNILNSLSDRLFNEYVTKTMSASELWKELKLLYFLEE
        RFDGKNYR WARQME LL+ LKIAYVLSD  PT++LGPESSS N ++SK + Q+WMSDDHMCRH ILNSLSD LF++Y  +TMSA ELWKEL  LY L +
Subjt:  RFDGKNYRAWARQMESLLQHLKIAYVLSDQCPTAVLGPESSSVNAAQSKVAGQKWMSDDHMCRHNILNSLSDRLFNEYVTKTMSASELWKELKLLYFLEE

Query:  FGTKRSQVNKYLEFKMVEEKSILEQVEELNNIADSIVSAGTRIDEDFHVSAIISKLPLSWKNVWVNLMHEQYLPLPKLIDRLRIEEQLRTQKNSHLSGVS
        +GT+RSQV KYLEF+MVEEKSILEQVEELNNIA+SI+SAG RIDEDFHVSAIISKLP SW NV+V LM E++LP   LIDRLR EE+LRTQ+NSH S   
Subjt:  FGTKRSQVNKYLEFKMVEEKSILEQVEELNNIADSIVSAGTRIDEDFHVSAIISKLPLSWKNVWVNLMHEQYLPLPKLIDRLRIEEQLRTQKNSHLSGVS

Query:  FAPNPGGQHNAANHPSKMGDPKLSSLPLGKREWQKDVKTLLCLNCGKEGHTSPNCPIRK
             GG+    NH  KMGD    SLP  KREW+ DVKTLLCLNCGKEGH S +CP  K
Subjt:  FAPNPGGQHNAANHPSKMGDPKLSSLPLGKREWQKDVKTLLCLNCGKEGHTSPNCPIRK

XP_038878142.1 uncharacterized protein LOC120070296 [Benincasa hispida]1.9e-21185.4Show/hide
Query:  MDQETQRRIEETVIDVLKKSNMEDTTEYKVRGQVEERLGIDLSNRQYKLLVRNVVESFLLSMSERVCMGKED--------ENKAVEQEIVQKKEINDDGD
        MD+ETQ +IEETVIDVLKKSNME+TTE+KVRGQVEERLGIDLSNR+YKLLVRNVVESFLLSMSE+VCMGKED        ENK ++QEI   KE NDDGD
Subjt:  MDQETQRRIEETVIDVLKKSNMEDTTEYKVRGQVEERLGIDLSNRQYKLLVRNVVESFLLSMSERVCMGKED--------ENKAVEQEIVQKKEINDDGD

Query:  LVICRLSNNRSVTIHKFRGATMVSIRQFFEKDGKQLPTLKGISMSTEQWSAFKSNIPAIEEAILQMKRKKRSEHDADKIGAVS--TTVITPPKFPIETTI
        LVICRLSNNRSVTIHKFRGA MVSIRQFFEKDGKQLPTLKGISMSTEQWSAFKSNIPAIE+AILQMK KKRSEHDADKIGAVS  T  +TPP FP E TI
Subjt:  LVICRLSNNRSVTIHKFRGATMVSIRQFFEKDGKQLPTLKGISMSTEQWSAFKSNIPAIEEAILQMKRKKRSEHDADKIGAVS--TTVITPPKFPIETTI

Query:  RFDGKNYRAWARQMESLLQHLKIAYVLSDQCPTAVLGPESSSVNAAQSKVAGQKWMSDDHMCRHNILNSLSDRLFNEYVTKTMSASELWKELKLLYFLEE
        RFDGKNY  WA QMESLLQHLKIAYVLS+QCPT VLGPESSS N AQ+K A QKWMSDD MC  NILNSLSDRLFNEY TKTMSASELW ELKLLYFLEE
Subjt:  RFDGKNYRAWARQMESLLQHLKIAYVLSDQCPTAVLGPESSSVNAAQSKVAGQKWMSDDHMCRHNILNSLSDRLFNEYVTKTMSASELWKELKLLYFLEE

Query:  FGTKRSQVNKYLEFKMVEEKSILEQVEELNNIADSIVSAGTRIDEDFHVSAIISKLPLSWKNVWVNLMHEQYLPLPKLIDRLRIEEQLRTQKNSHLSGVS
        FGTKRSQV KYLEF+MVEEKSILEQVE+LNNIADSIVSAGT IDEDFHVSAIISKLPLSW +VWVNLMHEQYL L KLIDRLRIEEQLRTQKNSHLSGVS
Subjt:  FGTKRSQVNKYLEFKMVEEKSILEQVEELNNIADSIVSAGTRIDEDFHVSAIISKLPLSWKNVWVNLMHEQYLPLPKLIDRLRIEEQLRTQKNSHLSGVS

Query:  FAPNPGGQHNAANHPSKMGDPKLSSLPLGKREWQKDVKTLLCLNCGKEGHTSPNCPIRK
         AP+  GQH+ ANH SKM DPKLSSLPL KREWQ DVKTLLCLNCGKEGHTSPNCP RK
Subjt:  FAPNPGGQHNAANHPSKMGDPKLSSLPLGKREWQKDVKTLLCLNCGKEGHTSPNCPIRK

TrEMBL top hitse value%identityAlignment
A0A0A0L3U5 CCHC-type domain-containing protein1.3e-21082.69Show/hide
Query:  MDQETQRRIEETVIDVLKKSNMEDTTEYKVRGQVEERLGIDLSNRQYKLLVRNVVESFLLSMSERVCMGKED--------ENKAVEQEIVQKKEINDDGD
        M+ ET+RRIEE VI+VLKKS+MEDTTE+KVR QVEERLGIDLSN+Q KLLVRNVVESFLLSMSERVCMGKED        ENKAVEQ+IV KKE NDDGD
Subjt:  MDQETQRRIEETVIDVLKKSNMEDTTEYKVRGQVEERLGIDLSNRQYKLLVRNVVESFLLSMSERVCMGKED--------ENKAVEQEIVQKKEINDDGD

Query:  LVICRLSNNRSVTIHKFRGATMVSIRQFFEKDGKQLPTLKGISMSTEQWSAFKSNIPAIEEAILQMKRKKRSEHDADKIGAVS--TTVITPPKFPIETTI
        L+ICRLSNNRSVTIHKF+GA MVS+RQ++EKDGKQLPTLKGISM TEQWS FKSNIPAI EAILQMKR KRSEHDA+KIGA S  TT +T PK+PIE TI
Subjt:  LVICRLSNNRSVTIHKFRGATMVSIRQFFEKDGKQLPTLKGISMSTEQWSAFKSNIPAIEEAILQMKRKKRSEHDADKIGAVS--TTVITPPKFPIETTI

Query:  RFDGKNYRAWARQMESLLQHLKIAYVLSDQCPTAVLGPESSSVNAAQSKVAGQKWMSDDHMCRHNILNSLSDRLFNEYVTKTMSASELWKELKLLYFLEE
        RFDGKNY AWA QME LLQ LKIAYVLS+QCPTAVLG ESSS NAAQSK A QKWM DDHMCR NILNSLSDRLFNEY  KTMSASELWKELKLLY LEE
Subjt:  RFDGKNYRAWARQMESLLQHLKIAYVLSDQCPTAVLGPESSSVNAAQSKVAGQKWMSDDHMCRHNILNSLSDRLFNEYVTKTMSASELWKELKLLYFLEE

Query:  FGTKRSQVNKYLEFKMVEEKSILEQVEELNNIADSIVSAGTRIDEDFHVSAIISKLPLSWKNVWVNLMHEQYLPLPKLIDRLRIEEQLRTQKNSHLSGVS
        FGTKRSQV KYLEFKMVEEKSILEQVEELN+IADSI S+GT IDEDFHVSAIISKLPLSWKNVWVNLMHEQYLPL KL DRLRIEEQLRTQKNS LSGVS
Subjt:  FGTKRSQVNKYLEFKMVEEKSILEQVEELNNIADSIVSAGTRIDEDFHVSAIISKLPLSWKNVWVNLMHEQYLPLPKLIDRLRIEEQLRTQKNSHLSGVS

Query:  FAPNPGGQHNAANHPSKMGDPKLSSLPLGKREWQKDVKTLLCLNCGKEGHTSPNCPIRKVDNEVTRER
         +P P GQH+AANHPSKMGDPK  ++PL K+E QK+VKTLLCL+CGKEGHTSPNCP +KV+NEV R+R
Subjt:  FAPNPGGQHNAANHPSKMGDPKLSSLPLGKREWQKDVKTLLCLNCGKEGHTSPNCPIRKVDNEVTRER

A0A1S3AV18 uncharacterized protein LOC1034831799.1e-20481.34Show/hide
Query:  MDQETQRRIEETVIDVLKKSNMEDTTEYKVRGQVEERLGIDLSNRQYKLLVRNVVESFLLSMSERVCMGKED--------ENKAVEQEIVQKKEINDDGD
        M+ ET+R+IEE VI+VLK+SN+EDTTE+KVR QVEER+GIDLSN+Q KLLVRNVVESFLLSMSERVCMGKED        EN+AVEQ+I+ KKE NDDGD
Subjt:  MDQETQRRIEETVIDVLKKSNMEDTTEYKVRGQVEERLGIDLSNRQYKLLVRNVVESFLLSMSERVCMGKED--------ENKAVEQEIVQKKEINDDGD

Query:  LVICRLSNNRSVTIHKFRGATMVSIRQFFEKDGKQLPTLKGISMSTEQWSAFKSNIPAIEEAILQMKRKKRSEHDADKIGAVST-TVITPPKFPIETTIR
        L+ICRLSNNRSVTIHKF+G  MVSIRQ++ KDGKQLPTLKGISM TEQWS FKSNIPAI EAILQMKR KRSEHDADKIGA+S  T +T PKFPIE TIR
Subjt:  LVICRLSNNRSVTIHKFRGATMVSIRQFFEKDGKQLPTLKGISMSTEQWSAFKSNIPAIEEAILQMKRKKRSEHDADKIGAVST-TVITPPKFPIETTIR

Query:  FDGKNYRAWARQMESLLQHLKIAYVLSDQCPTAVLGPESSSVNAAQSKVAGQKWMSDDHMCRHNILNSLSDRLFNEYVTKTMSASELWKELKLLYFLEEF
        FDGKNY AWA QME LLQ LKIAYVLS+QCPTAVLG ESSS NAAQSKVA QKWMSDDHMC  NILNSLSDRLFNEY  K MSASELWKELKLLYFLEEF
Subjt:  FDGKNYRAWARQMESLLQHLKIAYVLSDQCPTAVLGPESSSVNAAQSKVAGQKWMSDDHMCRHNILNSLSDRLFNEYVTKTMSASELWKELKLLYFLEEF

Query:  GTKRSQVNKYLEFKMVEEKSILEQVEELNNIADSIVSAGTRIDEDFHVSAIISKLPLSWKNVWVNLMHEQYLPLPKLIDRLRIEEQLRTQKNSHLSGVSF
        GTKRSQV KYLEFKMVEEKSILEQVEELN+IADSI SAGT IDEDFHVSAIISKLPLSWKNVW++LM E YLPL KL DRLRIEEQLRTQKNS LS VS 
Subjt:  GTKRSQVNKYLEFKMVEEKSILEQVEELNNIADSIVSAGTRIDEDFHVSAIISKLPLSWKNVWVNLMHEQYLPLPKLIDRLRIEEQLRTQKNSHLSGVSF

Query:  APNPGGQHNAANHPSKMGDPKLSSLPLGKREWQKDVKTLLCLNCGKEGHTSPNCPIRKVDN
         PN  GQH+AANHPSKMGDP   ++PL K+E QK+VKTLLCL+CGKEGHTSPNCP +KVDN
Subjt:  APNPGGQHNAANHPSKMGDPKLSSLPLGKREWQKDVKTLLCLNCGKEGHTSPNCPIRKVDN

A0A5A7TZ44 Zinc knuckle family protein, putative isoform 29.1e-20481.34Show/hide
Query:  MDQETQRRIEETVIDVLKKSNMEDTTEYKVRGQVEERLGIDLSNRQYKLLVRNVVESFLLSMSERVCMGKED--------ENKAVEQEIVQKKEINDDGD
        M+ ET+R+IEE VI+VLK+SN+EDTTE+KVR QVEER+GIDLSN+Q KLLVRNVVESFLLSMSERVCMGKED        EN+AVEQ+I+ KKE NDDGD
Subjt:  MDQETQRRIEETVIDVLKKSNMEDTTEYKVRGQVEERLGIDLSNRQYKLLVRNVVESFLLSMSERVCMGKED--------ENKAVEQEIVQKKEINDDGD

Query:  LVICRLSNNRSVTIHKFRGATMVSIRQFFEKDGKQLPTLKGISMSTEQWSAFKSNIPAIEEAILQMKRKKRSEHDADKIGAVST-TVITPPKFPIETTIR
        L+ICRLSNNRSVTIHKF+G  MVSIRQ++ KDGKQLPTLKGISM TEQWS FKSNIPAI EAILQMKR KRSEHDADKIGA+S  T +T PKFPIE TIR
Subjt:  LVICRLSNNRSVTIHKFRGATMVSIRQFFEKDGKQLPTLKGISMSTEQWSAFKSNIPAIEEAILQMKRKKRSEHDADKIGAVST-TVITPPKFPIETTIR

Query:  FDGKNYRAWARQMESLLQHLKIAYVLSDQCPTAVLGPESSSVNAAQSKVAGQKWMSDDHMCRHNILNSLSDRLFNEYVTKTMSASELWKELKLLYFLEEF
        FDGKNY AWA QME LLQ LKIAYVLS+QCPTAVLG ESSS NAAQSKVA QKWMSDDHMC  NILNSLSDRLFNEY  K MSASELWKELKLLYFLEEF
Subjt:  FDGKNYRAWARQMESLLQHLKIAYVLSDQCPTAVLGPESSSVNAAQSKVAGQKWMSDDHMCRHNILNSLSDRLFNEYVTKTMSASELWKELKLLYFLEEF

Query:  GTKRSQVNKYLEFKMVEEKSILEQVEELNNIADSIVSAGTRIDEDFHVSAIISKLPLSWKNVWVNLMHEQYLPLPKLIDRLRIEEQLRTQKNSHLSGVSF
        GTKRSQV KYLEFKMVEEKSILEQVEELN+IADSI SAGT IDEDFHVSAIISKLPLSWKNVW++LM E YLPL KL DRLRIEEQLRTQKNS LS VS 
Subjt:  GTKRSQVNKYLEFKMVEEKSILEQVEELNNIADSIVSAGTRIDEDFHVSAIISKLPLSWKNVWVNLMHEQYLPLPKLIDRLRIEEQLRTQKNSHLSGVSF

Query:  APNPGGQHNAANHPSKMGDPKLSSLPLGKREWQKDVKTLLCLNCGKEGHTSPNCPIRKVDN
         PN  GQH+AANHPSKMGDP   ++PL K+E QK+VKTLLCL+CGKEGHTSPNCP +KVDN
Subjt:  APNPGGQHNAANHPSKMGDPKLSSLPLGKREWQKDVKTLLCLNCGKEGHTSPNCPIRKVDN

A0A5D3DBA1 Zinc knuckle family protein, putative isoform 21.1e-20481.56Show/hide
Query:  MDQETQRRIEETVIDVLKKSNMEDTTEYKVRGQVEERLGIDLSNRQYKLLVRNVVESFLLSMSERVCMGKED--------ENKAVEQEIVQKKEINDDGD
        M+ ET+R+IEE VI+VLK+SN+EDTTE+KVR QVEER+GIDLSN+Q KLLVRNVVESFLLSMSERVCMGKED        EN+AVEQ+I+ KKE NDDGD
Subjt:  MDQETQRRIEETVIDVLKKSNMEDTTEYKVRGQVEERLGIDLSNRQYKLLVRNVVESFLLSMSERVCMGKED--------ENKAVEQEIVQKKEINDDGD

Query:  LVICRLSNNRSVTIHKFRGATMVSIRQFFEKDGKQLPTLKGISMSTEQWSAFKSNIPAIEEAILQMKRKKRSEHDADKIGAVST-TVITPPKFPIETTIR
        L+ICRLSNNRSVTIHKF+G  MVSIRQ++ KDGKQLPTLKGISM TEQWS FKSNIPAI EAILQMKR KRSEHDADKIGA+S  T +T PKFPIE TIR
Subjt:  LVICRLSNNRSVTIHKFRGATMVSIRQFFEKDGKQLPTLKGISMSTEQWSAFKSNIPAIEEAILQMKRKKRSEHDADKIGAVST-TVITPPKFPIETTIR

Query:  FDGKNYRAWARQMESLLQHLKIAYVLSDQCPTAVLGPESSSVNAAQSKVAGQKWMSDDHMCRHNILNSLSDRLFNEYVTKTMSASELWKELKLLYFLEEF
        FDGKNY AWA QME LLQ LKIAYVLS+QCPTAVLG ESSS NAAQSKVA QKWMSDDHMC  NILNSLSDRLFNEY  K MSASELWKELKLLYFLEEF
Subjt:  FDGKNYRAWARQMESLLQHLKIAYVLSDQCPTAVLGPESSSVNAAQSKVAGQKWMSDDHMCRHNILNSLSDRLFNEYVTKTMSASELWKELKLLYFLEEF

Query:  GTKRSQVNKYLEFKMVEEKSILEQVEELNNIADSIVSAGTRIDEDFHVSAIISKLPLSWKNVWVNLMHEQYLPLPKLIDRLRIEEQLRTQKNSHLSGVSF
        GTKRSQV KYLEFKMVEEKSILEQVEELN+IADSI SAGT IDEDFHVSAIISKLPLSWKNVW++LMHE YLPL KL DRLRIEEQLRTQKNS LS VS 
Subjt:  GTKRSQVNKYLEFKMVEEKSILEQVEELNNIADSIVSAGTRIDEDFHVSAIISKLPLSWKNVWVNLMHEQYLPLPKLIDRLRIEEQLRTQKNSHLSGVSF

Query:  APNPGGQHNAANHPSKMGDPKLSSLPLGKREWQKDVKTLLCLNCGKEGHTSPNCPIRKVDN
         PN  GQH+AANHPSKMGDP   ++PL K+E QK+VKTLLCL+CGKEGHTSPNCP +KVDN
Subjt:  APNPGGQHNAANHPSKMGDPKLSSLPLGKREWQKDVKTLLCLNCGKEGHTSPNCPIRKVDN

A0A6J1G0Z2 uncharacterized protein LOC1114496768.6e-16268.41Show/hide
Query:  MDQETQRRIEETVIDVLKKSNMEDTTEYKVRGQVEERLGIDLSNRQYKLLVRNVVESFLLSMSERVCMGKED--------ENKAVEQEIVQKKEINDDGD
        MD ET+RRI+ETVID+LK SNME+ TEYK+R + E+RLG+DLS+ Q K LVR+VVE FL S +ER   GKE         ENKA EQEIV+KKEIN D D
Subjt:  MDQETQRRIEETVIDVLKKSNMEDTTEYKVRGQVEERLGIDLSNRQYKLLVRNVVESFLLSMSERVCMGKED--------ENKAVEQEIVQKKEINDDGD

Query:  LVICRLSNNRSVTIHKFRGATMVSIRQFFEKDGKQLPTLKGISMSTEQWSAFKSNIPAIEEAILQMKRK-KRSEHDADKIGAVSTTVI-TPPKFPIETTI
         VIC+LSNNR+VT+H+F+G  +VSIRQ++EKDGKQLP +KGIS++TEQWSAF+SNIPAIEEAILQMKRK KRSEHDA+  GAVS     + PKFP E TI
Subjt:  LVICRLSNNRSVTIHKFRGATMVSIRQFFEKDGKQLPTLKGISMSTEQWSAFKSNIPAIEEAILQMKRK-KRSEHDADKIGAVSTTVI-TPPKFPIETTI

Query:  RFDGKNYRAWARQMESLLQHLKIAYVLSDQCPTAVLGPESSSVNAAQSKVAGQKWMSDDHMCRHNILNSLSDRLFNEYVTKTMSASELWKELKLLYFLEE
        RFDGKNYR WARQME LL+ LKIAYVLSD  PT++LGPESSS N ++SK + Q+WMSDDHMCRH ILNSLSD LF++Y  +TMSA ELWKEL  LY L +
Subjt:  RFDGKNYRAWARQMESLLQHLKIAYVLSDQCPTAVLGPESSSVNAAQSKVAGQKWMSDDHMCRHNILNSLSDRLFNEYVTKTMSASELWKELKLLYFLEE

Query:  FGTKRSQVNKYLEFKMVEEKSILEQVEELNNIADSIVSAGTRIDEDFHVSAIISKLPLSWKNVWVNLMHEQYLPLPKLIDRLRIEEQLRTQKNSHLSGVS
        +GT+RSQV KYLEF+MVEEKSILEQVEELNNIA+SI+SAG RIDEDFHVSAIISKLP SW NV+V LM E++LP   LIDRLR EE+LRTQ+NSH S   
Subjt:  FGTKRSQVNKYLEFKMVEEKSILEQVEELNNIADSIVSAGTRIDEDFHVSAIISKLPLSWKNVWVNLMHEQYLPLPKLIDRLRIEEQLRTQKNSHLSGVS

Query:  FAPNPGGQHNAANHPSKMGDPKLSSLPLGKREWQKDVKTLLCLNCGKEGHTSPNCPIRK
             GG+    NH  KMGD    SLP  KREW+ DVKTLLCLNCGKEGH S +CP  K
Subjt:  FAPNPGGQHNAANHPSKMGDPKLSSLPLGKREWQKDVKTLLCLNCGKEGHTSPNCPIRK

SwissProt top hitse value%identityAlignment
O65154 RNA polymerase II transcriptional coactivator KIWI3.9e-1036.05Show/hide
Query:  EDENKAVEQEIVQKKEINDDG-DLVICRLSNNRSVTIHKFRGATMVSIRQFFEKDGKQLPTLKGISMSTEQWSAFKSNIPAIEEAI
        E E  A  +++ +  + +D   D+V+C +S NR V++  + G   + IR+F+ KDGK LP  KGIS+S +QW+  +++   IE+A+
Subjt:  EDENKAVEQEIVQKKEINDDG-DLVICRLSNNRSVTIHKFRGATMVSIRQFFEKDGKQLPTLKGISMSTEQWSAFKSNIPAIEEAI

O65155 RNA polymerase II transcriptional coactivator KELP2.4e-3649.11Show/hide
Query:  MDQETQRRIEETVIDVLKKSNMEDTTEYKVRGQVEERLGIDLSNRQYKLLVRNVVESFLLSMSERVCMGKEDENKAVEQEIVQ--------KKEINDDGD
        M++ET+ +IE+TVI++L +S+M++ TE+KVR    E+L IDLS + +K  VR+VVE FL    ER    +E EN  V +E            KE +DDGD
Subjt:  MDQETQRRIEETVIDVLKKSNMEDTTEYKVRGQVEERLGIDLSNRQYKLLVRNVVESFLLSMSERVCMGKEDENKAVEQEIVQ--------KKEINDDGD

Query:  LVICRLSNNRSVTIHKFRGATMVSIRQFFEKDGKQLPTLKGISMSTEQWSAFKSNIPAIEEAILQMKRK
        L+ICRLS+ R VTI +F+G ++VSIR++++KDGK+LPT KGIS++ EQWS FK N+PAIE A+ +M+ +
Subjt:  LVICRLSNNRSVTIHKFRGATMVSIRQFFEKDGKQLPTLKGISMSTEQWSAFKSNIPAIEEAILQMKRK

P87294 Putative RNA polymerase II transcriptional coactivator8.4e-0533.33Show/hide
Query:  SNNRSVTIHKFRGATMVSIRQFFEKDGKQLPTLKGISMSTEQWSAFKSNIPAIEEAI
        +  + +T+ +FRG   V IR+++EKDG  LP  KGI+++  +W   K  I  +++++
Subjt:  SNNRSVTIHKFRGATMVSIRQFFEKDGKQLPTLKGISMSTEQWSAFKSNIPAIEEAI

Q63396 Activated RNA polymerase II transcriptional coactivator p154.2e-0427.85Show/hide
Query:  IVQKKEINDDGDLVICRLSNNRSVTIHKFRGATMVSIRQFF-EKDGKQLPTLKGISMSTEQWSAFKSNIPAIEEAILQM
        +   K+ +   D  + ++   R V++  F+G  ++ IR+++ + +G+  P  KGIS++ EQWS  K  I  I++A+ ++
Subjt:  IVQKKEINDDGDLVICRLSNNRSVTIHKFRGATMVSIRQFF-EKDGKQLPTLKGISMSTEQWSAFKSNIPAIEEAILQM

Q94045 Putative RNA polymerase II transcriptional coactivator2.2e-0535.48Show/hide
Query:  SERVCMGKEDENKAVEQEIVQKKEINDDGDLVICRLSNNRSVTIHKFRGATMVSIRQFF-EKDG-KQLPTLKGISMSTEQWSAFKSNIPAIEE
        SE V   K++  KA  +E V  +  + DG+ +   + N R  T+ KF+G   V+IR+++ ++D  K +P+ KGIS+S  QW+  K  IP I++
Subjt:  SERVCMGKEDENKAVEQEIVQKKEINDDGDLVICRLSNNRSVTIHKFRGATMVSIRQFF-EKDG-KQLPTLKGISMSTEQWSAFKSNIPAIEE

Arabidopsis top hitse value%identityAlignment
AT4G00980.1 zinc knuckle (CCHC-type) family protein1.8e-7938.74Show/hide
Query:  MDQETQRRIEETVIDVLKKSNMEDTTEYKVRGQVEERLGIDLSNRQYKLLVRNVVESFLLSMSERVCMGKEDENKAVEQEIVQKKEINDDGDLVICRLSN
        M+    ++IEETV  +L +S+M+  TE+K+R     +LGIDLS   +K LVR+V+E FLLS      + +       E   V    +  + +  IC+LS 
Subjt:  MDQETQRRIEETVIDVLKKSNMEDTTEYKVRGQVEERLGIDLSNRQYKLLVRNVVESFLLSMSERVCMGKEDENKAVEQEIVQKKEINDDGDLVICRLSN

Query:  NRSVTIHKFRGATMVSIRQFFEKDGKQLPTLKGISMSTEQWSAFKSNIPAIEEAILQMKRKKRSEHDADKIGAVSTTVITPPK--FPIETTIRFDGKNYR
         ++ T+ ++RG   +SI    ++ GK     +G  +ST QWS  K N  AIE+ I Q + K +SE  A + G  S  V       F +    RFDGK+Y 
Subjt:  NRSVTIHKFRGATMVSIRQFFEKDGKQLPTLKGISMSTEQWSAFKSNIPAIEEAILQMKRKKRSEHDADKIGAVSTTVITPPK--FPIETTIRFDGKNYR

Query:  AWARQMESLLQHLKIAYVLSDQCPT--AVLGPESSSVNAAQSKVAGQKWMSDDHMCRHNILNSLSDRLFNEYVTKTMSASELWKELKLLYFLEEFGTKRS
         WA QME  L+ LK+ YVLS+ CP+  +  GPE++     ++   G+KW+ DD++C  +++NSLSD L+  Y  K   A ELW ELK +Y  +E  +KRS
Subjt:  AWARQMESLLQHLKIAYVLSDQCPT--AVLGPESSSVNAAQSKVAGQKWMSDDHMCRHNILNSLSDRLFNEYVTKTMSASELWKELKLLYFLEEFGTKRS

Query:  QVNKYLEFKMVEEKSILEQVEELNNIADSIVSAGTRIDEDFHVSAIISKLPLSWKNVWVNLMHEQYLPLPKLIDRLRIEEQLRTQKNSHLSGVSFAPNPG
        QV KY+EF+MVEE+ ILEQV+  N IADSIVSAG  +DE FHVS IISK P SW+     LM E+YLP+  L++R++ EE+L     +   GV++ P  G
Subjt:  QVNKYLEFKMVEEKSILEQVEELNNIADSIVSAGTRIDEDFHVSAIISKLPLSWKNVWVNLMHEQYLPLPKLIDRLRIEEQLRTQKNSHLSGVSFAPNPG

Query:  GQHNAANHPSKMGDPKLSSLPLG--KREWQKDVKTLL-CLNCGKEGHTSPNCPIRKVDNEVT
           +       +G     S  +G  ++E ++D + ++ C NCG++GH + +C   K D   +
Subjt:  GQHNAANHPSKMGDPKLSSLPLG--KREWQKDVKTLL-CLNCGKEGHTSPNCPIRKVDNEVT

AT4G10920.1 transcriptional coactivator p15 (PC4) family protein (KELP)1.7e-3749.11Show/hide
Query:  MDQETQRRIEETVIDVLKKSNMEDTTEYKVRGQVEERLGIDLSNRQYKLLVRNVVESFLLSMSERVCMGKEDENKAVEQEIVQ--------KKEINDDGD
        M++ET+ +IE+TVI++L +S+M++ TE+KVR    E+L IDLS + +K  VR+VVE FL    ER    +E EN  V +E            KE +DDGD
Subjt:  MDQETQRRIEETVIDVLKKSNMEDTTEYKVRGQVEERLGIDLSNRQYKLLVRNVVESFLLSMSERVCMGKEDENKAVEQEIVQ--------KKEINDDGD

Query:  LVICRLSNNRSVTIHKFRGATMVSIRQFFEKDGKQLPTLKGISMSTEQWSAFKSNIPAIEEAILQMKRK
        L+ICRLS+ R VTI +F+G ++VSIR++++KDGK+LPT KGIS++ EQWS FK N+PAIE A+ +M+ +
Subjt:  LVICRLSNNRSVTIHKFRGATMVSIRQFFEKDGKQLPTLKGISMSTEQWSAFKSNIPAIEEAILQMKRK

AT4G10920.2 transcriptional coactivator p15 (PC4) family protein (KELP)1.7e-3749.11Show/hide
Query:  MDQETQRRIEETVIDVLKKSNMEDTTEYKVRGQVEERLGIDLSNRQYKLLVRNVVESFLLSMSERVCMGKEDENKAVEQEIVQ--------KKEINDDGD
        M++ET+ +IE+TVI++L +S+M++ TE+KVR    E+L IDLS + +K  VR+VVE FL    ER    +E EN  V +E            KE +DDGD
Subjt:  MDQETQRRIEETVIDVLKKSNMEDTTEYKVRGQVEERLGIDLSNRQYKLLVRNVVESFLLSMSERVCMGKEDENKAVEQEIVQ--------KKEINDDGD

Query:  LVICRLSNNRSVTIHKFRGATMVSIRQFFEKDGKQLPTLKGISMSTEQWSAFKSNIPAIEEAILQMKRK
        L+ICRLS+ R VTI +F+G ++VSIR++++KDGK+LPT KGIS++ EQWS FK N+PAIE A+ +M+ +
Subjt:  LVICRLSNNRSVTIHKFRGATMVSIRQFFEKDGKQLPTLKGISMSTEQWSAFKSNIPAIEEAILQMKRK

AT5G09240.1 ssDNA-binding transcriptional regulator2.4e-0734.44Show/hide
Query:  EDENKAVEQEIVQKKEINDDGDLVICRLSNNRSVTIHKFRGATMVSIRQFFEKDGKQLP--TLKGISMSTEQWSAFKSNIPAIEEAILQM
        E E  A  +++   K  ++  D+ IC L  NR V +    G   ++IRQFF KDG  LP  + +GIS+S EQW+  +++   I++A+ ++
Subjt:  EDENKAVEQEIVQKKEINDDGDLVICRLSNNRSVTIHKFRGATMVSIRQFFEKDGKQLP--TLKGISMSTEQWSAFKSNIPAIEEAILQM

AT5G09250.1 ssDNA-binding transcriptional regulator2.8e-1136.05Show/hide
Query:  EDENKAVEQEIVQKKEINDDG-DLVICRLSNNRSVTIHKFRGATMVSIRQFFEKDGKQLPTLKGISMSTEQWSAFKSNIPAIEEAI
        E E  A  +++ +  + +D   D+V+C +S NR V++  + G   + IR+F+ KDGK LP  KGIS+S +QW+  +++   IE+A+
Subjt:  EDENKAVEQEIVQKKEINDDG-DLVICRLSNNRSVTIHKFRGATMVSIRQFFEKDGKQLPTLKGISMSTEQWSAFKSNIPAIEEAI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
CTGAAGCAAGGGTTTTGTTTTACCGTGAACGAGGGTGGGGTTTTCGAAAATCTCCCTCATCGGGTTAAAGTTCGAGGCCAGGTCGAAGAACGACTCGGAGTATACAATAT
AAGAACGTGCCCAAGAGCATTTTACTTTTCAATGTCTGAGCGGGTGTGTCTAGGGAAAGAGGATGAGACAGGACGTAGTGTTCGTTGCGATAATAATGCGGTGGAGCAGG
AGATAGTCCCGAAGAAGGAGATTAACGATGATGGCGACCTTGTGAATTTGCCGCTATCTGATAACAAGTGTATGCATAACAAGTGCGTGCAAGTGCATAAATATAGAAAC
GCAACTATGGTATCAATTAGTGAGTTTTATGAAAAAGATGGAAGGCAGGTTCCTACTATTACAGAAGTCAACATGTCACCTAAATCATGGGCAACCTTTAAGAGTAATAT
TCCTGCTATAGAGGAAGCTATTTTGCAGCTGAAAGGAAAAACAAGATCTGAACATGATGCTGAGAAAATTGGTGGTGTTTCGGAACCGGCGACTAGGGTAACTCCTCCAA
AATTTCCAATTGGACCTGTCCGATTTGACGGAAAAAACTACATTGTATGGGGACGTCAGATGGAGTTTTTGCTGCAGCAGCTCAAGATTGCTTATGTACTTTTAGATCAA
TGTCCTACTGACGTGCTTGGGCTAGAATCAAGCTCCGGAAATGATGCTCAATCCCAGACTGCTGAACAGAAATGGATGAATGATGATTACATGTGTCGCCGCCTCATTCT
GGGCGCCCTCTCCGATAGGTTTTTTAATGAATACACAAAGAAAACTATGAGTGCCACGGAACTTTGGACGGAACTAAAATTGCTTTATTTTAGGGAGGAGTTTGGCGACA
AGAGGTCTCAAGTAAAAAAGTATCTTGAATTCAACATAGATGAGGAGAAGACAACAATTTCTTCACCCGCCGTGCAAAGCCGAAGCCAATGGAATTCCCTCTACAAATTA
CAACACTTCCAATCGGAGACGGCCATTCACAAACGAACTGTTCTTCACCAAGAAACGGCCGGCGTTTTCCTTTCAACTCCTTCTTTTCTGGTATTTTCCGTCTTGTTTCC
AAGAAAGATGGACCAAGAGACCCAACGGAGAATCGAGGAAACCGTGATTGACGTATTGAAGAAATCGAACATGGAAGACACGACGGAGTATAAAGTTCGAGGCCAGGTCG
AAGAACGGCTCGGAATCGATCTCTCAAATAGACAATACAAGTTGCTGGTGAGGAACGTGGTGGAGAGCTTTTTACTTTCAATGTCGGAGCGGGTGTGTATGGGGAAAGAG
GATGAGAATAAAGCGGTGGAGCAGGAGATAGTCCAGAAGAAGGAGATTAACGATGATGGCGACCTAGTGATTTGCCGGCTATCTAATAACAGGAGTGTGACAATTCATAA
ATTTAGAGGGGCAACTATGGTATCAATTAGGCAGTTTTTTGAAAAAGATGGAAAACAGCTTCCTACCCTTAAAGGAATCAGCATGTCAACTGAACAATGGTCAGCCTTTA
AGAGTAATATTCCTGCTATAGAGGAAGCTATTTTGCAGATGAAAAGAAAAAAAAGATCTGAACATGATGCTGACAAGATTGGTGCTGTCTCAACGACTGTGATAACTCCT
CCAAAATTTCCAATTGAAACTACTATTCGATTTGATGGAAAAAACTACAGGGCATGGGCACGCCAGATGGAGTCTTTGCTGCAGCACCTAAAGATTGCTTACGTACTATC
TGATCAATGTCCTACTGCTGTGCTTGGGCCAGAATCAAGCTCTGTAAATGCTGCTCAATCCAAGGTTGCTGGACAGAAATGGATGAGTGATGACCACATGTGTCGCCACA
ACATTCTGAACTCCCTCTCAGATAGGCTTTTTAATGAATACGTAACGAAAACAATGAGTGCTAGTGAACTTTGGAAGGAGCTAAAATTGCTTTATTTTTTGGAAGAGTTT
GGCACTAAGAGGTCTCAAGTGAATAAGTATCTGGAATTCAAGATGGTTGAGGAGAAGTCAATATTAGAACAAGTTGAAGAACTTAATAACATTGCTGATTCCATTGTTTC
TGCTGGAACGCGGATTGATGAGGATTTTCATGTTAGTGCCATTATTTCAAAGCTTCCACTTTCTTGGAAGAATGTTTGGGTGAACTTAATGCATGAGCAGTATCTTCCCC
TTCCGAAGTTGATAGATCGATTGAGGATTGAAGAACAATTACGTACACAAAAAAACTCACATCTCTCAGGAGTGTCTTTCGCTCCTAATCCAGGAGGCCAACATAATGCT
GCAAATCACCCATCAAAGATGGGAGACCCGAAGCTCTCAAGCCTACCACTGGGGAAAAGGGAATGGCAAAAGGATGTCAAAACTTTGCTCTGCTTGAATTGCGGCAAAGA
AGGGCACACATCTCCAAATTGTCCGATTAGGAAAGTCGATAATGAAGTAACTCGGGAAAGAAGATAA
mRNA sequenceShow/hide mRNA sequence
CTGAAGCAAGGGTTTTGTTTTACCGTGAACGAGGGTGGGGTTTTCGAAAATCTCCCTCATCGGGTTAAAGTTCGAGGCCAGGTCGAAGAACGACTCGGAGTATACAATAT
AAGAACGTGCCCAAGAGCATTTTACTTTTCAATGTCTGAGCGGGTGTGTCTAGGGAAAGAGGATGAGACAGGACGTAGTGTTCGTTGCGATAATAATGCGGTGGAGCAGG
AGATAGTCCCGAAGAAGGAGATTAACGATGATGGCGACCTTGTGAATTTGCCGCTATCTGATAACAAGTGTATGCATAACAAGTGCGTGCAAGTGCATAAATATAGAAAC
GCAACTATGGTATCAATTAGTGAGTTTTATGAAAAAGATGGAAGGCAGGTTCCTACTATTACAGAAGTCAACATGTCACCTAAATCATGGGCAACCTTTAAGAGTAATAT
TCCTGCTATAGAGGAAGCTATTTTGCAGCTGAAAGGAAAAACAAGATCTGAACATGATGCTGAGAAAATTGGTGGTGTTTCGGAACCGGCGACTAGGGTAACTCCTCCAA
AATTTCCAATTGGACCTGTCCGATTTGACGGAAAAAACTACATTGTATGGGGACGTCAGATGGAGTTTTTGCTGCAGCAGCTCAAGATTGCTTATGTACTTTTAGATCAA
TGTCCTACTGACGTGCTTGGGCTAGAATCAAGCTCCGGAAATGATGCTCAATCCCAGACTGCTGAACAGAAATGGATGAATGATGATTACATGTGTCGCCGCCTCATTCT
GGGCGCCCTCTCCGATAGGTTTTTTAATGAATACACAAAGAAAACTATGAGTGCCACGGAACTTTGGACGGAACTAAAATTGCTTTATTTTAGGGAGGAGTTTGGCGACA
AGAGGTCTCAAGTAAAAAAGTATCTTGAATTCAACATAGATGAGGAGAAGACAACAATTTCTTCACCCGCCGTGCAAAGCCGAAGCCAATGGAATTCCCTCTACAAATTA
CAACACTTCCAATCGGAGACGGCCATTCACAAACGAACTGTTCTTCACCAAGAAACGGCCGGCGTTTTCCTTTCAACTCCTTCTTTTCTGGTATTTTCCGTCTTGTTTCC
AAGAAAGATGGACCAAGAGACCCAACGGAGAATCGAGGAAACCGTGATTGACGTATTGAAGAAATCGAACATGGAAGACACGACGGAGTATAAAGTTCGAGGCCAGGTCG
AAGAACGGCTCGGAATCGATCTCTCAAATAGACAATACAAGTTGCTGGTGAGGAACGTGGTGGAGAGCTTTTTACTTTCAATGTCGGAGCGGGTGTGTATGGGGAAAGAG
GATGAGAATAAAGCGGTGGAGCAGGAGATAGTCCAGAAGAAGGAGATTAACGATGATGGCGACCTAGTGATTTGCCGGCTATCTAATAACAGGAGTGTGACAATTCATAA
ATTTAGAGGGGCAACTATGGTATCAATTAGGCAGTTTTTTGAAAAAGATGGAAAACAGCTTCCTACCCTTAAAGGAATCAGCATGTCAACTGAACAATGGTCAGCCTTTA
AGAGTAATATTCCTGCTATAGAGGAAGCTATTTTGCAGATGAAAAGAAAAAAAAGATCTGAACATGATGCTGACAAGATTGGTGCTGTCTCAACGACTGTGATAACTCCT
CCAAAATTTCCAATTGAAACTACTATTCGATTTGATGGAAAAAACTACAGGGCATGGGCACGCCAGATGGAGTCTTTGCTGCAGCACCTAAAGATTGCTTACGTACTATC
TGATCAATGTCCTACTGCTGTGCTTGGGCCAGAATCAAGCTCTGTAAATGCTGCTCAATCCAAGGTTGCTGGACAGAAATGGATGAGTGATGACCACATGTGTCGCCACA
ACATTCTGAACTCCCTCTCAGATAGGCTTTTTAATGAATACGTAACGAAAACAATGAGTGCTAGTGAACTTTGGAAGGAGCTAAAATTGCTTTATTTTTTGGAAGAGTTT
GGCACTAAGAGGTCTCAAGTGAATAAGTATCTGGAATTCAAGATGGTTGAGGAGAAGTCAATATTAGAACAAGTTGAAGAACTTAATAACATTGCTGATTCCATTGTTTC
TGCTGGAACGCGGATTGATGAGGATTTTCATGTTAGTGCCATTATTTCAAAGCTTCCACTTTCTTGGAAGAATGTTTGGGTGAACTTAATGCATGAGCAGTATCTTCCCC
TTCCGAAGTTGATAGATCGATTGAGGATTGAAGAACAATTACGTACACAAAAAAACTCACATCTCTCAGGAGTGTCTTTCGCTCCTAATCCAGGAGGCCAACATAATGCT
GCAAATCACCCATCAAAGATGGGAGACCCGAAGCTCTCAAGCCTACCACTGGGGAAAAGGGAATGGCAAAAGGATGTCAAAACTTTGCTCTGCTTGAATTGCGGCAAAGA
AGGGCACACATCTCCAAATTGTCCGATTAGGAAAGTCGATAATGAAGTAACTCGGGAAAGAAGATAAAAGATTCTTACTGAGGTAAATATGTCTGACTTCTGAGGATGAA
AATAGTGGATTGACATTTAGAGGCCACACTTCTTGATTCATATGTTCTTTCAAAGCATGGAAGACAATAGTCAATTTGAACAATTTGAACAATTTTCAAGGACTCTGGAA
CTATCATGCGCTTAGGAGCTTTCAAAGAGCGGAGTCAAGGCTCTATGCCATGTATGTGCATAACTGATATTCTTGTTGATTTTAACTCAGTTTCCTTATAATAGTTATAT
GTTCTCATTCTTCTTCTTGAAGCTTAAGTGAATTGCAATGTGTTCGTC
Protein sequenceShow/hide protein sequence
LKQGFCFTVNEGGVFENLPHRVKVRGQVEERLGVYNIRTCPRAFYFSMSERVCLGKEDETGRSVRCDNNAVEQEIVPKKEINDDGDLVNLPLSDNKCMHNKCVQVHKYRN
ATMVSISEFYEKDGRQVPTITEVNMSPKSWATFKSNIPAIEEAILQLKGKTRSEHDAEKIGGVSEPATRVTPPKFPIGPVRFDGKNYIVWGRQMEFLLQQLKIAYVLLDQ
CPTDVLGLESSSGNDAQSQTAEQKWMNDDYMCRRLILGALSDRFFNEYTKKTMSATELWTELKLLYFREEFGDKRSQVKKYLEFNIDEEKTTISSPAVQSRSQWNSLYKL
QHFQSETAIHKRTVLHQETAGVFLSTPSFLVFSVLFPRKMDQETQRRIEETVIDVLKKSNMEDTTEYKVRGQVEERLGIDLSNRQYKLLVRNVVESFLLSMSERVCMGKE
DENKAVEQEIVQKKEINDDGDLVICRLSNNRSVTIHKFRGATMVSIRQFFEKDGKQLPTLKGISMSTEQWSAFKSNIPAIEEAILQMKRKKRSEHDADKIGAVSTTVITP
PKFPIETTIRFDGKNYRAWARQMESLLQHLKIAYVLSDQCPTAVLGPESSSVNAAQSKVAGQKWMSDDHMCRHNILNSLSDRLFNEYVTKTMSASELWKELKLLYFLEEF
GTKRSQVNKYLEFKMVEEKSILEQVEELNNIADSIVSAGTRIDEDFHVSAIISKLPLSWKNVWVNLMHEQYLPLPKLIDRLRIEEQLRTQKNSHLSGVSFAPNPGGQHNA
ANHPSKMGDPKLSSLPLGKREWQKDVKTLLCLNCGKEGHTSPNCPIRKVDNEVTRERR