; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0008251 (gene) of Snake gourd v1 genome

Gene IDTan0008251
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionSingle-stranded nucleic acid binding R3H protein
Genome locationLG08:45420729..45428785
RNA-Seq ExpressionTan0008251
SyntenyTan0008251
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001374 - R3H domain
IPR024771 - SUZ domain
IPR036867 - R3H domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6580868.1 R3H domain-containing protein 2, partial [Cucurbita argyrosperma subsp. sororia]3.5e-18391.24Show/hide
Query:  MDSASFDATPQGFKDKESMVDPFLVEALQNPRHRLTILRMELDIQKFLHNPDQQLFEFQHFPTSYLRLAAHRVAQHYGLQTMVQDSGVDGFGNKILVRKM
        M+SASF+ TPQGFKDKESMVDPFLVEALQNPRHRLTILRMELDIQKFLHNPDQQLFEFQHFPTSYLRLAAHRVAQHYGLQTMVQDSG+DGFGN+ILVRK 
Subjt:  MDSASFDATPQGFKDKESMVDPFLVEALQNPRHRLTILRMELDIQKFLHNPDQQLFEFQHFPTSYLRLAAHRVAQHYGLQTMVQDSGVDGFGNKILVRKM

Query:  VENRLPSVRLSEIPAKQLDNEKHEKVKIVIRPRPNKMSEISANDGGLKQSSVRSMEERKEEYDRARARIFSSPSSPELDDTTSQAPSEVKYACSNRDETE
        VENRLPSVRLSEIPAKQ DNEK+E+VK VIRPRPNKMSE SANDGGLKQ+SVRS+EERKEEYDRARARIFSSPSSPELDDTTSQ+PSE KYACSNRDETE
Subjt:  VENRLPSVRLSEIPAKQLDNEKHEKVKIVIRPRPNKMSEISANDGGLKQSSVRSMEERKEEYDRARARIFSSPSSPELDDTTSQAPSEVKYACSNRDETE

Query:  GCRTLGSDLEKFNGRDGMTSRVAIFKDREKDRSDPDYDRNYDRYIRNLPTNQNLSLAPFIMQKIQPPFVQYDSGYSLMGHIPGTQASVGYGPHPSPVVSP
        G RTL  DLEKFNGRDGMTSRVAIFKDREKDRSDPDYDRNYDRYIRNLPTNQNLSLAPF MQK+QPPFVQ D GYSLMGHIPGTQ S+ YGPHPSPVVSP
Subjt:  GCRTLGSDLEKFNGRDGMTSRVAIFKDREKDRSDPDYDRNYDRYIRNLPTNQNLSLAPFIMQKIQPPFVQYDSGYSLMGHIPGTQASVGYGPHPSPVVSP

Query:  FCAVGLNQTSRDVSYEQWQSAAMMYAHSYSQLRHSAFQAPFCQQPLSFDYSQNH
        F A+G+N+TSRD SYEQWQSAAMMYAHSY+QLRHSAFQAPFCQQPLSFDYSQNH
Subjt:  FCAVGLNQTSRDVSYEQWQSAAMMYAHSYSQLRHSAFQAPFCQQPLSFDYSQNH

XP_004136625.2 uncharacterized protein LOC101215817 isoform X1 [Cucumis sativus]1.5e-18692.09Show/hide
Query:  MDSASFDATPQGFKDKESMVDPFLVEALQNPRHRLTILRMELDIQKFLHNPDQQLFEFQHFPTSYLRLAAHRVAQHYGLQTMVQDSGVDGFGNKILVRKM
        M+SASFD TPQGF DKESMVDPFLVEALQNPRHRLTILRMELDIQKFLHNPDQQLFEFQHFPTSYLRLAAHRVAQHYGLQTMVQDSG+DGFGN+I+VRKM
Subjt:  MDSASFDATPQGFKDKESMVDPFLVEALQNPRHRLTILRMELDIQKFLHNPDQQLFEFQHFPTSYLRLAAHRVAQHYGLQTMVQDSGVDGFGNKILVRKM

Query:  VENRLPSVRLSEIPAKQLDNEKHEKVKIVIRPRPNKMSEISANDGGLKQSSVRSMEERKEEYDRARARIFSSPSSPELDDTTSQAPSEVKYACSNRDETE
         ENRLPSVRLSEIPAKQLDNEKHE+VKIVIRPRPNKMS ISAN+GG KQSSVRS+EERKEEYDRARARIFSSPSSPE+DDT SQ PSE KYACSNRDETE
Subjt:  VENRLPSVRLSEIPAKQLDNEKHEKVKIVIRPRPNKMSEISANDGGLKQSSVRSMEERKEEYDRARARIFSSPSSPELDDTTSQAPSEVKYACSNRDETE

Query:  GCRTLGSDLEKFNGRDGMTSRVAIFKDREKDRSDPDYDRNYDRYIRNLPTNQNLSLAPFIMQKIQPPFVQYDSGYSLMGHIPGTQASVGYGPHPSPVVSP
        GCRTLG +LEKFNGRDGMTSRVAIFKDREKDRSDPDYDRNYDRYIRNLPTNQNLSLAPFIM K+QPPFVQYDSGYSL+GH+PGTQASV YGPHPSPVVSP
Subjt:  GCRTLGSDLEKFNGRDGMTSRVAIFKDREKDRSDPDYDRNYDRYIRNLPTNQNLSLAPFIMQKIQPPFVQYDSGYSLMGHIPGTQASVGYGPHPSPVVSP

Query:  FCAVGLNQTSRDVSYEQWQSAAMMYAHSYSQLRHSAFQAPFCQQPLSFDYSQNH
        FCA+GLNQ SRD SYEQWQSAAMMYAHSY+Q RHSAFQAPFCQQPLSFDYSQNH
Subjt:  FCAVGLNQTSRDVSYEQWQSAAMMYAHSYSQLRHSAFQAPFCQQPLSFDYSQNH

XP_008443221.1 PREDICTED: uncharacterized protein LOC103486865 isoform X2 [Cucumis melo]2.9e-18591.81Show/hide
Query:  MDSASFDATPQGFKDKESMVDPFLVEALQNPRHRLTILRMELDIQKFLHNPDQQLFEFQHFPTSYLRLAAHRVAQHYGLQTMVQDSGVDGFGNKILVRKM
        M+SASFD TPQGF DKESMVDPFLVEALQNPRHRLTILRMELDIQKFL+NPDQQLFEFQHFPTSYLRLAAHRVAQHYGLQTMVQDSG+DGFGN+I+VRKM
Subjt:  MDSASFDATPQGFKDKESMVDPFLVEALQNPRHRLTILRMELDIQKFLHNPDQQLFEFQHFPTSYLRLAAHRVAQHYGLQTMVQDSGVDGFGNKILVRKM

Query:  VENRLPSVRLSEIPAKQLDNEKHEKVKIVIRPRPNKMSEISANDGGLKQSSVRSMEERKEEYDRARARIFSSPSSPELDDTTSQAPSEVKYACSNRDETE
        VENRLPSVRLSEIPAKQLDNEKHE+VKIVIRPRPNKMS ISAN+GG KQSSVR++EERKEEYDRARARIFSSPSSPE+DDT SQ PSE KYACSNRDETE
Subjt:  VENRLPSVRLSEIPAKQLDNEKHEKVKIVIRPRPNKMSEISANDGGLKQSSVRSMEERKEEYDRARARIFSSPSSPELDDTTSQAPSEVKYACSNRDETE

Query:  GCRTLGSDLEKFNGRDGMTSRVAIFKDREKDRSDPDYDRNYDRYIRNLPTNQNLSLAPFIMQKIQPPFVQYDSGYSLMGHIPGTQASVGYGPHPSPVVSP
        GCRTLG DLEKFNGRDGMTSRVAIFKDREKDRSDPDYDRNYDRYIRNLPTNQNLSLAPFIM KIQPPFVQYDSGYSL+GH+PGTQASV YGPHPS VVSP
Subjt:  GCRTLGSDLEKFNGRDGMTSRVAIFKDREKDRSDPDYDRNYDRYIRNLPTNQNLSLAPFIMQKIQPPFVQYDSGYSLMGHIPGTQASVGYGPHPSPVVSP

Query:  FCAVGLNQTSRDVSYEQWQSAAMMYAHSYSQLRHSAFQAPFCQQPLSFDYSQNH
        FCA+GLNQ SRD SYEQWQ+AAMMYAHSY+Q RHSAFQAPFCQQPLSFDYSQNH
Subjt:  FCAVGLNQTSRDVSYEQWQSAAMMYAHSYSQLRHSAFQAPFCQQPLSFDYSQNH

XP_022944145.1 uncharacterized protein LOC111448686 [Cucurbita moschata]1.6e-18391.81Show/hide
Query:  MDSASFDATPQGFKDKESMVDPFLVEALQNPRHRLTILRMELDIQKFLHNPDQQLFEFQHFPTSYLRLAAHRVAQHYGLQTMVQDSGVDGFGNKILVRKM
        MD+ASFD  PQGF   +SMVDPFLVEALQNPRHRLTILRMELDIQKFLHNPDQQ FEFQHFPTSYLRLAAHRVAQHYGLQTMVQDSGVDGFGN+ILVRK 
Subjt:  MDSASFDATPQGFKDKESMVDPFLVEALQNPRHRLTILRMELDIQKFLHNPDQQLFEFQHFPTSYLRLAAHRVAQHYGLQTMVQDSGVDGFGNKILVRKM

Query:  VENRLPSVRLSEIPAKQLDNEKHEKVKIVIRPRPNKMSEISANDGGLKQSSVRSMEERKEEYDRARARIFSSPSSPELDDTTSQAPSEVKYACSNRDETE
         E+RLPSV LSEIPAKQLDNEKHE+VKIVIRPRPNKMSEISANDGGLKQSSVRS+EERKEEYDRARARIFSSPSSPELDDTTSQAPSE KY CSNRDETE
Subjt:  VENRLPSVRLSEIPAKQLDNEKHEKVKIVIRPRPNKMSEISANDGGLKQSSVRSMEERKEEYDRARARIFSSPSSPELDDTTSQAPSEVKYACSNRDETE

Query:  GCRTLGSDLEKFNGRDGMTSRVAIFKDREKDRSDPDYDRNYDRYIRNLPTNQNLSLAPFIMQKIQPPFVQYDSGYSLMGHIPGTQASVGYGPHPSPVVSP
        GCRTLG DLEKFNGRD MTSRVAIFKDREKDRSDPDYDRNYDRYIRNLP NQN SLAPF+M KIQPPFVQYDSGYSLMGHIPGTQASV YGPHPSP VSP
Subjt:  GCRTLGSDLEKFNGRDGMTSRVAIFKDREKDRSDPDYDRNYDRYIRNLPTNQNLSLAPFIMQKIQPPFVQYDSGYSLMGHIPGTQASVGYGPHPSPVVSP

Query:  FCAVGLNQTSRDVSYEQWQSAAMMYAHSYSQLRHSAFQAPFCQQPLSFDYSQNH
        FC VGLNQTSRD SYEQWQSAAMMYAHSY+QLRHSAFQAPF QQPLSFDYSQNH
Subjt:  FCAVGLNQTSRDVSYEQWQSAAMMYAHSYSQLRHSAFQAPFCQQPLSFDYSQNH

XP_038903441.1 uncharacterized protein LOC120090027 [Benincasa hispida]9.6e-18993.5Show/hide
Query:  MDSASFDATPQGFKDKESMVDPFLVEALQNPRHRLTILRMELDIQKFLHNPDQQLFEFQHFPTSYLRLAAHRVAQHYGLQTMVQDSGVDGFGNKILVRKM
        M+SASFD TPQGFKDKESMVDPFLVEALQNPRHRLTILRMELDIQKFLHNPDQQLFEFQHFPTSYLRLAAHRVAQHYGLQTMVQDSG+DGFGN+ILVRK 
Subjt:  MDSASFDATPQGFKDKESMVDPFLVEALQNPRHRLTILRMELDIQKFLHNPDQQLFEFQHFPTSYLRLAAHRVAQHYGLQTMVQDSGVDGFGNKILVRKM

Query:  VENRLPSVRLSEIPAKQLDNEKHEKVKIVIRPRPNKMSEISANDGGLKQSSVRSMEERKEEYDRARARIFSSPSSPELDDTTSQAPSEVKYACSNRDETE
        VENRLPSV LSEIPAKQLDNEKHE+VKIVIRPRPNKMS ISAN+GG KQSSVRS+EERKEEYD+ARARIFSSPSSPE+D+TTSQ PSE KYACSNRDETE
Subjt:  VENRLPSVRLSEIPAKQLDNEKHEKVKIVIRPRPNKMSEISANDGGLKQSSVRSMEERKEEYDRARARIFSSPSSPELDDTTSQAPSEVKYACSNRDETE

Query:  GCRTLGSDLEKFNGRDGMTSRVAIFKDREKDRSDPDYDRNYDRYIRNLPTNQNLSLAPFIMQKIQPPFVQYDSGYSLMGHIPGTQASVGYGPHPSPVVSP
        GCRTLG DLEKFNGRDGMTSRVAIFKDREKDRSDPDYDRNYDRYIRNLPTNQNLSLAPFIMQKIQPPFVQYDSGYSLMGH+PGTQASV YGPHPSPVVSP
Subjt:  GCRTLGSDLEKFNGRDGMTSRVAIFKDREKDRSDPDYDRNYDRYIRNLPTNQNLSLAPFIMQKIQPPFVQYDSGYSLMGHIPGTQASVGYGPHPSPVVSP

Query:  FCAVGLNQTSRDVSYEQWQSAAMMYAHSYSQLRHSAFQAPFCQQPLSFDYSQNH
        FCA+GLNQ SRD SYEQWQSAAMMYAHSY+QLRHSAFQAPFCQQPLSFDYSQNH
Subjt:  FCAVGLNQTSRDVSYEQWQSAAMMYAHSYSQLRHSAFQAPFCQQPLSFDYSQNH

TrEMBL top hitse value%identityAlignment
A0A0A0LFE9 Uncharacterized protein7.4e-18792.09Show/hide
Query:  MDSASFDATPQGFKDKESMVDPFLVEALQNPRHRLTILRMELDIQKFLHNPDQQLFEFQHFPTSYLRLAAHRVAQHYGLQTMVQDSGVDGFGNKILVRKM
        M+SASFD TPQGF DKESMVDPFLVEALQNPRHRLTILRMELDIQKFLHNPDQQLFEFQHFPTSYLRLAAHRVAQHYGLQTMVQDSG+DGFGN+I+VRKM
Subjt:  MDSASFDATPQGFKDKESMVDPFLVEALQNPRHRLTILRMELDIQKFLHNPDQQLFEFQHFPTSYLRLAAHRVAQHYGLQTMVQDSGVDGFGNKILVRKM

Query:  VENRLPSVRLSEIPAKQLDNEKHEKVKIVIRPRPNKMSEISANDGGLKQSSVRSMEERKEEYDRARARIFSSPSSPELDDTTSQAPSEVKYACSNRDETE
         ENRLPSVRLSEIPAKQLDNEKHE+VKIVIRPRPNKMS ISAN+GG KQSSVRS+EERKEEYDRARARIFSSPSSPE+DDT SQ PSE KYACSNRDETE
Subjt:  VENRLPSVRLSEIPAKQLDNEKHEKVKIVIRPRPNKMSEISANDGGLKQSSVRSMEERKEEYDRARARIFSSPSSPELDDTTSQAPSEVKYACSNRDETE

Query:  GCRTLGSDLEKFNGRDGMTSRVAIFKDREKDRSDPDYDRNYDRYIRNLPTNQNLSLAPFIMQKIQPPFVQYDSGYSLMGHIPGTQASVGYGPHPSPVVSP
        GCRTLG +LEKFNGRDGMTSRVAIFKDREKDRSDPDYDRNYDRYIRNLPTNQNLSLAPFIM K+QPPFVQYDSGYSL+GH+PGTQASV YGPHPSPVVSP
Subjt:  GCRTLGSDLEKFNGRDGMTSRVAIFKDREKDRSDPDYDRNYDRYIRNLPTNQNLSLAPFIMQKIQPPFVQYDSGYSLMGHIPGTQASVGYGPHPSPVVSP

Query:  FCAVGLNQTSRDVSYEQWQSAAMMYAHSYSQLRHSAFQAPFCQQPLSFDYSQNH
        FCA+GLNQ SRD SYEQWQSAAMMYAHSY+Q RHSAFQAPFCQQPLSFDYSQNH
Subjt:  FCAVGLNQTSRDVSYEQWQSAAMMYAHSYSQLRHSAFQAPFCQQPLSFDYSQNH

A0A1S3B8A9 uncharacterized protein LOC103486865 isoform X21.4e-18591.81Show/hide
Query:  MDSASFDATPQGFKDKESMVDPFLVEALQNPRHRLTILRMELDIQKFLHNPDQQLFEFQHFPTSYLRLAAHRVAQHYGLQTMVQDSGVDGFGNKILVRKM
        M+SASFD TPQGF DKESMVDPFLVEALQNPRHRLTILRMELDIQKFL+NPDQQLFEFQHFPTSYLRLAAHRVAQHYGLQTMVQDSG+DGFGN+I+VRKM
Subjt:  MDSASFDATPQGFKDKESMVDPFLVEALQNPRHRLTILRMELDIQKFLHNPDQQLFEFQHFPTSYLRLAAHRVAQHYGLQTMVQDSGVDGFGNKILVRKM

Query:  VENRLPSVRLSEIPAKQLDNEKHEKVKIVIRPRPNKMSEISANDGGLKQSSVRSMEERKEEYDRARARIFSSPSSPELDDTTSQAPSEVKYACSNRDETE
        VENRLPSVRLSEIPAKQLDNEKHE+VKIVIRPRPNKMS ISAN+GG KQSSVR++EERKEEYDRARARIFSSPSSPE+DDT SQ PSE KYACSNRDETE
Subjt:  VENRLPSVRLSEIPAKQLDNEKHEKVKIVIRPRPNKMSEISANDGGLKQSSVRSMEERKEEYDRARARIFSSPSSPELDDTTSQAPSEVKYACSNRDETE

Query:  GCRTLGSDLEKFNGRDGMTSRVAIFKDREKDRSDPDYDRNYDRYIRNLPTNQNLSLAPFIMQKIQPPFVQYDSGYSLMGHIPGTQASVGYGPHPSPVVSP
        GCRTLG DLEKFNGRDGMTSRVAIFKDREKDRSDPDYDRNYDRYIRNLPTNQNLSLAPFIM KIQPPFVQYDSGYSL+GH+PGTQASV YGPHPS VVSP
Subjt:  GCRTLGSDLEKFNGRDGMTSRVAIFKDREKDRSDPDYDRNYDRYIRNLPTNQNLSLAPFIMQKIQPPFVQYDSGYSLMGHIPGTQASVGYGPHPSPVVSP

Query:  FCAVGLNQTSRDVSYEQWQSAAMMYAHSYSQLRHSAFQAPFCQQPLSFDYSQNH
        FCA+GLNQ SRD SYEQWQ+AAMMYAHSY+Q RHSAFQAPFCQQPLSFDYSQNH
Subjt:  FCAVGLNQTSRDVSYEQWQSAAMMYAHSYSQLRHSAFQAPFCQQPLSFDYSQNH

A0A6J1F3M8 uncharacterized protein LOC1114418731.7e-18391.24Show/hide
Query:  MDSASFDATPQGFKDKESMVDPFLVEALQNPRHRLTILRMELDIQKFLHNPDQQLFEFQHFPTSYLRLAAHRVAQHYGLQTMVQDSGVDGFGNKILVRKM
        M+SASF+ TPQGFKDKESMVDPFLVEALQNPRHRLTILRMELDIQKFLHNPDQQLFEFQHFPTSYLRLAAHRVAQHYGLQTMVQDSG+DGFGN+ILVRK 
Subjt:  MDSASFDATPQGFKDKESMVDPFLVEALQNPRHRLTILRMELDIQKFLHNPDQQLFEFQHFPTSYLRLAAHRVAQHYGLQTMVQDSGVDGFGNKILVRKM

Query:  VENRLPSVRLSEIPAKQLDNEKHEKVKIVIRPRPNKMSEISANDGGLKQSSVRSMEERKEEYDRARARIFSSPSSPELDDTTSQAPSEVKYACSNRDETE
        VENRLPSVRLSEIPAKQ DNEK+E+VK VIRPRPNKMSE SANDGGLKQ+SVRS+EERKEEYDRARARIFSSPSSPELDDTTSQ+PSE KYACSNRDETE
Subjt:  VENRLPSVRLSEIPAKQLDNEKHEKVKIVIRPRPNKMSEISANDGGLKQSSVRSMEERKEEYDRARARIFSSPSSPELDDTTSQAPSEVKYACSNRDETE

Query:  GCRTLGSDLEKFNGRDGMTSRVAIFKDREKDRSDPDYDRNYDRYIRNLPTNQNLSLAPFIMQKIQPPFVQYDSGYSLMGHIPGTQASVGYGPHPSPVVSP
        G RTL  DLEKFNGRDGMTSRVAIFKDREKDRSDPDYDRNYDRYIRNLPTNQNLSLAPF MQK+QPPFVQ D GYSLMGHIPGTQ S+ YGPHPSPVVSP
Subjt:  GCRTLGSDLEKFNGRDGMTSRVAIFKDREKDRSDPDYDRNYDRYIRNLPTNQNLSLAPFIMQKIQPPFVQYDSGYSLMGHIPGTQASVGYGPHPSPVVSP

Query:  FCAVGLNQTSRDVSYEQWQSAAMMYAHSYSQLRHSAFQAPFCQQPLSFDYSQNH
        F A+G+N+TSRD SYEQWQSAAMMYAHSY+QLRHSAFQAPFCQQPLSFDYSQNH
Subjt:  FCAVGLNQTSRDVSYEQWQSAAMMYAHSYSQLRHSAFQAPFCQQPLSFDYSQNH

A0A6J1FYH7 uncharacterized protein LOC1114486867.6e-18491.81Show/hide
Query:  MDSASFDATPQGFKDKESMVDPFLVEALQNPRHRLTILRMELDIQKFLHNPDQQLFEFQHFPTSYLRLAAHRVAQHYGLQTMVQDSGVDGFGNKILVRKM
        MD+ASFD  PQGF   +SMVDPFLVEALQNPRHRLTILRMELDIQKFLHNPDQQ FEFQHFPTSYLRLAAHRVAQHYGLQTMVQDSGVDGFGN+ILVRK 
Subjt:  MDSASFDATPQGFKDKESMVDPFLVEALQNPRHRLTILRMELDIQKFLHNPDQQLFEFQHFPTSYLRLAAHRVAQHYGLQTMVQDSGVDGFGNKILVRKM

Query:  VENRLPSVRLSEIPAKQLDNEKHEKVKIVIRPRPNKMSEISANDGGLKQSSVRSMEERKEEYDRARARIFSSPSSPELDDTTSQAPSEVKYACSNRDETE
         E+RLPSV LSEIPAKQLDNEKHE+VKIVIRPRPNKMSEISANDGGLKQSSVRS+EERKEEYDRARARIFSSPSSPELDDTTSQAPSE KY CSNRDETE
Subjt:  VENRLPSVRLSEIPAKQLDNEKHEKVKIVIRPRPNKMSEISANDGGLKQSSVRSMEERKEEYDRARARIFSSPSSPELDDTTSQAPSEVKYACSNRDETE

Query:  GCRTLGSDLEKFNGRDGMTSRVAIFKDREKDRSDPDYDRNYDRYIRNLPTNQNLSLAPFIMQKIQPPFVQYDSGYSLMGHIPGTQASVGYGPHPSPVVSP
        GCRTLG DLEKFNGRD MTSRVAIFKDREKDRSDPDYDRNYDRYIRNLP NQN SLAPF+M KIQPPFVQYDSGYSLMGHIPGTQASV YGPHPSP VSP
Subjt:  GCRTLGSDLEKFNGRDGMTSRVAIFKDREKDRSDPDYDRNYDRYIRNLPTNQNLSLAPFIMQKIQPPFVQYDSGYSLMGHIPGTQASVGYGPHPSPVVSP

Query:  FCAVGLNQTSRDVSYEQWQSAAMMYAHSYSQLRHSAFQAPFCQQPLSFDYSQNH
        FC VGLNQTSRD SYEQWQSAAMMYAHSY+QLRHSAFQAPF QQPLSFDYSQNH
Subjt:  FCAVGLNQTSRDVSYEQWQSAAMMYAHSYSQLRHSAFQAPFCQQPLSFDYSQNH

A0A6J1J0D7 uncharacterized protein LOC1114823202.2e-18391.24Show/hide
Query:  MDSASFDATPQGFKDKESMVDPFLVEALQNPRHRLTILRMELDIQKFLHNPDQQLFEFQHFPTSYLRLAAHRVAQHYGLQTMVQDSGVDGFGNKILVRKM
        M+SASF+ TPQGFK+KESMVDPFLVEALQNPRHRLTILRMELDIQKFLHNPDQQLFEFQHFPTSYLRLAAHRVAQHYGLQTMVQDSG+DGFGN+ILVRK 
Subjt:  MDSASFDATPQGFKDKESMVDPFLVEALQNPRHRLTILRMELDIQKFLHNPDQQLFEFQHFPTSYLRLAAHRVAQHYGLQTMVQDSGVDGFGNKILVRKM

Query:  VENRLPSVRLSEIPAKQLDNEKHEKVKIVIRPRPNKMSEISANDGGLKQSSVRSMEERKEEYDRARARIFSSPSSPELDDTTSQAPSEVKYACSNRDETE
        VENRLPSVRLSEIPAKQLDNEK+E+VK VIRPRPNKMSE SANDGGLKQ+SVRS+EERKEEYDRARARIFSSPSSPELDDTTSQ+PSE KYACSNRDETE
Subjt:  VENRLPSVRLSEIPAKQLDNEKHEKVKIVIRPRPNKMSEISANDGGLKQSSVRSMEERKEEYDRARARIFSSPSSPELDDTTSQAPSEVKYACSNRDETE

Query:  GCRTLGSDLEKFNGRDGMTSRVAIFKDREKDRSDPDYDRNYDRYIRNLPTNQNLSLAPFIMQKIQPPFVQYDSGYSLMGHIPGTQASVGYGPHPSPVVSP
        G RTL  D EKFNGRDGMTSRVAIFKDREKDRSDPDYDRNYDRYIRNLPTNQNLSLAPF MQKIQPPFVQ D GYSLMGHIPGTQ S+ YGPHPSPVVSP
Subjt:  GCRTLGSDLEKFNGRDGMTSRVAIFKDREKDRSDPDYDRNYDRYIRNLPTNQNLSLAPFIMQKIQPPFVQYDSGYSLMGHIPGTQASVGYGPHPSPVVSP

Query:  FCAVGLNQTSRDVSYEQWQSAAMMYAHSYSQLRHSAFQAPFCQQPLSFDYSQNH
        F A+G+N+TSRD SYEQWQSAAMMYAHSY+QLRHSAFQAPFCQQPLSFDYSQNH
Subjt:  FCAVGLNQTSRDVSYEQWQSAAMMYAHSYSQLRHSAFQAPFCQQPLSFDYSQNH

SwissProt top hitse value%identityAlignment
A0JNC2 R3H domain-containing protein 29.8e-1132.69Show/hide
Query:  FLVEAL-QNPRHRLTILRMELDIQKFLHNPDQQLFEFQHFP--TSYLRLAAHRVAQHYGLQTMVQDSGVDGFGNKILVRKMVENRLPSVRLSEIPAKQLD
        FLV  L +NPR R+ +L++E +I  F+++ + Q   F+ FP  TSY R+  HRVA ++G+     D  VD  G  +++ K    R+P  R SE    + +
Subjt:  FLVEAL-QNPRHRLTILRMELDIQKFLHNPDQQLFEFQHFP--TSYLRLAAHRVAQHYGLQTMVQDSGVDGFGNKILVRKMVENRLPSVRLSEIPAKQLD

Query:  NEKHEKVKI----VIRPRPNKMSEISANDGGLKQSSVRSMEERKEEYDRARARIFS
         E  ++  +        R +    +   DG       +S+EER+EEY R R RIF+
Subjt:  NEKHEKVKI----VIRPRPNKMSEISANDGGLKQSSVRSMEERKEEYDRARARIFS

Q15032 R3H domain-containing protein 11.3e-1035.26Show/hide
Query:  FLVEALQ-NPRHRLTILRMELDIQKFLHNPDQQLFEFQHFPTSYLRLAAHRVAQHYGLQTMVQDSGVDGFGNKILVRKMVENRLPSVRLSEIPAKQLDNE
        FLV  L+ NPR R+ +L++E +I  F+ N +    +F    TSY R+  HRVA ++GL     D  VD  G  ++V K    R+P  + +E      D++
Subjt:  FLVEALQ-NPRHRLTILRMELDIQKFLHNPDQQLFEFQHFPTSYLRLAAHRVAQHYGLQTMVQDSGVDGFGNKILVRKMVENRLPSVRLSEIPAKQLDNE

Query:  KHEKVKIVIRPRPNKMSEISANDGGLKQSSVR---SMEERKEEYDRARARIFSSPS
          +  K  I  R N   +   N   ++    R   S+EER+EEY RAR RIFS  S
Subjt:  KHEKVKIVIRPRPNKMSEISANDGGLKQSSVR---SMEERKEEYDRARARIFSSPS

Q80TM6 R3H domain-containing protein 27.5e-1132.69Show/hide
Query:  FLVEAL-QNPRHRLTILRMELDIQKFLHNPDQQLFEFQHFP--TSYLRLAAHRVAQHYGLQTMVQDSGVDGFGNKILVRKMVENRLPSVRLSEIPAKQLD
        FLV  L +NPR R+ +L++E +I  F+++ + Q   F+ FP  TSY R+  HRVA ++G+     D  VD  G  +++ K    R+P  R SE    + +
Subjt:  FLVEAL-QNPRHRLTILRMELDIQKFLHNPDQQLFEFQHFP--TSYLRLAAHRVAQHYGLQTMVQDSGVDGFGNKILVRKMVENRLPSVRLSEIPAKQLD

Query:  NEKHEKVKI----VIRPRPNKMSEISANDGGLKQSSVRSMEERKEEYDRARARIFS
         E  ++  +        R +    +   DG       +S+EER+EEY R R RIF+
Subjt:  NEKHEKVKI----VIRPRPNKMSEISANDGGLKQSSVRSMEERKEEYDRARARIFS

Q9DCB4 cAMP-regulated phosphoprotein 217.0e-0931.21Show/hide
Query:  FLVEALQ-NPRHRLTILRMELDIQKFLHNPDQQLFEFQHFPTSYLRLAAHRVAQHYGLQTMVQDSGVDGFGNKILVRKMVENRLPSVRLSEIPAKQLDNE
        FL+  L+ N R R+ +L+ME ++  F+ + +    +F    +SY R+  HRVA ++GL     D  VD  G  +++ K    R+P  R  E      D +
Subjt:  FLVEALQ-NPRHRLTILRMELDIQKFLHNPDQQLFEFQHFPTSYLRLAAHRVAQHYGLQTMVQDSGVDGFGNKILVRKMVENRLPSVRLSEIPAKQLDNE

Query:  KHEKVKIVIRPRPNKMSEISANDGGL----KQSSVRSMEERKEEYDRARARIFSSPS
          E  K  I  R N   +   N   +         +S+EER+EEY R R RIF+  S
Subjt:  KHEKVKIVIRPRPNKMSEISANDGGL----KQSSVRSMEERKEEYDRARARIFSSPS

Q9UBL0 cAMP-regulated phosphoprotein 211.6e-0832.7Show/hide
Query:  FLVEALQ-NPRHRLTILRMELDIQKFLHNPDQQLFEFQHFPTSYLRLAAHRVAQHYGLQTMVQDSGVDGFGNKILVRKMVENRLPSVRLSEIPAKQLDNE
        FL+  L+ N R R+ +L+ME +I  F+ + +    +F    +SY R+  HRVA ++GL     D  VD  G  +++ K    R+P  R  E     L +E
Subjt:  FLVEALQ-NPRHRLTILRMELDIQKFLHNPDQQLFEFQHFPTSYLRLAAHRVAQHYGLQTMVQDSGVDGFGNKILVRKMVENRLPSVRLSEIPAKQLDNE

Query:  K-HEKVKIVIRPRPNKMSEISANDGG-----LKQSSVRSMEERKEEYDRARARIFSSPS
        K  E  K  I  R N   +   N              +S+EER+EEY R R RIF+  S
Subjt:  K-HEKVKIVIRPRPNKMSEISANDGG-----LKQSSVRSMEERKEEYDRARARIFSSPS

Arabidopsis top hitse value%identityAlignment
AT2G40960.1 Single-stranded nucleic acid binding R3H protein3.6e-8552.48Show/hide
Query:  DKESMVDPFLVEALQNPRHRLTILRMELDIQKFLHNPDQQLFEFQHFPTSYLRLAAHRVAQHYGLQTMVQDSGVDGFGNKILVRKMVENRLPSVRLSEIP
        DK   VDPFLVEALQNPRHRLTILRMELD+Q+FL +PDQQ FEFQHFPTSYLR AAHRVA HYGL T V+D G DG  ++I+V K  ++R P+ RLSEIP
Subjt:  DKESMVDPFLVEALQNPRHRLTILRMELDIQKFLHNPDQQLFEFQHFPTSYLRLAAHRVAQHYGLQTMVQDSGVDGFGNKILVRKMVENRLPSVRLSEIP

Query:  AKQLDNEKHEKVKIVIRPRPNKMSEISANDGGLKQSSVRSMEERKEEYDRARARIFSSPSSPELDDTTSQAPSEVKYACSNRDETEGCR--TLGSDLEKF
         KQ +  K E +K+VI+PRP K S +  ++  LK   ++S+EERKE+YDRARARIF+   + + +D++S+         S+RDE +  +   + +D    
Subjt:  AKQLDNEKHEKVKIVIRPRPNKMSEISANDGGLKQSSVRSMEERKEEYDRARARIFSSPSSPELDDTTSQAPSEVKYACSNRDETEGCR--TLGSDLEKF

Query:  NGRDGMTSRVAIFKDREKDRSDPDYDRNYDRYIRNLPTNQNLSLAPFIMQKIQPPFVQYDSGYSLMGHIPGTQASVGYGPHPSPVVSPFCAVGLNQTSRD
        + +    SR+AI +DREKDR DPDYDR+  RYI NLP +QNL +A F +Q +  P   YD G+   G+ P    S+  G H   V+SP    GLNQ SRD
Subjt:  NGRDGMTSRVAIFKDREKDRSDPDYDRNYDRYIRNLPTNQNLSLAPFIMQKIQPPFVQYDSGYSLMGHIPGTQASVGYGPHPSPVVSPFCAVGLNQTSRD

Query:  VS-YEQW-QSAAMMYAHSYSQLRHSAFQAPFCQQPLSFDYSQN
         + Y QW  +AA+MY HSY   R+S FQA F  QPLSFDY QN
Subjt:  VS-YEQW-QSAAMMYAHSYSQLRHSAFQAPFCQQPLSFDYSQN

AT3G10770.1 Single-stranded nucleic acid binding R3H protein2.2e-7449.42Show/hide
Query:  DKESMVDPFLVEALQNPRHRLTILRMELDIQKFLHNPDQQLFEFQHFPTSYLRLAAHRVAQHYGLQTMVQDSGV---DGFGNKILVRKMVENRLPSVRLS
        +KE+MVDPFLVEALQNPRHRLTILRMELDIQKF  NP+Q  FEF  FPTSYLRLAAHRVAQHYGL TM  D+G    DG  N+ILV K  E+R P V LS
Subjt:  DKESMVDPFLVEALQNPRHRLTILRMELDIQKFLHNPDQQLFEFQHFPTSYLRLAAHRVAQHYGLQTMVQDSGV---DGFGNKILVRKMVENRLPSVRLS

Query:  EIPAKQLDNEKHEKVKIVIRPRPNKMSEISANDGGLKQSSVRSMEERKEEYDRARARIFSSPSSPELDDTTSQAPS--EVKYACSNRDETEGCRTLGSDL
        EIP KQ +N + E  KI I+PRP + S    +  G++Q+ +RS+EERKEEYD+ARARIF+SPSS + +D++S  P   EV+  C NR+ETE      S +
Subjt:  EIPAKQLDNEKHEKVKIVIRPRPNKMSEISANDGGLKQSSVRSMEERKEEYDRARARIFSSPSSPELDDTTSQAPS--EVKYACSNRDETEGCRTLGSDL

Query:  EKFNGRDGMTSRVAIFKDREKDRSDPDYDRNYD--------RYIRNLPTNQNLSLAPFIMQKIQPPFVQYDSGYSLMGHIP-GTQASVGYGPHPSPVVSP
        +      G TSRVAI +DREKDR DPDYDR+YD        RY+R +P+ Q+ S  P          + +  G+    H+P G QA++ YG   +P +SP
Subjt:  EKFNGRDGMTSRVAIFKDREKDRSDPDYDRNYD--------RYIRNLPTNQNLSLAPFIMQKIQPPFVQYDSGYSLMGHIP-GTQASVGYGPHPSPVVSP

Query:  FCAVGLNQTSRDVSYEQW-QSAAMMYAHSYSQLRHSAFQAPFCQQP
        F       T    SY  W  S  M YA   +    + ++ P    P
Subjt:  FCAVGLNQTSRDVSYEQW-QSAAMMYAHSYSQLRHSAFQAPFCQQP

AT3G10770.2 Single-stranded nucleic acid binding R3H protein1.4e-7650.59Show/hide
Query:  DKESMVDPFLVEALQNPRHRLTILRMELDIQKFLHNPDQQLFEFQHFPTSYLRLAAHRVAQHYGLQTMVQDSGV---DGFGNKILVRKMVENRLPSVRLS
        +KE+MVDPFLVEALQNPRHRLTILRMELDIQKF  NP+Q  FEF  FPTSYLRLAAHRVAQHYGL TM  D+G    DG  N+ILV K  E+R P V LS
Subjt:  DKESMVDPFLVEALQNPRHRLTILRMELDIQKFLHNPDQQLFEFQHFPTSYLRLAAHRVAQHYGLQTMVQDSGV---DGFGNKILVRKMVENRLPSVRLS

Query:  EIPAKQLDNEKHEKVKIVIRPRPNKMSEISANDGGLKQSSVRSMEERKEEYDRARARIFSSPSSPELDDTTSQAPS--EVKYACSNRDETEGCRTLGSDL
        EIP KQ +N + E  KI I+PRP + S    +  G++Q+ +RS+EERKEEYD+ARARIF+SPSS + +D++S  P   EV+  C NR+ETE      S +
Subjt:  EIPAKQLDNEKHEKVKIVIRPRPNKMSEISANDGGLKQSSVRSMEERKEEYDRARARIFSSPSSPELDDTTSQAPS--EVKYACSNRDETEGCRTLGSDL

Query:  EKFNGRDGMTSRVAIFKDREKDRSDPDYDRNYDRYIRNLPTNQNLSLAPFIMQKIQPPFVQYDSGYSLMGHIP-GTQASVGYGPHPSPVVSPFCAVGLNQ
        +      G TSRVAI +DREKDR DPDYDR+YDRY+R +P+ Q+ S  P          + +  G+    H+P G QA++ YG   +P +SPF       
Subjt:  EKFNGRDGMTSRVAIFKDREKDRSDPDYDRNYDRYIRNLPTNQNLSLAPFIMQKIQPPFVQYDSGYSLMGHIP-GTQASVGYGPHPSPVVSPFCAVGLNQ

Query:  TSRDVSYEQW-QSAAMMYAHSYSQLRHSAFQAPFCQQP
        T    SY  W  S  M YA   +    + ++ P    P
Subjt:  TSRDVSYEQW-QSAAMMYAHSYSQLRHSAFQAPFCQQP

AT3G56680.1 Single-stranded nucleic acid binding R3H protein3.0e-9254.6Show/hide
Query:  VDPFLVEALQNPRHRLTILRMELDIQKFLHNPDQQLFEFQHFPTSYLRLAAHRVAQHYGLQTMVQDSGVDGFGNKILVRKMVENRLPSVRLSEIP-AKQL
        VDPFLVEAL N RHRLTILRMELD+Q+ L NP+QQ FEFQHFPTSYLRLAAHRVA HYGL T VQ+SG DG  N+ILV K  E++ P+VRLSEIP AKQ 
Subjt:  VDPFLVEALQNPRHRLTILRMELDIQKFLHNPDQQLFEFQHFPTSYLRLAAHRVAQHYGLQTMVQDSGVDGFGNKILVRKMVENRLPSVRLSEIP-AKQL

Query:  DNEKHEKVKIVIRPRPNKMSEISANDGGLKQSSVRSMEERKEEYDRARARIFSSPSSPELDDTTSQAPSEVKYACSNRDETEGCRTLGSDLEKFNG--RD
        +N K E  K+ I+ RP+K S   A D    +  +RS+EERKEEYD+AR RIFS  +    DD++S+     + A  +RD+ +  +    +++K       
Subjt:  DNEKHEKVKIVIRPRPNKMSEISANDGGLKQSSVRSMEERKEEYDRARARIFSSPSSPELDDTTSQAPSEVKYACSNRDETEGCRTLGSDLEKFNG--RD

Query:  GMTSRVAIFKDREKDRSDPDYDRNYDRYIRNLPTNQNLSLAPFIMQKIQPPFVQYDSGYSLMGHIPGTQASVGYGPHPSPVVSPFCAVGLNQTSRDVSYE
        G TSRVAIF+DREKDR DPDYDR + RYIR+LP NQN +L PF +Q+I  P+  Y+ G++    IP   A +G+GPHPS ++SP+       T+ D  Y 
Subjt:  GMTSRVAIFKDREKDRSDPDYDRNYDRYIRNLPTNQNLSLAPFIMQKIQPPFVQYDSGYSLMGHIPGTQASVGYGPHPSPVVSPFCAVGLNQTSRDVSYE

Query:  QWQSAAMMYAHSYSQLRHSAFQAPFCQQPLSFDYSQN
         W +AAMMYAH Y Q R+ + QA F QQPLSFDY QN
Subjt:  QWQSAAMMYAHSYSQLRHSAFQAPFCQQPLSFDYSQN

AT5G05100.1 Single-stranded nucleic acid binding R3H protein1.3e-7150.88Show/hide
Query:  DKESMVDPFLVEALQNPRHRLTILRMELDIQKFLHNPDQQLFEFQHFPTSYLRLAAHRVAQHYGL-QTMVQDSGVDGFGNKILVRKMVENRLPSVRLSEI
        DKE+MVDPFLVEALQNPRHRLTILRMELDIQKF  +P+QQ +EFQ  PTSYLRL AHRVAQHYGL  T ++  GVDG GN+IL  K VE+R P V LSEI
Subjt:  DKESMVDPFLVEALQNPRHRLTILRMELDIQKFLHNPDQQLFEFQHFPTSYLRLAAHRVAQHYGL-QTMVQDSGVDGFGNKILVRKMVENRLPSVRLSEI

Query:  PAKQLD-NEKHEKVKIVIRPRPNKMSEISANDGGLKQSSVRSMEERKEEYDRARARIFSSPSSPELDDTTSQAP--------SEVKYACSNRDETEGCRT
        P KQ + + + E  KI I+PRPN+ +  S +  G++++ +RS+EERKEEYD+ARARIF+SPSS + +D++++AP        S+   AC +R+ETE    
Subjt:  PAKQLD-NEKHEKVKIVIRPRPNKMSEISANDGGLKQSSVRSMEERKEEYDRARARIFSSPSSPELDDTTSQAP--------SEVKYACSNRDETEGCRT

Query:  LGS---DLEKFNG--RD-GMTSRVAIFKDREKDRSDPDYDRNYDRYIRNLPTNQNLSLAPFIMQKIQPPFVQYDSGYSLMGHIPGTQASVGYGPHPSPVV
          S   D+E+ NG  RD G TSRVAI +DREKDR DPDYDRN  RY+R  P  QN +  P        P   +D   ++   +P TQAS+ YG HP    
Subjt:  LGS---DLEKFNG--RD-GMTSRVAIFKDREKDRSDPDYDRNYDRYIRNLPTNQNLSLAPFIMQKIQPPFVQYDSGYSLMGHIPGTQASVGYGPHPSPVV

Query:  SPFCAVGLNQTSRDVSYEQWQSAAMMYAHSYSQLRHSAFQAP
               LN      SY +W SA M YA + +      F+ P
Subjt:  SPFCAVGLNQTSRDVSYEQWQSAAMMYAHSYSQLRHSAFQAP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCAGCTTCCTTTGATGCCACCCCACAAGGGTTCAAGGACAAAGAGTCAATGGTTGACCCCTTCTTGGTTGAGGCTCTCCAGAATCCTCGACATCGTCTCACCAT
TTTACGAATGGAACTTGATATCCAGAAGTTTTTGCATAATCCGGATCAGCAGCTTTTTGAGTTCCAGCACTTTCCTACTTCATACCTTCGACTTGCTGCTCACCGTGTTG
CTCAACACTATGGCCTTCAAACCATGGTTCAGGACTCTGGCGTAGATGGTTTTGGAAATAAGATCTTGGTTCGTAAAATGGTGGAAAACAGACTCCCTTCTGTGCGTTTA
TCTGAAATACCTGCAAAGCAATTGGACAATGAGAAACATGAGAAGGTCAAAATTGTCATCAGACCTAGGCCTAACAAAATGTCTGAAATTTCTGCCAATGATGGAGGTCT
CAAACAAAGTTCTGTCAGAAGCATGGAAGAGAGAAAGGAGGAATATGATAGGGCACGAGCTCGAATCTTCAGCTCCCCAAGTAGTCCCGAATTAGATGATACAACATCTC
AGGCCCCATCTGAAGTAAAGTATGCATGCTCGAACAGGGATGAGACTGAAGGCTGTAGAACTTTGGGTAGCGATTTGGAAAAATTTAATGGCAGGGATGGCATGACTTCT
CGGGTTGCCATTTTCAAAGATAGGGAAAAGGATCGTAGTGACCCTGATTATGATCGCAATTACGATAGGTATATTAGGAACCTTCCAACCAATCAAAACTTAAGCTTGGC
GCCCTTTATTATGCAAAAAATTCAGCCTCCATTTGTACAATATGATTCGGGTTATTCTCTCATGGGTCATATACCAGGAACTCAAGCTTCAGTTGGCTATGGGCCTCATC
CGAGTCCTGTTGTAAGCCCGTTCTGTGCAGTGGGTTTAAACCAGACATCTAGAGATGTATCTTATGAGCAGTGGCAGAGTGCTGCAATGATGTATGCCCATTCCTACAGC
CAGCTCAGGCATTCTGCTTTTCAGGCCCCATTCTGCCAGCAGCCCCTCAGCTTTGACTATTCTCAAAACCATTAG
mRNA sequenceShow/hide mRNA sequence
GTAATATAAAATCATTGCCAAATTCCATCACTGTTTCTCCTTTTACCCCTTTCCCCTCCGCCCGGTGGGATAAGAAAGAGGGTTTTCTCTCTCTTTCTCTCTCGATCTTC
CTTCTCCTCCTCCTGATGCTAAGAATTAAACCCTTTTCCTGTTTCTTCACCTTCCCTCTTCTTCTTCTTCTTCTCTGACCCAGAAGGGGAAGAAGAAACGCCTTTCCAAG
TACCCAACAGTTTCAGAGTTTTGTGCGTTCAATAATAAAGAGAGCTCTCTTCTTCCATGGATTCAGCTTCCTTTGATGCCACCCCACAAGGGTTCAAGGACAAAGAGTCA
ATGGTTGACCCCTTCTTGGTTGAGGCTCTCCAGAATCCTCGACATCGTCTCACCATTTTACGAATGGAACTTGATATCCAGAAGTTTTTGCATAATCCGGATCAGCAGCT
TTTTGAGTTCCAGCACTTTCCTACTTCATACCTTCGACTTGCTGCTCACCGTGTTGCTCAACACTATGGCCTTCAAACCATGGTTCAGGACTCTGGCGTAGATGGTTTTG
GAAATAAGATCTTGGTTCGTAAAATGGTGGAAAACAGACTCCCTTCTGTGCGTTTATCTGAAATACCTGCAAAGCAATTGGACAATGAGAAACATGAGAAGGTCAAAATT
GTCATCAGACCTAGGCCTAACAAAATGTCTGAAATTTCTGCCAATGATGGAGGTCTCAAACAAAGTTCTGTCAGAAGCATGGAAGAGAGAAAGGAGGAATATGATAGGGC
ACGAGCTCGAATCTTCAGCTCCCCAAGTAGTCCCGAATTAGATGATACAACATCTCAGGCCCCATCTGAAGTAAAGTATGCATGCTCGAACAGGGATGAGACTGAAGGCT
GTAGAACTTTGGGTAGCGATTTGGAAAAATTTAATGGCAGGGATGGCATGACTTCTCGGGTTGCCATTTTCAAAGATAGGGAAAAGGATCGTAGTGACCCTGATTATGAT
CGCAATTACGATAGGTATATTAGGAACCTTCCAACCAATCAAAACTTAAGCTTGGCGCCCTTTATTATGCAAAAAATTCAGCCTCCATTTGTACAATATGATTCGGGTTA
TTCTCTCATGGGTCATATACCAGGAACTCAAGCTTCAGTTGGCTATGGGCCTCATCCGAGTCCTGTTGTAAGCCCGTTCTGTGCAGTGGGTTTAAACCAGACATCTAGAG
ATGTATCTTATGAGCAGTGGCAGAGTGCTGCAATGATGTATGCCCATTCCTACAGCCAGCTCAGGCATTCTGCTTTTCAGGCCCCATTCTGCCAGCAGCCCCTCAGCTTT
GACTATTCTCAAAACCATTAGATATGAGGAGTTTAGGTGAAAAAAAACTTTACAATTAGATTTTGTACTTGTAGGGGCTGCCCTTCACTCTTGTTTGACTGCTTGGAACT
CAGTTCTAGTTTCTCTCTTTTGTTCTTCATTTGGATCTTCTTATCATAAGAATTTGTAAGGATTGTGAGGACTTTCTGATATCTTTTGAACAGATTTACATACACTCCAT
TATGCCATTAGTTGAATTTCTTCATCTTATACTTTTGCTTGTATTGCTGCATTTAATCGTGTACCATTTTAAATTTCTCTTTAATAACATATCTTTCATTGGACTT
Protein sequenceShow/hide protein sequence
MDSASFDATPQGFKDKESMVDPFLVEALQNPRHRLTILRMELDIQKFLHNPDQQLFEFQHFPTSYLRLAAHRVAQHYGLQTMVQDSGVDGFGNKILVRKMVENRLPSVRL
SEIPAKQLDNEKHEKVKIVIRPRPNKMSEISANDGGLKQSSVRSMEERKEEYDRARARIFSSPSSPELDDTTSQAPSEVKYACSNRDETEGCRTLGSDLEKFNGRDGMTS
RVAIFKDREKDRSDPDYDRNYDRYIRNLPTNQNLSLAPFIMQKIQPPFVQYDSGYSLMGHIPGTQASVGYGPHPSPVVSPFCAVGLNQTSRDVSYEQWQSAAMMYAHSYS
QLRHSAFQAPFCQQPLSFDYSQNH