; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0039705 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0039705
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCCHC-type domain-containing protein
Genome locationchr2:48625486..48633388
RNA-Seq ExpressionLag0039705
SyntenyLag0039705
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8717380.1 hypothetical protein F3Y22_tig00110050pilonHSYRG00143 [Hibiscus syriacus]1.7e-6757.04Show/hide
Query:  KMDVEKFDGRMNFGLWQVQVKDVLIQSGLHKALKGRPSDGASERLSGEGGPVESSGGSSRGLKKSSMSDEDWEEMDLRA-------------GNVHGIST
        + D+EKFDGR+NFGLWQVQVKD+LIQSGL+KALKG+P+         EG   +    SS    KS MS+E+WEE+D+RA              NV   S+
Subjt:  KMDVEKFDGRMNFGLWQVQVKDVLIQSGLHKALKGRPSDGASERLSGEGGPVESSGGSSRGLKKSSMSDEDWEEMDLRA-------------GNVHGIST

Query:  AKELWEKLEAMYQARSTSNRLYLKEKFYTLRMEEGTKISDHLSVLNSIISELEVIEVKIEDEDKAFKLILSLPTSYEHMKPILMYGKETLSFADVTSKLL
         KELWEKLE MYQA+S SNRLYLKEKF+ L+MEEGTKISDHLS LN I+SELE I V+I+DEDKA +LI SLP+SYEHM+ +LMYGKE ++F++VTSKL+
Subjt:  AKELWEKLEAMYQARSTSNRLYLKEKFYTLRMEEGTKISDHLSVLNSIISELEVIEVKIEDEDKAFKLILSLPTSYEHMKPILMYGKETLSFADVTSKLL

Query:  SEERRLKSEGRTSQEDSAL-VASNWKKKKESMQKKGCSECRQSGHMKKDCPNRAGSSKGSGSDADIVSLV
        SEERRLK+    S E  AL V  N KK K S +K  C  C Q GH+KKDC N  G++  +GS +D  ++V
Subjt:  SEERRLKSEGRTSQEDSAL-VASNWKKKKESMQKKGCSECRQSGHMKKDCPNRAGSSKGSGSDADIVSLV

KAF5758504.1 putative RNA-directed DNA polymerase [Helianthus annuus]2.9e-7058.78Show/hide
Query:  SPVKMDVEKFDGRMNFGLWQVQVKDVLIQSGLHKALKGRPSDGASERLSGEGGPVESSGGSSRGLKKSSMSDEDWEEMDLRA-------------GNVHG
        SP++  VEK+DGR+NFGLWQVQVKDVLIQSGLHKAL+G+P+  +S+  SG                 S   DE+WE++DLRA              NVHG
Subjt:  SPVKMDVEKFDGRMNFGLWQVQVKDVLIQSGLHKALKGRPSDGASERLSGEGGPVESSGGSSRGLKKSSMSDEDWEEMDLRA-------------GNVHG

Query:  ISTAKELWEKLEAMYQARSTSNRLYLKEKFYTLRMEEGTKISDHLSVLNSIISELEVIEVKIEDEDKAFKLILSLPTSYEHMKPILMYGKETLSFADVTS
        ISTAK+LWEKLE +YQ +  SNRLYLKE+F+TLRM+  TKISDHLSVLN+I+SELE I VK+EDEDKA +LILSL +SYEHMKPILMYGKETL +ADVT 
Subjt:  ISTAKELWEKLEAMYQARSTSNRLYLKEKFYTLRMEEGTKISDHLSVLNSIISELEVIEVKIEDEDKAFKLILSLPTSYEHMKPILMYGKETLSFADVTS

Query:  KLLSEERRLKSEGRTSQEDSALVASNWKKKKESMQKKGCSECRQSGHMKKDCPNRAGSSKGS
        KLLSEE+RL S G TS E + L+  N  KKK   +   C +C QSGH+K++CP  A S+  S
Subjt:  KLLSEERRLKSEGRTSQEDSALVASNWKKKKESMQKKGCSECRQSGHMKKDCPNRAGSSKGS

KAF5765959.1 putative RNA-directed DNA polymerase [Helianthus annuus]9.9e-7158.78Show/hide
Query:  SPVKMDVEKFDGRMNFGLWQVQVKDVLIQSGLHKALKGRPSDGASERLSGEGGPVESSGGSSRGLKKSSMSDEDWEEMDLRA-------------GNVHG
        SP++ DVEK+DGR+NFGLWQVQVKDVLIQSGLHKAL+G+P+  +S+  SG                 S   DE+WE++DLRA              NVHG
Subjt:  SPVKMDVEKFDGRMNFGLWQVQVKDVLIQSGLHKALKGRPSDGASERLSGEGGPVESSGGSSRGLKKSSMSDEDWEEMDLRA-------------GNVHG

Query:  ISTAKELWEKLEAMYQARSTSNRLYLKEKFYTLRMEEGTKISDHLSVLNSIISELEVIEVKIEDEDKAFKLILSLPTSYEHMKPILMYGKETLSFADVTS
        ISTAK+LWEKLE +YQ +   NRLYLKE+F+TLRM+  TKISDHLSVLN+I+SELE I VK+EDEDKA +LILSL +SYEHMKPILMYGKETL +ADVT 
Subjt:  ISTAKELWEKLEAMYQARSTSNRLYLKEKFYTLRMEEGTKISDHLSVLNSIISELEVIEVKIEDEDKAFKLILSLPTSYEHMKPILMYGKETLSFADVTS

Query:  KLLSEERRLKSEGRTSQEDSALVASNWKKKKESMQKKGCSECRQSGHMKKDCPNRAGSSKGS
        KLLSEE+RL S G TS E + L+  N  KKK   +   C +C QSGH+K++CP  A S+  S
Subjt:  KLLSEERRLKSEGRTSQEDSALVASNWKKKKESMQKKGCSECRQSGHMKKDCPNRAGSSKGS

KAG7577502.1 F-box associated domain type 1 [Arabidopsis thaliana x Arabidopsis arenosa]3.0e-6756.98Show/hide
Query:  VKMDVEKFDGRMNFGLWQVQVKDVLIQSGLHKALKGRPSDGASERLSGEGGPVESSGGSSRGLKKSSMSDEDWEEMDLRA-------------GNVHGIS
        +KM++EKFDGR+NFGLWQVQVKD+LIQ GLHKALKG+P+            PV    G+  G  K  +SD DWE++DLRA              NVHGIS
Subjt:  VKMDVEKFDGRMNFGLWQVQVKDVLIQSGLHKALKGRPSDGASERLSGEGGPVESSGGSSRGLKKSSMSDEDWEEMDLRA-------------GNVHGIS

Query:  TAKELWEKLEAMYQARSTSNRLYLKEKFYTLRMEEGTKISDHLSVLNSIISELEVIEVKIEDEDKAFKLILSLPTSYEHMKPILMYGKETLSFADVTSKL
        TAKELWEKLE +YQA+  SNR+YLKEKF+TLRM EGT +SDHLSVLN I+SELE I VK++DED A +LI SLP+SYEHMKPIL++GKE + F +VTSKL
Subjt:  TAKELWEKLEAMYQARSTSNRLYLKEKFYTLRMEEGTKISDHLSVLNSIISELEVIEVKIEDEDKAFKLILSLPTSYEHMKPILMYGKETLSFADVTSKL

Query:  LSEERRLKSEGRTSQEDSALVASNWKKKKESMQKKGCSECRQSGHMKKDCPNRAGSSKGSGSDAD
         SEE+RL +       +SALVA N KK+    +K  C  C QSGH+K++CPN  G S    S+ D
Subjt:  LSEERRLKSEGRTSQEDSALVASNWKKKKESMQKKGCSECRQSGHMKKDCPNRAGSSKGSGSDAD

XP_022139673.1 uncharacterized protein LOC111010521 [Momordica charantia]5.2e-10479.17Show/hide
Query:  EAEMSSFMSPVKMDVEKFDGRMNFGLWQVQVKDVLIQSGLHKALKGRPSDGASERLSGEGGPVESSGGSSRGLKKSSMSDEDWEEMDLRA----------
        EA+MS FMSPVK+DVEKFDG +NFGLWQVQVKDVLIQS LHKALKGRPS+GASE+LS +GGP+ESSGGSSRG KKSSMS EDWEEMDLRA          
Subjt:  EAEMSSFMSPVKMDVEKFDGRMNFGLWQVQVKDVLIQSGLHKALKGRPSDGASERLSGEGGPVESSGGSSRGLKKSSMSDEDWEEMDLRA----------

Query:  ---GNVHGISTAKELWEKLEAMYQARSTSNRLYLKEKFYTLRMEEGTKISDHLSVLNSIISELEVIEVKIEDEDKAFKLILSLPTSYEHMKPILMYGKET
            NVH ISTAKELWEKLEA+YQA+  SNRLYLKE+F+TL+MEEG KISDHLS LNSII ELE IEVKI+DEDKA +LILSLP SYEHMKPILMYGK+T
Subjt:  ---GNVHGISTAKELWEKLEAMYQARSTSNRLYLKEKFYTLRMEEGTKISDHLSVLNSIISELEVIEVKIEDEDKAFKLILSLPTSYEHMKPILMYGKET

Query:  LSFADVTSKLLSEERRLKSEGRTSQEDSALVASNWKKKKESMQKKGCS-ECRQSGHMKKDCPNR
        L+FA+VTSKLLSEERRLKSEGRTS EDSALV SNWKKKK+S+QKK C   C QSGHMKKDCPNR
Subjt:  LSFADVTSKLLSEERRLKSEGRTSQEDSALVASNWKKKKESMQKKGCS-ECRQSGHMKKDCPNR

TrEMBL top hitse value%identityAlignment
A0A6A2YS90 Transcription initiation factor IIA subunit 27.1e-6756.67Show/hide
Query:  KMDVEKFDGRMNFGLWQVQVKDVLIQSGLHKALKGRPSDGASERLSGEGGPVESSGGSSRGLKKSSMSDEDWEEMDLRA-------------GNVHGIST
        + D+EKFDGR+NFGLWQVQVKD+LIQSGL+KALKG+P+         EG   +    SS    KS MS+E+WEE+D+RA              NV   S+
Subjt:  KMDVEKFDGRMNFGLWQVQVKDVLIQSGLHKALKGRPSDGASERLSGEGGPVESSGGSSRGLKKSSMSDEDWEEMDLRA-------------GNVHGIST

Query:  AKELWEKLEAMYQARSTSNRLYLKEKFYTLRMEEGTKISDHLSVLNSIISELEVIEVKIEDEDKAFKLILSLPTSYEHMKPILMYGKETLSFADVTSKLL
         KELWEKLE MYQA+S SNRLYLKEKF+ L+MEEGTKISDHLS LN I+SELE I V+I+DEDKA +LI SL +SYEHM+ +LMYGKE ++F++VTSKL+
Subjt:  AKELWEKLEAMYQARSTSNRLYLKEKFYTLRMEEGTKISDHLSVLNSIISELEVIEVKIEDEDKAFKLILSLPTSYEHMKPILMYGKETLSFADVTSKLL

Query:  SEERRLKSEGRTSQEDSAL-VASNWKKKKESMQKKGCSECRQSGHMKKDCPNRAGSSKGSGSDADIVSLV
        SEERRLK+    S E  AL V  N KK K S +K  C  C Q GH+KKDC N  G++  +GS +D  ++V
Subjt:  SEERRLKSEGRTSQEDSAL-VASNWKKKKESMQKKGCSECRQSGHMKKDCPNRAGSSKGSGSDADIVSLV

A0A6A3BK59 CCHC-type domain-containing protein8.4e-6857.04Show/hide
Query:  KMDVEKFDGRMNFGLWQVQVKDVLIQSGLHKALKGRPSDGASERLSGEGGPVESSGGSSRGLKKSSMSDEDWEEMDLRA-------------GNVHGIST
        + D+EKFDGR+NFGLWQVQVKD+LIQSGL+KALKG+P+         EG   +    SS    KS MS+E+WEE+D+RA              NV   S+
Subjt:  KMDVEKFDGRMNFGLWQVQVKDVLIQSGLHKALKGRPSDGASERLSGEGGPVESSGGSSRGLKKSSMSDEDWEEMDLRA-------------GNVHGIST

Query:  AKELWEKLEAMYQARSTSNRLYLKEKFYTLRMEEGTKISDHLSVLNSIISELEVIEVKIEDEDKAFKLILSLPTSYEHMKPILMYGKETLSFADVTSKLL
         KELWEKLE MYQA+S SNRLYLKEKF+ L+MEEGTKISDHLS LN I+SELE I V+I+DEDKA +LI SLP+SYEHM+ +LMYGKE ++F++VTSKL+
Subjt:  AKELWEKLEAMYQARSTSNRLYLKEKFYTLRMEEGTKISDHLSVLNSIISELEVIEVKIEDEDKAFKLILSLPTSYEHMKPILMYGKETLSFADVTSKLL

Query:  SEERRLKSEGRTSQEDSAL-VASNWKKKKESMQKKGCSECRQSGHMKKDCPNRAGSSKGSGSDADIVSLV
        SEERRLK+    S E  AL V  N KK K S +K  C  C Q GH+KKDC N  G++  +GS +D  ++V
Subjt:  SEERRLKSEGRTSQEDSAL-VASNWKKKKESMQKKGCSECRQSGHMKKDCPNRAGSSKGSGSDADIVSLV

A0A6A3CWI3 CCHC-type domain-containing protein2.5e-6757.04Show/hide
Query:  KMDVEKFDGRMNFGLWQVQVKDVLIQSGLHKALKGRPSDGASERLSGEGGPVESSGGSSRGLKKSSMSDEDWEEMDLRA-------------GNVHGIST
        + D+EKFDGR+NFGLWQVQVKD+LIQSGL+KALKG+P+         EG   +    SS    KS MS+E+WEE+D+RA              NV   S+
Subjt:  KMDVEKFDGRMNFGLWQVQVKDVLIQSGLHKALKGRPSDGASERLSGEGGPVESSGGSSRGLKKSSMSDEDWEEMDLRA-------------GNVHGIST

Query:  AKELWEKLEAMYQARSTSNRLYLKEKFYTLRMEEGTKISDHLSVLNSIISELEVIEVKIEDEDKAFKLILSLPTSYEHMKPILMYGKETLSFADVTSKLL
         KELWEKLE MYQA+S SNRLYLKEKF+ L+MEEGTKISDHLS LN I+SELE I V I+DEDKA +LI SLP+SYEHM+ +LMYGKE ++F++VTSKL+
Subjt:  AKELWEKLEAMYQARSTSNRLYLKEKFYTLRMEEGTKISDHLSVLNSIISELEVIEVKIEDEDKAFKLILSLPTSYEHMKPILMYGKETLSFADVTSKLL

Query:  SEERRLKSEGRTSQEDSAL-VASNWKKKKESMQKKGCSECRQSGHMKKDCPNRAGSSKGSGSDADIVSLV
        SEERRLK+    S E  AL V  N KK K S +K  C  C Q GH+KKDC N  G++  +GS +D  ++V
Subjt:  SEERRLKSEGRTSQEDSAL-VASNWKKKKESMQKKGCSECRQSGHMKKDCPNRAGSSKGSGSDADIVSLV

A0A6A3DA47 CCHC-type domain-containing protein3.2e-6757.04Show/hide
Query:  KMDVEKFDGRMNFGLWQVQVKDVLIQSGLHKALKGRPSDGASERLSGEGGPVESSGGSSRGLKKSSMSDEDWEEMDLRA-------------GNVHGIST
        + D+EKFDGR+NFGLWQVQVKD+LIQSGL+KALKG+P+         EG   +    SS    KS MS+E+WEE+D+RA              NV   S+
Subjt:  KMDVEKFDGRMNFGLWQVQVKDVLIQSGLHKALKGRPSDGASERLSGEGGPVESSGGSSRGLKKSSMSDEDWEEMDLRA-------------GNVHGIST

Query:  AKELWEKLEAMYQARSTSNRLYLKEKFYTLRMEEGTKISDHLSVLNSIISELEVIEVKIEDEDKAFKLILSLPTSYEHMKPILMYGKETLSFADVTSKLL
         KELWEKLE MYQA+S SNRLYLKEKF+ L+MEEGTKISDHLS LN I+SELE I V+I+DEDKA +LI SLP+SYEHM+ +LMYGKE ++F++VTSKL+
Subjt:  AKELWEKLEAMYQARSTSNRLYLKEKFYTLRMEEGTKISDHLSVLNSIISELEVIEVKIEDEDKAFKLILSLPTSYEHMKPILMYGKETLSFADVTSKLL

Query:  SEERRLKSEGRTSQEDSAL-VASNWKKKKESMQKKGCSECRQSGHMKKDCPNRAGSSKGSGSDADIVSLV
        SEERRLK+    S E  AL V  N KK K S +K  C  C Q GH+KKDC N  G++  +GS +D  ++V
Subjt:  SEERRLKSEGRTSQEDSAL-VASNWKKKKESMQKKGCSECRQSGHMKKDCPNRAGSSKGSGSDADIVSLV

A0A6J1CG82 uncharacterized protein LOC1110105212.5e-10479.17Show/hide
Query:  EAEMSSFMSPVKMDVEKFDGRMNFGLWQVQVKDVLIQSGLHKALKGRPSDGASERLSGEGGPVESSGGSSRGLKKSSMSDEDWEEMDLRA----------
        EA+MS FMSPVK+DVEKFDG +NFGLWQVQVKDVLIQS LHKALKGRPS+GASE+LS +GGP+ESSGGSSRG KKSSMS EDWEEMDLRA          
Subjt:  EAEMSSFMSPVKMDVEKFDGRMNFGLWQVQVKDVLIQSGLHKALKGRPSDGASERLSGEGGPVESSGGSSRGLKKSSMSDEDWEEMDLRA----------

Query:  ---GNVHGISTAKELWEKLEAMYQARSTSNRLYLKEKFYTLRMEEGTKISDHLSVLNSIISELEVIEVKIEDEDKAFKLILSLPTSYEHMKPILMYGKET
            NVH ISTAKELWEKLEA+YQA+  SNRLYLKE+F+TL+MEEG KISDHLS LNSII ELE IEVKI+DEDKA +LILSLP SYEHMKPILMYGK+T
Subjt:  ---GNVHGISTAKELWEKLEAMYQARSTSNRLYLKEKFYTLRMEEGTKISDHLSVLNSIISELEVIEVKIEDEDKAFKLILSLPTSYEHMKPILMYGKET

Query:  LSFADVTSKLLSEERRLKSEGRTSQEDSALVASNWKKKKESMQKKGCS-ECRQSGHMKKDCPNR
        L+FA+VTSKLLSEERRLKSEGRTS EDSALV SNWKKKK+S+QKK C   C QSGHMKKDCPNR
Subjt:  LSFADVTSKLLSEERRLKSEGRTSQEDSALVASNWKKKKESMQKKGCS-ECRQSGHMKKDCPNR

SwissProt top hitse value%identityAlignment
P04146 Copia protein7.0e-1124.05Show/hide
Query:  MSPVKMDVEKFDGRMNFGLWQVQVKDVLIQSGLHKALKG-RPSDGASERLSGEGGPVESSGGSSRGLKKSSMSDEDWEE-MDLRAGNVHGISTAKELWEK
        M   K +++ FDG   + +W+ +++ +L +  + K + G  P++            V+ S   +    KS++ +   +  ++    ++    TA+++ E 
Subjt:  MSPVKMDVEKFDGRMNFGLWQVQVKDVLIQSGLHKALKG-RPSDGASERLSGEGGPVESSGGSSRGLKKSSMSDEDWEE-MDLRAGNVHGISTAKELWEK

Query:  LEAMYQARSTSNRLYLKEKFYTLRMEEGTKISDHLSVLNSIISELEVIEVKIEDEDKAFKLILSLPTSYEH-MKPILMYGKETLSFADVTSKLLSEERRL
        L+A+Y+ +S +++L L+++  +L++     +  H  + + +ISEL     KIE+ DK   L+++LP+ Y+  +  I    +E L+ A V ++LL +E ++
Subjt:  LEAMYQARSTSNRLYLKEKFYTLRMEEGTKISDHLSVLNSIISELEVIEVKIEDEDKAFKLILSLPTSYEH-MKPILMYGKETLSFADVTSKLLSEERRL

Query:  KSEGRTSQED--SALVASN-------------WKKKK----ESMQKKGCSECRQSGHMKKDC
        K++   + +   +A+V +N              K KK     S  K  C  C + GH+KKDC
Subjt:  KSEGRTSQED--SALVASN-------------WKKKK----ESMQKKGCSECRQSGHMKKDC

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.3e-3534.74Show/hide
Query:  MSPVKMDVEKFDGRMNFGLWQVQVKDVLIQSGLHKALKGRPSDGASERLSGEGGPVESSGGSSRGLKKSSMSDEDWEEMDLRA-------------GNVH
        MS VK +V KF+G   F  WQ +++D+LIQ GLHK L                  V+S        K  +M  EDW ++D RA              N+ 
Subjt:  MSPVKMDVEKFDGRMNFGLWQVQVKDVLIQSGLHKALKGRPSDGASERLSGEGGPVESSGGSSRGLKKSSMSDEDWEEMDLRA-------------GNVH

Query:  GISTAKELWEKLEAMYQARSTSNRLYLKEKFYTLRMEEGTKISDHLSVLNSIISELEVIEVKIEDEDKAFKLILSLPTSYEHMKPILMYGKETLSFADVT
           TA+ +W +LE++Y +++ +N+LYLK++ Y L M EGT    HL+V N +I++L  + VKIE+EDKA  L+ SLP+SY+++   +++GK T+   DVT
Subjt:  GISTAKELWEKLEAMYQARSTSNRLYLKEKFYTLRMEEGTKISDHLSVLNSIISELEVIEVKIEDEDKAFKLILSLPTSYEHMKPILMYGKETLSFADVT

Query:  SKLLSEER----------------RLKSEGRTSQEDSALVASNWKKKKESMQKKGCSECRQSGHMKKDCPN-RAGSSKGSGSDAD
        S LL  E+                R +S  R+S       A    K +   + + C  C Q GH K+DCPN R G  + SG   D
Subjt:  SKLLSEER----------------RLKSEGRTSQEDSALVASNWKKKKESMQKKGCSECRQSGHMKKDCPN-RAGSSKGSGSDAD

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCCCATCATGCGGGCTTGGACTAACCAAGTCTACCTCACCCCATCTAGCTCGGGCCAATGCACTTCATCTAAGTCGAGCTCACCCAGCAGAGGCCGTGGCCGAGCG
CCTCTTGCCAAGGCCGAGCACAAACTCTAAAGGGAGTGTCTACTTCTATCCTACTTTGCAGGATGACGACAAAGCTACACGTAACCCAGCCAAGGAAATTTTGGACCACC
CCGATGTACGAGGAGCTGACGAGGACAATCGGGGAGAAATCGGGCTGGGATACAGGCCAAGGAGGCGGAGCCGGCAAGCGGGACGGGCCAAGGCCGAAGGGGTCGGGTTT
TTGACCCGACCCATGCTCAGCCTCGGCCATGGGCCGAGGCCGACCCTCAGCCCGCTCGCGCGGGCCGAGCCCGTTCGGTCTCGTCTGGTCCCCACCGCCTCTGGCTACCC
CGGTTTCGCCTGGTTTGACCTAAAACGCCTCAGAATACCTAAATACCCTAGGAGGATGAGCAGGTATTTATATCCCTCTTCGTCACTGAAGAGGGGATCCCGAATTCTAT
CCCTAAACTCTACTCTCTATTCTCTGCTTTCTCCTCTTGCTCTTACTTTTCCGCTCCCTACCGTTCTGTTTGCTGACTTAAGCATCGGAGCCGGTGTGGCGAGCACCACA
CCGGTGTGCAGGTTACTGTCTTGCAGGCCACGTCTTCCCCCTCATCTACAAATTTACCGTTGGTGGCACGTGAAGGTCAGATTAGCTAGGAGTGAAGCAGAAATGTCAAG
CTTCATGAGTCCAGTGAAGATGGATGTGGAGAAATTTGATGGAAGGATGAACTTCGGCTTGTGGCAAGTGCAAGTCAAGGATGTGTTGATACAATCTGGGTTACACAAGG
CTTTGAAGGGAAGACCAAGCGATGGTGCTTCTGAAAGATTAAGCGGTGAAGGTGGTCCAGTGGAGTCCAGTGGCGGTTCCAGCAGAGGTTTGAAGAAGTCCAGCATGAGT
GATGAAGATTGGGAGGAAATGGATTTGAGAGCTGGAAATGTGCATGGAATTTCGACAGCCAAAGAGCTTTGGGAAAAGCTTGAAGCAATGTATCAGGCAAGGAGCACCTC
GAATCGGTTGTACCTGAAGGAGAAGTTTTACACGTTGCGAATGGAGGAAGGTACGAAAATTTCAGATCATCTGAGTGTTCTCAATAGCATCATTTCGGAGCTGGAGGTGA
TCGAAGTTAAGATAGAGGATGAGGATAAGGCATTCAAGCTTATCTTGTCACTTCCAACTTCTTATGAACACATGAAGCCAATCTTGATGTACGGGAAGGAAACTTTAAGT
TTTGCTGATGTTACTAGTAAACTCTTATCAGAAGAAAGAAGGCTGAAGAGTGAAGGGCGTACTTCCCAGGAGGATTCAGCGCTAGTAGCTAGCAATTGGAAGAAGAAGAA
AGAGTCCATGCAGAAGAAAGGTTGCTCGGAATGCAGACAGTCTGGACACATGAAAAAAGATTGTCCTAACAGAGCAGGTTCGTCAAAGGGCTCTGGGTCGGATGCTGACA
TTGTCTCTCTCGTCAGAAGAGTCAGTGAATTGCTCTGGAGAAAGACAGAATTCATCCTCATGGTATGTCCGCTTTATCATGATAGAGGATGTGATGTTAGCGGTTCCACA
AGTTTGCACACGGGCATTGGCTTGACAGTTATGCAAGGTGTGTGGTGGAAGTTATGTCCATGGCTGACGAACTTCCAGGAGGGAAAGACCGAAGGAGTTGGGCTGGCCCA
ACTCCTTGGCCTCGGCCATGGTCTCGGCCGAATCCCGACCCACCCCGGTGGGTCGAGTCCCTTCCCCTCCGTTTGGCTTCATGTGTCCCGGATCAGCCCGGTTCGAGCAG
TTTCAGTCCTGAATCGTTTCCACGCGCCTAGAAACCTTAAGAACTTCCTTCAGCTTTCTGATTTAAGCATCGGAGGCAGTGTGGTAAGCACCACACCGATGTGTAGGTTT
ACCTTGCCTTGCAGGCCACATCTTTCCCCTCTCATAAAAATTTACCGTTGGTGTCACGTGAGAGTCAGGCTGGCATCAACAGAAAAGACTTTTGCCGACGCTACTTCATT
GGCAATGGCGAAGAAAAGCTGGAGGGACTTTTGCAAGTCAGCATCAACACAACTAGTAGATGGAACTTCTGTTGACACAGTCAAGATCACGTCGACAGAAGTCACTTTTG
CCAATACCAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGAGCCCATCATGCGGGCTTGGACTAACCAAGTCTACCTCACCCCATCTAGCTCGGGCCAATGCACTTCATCTAAGTCGAGCTCACCCAGCAGAGGCCGTGGCCGAGCG
CCTCTTGCCAAGGCCGAGCACAAACTCTAAAGGGAGTGTCTACTTCTATCCTACTTTGCAGGATGACGACAAAGCTACACGTAACCCAGCCAAGGAAATTTTGGACCACC
CCGATGTACGAGGAGCTGACGAGGACAATCGGGGAGAAATCGGGCTGGGATACAGGCCAAGGAGGCGGAGCCGGCAAGCGGGACGGGCCAAGGCCGAAGGGGTCGGGTTT
TTGACCCGACCCATGCTCAGCCTCGGCCATGGGCCGAGGCCGACCCTCAGCCCGCTCGCGCGGGCCGAGCCCGTTCGGTCTCGTCTGGTCCCCACCGCCTCTGGCTACCC
CGGTTTCGCCTGGTTTGACCTAAAACGCCTCAGAATACCTAAATACCCTAGGAGGATGAGCAGGTATTTATATCCCTCTTCGTCACTGAAGAGGGGATCCCGAATTCTAT
CCCTAAACTCTACTCTCTATTCTCTGCTTTCTCCTCTTGCTCTTACTTTTCCGCTCCCTACCGTTCTGTTTGCTGACTTAAGCATCGGAGCCGGTGTGGCGAGCACCACA
CCGGTGTGCAGGTTACTGTCTTGCAGGCCACGTCTTCCCCCTCATCTACAAATTTACCGTTGGTGGCACGTGAAGGTCAGATTAGCTAGGAGTGAAGCAGAAATGTCAAG
CTTCATGAGTCCAGTGAAGATGGATGTGGAGAAATTTGATGGAAGGATGAACTTCGGCTTGTGGCAAGTGCAAGTCAAGGATGTGTTGATACAATCTGGGTTACACAAGG
CTTTGAAGGGAAGACCAAGCGATGGTGCTTCTGAAAGATTAAGCGGTGAAGGTGGTCCAGTGGAGTCCAGTGGCGGTTCCAGCAGAGGTTTGAAGAAGTCCAGCATGAGT
GATGAAGATTGGGAGGAAATGGATTTGAGAGCTGGAAATGTGCATGGAATTTCGACAGCCAAAGAGCTTTGGGAAAAGCTTGAAGCAATGTATCAGGCAAGGAGCACCTC
GAATCGGTTGTACCTGAAGGAGAAGTTTTACACGTTGCGAATGGAGGAAGGTACGAAAATTTCAGATCATCTGAGTGTTCTCAATAGCATCATTTCGGAGCTGGAGGTGA
TCGAAGTTAAGATAGAGGATGAGGATAAGGCATTCAAGCTTATCTTGTCACTTCCAACTTCTTATGAACACATGAAGCCAATCTTGATGTACGGGAAGGAAACTTTAAGT
TTTGCTGATGTTACTAGTAAACTCTTATCAGAAGAAAGAAGGCTGAAGAGTGAAGGGCGTACTTCCCAGGAGGATTCAGCGCTAGTAGCTAGCAATTGGAAGAAGAAGAA
AGAGTCCATGCAGAAGAAAGGTTGCTCGGAATGCAGACAGTCTGGACACATGAAAAAAGATTGTCCTAACAGAGCAGGTTCGTCAAAGGGCTCTGGGTCGGATGCTGACA
TTGTCTCTCTCGTCAGAAGAGTCAGTGAATTGCTCTGGAGAAAGACAGAATTCATCCTCATGGTATGTCCGCTTTATCATGATAGAGGATGTGATGTTAGCGGTTCCACA
AGTTTGCACACGGGCATTGGCTTGACAGTTATGCAAGGTGTGTGGTGGAAGTTATGTCCATGGCTGACGAACTTCCAGGAGGGAAAGACCGAAGGAGTTGGGCTGGCCCA
ACTCCTTGGCCTCGGCCATGGTCTCGGCCGAATCCCGACCCACCCCGGTGGGTCGAGTCCCTTCCCCTCCGTTTGGCTTCATGTGTCCCGGATCAGCCCGGTTCGAGCAG
TTTCAGTCCTGAATCGTTTCCACGCGCCTAGAAACCTTAAGAACTTCCTTCAGCTTTCTGATTTAAGCATCGGAGGCAGTGTGGTAAGCACCACACCGATGTGTAGGTTT
ACCTTGCCTTGCAGGCCACATCTTTCCCCTCTCATAAAAATTTACCGTTGGTGTCACGTGAGAGTCAGGCTGGCATCAACAGAAAAGACTTTTGCCGACGCTACTTCATT
GGCAATGGCGAAGAAAAGCTGGAGGGACTTTTGCAAGTCAGCATCAACACAACTAGTAGATGGAACTTCTGTTGACACAGTCAAGATCACGTCGACAGAAGTCACTTTTG
CCAATACCAAATGA
Protein sequenceShow/hide protein sequence
MSPSCGLGLTKSTSPHLARANALHLSRAHPAEAVAERLLPRPSTNSKGSVYFYPTLQDDDKATRNPAKEILDHPDVRGADEDNRGEIGLGYRPRRRSRQAGRAKAEGVGF
LTRPMLSLGHGPRPTLSPLARAEPVRSRLVPTASGYPGFAWFDLKRLRIPKYPRRMSRYLYPSSSLKRGSRILSLNSTLYSLLSPLALTFPLPTVLFADLSIGAGVASTT
PVCRLLSCRPRLPPHLQIYRWWHVKVRLARSEAEMSSFMSPVKMDVEKFDGRMNFGLWQVQVKDVLIQSGLHKALKGRPSDGASERLSGEGGPVESSGGSSRGLKKSSMS
DEDWEEMDLRAGNVHGISTAKELWEKLEAMYQARSTSNRLYLKEKFYTLRMEEGTKISDHLSVLNSIISELEVIEVKIEDEDKAFKLILSLPTSYEHMKPILMYGKETLS
FADVTSKLLSEERRLKSEGRTSQEDSALVASNWKKKKESMQKKGCSECRQSGHMKKDCPNRAGSSKGSGSDADIVSLVRRVSELLWRKTEFILMVCPLYHDRGCDVSGST
SLHTGIGLTVMQGVWWKLCPWLTNFQEGKTEGVGLAQLLGLGHGLGRIPTHPGGSSPFPSVWLHVSRISPVRAVSVLNRFHAPRNLKNFLQLSDLSIGGSVVSTTPMCRF
TLPCRPHLSPLIKIYRWCHVRVRLASTEKTFADATSLAMAKKSWRDFCKSASTQLVDGTSVDTVKITSTEVTFANTK