; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0041919 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0041919
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCCHC-type domain-containing protein
Genome locationchr13:31351359..31353584
RNA-Seq ExpressionLag0041919
SyntenyLag0041919
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8732531.1 Histidine kinase 5 [Hibiscus syriacus]5.7e-3030.7Show/hide
Query:  LRVGVMGFLEPKIFDGVMKFDGKNFGYWKMQVKDYLTCKKVHKALK-ERPKEMSDGDWEALDEEAVVTIRMCLSMDVARLVAHETTAVKLMESLTNR---
        +RVG M   E K+   + KFDG +FG+WKMQ++D+L  K +++ L  ++P+ M D DW  LD +A+  IR+ LS +VA  +A E T   LM +L++    
Subjt:  LRVGVMGFLEPKIFDGVMKFDGKNFGYWKMQVKDYLTCKKVHKALK-ERPKEMSDGDWEALDEEAVVTIRMCLSMDVARLVAHETTAVKLMESLTNR---

Query:  -QGSNKESIVGSAF--------------------------VMIKGKDKV----------DEDNEPSSSRKKWK-----------------GRNEVECYYC
           SNK  ++   F                          V I+  D+V          D  N   ++RK ++                 G+ +  CY C
Subjt:  -QGSNKESIVGSAF--------------------------VMIKGKDKV----------DEDNEPSSSRKKWK-----------------GRNEVECYYC

Query:  HKKGHFKYRCWKFKEDQKRKPEANIVEEVVLACVESDTKYSNHSSDWILDSATSVHIASDKSLITSFTGEHRGLLRMGNGRTSKTRGIGDVSLKTECGGK
         KKGHFK  C   K+D   +  AN+ EE   A + S    ++    WILDS  S H    + ++ ++     G + + +  T K  G GD+ LK      
Subjt:  HKKGHFKYRCWKFKEDQKRKPEANIVEEVVLACVESDTKYSNHSSDWILDSATSVHIASDKSLITSFTGEHRGLLRMGNGRTSKTRGIGDVSLKTECGGK

Query:  LILGDVRYEPNIKMNLISIGKLADDGYMCEFGSRQCKLKFGSQVVAVGHKKSTLY
          L  VR+ P +K NLIS+G+L  +GY   F   + K+  G+ V+A G K  TLY
Subjt:  LILGDVRYEPNIKMNLISIGKLADDGYMCEFGSRQCKLKFGSQVVAVGHKKSTLY

KAF7129225.1 hypothetical protein RHSIM_Rhsim10G0050800 [Rhododendron simsii]5.7e-3039.22Show/hide
Query:  GKDKVDEDNEPSSSRKKWKGRNEVECYYCHKKGHFKYRCWKFKEDQKRKPEANIV-----------EEVVLACVESDTKYSNHSSDWILDSATSVHIASD
        G     +D   S SR   K R+E+EC++CHK GH +  C   +++ K+   A I             EV++ C +     +     W++DS  S H+ S 
Subjt:  GKDKVDEDNEPSSSRKKWKGRNEVECYYCHKKGHFKYRCWKFKEDQKRKPEANIV-----------EEVVLACVESDTKYSNHSSDWILDSATSVHIASD

Query:  KSLITSFTGEHRGLLRMGNGRTSKTRGIGDVSLKTECGGKLILGDVRYEPNIKMNLISIGKLADDGYMCEFGSRQCKLKFGSQVVAVGHKKSTLYRCELN
        +    S+T    G +RMGN   SK  G+GDV L+T  G KL+L DVR+ PNI++NLIS GKL D+GY  +FG  + KL  GS VVA G K STLY  +  
Subjt:  KSLITSFTGEHRGLLRMGNGRTSKTRGIGDVSLKTECGGKLILGDVRYEPNIKMNLISIGKLADDGYMCEFGSRQCKLKFGSQVVAVGHKKSTLYRCELN

Query:  VAKG
        ++KG
Subjt:  VAKG

RVW30183.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]5.7e-3030.5Show/hide
Query:  GVMKFDGKNFGYWKMQVKDYLTCKKVH-KALKERPKEMSDGDWEALDEEAVVTIRMCLSMDVARLVAHETTAVKLMESLTNRQGSNKESI----------
        G+ KFDG +F YW+MQ++DYL  +K+H   L  +P+ M   +W  LD + +  IR+ LS  VA  V  E T   LM++L+    + KE +          
Subjt:  GVMKFDGKNFGYWKMQVKDYLTCKKVH-KALKERPKEMSDGDWEALDEEAVVTIRMCLSMDVARLVAHETTAVKLMESLTNRQGSNKESI----------

Query:  --------------VGSAF-VMIKGKDKVDEDNEPSS-------SRKKWKGRNEVECYYCHKKGHFKYRC---WKFKEDQKRKPEANIVEEVVLACVESD
                       GSA  +  +G+      N+  S       +R K +   +V+C+ C K GHFK +C    K  ED         V++ +L  V+S 
Subjt:  --------------VGSAF-VMIKGKDKVDEDNEPSS-------SRKKWKGRNEVECYYCHKKGHFKYRC---WKFKEDQKRKPEANIVEEVVLACVESD

Query:  TKYSNHSSDWILDSATSVHIASDKSLITSFTGEHRGLLRMGNGRTSKTRGIGDVSLKTECGGKLILGDVRYEPNIKMNLISIGKLADDGYMCEFGSRQCK
                DW+LDS  S H  S + +I ++     G + + +G      G+GDV +    G   +L  VR+ P+++ NLIS+G+L D+G+   F     K
Subjt:  TKYSNHSSDWILDSATSVHIASDKSLITSFTGEHRGLLRMGNGRTSKTRGIGDVSLKTECGGKLILGDVRYEPNIKMNLISIGKLADDGYMCEFGSRQCK

Query:  LKFGSQVVAVGHKKSTLY
        +  G++V+A G K  TLY
Subjt:  LKFGSQVVAVGHKKSTLY

RVW94144.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]2.8e-2935.71Show/hide
Query:  TAVKLMESLTNRQGSNKESIVGSAFVMI-------KGKDKVDEDNEPSSSRKKWKGRNEVECYYCHKKGHFKYRC--WKFKE------DQKRKPEANIVE
        T  ++  SL N +   K S    +  ++       K K K   + + S  R     + +VECYYCHKKGH K  C   KFKE       +K++ +  +V 
Subjt:  TAVKLMESLTNRQGSNKESIVGSAFVMI-------KGKDKVDEDNEPSSSRKKWKGRNEVECYYCHKKGHFKYRC--WKFKE------DQKRKPEANIVE

Query:  EVVLACVESDTKYS--NHSSDWILDSATSVHIASDKSLITSFTGEHRGLLRMGNGRTSKTRGIGDVSLKTECGGKLILGDVRYEPNIKMNLISIGKLADD
        +  L  +  D   +     +DW++DS  S H+ S     TS++    G +RMGN   SK  G+GD+ L+T  G KL+L DVR+ P+I++NLIS GKL D+
Subjt:  EVVLACVESDTKYS--NHSSDWILDSATSVHIASDKSLITSFTGEHRGLLRMGNGRTSKTRGIGDVSLKTECGGKLILGDVRYEPNIKMNLISIGKLADD

Query:  GYMCEFGSRQCKLKFGSQVVAVGHKKSTLYRCELNVAK
        GY   F   + KL  GS VVA G K  +LY  +  + K
Subjt:  GYMCEFGSRQCKLKFGSQVVAVGHKKSTLYRCELNVAK

VFQ72060.1 unnamed protein product [Cuscuta campestris]3.7e-2938.71Show/hide
Query:  IRMCLSMDVARLVAHETTAVKLMESLTNRQGSNKESIVGSAFVMIKGKDKVDEDNEPSSSRKKWKGRNEVECYYCHKKGHFKYRCWKFKEDQKRKPEANI
        ++  +  +  R   HET+  K    +T  +G +K+   G      +GK +       S SR K+K    VEC+YCHKKGH    C+K K   K KPE   
Subjt:  IRMCLSMDVARLVAHETTAVKLMESLTNRQGSNKESIVGSAFVMIKGKDKVDEDNEPSSSRKKWKGRNEVECYYCHKKGHFKYRCWKFKEDQKRKPEANI

Query:  VE--EVVLA------CVESDTKYSNHSSD---WILDSATSVHIASDKSLITSFTGEHRGLLRMGNGRTSKTRGIGDVSLKTECGGKLILGDVRYEPNIKM
         E    V A       V SD    N +SD   W++DS  + H  S K   TS+T    G+L+MGN   S+  GIG V L+T+ G KL+L +VR+ P+I++
Subjt:  VE--EVVLA------CVESDTKYSNHSSD---WILDSATSVHIASDKSLITSFTGEHRGLLRMGNGRTSKTRGIGDVSLKTECGGKLILGDVRYEPNIKM

Query:  NLISIGKLADDGYMCEFGSRQCKLKFGSQVVAVGHKKSTLYRCELNVA
        NLIS   L D+GYM  FG  QCKL  GS +VA G K S LY    +V+
Subjt:  NLISIGKLADDGYMCEFGSRQCKLKFGSQVVAVGHKKSTLYRCELNVA

TrEMBL top hitse value%identityAlignment
A0A2N9FSS1 Uncharacterized protein2.5e-3130.84Show/hide
Query:  GVMKFDGKNFGYWKMQVKDYLTCKKVH-KALKERPKEMSDGDWEALDEEAVVTIRMCLSMDVARLVAHETTAVKLMESL---------------------
        G+ KFDG +FGYW+MQ++DYL  KK+H   L E+P++M D +W  LD + +  IR+ LS  VA  V  E T  +LM +L                     
Subjt:  GVMKFDGKNFGYWKMQVKDYLTCKKVH-KALKERPKEMSDGDWEALDEEAVVTIRMCLSMDVARLVAHETTAVKLMESL---------------------

Query:  --------TNRQGSNKESIVGSAFVMIKGKDKVDEDNEPSSSRKKWKGRN------EVECYYCHKKGHFKYRCWKFKEDQKRKPEANIVEEV---VLACV
                  R+ + + S  GSA + ++ + +V + N      K  KGR+      ++EC+ C K GH +  CW+ K+  +      + EEV   +L  V
Subjt:  --------TNRQGSNKESIVGSAFVMIKGKDKVDEDNEPSSSRKKWKGRN------EVECYYCHKKGHFKYRCWKFKEDQKRKPEANIVEEV---VLACV

Query:  ESDTKYSNHSSDWILDSATSVHIASDKSLITSFTGEHRGLLRMGNGRTSKTRGIGDVSLKTECGGKLILGDVRYEPNIKMNLISIGKLADDGYMCEFGSR
        +S  +       W+LDS  S H  + + +I ++     G + + +       G+GDV +    G   +L  VR+ P +K NLIS+G+L  +G+   F   
Subjt:  ESDTKYSNHSSDWILDSATSVHIASDKSLITSFTGEHRGLLRMGNGRTSKTRGIGDVSLKTECGGKLILGDVRYEPNIKMNLISIGKLADDGYMCEFGSR

Query:  QCKLKFGSQVVAVGHKKSTLY
          K+  G+ VVA G K  TLY
Subjt:  QCKLKFGSQVVAVGHKKSTLY

A0A2N9GR56 Uncharacterized protein3.3e-3132.14Show/hide
Query:  GVMKFDGKNFGYWKMQVKDYLTCKKVH-KALKERPKEMSDGDWEALDEEAVVTIRMCLSMDVARLVAHETTAVKLMESLT-------NRQGSNKESIV--
        G+ KFDG NF YWKMQ++DYL  KK+H   L  +P +M D  W  LD + +  IR+ LS  V   V  ETT V LM +L+       N+ GSN+E +   
Subjt:  GVMKFDGKNFGYWKMQVKDYLTCKKVH-KALKERPKEMSDGDWEALDEEAVVTIRMCLSMDVARLVAHETTAVKLMESLT-------NRQGSNKESIV--

Query:  --GSAFVMI-------------------KGKDKVDED----NEPSSSRKKWKGRNEVECYYCHKKGHFKYRC--WKFKE----DQKRKPEANIV-----E
          GS  V +                   K K++   D    N+ S SR + + +   EC++  KKGH +  C  WK ++    DQK   E +       E
Subjt:  --GSAFVMI-------------------KGKDKVDED----NEPSSSRKKWKGRNEVECYYCHKKGHFKYRC--WKFKE----DQKRKPEANIV-----E

Query:  EVVLACVESD--TKYSNHSSDWILDSATSVHIASDKSLITSFTGEHRGLLRMGNGRTSKTRGIGDVSLKTECGGKLILGDVRYEPNIKMNLISIGKLADD
        EVV+  V+        N+  +W+ D A + ++   K L  ++  E    ++MGN   SK  GIGDVS+KT  G  +IL +V++  N++ NLIS   +   
Subjt:  EVVLACVESD--TKYSNHSSDWILDSATSVHIASDKSLITSFTGEHRGLLRMGNGRTSKTRGIGDVSLKTECGGKLILGDVRYEPNIKMNLISIGKLADD

Query:  GYMCEFGSRQCKLKFGSQVVAVGHKKSTLYRCELNVAKGSKRQWMPVKAADGSCRGTVEPTARI
        GY    G+ + KL  G  V   G     LY   +   K  K+++  V        GT+E T ++
Subjt:  GYMCEFGSRQCKLKFGSQVVAVGHKKSTLYRCELNVAKGSKRQWMPVKAADGSCRGTVEPTARI

A0A2N9J4N9 CCHC-type domain-containing protein5.6e-3130.58Show/hide
Query:  LLRVGVMGFLEPKIFDGVMKFDGKNFGYWKMQVKDYLTCKKVH-KALKERPKEMSDGDWEALDEEAVVTIRMCLSMDVARLVAHETTAVKLMESL-----
        L+RVG M   + K   G+ KFDG +FGYWKMQ++DYL  KK+H   L ++P +M D +W  LD + +  IR+ LS  VA  V  ETT V LM +L     
Subjt:  LLRVGVMGFLEPKIFDGVMKFDGKNFGYWKMQVKDYLTCKKVH-KALKERPKEMSDGDWEALDEEAVVTIRMCLSMDVARLVAHETTAVKLMESL-----

Query:  ----------TNRQGSNKESIVGSAFVMI---------------------KGKDKVDE-----------------DNEPSS---------SRKKWKGRNE
                  TN+  S +         +I                     KGK K D+                 DN             SR KW+    
Subjt:  ----------TNRQGSNKESIVGSAFVMI---------------------KGKDKVDE-----------------DNEPSS---------SRKKWKGRNE

Query:  V-----------------ECYYCHKKGHFKYRC--WKFK----EDQKRKPEANIV-----EEVVLACVESD--TKYSNHSSDWILDSATSVHIASDKSLI
                          EC++C KKGH +  C  W+ +    ED K   E         EEVV+  V+        N   +W++DSA + H+   K L 
Subjt:  V-----------------ECYYCHKKGHFKYRC--WKFK----EDQKRKPEANIV-----EEVVLACVESD--TKYSNHSSDWILDSATSVHIASDKSLI

Query:  TSFTGEHRGLLRMGNGRTSKTRGIGDVSLKTECGGKLILGDVRYEPNIKMNLISIGKLADDGYMCEFGSRQCKLKFGSQVVAVGHKKSTLYRCELNVAK
        T++     G ++MGN   SK  GIGDV +KT  G  ++L +VR+ P++  NLIS   +   GY    G+ + KL  G  VVA G     LY+  +   K
Subjt:  TSFTGEHRGLLRMGNGRTSKTRGIGDVSLKTECGGKLILGDVRYEPNIKMNLISIGKLADDGYMCEFGSRQCKLKFGSQVVAVGHKKSTLYRCELNVAK

A0A2N9JC56 Uncharacterized protein3.3e-3132.14Show/hide
Query:  GVMKFDGKNFGYWKMQVKDYLTCKKVH-KALKERPKEMSDGDWEALDEEAVVTIRMCLSMDVARLVAHETTAVKLMESLT-------NRQGSNKESIV--
        G+ KFDG NF YWKMQ++DYL  KK+H   L  +P +M D  W  LD + +  IR+ LS  V   V  ETT V LM +L+       N+ GSN+E +   
Subjt:  GVMKFDGKNFGYWKMQVKDYLTCKKVH-KALKERPKEMSDGDWEALDEEAVVTIRMCLSMDVARLVAHETTAVKLMESLT-------NRQGSNKESIV--

Query:  --GSAFVMI-------------------KGKDKVDED----NEPSSSRKKWKGRNEVECYYCHKKGHFKYRC--WKFKE----DQKRKPEANIV-----E
          GS  V +                   K K++   D    N+ S SR + + +   EC++  KKGH +  C  WK ++    DQK   E +       E
Subjt:  --GSAFVMI-------------------KGKDKVDED----NEPSSSRKKWKGRNEVECYYCHKKGHFKYRC--WKFKE----DQKRKPEANIV-----E

Query:  EVVLACVESD--TKYSNHSSDWILDSATSVHIASDKSLITSFTGEHRGLLRMGNGRTSKTRGIGDVSLKTECGGKLILGDVRYEPNIKMNLISIGKLADD
        EVV+  V+        N+  +W+ D A + ++   K L  ++  E    ++MGN   SK  GIGDVS+KT  G  +IL +V++  N++ NLIS   +   
Subjt:  EVVLACVESD--TKYSNHSSDWILDSATSVHIASDKSLITSFTGEHRGLLRMGNGRTSKTRGIGDVSLKTECGGKLILGDVRYEPNIKMNLISIGKLADD

Query:  GYMCEFGSRQCKLKFGSQVVAVGHKKSTLYRCELNVAKGSKRQWMPVKAADGSCRGTVEPTARI
        GY    G+ + KL  G  V   G     LY   +   K  K+++  V        GT+E T ++
Subjt:  GYMCEFGSRQCKLKFGSQVVAVGHKKSTLYRCELNVAKGSKRQWMPVKAADGSCRGTVEPTARI

A0A6A3CZB7 Histidine kinase 52.8e-3030.7Show/hide
Query:  LRVGVMGFLEPKIFDGVMKFDGKNFGYWKMQVKDYLTCKKVHKALK-ERPKEMSDGDWEALDEEAVVTIRMCLSMDVARLVAHETTAVKLMESLTNR---
        +RVG M   E K+   + KFDG +FG+WKMQ++D+L  K +++ L  ++P+ M D DW  LD +A+  IR+ LS +VA  +A E T   LM +L++    
Subjt:  LRVGVMGFLEPKIFDGVMKFDGKNFGYWKMQVKDYLTCKKVHKALK-ERPKEMSDGDWEALDEEAVVTIRMCLSMDVARLVAHETTAVKLMESLTNR---

Query:  -QGSNKESIVGSAF--------------------------VMIKGKDKV----------DEDNEPSSSRKKWK-----------------GRNEVECYYC
           SNK  ++   F                          V I+  D+V          D  N   ++RK ++                 G+ +  CY C
Subjt:  -QGSNKESIVGSAF--------------------------VMIKGKDKV----------DEDNEPSSSRKKWK-----------------GRNEVECYYC

Query:  HKKGHFKYRCWKFKEDQKRKPEANIVEEVVLACVESDTKYSNHSSDWILDSATSVHIASDKSLITSFTGEHRGLLRMGNGRTSKTRGIGDVSLKTECGGK
         KKGHFK  C   K+D   +  AN+ EE   A + S    ++    WILDS  S H    + ++ ++     G + + +  T K  G GD+ LK      
Subjt:  HKKGHFKYRCWKFKEDQKRKPEANIVEEVVLACVESDTKYSNHSSDWILDSATSVHIASDKSLITSFTGEHRGLLRMGNGRTSKTRGIGDVSLKTECGGK

Query:  LILGDVRYEPNIKMNLISIGKLADDGYMCEFGSRQCKLKFGSQVVAVGHKKSTLY
          L  VR+ P +K NLIS+G+L  +GY   F   + K+  G+ V+A G K  TLY
Subjt:  LILGDVRYEPNIKMNLISIGKLADDGYMCEFGSRQCKLKFGSQVVAVGHKKSTLY

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.5e-2030.31Show/hide
Query:  DVARLVAHETTAVKLME----SLTNRQGSNKESIVGSAFVMI-KGKDKVDEDNEPSSSRKKWKGRNEVE-----CYYCHKKGHFKYRCWKFK----EDQK
        ++A  + H  T ++L +     L N +   K    G A +   +G+      N    S  + K +N  +     CY C++ GHFK  C   +    E   
Subjt:  DVARLVAHETTAVKLME----SLTNRQGSNKESIVGSAFVMI-KGKDKVDEDNEPSSSRKKWKGRNEVE-----CYYCHKKGHFKYRCWKFK----EDQK

Query:  RKPEANIV------EEVVLACVESD--TKYSNHSSDWILDSATSVHIASDKSLITSFTGEHRGLLRMGNGRTSKTRGIGDVSLKTECGGKLILGDVRYEP
        +K + N        + VVL   E +     S   S+W++D+A S H    + L   +     G ++MGN   SK  GIGD+ +KT  G  L+L DVR+ P
Subjt:  RKPEANIV------EEVVLACVESD--TKYSNHSSDWILDSATSVHIASDKSLITSFTGEHRGLLRMGNGRTSKTRGIGDVSLKTECGGKLILGDVRYEP

Query:  NIKMNLISIGKLADDGYMCEFGSRQCKLKFGSQVVAVGHKKSTLYRCELNVAKG
        +++MNLIS   L  DGY   F +++ +L  GS V+A G  + TLYR    + +G
Subjt:  NIKMNLISIGKLADDGYMCEFGSRQCKLKFGSQVVAVGHKKSTLYRCELNVAKG

P25601 Putative transposon Ty5-1 protein YCL075W3.0e-0534.48Show/hide
Query:  CVESDTKYSNHSSDWILDSATSVHIASDKSLITSFTGEHRGLLRMGNGRTSKTRGIGDVSLKTECGGKLILGDVRYEPNIKMNLISI
        C+ S T  +  SS+WI D+  + H+  D+S+ +SFT   R     G G +    G G V++     G + L DV Y P++ +NLIS+
Subjt:  CVESDTKYSNHSSDWILDSATSVHIASDKSLITSFTGEHRGLLRMGNGRTSKTRGIGDVSLKTECGGKLILGDVRYEPNIKMNLISI

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAGGACTTTTCTTGAAGTTGCGGCTACTTGTTTTTCTTTACAGCAAGCTGACATGCGGCAGTGAGATATTTGGAGAGAATGAGATTTATTCTCTTGTGGGATTGAG
GGTGTTTTGTTTCTTTGTTTATCCCAAGCTATCGGGTTGTGATACTTGGAAGTCAATAGCTTTTGTTGGGCTATTGAGGGTTGGAGTCATGGGTTTTCTAGAGCCAAAAA
TTTTCGATGGAGTCATGAAGTTCGATGGGAAAAATTTTGGATATTGGAAGATGCAAGTCAAAGATTATTTAACTTGCAAGAAAGTGCATAAGGCATTGAAGGAGAGACCG
AAAGAGATGTCAGACGGAGATTGGGAAGCTCTAGATGAAGAGGCAGTTGTAACCATAAGGATGTGTTTGTCGATGGATGTGGCACGTCTAGTAGCCCATGAGACAACTGC
AGTCAAGTTGATGGAATCGCTTACAAACAGGCAGGGTAGTAATAAAGAGTCTATTGTAGGGTCAGCTTTCGTTATGATTAAAGGTAAAGATAAGGTCGATGAAGATAATG
AACCGAGTAGTAGTAGGAAAAAGTGGAAAGGTAGAAATGAGGTAGAATGTTATTACTGCCATAAGAAAGGTCACTTCAAGTATCGGTGTTGGAAATTTAAAGAGGATCAG
AAAAGAAAACCAGAGGCAAATATAGTGGAGGAGGTTGTCTTAGCTTGTGTTGAGAGTGACACAAAGTATAGTAACCACTCATCAGATTGGATATTAGACAGTGCAACTTC
TGTTCACATAGCTTCAGATAAAAGTTTGATCACATCATTCACAGGAGAGCATCGTGGCCTATTGAGGATGGGGAATGGTAGAACCTCCAAGACTAGAGGGATTGGAGATG
TTAGTCTGAAGACAGAATGTGGAGGTAAATTGATACTGGGAGATGTCAGGTACGAGCCTAATATCAAGATGAATCTTATTTCTATTGGTAAGTTGGCAGATGATGGTTAC
ATGTGTGAGTTTGGTAGTCGCCAGTGTAAACTCAAGTTCGGATCCCAAGTAGTGGCAGTTGGTCACAAGAAATCTACATTGTACAGATGTGAGTTGAATGTTGCCAAAGG
TTCAAAGAGACAGTGGATGCCGGTTAAAGCTGCAGATGGTAGTTGTAGAGGTACAGTTGAGCCAACAGCGAGGATAGTCAATTTCGATCAGTTCGATCAAGATCCTTCAG
TTCACAAACAATTGGGAAGTCTAGGAGAGAAAGTTGATGGCTATCGTGAATCCCCAGTTGTCAGACGGTCGAATGAATTGAAGAAGTCGGTTAGGCGAGTTGAGGCATCA
AAGTGGAAGGCCAGAGCAGTTGCTAAGGTCAAAGGTCAGGTCTTTAGCTTGGTAACAGGTTTGAATAGAGTATTCAAGCCATTCTCAGAGTGTATCTTCTTTAGGAACAG
TTGTTCGGATTGGAAGAAGATGACAGGTATTTTGAGCCTTGGAGTTTACCAAGAAGATCAGAGATAA
mRNA sequenceShow/hide mRNA sequence
ATGAGAGGACTTTTCTTGAAGTTGCGGCTACTTGTTTTTCTTTACAGCAAGCTGACATGCGGCAGTGAGATATTTGGAGAGAATGAGATTTATTCTCTTGTGGGATTGAG
GGTGTTTTGTTTCTTTGTTTATCCCAAGCTATCGGGTTGTGATACTTGGAAGTCAATAGCTTTTGTTGGGCTATTGAGGGTTGGAGTCATGGGTTTTCTAGAGCCAAAAA
TTTTCGATGGAGTCATGAAGTTCGATGGGAAAAATTTTGGATATTGGAAGATGCAAGTCAAAGATTATTTAACTTGCAAGAAAGTGCATAAGGCATTGAAGGAGAGACCG
AAAGAGATGTCAGACGGAGATTGGGAAGCTCTAGATGAAGAGGCAGTTGTAACCATAAGGATGTGTTTGTCGATGGATGTGGCACGTCTAGTAGCCCATGAGACAACTGC
AGTCAAGTTGATGGAATCGCTTACAAACAGGCAGGGTAGTAATAAAGAGTCTATTGTAGGGTCAGCTTTCGTTATGATTAAAGGTAAAGATAAGGTCGATGAAGATAATG
AACCGAGTAGTAGTAGGAAAAAGTGGAAAGGTAGAAATGAGGTAGAATGTTATTACTGCCATAAGAAAGGTCACTTCAAGTATCGGTGTTGGAAATTTAAAGAGGATCAG
AAAAGAAAACCAGAGGCAAATATAGTGGAGGAGGTTGTCTTAGCTTGTGTTGAGAGTGACACAAAGTATAGTAACCACTCATCAGATTGGATATTAGACAGTGCAACTTC
TGTTCACATAGCTTCAGATAAAAGTTTGATCACATCATTCACAGGAGAGCATCGTGGCCTATTGAGGATGGGGAATGGTAGAACCTCCAAGACTAGAGGGATTGGAGATG
TTAGTCTGAAGACAGAATGTGGAGGTAAATTGATACTGGGAGATGTCAGGTACGAGCCTAATATCAAGATGAATCTTATTTCTATTGGTAAGTTGGCAGATGATGGTTAC
ATGTGTGAGTTTGGTAGTCGCCAGTGTAAACTCAAGTTCGGATCCCAAGTAGTGGCAGTTGGTCACAAGAAATCTACATTGTACAGATGTGAGTTGAATGTTGCCAAAGG
TTCAAAGAGACAGTGGATGCCGGTTAAAGCTGCAGATGGTAGTTGTAGAGGTACAGTTGAGCCAACAGCGAGGATAGTCAATTTCGATCAGTTCGATCAAGATCCTTCAG
TTCACAAACAATTGGGAAGTCTAGGAGAGAAAGTTGATGGCTATCGTGAATCCCCAGTTGTCAGACGGTCGAATGAATTGAAGAAGTCGGTTAGGCGAGTTGAGGCATCA
AAGTGGAAGGCCAGAGCAGTTGCTAAGGTCAAAGGTCAGGTCTTTAGCTTGGTAACAGGTTTGAATAGAGTATTCAAGCCATTCTCAGAGTGTATCTTCTTTAGGAACAG
TTGTTCGGATTGGAAGAAGATGACAGGTATTTTGAGCCTTGGAGTTTACCAAGAAGATCAGAGATAA
Protein sequenceShow/hide protein sequence
MRGLFLKLRLLVFLYSKLTCGSEIFGENEIYSLVGLRVFCFFVYPKLSGCDTWKSIAFVGLLRVGVMGFLEPKIFDGVMKFDGKNFGYWKMQVKDYLTCKKVHKALKERP
KEMSDGDWEALDEEAVVTIRMCLSMDVARLVAHETTAVKLMESLTNRQGSNKESIVGSAFVMIKGKDKVDEDNEPSSSRKKWKGRNEVECYYCHKKGHFKYRCWKFKEDQ
KRKPEANIVEEVVLACVESDTKYSNHSSDWILDSATSVHIASDKSLITSFTGEHRGLLRMGNGRTSKTRGIGDVSLKTECGGKLILGDVRYEPNIKMNLISIGKLADDGY
MCEFGSRQCKLKFGSQVVAVGHKKSTLYRCELNVAKGSKRQWMPVKAADGSCRGTVEPTARIVNFDQFDQDPSVHKQLGSLGEKVDGYRESPVVRRSNELKKSVRRVEAS
KWKARAVAKVKGQVFSLVTGLNRVFKPFSECIFFRNSCSDWKKMTGILSLGVYQEDQR