; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0015381 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0015381
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCCHC-type domain-containing protein
Genome locationchr12:11930079..11931068
RNA-Seq ExpressionLag0015381
SyntenyLag0015381
Gene Ontology termsNA
InterPro domainsIPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022132681.1 uncharacterized protein LOC111005481 [Momordica charantia]2.5e-5845.38Show/hide
Query:  MALEDMLNHWENFNLTTEEEGTAVDVDRQATEVTSRSLGFSLIGKLVSPRFIVGDVMRKNFKSAWNIP-SGLTVEKLGPNLFLFSLRSEEEQARILQQGP
        MA  ++L  W+NF LT+EE+  AVD+D  A E T + L  SLI KL+S R I   V++   K AW +     +V+ +G N+FLF+     ++ RIL+ GP
Subjt:  MALEDMLNHWENFNLTTEEEGTAVDVDRQATEVTSRSLGFSLIGKLVSPRFIVGDVMRKNFKSAWNIP-SGLTVEKLGPNLFLFSLRSEEEQARILQQGP

Query:  WLFDKFLLVLSKPIPMVKPKAMEFKFVAFWVHFYELPMELFNASMTTRLGNAIGRFEEYDIGGRNFGWKESLRVRVILDITKPLRRCIKVNLDEPMGSCW
        W FD+ L+++  P+ + KP  M+F+ V+ WVHF++L +   N +M TRLGNAIG FE+ +    NF W   LRVRV  D+ KPL R IK+NLD PMG CW
Subjt:  WLFDKFLLVLSKPIPMVKPKAMEFKFVAFWVHFYELPMELFNASMTTRLGNAIGRFEEYDIGGRNFGWKESLRVRVILDITKPLRRCIKVNLDEPMGSCW

Query:  TPIRYEKLPDLCGYCGIIGHSVKDCSAYYIAAGSPSQCNQYGMWLQYTG
         PI+YE+LPD   +CG + H +KDCS       S S+  QYG WL++ G
Subjt:  TPIRYEKLPDLCGYCGIIGHSVKDCSAYYIAAGSPSQCNQYGMWLQYTG

XP_022149484.1 uncharacterized protein LOC111017902 [Momordica charantia]3.8e-3836.29Show/hide
Query:  LEDMLNHWENFNLTTEEEGTAVDVDRQATEVTSRSLGFSLIGKLVSPRFIVGDVMRKNFKSAWNIPSGLTVEKLGPNLFLFSLRSEEEQARILQQGPWLF
        ++++ + WE+F  T +E  T V +DR    +T+ ++   ++ KL + + I  + +R   KS W + +    E LG N+++   +S  E++R+L  GPW F
Subjt:  LEDMLNHWENFNLTTEEEGTAVDVDRQATEVTSRSLGFSLIGKLVSPRFIVGDVMRKNFKSAWNIPSGLTVEKLGPNLFLFSLRSEEEQARILQQGPWLF

Query:  DKFLLVLSKPIPMVKPKAMEFKFVAFWVHFYELPMELFNASMTTRLGNAIGRFEEYDIGGRNFGWK-ESLRVRVILDITKPLRRCIKVNLDEPMGSCWTP
        +K LLVL+ P    +P  M F F AFW+  + +P E  +  M   LG  +G  EE +  G + GW    +RVRV +D++KPLRR IK+   +     W P
Subjt:  DKFLLVLSKPIPMVKPKAMEFKFVAFWVHFYELPMELFNASMTTRLGNAIGRFEEYDIGGRNFGWK-ESLRVRVILDITKPLRRCIKVNLDEPMGSCWTP

Query:  IRYEKLPDLCGYCGIIGHSVKDCS--AYYIAAGSPSQCNQYGMWLQYT
        +RYEKLPD C  CG IGHS ++C   +  +   SP    QYG WL+ T
Subjt:  IRYEKLPDLCGYCGIIGHSVKDCS--AYYIAAGSPSQCNQYGMWLQYT

XP_022156185.1 uncharacterized protein LOC111023135 [Momordica charantia]1.9e-6648.37Show/hide
Query:  EDMLNHWENFNLTTEEEGTAVDVDRQATEVTSRSLGFSLIGKLVSPRFIVGDVMRKNFKSAWNIPSGLTVEKLGPNLFLFSLRSEEEQARILQQGPWLFD
        E++L  W+ F LT+EE+  A+DVD  A ++  + L +SL+GKL++ R I  DV+ +    AW +   LTVE +G NLFLF    E +  R+++ GPW FD
Subjt:  EDMLNHWENFNLTTEEEGTAVDVDRQATEVTSRSLGFSLIGKLVSPRFIVGDVMRKNFKSAWNIPSGLTVEKLGPNLFLFSLRSEEEQARILQQGPWLFD

Query:  KFLLVLSKPIPMVKPKAMEFKFVAFWVHFYELPMELFNASMTTRLGNAIGRFEEYDIGGRNFGWKESLRVRVILDITKPLRRCIKVNLDEPMGSCWTPIR
        K L+VL KP        +EF  VAFW+H ++LPM   N +M  RLGNAIG F + D   + F W  SLR+RV++DITKPLRR IK+N+D PMG CW PI+
Subjt:  KFLLVLSKPIPMVKPKAMEFKFVAFWVHFYELPMELFNASMTTRLGNAIGRFEEYDIGGRNFGWKESLRVRVILDITKPLRRCIKVNLDEPMGSCWTPIR

Query:  YEKLPDLCGYCGIIGHSVKDCSAYYIAAGSPSQC-NQYGMWLQYTG
        YE+LPD C +CG+IGHS  DC A Y+AA   S+  ++YG WL++ G
Subjt:  YEKLPDLCGYCGIIGHSVKDCSAYYIAAGSPSQC-NQYGMWLQYTG

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]1.2e-5242.97Show/hide
Query:  MALEDMLNHWENFNLTTEEEGTAVDVDRQATEVTSRSLGFSLIGKLVSPRFIVGDVMRKNFKSAWNIP-SGLTVEKLGPNLFLFSLRSEEEQARILQQGP
        MA  D+L  W+NF LT+EEE TA+DVD  A   T   L   L+GKL   R I   VM+   ++AW +  +   V+ LG NLFLFS     ++ +I + GP
Subjt:  MALEDMLNHWENFNLTTEEEGTAVDVDRQATEVTSRSLGFSLIGKLVSPRFIVGDVMRKNFKSAWNIP-SGLTVEKLGPNLFLFSLRSEEEQARILQQGP

Query:  WLFDKFLLVLSKPIPMVKPKAMEFKFVAFWVHFYELPMELFNASMTTRLGNAIGRFEEYDIGGRNFGWKESLRVRVILDITKPLRRCIKVNLDEPMGSCW
        W FD+ L++++KP+ ++ P  ++F  +  WV F++LP+      M  RLGNA+G FEE D    N  W  +LRVRV+LDI+KPLRR IK+NLD P+G  W
Subjt:  WLFDKFLLVLSKPIPMVKPKAMEFKFVAFWVHFYELPMELFNASMTTRLGNAIGRFEEYDIGGRNFGWKESLRVRVILDITKPLRRCIKVNLDEPMGSCW

Query:  TPIRYEKLPDLCGYCGIIGHSVKDCSAYYIAAGSPSQCNQYGMWLQYTG
         PI+YE+LPD C +CG+                S  + +QYG WL+Y G
Subjt:  TPIRYEKLPDLCGYCGIIGHSVKDCSAYYIAAGSPSQCNQYGMWLQYTG

XP_028122006.1 uncharacterized protein LOC114319195 [Camellia sinensis]1.0e-3535.1Show/hide
Query:  EDMLNHWENFNLTTEEEGTAVDVDRQATEVTSRSLGFSLIGKLVSPRFIVGDVMRKNFKSAWNIPSGLTVEKLGPNLFLFSLRSEEEQARILQQGPWLFD
        + +L+     +LT+EE+   V +  ++T +        L+GKL++ R    + M+    S W    G+ V  +G NLF+F      ++ R+L  GPW FD
Subjt:  EDMLNHWENFNLTTEEEGTAVDVDRQATEVTSRSLGFSLIGKLVSPRFIVGDVMRKNFKSAWNIPSGLTVEKLGPNLFLFSLRSEEEQARILQQGPWLFD

Query:  KFLLVLSKPIPMVKPKAMEFKFVAFWVHFYELPMELFNASMTTRLGNAIGRFEEYDIGGRNFGWKESLRVRVILDITKPLRRCIKVNLD--EPMGSCWTP
        K LL+L +  P V+P  ++   V FWVH   LP+ L N  +   +GNA+G+F + D       W  ++R+RV LD+ KPLRR +K+ L   EP+   W  
Subjt:  KFLLVLSKPIPMVKPKAMEFKFVAFWVHFYELPMELFNASMTTRLGNAIGRFEEYDIGGRNFGWKESLRVRVILDITKPLRRCIKVNLD--EPMGSCWTP

Query:  IRYEKLPDLCGYCGIIGHSVKDCSAYYIAA-GSPSQCNQYGMWLQ
         +YE+LP  C +CG +GHS ++C     +A GS     QYG WL+
Subjt:  IRYEKLPDLCGYCGIIGHSVKDCSAYYIAA-GSPSQCNQYGMWLQ

TrEMBL top hitse value%identityAlignment
A0A2N9GWE9 Uncharacterized protein7.7e-3734.02Show/hide
Query:  EDMLNHWENFNLTTEEEGTAVDVDRQATEVTSRSLGFSLIGKLVSPRFIVGDVMRKNFKSAW-NIPSGLTVEKLGPNLFLFSLRSEEEQARILQQGPWLF
        ++++  W  F+L TE EG  V +   A EV+       L+GKL + ++   + ++      W  +  G+T   +G NLF+F  R + E+ R++   PWLF
Subjt:  EDMLNHWENFNLTTEEEGTAVDVDRQATEVTSRSLGFSLIGKLVSPRFIVGDVMRKNFKSAW-NIPSGLTVEKLGPNLFLFSLRSEEEQARILQQGPWLF

Query:  DKFLLVLSKPIPMVKPKAMEFKFVAFWVHFYELPMELFNASMTTRLGNAIGRFEEYDIGGRNFGWKESLRVRVILDITKPLRRCIKVNLDEPMGSCWTPI
        D  LL+L +         ++F +  FWV FY +P+         ++G+  G+ EE D+     GW   LRVR+ LDITKP+ R   V  +  +G  W   
Subjt:  DKFLLVLSKPIPMVKPKAMEFKFVAFWVHFYELPMELFNASMTTRLGNAIGRFEEYDIGGRNFGWKESLRVRVILDITKPLRRCIKVNLDEPMGSCWTPI

Query:  RYEKLPDLCGYCGIIGHSVKDC-SAYYIAAGSPSQCNQYGMWLQ
        +YE+LP LC +CG+IGH  +DC S     + SP    QYG WL+
Subjt:  RYEKLPDLCGYCGIIGHSVKDC-SAYYIAAGSPSQCNQYGMWLQ

A0A6J1BSZ1 uncharacterized protein LOC1110054811.2e-5845.38Show/hide
Query:  MALEDMLNHWENFNLTTEEEGTAVDVDRQATEVTSRSLGFSLIGKLVSPRFIVGDVMRKNFKSAWNIP-SGLTVEKLGPNLFLFSLRSEEEQARILQQGP
        MA  ++L  W+NF LT+EE+  AVD+D  A E T + L  SLI KL+S R I   V++   K AW +     +V+ +G N+FLF+     ++ RIL+ GP
Subjt:  MALEDMLNHWENFNLTTEEEGTAVDVDRQATEVTSRSLGFSLIGKLVSPRFIVGDVMRKNFKSAWNIP-SGLTVEKLGPNLFLFSLRSEEEQARILQQGP

Query:  WLFDKFLLVLSKPIPMVKPKAMEFKFVAFWVHFYELPMELFNASMTTRLGNAIGRFEEYDIGGRNFGWKESLRVRVILDITKPLRRCIKVNLDEPMGSCW
        W FD+ L+++  P+ + KP  M+F+ V+ WVHF++L +   N +M TRLGNAIG FE+ +    NF W   LRVRV  D+ KPL R IK+NLD PMG CW
Subjt:  WLFDKFLLVLSKPIPMVKPKAMEFKFVAFWVHFYELPMELFNASMTTRLGNAIGRFEEYDIGGRNFGWKESLRVRVILDITKPLRRCIKVNLDEPMGSCW

Query:  TPIRYEKLPDLCGYCGIIGHSVKDCSAYYIAAGSPSQCNQYGMWLQYTG
         PI+YE+LPD   +CG + H +KDCS       S S+  QYG WL++ G
Subjt:  TPIRYEKLPDLCGYCGIIGHSVKDCSAYYIAAGSPSQCNQYGMWLQYTG

A0A6J1D765 uncharacterized protein LOC1110179021.8e-3836.29Show/hide
Query:  LEDMLNHWENFNLTTEEEGTAVDVDRQATEVTSRSLGFSLIGKLVSPRFIVGDVMRKNFKSAWNIPSGLTVEKLGPNLFLFSLRSEEEQARILQQGPWLF
        ++++ + WE+F  T +E  T V +DR    +T+ ++   ++ KL + + I  + +R   KS W + +    E LG N+++   +S  E++R+L  GPW F
Subjt:  LEDMLNHWENFNLTTEEEGTAVDVDRQATEVTSRSLGFSLIGKLVSPRFIVGDVMRKNFKSAWNIPSGLTVEKLGPNLFLFSLRSEEEQARILQQGPWLF

Query:  DKFLLVLSKPIPMVKPKAMEFKFVAFWVHFYELPMELFNASMTTRLGNAIGRFEEYDIGGRNFGWK-ESLRVRVILDITKPLRRCIKVNLDEPMGSCWTP
        +K LLVL+ P    +P  M F F AFW+  + +P E  +  M   LG  +G  EE +  G + GW    +RVRV +D++KPLRR IK+   +     W P
Subjt:  DKFLLVLSKPIPMVKPKAMEFKFVAFWVHFYELPMELFNASMTTRLGNAIGRFEEYDIGGRNFGWK-ESLRVRVILDITKPLRRCIKVNLDEPMGSCWTP

Query:  IRYEKLPDLCGYCGIIGHSVKDCS--AYYIAAGSPSQCNQYGMWLQYT
        +RYEKLPD C  CG IGHS ++C   +  +   SP    QYG WL+ T
Subjt:  IRYEKLPDLCGYCGIIGHSVKDCS--AYYIAAGSPSQCNQYGMWLQYT

A0A6J1DU55 uncharacterized protein LOC1110231359.3e-6748.37Show/hide
Query:  EDMLNHWENFNLTTEEEGTAVDVDRQATEVTSRSLGFSLIGKLVSPRFIVGDVMRKNFKSAWNIPSGLTVEKLGPNLFLFSLRSEEEQARILQQGPWLFD
        E++L  W+ F LT+EE+  A+DVD  A ++  + L +SL+GKL++ R I  DV+ +    AW +   LTVE +G NLFLF    E +  R+++ GPW FD
Subjt:  EDMLNHWENFNLTTEEEGTAVDVDRQATEVTSRSLGFSLIGKLVSPRFIVGDVMRKNFKSAWNIPSGLTVEKLGPNLFLFSLRSEEEQARILQQGPWLFD

Query:  KFLLVLSKPIPMVKPKAMEFKFVAFWVHFYELPMELFNASMTTRLGNAIGRFEEYDIGGRNFGWKESLRVRVILDITKPLRRCIKVNLDEPMGSCWTPIR
        K L+VL KP        +EF  VAFW+H ++LPM   N +M  RLGNAIG F + D   + F W  SLR+RV++DITKPLRR IK+N+D PMG CW PI+
Subjt:  KFLLVLSKPIPMVKPKAMEFKFVAFWVHFYELPMELFNASMTTRLGNAIGRFEEYDIGGRNFGWKESLRVRVILDITKPLRRCIKVNLDEPMGSCWTPIR

Query:  YEKLPDLCGYCGIIGHSVKDCSAYYIAAGSPSQC-NQYGMWLQYTG
        YE+LPD C +CG+IGHS  DC A Y+AA   S+  ++YG WL++ G
Subjt:  YEKLPDLCGYCGIIGHSVKDCSAYYIAAGSPSQC-NQYGMWLQYTG

A0A6J1DX30 uncharacterized protein LOC1110248745.8e-5342.97Show/hide
Query:  MALEDMLNHWENFNLTTEEEGTAVDVDRQATEVTSRSLGFSLIGKLVSPRFIVGDVMRKNFKSAWNIP-SGLTVEKLGPNLFLFSLRSEEEQARILQQGP
        MA  D+L  W+NF LT+EEE TA+DVD  A   T   L   L+GKL   R I   VM+   ++AW +  +   V+ LG NLFLFS     ++ +I + GP
Subjt:  MALEDMLNHWENFNLTTEEEGTAVDVDRQATEVTSRSLGFSLIGKLVSPRFIVGDVMRKNFKSAWNIP-SGLTVEKLGPNLFLFSLRSEEEQARILQQGP

Query:  WLFDKFLLVLSKPIPMVKPKAMEFKFVAFWVHFYELPMELFNASMTTRLGNAIGRFEEYDIGGRNFGWKESLRVRVILDITKPLRRCIKVNLDEPMGSCW
        W FD+ L++++KP+ ++ P  ++F  +  WV F++LP+      M  RLGNA+G FEE D    N  W  +LRVRV+LDI+KPLRR IK+NLD P+G  W
Subjt:  WLFDKFLLVLSKPIPMVKPKAMEFKFVAFWVHFYELPMELFNASMTTRLGNAIGRFEEYDIGGRNFGWKESLRVRVILDITKPLRRCIKVNLDEPMGSCW

Query:  TPIRYEKLPDLCGYCGIIGHSVKDCSAYYIAAGSPSQCNQYGMWLQYTG
         PI+YE+LPD C +CG+                S  + +QYG WL+Y G
Subjt:  TPIRYEKLPDLCGYCGIIGHSVKDCSAYYIAAGSPSQCNQYGMWLQYTG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G01050.1 zinc ion binding;nucleic acid binding5.8e-1322.22Show/hide
Query:  EEEGTAVDVDRQATEVTSRSLGFSLIGKLVSPRFIVGDVMRKNFKSAWNIPSGLTVEKLGPNLFLFSLRSEEEQARILQQGPWLFDKFLLVLSKPIPMVK
        E+E   + +  +  E  +      +I K++  +  +  V+ +  +  W     +TV  L    F+     EEE    L  GPW      L++        
Subjt:  EEEGTAVDVDRQATEVTSRSLGFSLIGKLVSPRFIVGDVMRKNFKSAWNIPSGLTVEKLGPNLFLFSLRSEEEQARILQQGPWLFDKFLLVLSKPIPMVK

Query:  PKAMEFKFVAFWVHFYELPMELFNASMTTRLGNAIGRFEEYDIGGRNFGWKESLRVRVILDITKPLRRCIKVNLDEPMGSCWTPIRYEKLPDLCGYCGII
        P   +      WV    +P   ++  +   +   +GR  + D+   NF      RV + +++ KPL+  + +N D         + YE L  +C  CGI 
Subjt:  PKAMEFKFVAFWVHFYELPMELFNASMTTRLGNAIGRFEEYDIGGRNFGWKESLRVRVILDITKPLRRCIKVNLDEPMGSCWTPIRYEKLPDLCGYCGII

Query:  GHSVKDC
        GH V  C
Subjt:  GHSVKDC

AT3G31430.1 unknown protein3.4e-1326.94Show/hide
Query:  FSLIGKLVSPRFIVGDVMRKNFKS-------AWNIPSGLTVEK-LGPNLFLFSLRSEEEQARILQQGPWLFDKFLLVLSKPIPMVKPKAMEFKFVAFWVH
        F L G+ V PR       R+N +S        W   SGL   + +    F F    EE    +L++GPW F+ ++++L +     +P+   F F+ FWV 
Subjt:  FSLIGKLVSPRFIVGDVMRKNFKS-------AWNIPSGLTVEK-LGPNLFLFSLRSEEEQARILQQGPWLFDKFLLVLSKPIPMVKPKAMEFKFVAFWVH

Query:  FYELPMELFNASMTTRLGNAIGRFEEYDIGGRNFGWKESLRVRVILDITKPLRRCIKVNLDEPMGSCWTPIRYEKLPDLCGYCGIIGHSVKDC
           +P +  N  +   +G A+G+  + D         +  RV +  DIT PLR          + +     RYE+L   C  CG++ H    C
Subjt:  FYELPMELFNASMTTRLGNAIGRFEEYDIGGRNFGWKESLRVRVILDITKPLRRCIKVNLDEPMGSCWTPIRYEKLPDLCGYCGIIGHSVKDC

AT3G42140.1 zinc ion binding;nucleic acid binding1.1e-0824.65Show/hide
Query:  FSLRSEEEQARILQQGPWLFDKFLLVLSKPIPMVKPKAMEFKFVAFWVHFYELPMELFNASMTTRLGNAIGRFEEYDIGGRNFGWKESLRVRVILDITKP
        F  +SEE    IL++GPW F+ ++ V+ +   +      EFK + FW+    +P+    A + T +G  +G F E ++G                     
Subjt:  FSLRSEEEQARILQQGPWLFDKFLLVLSKPIPMVKPKAMEFKFVAFWVHFYELPMELFNASMTTRLGNAIGRFEEYDIGGRNFGWKESLRVRVILDITKP

Query:  LRRCIKVNLDEPMGSCWTPIRYEKLPDLCGYCGIIGHSVKDC
          R + V             +YEKL + C  CG++ H   +C
Subjt:  LRRCIKVNLDEPMGSCWTPIRYEKLPDLCGYCGIIGHSVKDC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGGTTAGTCATCGCAATCGGTGCAGTCGGAAATTCCTTTTCTTCTTTGCAGGTTTTGTTAAAGTCAGAGATTCGACTTTTGATTCAATGGCTCTCGAAGACATGCT
CAATCACTGGGAAAATTTCAATCTCACTACTGAGGAGGAAGGAACAGCGGTTGATGTGGACCGACAAGCGACAGAGGTCACTAGCAGATCGTTGGGGTTTAGTCTTATTG
GCAAACTGGTATCTCCTCGCTTTATCGTCGGTGACGTTATGCGGAAAAACTTCAAATCCGCCTGGAACATACCATCAGGCCTCACTGTGGAGAAATTAGGACCAAATTTA
TTCCTTTTTTCTCTGCGATCAGAAGAGGAACAAGCTCGTATTCTTCAGCAGGGACCTTGGTTATTTGACAAGTTTTTACTTGTTCTTTCAAAACCAATCCCAATGGTTAA
ACCGAAAGCTATGGAGTTCAAGTTTGTGGCGTTCTGGGTCCATTTTTACGAACTTCCGATGGAGTTGTTCAACGCATCAATGACGACACGACTAGGAAACGCCATCGGAC
GTTTCGAGGAATATGATATTGGAGGACGCAACTTTGGTTGGAAGGAAAGTCTTCGCGTCCGTGTTATCCTTGATATCACCAAACCACTTCGACGGTGTATCAAAGTCAAT
CTTGATGAACCTATGGGGAGTTGCTGGACTCCAATCCGCTATGAGAAGCTTCCGGATCTGTGTGGTTACTGTGGGATTATCGGACACAGTGTCAAAGATTGCAGTGCTTA
CTATATAGCGGCTGGATCTCCGTCACAATGTAATCAGTATGGTATGTGGCTACAATATACAGGGAGAACGACCACTTTGTTTCGATCCCCTAGCACCAGTCCGATGAGCA
TGAACAGGATGGCGGTGGATCAACCCTCTGCTCTTCAGTCTCCTCTAGTCGCCGGGAAATTTAGCCATCCACGTTTTCTTGGCCAACTTGGTGGTTCGGTTAATACCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTGGTTAGTCATCGCAATCGGTGCAGTCGGAAATTCCTTTTCTTCTTTGCAGGTTTTGTTAAAGTCAGAGATTCGACTTTTGATTCAATGGCTCTCGAAGACATGCT
CAATCACTGGGAAAATTTCAATCTCACTACTGAGGAGGAAGGAACAGCGGTTGATGTGGACCGACAAGCGACAGAGGTCACTAGCAGATCGTTGGGGTTTAGTCTTATTG
GCAAACTGGTATCTCCTCGCTTTATCGTCGGTGACGTTATGCGGAAAAACTTCAAATCCGCCTGGAACATACCATCAGGCCTCACTGTGGAGAAATTAGGACCAAATTTA
TTCCTTTTTTCTCTGCGATCAGAAGAGGAACAAGCTCGTATTCTTCAGCAGGGACCTTGGTTATTTGACAAGTTTTTACTTGTTCTTTCAAAACCAATCCCAATGGTTAA
ACCGAAAGCTATGGAGTTCAAGTTTGTGGCGTTCTGGGTCCATTTTTACGAACTTCCGATGGAGTTGTTCAACGCATCAATGACGACACGACTAGGAAACGCCATCGGAC
GTTTCGAGGAATATGATATTGGAGGACGCAACTTTGGTTGGAAGGAAAGTCTTCGCGTCCGTGTTATCCTTGATATCACCAAACCACTTCGACGGTGTATCAAAGTCAAT
CTTGATGAACCTATGGGGAGTTGCTGGACTCCAATCCGCTATGAGAAGCTTCCGGATCTGTGTGGTTACTGTGGGATTATCGGACACAGTGTCAAAGATTGCAGTGCTTA
CTATATAGCGGCTGGATCTCCGTCACAATGTAATCAGTATGGTATGTGGCTACAATATACAGGGAGAACGACCACTTTGTTTCGATCCCCTAGCACCAGTCCGATGAGCA
TGAACAGGATGGCGGTGGATCAACCCTCTGCTCTTCAGTCTCCTCTAGTCGCCGGGAAATTTAGCCATCCACGTTTTCTTGGCCAACTTGGTGGTTCGGTTAATACCTAA
Protein sequenceShow/hide protein sequence
MVVSHRNRCSRKFLFFFAGFVKVRDSTFDSMALEDMLNHWENFNLTTEEEGTAVDVDRQATEVTSRSLGFSLIGKLVSPRFIVGDVMRKNFKSAWNIPSGLTVEKLGPNL
FLFSLRSEEEQARILQQGPWLFDKFLLVLSKPIPMVKPKAMEFKFVAFWVHFYELPMELFNASMTTRLGNAIGRFEEYDIGGRNFGWKESLRVRVILDITKPLRRCIKVN
LDEPMGSCWTPIRYEKLPDLCGYCGIIGHSVKDCSAYYIAAGSPSQCNQYGMWLQYTGRTTTLFRSPSTSPMSMNRMAVDQPSALQSPLVAGKFSHPRFLGQLGGSVNT