; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0041388 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0041388
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr13:16940880..16941764
RNA-Seq ExpressionLag0041388
SyntenyLag0041388
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0026100.1 uncharacterized protein E6C27_scaffold19G00360 [Cucumis melo var. makuwa]5.3e-6851.54Show/hide
Query:  MTNVTTSGPIATSNSTGFSSPPLNQLLNQITTIKLDRGNYLLWKNLALPILRSYKLEGHLLGTKPYPPMMINQTTEGITNGATGAAAGADASSSTRSAGG
        M N   +    + +S GFS+PPLNQ+LNQ+ T+KLDR NYLLWK LALPIL+ YKLEGHL G  P P   +   +   ++  T    GADA+       G
Subjt:  MTNVTTSGPIATSNSTGFSSPPLNQLLNQITTIKLDRGNYLLWKNLALPILRSYKLEGHLLGTKPYPPMMINQTTEGITNGATGAAAGADASSSTRSAGG

Query:  ASSSTTAMTVNPLYEAWITTDQLFLGWLYNSMTSKVATQVMGFENAKELWEAIQNLFGIQSRAEEDYLRQIFQQTRKGNTKMTDYLRLMKSHSDNLGQAG
        ASSS T   VN L+E W+TTD L LGWLYNSMT  VA Q+MGF N ++LW+A Q+ FG+QSRAEED+LRQ+ Q TRKGNTKM +YL +MK++ DNLGQ G
Subjt:  ASSSTTAMTVNPLYEAWITTDQLFLGWLYNSMTSKVATQVMGFENAKELWEAIQNLFGIQSRAEEDYLRQIFQQTRKGNTKMTDYLRLMKSHSDNLGQAG

Query:  SPVTPRNLISQVLLGMDEEYNPVVAMIQGRGDVSWSELQAELLVFEKRLELQNA---KKTSVTFNHTASVNVASNSRGSFNSNPGNSRGWNSN
        SPV  R LISQVLLG+DE YN V+ +IQG+ D+SW ++Q++LL+FEK L+ QN    KK       + ++N+A     +   N  N + +  N
Subjt:  SPVTPRNLISQVLLGMDEEYNPVVAMIQGRGDVSWSELQAELLVFEKRLELQNA---KKTSVTFNHTASVNVASNSRGSFNSNPGNSRGWNSN

TYJ96311.1 uncharacterized protein E5676_scaffold1970G00140 [Cucumis melo var. makuwa]3.4e-5456.87Show/hide
Query:  MTNVTTSGPIATSNSTGFSSPPLNQLLNQITTIKLDRGNYLLWKNLALPILRSYKLEGHLLGTKPYPPMMINQTTEGITNGATGAAAGADASSSTRSAGG
        M N   +    + +S GFS+PPLNQ+LNQ+ T+KLDR NYLLWK LALPIL+ YKLEGHL G  P P   +   +   ++  T    GADA+       G
Subjt:  MTNVTTSGPIATSNSTGFSSPPLNQLLNQITTIKLDRGNYLLWKNLALPILRSYKLEGHLLGTKPYPPMMINQTTEGITNGATGAAAGADASSSTRSAGG

Query:  ASSSTTAMTVNPLYEAWITTDQLFLGWLYNSMTSKVATQVMGFENAKELWEAIQNLFGIQSRAEEDYLRQIFQQTRKGNTKMTDYLRLMKSHSDNLGQAG
        ASSS T   VN L+E W+TTD L LGWLYNSMT  VA Q+MGF N ++LW+A Q+ FG+QSRAEED+LRQ+ Q TRKGNTKM +YL +MK++ DNLGQ G
Subjt:  ASSSTTAMTVNPLYEAWITTDQLFLGWLYNSMTSKVATQVMGFENAKELWEAIQNLFGIQSRAEEDYLRQIFQQTRKGNTKMTDYLRLMKSHSDNLGQAG

Query:  SPVTPRNLISQ
        SPV  R LISQ
Subjt:  SPVTPRNLISQ

XP_022151683.1 uncharacterized protein LOC111019598 [Momordica charantia]2.3e-7152.82Show/hide
Query:  PIATSNSTGFSSPPLNQLLNQITTIKLDRGNYLLWKNLALPILRSYKLEGHLLGTKPYPPMMINQTTEGITNGATGAAAGADASSSTRSAGGASSSTTAM
        P A  +   F+SPPLNQLLNQIT+IK+DRGN+LLW+NLALPILRSYKL  +L G KP PP  +  T                   +  +  G++SS ++ 
Subjt:  PIATSNSTGFSSPPLNQLLNQITTIKLDRGNYLLWKNLALPILRSYKLEGHLLGTKPYPPMMINQTTEGITNGATGAAAGADASSSTRSAGGASSSTTAM

Query:  TVNPLYEAWITTDQLFLGWLYNSMTSKVATQVMGFENAKELWEAIQNLFGIQSRAEEDYLRQIFQQTRKGNTKMTDYLRLMKSHSDNLGQAGSPVTPRNL
        T+NP YEAWI  D+L LGWLYNSM + VA QVMGF  ++ELW A+Q LFG+QSRAE DYL+Q+FQQT KG+ +M +YL+LMKSH+DNL  AGS V+ R+L
Subjt:  TVNPLYEAWITTDQLFLGWLYNSMTSKVATQVMGFENAKELWEAIQNLFGIQSRAEEDYLRQIFQQTRKGNTKMTDYLRLMKSHSDNLGQAGSPVTPRNL

Query:  ISQVLLGMDEEYNPVVAMIQGRGDVSWSELQAELLVFEKRLELQNAKKTSVTFNHTASVNVASNSRGSFNSNPGNSRGWNSNNN
        +SQVL G+DEEYNP+V  +QG+ ++SWSE+ AELL +EKRLE QN+ K+ +  N T + +V      SF +N   + G NS+ +
Subjt:  ISQVLLGMDEEYNPVVAMIQGRGDVSWSELQAELLVFEKRLELQNAKKTSVTFNHTASVNVASNSRGSFNSNPGNSRGWNSNNN

XP_038902487.1 uncharacterized protein LOC120089143 [Benincasa hispida]2.8e-6153.51Show/hide
Query:  TTIKLDRGNYLLWKNLALPILRSYKLEGHLLGTKPYPP---MMINQTT----EGITNGATGAAAGADASSSTRSAGGASSSTTAMTVNPLYEAWITTDQL
        TTIKLD+ NYLLW+NLALPILRSY+LEGHL G  P PP   +  +Q+T     G   G  G  +G  + +  +    AS+S+  + VNP YE+    DQL
Subjt:  TTIKLDRGNYLLWKNLALPILRSYKLEGHLLGTKPYPP---MMINQTT----EGITNGATGAAAGADASSSTRSAGGASSSTTAMTVNPLYEAWITTDQL

Query:  FLGWLYNSMTSKVATQVMGFENAKELWEAIQNLFGIQSRAEEDYLRQIFQQTRKGNTKMTDYLRLMKSHSDNLGQAGSPVTPRNLISQVLLGMDEEYNPV
         LGWLYN MT++VA QVMG+EN K LW AIQ LFG+QSRA EDYLRQ+FQQT KG  KM +YLR+MK+HSDNLG  GSPV  R L+SQVLLG+DEE+NP 
Subjt:  FLGWLYNSMTSKVATQVMGFENAKELWEAIQNLFGIQSRAEEDYLRQIFQQTRKGNTKMTDYLRLMKSHSDNLGQAGSPVTPRNLISQVLLGMDEEYNPV

Query:  VAMIQGRGDVSWSELQAELLVFEKRLELQNAKKTSVTFNHTASVNVASNSRGSFNSNPGNSRGWNSNNNQR
        VA IQGR ++SW+ +Q ELL FEKR    N                  N RG    N G  R WN+N+  R
Subjt:  VAMIQGRGDVSWSELQAELLVFEKRLELQNAKKTSVTFNHTASVNVASNSRGSFNSNPGNSRGWNSNNNQR

XP_038905164.1 uncharacterized protein LOC120091275 isoform X4 [Benincasa hispida]5.7e-5458.37Show/hide
Query:  EGITNGATGAAAGADASSSTRSAGGASSSTTAMTVNPLYEAWITTDQLFLGWLYNSMTSKVATQVMGFENAKELWEAIQNLFGIQSRAEEDYLRQIFQQT
        E I  G+TGA A A  S    S  GASSS TA+ VNP YE+W+  DQL LGWLYNSMT +VA QVMG E AK+LW +I  LFG+QSR EEDYLR +FQ T
Subjt:  EGITNGATGAAAGADASSSTRSAGGASSSTTAMTVNPLYEAWITTDQLFLGWLYNSMTSKVATQVMGFENAKELWEAIQNLFGIQSRAEEDYLRQIFQQT

Query:  RKGNTKMTDYLRLMKSHSDNLGQAGSPVTPRNLISQVLLGMDEEYNPVVAMIQGRGDVSWSELQAELLVFEKRLELQNAKKTSVTFNH--TASVNVASNS
        RKGN KM +YL+ MK ++DNL QAGSP+ PR L+SQVLLG+DEEYN +VAMIQGR D+SW ++Q+ELL++E+RLE Q+ +KT+V FN    ASVN+ +N+
Subjt:  RKGNTKMTDYLRLMKSHSDNLGQAGSPVTPRNLISQVLLGMDEEYNPVVAMIQGRGDVSWSELQAELLVFEKRLELQNAKKTSVTFNH--TASVNVASNS

Query:  RGSFNSNPGNSRGWNSNNNQR
        R    +N  NS   +    QR
Subjt:  RGSFNSNPGNSRGWNSNNNQR

TrEMBL top hitse value%identityAlignment
A0A5A7SIT7 Uncharacterized protein2.6e-6851.54Show/hide
Query:  MTNVTTSGPIATSNSTGFSSPPLNQLLNQITTIKLDRGNYLLWKNLALPILRSYKLEGHLLGTKPYPPMMINQTTEGITNGATGAAAGADASSSTRSAGG
        M N   +    + +S GFS+PPLNQ+LNQ+ T+KLDR NYLLWK LALPIL+ YKLEGHL G  P P   +   +   ++  T    GADA+       G
Subjt:  MTNVTTSGPIATSNSTGFSSPPLNQLLNQITTIKLDRGNYLLWKNLALPILRSYKLEGHLLGTKPYPPMMINQTTEGITNGATGAAAGADASSSTRSAGG

Query:  ASSSTTAMTVNPLYEAWITTDQLFLGWLYNSMTSKVATQVMGFENAKELWEAIQNLFGIQSRAEEDYLRQIFQQTRKGNTKMTDYLRLMKSHSDNLGQAG
        ASSS T   VN L+E W+TTD L LGWLYNSMT  VA Q+MGF N ++LW+A Q+ FG+QSRAEED+LRQ+ Q TRKGNTKM +YL +MK++ DNLGQ G
Subjt:  ASSSTTAMTVNPLYEAWITTDQLFLGWLYNSMTSKVATQVMGFENAKELWEAIQNLFGIQSRAEEDYLRQIFQQTRKGNTKMTDYLRLMKSHSDNLGQAG

Query:  SPVTPRNLISQVLLGMDEEYNPVVAMIQGRGDVSWSELQAELLVFEKRLELQNA---KKTSVTFNHTASVNVASNSRGSFNSNPGNSRGWNSN
        SPV  R LISQVLLG+DE YN V+ +IQG+ D+SW ++Q++LL+FEK L+ QN    KK       + ++N+A     +   N  N + +  N
Subjt:  SPVTPRNLISQVLLGMDEEYNPVVAMIQGRGDVSWSELQAELLVFEKRLELQNA---KKTSVTFNHTASVNVASNSRGSFNSNPGNSRGWNSN

A0A5A7VPY0 Uncharacterized protein1.1e-5056.67Show/hide
Query:  MTNVTTSGPIATSNSTGFSSPPLNQLLNQITTIKLDRGNYLLWKNLALPILRSYKLEGHLLGTKPYPPMMINQTTEGITNGATGAAAGADASSSTRSAGG
        MTN       AT  +T F++P LNQ+LNQ+TTIKLDRGNYLLWK LALPIL+SYKL  HL G  P  P +I  TT+   N +    AG  +  +      
Subjt:  MTNVTTSGPIATSNSTGFSSPPLNQLLNQITTIKLDRGNYLLWKNLALPILRSYKLEGHLLGTKPYPPMMINQTTEGITNGATGAAAGADASSSTRSAGG

Query:  ASSSTTAMTVNPLYEAWITTDQLFLGWLYNSMTSKVATQVMGFENAKELWEAIQNLFGIQSRAEEDYLRQIFQQTRKGNTKMTDYLRLMKSHSDNLGQAG
        +SSST  +TVNP YE WITTD L LGWLYNSMT +V  Q+MGF NAK+LWEA Q+LFGIQSRA+ED+L Q FQ T+KGN  M +YLR MK++ +NLGQA 
Subjt:  ASSSTTAMTVNPLYEAWITTDQLFLGWLYNSMTSKVATQVMGFENAKELWEAIQNLFGIQSRAEEDYLRQIFQQTRKGNTKMTDYLRLMKSHSDNLGQAG

Query:  SPVTPRNLIS
        S V    ++S
Subjt:  SPVTPRNLIS

A0A5D3BCH9 Uncharacterized protein1.6e-5456.87Show/hide
Query:  MTNVTTSGPIATSNSTGFSSPPLNQLLNQITTIKLDRGNYLLWKNLALPILRSYKLEGHLLGTKPYPPMMINQTTEGITNGATGAAAGADASSSTRSAGG
        M N   +    + +S GFS+PPLNQ+LNQ+ T+KLDR NYLLWK LALPIL+ YKLEGHL G  P P   +   +   ++  T    GADA+       G
Subjt:  MTNVTTSGPIATSNSTGFSSPPLNQLLNQITTIKLDRGNYLLWKNLALPILRSYKLEGHLLGTKPYPPMMINQTTEGITNGATGAAAGADASSSTRSAGG

Query:  ASSSTTAMTVNPLYEAWITTDQLFLGWLYNSMTSKVATQVMGFENAKELWEAIQNLFGIQSRAEEDYLRQIFQQTRKGNTKMTDYLRLMKSHSDNLGQAG
        ASSS T   VN L+E W+TTD L LGWLYNSMT  VA Q+MGF N ++LW+A Q+ FG+QSRAEED+LRQ+ Q TRKGNTKM +YL +MK++ DNLGQ G
Subjt:  ASSSTTAMTVNPLYEAWITTDQLFLGWLYNSMTSKVATQVMGFENAKELWEAIQNLFGIQSRAEEDYLRQIFQQTRKGNTKMTDYLRLMKSHSDNLGQAG

Query:  SPVTPRNLISQ
        SPV  R LISQ
Subjt:  SPVTPRNLISQ

A0A6J1D5J0 uncharacterized protein LOC1110175013.1e-5359.69Show/hide
Query:  DASSSTRSAGGASSSTTAMTVNPLYEAWITTDQLFLGWLYNSMTSKVATQVMGFENAKELWEAIQNLFGIQSRAEEDYLRQIFQQTRKGNTKMTDYLRLM
        + S +  SA  +SS  T   +NPLYE+W+TTDQL LGWLYNSMT +VATQVMG+ENA +LW AIQ LFG+QS+AEEDYLRQ+FQQTRKG+ KMTD+LR+M
Subjt:  DASSSTRSAGGASSSTTAMTVNPLYEAWITTDQLFLGWLYNSMTSKVATQVMGFENAKELWEAIQNLFGIQSRAEEDYLRQIFQQTRKGNTKMTDYLRLM

Query:  KSHSDNLGQAGSPVTPRNLISQVLLGMDEEYNPVVAMIQGRGDVSWSELQAELLVFEKRLELQNAKKTSVTFNHTASVNVASNSRGSFNSNPGNSR
        KSH+DNLGQAGSPV  R+LISQVLLG+DEEYNPVVA IQG+  +SW E+QAE        + QN                  NS+  FN+N G  R
Subjt:  KSHSDNLGQAGSPVTPRNLISQVLLGMDEEYNPVVAMIQGRGDVSWSELQAELLVFEKRLELQNAKKTSVTFNHTASVNVASNSRGSFNSNPGNSR

A0A6J1DCW4 uncharacterized protein LOC1110195981.1e-7152.82Show/hide
Query:  PIATSNSTGFSSPPLNQLLNQITTIKLDRGNYLLWKNLALPILRSYKLEGHLLGTKPYPPMMINQTTEGITNGATGAAAGADASSSTRSAGGASSSTTAM
        P A  +   F+SPPLNQLLNQIT+IK+DRGN+LLW+NLALPILRSYKL  +L G KP PP  +  T                   +  +  G++SS ++ 
Subjt:  PIATSNSTGFSSPPLNQLLNQITTIKLDRGNYLLWKNLALPILRSYKLEGHLLGTKPYPPMMINQTTEGITNGATGAAAGADASSSTRSAGGASSSTTAM

Query:  TVNPLYEAWITTDQLFLGWLYNSMTSKVATQVMGFENAKELWEAIQNLFGIQSRAEEDYLRQIFQQTRKGNTKMTDYLRLMKSHSDNLGQAGSPVTPRNL
        T+NP YEAWI  D+L LGWLYNSM + VA QVMGF  ++ELW A+Q LFG+QSRAE DYL+Q+FQQT KG+ +M +YL+LMKSH+DNL  AGS V+ R+L
Subjt:  TVNPLYEAWITTDQLFLGWLYNSMTSKVATQVMGFENAKELWEAIQNLFGIQSRAEEDYLRQIFQQTRKGNTKMTDYLRLMKSHSDNLGQAGSPVTPRNL

Query:  ISQVLLGMDEEYNPVVAMIQGRGDVSWSELQAELLVFEKRLELQNAKKTSVTFNHTASVNVASNSRGSFNSNPGNSRGWNSNNN
        +SQVL G+DEEYNP+V  +QG+ ++SWSE+ AELL +EKRLE QN+ K+ +  N T + +V      SF +N   + G NS+ +
Subjt:  ISQVLLGMDEEYNPVVAMIQGRGDVSWSELQAELLVFEKRLELQNAKKTSVTFNHTASVNVASNSRGSFNSNPGNSRGWNSNNN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).1.5e-0423.72Show/hide
Query:  ITTIKLDRGNYLLWKNLALPILRSYKLEGHLLGTKPYPPMMINQTTEGITNGATGAAAGADASSSTRSAGGASSSTTAMTVNPLYEAWITTDQLFLGWLY
        I  +  D  NY+ WK      LR  K  G + GT P P                                           +PLY+ W   + + + WL 
Subjt:  ITTIKLDRGNYLLWKNLALPILRSYKLEGHLLGTKPYPPMMINQTTEGITNGATGAAAGADASSSTRSAGGASSSTTAMTVNPLYEAWITTDQLFLGWLY

Query:  NSMTSKVATQVMGFENAKELWEAIQNLFGIQSRAEEDYLRQIFQQTRKGNTKMTDY
        NSMT K+   VM  E A ++WE ++ +F      +   LR+     R+G   + +Y
Subjt:  NSMTSKVATQVMGFENAKELWEAIQNLFGIQSRAEEDYLRQIFQQTRKGNTKMTDY

AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)2.6e-0424.87Show/hide
Query:  GASSSTTAMTVNPLYEAWITTDQLFLGWLYNSMTSK-VATQVMGFENAKELWEAIQNLFGIQSRAEEDYLRQIFQQTRKGNTKMTDYLRLMKSHSDNLGQ
        G    T A  VN     W   D +    LY ++T K      +    ++++W  I+N F     A    L    +    G+ ++ DY R MK  +D+L  
Subjt:  GASSSTTAMTVNPLYEAWITTDQLFLGWLYNSMTSK-VATQVMGFENAKELWEAIQNLFGIQSRAEEDYLRQIFQQTRKGNTKMTDYLRLMKSHSDNLGQ

Query:  AGSPVTPRNLISQVLLGMDEEYNPVVAMIQGRGDVSWSELQAELLVFEKRLELQNAKKTSVTFNHTASVNVASNSRG----SFNSNPGNSRGW
           PVT RNL+  VL G++ +++ ++ +I+ R      +  A +L  E+    +  K      +H++S  V + S      +F  + GN  G+
Subjt:  AGSPVTPRNLISQVLLGMDEEYNPVVAMIQGRGDVSWSELQAELLVFEKRLELQNAKKTSVTFNHTASVNVASNSRG----SFNSNPGNSRGW

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)2.7e-0927.65Show/hide
Query:  SSSTTAMTVNPLYEAWITTDQLFLGWLYNSMTSKVATQVMGFE-NAKELWEAIQNLFGIQSRAEEDYLRQIFQQTRKGNTKMTDYLRLMKSHSDNLGQAG
        SS+ T MT     + W   D L   W+Y ++T  +   ++     A++LW +++NLF     A         + T   +  + +Y + +KS SD L    
Subjt:  SSSTTAMTVNPLYEAWITTDQLFLGWLYNSMTSKVATQVMGFE-NAKELWEAIQNLFGIQSRAEEDYLRQIFQQTRKGNTKMTDYLRLMKSHSDNLGQAG

Query:  SPVTPRNLISQVLLGMDEEYNPVVAMIQGRGDV-SWSELQAELLVFEKRLELQNAKKTSVTFNHTASVNV
        SP++ R L+  +L G+ E+Y+ ++ +I+ +    S++E ++ LL+ E RL  ++    S T NH +  NV
Subjt:  SPVTPRNLISQVLLGMDEEYNPVVAMIQGRGDV-SWSELQAELLVFEKRLELQNAKKTSVTFNHTASVNV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCAACGTCACAACCTCTGGGCCTATTGCAACCAGCAATTCGACTGGCTTCAGCTCACCTCCTCTCAACCAGCTTTTGAACCAAATCACTACCATCAAATTGGACAG
AGGTAACTACTTACTATGGAAAAATCTCGCTCTTCCAATCCTAAGAAGCTACAAATTGGAGGGGCATCTTCTGGGGACAAAACCTTATCCACCAATGATGATAAATCAGA
CCACAGAAGGTATCACTAATGGTGCAACTGGAGCAGCCGCAGGTGCAGATGCATCAAGCTCAACGAGGTCAGCTGGGGGAGCGTCGAGCTCTACAACAGCAATGACAGTC
AATCCTTTATATGAAGCTTGGATCACTACAGATCAATTGTTTCTCGGATGGCTATACAACTCCATGACATCCAAAGTTGCCACTCAGGTCATGGGTTTTGAGAATGCGAA
AGAGCTATGGGAAGCGATTCAGAATCTATTTGGCATACAGTCAAGAGCGGAGGAGGACTACCTTCGTCAAATTTTTCAACAGACTCGAAAAGGTAACACTAAAATGACAG
ATTATTTGAGGCTAATGAAATCTCATTCAGACAATTTAGGGCAAGCAGGTAGTCCCGTCACGCCTAGAAACCTAATCTCCCAAGTTTTACTTGGGATGGATGAAGAATAC
AACCCTGTCGTCGCCATGATTCAAGGAAGAGGCGATGTTTCGTGGTCTGAACTTCAAGCAGAACTCCTTGTGTTCGAAAAGAGACTGGAGTTACAGAATGCTAAAAAAAC
CTCTGTCACCTTCAATCATACAGCCTCTGTAAATGTAGCAAGCAACAGTAGAGGTTCGTTTAATTCAAACCCAGGGAATTCTAGAGGATGGAACTCTAACAATAATCAGC
GATGA
mRNA sequenceShow/hide mRNA sequence
ATGACCAACGTCACAACCTCTGGGCCTATTGCAACCAGCAATTCGACTGGCTTCAGCTCACCTCCTCTCAACCAGCTTTTGAACCAAATCACTACCATCAAATTGGACAG
AGGTAACTACTTACTATGGAAAAATCTCGCTCTTCCAATCCTAAGAAGCTACAAATTGGAGGGGCATCTTCTGGGGACAAAACCTTATCCACCAATGATGATAAATCAGA
CCACAGAAGGTATCACTAATGGTGCAACTGGAGCAGCCGCAGGTGCAGATGCATCAAGCTCAACGAGGTCAGCTGGGGGAGCGTCGAGCTCTACAACAGCAATGACAGTC
AATCCTTTATATGAAGCTTGGATCACTACAGATCAATTGTTTCTCGGATGGCTATACAACTCCATGACATCCAAAGTTGCCACTCAGGTCATGGGTTTTGAGAATGCGAA
AGAGCTATGGGAAGCGATTCAGAATCTATTTGGCATACAGTCAAGAGCGGAGGAGGACTACCTTCGTCAAATTTTTCAACAGACTCGAAAAGGTAACACTAAAATGACAG
ATTATTTGAGGCTAATGAAATCTCATTCAGACAATTTAGGGCAAGCAGGTAGTCCCGTCACGCCTAGAAACCTAATCTCCCAAGTTTTACTTGGGATGGATGAAGAATAC
AACCCTGTCGTCGCCATGATTCAAGGAAGAGGCGATGTTTCGTGGTCTGAACTTCAAGCAGAACTCCTTGTGTTCGAAAAGAGACTGGAGTTACAGAATGCTAAAAAAAC
CTCTGTCACCTTCAATCATACAGCCTCTGTAAATGTAGCAAGCAACAGTAGAGGTTCGTTTAATTCAAACCCAGGGAATTCTAGAGGATGGAACTCTAACAATAATCAGC
GATGA
Protein sequenceShow/hide protein sequence
MTNVTTSGPIATSNSTGFSSPPLNQLLNQITTIKLDRGNYLLWKNLALPILRSYKLEGHLLGTKPYPPMMINQTTEGITNGATGAAAGADASSSTRSAGGASSSTTAMTV
NPLYEAWITTDQLFLGWLYNSMTSKVATQVMGFENAKELWEAIQNLFGIQSRAEEDYLRQIFQQTRKGNTKMTDYLRLMKSHSDNLGQAGSPVTPRNLISQVLLGMDEEY
NPVVAMIQGRGDVSWSELQAELLVFEKRLELQNAKKTSVTFNHTASVNVASNSRGSFNSNPGNSRGWNSNNNQR