; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0031564 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0031564
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr11:10096508..10099569
RNA-Seq ExpressionLag0031564
SyntenyLag0031564
Gene Ontology termsGO:0005488 - binding (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022152309.1 uncharacterized protein LOC111020057 [Momordica charantia]8.4e-4251.58Show/hide
Query:  QRRNSRDIYKVDPNL-RMKPKLPKFYGSTNSKEYIQWERQVDHVFDCCTLSEEKKVKLIVSHLRDYVITWWDKLF-LGRRNLEHPIDSWNDLKQLMREHF
        QRR  R+  +VD NL  +K K+ KF+G T+ +EY+QWE +V+ VF C   SEEKK++L V+  + Y  TWWDKL    RRNLE PI SW +  + ++E F
Subjt:  QRRNSRDIYKVDPNL-RMKPKLPKFYGSTNSKEYIQWERQVDHVFDCCTLSEEKKVKLIVSHLRDYVITWWDKLF-LGRRNLEHPIDSWNDLKQLMREHF

Query:  VPKLF-----YNLWTLRQGSKSVEAYYMEMQKLLDELDLYEDEMTTMARFFRGLNKEIAALLDLQLYGDLDEMVNLAIKIEKHLQRKSTR
        VPK F       L  LRQG+KSVE YY E+  L+D +DL ED    MARF   LNKEIA  +DLQ Y D++EM++LAIKIE+ +QRKS R
Subjt:  VPKLF-----YNLWTLRQGSKSVEAYYMEMQKLLDELDLYEDEMTTMARFFRGLNKEIAALLDLQLYGDLDEMVNLAIKIEKHLQRKSTR

XP_022157414.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111024115 [Momordica charantia]4.3e-4643.91Show/hide
Query:  RLHRSMEKMTMQFEKLDARIKS---VEKQPAFLIRKERSNPMHHKEYWE---------------------VKGKEYGGGFQAVQRRNSRDIYKVDPNL-R
        R+ RSME +T +  +L+ + ++   V   P   +  E        ++WE                      +G+  G G+   QR   R   +VD NL  
Subjt:  RLHRSMEKMTMQFEKLDARIKS---VEKQPAFLIRKERSNPMHHKEYWE---------------------VKGKEYGGGFQAVQRRNSRDIYKVDPNL-R

Query:  MKPKLPKFYGSTNSKEYIQWERQVDHVFDCCTLSEEKKVKLIVSHLRDYVITWWDKLF-LGRRNLEHPIDSWNDLKQLMREHFVPKLFY-----NLWTLR
        +K KL KFYG T+ + YIQW ++V+ VFDC   SEEKKV+L ++  +DY +TWWDKL  + RRNLE PIDSW + K+L+R  FVP+ F+      L  LR
Subjt:  MKPKLPKFYGSTNSKEYIQWERQVDHVFDCCTLSEEKKVKLIVSHLRDYVITWWDKLF-LGRRNLEHPIDSWNDLKQLMREHFVPKLFY-----NLWTLR

Query:  QGSKSVEAYYMEMQKLLDELDLYEDEMTTMARFFRGLNKEIAALLDLQLYGDLDEMVNLAIKIEKHLQRKS
        QGSKSVE YY EM  L++ LDL ED    MARF  GLNKEIA  +DLQ Y +++EM++LAIKIEK LQR+S
Subjt:  QGSKSVEAYYMEMQKLLDELDLYEDEMTTMARFFRGLNKEIAALLDLQLYGDLDEMVNLAIKIEKHLQRKS

XP_023520835.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111784339 [Cucurbita pepo subsp. pepo]5.8e-4341.06Show/hide
Query:  QEDFARLHRSMEKMTMQFEKLDARIKSVEKQP---AFLIRKERSNPMHHKE------YWEVKGKEYGGGFQAVQRRNSRDIYKVDPNL-RMKPKLPKFYG
        Q    RL R +E++T +  +L+ + ++ ++ P         E  N  HH++      +  ++G+++G  +  +Q+R   D  ++D N+  +K KLPKFYG
Subjt:  QEDFARLHRSMEKMTMQFEKLDARIKSVEKQP---AFLIRKERSNPMHHKE------YWEVKGKEYGGGFQAVQRRNSRDIYKVDPNL-RMKPKLPKFYG

Query:  STNSKEYIQWERQVDHVFDCCTLSEEKKVKLIVSHLRDYVITWWDKLFLG-RRNLEHPIDSWNDLKQLMREHFVPKLF-----YNLWTLRQGSKSVEAYY
         T+ +EY+QWE+ V+ VF+C   S++KKV L ++  + Y   WWDKL    RRNLE PIDSW + K+ MR+ FVP+ F       L  L+QG KSVE YY
Subjt:  STNSKEYIQWERQVDHVFDCCTLSEEKKVKLIVSHLRDYVITWWDKLFLG-RRNLEHPIDSWNDLKQLMREHFVPKLF-----YNLWTLRQGSKSVEAYY

Query:  MEMQKLLDELDLYEDEMTTMARFFRGLNKEIAALLDLQLYGDLDEMVNLAIKIEKHLQRKSTR
         EM  L+D LDL ED    MARF  GLN EIA   DLQ Y +++E++++AIKIE+ +QR+S R
Subjt:  MEMQKLLDELDLYEDEMTTMARFFRGLNKEIAALLDLQLYGDLDEMVNLAIKIEKHLQRKSTR

XP_023553652.1 uncharacterized protein LOC111811140 [Cucurbita pepo subsp. pepo]2.9e-4240.3Show/hide
Query:  QEDFARLHRSMEKMTMQFEKLDARIKSVEKQP---AFLIRKERSNPMHHKE------YWEVKGKEYGGGFQAVQRRNSRDIYKVDPNL-RMKPKLPKFYG
        Q    RL R +E++T +  +L+ + ++ ++ P         E  N  HH++      +  ++G+++G  +  +Q+R   D  ++D N+  +K KLPKFYG
Subjt:  QEDFARLHRSMEKMTMQFEKLDARIKSVEKQP---AFLIRKERSNPMHHKE------YWEVKGKEYGGGFQAVQRRNSRDIYKVDPNL-RMKPKLPKFYG

Query:  STNSKEYIQWERQVDHVFDCCTLSEEKKVKLIVSHLRDYVITWWDKLFLG-RRNLEHPIDSWNDLKQLMREHFVPKLF-----YNLWTLRQGSKSVEAYY
         T+ +EY++WE+ ++ VF C   S++KKV L ++  + Y   WWDKL    RRNLE PIDSW + K+ MR+ FVP+ F       L  L+QG KSVE YY
Subjt:  STNSKEYIQWERQVDHVFDCCTLSEEKKVKLIVSHLRDYVITWWDKLFLG-RRNLEHPIDSWNDLKQLMREHFVPKLF-----YNLWTLRQGSKSVEAYY

Query:  MEMQKLLDELDLYEDEMTTMARFFRGLNKEIAALLDLQLYGDLDEMVNLAIKIEKHLQRKSTR
         EM  L+D LDL ED    MARF  GLN EIA   DLQ Y +++E++++AIKIE+ +QR+S R
Subjt:  MEMQKLLDELDLYEDEMTTMARFFRGLNKEIAALLDLQLYGDLDEMVNLAIKIEKHLQRKSTR

XP_038887118.1 uncharacterized protein K02A2.6-like [Benincasa hispida]2.0e-4352.69Show/hide
Query:  MKPKLPKFYGSTNSKEYIQWERQVDHVFDCCTLSEEKKVKLIVSHLRDYVITWWDKLFLG-RRNLEHPIDSWNDLKQLMREHFVPKLF-----YNLWTLR
        +K K+PKF+G T+ +EYI+WE++V++VF C   S+++KV+  V+  +DY  TWWDKL  G RRNLE PI SW + K  MR+HFVP  F       L  LR
Subjt:  MKPKLPKFYGSTNSKEYIQWERQVDHVFDCCTLSEEKKVKLIVSHLRDYVITWWDKLFLG-RRNLEHPIDSWNDLKQLMREHFVPKLF-----YNLWTLR

Query:  QGSKSVEAYYMEMQKLLDELDLYEDEMTTMARFFRGLNKEIAALLDLQLYGDLDEMVNLAIKIEKHLQRKSTRQLDCKDLFDCNPS
        QG+KSVE YY EM  L+D LDL ED  T MARF  GLNKEIA  +DLQ Y D++EM++LAIK+EKHL  K  R    K     N S
Subjt:  QGSKSVEAYYMEMQKLLDELDLYEDEMTTMARFFRGLNKEIAALLDLQLYGDLDEMVNLAIKIEKHLQRKSTRQLDCKDLFDCNPS

TrEMBL top hitse value%identityAlignment
A0A2N9FBC2 Uncharacterized protein7.2e-3937.36Show/hide
Query:  QEDFARLHRSMEKMTMQFEKLDARIKSVE-----KQPAFLIRKERSNPMHHKEYWEVKGKEYGGGFQAVQR-------RNSRDIYKVDPNL-RMKPKLPK
        Q+ F RL+  + ++  + +  +A I++++     +Q    +  E  N    ++  ++  K   G  +  +R       R  RD   +D NL  +K K+P 
Subjt:  QEDFARLHRSMEKMTMQFEKLDARIKSVE-----KQPAFLIRKERSNPMHHKEYWEVKGKEYGGGFQAVQR-------RNSRDIYKVDPNL-RMKPKLPK

Query:  FYGSTNSKEYIQWERQVDHVFDCCTLSEEKKVKLIVSHLRDYVITWWDKLFLG-RRNLEHPIDSWNDLKQLMREHFVPKLFY-----NLWTLRQGSKSVE
        F G TN + Y++WE+++D VFDC   SEEKK+KL V    DY I WWD+L    RRN E P+++W +LK LMR  FVP  FY      L  L QGS+SVE
Subjt:  FYGSTNSKEYIQWERQVDHVFDCCTLSEEKKVKLIVSHLRDYVITWWDKLFLG-RRNLEHPIDSWNDLKQLMREHFVPKLFY-----NLWTLRQGSKSVE

Query:  AYYMEMQKLLDELDLYEDEMTTMARFFRGLNKEIAALLDLQLYGDLDEMVNLAIKIEKHLQRKST
         Y+ EM+  + + ++ ED   TMARFF GLN++IA +++LQ Y ++++M+++A+K+E+ L+RK T
Subjt:  AYYMEMQKLLDELDLYEDEMTTMARFFRGLNKEIAALLDLQLYGDLDEMVNLAIKIEKHLQRKST

A0A2N9GNR6 Uncharacterized protein3.2e-3929.9Show/hide
Query:  QEDFARLHRSMEKMTMQFEKLDARIKSVE-----KQPAFLIRKERSNPMHHKEYWEVKGKEYGGGFQAVQRRNSRDIYK-----VDPNL-RMKPKLPKFY
        Q+ F RL+  + ++  + +  +A I++++     ++    +  E  N    ++  ++  +   G  + V+R    +        VD +L  +K K+P F 
Subjt:  QEDFARLHRSMEKMTMQFEKLDARIKSVE-----KQPAFLIRKERSNPMHHKEYWEVKGKEYGGGFQAVQRRNSRDIYK-----VDPNL-RMKPKLPKFY

Query:  GSTNSKEYIQWERQVDHVFDCCTLSEEKKVKLIVSHLRDYVITWWDKLFLG-RRNLEHPIDSWNDLKQLMREHFVPKLFY-----NLWTLRQGSKSVEAY
        G T+ + Y++WE+++D VFDC   SEEKKVKL V    DY I WWD+L    RRN E P+++W +LK LMR  FVP  FY      L  L QGS+SVE Y
Subjt:  GSTNSKEYIQWERQVDHVFDCCTLSEEKKVKLIVSHLRDYVITWWDKLFLG-RRNLEHPIDSWNDLKQLMREHFVPKLFY-----NLWTLRQGSKSVEAY

Query:  YMEMQKLLDELDLYEDEMTTMARFFRGLNKEIAALLDLQLYGDLDEMVNLAIKIEKHLQRKSTRQLDCKDLFDCNPSTIVIPTHDEEAKTIIEKLDSKFP
        + EM+  +   ++ ED   TMARF  GLN++IA +++LQ Y ++++MV++A+K+E+ L+RK T     + +     +  V+   ++++  + E +D+   
Subjt:  YMEMQKLLDELDLYEDEMTTMARFFRGLNKEIAALLDLQLYGDLDEMVNLAIKIEKHLQRKSTRQLDCKDLFDCNPSTIVIPTHDEEAKTIIEKLDSKFP

Query:  SKV-----NNGNQHMNINANAF-----------LERQNSKFA---------------SKFIHGSTHKVFKPGDWVWKHYWKDPYSFNKKPKWRSKGGLP-
          V      +GN    +  + F           L+RQ  +FA                      T K    GDWVW H  K+ +  ++K K   +G  P 
Subjt:  SKV-----NNGNQHMNINANAF-----------LERQNSKFA---------------SKFIHGSTHKVFKPGDWVWKHYWKDPYSFNKKPKWRSKGGLP-

Query:  -----ITSHDYKIALQGE
             I  + +K+ L GE
Subjt:  -----ITSHDYKIALQGE

A0A2N9HDW6 Integrase catalytic domain-containing protein1.6e-3837.22Show/hide
Query:  QEDFARLHRSMEKMTMQFEKLDARIKSVE-------KQPAFLIRKERSNPMHHKEYWEVKGKEYGGGFQAVQRR-------NSRDIYKVDPNL-RMKPKL
        Q+ F RL+  + ++  + +  +A I++++       ++P   +  E  N    ++  +++ +   G  + V+R          RD   VD NL  +K K+
Subjt:  QEDFARLHRSMEKMTMQFEKLDARIKSVE-------KQPAFLIRKERSNPMHHKEYWEVKGKEYGGGFQAVQRR-------NSRDIYKVDPNL-RMKPKL

Query:  PKFYGSTNSKEYIQWERQVDHVFDCCTLSEEKKVKLIVSHLRDYVITWWDKLFLGRRNLEHPIDSWNDLKQLMREHFVPKLFY-----NLWTLRQGSKSV
        P F G T+ + Y++WE+++D VFDC T SEEKKVKL V    DY I WWD+L   RRN E PI++W +LK LMR  F+P  FY      L  L QGS+SV
Subjt:  PKFYGSTNSKEYIQWERQVDHVFDCCTLSEEKKVKLIVSHLRDYVITWWDKLFLGRRNLEHPIDSWNDLKQLMREHFVPKLFY-----NLWTLRQGSKSV

Query:  EAYYMEMQKLLDELDLYEDEMTTMARFFRGLNKEIAALLDLQLYGDLDEMVNLAIKIEKHLQRKST
        E Y+ EM+  +   ++ ED   TMARF  GLN++IA +++LQ Y ++++MV++A+K+E+ L++K T
Subjt:  EAYYMEMQKLLDELDLYEDEMTTMARFFRGLNKEIAALLDLQLYGDLDEMVNLAIKIEKHLQRKST

A0A6J1DFN2 uncharacterized protein LOC1110200574.1e-4251.58Show/hide
Query:  QRRNSRDIYKVDPNL-RMKPKLPKFYGSTNSKEYIQWERQVDHVFDCCTLSEEKKVKLIVSHLRDYVITWWDKLF-LGRRNLEHPIDSWNDLKQLMREHF
        QRR  R+  +VD NL  +K K+ KF+G T+ +EY+QWE +V+ VF C   SEEKK++L V+  + Y  TWWDKL    RRNLE PI SW +  + ++E F
Subjt:  QRRNSRDIYKVDPNL-RMKPKLPKFYGSTNSKEYIQWERQVDHVFDCCTLSEEKKVKLIVSHLRDYVITWWDKLF-LGRRNLEHPIDSWNDLKQLMREHF

Query:  VPKLF-----YNLWTLRQGSKSVEAYYMEMQKLLDELDLYEDEMTTMARFFRGLNKEIAALLDLQLYGDLDEMVNLAIKIEKHLQRKSTR
        VPK F       L  LRQG+KSVE YY E+  L+D +DL ED    MARF   LNKEIA  +DLQ Y D++EM++LAIKIE+ +QRKS R
Subjt:  VPKLF-----YNLWTLRQGSKSVEAYYMEMQKLLDELDLYEDEMTTMARFFRGLNKEIAALLDLQLYGDLDEMVNLAIKIEKHLQRKSTR

A0A6J1DWE9 LOW QUALITY PROTEIN: uncharacterized protein LOC1110241152.1e-4643.91Show/hide
Query:  RLHRSMEKMTMQFEKLDARIKS---VEKQPAFLIRKERSNPMHHKEYWE---------------------VKGKEYGGGFQAVQRRNSRDIYKVDPNL-R
        R+ RSME +T +  +L+ + ++   V   P   +  E        ++WE                      +G+  G G+   QR   R   +VD NL  
Subjt:  RLHRSMEKMTMQFEKLDARIKS---VEKQPAFLIRKERSNPMHHKEYWE---------------------VKGKEYGGGFQAVQRRNSRDIYKVDPNL-R

Query:  MKPKLPKFYGSTNSKEYIQWERQVDHVFDCCTLSEEKKVKLIVSHLRDYVITWWDKLF-LGRRNLEHPIDSWNDLKQLMREHFVPKLFY-----NLWTLR
        +K KL KFYG T+ + YIQW ++V+ VFDC   SEEKKV+L ++  +DY +TWWDKL  + RRNLE PIDSW + K+L+R  FVP+ F+      L  LR
Subjt:  MKPKLPKFYGSTNSKEYIQWERQVDHVFDCCTLSEEKKVKLIVSHLRDYVITWWDKLF-LGRRNLEHPIDSWNDLKQLMREHFVPKLFY-----NLWTLR

Query:  QGSKSVEAYYMEMQKLLDELDLYEDEMTTMARFFRGLNKEIAALLDLQLYGDLDEMVNLAIKIEKHLQRKS
        QGSKSVE YY EM  L++ LDL ED    MARF  GLNKEIA  +DLQ Y +++EM++LAIKIEK LQR+S
Subjt:  QGSKSVEAYYMEMQKLLDELDLYEDEMTTMARFFRGLNKEIAALLDLQLYGDLDEMVNLAIKIEKHLQRKS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTCAAGAGGATTTTGCAAGACTTCATCGAAGCATGGAGAAAATGACCATGCAATTTGAGAAATTGGATGCAAGGATCAAATCCGTGGAGAAACAACCAGCCTTTCT
TATCCGAAAGGAGAGATCTAATCCAATGCACCATAAGGAGTATTGGGAGGTCAAAGGAAAAGAATATGGTGGAGGTTTCCAAGCTGTCCAACGGAGAAATTCAAGAGACA
TTTATAAAGTTGATCCCAACTTGCGCATGAAGCCTAAACTTCCTAAATTCTATGGAAGTACAAACTCAAAGGAGTACATTCAATGGGAAAGGCAAGTTGATCATGTCTTT
GATTGTTGCACTTTAAGTGAGGAGAAAAAGGTAAAACTTATTGTTTCTCATCTTAGAGATTATGTCATTACTTGGTGGGATAAATTGTTTTTGGGTAGGAGGAACCTTGA
ACATCCAATAGATTCATGGAATGATTTGAAGCAATTAATGCGAGAGCACTTTGTTCCTAAGCTTTTCTATAACTTGTGGACTTTGAGACAAGGGAGCAAAAGTGTGGAGG
CTTACTACATGGAGATGCAAAAATTGCTTGATGAACTTGATCTTTATGAGGATGAGATGACTACCATGGCTCGTTTCTTTAGAGGACTTAATAAGGAGATTGCTGCCCTA
CTTGATCTTCAACTTTATGGGGATTTAGACGAGATGGTGAACTTAGCCATAAAGATTGAAAAACATCTCCAAAGGAAGTCTACAAGGCAATTAGATTGCAAAGACTTGTT
TGATTGTAACCCTTCTACTATTGTTATACCTACTCATGATGAAGAAGCCAAAACAATCATTGAGAAGCTTGATTCAAAGTTCCCTTCTAAAGTCAACAATGGCAACCAAC
ACATGAACATTAATGCCAATGCATTTCTTGAGAGGCAAAACTCCAAATTTGCTTCTAAGTTCATCCATGGCAGCACGCATAAGGTGTTCAAACCGGGTGATTGGGTTTGG
AAACACTATTGGAAGGATCCTTATTCTTTTAATAAGAAACCCAAGTGGAGATCCAAAGGCGGTTTACCAATCACATCCCATGACTACAAAATTGCTCTACAAGGCGAGAG
GGATGACCATTCAAGTAGCTTTGATCGAACTACTTGTGGAGTGATTCGAATCTCAAGCAAGCTTCTTGCCTTGAGATATTTGATCGACAAGATTAGAGGAGAAAATGGAG
ATGGGAAGAGAAAACGACGAACCTCACTTTGTTCCGACTGCCACCCATTCCGGGTAAAGAATTCTCGCGCGTCGCCGGAGGAAAATGACCTTTCTCGTCGGAGGAAAACG
AGAGCCAAGGGAAAGAAAGGCAAGGGAAGAAAAATGAAAAGGGGAAGAGAGGAAGTGAGGAAAGAAAATGAAAGGGTGTGCGATGAAATCCCCCTTATCGACGCACGGTT
GCGTCGTTTCAGACAGTTTCGACGCAACGTTGCGTCAATTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCTCAAGAGGATTTTGCAAGACTTCATCGAAGCATGGAGAAAATGACCATGCAATTTGAGAAATTGGATGCAAGGATCAAATCCGTGGAGAAACAACCAGCCTTTCT
TATCCGAAAGGAGAGATCTAATCCAATGCACCATAAGGAGTATTGGGAGGTCAAAGGAAAAGAATATGGTGGAGGTTTCCAAGCTGTCCAACGGAGAAATTCAAGAGACA
TTTATAAAGTTGATCCCAACTTGCGCATGAAGCCTAAACTTCCTAAATTCTATGGAAGTACAAACTCAAAGGAGTACATTCAATGGGAAAGGCAAGTTGATCATGTCTTT
GATTGTTGCACTTTAAGTGAGGAGAAAAAGGTAAAACTTATTGTTTCTCATCTTAGAGATTATGTCATTACTTGGTGGGATAAATTGTTTTTGGGTAGGAGGAACCTTGA
ACATCCAATAGATTCATGGAATGATTTGAAGCAATTAATGCGAGAGCACTTTGTTCCTAAGCTTTTCTATAACTTGTGGACTTTGAGACAAGGGAGCAAAAGTGTGGAGG
CTTACTACATGGAGATGCAAAAATTGCTTGATGAACTTGATCTTTATGAGGATGAGATGACTACCATGGCTCGTTTCTTTAGAGGACTTAATAAGGAGATTGCTGCCCTA
CTTGATCTTCAACTTTATGGGGATTTAGACGAGATGGTGAACTTAGCCATAAAGATTGAAAAACATCTCCAAAGGAAGTCTACAAGGCAATTAGATTGCAAAGACTTGTT
TGATTGTAACCCTTCTACTATTGTTATACCTACTCATGATGAAGAAGCCAAAACAATCATTGAGAAGCTTGATTCAAAGTTCCCTTCTAAAGTCAACAATGGCAACCAAC
ACATGAACATTAATGCCAATGCATTTCTTGAGAGGCAAAACTCCAAATTTGCTTCTAAGTTCATCCATGGCAGCACGCATAAGGTGTTCAAACCGGGTGATTGGGTTTGG
AAACACTATTGGAAGGATCCTTATTCTTTTAATAAGAAACCCAAGTGGAGATCCAAAGGCGGTTTACCAATCACATCCCATGACTACAAAATTGCTCTACAAGGCGAGAG
GGATGACCATTCAAGTAGCTTTGATCGAACTACTTGTGGAGTGATTCGAATCTCAAGCAAGCTTCTTGCCTTGAGATATTTGATCGACAAGATTAGAGGAGAAAATGGAG
ATGGGAAGAGAAAACGACGAACCTCACTTTGTTCCGACTGCCACCCATTCCGGGTAAAGAATTCTCGCGCGTCGCCGGAGGAAAATGACCTTTCTCGTCGGAGGAAAACG
AGAGCCAAGGGAAAGAAAGGCAAGGGAAGAAAAATGAAAAGGGGAAGAGAGGAAGTGAGGAAAGAAAATGAAAGGGTGTGCGATGAAATCCCCCTTATCGACGCACGGTT
GCGTCGTTTCAGACAGTTTCGACGCAACGTTGCGTCAATTTAG
Protein sequenceShow/hide protein sequence
MSQEDFARLHRSMEKMTMQFEKLDARIKSVEKQPAFLIRKERSNPMHHKEYWEVKGKEYGGGFQAVQRRNSRDIYKVDPNLRMKPKLPKFYGSTNSKEYIQWERQVDHVF
DCCTLSEEKKVKLIVSHLRDYVITWWDKLFLGRRNLEHPIDSWNDLKQLMREHFVPKLFYNLWTLRQGSKSVEAYYMEMQKLLDELDLYEDEMTTMARFFRGLNKEIAAL
LDLQLYGDLDEMVNLAIKIEKHLQRKSTRQLDCKDLFDCNPSTIVIPTHDEEAKTIIEKLDSKFPSKVNNGNQHMNINANAFLERQNSKFASKFIHGSTHKVFKPGDWVW
KHYWKDPYSFNKKPKWRSKGGLPITSHDYKIALQGERDDHSSSFDRTTCGVIRISSKLLALRYLIDKIRGENGDGKRKRRTSLCSDCHPFRVKNSRASPEENDLSRRRKT
RAKGKKGKGRKMKRGREEVRKENERVCDEIPLIDARLRRFRQFRRNVASI