28
Maintaining Large Vista Installations Amy Edwards, Ezra Freelove, & George Hernandez July 12, 2007

Maintaining Large Vista Installations

  • Upload
    nusa

  • View
    47

  • Download
    0

Embed Size (px)

DESCRIPTION

Maintaining Large Vista Installations. Amy Edwards, Ezra Freelove, & George Hernandez July 12, 2007. Agenda. Comparisons Who is USG Automation Monitoring Maintenance More Tricks Questions?. (All prod clusters) now: 1-10 11-20 21-50 50-70 70+ Ours in bold. - PowerPoint PPT Presentation

Citation preview

Page 1: Maintaining Large Vista Installations

Maintaining Large Vista Installations

Amy Edwards, Ezra Freelove, & George Hernandez

July 12, 2007

Page 2: Maintaining Large Vista Installations

2

Agenda

• Comparisons

• Who is USG

• Automation

• Monitoring

• Maintenance

• More Tricks

• Questions?

Page 3: Maintaining Large Vista Installations

3

Informal Poll - Number of nodes

(All prod clusters) now:• 1-10• 11-20• 21-50• 50-70• 70+

• Ours in bold

• (All prod clusters) by December:

• 1-10• 11-20• 21-50• 50-70• 70+

Page 4: Maintaining Large Vista Installations

4

Informal Poll – Number of DB Instances

Including secondary and non-production

• 1-2• 3-6• 7-10• 10+

• Ours in bold

Page 5: Maintaining Large Vista Installations

5

Vista Architecture

Page 6: Maintaining Large Vista Installations

6

GeorgiaVIEW Project

• University System of Georgia (USG)

• Vista 3.0.7 • Host 32 institutions &

multiple consortial programs

• >150,000 active students– Active is 100+ actions

• >11,000 active sections / term

Page 7: Maintaining Large Vista Installations

7

Issues

• Handling performance issues

• Capacity planning

• Upgrades

• Replication

• JMS sensitivity

• Integration

Page 8: Maintaining Large Vista Installations

8

Automation

• Rolling Restarts– Managed nodes restarted weekly

• except JMS

• Log cleanup to preserve space• Error reporting

– application, tracking, vulnerabilities

• Thread dumps• Sync admin node with backup• LDIS batch integration

Page 9: Maintaining Large Vista Installations

9

Monitoring

• Nagios– http://www.nagios.org/– Sends alerts

• Stats– Custom AJAX web app– Watch changes of over time

• AWStats– http://www.awstats.org/

Page 10: Maintaining Large Vista Installations

10

Nagios Example

Page 11: Maintaining Large Vista Installations

11

Nagios Monitors

• OS / Hardware– Load– Temperature– Free space

• Database– Tablespace free space– Listener– Oracle processes

• Application– Direct-login– Weblogic processes– Java MBeans

• Default/Primary Pending Requests Current Count

• Java Heap Current

• JDBC Waiting for Connection Current Count

• Multicast Messages Lost

• Primary count

Page 12: Maintaining Large Vista Installations

12

Stats

• Short and long term analysis– 21 months of data

• Graphs all Nagios data collected

• Flexible creation of reports

• Built with AJAX

Page 13: Maintaining Large Vista Installations

13

Stats Examples I of III

Page 14: Maintaining Large Vista Installations

14

Stats Examples II of III

Page 15: Maintaining Large Vista Installations

15

Stats Examples I of III

Page 16: Maintaining Large Vista Installations

16

AWStats

• Records data from web server logs

• Custom script grabs data from webserver.log files

• Runs daily

Page 17: Maintaining Large Vista Installations

17

AWStats Examples I of II

Page 18: Maintaining Large Vista Installations

18

AWStats Eamples II of II

Page 19: Maintaining Large Vista Installations

19

Specialized Nodes

• Admin

• JMS

• Institutional Admin– Integration

• Chat

Page 20: Maintaining Large Vista Installations

20

JMS Node

• Provides special services– Mail, LC creation, chat

• Failure or migration of JMS node hinders usage

• Services do not migrate well– Allow targeted migration– OTHERS: Pin JMS to a specific node

Page 21: Maintaining Large Vista Installations

21

Integration

• Batched LDIS data files

• Cron runs nightly• Files broken up by:

– type– “reasonable” number

of records

• Done on Inst node– Issues with import can

kill node

Page 22: Maintaining Large Vista Installations

22

Touching Nodes

• ssh & dsh– Touch groups of nodes at once– Useful for:

• Installs• Gathering logs• Locating a session

Page 23: Maintaining Large Vista Installations

23

Maintenance Page

• Hosted on opposite f5

• Two versions– Scheduled maintenance– Unscheduled outage

• In an f5 outage, move DNS to other f5 so message still appears

Page 24: Maintaining Large Vista Installations

24

Installs and Upgrades

• Silent install scripts

• Test in both development environments– Create against a small database– Get results of time to complete against a full

size copy of production

• Install to production

Page 25: Maintaining Large Vista Installations

25

Powerlinks and Custom Development

• Test in development

• Try to break

• Pilot in production

• Release to all

Page 26: Maintaining Large Vista Installations

26

Questions?

Page 27: Maintaining Large Vista Installations

27

Want More?

• To view my resources and references for this presentation, visit

www.scholar.com• Simply click “Advanced Search” and

search by ezrafreelove and tag: ‘bbworld07’